Skip to content

Add the hot reload delta emitter and symbol matcher#20027

Draft
NatElkins wants to merge 15 commits into
dotnet:mainfrom
NatElkins:hotreload-delta-emitter
Draft

Add the hot reload delta emitter and symbol matcher#20027
NatElkins wants to merge 15 commits into
dotnet:mainfrom
NatElkins:hotreload-delta-emitter

Conversation

@NatElkins

@NatElkins NatElkins commented Jul 3, 2026

Copy link
Copy Markdown
Contributor

Adds the F# hot reload delta emitter and symbol matcher: given a baseline (#20026), a fresh compile's module, and a typed-tree diff (#20025), this layer produces the three Edit and Continue delta blobs (metadata delta with EncLog/EncMap through the #20019 writer, IL delta, PDB delta).

The synthesized-type match ladder is the core correctness surface: exact matching via replayed generation-0 names first (#20024 makes those byte-stable), then verified alias candidates, then a guarded positional-pairing recovery prepass (shape-verified bijection over line-normalized names), and fail-closed ambiguity errors for anything the ladder cannot prove; a rude edit and restart is always preferred over a wrong mapping. Task resumable-helper chains are reconciled by shape so an inserted let!/do! updates the right baseline helper rows and appends genuinely new await machinery instead of corrupting existing rows.

Everything is internal (SurfaceAreaTest green) and nothing on main calls it yet; the session slice wires it up. Session, capture hook, and the public active-statement surface are deferred there.

Tests: 25 direct emitter tests that build baselines through the #20026 layer, compile edited variants, and assert delta shape: EncLog/EncMap row kinds, updated method tokens, user-string heap content, exact and positional-pairing synthesized-type mappings, and fail-closed outcomes for ambiguous mixed buckets.

Stacked PR: based on #20026 (which stacks on #20018 and #20024), plus #20025 and #20019 merged in. The diff shows those commits until they merge. The only commit that belongs to this PR, viewable as a single diff:

Everything else in the commit list arrives from the dependency PRs via merges. This is the largest slice in the train; the match ladder and the delta serializer are the parts that deserve the closest read.

Sequencing

This PR is part of splitting the F# hot reload work (#19941) into small, independently reviewable PRs. The planned order:

  1. Wave 1 (independent): Add ResetCompilerGeneratedNameState to compiler-generated name generators #20017, Add Roslyn-format EnC CustomDebugInformation codec and portable PDB method CDI emission #20018, Add ECMA-335 EnC metadata delta writer #20019, Add stable synthesized-name replay infrastructure for hot reload #20024, Add typed-tree differ and edit classification for hot reload #20025.
  2. Wave 2: Add hot reload baseline reading and recorded EnC state #20026 (baseline reading and recorded EnC state).
  3. Wave 3 (this PR): the delta emitter and symbol matcher.
  4. Wave 4: the hot reload session, FCS surface, and the --test:HotReloadDeltas capture hook (F# hot reload: Edit-and-Continue delta emission behind --test:HotReloadDeltas #19941 in its final, much smaller form).
  5. Last, explicitly experimental: Add an experimental flag-gated in-process compile path for hot reload sessions #20031 (the flag-gated in-process compile perf path).

NatElkins added 11 commits July 2, 2026 17:51
…ethod CDI emission

Adds an internal AbstractIL module implementing, byte for byte, the three Portable PDB
CustomDebugInformation blob formats Roslyn persists per method for Edit and Continue
(EnC Local Slot Map, EnC Lambda and Closure Map, EnC State Machine State Map), with
serializers, deserializers, a portable PDB read-back helper, and an occurrence-key
packing helper for deterministic syntax-offset slots.

Plumbs an optional methodCustomDebugInfoRows side channel through the IL binary writer
options into the portable PDB generator so a compilation can attach CDI rows to named
methods. Names that do not identify exactly one method row are dropped. All existing
writer call sites pass an empty map, so emitted PDBs are byte-identical to before.

No in-tree caller populates the map yet; the consumer is the F# hot reload work in
dotnet#19941, following the same pattern as dotnet#20017 (land isolated, test-covered
infrastructure first, wire the feature later).

Tests: blob round-trips, Roslyn golden-byte encodings, cross-validation against
CDI blobs emitted by a real Roslyn compilation, fail-closed occurrence-key packing
(including an int32-overflow regression where a wrapped negative key previously
escaped the bound check), and end-to-end synthetic PDB emission proving correct
MethodDef parenting, zero rows for an empty map, and no rows for absent or
ambiguous names.
ProcessStartInfo.ArgumentList does not exist on net472, which the component tests
also target on Windows CI. Build the quoted argument string by hand instead.
Adds an internal, standalone ECMA-335 Edit-and-Continue metadata delta writer to
AbstractIL: delta #- table stream and heap construction (DeltaMetadataTables,
DeltaMetadataSerializer, DeltaTableLayout, DeltaIndexSizing), ECMA-335 II.24.2.6
coded-index encoding (DeltaMetadataEncoding), EncLog/EncMap emission, generation GUID
chaining, user-string and standalone-signature token calculators (IlxDeltaStreams),
and the coordinating writer (FSharpDeltaMetadataWriter) over a plain row-description
input model (DeltaMetadataTypes, ILDeltaHandles, ILMetadataHeaps).

The writer's inputs are row records (names, tokens, signatures, RVAs) plus heap
offsets; it has no dependency on any semantic diffing or session machinery. It
compiles with no in-tree consumer by design: the consumer is the F# hot reload work
in dotnet#19941, following the same upstreaming pattern as dotnet#20017 and dotnet#20018
(land isolated, test-covered infrastructure first, wire the feature in a later PR).

One line of ilwrite.fsi is touched to expose the pre-existing markerForUnicodeBytes
so the delta writer reuses the exact string-marker logic of the full writer. No
behavior change for any existing code path.

Tests (130): coded-index encodings asserted against the production definitions and
ECMA-335 II.24.2.6 order, System.Reflection.Metadata reader parity over emitted
deltas, EncLog/EncMap correctness, stream layout, heap and index sizing,
multi-generation heap-offset chaining asserted against computed expected values,
standalone-signature rows asserted at baseline+1 from a real seeded baseline, and
serializer failure paths.
xunit 3.2.2 no longer discovers internal test classes, so the module was
silently skipped after rebasing onto current main (pre-existing internal test
modules like CompilerService.Caches are likewise undiscovered there). Public
visibility restores discovery; 17 tests run and pass.
Add internal generated-name normalization and synthesized-name map replay support as a standalone slice. The new map state is side-channel based, all new compiler modules remain internal, and CompilerGlobalState preserves the existing no-map counter path while checking an accessor captured once per compiler state.

Route existing IlxGen generated-name allocations through inert helper wrappers, add pure name-map and normalizer tests, add a normal compilation determinism guard over emitted generated names, and document the extracted seams in P5_REPORT.md.

Verification: built FSharp.Compiler.Service, FSharp.Compiler.Service.Tests, FSharp.Compiler.ComponentTests, and FSharpSuite.Tests in Release; ran the migrated service test classes, the component determinism class, FSharpSuite DeterministicTests, and the FCS SurfaceArea class successfully.
Add an internal TypedTreeDiff module that snapshots CheckedImplFile bindings and entities, then classifies body, signature, inline, declaration add/remove, and type layout changes without depending on hot reload sessions, runtime capability negotiation, EnC capability names, baseline state, or delta emission.

Add focused FCS tests for unchanged/reference-equal files, body edits, signature edits, additions, deletions, layout changes, and logical-name arity handling. Wire the module and tests into compile order, add a release note, and include P6_REPORT.md.

Verification:

- ./.dotnet/dotnet build src/Compiler/FSharp.Compiler.Service.fsproj -c Debug /p:BUILDING_USING_DOTNET=true

- ./.dotnet/dotnet test tests/FSharp.Compiler.Service.Tests/FSharp.Compiler.Service.Tests.fsproj -c Debug /p:BUILDING_USING_DOTNET=true -- --filter-class "*TypedTreeDiffTests*"

- ./.dotnet/dotnet test tests/FSharp.Compiler.Service.Tests/FSharp.Compiler.Service.Tests.fsproj -c Debug /p:BUILDING_USING_DOTNET=true -- --filter-class "*SurfaceAreaTest*"
Add standalone baseline PE and portable PDB readers for hot reload, including token maps and MVID/PDB table snapshots.

Carry the F# synthesized-name snapshot module CDI codec and portable PDB read path, with recorded snapshots taking precedence over IL reconstruction.

Add focused component tests for snapshot round-trip, direct module CDI reading, baseline token maps, recorded fallback behavior, and EnC closure-name reconstruction.
Add the internal hot reload delta emitter, symbol matcher, and direct emitter tests.

Keep session and service integration deferred.
@github-actions

github-actions Bot commented Jul 3, 2026

Copy link
Copy Markdown
Contributor

❗ Release notes required

You can open this PR in browser to add release notes: open in github.dev


✅ Found changes and release notes in following paths:

Warning

No PR link found in some release notes, please consider adding it.

Change path Release notes path Description
src/Compiler docs/release-notes/.FSharp.Compiler.Service/11.0.100.md No current pull request URL (#20027) found, please consider adding it

NatElkins added 4 commits July 3, 2026 08:50
…CI images

The Roslyn cross-validation test shells out to dotnet build to produce a
real Roslyn PDB. The process launch had two problems. It computed the
host path by hand from __SOURCE_DIRECTORY__, which misses on CI images
that carry no repo-local .dotnet at that depth, failing with
Win32Exception before the build starts. It also left UseShellExecute at
its default, which is true on net472 and rejects redirected streams, so
every Desktop test leg failed deterministically with
InvalidOperationException.

Resolve the host like the rest of the test framework via
TestFramework.initialConfig.DotNetExe, which prefers the repo-local
.dotnet and falls back to PATH, and set UseShellExecute to false
explicitly.

Verified: FSharp.Compiler.ComponentTests builds clean;
EncMethodDebugInformationTests 17 passed, 0 failed (net10.0); fantomas
clean on the touched file.
…erload

CI compiles the snapshot round-trip and baseline reader tests with
FS0193: the explicit Assert.Equal<string> type application commits
overload resolution to the scalar Equal(T, T) shape while both
arguments are string arrays. Use Assert.Equal<string[]>, the form the
neighboring name-map tests already use, so resolution lands on the
structural array comparison everywhere.

Verified: FSharp.Compiler.ComponentTests builds clean;
EncMethodDebugInformationTests 22 passed, 0 failed (net10.0); fantomas
clean on the touched file.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: New

Development

Successfully merging this pull request may close these issues.

1 participant