Improve trace reconciliation performance #15725

bendikberg · 2024-12-16T10:00:55Z

Purpose

When loading a file with a lot of bindings stored in CallSite.SingleRunTraceData, the trace reconciliation spends a lot of time reading nested data due to list allocations. And in general it does unecessary work copying lists and iterating through IEnumerables, for .Contains-testing and the like.

These changes have a large impact on the graph execution time for graphs with a lot of orphaned serializable data, as seen in these profiling results.

Declarations

Check these if you believe they are true

The codebase is in a better state after this PR
Is documented according to the standards
The level of testing this PR includes is appropriate
User facing strings, if any, are extracted into *.resx files
All tests pass using the self-service CI.
Snapshot of UI changes, if any.
Changes to the API follow Semantic Versioning and are documented in the API Changes document.
This PR modifies some build requirements and the readme is updated
This PR contains no files larger than 50 MB

Release Notes

N/A

Reviewers

(FILL ME IN) Reviewer 1 (If possible, assign the Reviewer for the PR)
@mjkkirschner

(FILL ME IN, optional) Any additional notes to reviewers or testers.

FYIs

(FILL ME IN, Optional) Names of anyone else you wish to be notified of
@dimven

…Serializables

…opies in AddRange.

bendikberg · 2024-12-16T10:02:36Z

src/Engine/ProtoCore/Lang/CallSite.cs


-            var currentSerializables = traceData.SelectMany(td => td.RecursiveGetNestedData());
-            result.AddRange(beforeFirstRunSerializables.Where(hs => !currentSerializables.Contains(hs)).ToList());
+            var currentSerializables = traceData.SelectMany(td => td.RecursiveGetNestedData()).ToHashSet();


This line with the added .ToHashSet() call and the line below are the likeliest culprits of the bad performance, as they would previously keep reloading the TraceData from the CallSite object for every .Contains test

if I'm understanding that correctly, then it could be solved by actualizing the currentSerializables outside the deferred call - right?

Is the cost to create the hashset worth the lookup cost?

github-actions · 2025-01-02T17:54:18Z

UI Smoke Tests

Test: success. 11 passed, 0 failed.
TestComplete Test Result
Workflow Run: UI Smoke Tests
Check: UI Smoke Tests

mjkkirschner · 2025-01-07T22:16:17Z

src/DynamoCore/Engine/EngineController.cs

-                var orphanedSerializables = cs.GetOrphanedSerializables().ToList();
-                if (callsiteToOrphanMap.ContainsKey(cs.CallSiteID))
+                var orphanedSerializables = cs.GetOrphanedSerializables();
+                if (callsiteToOrphanMap.TryGetValue(cs.CallSiteID, out var serializablesForCallsite))


how does this change improve performance?

Since the immediate action after the ContainsKey is to mutate the value at that key, using TryGetValue avoids a redundant dictionary lookup

C# lint CA1854

okay, seems like a pretty safe change, though under profiling does it make an actual difference here?

mjkkirschner · 2025-01-07T22:20:52Z

src/DynamoCore/Engine/EngineController.cs

                }
                else
                {
-                    callsiteToOrphanMap.Add(cs.CallSiteID, orphanedSerializables);
+                    callsiteToOrphanMap.Add(cs.CallSiteID, orphanedSerializables.ToList());


so orphanedSerializables is already a List<string> -

Dynamo/src/Engine/ProtoCore/Lang/CallSite.cs

Line 475 in 10df94f

var result = new List<string>();

though it's declared here as an IList - I assume that this calls the List constructor passing in the existing list, have you verified there is any benefit to moving this ToList call around?

If it's really such a big performance impact you could probably change this data structure to be a dict of IList

mjkkirschner · 2025-01-07T22:24:40Z

src/DynamoCore/Graph/Workspaces/HomeWorkspaceModel.cs

@@ -897,7 +897,7 @@ internal IList<string> GetOrphanedSerializablesAndClearHistoricalTraceData()

                if (Nodes.All(n => n.GUID != nodeGuid))
                {
-                    orphans.AddRange(nodeData.Value.SelectMany(CallSite.GetAllSerializablesFromSingleRunTraceData).ToList());
+                    orphans.AddRange(nodeData.Value.SelectMany(CallSite.GetAllSerializablesFromSingleRunTraceData));


does calling add range immediately execute the deferred selectMany call?

mjkkirschner · 2025-01-07T22:25:22Z