Skip to content

Split find tracks kernel into compute and storage#1318

Draft
stephenswat wants to merge 1 commit intoacts-project:mainfrom
stephenswat:perf/split_find_tracks
Draft

Split find tracks kernel into compute and storage#1318
stephenswat wants to merge 1 commit intoacts-project:mainfrom
stephenswat:perf/split_find_tracks

Conversation

@stephenswat
Copy link
Copy Markdown
Member

No description provided.

@stephenswat stephenswat added the refactor Change the structure of the code label May 7, 2026
@sonarqubecloud
Copy link
Copy Markdown

sonarqubecloud Bot commented May 7, 2026

@stephenswat
Copy link
Copy Markdown
Member Author

Physics performance summary

Here is a summary of the physics performance effects of this PR. Command used:

traccc_seeding_example_cuda --input-directory=/data/Acts/odd-simulations-20240506/geant4_ttbar_mu200 --digitization-file=geometries/odd/odd-digi-geometric-config.json --conditions-file=geometries/odd/odd-digi-geometric-config.json --detector-file=geometries/odd/odd-detray_geometry_detray.json --grid-file=geometries/odd/odd-detray_surface_grids_detray.json --material-file=geometries/odd/odd-detray_material_detray.json --input-events=10 --use-acts-geom-source=on --check-performance --truth-finding-min-track-candidates=5 --truth-finding-min-pt=1.0 --truth-finding-min-z=-150 --truth-finding-max-z=150 --truth-finding-max-r=10 --seed-matching-ratio=0.99 --track-matching-ratio=0.5 --track-candidates-range=5:100 --seedfinder-vertex-range=-150:150

Seeding performance

Total number of seeds went from 298341 to 298338 (-0.0%)

Seeding plots



Track finding performance

Total number of found tracks went from 50211 to 50211 (+0.0%)

Finding plots









Track fitting performance

Fitting plots














Seeding to track finding relative performance

Seeding to track finding plots



Note

This is an automated message produced on the explicit request of a human being.

@stephenswat
Copy link
Copy Markdown
Member Author

Performance summary

Here is a summary of the performance effects of this PR:

Graphical

Tabular

KernelReciprocal ThroughputParallelism
bd18c46b2efdacDeltabd18c46b2efdac
propagate_to_next_surface5.80 ms5.80 ms0.0%4.334.33
find_tracks1.76 ms1.39 ms-20.9%1.831.79
count_doublets815.85 μs814.17 μs-0.2%1.611.61
ccl_kernel592.20 μs591.88 μs-0.1%1.711.71
count_triplets566.36 μs567.68 μs0.2%1.021.02
find_doublets535.05 μs537.89 μs0.5%3.083.08
Thrust::sort378.98 μs378.95 μs-0.0%7.327.33
condense_tracks251.48 μsnan5.75
find_triplets169.35 μs169.38 μs0.0%1.321.32
estimate_track_params146.80 μs146.66 μs-0.1%2.682.68
build_tracks123.72 μs123.63 μs-0.1%3.713.71
select_seeds59.31 μs58.98 μs-0.6%1.341.34
populate_grid23.90 μs23.96 μs0.3%1.221.22
count_grid_capacities22.14 μs22.13 μs-0.0%1.221.22
remove_duplicates20.01 μs20.02 μs0.0%25.6425.64
fill_sorted_measurements16.31 μs16.29 μs-0.1%1.131.13
update_triplet_weights14.89 μs14.78 μs-0.8%1.271.27
fill_finding_propagation_sort_keys8.76 μs8.82 μs0.7%7.767.67
form_spacepoints8.38 μs8.41 μs0.4%1.481.49
reduce_triplet_counts5.61 μs5.63 μs0.3%3.083.08
unknown5.02 μs5.08 μs1.2%4.284.25
fill_finding_duplicate_removal_sort_keys1.56 μs1.57 μs0.5%37.9138.02
DeviceScanKernel991.23 nsnan106.46
DeviceScanInitKernel66.44 nsnan768.00
Total11.08 ms10.97 ms-1.0%3.403.51

Important

All metrics in this report are given as reciprocal throughput, not as wallclock runtime.

Note

This is an automated message produced upon the explicit request of a human being.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

refactor Change the structure of the code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant