Match subnets in O(n^3) via linear_sum_assignment. by anntzer · Pull Request #795 · soft-matter/trackpy

anntzer · 2025-11-04T11:39:34Z

To solve subnets, instead of brute-forcing the n! combinations, use scipy.optimize.linear_sum_assignment (commonly known as the Hungarian/Munkres algorithm, although scipy actually uses a different algorithm) which provides a solution in O(n^3). This is not an original idea; see e.g. #450 and
https://imagej.net/imagej-wiki-static/TrackMate_Algorithms.html#Solving_LAP

Locally, this method solves the "slow" example in adaptive-search.ipynb (tracks_regular = trackpy.link_df(cg, 0.75)) in ~50ms instead of 1min. linear_sum_assignment ("lsa") was extensively benchmarked in scipy PR#12541 (which was most about adding a sparse variant, but one can just look at the performance of lsa), which shows sub-second runtimes for thousands of inputs. This is also why this PR fully skips the use of MAX_SUB_NET (at most, one may consider setting an alternate value in the thousands for this strategy...).

(In fact, it may perhaps(?) be possible to get even better performance via by completely dropping the subnet paradigm and just passing the whole distance matrix (sparsified by removing edges greater than the search range) to scipy.sparse.csgraph.min_weight_full_bipartite_matching, but 1) one needs to figure out how to handle unmatched points (maybe by adding "dummy" points with a high but finite link cost), and 2) this would require more massive code surgery anyways. Edit: Maybe this doesn't actually work if we want to maintain the current approach of having too-long edges within a subnet be set to search_range**2.)

The docs will need to be updated, but this PR is just to discuss the implementation.

b-grimaud · 2025-11-17T16:21:18Z

I pulled this PR locally to try it out, and while I can see a slight difference in speed when running on individual files, it's not much different than numba when running on larger samples of my own data.

Just to make sure I'm not missing something, is it enough to pass lsa as linking strategy or is there something else than needs to be specified ? Are there cases where this method is more advantageous than others, e.g. a higher or lower number of features per frame ?

anntzer · 2025-11-18T22:06:24Z

Just to make sure I'm not missing something, is it enough to pass lsa as linking strategy or is there something else than needs to be specified ?

Yes. (Or in fact leaving the default value also works, if you don't have numba installed, because I changed the default only in that case.)

Are there cases where this method is more advantageous than others, e.g. a higher or lower number of features per frame ?

Yes, this is specifically better in the case where you have a lot of features that are within search_range of each other, but which all move together in some coordinated fashion (e.g., the "slow" example at https://soft-matter.github.io/trackpy/v0.7/tutorial/adaptive-search.html). In my specific use case, this occurs because I have multiple features that are all detected on a single larger solid object, these features simply track the solid's motion, and the spacing between the features is similar to the distance by which the object (and thus the features) moves from one frame to the next.

nkeim

Wow, @anntzer ! This is so exciting. It would be great to reduce our dependency on numba, too. Is there a minimum scipy version required?

We already compute a sparse distance matrix, in effect, thanks to search_range being passed to cKDTree etc. This is what makes the upstream part of the linking algorithm (which assigns trivial links and identifies the non-trivial subnets) O(N log N) instead of N^2. But I think that taking advantage of this fact would require, as you say, a major overhaul that would lock us into the LSA algorithm exclusively. Maybe in a later version of trackpy.

nkeim · 2026-06-18T13:16:00Z


+def subnet_linker_lsa(source_set, dest_set, search_range, max_size=None):
+    src = [*source_set]
+    dst = [*dest_set]


In #802 we now sort the candidates by uuid so that degenerate cases are stable. There are also new tests for this. Sorting will probably need to be added here.

Sure, done.

Did you already implement this? It looks like it didn't get pushed yet.

Oops, forgot to push, should be done now.

To solve subnets, instead of brute-forcing the n! combinations, use scipy.optimize.linear_sum_assignment (commonly known as the Hungarian/Munkres algorithm, although scipy actually uses a different algorithm) which provides a solution in O(n^3). This is not an original idea; see e.g. https://imagej.net/imagej-wiki-static/TrackMate_Algorithms.html#Solving_LAP Locally, this method solves the "slow" example in adaptive-search.ipynb (`tracks_regular = trackpy.link_df(cg, 0.75)`) in ~50ms instead of 1min. linear_sum_assignment ("lsa") was extensively benchmarked in scipy PR#12541 (which was most about adding a *sparse* variant, but one can just look at the performance of lsa), which shows sub-second runtimes for thousands of inputs. This is also why this PR fully skips the use of MAX_SUB_NET (at most, one may consider setting an alternate value in the thousands for this strategy...). The docs will need to be updated, but this PR is just to discuss the implementation.

anntzer · 2026-06-18T19:54:34Z

linear_sum_assignment is available since scipy 0.17.0, whereas trackpy already requires scipy 1.4.

anntzer force-pushed the lsa branch from c1c34ef to 11b56cb Compare November 4, 2025 16:03

nkeim reviewed Jun 18, 2026

View reviewed changes

TST: Add explicit tests for LSA subnet linking

eb8eb44

anntzer force-pushed the lsa branch from 031c6a2 to eb8eb44 Compare June 27, 2026 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Match subnets in O(n^3) via linear_sum_assignment.#795

Match subnets in O(n^3) via linear_sum_assignment.#795
anntzer wants to merge 2 commits into
soft-matter:masterfrom
anntzer:lsa

anntzer commented Nov 4, 2025 •

edited

Loading

Uh oh!

b-grimaud commented Nov 17, 2025

Uh oh!

anntzer commented Nov 18, 2025

Uh oh!

nkeim left a comment

Uh oh!

nkeim Jun 18, 2026

Uh oh!

anntzer Jun 18, 2026

Uh oh!

nkeim Jun 26, 2026

Uh oh!

anntzer Jun 27, 2026

Uh oh!

Uh oh!

Uh oh!

anntzer commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

anntzer commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

b-grimaud commented Nov 17, 2025

Uh oh!

anntzer commented Nov 18, 2025

Uh oh!

nkeim left a comment

Choose a reason for hiding this comment

Uh oh!

nkeim Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

anntzer Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

nkeim Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

anntzer Jun 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

anntzer commented Jun 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anntzer commented Nov 4, 2025 •

edited

Loading