Speed up Seurat FindNeighbors: up to 4.23× faster, identical output

Benchmark charts

Switch benchmark platform; all charts update together

Speedup distribution

Each dot is one finalized dataset/thread run on Windows

tms_ss2

4.23×

splitseq_rosenberg

3.62×

heart_adult

3.43×

gastrulation_pijuansa…

3.38×

pbmc68k

3.33×

pbmc200k_glaucoma

3.13×

tms_ss2splitseq_rosenbergheart_adultgastrulation_pijuansa…pbmc68kpbmc200k_glaucoma

Thread sweep

Speedup across finalized thread counts on Windows

tms_ss2splitseq_rosenbergheart_adultgastrulation_pijuan…pbmc68kpbmc200k_glaucoma

Memory

Baseline vs optimized peak memory on Windows

baselineoptimized

What is accelerated

The public API stays the same; AutoZyme replaces only the supported fast path.

This task targets FindNeighbors in Seurat. The benchmarked result preserves the declared scientific output gate while reducing CPU runtime on the listed datasets.

Also searched as: KNN, kNN graph, nearest neighbors, SNN, shared nearest neighbor, neighborhood graph.

Supported scope

Fast path is taken only for: a Seurat object (inherits "Seurat"), nn.method == "annoy", annoy.metric == "euclidean", return.neighbor == FALSE, l2.norm == FALSE, and the zyme flag TRUE (lines 373-376). Read full supported scope

Fast path is taken only for: a Seurat object (inherits "Seurat"), nn.method == "annoy", annoy.metric == "euclidean", return.neighbor == FALSE, l2.norm == FALSE, and the zyme flag TRUE (lines 373-376). On that path it builds an Annoy Euclidean index from Embeddings(object[[reduction]])[, dims] in C++ (single-threaded build, std::thread-parallel k-NN search over OMP_NUM_THREADS / detectCores), then constructs the NN sparse Graph and, when compute.SNN is TRUE, the SNN graph via Seurat:::ComputeSNN(prune = prune.SNN). Correctly honored args: reduction (any reduction present in the object), dims (any column subset that exists), k.param, n.trees, prune.SNN, compute.SNN, graph.name, verbose. Equivalence is approximate-only: the metric is knn_jaccard >= 0.85, not bit-exact (Annoy is approximate and the kernel casts to float32). This covers the benchmarked default Annoy/Euclidean graph configuration.

Out-of-scope behavior

silent fallback to upstream

Show detailed speedup table 11 runs

Dataset	Tier	Platform	Threads	Baseline	Optimized	Speedup	Memory	Concordance	Pass
`gastrulation_pijuansala`	ood_large3	Windows	32	24.72 s	7.32 s	3.38×	40.6 → 37.1 GB	—	pass
`heart_adult`	large	Windows	32	1.83 min	31.99 s	3.43×	73.5 → 52.1 GB	—	pass
`pbmc200k_glaucoma`	medium	Windows	32	56.81 s	14.27 s	3.13×	28.7 → 20.2 GB	—	pass
`pbmc68k`	small	Windows	32	15.67 s	4.70 s	3.33×	7.3 → 5.3 GB	—	pass
`splitseq_rosenberg`	ood_large1	Windows	32	33.40 s	9.20 s	3.62×	20.5 → 14.2 GB	—	pass
`tms_ss2`	ood_large2	Windows	32	20.10 s	5.59 s	4.23×	24.1 → 23.8 GB	—	pass
`gastrulation_pijuansala`	ood_large3	macOS	14	19.94 s	6.35 s	3.13×	14.4 → 14.0 GB	—	pass
`pbmc200k_glaucoma`	medium	macOS	14	34.08 s	10.62 s	3.21×	10.0 → 9.5 GB	—	pass
`pbmc68k (inferred)`	small	macOS	14	14.07 s	4.44 s	3.17×	15.3 → 10.3 GB	—	pass
`splitseq_rosenberg`	ood_large1	macOS	14	25.17 s	7.75 s	3.25×	6.4 → 6.2 GB	—	pass
`tms_ss2`	ood_large2	macOS	14	16.14 s	5.09 s	3.15×	9.0 → 8.7 GB	—	pass

Frequently asked questions

Speeding up Seurat FindNeighbors

Why is Seurat FindNeighbors slow?

Seurat FindNeighbors is CPU-bound, and the stock implementation in Seurat leaves performance on the table in its core numerical work. On the benchmark datasets the original takes 20.10 s where the AutoZyme path takes 5.59 s (4.23× faster).

How do I make Seurat FindNeighbors faster?

Install AutoZyme and activate the Seurat patch, then keep using Seurat FindNeighbors exactly as before. AutoZyme transparently substitutes the faster, output-validated path, up to 4.23× faster on the benchmark datasets, with no pipeline or API changes.

Does the AutoZyme speedup change the Seurat FindNeighbors output?

No. The accelerated path returns bit-for-bit identical results to the original Seurat implementation (maximum absolute difference 0), checked by a frozen concordance gate on every benchmark dataset.

How do I install the Seurat speedup?

In R: install the autozyme package, then run library(autozyme) and autozyme::activate("seurat"). The patch applies automatically the next time you call FindNeighbors.