Speed up Seurat FindAllMarkers: up to 271.4× faster, identical output

Benchmark charts

Switch benchmark platform; all charts update together

Speedup distribution

Each dot is one finalized dataset/thread run on Windows

log scale

tms_ss2

271.4×

heart_adult

249.7×

splitseq_rosenberg

222.3×

gastrulation_pijuansa…

199.8×

pbmc200k_glaucoma

167.8×

pbmc68k

126.3×

tms_ss2heart_adultsplitseq_rosenberggastrulation_pijuansa…pbmc200k_glaucomapbmc68k

Thread sweep

Speedup across finalized thread counts on Windows

tms_ss2heart_adultsplitseq_rosenberggastrulation_pijuan…pbmc200k_glaucomapbmc68k

Memory

Baseline vs optimized peak memory on Windows

baselineoptimized

What is accelerated

The public API stays the same; AutoZyme replaces only the supported fast path.

This task targets FindAllMarkers in Seurat. The benchmarked result preserves the declared scientific output gate while reducing CPU runtime on the listed datasets.

Also searched as: FindMarkers, FindConservedMarkers, marker genes, differential expression, DEG, DE genes, wilcoxon, wilcox.

Supported scope

The shipped default path (fast_FindAllMarkers_fusion) computes a Wilcoxon-rank-sum (normal-approximation, presto-style) marker test per cluster-vs-rest using a single fused RcppParallel C++ kernel. Read full supported scope

The shipped default path (fast_FindAllMarkers_fusion) computes a Wilcoxon-rank-sum (normal-approximation, presto-style) marker test per cluster-vs-rest using a single fused RcppParallel C++ kernel. It is taken ONLY when ALL of these hold (gate at patch.R:466-480): zyme/turbo enabled (default TRUE); test.use=='wilcox'; slot=='data'; features is NULL (all features); node is NULL; latent.vars is NULL; mean.fxn is NULL; fc.name is NULL; only.pos is FALSE; densify is FALSE; max.cells.per.ident is Inf; min.diff.pct == -Inf; base == 2; no extra (...) args (length(dots)==0); group.by is NULL or 'ident'. Additional runtime guards fall back to upstream: data layer must be a single (joined) dgCMatrix (patch.R:487-493), and Idents() must name all cells (patch.R:499-501). Within that gate, the fast path DOES honor user-supplied values of the args it actually consumes: logfc.threshold (default 0.1, used at :570), min.pct (default 0.01, used at :569), return.thresh (default 1e-2, used at :587), min.cells.group (default 3, used at :561 to skip small clusters), assay, and base (only base==2). Per-cluster small-group skipping matches Seurat behavior (warn+skip rare clusters rather than global fallback). p-value adjustment is Bonferroni over n.features. This matches the benchmarked call (object + verbose=FALSE = all defaults) exactly, so the benchmark exercises the supported fast path.

Out-of-scope behavior

silent possibly wrong

Show detailed speedup table 10 runs

Dataset	Tier	Platform	Threads	Baseline	Optimized	Speedup	Memory	Concordance	Pass
`gastrulation_pijuansala`	ood_large3	Windows	32	33.03 min	9.55 s	199.8×	63.5 → 19.1 GB	—	pass
`heart_adult`	large	Windows	32	50.03 min	12.47 s	249.7×	74.4 → 29.1 GB	—	pass
`pbmc200k_glaucoma`	medium	Windows	32	11.91 min	4.26 s	167.8×	28.9 → 11.6 GB	—	pass
`pbmc68k`	small	Windows	32	1.61 min	751 ms	126.3×	6.6 → 2.9 GB	—	pass
`splitseq_rosenberg`	ood_large1	Windows	32	8.90 min	2.40 s	222.3×	18.0 → 7.2 GB	—	pass
`tms_ss2`	ood_large2	Windows	32	24.90 min	5.50 s	271.4×	38.0 → 11.7 GB	—	pass
`gastrulation_pijuansala`	ood_large3	macOS	1	7.42 min	14.11 s	31.5×	20.5 → 28.6 GB	—	fail
`pbmc68k_full`	medium	macOS	1	2.12 min	840 ms	151.6×	4.8 → 2.4 GB	—	pass
`splitseq_rosenberg`	ood_large1	macOS	1	5.63 min	2.67 s	126.8×	18.7 → 9.6 GB	—	pass
`tms_ss2`	ood_large2	macOS	1	21.48 min	5.83 s	221.3×	25.8 → 17.4 GB	—	pass

Frequently asked questions

Speeding up Seurat FindAllMarkers

Why is Seurat FindAllMarkers slow?

Seurat FindAllMarkers is CPU-bound, and the stock implementation in Seurat leaves performance on the table in its core numerical work. On the benchmark datasets the original takes 24.90 min where the AutoZyme path takes 5.50 s (271.4× faster).

How do I make Seurat FindAllMarkers faster?

Install AutoZyme and activate the Seurat patch, then keep using Seurat FindAllMarkers exactly as before. AutoZyme transparently substitutes the faster, output-validated path, up to 271.4× faster on the benchmark datasets, with no pipeline or API changes.

Does the AutoZyme speedup change the Seurat FindAllMarkers output?

No. The accelerated path returns bit-for-bit identical results to the original Seurat implementation (maximum absolute difference 0), checked by a frozen concordance gate on every benchmark dataset.

How do I install the Seurat speedup?

In R: install the autozyme package, then run library(autozyme) and autozyme::activate("seurat"). The patch applies automatically the next time you call FindAllMarkers.