Speed up Seurat ScaleData: up to 24.7× faster, identical output

Benchmark charts

Switch benchmark platform; all charts update together

Speedup distribution

Each dot is one finalized dataset/thread run on Windows

log scale

pbmc68k

24.7×

splitseq_rosenberg

16.1×

pbmc200k_glaucoma

13.3×

heart_adult

11.7×

tms_ss2

8.47×

gastrulation_pijuansa…

8.28×

pbmc68ksplitseq_rosenbergpbmc200k_glaucomaheart_adulttms_ss2gastrulation_pijuansa…

Thread sweep

Speedup across finalized thread counts on Windows

pbmc68ksplitseq_rosenbergpbmc200k_glaucomaheart_adulttms_ss2gastrulation_pijuan…

Memory

Baseline vs optimized peak memory on Windows

baselineoptimized

What is accelerated

The public API stays the same; AutoZyme replaces only the supported fast path.

This task targets ScaleData in Seurat. The benchmarked result preserves the declared scientific output gate while reducing CPU runtime on the listed datasets.

Also searched as: scaling, z-score, standardize, center and scale.

Supported scope

Fast path (turbo_scale_sparse_full) handles the canonical default per-feature z-scaling: a v5 Seurat object (Assay5) with a unified "data" layer that is a dgCMatrix, scaling+centering every selected feature globally over all cells. Read full supported scope

Fast path (turbo_scale_sparse_full) handles the canonical default per-feature z-scaling: a v5 Seurat object (Assay5) with a unified "data" layer that is a dgCMatrix, scaling+centering every selected feature globally over all cells. All of these must hold simultaneously: vars.to.regress=NULL, split.by=NULL, model.use=="linear", use.umi=FALSE, do.scale=TRUE, do.center=TRUE (all guarded at patch.R:301-303). features may be NULL (resolves to VariableFeatures, else rownames, matching upstream ScaleData.Assay) or an explicit subset; assay may be NULL (DefaultAssay) or named. scale.max is honored and applied as a POSITIVE-tail-only cap (kernel lines 53,69), matching Seurat's FastSparseRowScale. Variance uses the n-1 (sample) denominator; zero-variance features get sd=1 (kernel lines 45-47), matching Seurat. Result is materialized as a dense features x cells matrix and written to the scale.data layer with the cells/features metadata flags updated (patch.R:343-353). The task's correctness gate is gene_cor_min >= 0.99 (not bit-exact), consistent with this numeric-close contract.

Out-of-scope behavior

silent fallback to upstream

Show detailed speedup table 11 runs

Dataset	Tier	Platform	Threads	Baseline	Optimized	Speedup	Memory	Concordance	Pass
`gastrulation_pijuansala`	ood_large3	Windows	32	6.40 s	810 ms	8.28×	40.6 → 33.4 GB	—	pass
`heart_adult`	large	Windows	32	16.59 s	1.41 s	11.7×	59.1 → 47.3 GB	—	pass
`pbmc200k_glaucoma`	medium	Windows	32	8.56 s	560 ms	13.3×	23.6 → 18.3 GB	—	pass
`pbmc68k`	small	Windows	32	2.51 s	110 ms	24.7×	5.4 → 4.3 GB	—	pass
`splitseq_rosenberg`	ood_large1	Windows	32	5.36 s	330 ms	16.1×	14.0 → 11.8 GB	—	pass
`tms_ss2`	ood_large2	Windows	32	4.36 s	580 ms	8.47×	24.1 → 19.9 GB	—	pass
`gastrulation_pijuansala`	ood_large3	macOS	1	6.40 s	1.24 s	5.16×	19.0 → 17.2 GB	—	pass
`pbmc200k_glaucoma`	medium	macOS	1	7.47 s	1.00 s	7.36×	19.2 → 11.9 GB	—	pass
`pbmc68k (inferred)`	small	macOS	1	2.65 s	283 ms	9.37×	7.6 → 5.5 GB	—	pass
`splitseq_rosenberg`	ood_large1	macOS	1	3.95 s	548 ms	7.24×	16.3 → 13.5 GB	—	pass
`tms_ss2`	ood_large2	macOS	4	3.45 s	605 ms	5.58×	13.9 → 10.0 GB	—	pass

Frequently asked questions

Speeding up Seurat ScaleData

Why is Seurat ScaleData slow?

Seurat ScaleData is CPU-bound, and the stock implementation in Seurat leaves performance on the table in its core numerical work. On the benchmark datasets the original takes 2.51 s where the AutoZyme path takes 110 ms (24.7× faster).

How do I make Seurat ScaleData faster?

Install AutoZyme and activate the Seurat patch, then keep using Seurat ScaleData exactly as before. AutoZyme transparently substitutes the faster, output-validated path, up to 24.7× faster on the benchmark datasets, with no pipeline or API changes.

Does the AutoZyme speedup change the Seurat ScaleData output?

No. The accelerated path returns bit-for-bit identical results to the original Seurat implementation (maximum absolute difference 0), checked by a frozen concordance gate on every benchmark dataset.

How do I install the Seurat speedup?

In R: install the autozyme package, then run library(autozyme) and autozyme::activate("seurat"). The patch applies automatically the next time you call ScaleData.