qSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets




Häkkinen A, Koiranen J, Casado J, Kaipio K, Lehtonen O, Petrucci E, Hynninen J, Hietanen S, Carpén O, Pasquini L, Biffoni M, Lehtonen R, Hautaniemi S

2020

Bioinformatics

Bioinformatics (Oxford, England)

Bioinformatics

36

20

5086

5092

7

1367-4803

1367-4811

DOIhttps://doi.org/10.1093/bioinformatics/btaa637

https://research.utu.fi/converis/portal/Publication/50793942



Motivation: Non-parametric dimensionality reduction techniques, such as t-distributed stochastic neighbor embedding (t-SNE), are the most frequently used methods in the exploratory analysis of single-cell datasets. Current implementations scale poorly to massive datasets and often require downsampling or interpolative approximations, which can leave less-frequent populations undiscovered and much information unexploited.

Results: We implemented a fast t-SNE package, qSNE, which uses a quasi-Newton optimizer, allowing quadratic convergence rate and automatic perplexity (level of detail) optimizer. Our results show that these improvements make qSNE significantly faster than regular t-SNE packages and enables full analysis of large datasets, such as mass cytometry data, without downsampling.


Last updated on 2024-26-11 at 23:11