A1 Refereed original research article in a scientific journal

qSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets




AuthorsHäkkinen A, Koiranen J, Casado J, Kaipio K, Lehtonen O, Petrucci E, Hynninen J, Hietanen S, Carpén O, Pasquini L, Biffoni M, Lehtonen R, Hautaniemi S

Publication year2020

JournalBioinformatics

Journal name in sourceBioinformatics (Oxford, England)

Journal acronymBioinformatics

Volume36

Issue20

First page 5086

Last page5092

Number of pages7

ISSN1367-4803

eISSN1367-4811

DOIhttps://doi.org/10.1093/bioinformatics/btaa637

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/Publication/50793942


Abstract

Motivation: Non-parametric dimensionality reduction techniques, such as t-distributed stochastic neighbor embedding (t-SNE), are the most frequently used methods in the exploratory analysis of single-cell datasets. Current implementations scale poorly to massive datasets and often require downsampling or interpolative approximations, which can leave less-frequent populations undiscovered and much information unexploited.

Results: We implemented a fast t-SNE package, qSNE, which uses a quasi-Newton optimizer, allowing quadratic convergence rate and automatic perplexity (level of detail) optimizer. Our results show that these improvements make qSNE significantly faster than regular t-SNE packages and enables full analysis of large datasets, such as mass cytometry data, without downsampling.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2024-26-11 at 23:11