<?xml version="1.0"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/"><rdf:Description rdf:about="https://repozitorij.uni-lj.si/IzpisGradiva.php?id=109337"><dc:title>Embedding to Reference t-SNE Space Addresses Batch Effects in Single-Cell Classification</dc:title><dc:creator>Poličar,	Pavlin Gregor	(Avtor)
	</dc:creator><dc:creator>Zupan,	Blaž	(Mentor)
	</dc:creator><dc:subject>batch effects</dc:subject><dc:subject>embedding</dc:subject><dc:subject>t-SNE</dc:subject><dc:subject>visualization</dc:subject><dc:subject>single-cell transcriptomics</dc:subject><dc:subject>data integration</dc:subject><dc:subject>domain adaptation</dc:subject><dc:description>Dimensionality reduction techniques, such as t-SNE, can construct informative visualizations of high-dimensional data. When working with multiple data sets, a straightforward application of these methods often fails; instead of revealing underlying classes, the resulting visualizations expose data set-specific clusters. To circumvent these batch effects, we propose a principled embedding procedure that enables the addition of new data points into existing t-SNE embeddings. We provide an open-source implementation of the proposed method and demonstrate the utility of our approach with an analysis of six recently published single-cell gene expression data sets containing up to tens of thousands of cells and thousands of genes. We present surprising evidence that our computationally more direct procedure solves the batch effect problem, one of the core challenges in the analysis of gene expression data, and enables the reuse of t-SNE embeddings, paving the way for interpretable visualizations of high-dimensional data sets.</dc:description><dc:date>2019</dc:date><dc:date>2019-08-30 13:20:04</dc:date><dc:type>Magistrsko delo/naloga</dc:type><dc:identifier>109337</dc:identifier><dc:language>sl</dc:language></rdf:Description></rdf:RDF>