Joint t-SNE for Comparable Projections of Multiple High-Dimensional Datasets

  • Yinqiao Wang
  • , Lu Chen
  • , Jaemin Jo
  • , Yunhai Wang

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

We present Joint t-Stochastic Neighbor Embedding (Joint t-SNE), a technique to generate comparable projections of multiple high-dimensional datasets. Although t-SNE has been widely employed to visualize high-dimensional datasets from various domains, it is limited to projecting a single dataset. When a series of high-dimensional datasets, such as datasets changing over time, is projected independently using t-SNE, misaligned layouts are obtained. Even items with identical features across datasets are projected to different locations, making the technique unsuitable for comparison tasks. To tackle this problem, we introduce edge similarity, which captures the similarities between two adjacent time frames based on the Graphlet Frequency Distribution (GFD). We then integrate a novel loss term into the t-SNE loss function, which we call vector constraints, to preserve the vectors between projected points across the projections, allowing these points to serve as visual landmarks for direct comparisons between projections. Using synthetic datasets whose ground-Truth structures are known, we show that Joint t-SNE outperforms existing techniques, including Dynamic t-SNE, in terms of local coherence error, Kullback-Leibler divergence, and neighborhood preservation. We also showcase a real-world use case to visualize and compare the activation of different layers of a neural network.

Original languageEnglish
Pages (from-to)623-632
Number of pages10
JournalIEEE Transactions on Visualization and Computer Graphics
Volume28
Issue number1
DOIs
StatePublished - 1 Jan 2022

Keywords

  • embedding
  • High-dimensional data
  • projection
  • t-stochastic neighbor embedding

Fingerprint

Dive into the research topics of 'Joint t-SNE for Comparable Projections of Multiple High-Dimensional Datasets'. Together they form a unique fingerprint.

Cite this