TY - GEN
T1 - SwiftTuna
T2 - 10th IEEE Pacific Visualization Symposium, PacificVis 2017
AU - Jo, Jaemin
AU - Kim, Wonjae
AU - Yoo, Seunghoon
AU - Kim, Bohyoung
AU - Seo, Jinwook
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/9/11
Y1 - 2017/9/11
N2 - For interactive exploration of large-scale data, a preprocessing scheme (e.g., data cubes) has often been used to summarize the data and provide low-latency responses. However, such a scheme suffers from a prohibitively large amount of memory footprint as more dimensions are involved in querying, and a strong prerequisite that specific data structures have to be built from the data before querying. In this paper, we present SwiftTuna, a holistic system that streamlines the visual information seeking process on large-scale multidimensional data. SwiftTuna exploits an in-memory computing engine, Apache Spark, to achieve both scalability and performance without building precomputed data structures. We also present a novel interactive visualization technique, tailed charts, to facilitate large-scale multidimensional data exploration. To support responsive querying on large-scale data, SwiftTuna leverages an incremental processing approach, providing immediate low-fidelity responses (i.e., prompt responses) as well as delayed high-fidelity responses (i.e., incremental responses). Our performance evaluation demonstrates that SwiftTuna allows data exploration of a real-world dataset with four billion records while preserving the latency between incremental responses within a few seconds.
AB - For interactive exploration of large-scale data, a preprocessing scheme (e.g., data cubes) has often been used to summarize the data and provide low-latency responses. However, such a scheme suffers from a prohibitively large amount of memory footprint as more dimensions are involved in querying, and a strong prerequisite that specific data structures have to be built from the data before querying. In this paper, we present SwiftTuna, a holistic system that streamlines the visual information seeking process on large-scale multidimensional data. SwiftTuna exploits an in-memory computing engine, Apache Spark, to achieve both scalability and performance without building precomputed data structures. We also present a novel interactive visualization technique, tailed charts, to facilitate large-scale multidimensional data exploration. To support responsive querying on large-scale data, SwiftTuna leverages an incremental processing approach, providing immediate low-fidelity responses (i.e., prompt responses) as well as delayed high-fidelity responses (i.e., incremental responses). Our performance evaluation demonstrates that SwiftTuna allows data exploration of a real-world dataset with four billion records while preserving the latency between incremental responses within a few seconds.
KW - Exploratory analysis
KW - Incremental visualization
KW - Information visualization
KW - Large-scale data exploration
KW - Scalability
UR - https://www.scopus.com/pages/publications/85031996789
U2 - 10.1109/PACIFICVIS.2017.8031587
DO - 10.1109/PACIFICVIS.2017.8031587
M3 - Conference contribution
AN - SCOPUS:85031996789
T3 - IEEE Pacific Visualization Symposium
SP - 131
EP - 140
BT - 2017 IEEE Pacific Visualization Symposium, PacificVis 2017 - Proceedings
A2 - Wu, Yingcai
A2 - Weiskopf, Daniel
A2 - Dwyer, Tim
PB - IEEE Computer Society
Y2 - 18 April 2017 through 21 April 2017
ER -