Analyzing design choices for distributed multidimensional indexing

Beomseok Nam, Alan Sussman

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Scientific datasets are often stored on distributed archival storage systems, because geographically distributed sensor devices store the datasets in their local machines and also because the size of scientific datasets demands large amount of disk space. Multidimensional indexing techniques have been shown to greatly improve range query performance into large scientific datasets. In this paper, we discuss several ways of distributing a multidimensional index in order to speed up access to large distributed scientific datasets. This paper compares the designs, challenges, and problems for distributed multidimensional indexing schemes, and provides a comprehensive performance study of distributed indexing to provide guidelines to choose a distributed multidimensional index for a specific data analysis application.

Original languageEnglish
Pages (from-to)1552-1576
Number of pages25
JournalJournal of Supercomputing
Volume59
Issue number3
DOIs
StatePublished - Mar 2012
Externally publishedYes

Keywords

  • Data intensive computing
  • Decentralized indexing
  • Distributed indexing
  • Multidimensional indexing

Fingerprint

Dive into the research topics of 'Analyzing design choices for distributed multidimensional indexing'. Together they form a unique fingerprint.

Cite this