Optimizing skyline queries over incomplete data

Research output: Contribution to journalArticlepeer-review

Abstract

Skyline queries have been widely used as an attractive operator in multi-criteria decision making applications. Because of the intuitive notion of skyline queries, many skyline algorithms have been developed in various data settings. However, most of the skyline algorithms rely on the assumption of completeness, i.e., all values of points are known. In many cases, because this assumption does not hold, conventional skyline algorithms cannot be applied. To handle incomplete data, existing work redefines the dominance notion by using the common subspace between points. However, it can incur too many pairwise comparisons over incomplete data. To address this problem, we first propose a new sorting-based bucket skyline algorithm using two optimization techniques: bucket- and point-level orders. In case that too few or no skyline points exist over incomplete data, we develop a novel skyline ranking method that adjusts two user-specific parameters for retrieving meaningful skyline points. Lastly, we empirically evaluate the efficiency and effectiveness of our proposed algorithms over both synthetic and real-life datasets.

Original languageEnglish
Pages (from-to)14-28
Number of pages15
JournalInformation Sciences
Volume361-362
DOIs
StatePublished - 20 Sep 2016
Externally publishedYes

Keywords

  • Cyclicity
  • Dominance
  • Incomparability
  • Incomplete data
  • Intransitivity
  • Skyline queries

Fingerprint

Dive into the research topics of 'Optimizing skyline queries over incomplete data'. Together they form a unique fingerprint.

Cite this