Skip to main navigation Skip to search Skip to main content

EclipseMR: Distributed and Parallel Task Processing with Consistent Hashing

  • Vicente A.B. Sanchez
  • , Wonbae Kim
  • , Youngmoon Eom
  • , Kibeom Jin
  • , Moohyeon Nam
  • , Deukyeon Hwang
  • , Jik Soo Kim
  • , Beomseok Nam
  • Korea Military Academy
  • Myongji University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present EclipseMR, a novel MapReduce framework prototype that efficiently utilizes a large distributed memory in cluster environments. EclipseMR consists of double-layered consistent hash rings-a decentralized DHT-based file system and an in-memory key-value store that employs consistent hashing. The in-memory key-value store in EclipseMR is designed not only to cache local data but also remote data as well so that globally popular data can be distributed across cluster serversand found by consistent hashing.In order to leverage large distributed memories and increase the cache hit ratio, we propose a locality-Aware fair (LAF) job scheduler that works as the load balancer for the distributed in-memorycaches. Based on hash keys, the LAF job scheduler predicts which servers have reusable data, and assigns tasks to the servers so that they can be reused. The LAF job scheduler makes its best efforts to strike a balance between data locality and load balance, which often conflict with each other. We evaluate EclipseMR by quantifying the performance effect of each component using several representative MapReduce applications and show EclipseMR is faster than Hadoop andSpark by a large margin for various applications.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages322-332
Number of pages11
ISBN (Electronic)9781538623268
DOIs
StatePublished - 22 Sep 2017
Externally publishedYes
Event2017 IEEE International Conference on Cluster Computing, CLUSTER 2017 - Honolulu, United States
Duration: 5 Sep 20178 Sep 2017

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
Volume2017-September
ISSN (Print)1552-5244

Conference

Conference2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
Country/TerritoryUnited States
CityHonolulu
Period5/09/178/09/17

Keywords

  • Consistent Hashing
  • Distributed Caching
  • Distributed Hash Table
  • MapReduce

Fingerprint

Dive into the research topics of 'EclipseMR: Distributed and Parallel Task Processing with Consistent Hashing'. Together they form a unique fingerprint.

Cite this