Skip to main navigation Skip to search Skip to main content

Fast, Light-weight, and Accurate Performance Evaluation using Representative Datacenter Behaviors

  • Jaewon Lee
  • , Dongmoon Min
  • , Ilkwon Byun
  • , Hanhwi Jang
  • , Jangwoo Kim
  • Meta
  • Seoul National University
  • Ajou University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Datacenters rapidly evolve by adopting new features such as new hardware deployment and software patches. Adopting a new feature requires an accurate evaluation of its impact to minimize the risk to the multi-million dollar computing infrastructure. However, a comprehensive performance analysis of a datacenter is extremely challenging due to its cost and multitenancy. Evaluating the performance in a live datacenter is accurate but prohibitive to prevent any damage to production services. Using conventional load-testing benchmarks on small-scale testbeds is imprecise as they do not consider the effect of other co-located jobs. In this paper, we propose FLARE, a fast, lightweight, and accurate performance evaluation method using representative datacenter behaviors. The key idea is to extract a small set of representative job colocation scenarios from all possible job colocations in a target datacenter. FLARE systematically characterizes and groups job colocations according to performance and resource metrics, providing high-level insights into the datacenter's behaviors. Then, it reconstructs the colocations on a testbed and allows accurate feature evaluation with load-testing benchmarks. We evaluate FLARE using an in-house datacenter and three features: cache sizing, DVFS, and SMT configurations. FLARE accurately estimates the impact of features with less than 1% errors by incurring 50× and 10× lower evaluation costs compared to full datacenter and sampling-based evaluation, respectively.

Original languageEnglish
Title of host publicationMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference
PublisherAssociation for Computing Machinery, Inc
Pages220-233
Number of pages14
ISBN (Electronic)9798400701771
DOIs
StatePublished - 27 Nov 2023
Externally publishedYes
Event24th ACM/IFIP International Middleware Conference, Middleware 2023 - Bologna, Italy
Duration: 11 Dec 202315 Dec 2023

Publication series

NameMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference

Conference

Conference24th ACM/IFIP International Middleware Conference, Middleware 2023
Country/TerritoryItaly
CityBologna
Period11/12/2315/12/23

Keywords

  • datacenters
  • performance modeling
  • sampling-based evaluation

Fingerprint

Dive into the research topics of 'Fast, Light-weight, and Accurate Performance Evaluation using Representative Datacenter Behaviors'. Together they form a unique fingerprint.

Cite this