Skip to main navigation Skip to search Skip to main content

Feature combination to alleviate hubness problem of source code representation for bug localization

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep learning-based bug localization (DLBL) can effectively reduce software maintenance costs. However, the inherent hub ness problem of the high-dimensional vector of the source code file used in DLBL leads to inaccurate bug localization. To solve this problem, we analyzed 10, 359 defects and found that the call graph and flow of the program can distinguish buggy files from non-buggy files, and provide functional semantic information for bug localization. Based on our observations, we propose a feature combination to alleviate the hubness problem of the source file representation by using functional semantic information. Our proposed method models the functional semantics with the call graph and program flow based on the raw abstract syntax tree. We evaluated the effectiveness of the proposed approach on 19 widely used projects and conducted an ablation study. The experimental results show that the proposed method can improve the current approaches by 12 % to 45 %, with differentiating buggy files and non-buggy files. In our ablation study, functional information shows its significance as the absence of functional semantics deteriorates performance by 8.5 %.

Original languageEnglish
Title of host publicationProceedings - 2020 27th Asia-Pacific Software Engineering Conference, APSEC 2020
PublisherIEEE Computer Society
Pages511-512
Number of pages2
ISBN (Electronic)9781728195537
DOIs
StatePublished - Dec 2020
Event27th Asia-Pacific Software Engineering Conference, APSEC 2020 - Singapore, Singapore
Duration: 1 Dec 20204 Dec 2020

Publication series

NameProceedings - Asia-Pacific Software Engineering Conference, APSEC
Volume2020-December
ISSN (Print)1530-1362

Conference

Conference27th Asia-Pacific Software Engineering Conference, APSEC 2020
Country/TerritorySingapore
CitySingapore
Period1/12/204/12/20

Keywords

  • Abstract Syntax Tree
  • Bug Localization
  • Code Representation
  • Functional Semantics

Fingerprint

Dive into the research topics of 'Feature combination to alleviate hubness problem of source code representation for bug localization'. Together they form a unique fingerprint.

Cite this