TY - GEN
T1 - A novel approach to automatic query reformulation for IR-based bug localization
AU - Kim, Misoo
AU - Lee, Eunseok
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery.
PY - 2019
Y1 - 2019
N2 - Automatic query reformulation techniques for Information Retrieval based Bug Localization (IRBL) have been proposed to improve the quality of queries and IRBL performance. Recently proposed techniques determine the quality of queries via the bugs' description and reformulate them using important terms in the top-N source files retrieved by the initial query. However, the bugs' description may not contain enough information about the bugs, and the retrieved top-N files may not always provide important terms. In this paper, we propose a novel automatic query reformulation approach to improve IRBL performance beyond that of a recent technique. Our method expands bug reports using attachments and expands queries by reducing the noisy terms in them. We experimented with 1,546 bug reports. According to our results, we found that the quality of 70 reports was wrongly determined, and our method improved IRBL performance by up to 118% for these reports. Moreover, compared with a state-of-the-art technique, our method resulted in improvements of approximately 17% in Top-1, 11% in MRR@10, and 10% in MAP@10.
AB - Automatic query reformulation techniques for Information Retrieval based Bug Localization (IRBL) have been proposed to improve the quality of queries and IRBL performance. Recently proposed techniques determine the quality of queries via the bugs' description and reformulate them using important terms in the top-N source files retrieved by the initial query. However, the bugs' description may not contain enough information about the bugs, and the retrieved top-N files may not always provide important terms. In this paper, we propose a novel automatic query reformulation approach to improve IRBL performance beyond that of a recent technique. Our method expands bug reports using attachments and expands queries by reducing the noisy terms in them. We experimented with 1,546 bug reports. According to our results, we found that the quality of 70 reports was wrongly determined, and our method improved IRBL performance by up to 118% for these reports. Moreover, compared with a state-of-the-art technique, our method resulted in improvements of approximately 17% in Top-1, 11% in MRR@10, and 10% in MAP@10.
KW - Automatic Debugging
KW - Automatic Query Reformulation
KW - Bug Report
KW - Information Retrieval-based Bug Localization
KW - Test File
UR - https://www.scopus.com/pages/publications/85065638336
U2 - 10.1145/3297280.3297451
DO - 10.1145/3297280.3297451
M3 - Conference contribution
AN - SCOPUS:85065638336
SN - 9781450359337
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 1752
EP - 1759
BT - Proceedings of the ACM Symposium on Applied Computing
PB - Association for Computing Machinery
T2 - 34th Annual ACM Symposium on Applied Computing, SAC 2019
Y2 - 8 April 2019 through 12 April 2019
ER -