TY - GEN
T1 - Using bug report similarity to enhance bug localisation
AU - Davies, Steven
AU - Roper, Marc
AU - Wood, Murray
PY - 2012/12/20
Y1 - 2012/12/20
N2 - Bug localisation techniques are proposed as a method to reduce the time developers spend on maintenance, allowing them to quickly find source code relevant to a bug. Some techniques are based on information retrieval methods, treating the source code as a corpus and the bug report as a query. While these have shown success, there remain a number of little-exploited additional sources of information which could enhance the techniques, including the textual similarity between bug reports themselves. Based on successful results in detecting duplicate bug reports, this work asks: if duplicate bugs reports, which by definition are fixed in the same source location, can be detected through the use of similar language, can bugs which are in the same location but not duplicates be detected in the same way? A technique using this information is implemented and evaluated on 372 bugs across 4 projects, and is found to improve performance on all projects. In particular, the technique increases the number of bugs where the first relevant method presented to developers is the first result from 6 to 27, and those in the top-10 from 50 to 57, showing that it can be successfully used to enhance existing bug localisation techniques.
AB - Bug localisation techniques are proposed as a method to reduce the time developers spend on maintenance, allowing them to quickly find source code relevant to a bug. Some techniques are based on information retrieval methods, treating the source code as a corpus and the bug report as a query. While these have shown success, there remain a number of little-exploited additional sources of information which could enhance the techniques, including the textual similarity between bug reports themselves. Based on successful results in detecting duplicate bug reports, this work asks: if duplicate bugs reports, which by definition are fixed in the same source location, can be detected through the use of similar language, can bugs which are in the same location but not duplicates be detected in the same way? A technique using this information is implemented and evaluated on 372 bugs across 4 projects, and is found to improve performance on all projects. In particular, the technique increases the number of bugs where the first relevant method presented to developers is the first result from 6 to 27, and those in the top-10 from 50 to 57, showing that it can be successfully used to enhance existing bug localisation techniques.
KW - bug localisation
KW - feature location
KW - mining software repositories
UR - https://www.scopus.com/pages/publications/84872360359
U2 - 10.1109/WCRE.2012.22
DO - 10.1109/WCRE.2012.22
M3 - Conference contribution
AN - SCOPUS:84872360359
SN - 9780769548913
T3 - Proceedings - Working Conference on Reverse Engineering, WCRE
SP - 125
EP - 134
BT - Proceedings - 19th Working Conference on Reverse Engineering, WCRE 2012
T2 - 19th Working Conference on Reverse Engineering, WCRE 2012
Y2 - 15 October 2012 through 18 October 2012
ER -