skip to main content
research-article

Data Mining for Software Engineering

Published: 01 August 2009 Publication History

Abstract

To improve software productivity and quality, software engineers are increasingly applying data mining algorithms to various software engineering tasks. However, mining SE data poses several challenges. The authors present various algorithms to effectively mine sequences, graphs, and text from such data.

Cited By

View all
  • (2024)Revisiting the reproducibility of empirical software engineering studies based on data retrieved from development repositoriesInformation and Software Technology10.1016/j.infsof.2023.107318164:COnline publication date: 10-Jan-2024
  • (2023)Machine/Deep Learning for Software Engineering: A Systematic Literature ReviewIEEE Transactions on Software Engineering10.1109/TSE.2022.317334649:3(1188-1231)Online publication date: 1-Mar-2023
  • (2021)GrumPy: an automated approach to simplify issue data analysis for newcomersProceedings of the XXXV Brazilian Symposium on Software Engineering10.1145/3474624.3476012(33-38)Online publication date: 27-Sep-2021
  • Show More Cited By

Recommendations

Reviews

Alexis Leon

Nowadays, almost every aspect of life is touched and controlled by software. As software becomes a more prominent presence, the task of developing it becomes more difficult. Because software is used for critical applications and for controlling sophisticated equipment and systems, even a small bug can potentially have catastrophic consequences. Today's software projects are becoming more complex in size, sophistication, and the technologies used. As demand for software increases, software developers must find ways to improve their productivity, efficiency, and the quality of their product. This paper explores the use of data mining algorithms to perform software engineering (SE) tasks, and improve the quality and productivity of software developers. Organizations have large amounts of SE data, such as documents, reports, source code, and bug reports; mining this data can help organizations solve many SE problems. Data mining algorithms can help software engineers find the correct usage of an application programming interface (API), the impact of a change in source code, and potential bugs in the software. This paper explains the mining technology, the various challenges of mining SE data, and common data mining algorithms. Using examples, Xie et al. explain the various data, graph, and text mining algorithms for mining SE data: iterative patterns, temporal rules, sequence diagrams, finite state machines, sequence association rules, discriminative graph mining, graph classification to assist in debugging, and an algorithm to detect duplicate bug reporting. The results from their experiments illustrate the superiority of these techniques. This excellent paper will be useful to software engineers, productivity improvement professionals, and people and organizations that develop tools for improving the quality and efficiency of software development. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computer
Computer  Volume 42, Issue 8
August 2009
101 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 August 2009

Author Tags

  1. Computational intelligence
  2. Data mining
  3. Design and test
  4. Software engineering

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Revisiting the reproducibility of empirical software engineering studies based on data retrieved from development repositoriesInformation and Software Technology10.1016/j.infsof.2023.107318164:COnline publication date: 10-Jan-2024
  • (2023)Machine/Deep Learning for Software Engineering: A Systematic Literature ReviewIEEE Transactions on Software Engineering10.1109/TSE.2022.317334649:3(1188-1231)Online publication date: 1-Mar-2023
  • (2021)GrumPy: an automated approach to simplify issue data analysis for newcomersProceedings of the XXXV Brazilian Symposium on Software Engineering10.1145/3474624.3476012(33-38)Online publication date: 27-Sep-2021
  • (2021)A Mining Software Repository Extended Cookbook: Lessons learned from a literature reviewProceedings of the XXXV Brazilian Symposium on Software Engineering10.1145/3474624.3474627(1-10)Online publication date: 27-Sep-2021
  • (2021)FPGA/GPU-based Acceleration for Frequent Itemsets Mining: A Comprehensive ReviewACM Computing Surveys10.1145/347228954:9(1-35)Online publication date: 8-Oct-2021
  • (2021)Mining Treatment-Outcome Constructs from Sequential Software Engineering DataIEEE Transactions on Software Engineering10.1109/TSE.2019.289295647:2(393-411)Online publication date: 10-Feb-2021
  • (2021)Analysing Time-Stamped Co-Editing Networks in Software Development Teams using git2netEmpirical Software Engineering10.1007/s10664-020-09928-226:4Online publication date: 1-Jul-2021
  • (2019)To apply Data Mining for Classification of Crowd sourced Software RequirementsProceedings of the 8th International Conference on Software and Information Engineering10.1145/3328833.3328837(42-46)Online publication date: 9-Apr-2019
  • (2019)git2netProceedings of the 16th International Conference on Mining Software Repositories10.1109/MSR.2019.00070(433-444)Online publication date: 26-May-2019
  • (2018)Mining container image repositories for software configuration and beyondProceedings of the 40th International Conference on Software Engineering: New Ideas and Emerging Results10.1145/3183399.3183403(49-52)Online publication date: 27-May-2018
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media