skip to main content
10.1109/MSR.2017.26acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Cost-effective build outcome prediction using cascaded classifiers

Published: 20 May 2017 Publication History

Abstract

Software developers use continuous integration to find defects in the early stage and reduce risk. But this process can be resource and time consuming, which decreases the efficiency of development. In this work, we adopt cascaded classifiers to predict the build outcome and study what kinds of attributes are potentially useful for this process. We emphasize on the "failed" instances which bring more cost. Our experiments reveal that our approach outperforms other commonly used classifiers. It reduces 51.7% of the waiting time and server workload while identifying 85.2% of the defective builds.

References

[1]
P. M. Duvall, Continuous integration. Pearson Education India, 2007.
[2]
M. Beller, G. Gousios, and A. Zaidman. Oops, my tests broke the build: An analysis of travis ci builds with github. In No. e1984v1. PeerJ Preprints, 2016.
[3]
A. E. Hassan, and K. Zhang. Using decision trees to predict the certification result of a build. In 21st IEEE/ACM International Conference on Automated Software Engineering, 2006.
[4]
E. A. Santos, and A. Hindle. Judging a commit by its cover: correlating commit message entropy with build status on travis-CI. In Proceedings of the 13th working conference on mining software repositories, 2016.
[5]
J. Finlay, R. Pears, and A. M. Connor. Data stream mining for predicting software build outcomes using source code metrics. In Information and Software Technology 56.2 (2014): 183--198.
[6]
P. A. Viola, and M. J. Jones. Robust Real-Time Face Detection. In International Journal of Computer Vision 57.2 (2004): 137--154
[7]
M. Beller, G. Gousios, and A. Zaidman. TravisTorrent: Synthesizing Travis CI and GitHub for Full-Stack Research on Continuous Integration. In Proceedings of the 14th working conference on mining software repositories, 2017.
[8]
S. Kim, T. Zimmermann, and E. J. Whitehead Jr. Predicting faults from cached history. In Proceedings of the 29th international conference on Software Engineering, 2007.
[9]
Y. Freund, and R. E. Schapire. A desicion-theoretic generalization of online learning and an application to boosting. In European conference on computational learning theory, 1995.
[10]
I. H. Witten, and E. Frank. Data Mining: Practical machine learning tools and techniques with Java implementations. Morgan Kaufmann, 1999.
[11]
N. Friedman, D. Geiger, and M. Goldszmidt. Bayesian network classifiers. In Machine learning 29.2--3 (1997): 131--163.
[12]
J. R. Quinlan. C4. 5: programs for machine learning. Morgan Kaufmann, 1993.

Cited By

View all
  • (2024)Practitioners’ Challenges and Perceptions of CI Build Failure Predictions at AtlassianCompanion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering10.1145/3663529.3663856(370-381)Online publication date: 10-Jul-2024
  • (2024)Commit Artifact Preserving Build PredictionProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680356(1236-1248)Online publication date: 11-Sep-2024
  • (2024)The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI BuildsProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644924(543-555)Online publication date: 15-Apr-2024
  • Show More Cited By
  1. Cost-effective build outcome prediction using cascaded classifiers

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MSR '17: Proceedings of the 14th International Conference on Mining Software Repositories
    May 2017
    567 pages
    ISBN:9781538615447

    Sponsors

    Publisher

    IEEE Press

    Publication History

    Published: 20 May 2017

    Check for updates

    Author Tags

    1. continuous integration
    2. cost-sensitive learning
    3. ensemble learning
    4. software engineering
    5. software mining

    Qualifiers

    • Research-article

    Conference

    ICSE '17
    Sponsor:

    Upcoming Conference

    ICSE 2025

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Practitioners’ Challenges and Perceptions of CI Build Failure Predictions at AtlassianCompanion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering10.1145/3663529.3663856(370-381)Online publication date: 10-Jul-2024
    • (2024)Commit Artifact Preserving Build PredictionProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis10.1145/3650212.3680356(1236-1248)Online publication date: 11-Sep-2024
    • (2024)The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI BuildsProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644924(543-555)Online publication date: 15-Apr-2024
    • (2024)RavenBuild: Context, Relevance, and Dependency Aware Build Outcome PredictionProceedings of the ACM on Software Engineering10.1145/36437711:FSE(996-1018)Online publication date: 12-Jul-2024
    • (2023)Accelerating Continuous Integration with Parallel Batch TestingProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3611643.3616255(55-67)Online publication date: 30-Nov-2023
    • (2023)HybridCISave: A Combined Build and Test Selection Approach in Continuous IntegrationACM Transactions on Software Engineering and Methodology10.1145/357603832:4(1-39)Online publication date: 26-May-2023
    • (2023)The Why, When, What, and How About Predictive Continuous Integration: A Simulation-Based InvestigationIEEE Transactions on Software Engineering10.1109/TSE.2023.333051049:12(5223-5249)Online publication date: 1-Dec-2023
    • (2022)BuildSonic: Detecting and Repairing Performance-Related Configuration Smells for Continuous Integration BuildsProceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering10.1145/3551349.3556923(1-13)Online publication date: 10-Oct-2022
    • (2021)Reducing cost in continuous integration with a collection of build selection approachesProceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3468264.3473103(1650-1654)Online publication date: 20-Aug-2021
    • (2020)An Experimental Evaluation of Imbalanced Learning and Time-Series Validation in the Context of CI/CD PredictionProceedings of the 24th International Conference on Evaluation and Assessment in Software Engineering10.1145/3383219.3383222(21-30)Online publication date: 15-Apr-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media