Welcome to MSR 2012, the Ninth International Working Conference on Mining Software Repositories, held June 2-3 in Zurich, Switzerland, and co-located with the 2012 International Conference on Software Engineering (ICSE 2012).
The first decade of MSR is coming quickly to and end: This is the fifth year that MSR has been a Working Conference, building on the strong foundation of four years as a successful ICSE workshop. We are happy to report that the community of MSR researchers continues to grow, while at the same time the research being performed is maturing. This year saw a further increase in the total number of submissions: 64 full paper submissions (10 pages) and 22 short paper submissions (4 pages) for a total of 86. This total is 8 more than the 78 total submissions in 2011. Full paper submissions increased by 3 (64 vs. 61) while short paper submissions increased by 5 (22 vs. 17).
Proceeding Downloads
Software analytics in practice: approaches and experiences
A huge wealth of various data exists in the software development process, and hidden in the data is information about the quality of software and services as well as the dynamics of software development. With various analytic and computing technologies, ...
Towards improving bug tracking systems with game mechanisms
Low bug report quality and human conflicts pose challenges to keep bug tracking systems productive. This work proposes to address these issues by applying game mechanisms to bug tracking systems. We investigate the use of game mechanisms in Stack ...
GHTorrent: GitHub's data from a firehose
A common requirement of many empirical software engineering studies is the acquisition and curation of data from software repositories. During the last few years, GitHub has emerged as a popular project hosting, mirroring and collaboration platform. ...
MIC check: a correlation tactic for ESE data
Empirical software engineering researchers are concerned with understanding the relationships between outcomes of interest, e.g. defects, and process and product measures. The use of correlations to uncover strong relationships is a natural precursor to ...
A linked data platform for mining software repositories
- Iman Keivanloo,
- Christopher Forbes,
- Aseel Hmood,
- Mostafa Erfani,
- Christopher Neal,
- George Peristerakis,
- Juergen Rilling
The mining of software repositories involves the extraction of both basic and value-added information from existing software repositories. The repositories will be mined to extract facts by different stakeholders (e.g. researchers, managers) and for ...
How distributed version control systems impact open source software projects
Centralized Version Control Systems have been used by many open source projects for a long time. However, in recent years several widely-known projects have migrated their repositories to Distributed Version Control Systems, such as Mercurial, Bazaar, ...
An empirical study of supplementary bug fixes
A recent study finds that errors of omission are harder for programmers to detect than errors of commission. While several change recommendation systems already exist to prevent or reduce omission errors during software development, there have been very ...
Incorporating version histories in information retrieval based bug localization
Fast and accurate localization of software defects continues to be a difficult problem since defects can emanate from a large variety of sources and can often be intricate in nature. In this paper, we show how version histories of a software project can ...
Think locally, act globally: improving defect and effort prediction models
Much research energy in software engineering is focused on the creation of effort and defect prediction models. Such models are important means for practitioners to judge their current project situation, optimize the allocation of their resources, and ...
Characterizing verification of bug fixes in two open source IDEs
Data from bug repositories have been used to enable inquiries about software product and process quality. Unfortunately, such repositories often contain inaccurate, inconsistent, or missing data, which can originate misleading results. In this paper, we ...
Are faults localizable?
Many fault localization techniques have been proposed to facilitate debugging activities. Most of them attempt to pinpoint the location of faults (i.e., localize faults) based on a set of failing and correct executions and expect debuggers to ...
Green mining: a methodology of relating software change to power consumption
Power consumption is becoming more and more important with the increased popularity of smart-phones, tablets and laptops. The threat of reducing a customer's battery-life now hangs over the software developer who asks, "will this next change be the one ...
Analysis of customer satisfaction survey data
Cisco Systems, Inc., conducts a customer satisfaction survey (CSAT) each year to gauge customer sentiment regarding Cisco products, technical support, partner- and Cisco-provided technical services, order fulfillment, and a number of other aspects of ...
Mining usage data and development artifacts
Software repository mining techniques generally focus on analyzing, unifying, and querying different kinds of development artifacts, such as source code, version control meta-data, defect tracking data, and electronic communication. In this work, we ...
App store mining and analysis: MSR for app stores
This paper introduces app store mining and analysis as a form of software repository mining. Unlike other software repositories traditionally used in MSR work, app stores usually do not provide source code. However, they do provide a wealth of other ...
Mining challenge 2012: the Android platform
The MSR Challenge offers researchers and practitioners in the area of Mining Software Repositories a common data set and asks them to put their mining tools and approaches on a dare. This year, the challenge is on the Android platform. We provided the ...
Bug introducing changes: a case study with Android
Changes, a rather inevitable part of software development can cause maintenance implications if they introduce bugs into the system. By isolating and characterizing these bug introducing changes it is possible to uncover potential risky source code ...
Trendy bugs: topic trends in the Android bug reports
Studying vast volumes of bug and issue discussions can give an understanding of what the community has been most concerned about, however the magnitude of documents can overload the analyst. We present an approach to analyze the development of the ...
Do the stars align?: multidimensional analysis of Android's layered architecture
In this paper we mine the Android bug tracker repository and study the characteristics of the architectural layers of the Android system. We have identified the locality of the Android bugs in the architectural layers of the its infrastructure, and ...
The build dependency perspective of Android's concrete architecture
Android is an operating system designed specifically for mobile devices. It has a layered architecture. In this paper, we extract Android's concrete layered architecture by analyzing the build dependency relation between Android sub-projects and use it ...
MINCE: mining change history of Android project
An analysis of commit history of Android reveals that Android has a code base of 550K files, where on an average each file has been modified 8.7 times. 41% of files have been modified at-least once. In terms of contributors, it has an overall ...
Mining for localization in Android
Localization, and in particular translation, is a key aspect of modern end-user software applications. Open source systems have traditionally taken advantage of distributed and volunteer collaboration to carry localization tasks. In this paper, we will ...
The evolution of the social programmer
Social media has revolutionized how humans create and curate knowledge artifacts [1]. It has increased individual engagement, broadened community participation and led to the formation of new social networks. This paradigm shift is particularly evident ...
Why do software packages conflict?
Determining whether two or more packages cannot be installed together is an important issue in the quality assurance process of package-based distributions. Unfortunately, the sheer number of different configurations to test makes this task particularly ...
Discovering complete API rules with mutation testing
Specifications are important for many activities during software construction and maintenance process such as testing, verification, debugging and repairing. Despite their importance, specifications are often missing, informal or incomplete because they ...
Inferring semantically related words from software context
Code search is an integral part of software development and program comprehension. The difficulty of code search lies in the inability to guess the exact words used in the code. Therefore, it is crucial for keyword-based code search to expand queries ...
The evolution of data races
Concurrency bugs are notoriously difficult to find and fix. Several prior empirical studies have identified the prevalence and challenges of concurrency bugs in open source projects, and several existing tools can be used to identify concurrency errors ...
Error mining: bug detection through comparison with large code databases
Bugs are hard to find. Static analysis tools are capable of systematically detecting predefined sets of errors, but extending them to find new error types requires a deep understanding of the underlying programming language. Manual reviews on the other ...
Do faster releases improve software quality?: an empirical case study of Mozilla Firefox
Nowadays, many software companies are shifting from the traditional 18-month release cycle to shorter release cycles. For example, Google Chrome and Mozilla Firefox release new versions every 6 weeks. These shorter release cycles reduce the users' ...
Explaining software defects using topic models
Researchers have proposed various metrics based on measurable aspects of the source code entities (e.g., methods, classes, files, or modules) and the social structure of a software project in an effort to explain the relationships between software ...
Index Terms
- Proceedings of the 9th IEEE Working Conference on Mining Software Repositories