Towards automatic generation of short summaries of commits

S Jiang, C McMillan - 2017 IEEE/ACM 25th International …, 2017 - ieeexplore.ieee.org
2017 IEEE/ACM 25th International Conference on Program …, 2017ieeexplore.ieee.org
Committing to a version control system means submitting a software change to the system.
Each commit can have a message to describe the submission. Several approaches have
been proposed to automatically generate the content of such messages. However, the
quality of the automatically generated messages falls far short of what humans write. In
studying the differences between auto-generated and human-written messages, we found
that 82% of the human-written messages have only one sentence, while the automatically …
Committing to a version control system means submitting a software change to the system. Each commit can have a message to describe the submission. Several approaches have been proposed to automatically generate the content of such messages. However, the quality of the automatically generated messages falls far short of what humans write. In studying the differences between auto-generated and human-written messages, we found that 82% of the human-written messages have only one sentence, while the automatically generated messages often have multiple lines. Furthermore, we found that the commit messages often begin with a verb followed by an direct object. This finding inspired us to use a "verb+object" format in this paper to generate short commit summaries. We split the approach into two parts: verb generation and object generation. As our first try, we trained a classifier to classify a diff to a verb. We are seeking feedback from the community before we continue to work on generating direct objects for the commits.
ieeexplore.ieee.org