Please use this identifier to cite or link to this item:
|Scopus||Web of Science®||Altmetric|
|Title:||An annotated dataset of stack overflow post edits|
|Citation:||Proceedings of the 2020 Genetic and Evolutionary Computation Conference (GECCO'20). GECCO 2020 Companion, 2020, pp.1923-1925|
|Publisher:||Association for Computing Machinery|
|Conference Name:||Genetic and Evolutionary Computation Conference (GECCO) (8 Jul 2020 - 12 Jul 2020 : virtual online)|
|Sebastian Baltes, Markus Wagner|
|Abstract:||To improve software engineering, software repositories have been mined for code snippets and bug fixes. Typically, this mining takes place at the level of files or commits. To be able to dig deeper and to extract insights at a higher resolution, we hereby present an annotated dataset that contains over 7 million edits of code and text on Stack Overflow. Our preliminary study indicates that these edits might be a treasure trove for mining information about fine-grained patches, e.g., for the optimisation of non-functional properties.|
|Keywords:||Software documentation; software evolution; patches; mining software repositories; stack overflow|
|Rights:||© 2020 Copyright held by the owner/author(s). Publication rights licensed to ACM.|
|Appears in Collections:||Aurora harvest 4|
Computer Science publications
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.