Please use this identifier to cite or link to this item:
|Scopus||Web of Science®||Altmetric|
|Title:||SOTorrent: Studying the origin, evolution, and usage of stack overflow code snippets|
|Citation:||Proceedings of the IEEE/ACM 16th International Conference on Mining Software Repositories (MSR 2019), 2019 / vol.2019-May, pp.191-194|
|Series/Report no.:||IEEE International Working Conference on Mining Software Repositories|
|Conference Name:||16th International Conference on Mining Software Repositories (MSR) (25 May 2019 - 27 May 2019 : Montreal, QC, Canada)|
|Sebastian Baltes, Christoph Treude, Stephan Diehl|
|Abstract:||Stack Overflow (SO) is the most popular question-and-answer website for software developers, providing a large amount of copyable code snippets. Like other software artifacts, code on SO evolves over time, for example when bugs are fixed or APIs are updated to the most recent version. To be able to analyze how code and the surrounding text on SO evolves, we built SOTorrent, an open dataset based on the official SO data dump. SOTorrent provides access to the version history of SO content at the level of whole posts and individual text and code blocks. It connects code snippets from SO posts to other platforms by aggregating URLs from surrounding text blocks and comments, and by collecting references from GitHub files to SO posts. Our vision is that researchers will use SOTorrent to investigate and understand the evolution and maintenance of code on SO and its relation to other platforms such as GitHub.|
|Description:||Part of Conference: ICSE '19: 41st International Conference on Software Engineering.|
|Appears in Collections:||Computer Science publications|
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.