Publication | Closed Access
Findings from GitHub
86
Citations
13
References
2016
Year
Unknown Venue
Software MaintenanceEngineeringSoftware EngineeringSource Code AnalysisSoftware AnalysisResearch PapersText MiningComputational Social ScienceEmpirical Software Engineering ResearchData ScienceOpen-source Software DevelopmentOpen-source SystemGithub RepositoriesLanguage StudiesContent AnalysisSoftware RepositorySoftware MiningKnowledge DiscoveryComputer ScienceSoftware DesignMining Open SourceProgram AnalysisSoftware TestingSoftware Versioning
GitHub, one of the most popular social coding platforms, is the platform of reference when mining Open Source repositories to learn from past experiences. In the last years, a number of research papers have been published reporting findings based on data mined from GitHub. As the community continues to deepen in its understanding of software engineering thanks to the analysis performed on this platform, we believe it is worthwhile to reflect how research papers have addressed the task of mining GitHub repositories over the last years. In this regard, we present a meta-analysis of 93 research papers which addresses three main dimensions of those papers: i) the empirical methods employed, ii) the datasets they used and iii) the limitations reported. Results of our meta-analysis show some concerns regarding the dataset collection process and size, the low level of replicability, poor sampling techniques, lack of longitudinal studies and scarce variety of methodologies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1