Publication | Closed Access
The GHTorent dataset and tool suite
339
Citations
1
References
2013
Year
Unknown Venue
EngineeringData RepositoryData ExplorationData MashupsSemantic WebLarge-scale DatasetsData ScienceData MiningManagementData IntegrationLinked DataData ManagementOpen DataData ModelingBenchmark DatasetsDataset DetailsResearch Data ArchivingGhtorent ProjectCloud ComputingExtensive Rest ApiGhtorent DatasetBig Data
During the last few years, GitHub has emerged as a popular project hosting, mirroring and collaboration platform. GitHub provides an extensive REST API, which enables researchers to retrieve high-quality, interconnected data. The GHTorent project has been collecting data for all public projects available on Github for more than a year. In this paper, we present the dataset details and construction process and outline the challenges and research opportunities emerging from it.
| Year | Citations | |
|---|---|---|
Page 1
Page 1