Publication | Closed Access
Predicting reuse of end-user web macro scripts
13
Citations
28
References
2009
Year
Unknown Venue
Software MaintenanceWeb Macro ScriptsEngineeringModel ReuseSoftware EngineeringSource Code AnalysisSoftware AnalysisText MiningInformation RetrievalData ScienceScript TraitsSoftware MiningPredictive AnalyticsRepository LogsKnowledge DiscoveryComputer ScienceCode RepresentationSoftware DesignProgram AnalysisSoftware TestingReusabilityCode Reuse
Repositories of code written by end-user programmers are beginning to emerge, but when a piece of code is new or nobody has yet reused it, then current repositories provide users with no information about whether that code might be appropriate for reuse. Addressing this problem requires predicting reusability based on information that exists when a script is created. To provide such a model for web macro scripts, we identified script traits that might plausibly predict reuse, then used IBM CoScripter repository logs to statistically test how well each corresponded to reuse. We then built a machine learning model that combines the useful traits and evaluated how well it can predict four different types of reuse that we saw in the repository logs. Our model was able to predict reuse from a surprisingly small set of traits. It is simple enough to be explained in only 6-11 rules, making it potentially viable for integration in repository search engines for end-user programmers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1