Publication | Closed Access
Cascade
235
Citations
20
References
2013
Year
Unknown Venue
EngineeringAutomated WorkflowSmall GroupSemantic WebData ScienceData MiningManagementData IntegrationInformation ArchitectsHuman ComputationData ManagementWeb-based CollaborationKnowledge DiscoveryComputer ScienceInformation ManagementCrowdsourcingCrowd ComputingMassive Data ProcessingBig Data
Taxonomies are a useful and ubiquitous way of organizing information. However, creating organizational hierarchies is difficult because the process requires a global understanding of the objects to be categorized. Usually one is created by an individual or a small group of people working together for hours or even days. Unfortunately, this centralized approach does not work well for the large, quickly changing datasets found on the web. Cascade is an automated workflow that allows crowd workers to spend as little at 20 seconds each while collectively making a taxonomy. We evaluate Cascade and show that on three datasets its quality is 80-90% of that of experts. Cascade has a competitive cost to expert information architects, despite taking six times more human labor. Fortunately, this labor can be parallelized such that Cascade will run in as fast as four minutes instead of hours or days.
| Year | Citations | |
|---|---|---|
Page 1
Page 1