Publication | Closed Access
Deploying GOOG-411: Early lessons in data, measurement, and testing
47
Citations
7
References
2008
Year
Correct Accept RateSearch Engine OptimizationEngineeringBusiness IntelligenceIntelligent Information RetrievalSpoken Language ProcessingData InfrastructureBusiness FinderLarge Language ModelSpeech RecognitionNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsManagementLanguage EngineeringData IntegrationData ManagementMachine TranslationSearch TechnologyDistributed Search EngineKnowledge DiscoveryComputer ScienceInformation ManagementEarly LessonsSoftware TestingEarly Experience BuildingBig Data
We describe our early experience building and optimizing GOOG-411, a fully automated, voice-enabled, business finder. We show how taking an iterative approach to system development allows us to optimize the various components of the system, thereby progressively improving user-facing metrics. We show the contributions of different data sources to recognition accuracy. For business listing language models, we see a nearly linear performance increase with the logarithm of the amount of training data. To date, we have improved our correct accept rate by 25% absolute, and increased our transfer rate by 35% absolute.
| Year | Citations | |
|---|---|---|
Page 1
Page 1