Publication | Closed Access
Document normalization revisited
25
Citations
3
References
2002
Year
Unknown Venue
Natural Language ProcessingDocument ProcessingText NormalizationDocument NormalizationInformation RetrievalData ScienceEngineeringComputational LinguisticsStructured DocumentKnowledge DiscoveryData NormalizationComputer ScienceDocument CollectionAverage PrecisionStatisticsCorpus LinguisticsSpecific ValueText Mining
Cosine Pivoted Document Length Normalization has reached a point of stability where many researchers indiscriminately apply a specific value of 0.2 regardless of the collection. Our efforts, however, demonstrate that applying this specific value without tuning for the document collection degrades average precision by as much as 20%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1