Publication | Closed Access
SoftRank
322
Citations
13
References
2008
Year
Unknown Venue
Artificial IntelligenceRetrieval Augmented GenerationRanking AlgorithmEngineeringMachine LearningData ScienceData MiningInformation RetrievalKnowledge DiscoveryLearning To RankRelevance FeedbackEvaluation MetricsComputer ScienceIr MetricsSupervised LearningText MiningHence Ir Metrics
We address the problem of learning large complex ranking functions. Most IR applications use evaluation metrics that depend only upon the ranks of documents. However, most ranking functions generate document scores, which are sorted to produce a ranking. Hence IR metrics are innately non-smooth with respect to the scores, due to the sort. Unfortunately, many machine learning algorithms require the gradient of a training objective in order to perform the optimization of the model parameters,and because IR metrics are non-smooth,we need to find a smooth proxy objective that can be used for training. We present a new family of training objectives that are derived from the rank distributions of documents, induced by smoothed scores. We call this approach SoftRank. We focus on a smoothed approximation to Normalized Discounted Cumulative Gain (NDCG), called SoftNDCG and we compare it with three other training objectives in the recent literature. We present two main results. First, SoftRank yields a very good way of optimizing NDCG. Second, we show that it is possible to achieve state of the art test set NDCG results by optimizing a soft NDCG objective on the training set with a different discount function
| Year | Citations | |
|---|---|---|
Page 1
Page 1