Publication | Closed Access
Research on Enterprise Track of TREC 2007 at SJTU APEX Lab
16
Citations
4
References
2007
Year
Ranking AlgorithmEngineeringBusiness IntelligenceData WarehouseLearning To RankTrec 2007Text MiningDocument Search TaskInformation RetrievalData ScienceData MiningEnterprise TrackManagementSystems EngineeringData IntegrationEnterprise Information SystemQuantitative ManagementTechnology Readiness LevelBm25 FormulaComputer ScienceInformation ManagementKeyword SearchSearch Engine DesignAdvanced Information SystemSjtu Apex LabHtml PagesSearch Engine IndexingIndustrial InformaticsTechnical PerformanceData Modeling
For Document Search Task, we generally applied BM25 formula separately on different fields of HTML pages: Title, Anchor, H1, H2, Keywords, and Extracted Body. Various Static Ranking methods are also exploited. Scores are combined together using linear combination. Among all the techniques we have embedded in our system, our highlight is the static ranking approaches. Beside this, some data preprocessing methods and similarity function will also be introduced.
| Year | Citations | |
|---|---|---|
Page 1
Page 1