Publication | Closed Access
Locality Reconstruction Models for Book Representation
32
Citations
43
References
2018
Year
EngineeringSemantic WebText MiningWord EmbeddingsNatural Language ProcessingSingle Book TreeInformation RetrievalData ScienceData MiningPattern RecognitionComputational LinguisticsLocality Reconstruction ModelsLanguage StudiesDocument ClusteringSimilarity SearchKnowledge DiscoveryComputer ScienceLocality ReconstructionVector Space ModelTopic ModelContent RepresentationLinguisticsLengthy Documents
Books, as a representative of lengthy documents, convey rich semantics. Traditional document modeling methods, such as bag-of-words models, have difficulty capturing such rich semantics when only considering term-frequency features. In order to explore term spatial distributions over a book, a tree-structured book representation is investigated in this paper. Moreover, an efficient learning framework, Tree2Vector, is introduced for mapping tree-structured book data into vectorial space. In particular, we present two types of locality reconstruction (LR) models: Euclidean-type and cosine-type, during the transformation process of tree structures into vectorial representations. The LR is used for modeling the reconstruction process, in which each parent node in a tree is supposed to be reconstructed by its child nodes. The prominent advantage of this Tree2Vector framework is that it solely utilizes the local information within a single book tree. In addition, extensive experimental results demonstrate that Tree2Vector is able to deliver comparable or better performance in comparison to methods that consider the information of all trees in a database globally. Experimental results also suggest that cosine-type LR consistently performs better than Euclidean-type LR in applications of book and author recommendations.
| Year | Citations | |
|---|---|---|
Page 1
Page 1