Publication | Closed Access
Network representation learning with rich text information
893
Citations
21
References
2015
Year
Unknown Venue
Representation learning has proven effective across domains, and network representation learning seeks distributed vertex embeddings, yet most approaches ignore rich textual attributes inherent to vertices. The authors aim to extend DeepWalk by incorporating vertex text features through a matrix‑factorization framework, resulting in the proposed TADW method. TADW integrates vertex text features into a matrix‑factorization model and is evaluated against baselines on multi‑class vertex classification tasks. Experiments demonstrate that TADW surpasses baseline methods on all three datasets, particularly under noisy network conditions and low training ratios. Source code is available at https://github.com/albertyang33/TADW.
Representation learning has shown its effectiveness in many tasks such as image classification and text mining. Network representation learning aims at learning distributed vector representation for each vertex in a network, which is also increasingly recognized as an important aspect for network analysis. Most network representation learning methods investigate network structures for learning. In reality, network vertices contain rich information (such as text), which cannot be well applied with algorithmic frameworks of typical representation learning methods. By proving that DeepWalk, a state-of-the-art network representation method, is actually equivalent to matrix factorization (MF), we propose text-associated DeepWalk (TADW). TADW incorporates text features of vertices into network representation learning under the framework of matrix factorization. We evaluate our method and various baseline methods by applying them to the task of multi-class classification of vertices. The experimental results show that, our method outperforms other baselines on all three datasets, especially when networks are noisy and training ratio is small. The source code of this paper can be obtained from https://github.com/albertyang33/TADW.
| Year | Citations | |
|---|---|---|
Page 1
Page 1