Publication | Closed Access
Bit Sequences and Biclustering of Text Documents
83
Citations
35
References
2007
Year
Unknown Venue
EngineeringText Doc- UmentsSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningText SegmentationComputational LinguisticsLanguage StudiesNew TechniqueDocument ClusteringComputational LexicologyKnowledge DiscoveryTerminology ExtractionBiclustering StructureComputer ScienceKeyword ExtractionText ProcessingBit SequencesLinguisticsSemantic Similarity
We propose a new technique for clustering of text doc- uments that relies on a biclustering structure constructed on terms and documents. Our approach makes use of a greedy algorithm applied to bit sequences associated with each group of synonym terms. The use of bit sequences al- lows us to achieve superior time performance. Additionally, our algorithm provides meaningful cluster descriptions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1