Publication | Open Access
Document overlap detection system for distributed digital libraries
48
Citations
3
References
2000
Year
Unknown Venue
Cluster ComputingEngineeringData DeduplicationInformation ForensicsText MiningString-searching AlgorithmInformation RetrievalData ScienceData MiningString ProcessingDocument EngineeringData IntegrationData ManagementDistributed Digital LibrariesDocument ClusteringKnowledge DiscoveryMatching-engine ComponentComputer ScienceContent Similarity DetectionCombinatorial Pattern MatchingPlagiarised DocumentsDocument Processing
In this paper we introduce the MatchDetectReveal(MDR) system, which is capable of identifying overlapping and plagiarised documents. Each component of the system is briefly described. The matching-engine component uses a modified suffix tree representation, which is able to identify the exact overlapping chunks and its performance is also presented.
| Year | Citations | |
|---|---|---|
Page 1
Page 1