Publication | Open Access
Mining Relevant Sequence Patterns with CP-Based Framework
15
Citations
21
References
2014
Year
Unknown Venue
EngineeringPattern DiscoveryPattern MiningDynamic CspMining MethodsSoftware AnalysisCorpus LinguisticsText MiningConstraint ProgrammingNatural Language ProcessingRelevant Sequence PatternsSyntaxSequential PatternData ScienceData MiningPattern RecognitionComputational LinguisticsLanguage StudiesKnowledge DiscoveryComputer ScienceFrequent Pattern MiningProgram AnalysisCombinatorial Pattern MatchingStructure MiningLinguisticsData Modeling
Sequential pattern mining under various constraints is a challenging data mining task. The paper provides a generic framework based on constraint programming to discover sequence patterns defined by constraints on local patterns (e.g., Gap, regular expressions) or constraints on patterns involving combination of local patterns such as relevant subgroups and top-k patterns. This framework enables the user to mine in a declarative way both kinds of patterns. The solving step is done by exploiting the machinery of Constraint Programming. For complex patterns involving combination of local patterns, we improve the mining step by using dynamic CSP. Finally, we present two case studies in biomedical information extraction and stylistic analysis in linguistics.
| Year | Citations | |
|---|---|---|
Page 1
Page 1