Publication | Closed Access
LattesMiner
10
Citations
14
References
2011
Year
Unknown Venue
Natural Language ProcessingLattes Cv SystemEngineeringInformation RetrievalData ScienceKnowledge ExtractionComputational LinguisticsKnowledge DiscoveryLattes CurriculaTerminology ExtractionLearning AnalyticsCurricular Information SystemSemantic WebData ExtractionInformation ExtractionNamed-entity RecognitionCorpus LinguisticsText Mining
The Lattes CV system, a curricular information system maintained by CNPq, is the core of the Lattes Platform. This system is undoubtedly the major source of information on Brazilian researchers. This paper describes "LattesMiner", a multilingual domain-specific language for automatic information extraction from Lattes curricula. It is composed by a set of classes written in Java that allows developers to implement their own applications with a high-level abstraction and expression power. LattesMiner can extract data belonging to the Lattes Platform from any individual researcher or group of researchers by its name or given (ID) number. The data extracted can be analyzed and used, for instance, to identify academic social networks, regional competences, profile of groups in diferent areas of research etc. We illustrate its use with a case study.
| Year | Citations | |
|---|---|---|
Page 1
Page 1