Publication | Closed Access
LinguaKit: A Big Data-Based Multilingual Tool for Linguistic Analysis and Information Extraction
36
Citations
17
References
2018
Year
Unknown Venue
EngineeringTaggingLinguistic CorrectionPart-of-speech TaggingSemantic WebSentiment AnalysisBig Data InfrastructureText MiningApplied LinguisticsNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsLanguage StudiesMachine TranslationNlp TaskTerminology ExtractionCross-language RetrievalInformation ExtractionSemantic ParsingLanguage CorpusMultilingual SuiteLinguistic AnalysisLinguisticsBig Data
This paper presents LinguaKit, a multilingual suite of tools for analysis, extraction, annotation and linguistic correction, as well as its integration into a Big Data infrastructure. LinguaKit allows the user to perform different tasks such as PoS-tagging, syntactic parsing, coreference resolution (among others), including applications for relation extraction, sentiment analysis, summarization, extraction of multiword expressions, or entity linking to DBpedia. Most modules work in four languages: Portuguese, Spanish, English, and Galician. The system is programmed in Perl and is freely available under a GPLv3 license.
| Year | Citations | |
|---|---|---|
Page 1
Page 1