Publication | Closed Access
Towards explainable prediction of essay cohesion in Portuguese and English
16
Citations
23
References
2023
Year
Unknown Venue
EngineeringMachine LearningTextual CohesionEssay CohesionLarge Language ModelSyntactic StructureDeep Learning ModelLanguage LearningCorpus LinguisticsText MiningApplied LinguisticsNatural Language ProcessingParaphraseComputational LinguisticsLanguage EngineeringGrammarLanguage StudiesMachine TranslationNatural LanguageNlp TaskDistributional SemanticsAutomatic Cohesion EstimationsLexical Complexity PredictionLinguistics
Textual cohesion is an essential aspect of a formally written text, related to linguistic mechanisms that connect elements such as words, sentences, and paragraphs. Several studies have proposed approaches to estimate textual cohesion in essays automatically. There is limited research that aims to study the extent to which the use of machine learning approaches can predict the textual cohesion of essays written in different languages (not just English). This paper reports on the findings of a study that aimed to propose and evaluate approaches that automatically estimate the cohesion of essays in Portuguese and English. The study proposed regression-based models grounded in conventional feature-based machine learning methods and deep learning-based pre-trained language models. The study also examined the explainability of automated approaches to scrutinize their predictions. We analyzed two datasets composed of 4,570 (Portuguese) and 7,101 (English) essays. The results demonstrate that a deep learning-based model achieved the best performance on both datasets with a moderate Pearson correlation with human-rated cohesion scores. However, the explainability of the automatic cohesion estimations based on conventional machine learning models offered a stronger potential than that of the deep learning model.
| Year | Citations | |
|---|---|---|
Page 1
Page 1