Publication | Open Access
Health information text characteristics.
26
Citations
11
References
2006
Year
EngineeringCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalHealth CommunicationDocument AnalysisDigital HealthComputational LinguisticsDocument ClassificationSurface MetricsLanguage StudiesPublic HealthBiomedical Text MiningDifficult Webmd DocumentsContent AnalysisMedical TextHealth Information SystemHealth LiteracyClinical DataHealth Information TechnologyHealth DataText ProcessingLinguisticsHealth InformaticsEmergency Medicine
Millions of people search online for medical text, but these texts are often too complicated to understand. Readability evaluations are mostly based on surface metrics such as character or words counts and sentence syntax, but content is ignored. We compared four types of documents, easy and difficult WebMD documents, patient blogs, and patient educational material, for surface and content-based metrics. The documents differed significantly in reading grade levels and vocabulary used. WebMD pages with high readability also used terminology that was more consumer-friendly. Moreover, difficult documents are harder to understand due to their grammar and word choice and because they discuss more difficult topics. This indicates that we can simplify many documents by focusing on word choice in addition to sentence structure, however, for difficult documents this may be insufficient.
| Year | Citations | |
|---|---|---|
Page 1
Page 1