Concepedia

TLDR

Incident reporting and investigation are key components of safety management systems, yet identifying risk factors is hampered by large, complex data sets and the need for manual categorization. The study proposes a data‑mining approach to automatically identify incident risk factors from the Aviation Safety Reporting System, aiming to reduce manual effort and address incomplete data. The method applies topical mining to transform incident narratives into features and uses data‑streaming algorithms to iteratively build and evaluate classification models for risk factor identification. The approach achieved 76–88 % accuracy across three classifiers, showing that automated text‑mining and data‑streaming can effectively identify incident risk factors and improve safety management.

Abstract

Incident reporting and investigation are components of safety management systems. Timely and accurate identification of risk factors is crucial to effective prevention strategies. However, risk factor identification is often hampered by size, complexity, and the need for human involvement in categorizing incident data. We present a data-mining approach to incident risk factor identification and analysis using data from the Aviation Safety Reporting System, which is part of the Federal Aviation Administration. Our approach is an attempt to overcome obstacles related to labor intensive manual identification of risk factors as well as incomplete data. First, topical mining techniques convert underused textual data (incident narratives) to serve as model input. Second, data-streaming algorithms are used to incrementally build and test classification models for risk factor identification. Three different classification algorithms were tested providing overall accuracy rates ranging from 76 percent to 88 percent, demonstrating the potential for effective use of large and unstructured incident data in safety management. Our research presents and demonstrates an approach to automated incident type identification and contributes to our understanding of the use of text-mining and data-streaming technologies in improving safety management systems.

References

YearCitations

Page 1