Publication | Open Access
An Open Architecture for End-to-End Document Analysis Benchmarking
30
Citations
10
References
2011
Year
Unknown Venue
Performance BenchmarkingEngineeringSoftware EngineeringSoftware AnalysisText MiningNatural Language ProcessingInformation RetrievalData ScienceBenchmark StudyWhole PipelineEnd-to-end Document AnalysisParallel ComputingMachine TranslationHigh-performance Data AnalyticsComputer SciencePerformance Analysis ToolData-intensive ComputingBenchmarking ToolAnalysis ProcessProgram AnalysisOpen Architecture
In this paper, we present a fully operational, scalable and open architecture allowing end-to-end document analysis benchmarking without needing to develop the whole pipeline. By decomposing the analysis process into coarse-grained tasks, and by building upon community provided state-of-the art algorithms, our architecture allows any combination of elementary document analysis algorithms, regardless their running system environment, programming language or data structures. Its flexible structure makes it straightforward to plug in new algorithms, compare them to other algorithms, and observe the effects on end-to-end tasks without need to install, compile or otherwise interact with any other software than one's own.
| Year | Citations | |
|---|---|---|
Page 1
Page 1