Transformation Algorithms for Data Streams

Abstract

Next generation data processing systems must deal with very high data ingest rates and massive volumes of data. Such conditions are typically encountered in the intelligence community (IC) where analysts must search through huge volumes of data in order to gather evidence to support or refute their hypotheses. Their effort is made all the more difficult given that the data appears as unstructured text that is written in multiple languages using characters that have different encodings. Human analysts have not been able to keep pace with reading the data and a large amount of data is discarded even though it might contain key information. The goal of our project is to assess the feasibility of incrementally replacing humans with automation in key areas of information processing. These areas include document ingest, content categorization, language translation, and context- and temporally-based information retrieval

References

Page 1

	Year	Citations

Page 1