This paper illustrates an automatic document processing system for the extraction of data contained in medical laboratory results printed on paper. The final goal of the research is to automate the collection of medical data and to enable an efficient management and dissemination of the information. The following processing steps of the system are described in detail: image preprocessing; layout analysis for the identification of the tables contained in the document; extraction and classification of the laboratory results. Among the many features of the system there are the use of an open source OCR engine, as a basis of further processing, and the storage in XML format of the data retrieved, for ease of sharing. The knowledge base used to guide the data extraction is also explained. The proposed approach has been tested on several document formats and performance analyzed.
An automatic document processing system for medical data extraction / Adamo, F.; Attivissimo, F; Di Nisio, A.; Spadavecchia, M.. - In: MEASUREMENT. - ISSN 0263-2241. - STAMPA. - 61:2(2015), pp. 88-99. [10.1016/j.measurement.2014.10.032]
An automatic document processing system for medical data extraction
Adamo, F.;Attivissimo, F;Di Nisio, A.;Spadavecchia, M.
2015-01-01
Abstract
This paper illustrates an automatic document processing system for the extraction of data contained in medical laboratory results printed on paper. The final goal of the research is to automate the collection of medical data and to enable an efficient management and dissemination of the information. The following processing steps of the system are described in detail: image preprocessing; layout analysis for the identification of the tables contained in the document; extraction and classification of the laboratory results. Among the many features of the system there are the use of an open source OCR engine, as a basis of further processing, and the storage in XML format of the data retrieved, for ease of sharing. The knowledge base used to guide the data extraction is also explained. The proposed approach has been tested on several document formats and performance analyzed.File | Dimensione | Formato | |
---|---|---|---|
2015 An automatic document processing system for medical data extraction.pdf
accesso aperto
Descrizione: Articolo principale
Tipologia:
Documento in Pre-print
Licenza:
Creative commons
Dimensione
926.72 kB
Formato
Adobe PDF
|
926.72 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.