Finding Needles in the Haystack: Identifying Patients with Rare Subtype of Multiple Myeloma Supported by a Data Warehouse and Information Extraction

Krebs J, Bittrich M, Dietrich G, Ertl M, Fette G, Kaspar M, Liman L, Einsele H, Puppe F, Knop S (2018)


Publication Type: Conference contribution

Publication year: 2018

Journal

Publisher: IOS Press BV

Book Volume: 253

Pages Range: 160-164

Conference Proceedings Title: Studies in Health Technology and Informatics

Event location: Osnabruck, DEU

ISBN: 9781614998952

DOI: 10.3233/978-1-61499-896-9-160

Abstract

Finding patient cases with extremely rare pathologies is a laborious task. To decrease time spent on manually searching through thousands of discharge letters and reports, a data warehouse with a fast fulltext search index was queried. Our use case is to find "macrofocal myeloma", i.e. Multiple Myeloma patients with few large lesions. We guessed the number of those patients in the University Hospital Würzburg at about 20. Most criteria were available in the data warehouse in an unstructured form requiring information extraction. 8 patient cases were found by searching for different spellings of "macrofocal myeloma" in discharge letters directly. With an indirect search combining several criteria, we found additional 23 candidate patient cases, from which 10 were classified by a domain expert as correct. The most difficult criteria were determining the degree of bone marrow infiltration. We achieved an F1 score of 93.2 % for this task. The number of patient cases to be screened manually for this disease decreased from about 25000 to 23.

Involved external institutions

How to cite

APA:

Krebs, J., Bittrich, M., Dietrich, G., Ertl, M., Fette, G., Kaspar, M.,... Knop, S. (2018). Finding Needles in the Haystack: Identifying Patients with Rare Subtype of Multiple Myeloma Supported by a Data Warehouse and Information Extraction. In Ursula Hubner, Ulrich Sax, Hans-Ulrich Prokosch, Bernhard Breil, Harald Binder, Antonia Zap, Brigitte Strahwald, Tim Beissbarth, Niels Grabe, Anke Scholer (Eds.), Studies in Health Technology and Informatics (pp. 160-164). Osnabruck, DEU: IOS Press BV.

MLA:

Krebs, Jonathan, et al. "Finding Needles in the Haystack: Identifying Patients with Rare Subtype of Multiple Myeloma Supported by a Data Warehouse and Information Extraction." Proceedings of the 63rd Annual Meeting of the German Association of Medical Informatics, Biometry and Epidemiology, GMDS 2018, Osnabruck, DEU Ed. Ursula Hubner, Ulrich Sax, Hans-Ulrich Prokosch, Bernhard Breil, Harald Binder, Antonia Zap, Brigitte Strahwald, Tim Beissbarth, Niels Grabe, Anke Scholer, IOS Press BV, 2018. 160-164.

BibTeX: Download