Fine-grained protein mutation extraction from biological literature

Böckmann R (2009)


Publication Status: Published

Publication Type: Conference contribution, Conference Contribution

Publication year: 2009

Pages Range: 401-405

Article Number: 4795993

Event location: Macau

ISBN: 9780769535593

DOI: 10.1109/ICECT.2009.10

Abstract

Automatic extraction of experimental data on protein mutants from large volumes of biological texts can help building corresponding databases to facilitate research in relevant studies. Mutation extraction cannot be fully solved by the surface pattern matching but requires linguistic analysis of the plain text. Based on the existing regular expression method, we improved the mutation extraction by applying the dependency parsing technique from natural language processing (NLP). Furthermore, we extract valuable data about experimental measurements from the texts and relate them to the identified mutations. Our method was evaluated on MedLine abstracts. The results show great potential for future exploration. © 2009 IEEE.

Authors with CRIS profile

How to cite

APA:

Böckmann, R. (2009). Fine-grained protein mutation extraction from biological literature. In Proceedings of the 2009 International Conference on Electronic Computer Technology, ICECT 2009 (pp. 401-405). Macau.

MLA:

Böckmann, Rainer. "Fine-grained protein mutation extraction from biological literature." Proceedings of the 2009 International Conference on Electronic Computer Technology, ICECT 2009, Macau 2009. 401-405.

BibTeX: Download