A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining

Donghyeon Kim, Jinhyuk Lee, Chan Ho So, Hwisang Jeon, Minbyul Jeong, Yonghwa Choi, Wonjin Yoon, Mujeen Sung, Jaewoo Kang

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

The amount of biomedical literature is vast and growing quickly, and accurate text mining techniques could help researchers to efficiently extract useful information from the literature. However, existing named entity recognition models used by text mining tools such as tmTool and ezTag are not effective enough, and cannot accurately discover new entities. Also, the traditional text mining tools do not consider overlapping entities, which are frequently observed in multi-type named entity recognition results. We propose a neural biomedical named entity recognition and multi-type normalization tool called BERN. The BERN uses high-performance BioBERT named entity recognition models which recognize known entities and discover new entities. Also, probability-based decision rules are developed to identify the types of overlapping entities. Furthermore, various named entity normalization models are integrated into BERN for assigning a distinct identifier to each recognized entity. The BERN provides a Web service for tagging entities in PubMed articles or raw text. Researchers can use the BERN Web service for their text mining tasks, such as new named entity discovery, information retrieval, question answering, and relation extraction. The application programming interfaces and demonstrations of BERN are publicly available at https://bern.korea.ac.kr.

Original languageEnglish
Article number8730332
Pages (from-to)73729-73740
Number of pages12
JournalIEEE Access
Volume7
DOIs
Publication statusPublished - 2019 Jan 1

Keywords

  • Biomedical text mining
  • decision rules
  • multi-type
  • named entity recognition
  • neural networks
  • normalization
  • Web service

ASJC Scopus subject areas

  • Computer Science(all)
  • Materials Science(all)
  • Engineering(all)

Fingerprint Dive into the research topics of 'A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining'. Together they form a unique fingerprint.

  • Cite this

    Kim, D., Lee, J., So, C. H., Jeon, H., Jeong, M., Choi, Y., Yoon, W., Sung, M., & Kang, J. (2019). A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining. IEEE Access, 7, 73729-73740. [8730332]. https://doi.org/10.1109/ACCESS.2019.2920708