skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A pre-training and self-training approach for biomedical named entity recognition

Journal Article · · PLoS ONE

Named entity recognition (NER) is a key component of many scientific literature mining tasks, such as information retrieval, information extraction, and question answering; however, many modern approaches require large amounts of labeled training data in order to be effective. This severely limits the effectiveness of NER models in applications where expert annotations are difficult and expensive to obtain. In this work, we explore the effectiveness of transfer learning and semi-supervised self-training to improve the performance of NER models in biomedical settings with very limited labeled data (250-2000 labeled samples). We first pre-train a BiLSTM-CRF and a BERT model on a very large general biomedical NER corpus such as MedMentions or Semantic Medline, and then we fine-tune the model on a more specific target NER task that has very limited training data; finally, we apply semi-supervised self-training using unlabeled data to further boost model performance. We show that in NER tasks that focus on common biomedical entity types such as those in the Unified Medical Language System (UMLS), combining transfer learning with self-training enables a NER model such as a BiLSTM-CRF or BERT to obtain similar performance with the same model trained on 3x-8x the amount of labeled data. We further show that our approach can also boost performance in a low-resource application where entities types are more rare and not specifically covered in UMLS.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1765039
Alternate ID(s):
OSTI ID: 1766402
Journal Information:
PLoS ONE, Journal Name: PLoS ONE Vol. 16 Journal Issue: 2; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (44)

An overview of MetaMap: historical perspective and recent advances journal May 2010
Rule-based pattern extractor and named entity recognition: A hybrid approach conference June 2010
SemMedDB: a PubMed-scale repository of biomedical semantic predications journal October 2012
Towards reliable named entity recognition in the biomedical domain journal June 2019
Transfer Learning in Biomedical Named Entity Recognition: An Evaluation of BERT in the PharmaCoNER task conference January 2019
A context pattern induction method for named entity extraction conference January 2006
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF conference January 2016
Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets conference January 2019
A Bidirectional LSTM and Conditional Random Fields Approach to Medical Named Entity Recognition book August 2017
HUNER: improving biomedical NER with pretraining journal June 2019
A Neural Named Entity Recognition and Multi-Type Normalization Tool for Biomedical Text Mining journal January 2019
Semantic MEDLINE: An advanced information management application for biomedicine journal September 2011
Multi-domain evaluation framework for named entity recognition tools journal May 2017
MetaNER: Named Entity Recognition with Meta-Learning conference April 2020
Rule-Based Method for Entity Resolution journal January 2015
Few-Shot Named Entity Recognition via Meta-Learning journal January 2020
Strong Baselines for Neural Semi-Supervised Learning under Domain Shift
  • Ruder, Sebastian; Plank, Barbara
  • Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) https://doi.org/10.18653/v1/P18-1096
conference January 2018
BioBERT: a pre-trained biomedical language representation model for biomedical text mining journal September 2019
Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis journal June 2020
Biomedical Named Entity Recognition with Multilingual BERT conference January 2019
DTranNER: biomedical named entity recognition with deep learning-based label-label transition model journal February 2020
BioWordVec, improving biomedical word embeddings with subword information and MeSH journal May 2019
ChemSpot: a hybrid system for chemical named entity recognition journal April 2012
Artificial intelligence to organize patient portal messages: a journey from an ensemble deep learning text classification to rule-based named entity recognition conference November 2019
Neural Architectures for Named Entity Recognition
  • Lample, Guillaume; Ballesteros, Miguel; Subramanian, Sandeep
  • Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies https://doi.org/10.18653/v1/N16-1030
conference January 2016
A Survey on Transfer Learning journal October 2010
Transfer learning for biomedical named entity recognition with neural networks journal June 2018
Better Modeling of Incomplete Annotations for Named Entity Recognition conference January 2019
Extraction of protein interaction information from unstructured text using a context-free grammar journal October 2003
Transfer Learning for Low-Resource Neural Machine Translation conference January 2016
ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing conference January 2019
Scientific Information Extraction with Semi-supervised Neural Tagging conference January 2017
A rule-based named-entity recognition method for knowledge extraction of evidence-based dietary recommendations journal June 2017
Using machine learning to maintain rule-based named-entity recognition and classification systems conference January 2001
Dynamic Transfer Learning for Named Entity Recognition book August 2019
Effect of Character and Word Features in Bidirectional LSTM-CRF for NER conference February 2020
A survey on semi-supervised learning journal November 2019
A novel method for prokaryotic promoter prediction based on DNA stability journal January 2005
Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning journal August 2019
Transformers: State-of-the-Art Natural Language Processing conference January 2020
A Bootstrapping Approach With CRF and Deep Learning Models for Improving the Biomedical Named Entity Recognition in Multi-Domains journal January 2019
Pre-trained models for natural language processing: A survey journal September 2020
A simple semi-supervised algorithm for named entity recognition
  • Liao, Wenhui; Veeramachaneni, Sriharsha
  • Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing - SemiSupLearn '09 https://doi.org/10.3115/1621829.1621837
conference January 2009
Semi-supervised learning for named entity recognition using weakly labeled training data
  • Zafarian, Atefeh; Rokni, Ali; Khadivi, Shahram
  • 2015 International Symposium on Artificial Intelligence and Signal Processing (AISP), 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP) https://doi.org/10.1109/AISP.2015.7123504
conference March 2015

Similar Records

Creating Training Data for Scientific Named Entity Recognition with Minimal Human Effort
Conference · Tue Jan 01 00:00:00 EST 2019 · OSTI ID:1765039

CyBERT: Cybersecurity Claim Classification by Fine-Tuning the BERT Language Model
Journal Article · Thu Nov 04 00:00:00 EDT 2021 · Journal of Cybersecurity and Privacy · OSTI ID:1765039

Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)–based ranking for concept normalization
Journal Article · Mon Jul 27 00:00:00 EDT 2020 · Journal of the American Medical Informatics Association · OSTI ID:1765039