Skip to main content
Glossary Term

Named-entity recognition

Named-entity recognition platforms and evaluation - GATE, OpenNLP, SpaCy, Transformers, Stanford NER, and NLTK are notable NER platforms - Precision, recall, and F1 score are commonly used measures for evaluating NER systems - State-of-the-art NER systems for English achieve near-human performance - The best system in MUC-7 scored 93.39% of F-measure - Human annotators scored 97.60% and 96.95% - NER systems have made significant advancements in accuracy and efficiency - Different types of errors and their importance need to be considered in evaluating NER systems Named entity types and hierarchies - BBN categories, Sekines extended hierarchy, and Freebase entity types are proposed for named entity types - BBN categories consist of 29 types and 64 subtypes - Sekines extended hierarchy includes 200 subtypes - Freebase entity types have been used for NER over social media text - Different hierarchies help organize and classify named entities in NER systems Approaches to Named Entity Recognition - NER systems use linguistic grammar-based techniques and statistical models like machine learning - Hand-crafted grammar-based systems have better precision but lower recall and require months of work - Statistical NER systems require a large amount of manually annotated training data - Semisupervised approaches have been suggested to reduce annotation effort - Conditional random fields are a typical choice for machine-learned NER Challenges, misconceptions, and research in Named Entity Recognition - Named-entity recognition is far from being solved despite high reported F1 numbers - Efforts are focused on reducing annotation labor through semi-supervised learning - Robust performance across domains is a key challenge - Scaling up to fine-grained entity types is another challenge - Crowdsourcing is a promising solution to obtain high-quality human judgments for NER - Researchers have compared NER performances from different statistical models and feature sets - Graph-based semi-supervised learning models have been proposed for language-specific NER tasks Applications and advances in Named Entity Recognition - NER has applications in question answering, information retrieval, and open-domain search queries - Chinese named entity recognition is relevant to language processing and intelligent information systems - NER has been applied to Twitter messages - Advances in NER include fine-grained recognition using conditional random fields and graph-based semi-supervised learning models