/guid/9202a8c04000641f800000000062a2be rename
Summary
Named entity recognition (NER) (also known as entity identification and entity extraction) is a...
Content
Named entity recognition (NER) (also known as entity identification and entity extraction) is a subtask of information extraction that seeks to locate and classify atomic elements in text into predefined categories such as the names of persons, organizations, locations, expressions of times, quantities, monetary values, percentages, etc.
Most research on NER systems has been structured as taking an unannotated block of text, such as this one:
And producing an annotated block of text, such as this one:
In this example, the annotations have been done using so-called ENAMEX tags that were developed for the Message Understanding Conference in the 1990s.
State-of-the-art NER systems produce near-human performance. For example, the best system entering MUC-7 scored 93.39% of f-measure while human annotators scored 97.60% and 96.95%. These results indicate the algorithms had roughly twice the error rate (6.61%) as human annotators (2.40% and 3.05%).
NER systems have been created that use linguistic grammar-based techniques as well as statistical models. Hand-crafted grammar-based systems typically obtain better results, but at the cost of months of work by experienced computational
Created by:
Freebase Data Team
Oct 23, 2006
Last edited by:
Freebase Data Team
Oct 23, 2006
Recent Discussions about None
There is no discussion about this document.
Start the Discussion »