Summary of the paper

Title Comparative Analysis of Portuguese Named Entities Recognition Tools
Authors Daniela Amaral, Evandro Fonseca, Lucelene Lopes and Renata Vieira
Abstract This paper describes an experiment to compare four tools to recognize named entities in Portuguese texts. The experiment was made over the HAREM corpora, a golden standard for named entities recognition in Portuguese. The tools experimented are based on natural language processing techniques and also machine learning. Specifically, one of the tools is based on Conditional random fields, an unsupervised machine learning model that has being used to named entities recognition in several languages, while the other tools follow more traditional natural language approaches. The comparison results indicate advantages for different tools according to the different classes of named entities. Despite of such balance among tools, we conclude pointing out foreseeable advantages to the machine learning based tool.
Topics Tools, Systems, Applications
Full paper Comparative Analysis of Portuguese Named Entities Recognition Tools
