Named Entity Recognition for Clinical Portuguese Corpus with Conditional Random Fields and Semantic Groups
Keyword(s):
Considering the difficulties of extracting entities from Electronic Health Records (EHR) texts in Portuguese, we explore the Conditional Random Fields (CRF) algorithm to build a Named Entity Recognition (NER) system based on a corpus of clinical Portuguese data annotated by experts. We acquaint the challenges and methods to classify Abbreviations, Disorders, Procedures and Chemicals within the texts. By selecting a meaningful set of features, and parameters with the best performance the results demonstrate that the method is promising and may support other biomedical tasks, nonetheless, further experiments with more features, different architectures and sophisticated preprocessing steps are needed.
2018 ◽
Vol 7
(1)
◽
pp. 1-15
◽
Named Entity Recognition Over Electronic Health Records Through a Combined Dictionary-based Approach
2016 ◽
Vol 100
◽
pp. 55-61
◽
2019 ◽
Vol 19
(S7)
◽
2020 ◽