Tagging Named Entities in Croatian Tweets
2017 ◽
Vol 4
(1)
◽
pp. 20-41
Keyword(s):
Set Size
◽
Named entity extraction tools designed for recognizing named entities in texts written in standard language (e.g., news stories or legal texts) have been shown to be inadequate for user-generated textual content (e.g., tweets, forum posts). In this work, we propose a supervised approach to named entity recognition and classification for Croatian tweets. We compare two sequence labelling models: a hidden Markov model (HMM) and conditional random fields (CRF). Our experiments reveal that CRF is the best model for the task, achieving a very good performance of over 87% micro-averaged F1 score. We analyse the contributions of different feature groups and influence of the training set size on the performance of the CRF model.
2019 ◽
2019 ◽
Vol 8
(2)
◽
pp. 4211-4216
2020 ◽
Vol 10
(2)
◽
pp. 1544
◽
2020 ◽
2014 ◽
Vol 571-572
◽
pp. 1202-1205
2020 ◽
Vol 9
(6)
◽
pp. 1-22