Classifying news versus opinions in newspapers: Linguistic features for domain independence
2017 ◽
Vol 23
(5)
◽
pp. 687-707
◽
Keyword(s):
AbstractNewspaper text can be broadly divided in the classes ‘opinion’ (editorials, commentary, letters to the editor) and ‘neutral’ (reports). We describe a classification system for performing this separation, which uses a set of linguistically motivated features. Working with various English newspaper corpora, we demonstrate that it significantly outperforms bag-of-lemma and PoS-tag models. We conclude that the linguistic features constitute the best method for achieving robustness against change of newspaper or domain.
2017 ◽
Vol 44
(2)
◽
pp. 184-202
◽
Keyword(s):
1972 ◽
Vol 3
(3)
◽
pp. 38-39
Keyword(s):
1974 ◽
Vol 5
(3)
◽
pp. 180-181
Keyword(s):
2012 ◽
Vol 2
(2)
◽
pp. 66-72
Keyword(s):
1998 ◽
Vol 85
(3)
◽
pp. 632-633
Keyword(s):
2001 ◽
Vol 35
(2)
◽
pp. 249-252
Keyword(s):
Keyword(s):