Using LSA for Pronominal Anaphora Resolution

A typical problem in the resolution of pronominal anaphora is the presence of more than one candidate for the antecedent of the pronoun. Considering two English sentences like (1) "People buy expensive cars because they offer more status" and (2) "People buy expensive cars because they want more status" we can see that the two NPs "people" and "expensive cars", from a purely syntactic perspective, are both legitimate candidates as antecedents for the pronoun "they". This problem has been traditionally solved by using world knowledge (e.g. schema theory), where, through an internal representation of the world, we "know" that cars "offer" status and people "want" status. The assumption in this paper is that the use of world knowledge does not explain how the disambiguation process works and alternative explanations should be explored. Using a knowledge poor approach (explicit information from the text rather than implicit world knowledge) the study investigates to what extent syntactic and semantic constraints can be used to resolve anaphora. For this purpose, 1,400 examples of the word "they" were randomly selected from a corpus of 10,000,000 words of expository text in English. Antecedent candidates for each case were then analyzed and classified in terms of their syntactic functions in the sentence (subject, object, etc.) and semantic features (+ human, + animate, etc.). It was found that syntactic constraints resolved 85% of the cases. When combined with semantic constraints the resolution rate rose to 98%. The implications of the findings for Natural Language Processing are discussed.

Download Full-text

A Machine Learning Approach to Pronominal Anaphora Resolution in Dialogue Based Intelligent Tutoring Systems

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-54906-9_25 ◽

2014 ◽

pp. 307-318 ◽

Cited By ~ 4

Author(s):

Nobal B. Niraula ◽

Vasile Rus

Keyword(s):

Machine Learning ◽

Intelligent Tutoring Systems ◽

Intelligent Tutoring ◽

Learning Approach ◽

Anaphora Resolution ◽

Tutoring Systems ◽

Machine Learning Approach ◽

Pronominal Anaphora

Download Full-text

Pronominal Anaphora Resolution in Punjabi Language

International Journal in Foundations of Computer Science & Technology ◽

10.5121/ijfcst.2014.4408 ◽

2014 ◽

Vol 4 (4) ◽

pp. 99-105

Author(s):

Priya Lakhmani ◽

Smita Pratistha Mathur ◽

Sudha Morwal

Keyword(s):

Anaphora Resolution ◽

Pronominal Anaphora

Download Full-text

Telugu Pronominal Anaphora Resolution

International Journal of Research and Applications ◽

10.17812/ijra.1.1(5)2014 ◽

2014 ◽

Vol 1 (1) ◽

pp. 23-30

Keyword(s):

Anaphora Resolution ◽

Pronominal Anaphora

Download Full-text

Chinese Pronominal Anaphora Resolution Based on Conditional Random Fields

2008 International Conference on Computer Science and Software Engineering ◽

10.1109/csse.2008.432 ◽

2008 ◽

Cited By ~ 1

Author(s):

Li Fei ◽

Shi Shuicai ◽

Chen Yuzhong ◽

Lv Xueqiang

Keyword(s):

Random Fields ◽

Conditional Random Fields ◽

Anaphora Resolution ◽

Pronominal Anaphora

Download Full-text

An Integrated Framework for Pronominal Anaphora Resolution in Malayalam

Advances in Science Technology and Engineering Systems Journal ◽

10.25046/aj040536 ◽

2019 ◽

Vol 4 (5) ◽

pp. 287-293

Author(s):

Ajees Arimbassery Pareed ◽

Sumam Mary Idicula

Keyword(s):

Anaphora Resolution ◽

Integrated Framework ◽

Pronominal Anaphora

Download Full-text

Pronominal Anaphora Resolution on Spanish Text

Handbook of Research on Natural Language Processing and Smart Service Systems - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-7998-4730-4.ch014 ◽

2021 ◽

pp. 309-326

Author(s):

Alonso García ◽

Martha Victoria González ◽

Francisco López-Orozco ◽

Lucero Zamora

Keyword(s):

Spanish Text ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Spanish Language ◽

Online Version ◽

Anaphora Resolution ◽

Personal Pronouns ◽

Technological Advances ◽

Pronominal Anaphora

Recent technological advances have allowed the development of numerous natural language processing applications with which users frequently interact. When interacting with this type of application, users often search for the economy of words, which promotes the use of pronouns, thereby highlighting the well-known anaphora problem. This chapter describes a proposal to approach the pronominal anaphora for the Spanish language. A set of rules (based on the Eagle standard) was designed to identify the referents of personal pronouns through the structure of the grammatical tags of the words. The proposed algorithm uses the online Freeling service to perform tokenization and tagging tasks. The performance of the algorithm was compared with an online version of Freeling, and the proposed algorithm shows better performance.

Download Full-text

Arabic Pronominal Anaphora Resolution Based on New Set of Features

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-319-75477-2_38 ◽

2018 ◽

pp. 533-544

Author(s):

Souha Mezghani Hammami ◽

Lamia Hadrich Belguith

Keyword(s):

Anaphora Resolution ◽

Pronominal Anaphora

Download Full-text

Computational Approach to Anaphora Resolution in Spanish Dialogues

Journal of Artificial Intelligence Research ◽

10.1613/jair.848 ◽

2001 ◽

Vol 15 ◽

pp. 263-287 ◽

Cited By ~ 1

Author(s):

M. Palomar ◽

P. Martinez-Barco

Keyword(s):

Noun Phrase ◽

Computational Approach ◽

Anaphora Resolution ◽

Sources Of Information ◽

Linguistic Information ◽

Structure Information ◽

Dialogue Structure ◽

Pronominal Anaphora ◽

Linguistic Constraints

This paper presents an algorithm for identifying noun-phrase antecedents of pronouns and adjectival anaphors in Spanish dialogues. We believe that anaphora resolution requires numerous sources of information in order to find the correct antecedent of the anaphor. These sources can be of different kinds, e.g., linguistic information, discourse/dialogue structure information, or topic information. For this reason, our algorithm uses various different kinds of information (hybrid information). The algorithm is based on linguistic constraints and preferences and uses an anaphoric accessibility space within which the algorithm finds the noun phrase. We present some experiments related to this algorithm and this space using a corpus of 204 dialogues. The algorithm is implemented in Prolog. According to this study, 95.9% of antecedents were located in the proposed space, a precision of 81.3% was obtained for pronominal anaphora resolution, and 81.5% for adjectival anaphora.

Download Full-text