Challenges in anaphora resolution in the news media genre

Date
2007
Authors
Nand, P
Supervisor
Item type
Conference Contribution
Degree name
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

This paper discusses the characteristics of anaphora found in written News Media genre and describes an initial implementation of an algorithm to resolve the anaphors. The paper forms part of wider research aimed at using resolved anaphors for text visualization. The algorithm described is light weight and incremental, in that, it builds vocabulary as it processes documents to be used in future. The input data is also tested on two publicly available anaphora resolution algorithms and the results compared to the algorithm described in this paper. Finally it discusses the challenges in developing light weight algorithms to resolve anaphors in the News Media genre and further research to overcome some of these challenges.

Description
Keywords
Anaphora , Natural Language Processing , Resolution , Antecedent , Ontology , Pleonastic , Noun
Source
Fifth International Conference on Information Technology in Asia held at Hilton Sarawak, Sarawak, Malaysia, 2007-07-09to 2007-07-12, published in: Proceedings of CITA'07, pp.156 - 160 (5)
DOI
Rights statement
NOTICE: this is the author’s version of a work that was accepted for publication. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication.