Challenges in anaphora resolution in the news media genre

aut.conference.typePaper Published in Proceedings
aut.relation.endpage160
aut.relation.pages5
aut.relation.startpage156
dark.contributor.authorNand, P
dc.contributor.authorNand, P
dc.contributor.editorYeo, A
dc.contributor.editorLabadin, J
dc.contributor.editorChai, W
dc.contributor.editorEng, T
dc.date.accessioned2012-05-16T21:44:32Z
dc.date.available2012-05-16T21:44:32Z
dc.date.copyright2007
dc.date.issued2007
dc.description.abstractThis paper discusses the characteristics of anaphora found in written News Media genre and describes an initial implementation of an algorithm to resolve the anaphors. The paper forms part of wider research aimed at using resolved anaphors for text visualization. The algorithm described is light weight and incremental, in that, it builds vocabulary as it processes documents to be used in future. The input data is also tested on two publicly available anaphora resolution algorithms and the results compared to the algorithm described in this paper. Finally it discusses the challenges in developing light weight algorithms to resolve anaphors in the News Media genre and further research to overcome some of these challenges.
dc.identifier.citationFifth International Conference on Information Technology in Asia held at Hilton Sarawak, Sarawak, Malaysia, 2007-07-09to 2007-07-12, published in: Proceedings of CITA'07, pp.156 - 160 (5)
dc.identifier.isbn983-9257-66-8
dc.identifier.roid3585en_NZ
dc.identifier.urihttps://hdl.handle.net/10292/4185
dc.relation.urihttp://web.archive.org/web/20070630164828/http://www.cita07.org/
dc.rightsNOTICE: this is the author’s version of a work that was accepted for publication. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication.
dc.rights.accessrightsOpenAccess
dc.subjectAnaphora
dc.subjectNatural Language Processing
dc.subjectResolution
dc.subjectAntecedent
dc.subjectOntology
dc.subjectPleonastic
dc.subjectNoun
dc.titleChallenges in anaphora resolution in the news media genre
dc.typeConference Contribution
pubs.organisational-data/AUT
pubs.organisational-data/AUT/Design & Creative Technologies
pubs.organisational-data/AUT/Design & Creative Technologies/School of Computing & Mathematical Science
pubs.organisational-data/AUT/PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers/DCT C & M Computing
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CITA'071.pdf
Size:
138.35 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
licence.htm
Size:
29.98 KB
Format:
Unknown data format
Description: