A multi-strategy approach for location mining in Tweets: AUT NLP Group entry for ALTA-2014 shared task

Nand, P; Perera, R; Lingmin, H

A multi-strategy approach for location mining in Tweets: AUT NLP Group entry for ALTA-2014 shared task

Files

(1.07 MB)

Date

2014

Authors

Nand, P

Perera, R

Lingmin, H

Item type

Conference Contribution

Publisher

Association for Computational Linguistics (ACL)

Abstract

This paper describes the strategy and the results of a location mining system used for the ALTA-2014 shared task competition. The task required the participants to identify the location mentions in 1003 Twitter test messages given a separate annotated training set of 2000 messages. We present an architecture that uses a basic named entity recognizer in conjunction with various rule-based modules and knowledge infusion to achieve an average F score of 0.747 which won the second place in the competition. We used the pre-trained Stanford NER which gives us an F score of 0.532 and used an ensemble of other techniques to reach the 0.747 value. The other major source of location resolver was the DBpedia location list which was used to identify a large percentage of locations with an individual F-score of 0.935

Source

In Proceedings of Australasian Language Technology Association Workshop, pages 163−170.

Publisher's version

http://www.aclweb.org/anthology/U14-1024

Rights statement

ACL materials are Copyright (C) 1963-2015 ACL; other materials are copyrighted by their respective copyright holders. All materials here are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License . Permission is granted to make copies for the purposes of teaching and research.

Permanent link

https://hdl.handle.net/10292/9104

Collections

School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau

Full item page