A multi-strategy approach for location mining in tweets: AUT NLP Group entry for ALTA-2014 shared task

Nand, P; Perera, R; Sreekumar, A; Lingmin, H

A multi-strategy approach for location mining in tweets: AUT NLP Group entry for ALTA-2014 shared task

Files

(1.07 MB)

Date

2014-11

Authors

Nand, P

Perera, R

Sreekumar, A

Lingmin, H

Item type

Conference Contribution

Publisher

Association for Computational Linguistics (ACL)

Abstract

This paper describes the strategy and the results of a location mining system used for the ALTA-2014 shared task competition. The task required the participants to identify the location mentions in 1003 Twitter test messages given a separate annotated training set of 2000 messages. We present an architecture that uses a basic named entity recognizer in conjunction with various rule-based modules and knowledge infusion to achieve an average F score of 0.747 which won the second place in the competition. We used the pre-trained Stanford NER which gives us an F score of 0.532 and used an ensemble of other techniques to reach the 0.747 value. The other major source of location resolver was the DBpedia location list which was used to identify a large percentage of locations with an individual F-score of 0.935.

Source

Published in: Proceedings of the Australasian Language Technology Association Workshop 2014, pp.163 - 170

Publisher's version

http://www.aclweb.org/anthology/U14-1024

Rights statement

ACL materials are Copyright © 1963-2015 ACL; other materials are copyrighted by their respective copyright holders. All materials here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. Permission is granted to make copies for the purposes of teaching and research.

Permanent link

https://hdl.handle.net/10292/8452

Collections

School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau

Full item page