Named Entity Recognition With Deep Learning

Yu, Haobin

Named Entity Recognition With Deep Learning

Files

Thesis(2.38 MB)

Date

2019

Authors

Yu, Haobin

Supervisor

Nand, Parma

Item type

Thesis

Degree name

Master of Computer and Information Sciences

Publisher

Auckland University of Technology

Abstract

Named entity recognition is a significant subtask of natural language processing. Traditional methods require large numbers of feature engineering and handcrafted additional dictionaries to achieve high performance. Nowadays, deep learning is widely applied for image and natural language processing domains. Through the deep neural network, features are automatically extracted from the training data, and this avoids most feature engineering. Some deep neural network based methods are also used for the modelling sequence labelled problem. Throughout these methods, the performance of named entity recognition can be improved. In this thesis, we have mainly used the bidirectional LSTM model based on deep learning as an architecture, word embedding and Convolutional Neural Network (CNN) as a word-level and char-level feature extractor. The conditional random field (CRF) layer is used to output the predicted labels. We used the CoNLL 2003 dataset and the Broad Twitter Corpus dataset training and testing the Bi-LSTM-CNN-CRF model respectively and got satisfactory results. In order to evaluate the Bi-LSTM-CNN-CRF model, we compared it with three popular machine learning methods. The three popular machine learning methods include the Support Vector Machine (SVM), the Hidden Markov Model (HMM), and the Conditional Random Field (CRF). The results of the evaluation show that the Bi-LSTM-CNN-CRF model with only given labelled text and pre-training word embedding surpasses the three machine learning methods that employ handcrafted feature engineering.

Keywords

Named entity recognition , Deep learning , Deep Neural Network , Convolutional Neural Network (CNN) , Bi-LSTM , Support Vector Machine (SVM) , Hidden Markov Model (HMM) , Conditional Random Field (CRF) , Word Embedding

Permanent link

https://hdl.handle.net/10292/12936

Collections

Masters Theses

Full item page