Information extraction from free text comments in questionnaires

Ramachandran, Kartik

Information extraction from free text comments in questionnaires

aut.author.twitter	@TheRealKrama
aut.embargo	No	en_NZ
aut.thirdpc.contains	No	en_NZ
dc.contributor.advisor	Tegginmath, Shoba
dc.contributor.advisor	Nand, Parma
dc.contributor.author	Ramachandran, Kartik
dc.date.accessioned	2018-01-25T22:40:41Z
dc.date.available	2018-01-25T22:40:41Z
dc.date.copyright	2018
dc.date.issued	2018
dc.date.updated	2018-01-25T11:45:35Z
dc.description.abstract	The last 15 years have seen a tremendous explosion in the amount of information available, encoded both in structured forms such as databases and XML files as well as free, naturally occurring forms such as HTML pages and word documents. This availability of free texts has created a need for automated text processing tools so that information can be extracted in a timely and effective manner. This research investigated the extraction of information from free text responses to open-ended questions in questionnaires. The research undertook to develop a framework for analyzing open question responses to extract structured information which can then be conflated with the closed question responses in order to produce a more informative report from the survey, in particular to determine the sentiment expressed in the response. Specifically, this research will help in understanding the positive or negative nature of the respondent’s answers through the creation of software tools using Natural Language Toolkit (NLTK) and data mining and Natural Language Processing techniques and will help surveyors (Health centers, doctors, data analysts) obtain additional information from surveys. There is also a discussion of existing sentiment analysis solutions as well as the different components and ways of analyzing sentiment and creating a Natural Language Processing tool which would be interesting to future developers of such systems. This research was successfully able to classify free text responses as positive or negative. While we appreciate that more time to fine tune the application and perform more training and testing would have been useful, the results obtained are promising. We have successfully developed a platform which can be used for generating a custom corpus and provide interested developers a starting framework to develop sentiment analysis tools.	en_NZ
dc.identifier.uri	https://hdl.handle.net/10292/11141
dc.language.iso	en	en_NZ
dc.publisher	Auckland University of Technology
dc.rights.accessrights	OpenAccess
dc.subject	Natural Language Processing	en_NZ
dc.subject	Data mining	en_NZ
dc.subject	Information extraction	en_NZ
dc.subject	NLTK	en_NZ
dc.title	Information extraction from free text comments in questionnaires	en_NZ
dc.type	Thesis	en_NZ
thesis.degree.grantor	Auckland University of Technology
thesis.degree.level	Masters Theses
thesis.degree.name	Master of Computer and Information Sciences	en_NZ

Files

Original bundle

Now showing 1 - 1 of 1

Name:: RamachandranK.pdf
Size:: 2.13 MB
Format:: Adobe Portable Document Format
Description:: Whole thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 897 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Masters Theses