Information extraction from free text comments in questionnaires

aut.author.twitter@TheRealKrama
aut.embargoNoen_NZ
aut.thirdpc.containsNoen_NZ
dc.contributor.advisorTegginmath, Shoba
dc.contributor.advisorNand, Parma
dc.contributor.authorRamachandran, Kartik
dc.date.accessioned2018-01-25T22:40:41Z
dc.date.available2018-01-25T22:40:41Z
dc.date.copyright2018
dc.date.issued2018
dc.date.updated2018-01-25T11:45:35Z
dc.description.abstractThe last 15 years have seen a tremendous explosion in the amount of information available, encoded both in structured forms such as databases and XML files as well as free, naturally occurring forms such as HTML pages and word documents. This availability of free texts has created a need for automated text processing tools so that information can be extracted in a timely and effective manner. This research investigated the extraction of information from free text responses to open-ended questions in questionnaires. The research undertook to develop a framework for analyzing open question responses to extract structured information which can then be conflated with the closed question responses in order to produce a more informative report from the survey, in particular to determine the sentiment expressed in the response. Specifically, this research will help in understanding the positive or negative nature of the respondent’s answers through the creation of software tools using Natural Language Toolkit (NLTK) and data mining and Natural Language Processing techniques and will help surveyors (Health centers, doctors, data analysts) obtain additional information from surveys. There is also a discussion of existing sentiment analysis solutions as well as the different components and ways of analyzing sentiment and creating a Natural Language Processing tool which would be interesting to future developers of such systems. This research was successfully able to classify free text responses as positive or negative. While we appreciate that more time to fine tune the application and perform more training and testing would have been useful, the results obtained are promising. We have successfully developed a platform which can be used for generating a custom corpus and provide interested developers a starting framework to develop sentiment analysis tools.en_NZ
dc.identifier.urihttps://hdl.handle.net/10292/11141
dc.language.isoenen_NZ
dc.publisherAuckland University of Technology
dc.rights.accessrightsOpenAccess
dc.subjectNatural Language Processingen_NZ
dc.subjectData miningen_NZ
dc.subjectInformation extractionen_NZ
dc.subjectNLTKen_NZ
dc.titleInformation extraction from free text comments in questionnairesen_NZ
dc.typeThesisen_NZ
thesis.degree.grantorAuckland University of Technology
thesis.degree.levelMasters Theses
thesis.degree.nameMaster of Computer and Information Sciencesen_NZ
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
RamachandranK.pdf
Size:
2.13 MB
Format:
Adobe Portable Document Format
Description:
Whole thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
897 B
Format:
Item-specific license agreed upon to submission
Description:
Collections