Real Text-CS - Corpus based domain independent content selection model

Perera, R; Nand, P

doi:10.1109/ICTAI.2014.95

Real Text-CS - Corpus based domain independent content selection model

Files

ICTAI2014.pdf

Size: 662.79 KB, File format: Adobe PDF

Date

2014-11

Authors

Perera, R

Nand, P

Item type

Conference Contribution

Publisher

IEEE

Abstract

Content selection is a highly domain dependent task responsible for retrieving relevant information from a knowledge source using a given communicative goal. This paper presents a domain independent content selection model using keywords as communicative goal. We employ DBpedia triple store as our knowledge source and triples are selected based on weights assigned to each triple. The calculation of the weights is carried out through log likelihood distance between a domain corpus and a general reference corpus. The method was evaluated using keywords extracted from QALD dataset and the performance was compared with cross entropy based statistical content selection. The evaluation results showed that the proposed method can perform 32% better than cross entropy based statistical content selection.

Keywords

Content selection, Natural Language Generation, Natural Language Processing, Semantic web

Source

Published in: IEEE 26th International Conference on Tools with Artificial Intelligence (ICTAI), pp.599 - 606

DOI

10.1109/ICTAI.2014.95

Publisher's version

http://dx.doi.org/10.1109/ICTAI.2014.95

Rights statement

Copyright © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Permanent link

https://hdl.handle.net/10292/8596

Collections

School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau

Full item page

Real Text-CS - Corpus based domain independent content selection model

Files

Date

Authors

Supervisor

Item type

Degree name

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Source

DOI

Publisher's version

Rights statement

Permanent link

Collections

Endorsement

Review

Supplemented By

Referenced By