Near closed frequent itemsets to accelerate the generation of association rules in a data stream environment

aut.embargoNoen
aut.thirdpc.containsNo
aut.thirdpc.permissionNo
aut.thirdpc.removedNo
dc.contributor.advisorPears, Russel
dc.contributor.authorViriyarattanaporn, Nathaphong
dc.date.accessioned2010-07-12T04:39:10Z
dc.date.available2010-07-12T04:39:10Z
dc.date.copyright2010
dc.date.issued2010
dc.date.updated2010-07-12T04:32:04Z
dc.description.abstractThe subject of this research is mining data stream. It is one of the most challenging and widely researched areas in Knowledge Discovery and Data Mining (KDD). A data stream is a continuous, voluminous, and unpredictable flow of data which occurs in many application domains. In a previous study, Data Stream Mining (DSM) algorithm was proposed to overcome these problems on association rules mining. It was built using various techniques such as closed frequent itemsets, tree data structures, itemsets pruning, and statistical sampling. We have developed Near Closed Nodes algorithms, which can be applied to algorithms for mining association rules that utilised closed itemsets structure. In this study, we look into the characteristics of closed frequent itemsets and propose a novel concept called Near Closed Nodes (NCN). This concept was thoroughly explored and later developed in conjunction with an existing DSM algorithm. By incorporating NCN into the DSM algorithm, we were able to increase the performance of both speed and memory usage. A comprehensive experimental study was performed to compare the performance of DSM and DSM-NCN using both simulated and real world datasets. Based on the results from the experimental study, we concluded that DSM-NCN outperformed DSM in most circumstances, especially when the datasets were dense.
dc.identifier.urihttps://hdl.handle.net/10292/965
dc.language.isoenen_NZ
dc.publisherAuckland University of Technology
dc.rights.accessrightsOpenAccess
dc.subjectAssociation rules
dc.subjectData stream mining
dc.subjectNear closed node
dc.subjectComputer science
dc.titleNear closed frequent itemsets to accelerate the generation of association rules in a data stream environment
dc.typeThesis
thesis.degree.grantorAuckland University of Technology
thesis.degree.levelMasters Dissertations
thesis.degree.nameMaster of Computer and Information Sciences
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ViriyarattanapornN.pdf
Size:
3.54 MB
Format:
Adobe Portable Document Format
Description:
Whole thesis
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
895 B
Format:
Item-specific license agreed upon to submission
Description: