Integration of Data Mining and Data Warehousing: a practical methodology

aut.relation.endpage46
aut.relation.issue3
aut.relation.startpage31
aut.relation.volume2
dc.contributor.authorUsman, M
dc.contributor.authorPears, R
dc.date.accessioned2013-01-14T22:42:03Z
dc.date.available2013-01-14T22:42:03Z
dc.date.copyright2010
dc.date.issued2010
dc.description.abstractThe ever growing repository of data in all fields poses new challenges to the modern analytical systems. Real-world datasets, with mixed numeric and nominal variables, are difficult to analyze and require effective visual exploration that conveys semantic relationships of data. Traditional data mining techniques such as clustering clusters only the numeric data. Little research has been carried out in tackling the problem of clustering high cardinality nominal variables to get better insight of underlying dataset. Several works in the literature proved the likelihood of integrating data mining with warehousing to discover knowledge from data. For the seamless integration, the mined data has to be modeled in form of a data warehouse schema. Schema generation process is complex manual task and requires domain and warehousing familiarity. Automated techniques are required to generate warehouse schema to overcome the existing dependencies. To fulfill the growing analytical needs and to overcome the existing limitations, we propose a novel methodology in this paper that permits efficient analysis of mixed numeric and nominal data, effective visual data exploration, automatic warehouse schema generation and integration of data mining and warehousing. The proposed methodology is evaluated by performing case study on real-world data set. Results show that multidimensional analysis can be performed in an easier and flexible way to discover meaningful knowledge from large datasets.
dc.identifier.citationJournal of Advancements in Computing Technology, vol.2(3), pp.31 - 46
dc.identifier.doi10.4156/ijact.vol2.issue3.4
dc.identifier.roid18737en_NZ
dc.identifier.urihttps://hdl.handle.net/10292/4999
dc.publisherInternational Journal of Advancements in Computing Technology (IJACT)
dc.rightsIntegrated Publishing Association which supports IJACIT is a RoMeo green publisher– RoMEO is a database of Publishers copyright and self archiving policies hosted be the University of Nottingham and we had also signed in the Budapest open access initiative to show our commitment towards open access publishing.
dc.rights.accessrightsOpenAccess
dc.subjectAutomatic Schema, Clustering, Data Warehouse, Multi-dimensional Analysis
dc.titleIntegration of Data Mining and Data Warehousing: a practical methodology
dc.typeJournal Article
pubs.elements-id15210
pubs.organisational-data/AUT
pubs.organisational-data/AUT/Design & Creative Technologies
pubs.organisational-data/AUT/Design & Creative Technologies/School of Computing & Mathematical Science
pubs.organisational-data/AUT/PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers/DCT C & M Computing
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
IJACT2-3.41.pdf
Size:
526.75 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
licence.htm
Size:
29.98 KB
Format:
Unknown data format
Description: