Integration of Data Mining and Data Warehousing: a practical methodology

aut.researcherPears, Russel Lawrence
dc.contributor.authorUsman, M
dc.contributor.authorPears, R
dc.date.accessioned2011-08-11T22:49:38Z
dc.date.available2011-08-11T22:49:38Z
dc.date.copyright2010
dc.date.issued2010
dc.description.abstractThe ever growing repository of data in all fields poses new challenges to the modern analytical systems. Real-world datasets, with mixed numeric and nominal variables, are difficult to analyze and require effective visual exploration that conveys semantic relationships of data. Traditional data mining techniques such as clustering clusters only the numeric data. Little research has been carried out in tackling the problem of clustering high cardinality nominal variables to get better insight of underlying dataset. Several works in the literature proved the likelihood of integrating data mining with warehousing to discover knowledge from data. For the seamless integration, the mined data has to be modeled in form of a data warehouse schema. Schema generation process is complex manual task and requires domain and warehousing familiarity. Automated techniques are required to generate warehouse schema to overcome the existing dependencies. To fulfill the growing analytical needs and to overcome the existing limitations, we propose a novel methodology in this paper that permits efficient analysis of mixed numeric and nominal data, effective visual data exploration, automatic warehouse schema generation and integration of data mining and warehousing. The proposed methodology is evaluated by performing case study on real-world data set. Results show that multidimensional analysis can be performed in an easier and flexible way to discover meaningful knowledge from large datasets.
dc.identifier.citationInternational Journal of Advancements in Computing Technology, vol.2(3), pp.31 - 46
dc.identifier.doi10.4156/ijact.vol2.issue3.4
dc.identifier.issn2233-9337
dc.identifier.urihttps://hdl.handle.net/10292/1684
dc.publisherAdvanced Institute of Convergence IT (AICIT)
dc.rightsAICIT journals are open access journal which do not sell published papers and offer all published papers free of cost to all members, researchers, libraries, organizations, companies and universities.
dc.rights.accessrightsOpenAccess
dc.subjectAutomatic schema
dc.subjectClustering
dc.subjectData Warehouse
dc.subjectMulti-dimensional analysis
dc.titleIntegration of Data Mining and Data Warehousing: a practical methodology
dc.typeJournal Article
pubs.organisational-data/AUT
pubs.organisational-data/AUT/Design & Creative Technologies
pubs.organisational-data/AUT/PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers/DCT C & M Computing
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DataMiningDataWarehousing.pdf
Size:
526.75 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
licence.htm
Size:
29.98 KB
Format:
Unknown data format
Description: