AUT LibraryAUT
View Item 
  •   Open Research
  • AUT Faculties
  • Faculty of Design and Creative Technologies (Te Ara Auaha)
  • School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau
  • View Item
  •   Open Research
  • AUT Faculties
  • Faculty of Design and Creative Technologies (Te Ara Auaha)
  • School of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, Pāngarau
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Integration of Data Mining and Data Warehousing: a practical methodology

Usman, M; Pears, R
Thumbnail
View/Open
DataMiningDataWarehousing.pdf (526.7Kb)
Permanent link
http://hdl.handle.net/10292/1684
Metadata
Show full metadata
Abstract
The ever growing repository of data in all fields poses new challenges to the modern analytical

systems. Real-world datasets, with mixed numeric and nominal variables, are difficult to analyze and require effective visual exploration that conveys semantic relationships of data. Traditional data mining techniques such as clustering clusters only the numeric data. Little research has been carried out in tackling the problem of clustering high cardinality nominal variables to get better insight of underlying dataset. Several works in the literature proved the likelihood of integrating data mining with warehousing to discover knowledge from data. For the seamless integration, the mined data has

to be modeled in form of a data warehouse schema. Schema generation process is complex manual

task and requires domain and warehousing familiarity. Automated techniques are required to generate warehouse schema to overcome the existing dependencies. To fulfill the growing analytical needs and to overcome the existing limitations, we propose a novel methodology in this paper that permits efficient analysis of mixed numeric and nominal data, effective visual data exploration, automatic warehouse schema generation and integration of data mining and warehousing. The proposed methodology is evaluated by performing case study on real-world data set. Results show that multidimensional analysis can be performed in an easier and flexible way to discover meaningful

knowledge from large datasets.
Keywords
Automatic schema; Clustering; Data Warehouse; Multi-dimensional analysis
Date
2010
Source
International Journal of Advancements in Computing Technology, vol.2(3), pp.31 - 46
Item Type
Journal Article
Publisher
Advanced Institute of Convergence IT (AICIT)
DOI
10.4156/ijact.vol2.issue3.4
Rights Statement
AICIT journals are open access journal which do not sell published papers and offer all published papers free of cost to all members, researchers, libraries, organizations, companies and universities.

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library

 

 

Browse

Open ResearchTitlesAuthorsDateSchool of Engineering, Computer and Mathematical Sciences - Te Kura Mātai Pūhanga, Rorohiko, PāngarauTitlesAuthorsDate

Alternative metrics

 

Statistics

For this itemFor all Open Research

Share

 
Follow @AUT_SC

Contact Us
  • Admin

Hosted by Tuwhera, an initiative of the Auckland University of Technology Library