A methodology for integrating and exploiting data mining techniques in the design of data warehouses

Date
2010
Authors
Usman, M
Pears, R
Supervisor
Item type
Conference Contribution
Degree name
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Abstract

Data Warehousing and Data Mining are two mature disciplines in their own right. Yet, they have developed largely separate from each other, despite the fact that techniques developed for pattern recognition such as Clustering and Visualization in the Data Mining discipline have much to offer in the design of Data Warehouses. This is somewhat surprising, given that the two disciplines have broadly the same set of objectives, although the techniques that they employ are admittedly quite different from each other. This may be due to the lack of a suitable methodology for integrating methods such as clustering and pattern visualization into data warehousing design. In this research, we propose such a methodology and report on its application to two case studies involving real world data taken from the UCI Machine Learning repository. We demonstrate how data clustering and visualization methods, working in conjunction with each other can be used to gain new insights and build more meaningful dimensions which may not be obvious to human data warehouse designers.

Description
Keywords
Automatic schema , Clustering , Data mining , Multidimensional analyis , Warehouseing
Source
2nd International Conference on Data Mining and Intelligent Information Technology Applications, Seoul, South Korea, pp.361 - 367 (7)
DOI
Rights statement
Copyright © 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.