Show simple item record

dc.contributor.authorUsman, M
dc.contributor.authorPears, R
dc.date.accessioned2011-08-11T22:49:38Z
dc.date.available2011-08-11T22:49:38Z
dc.date.copyright2010
dc.date.issued2011-08-12
dc.identifier.citationInternational Journal of Advancements in Computing Technology, vol.2(3), pp.31 - 46
dc.identifier.issn2233-9337
dc.identifier.urihttp://hdl.handle.net/10292/1684
dc.description.abstractThe ever growing repository of data in all fields poses new challenges to the modern analytical systems. Real-world datasets, with mixed numeric and nominal variables, are difficult to analyze and require effective visual exploration that conveys semantic relationships of data. Traditional data mining techniques such as clustering clusters only the numeric data. Little research has been carried out in tackling the problem of clustering high cardinality nominal variables to get better insight of underlying dataset. Several works in the literature proved the likelihood of integrating data mining with warehousing to discover knowledge from data. For the seamless integration, the mined data has to be modeled in form of a data warehouse schema. Schema generation process is complex manual task and requires domain and warehousing familiarity. Automated techniques are required to generate warehouse schema to overcome the existing dependencies. To fulfill the growing analytical needs and to overcome the existing limitations, we propose a novel methodology in this paper that permits efficient analysis of mixed numeric and nominal data, effective visual data exploration, automatic warehouse schema generation and integration of data mining and warehousing. The proposed methodology is evaluated by performing case study on real-world data set. Results show that multidimensional analysis can be performed in an easier and flexible way to discover meaningful knowledge from large datasets.
dc.publisherAdvanced Institute of Convergence IT (AICIT)
dc.rightsAICIT journals are open access journal which do not sell published papers and offer all published papers free of cost to all members, researchers, libraries, organizations, companies and universities.
dc.subjectAutomatic schema
dc.subjectClustering
dc.subjectData Warehouse
dc.subjectMulti-dimensional analysis
dc.titleIntegration of Data Mining and Data Warehousing: a practical methodology
dc.typeJournal Article
dc.rights.accessrightsOpenAccess
dc.identifier.doi10.4156/ijact.vol2.issue3.4


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record