Optimised X-HYBRIDJOIN for near-real-time data warehousing

aut.conference.typePaper Published in Proceedings
aut.relation.endpage30
aut.relation.pages10
aut.relation.startpage21
aut.relation.volume124
aut.researcherNaeem, Muhammad Asif
dc.contributor.authorNaeem, M
dc.contributor.authorDobbie, G
dc.contributor.authorWeber, G
dc.contributor.editorZhang, R
dc.contributor.editorZhang, Y
dc.date.accessioned2014-07-31T02:34:59Z
dc.date.available2014-07-31T02:34:59Z
dc.date.copyright2012-01-30
dc.date.issued2012-01-30
dc.description.abstractStream-based join algorithms are needed in modern near-real-time data warehouses. A particular class of stream-based join algorithms, with MESHJOIN as a typical example, computes the join between a stream and a disk-based relation. Recently we have presented a new algorithm X-HYBRIDJOIN (Extended Hybrid Join) in that class. X-HYBRIDJOIN achieves better performance compared to earlier algorithms by pinning frequently accessed data from the disk-based relation in main memory. Apart from being held in main memory, X-HYBRIDJOIN treats this frequently accessed data no differently than other data from the disk-based relation. In this paper we investigate whether performance can be improved by treating the frequently accessed data differently. We present a new algorithm called Optimised X-HYBRIDJOIN, which consists of two phases. One phase, called the stream-probing phase, deals with the frequently accessed part of the disk-based relation. The other one is called the disk-probing phase and deals with the other part of the disk-based relation. In experiments we found that the performance of Optimised X-HYBRIDJOIN is significantly better than the performance of X-HYBRIDJOIN. We derive the cost model for our algorithm, which allows us to tune the components of Optimised X-HYBRIDJOIN. We performed an experimental study and we validate the cost model against the experimental results.
dc.identifier.citationProceeding ADC '12 Proceedings of the Twenty-Third Australasian Database Conference - Volume 124. Pages 21-30.
dc.identifier.isbn978-1-921770-05-0
dc.identifier.urihttps://hdl.handle.net/10292/7521
dc.publisherAustralian Computer Society
dc.relation.urihttp://dl.acm.org/citation.cfm?id=2483739.2483744
dc.rightsCopyright 2012, Australian Computer Society, Inc. This paper appeared at the 23rd Australasian Database Conference (ADC 2012), Melbourne, Australia, January-February 2012. Conferences in Research and Practice in Information Technology (CRPIT), Vol. 124, Rui Zhang and Yanchun Zhang, Ed. Reproduction for academic, not-for-profit purposes permitted provided this text is included.
dc.rights.accessrightsOpenAccess
dc.titleOptimised X-HYBRIDJOIN for near-real-time data warehousing
dc.typeConference Contribution
pubs.elements-id169386
pubs.organisational-data/AUT
pubs.organisational-data/AUT/Design & Creative Technologies
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Published-version.pdf
Size:
748 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
licence.htm
Size:
30.34 KB
Format:
Unknown data format
Description: