• An event-based near real-time data integration architecture

      Naeem, M; Dobbie, G; Weber, G (IEEE Computer Society, 2008)
      Extract-Transform-Load (ETL) tools feed data from operational databases into data warehouses. Traditionally, these ETL tools use batch processing and operate offline at regular time intervals, for example on a nightly or ...
    • Comparing global optimization and default settings of stream-based joins

      Naeem, M; Dobbie, G; Weber, G (Springer, 2009)
      One problem encountered in real-time data integration is the join of a continuous incoming data stream with a disk-based relation. In this paper we investigate a stream-based join algorithm, called mesh join (MESHJOIN), ...
    • HYBRIDJOIN for Near Real-time Data Warehousing

      Naeem, M; Dobbie, G; Weber, G (University of Auckland, 2010)
      In order to make timely and effective decisions, businesses need the latest information from data warehouse repositories. To keep these repositories up-to-date with respect to the end user updates, near real-time data ...
    • HYBRIDJOIN for near-real-time Data Warehousing

      Naeem, MA; Dobbie, G; Weber, G (IGI Publishers, 2011)
      An important component of near-real-time data warehouses is the near-real-time integration layer. One important element in near-real-time data integration is the join of a continuous input data stream with a diskbased ...
    • Optimised X-HYBRIDJOIN for near-real-time data warehousing

      Naeem, M; Dobbie, G; Weber, G (Australian Computer Society, 2012)
      Stream-based join algorithms are needed in modern near-real-time data warehouses. A particular class of stream-based join algorithms, with MESHJOIN as a typical example, computes the join between a stream and a disk-based ...
    • Skewed Distributions in Semi-stream Joins: How Much Can Caching Help?

      Naeem, MA; Dobbie, G; Lutteroth, C; Weber, G (Elsevier, 2016)
      Semi-stream join algorithms join a fast data stream with a disk-based relation. This is important, for example, in real-time data warehousing where a stream of transactions is joined with master data before loading it into ...
    • X-HYBRIDJOIN for near-real-time Data Warehousing

      Naeem, MA; Dobbie, G; Weber, G (Springer-Verlag, 2011)
      In order to make timely and effective decisions, businesses need the latest information from data warehouse repositories. To keep these repositories up-to-date with respect to end user updates, near-real-time data integration ...