About

Log in?

DTU users get better search results including licensed content and discounts on order fees.

Anyone can log in and get personalized features such as favorites, tags and feeds.

Log in as DTU user Log in as non-DTU user No thanks

DTU Findit

Journal article

Optimizing ETL by a Two-level Data Staging Method

From

Department of Management Engineering, Technical University of Denmark1

Systems Analysis, Department of Management Engineering, Technical University of Denmark2

DTU Climate Centre, Systems Analysis, Department of Management Engineering, Technical University of Denmark3

University College of Northern Denmark4

CITIES - Centre for IT-Intelligent Energy Systems, Centers, Technical University of Denmark5

In data warehousing, the data from source systems are populated into a central data warehouse (DW) through extraction, transformation and loading (ETL). The standard ETL approach usually uses sequential jobs to process the data with dependencies, such as dimension and fact data. It is a non-trivial task to process the so-called early-/late-arriving data, which arrive out of order.

This paper proposes a two-level data staging area method to optimize ETL. The proposed method is an all-in-one solution that supports processing different types of data from operational systems, including early-/late-arriving data, and fast-/slowly-changing data. The introduced additional staging area decouples loading process from data extraction and transformation, which improves ETL flexibility and minimizes intervention to the data warehouse.

This paper evaluates the proposed method empirically, which shows that it is more efficient and less intrusive than the standard ETL method.

Language: English
Year: 2016
Pages: 32-50
ISSN: 15483932 and 15483924
Types: Journal article
DOI: 10.4018/IJDWM.2016070103
ORCIDs: Liu, Xiufeng and Nielsen, Per Sieverts

DTU users get better search results including licensed content and discounts on order fees.

Log in as DTU user

Access

Analysis