ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study

BACKGROUND: A data warehouse (DW) is an integrated collection of subject-oriented data in the support of decision making. Importantly, the integration of data sources is achieved through the use of ETL (Extract, Transform, and Load) processes. It is therefore extensively recognized that the appropri...

Descripción completa

Autores Principales: Muñoz, Lilia, Mazón, Jose Norberto, Juan, Trujillo
Formato: Artículo
Idioma: Inglés
Inglés
Publicado: 2018
Materias:
Acceso en línea: https://ieeexplore.ieee.org/abstract/document/5893784/
http://ridda2.utp.ac.pa/handle/123456789/4918
http://ridda2.utp.ac.pa/handle/123456789/4918
id RepoUTP4918
recordtype dspace
spelling RepoUTP49182021-07-06T15:35:09Z ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study Muñoz, Lilia Mazón, Jose Norberto Juan, Trujillo Unified modeling language Data models Data warehouses Load modeling Systematics Libraries Data mining Unified modeling language Data models Data warehouses Load modeling Systematics Libraries Data mining BACKGROUND: A data warehouse (DW) is an integrated collection of subject-oriented data in the support of decision making. Importantly, the integration of data sources is achieved through the use of ETL (Extract, Transform, and Load) processes. It is therefore extensively recognized that the appropriate design of the ETL processes are key factors in the success of DW projects. OBJECTIVE: We assess existing research proposals about ETL process modeling for data warehouse in order to identify their main characteristics, notation, and activities. We also study if these modeling approaches are supported by some kind of prototype or tool. METHOD: We have undertaken a systematic mapping study of the research literature about modeling ETL processes. A mapping study provides a systematic and objective procedure for identifying the nature and extent of the available research by means of research questions. RESULTS: The study is based on a comprehensive set of papers obtained after using a multi-stage selection criteria and are published in international workshops, conferences and journals between 2000 and 2009. CONCLUSIONS: This systematic mapping study states that there is a clear classification of ETL process modeling approaches, but that they are not enough covered by researchers. Therefore, more effort is required to bridge the research gap in modeling ETL processes. BACKGROUND: A data warehouse (DW) is an integrated collection of subject-oriented data in the support of decision making. Importantly, the integration of data sources is achieved through the use of ETL (Extract, Transform, and Load) processes. It is therefore extensively recognized that the appropriate design of the ETL processes are key factors in the success of DW projects. OBJECTIVE: We assess existing research proposals about ETL process modeling for data warehouse in order to identify their main characteristics, notation, and activities. We also study if these modeling approaches are supported by some kind of prototype or tool. METHOD: We have undertaken a systematic mapping study of the research literature about modeling ETL processes. A mapping study provides a systematic and objective procedure for identifying the nature and extent of the available research by means of research questions. RESULTS: The study is based on a comprehensive set of papers obtained after using a multi-stage selection criteria and are published in international workshops, conferences and journals between 2000 and 2009. CONCLUSIONS: This systematic mapping study states that there is a clear classification of ETL process modeling approaches, but that they are not enough covered by researchers. Therefore, more effort is required to bridge the research gap in modeling ETL processes. 2018-06-14T13:37:43Z 2018-06-14T13:37:43Z 2018-06-14T13:37:43Z 2018-06-14T13:37:43Z 06/16/2011 06/16/2011 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion https://ieeexplore.ieee.org/abstract/document/5893784/ 1548-0992 http://ridda2.utp.ac.pa/handle/123456789/4918 http://ridda2.utp.ac.pa/handle/123456789/4918 eng eng info:eu-repo/semantics/embargoedAccess application/pdf text/html
institution Universidad Tecnológica de Panamá
collection Repositorio UTP – Ridda2
language Inglés
Inglés
topic Unified modeling language
Data models
Data warehouses
Load modeling
Systematics
Libraries
Data mining
Unified modeling language
Data models
Data warehouses
Load modeling
Systematics
Libraries
Data mining
spellingShingle Unified modeling language
Data models
Data warehouses
Load modeling
Systematics
Libraries
Data mining
Unified modeling language
Data models
Data warehouses
Load modeling
Systematics
Libraries
Data mining
Muñoz, Lilia
Mazón, Jose Norberto
Juan, Trujillo
ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
description BACKGROUND: A data warehouse (DW) is an integrated collection of subject-oriented data in the support of decision making. Importantly, the integration of data sources is achieved through the use of ETL (Extract, Transform, and Load) processes. It is therefore extensively recognized that the appropriate design of the ETL processes are key factors in the success of DW projects. OBJECTIVE: We assess existing research proposals about ETL process modeling for data warehouse in order to identify their main characteristics, notation, and activities. We also study if these modeling approaches are supported by some kind of prototype or tool. METHOD: We have undertaken a systematic mapping study of the research literature about modeling ETL processes. A mapping study provides a systematic and objective procedure for identifying the nature and extent of the available research by means of research questions. RESULTS: The study is based on a comprehensive set of papers obtained after using a multi-stage selection criteria and are published in international workshops, conferences and journals between 2000 and 2009. CONCLUSIONS: This systematic mapping study states that there is a clear classification of ETL process modeling approaches, but that they are not enough covered by researchers. Therefore, more effort is required to bridge the research gap in modeling ETL processes.
format Artículo
author Muñoz, Lilia
Mazón, Jose Norberto
Juan, Trujillo
author_sort Muñoz, Lilia
title ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
title_short ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
title_full ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
title_fullStr ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
title_full_unstemmed ETL Process Modeling Conceptual for Data Warehouses: A Systematic Mapping Study
title_sort etl process modeling conceptual for data warehouses: a systematic mapping study
publishDate 2018
url https://ieeexplore.ieee.org/abstract/document/5893784/
http://ridda2.utp.ac.pa/handle/123456789/4918
http://ridda2.utp.ac.pa/handle/123456789/4918
_version_ 1785813575531495424
score 12.140644