SlideShare a Scribd company logo
Why shift from ETL to ELT?
Author: Prakash Jalihal
Contributor: Vedvrat Shikarpur
The process of data warehousing is undergoing rapid transformation, giving rise to various new
terminologies, especially due to the shift from the traditional ETL to the new ELT. For someone new to
the process, these additional terminologies and abbreviations might seem overwhelming, some may
even ask, “Why does it matter if the L comes before the T?”
The answer lies in the infrastructure and the setup. Here is what the fuss is all about, the sequencing of
the words and more importantly, why you should be shifting from ETL to ELT.
Understanding Data Warehousing Processes
Data Warehouse or Enterprise Data Warehouse (EDW) is a system implemented for the purpose of
reporting and data analysis. They are central repositories of integrated data from disparate sources used
for generating reports.
The popular definition from Bill Inmon is, “It is a subject oriented, integrated, time variant and non-
volatile collection of data used for decision making process.”1
 Subject oriented: A data warehouse can be used to analyze a particular subject area.
 Integrated: A data warehouse integrates data from one or more disparate data sources.
 Time variant: Historical data is stored in a data warehouse.
 Nonvolatile: Once data is input in a data warehouse, it cannot be changed or altered.
What is ETL?
ETL stands for extraction, transformation and loading, and is the process of extracting data from the
source system to the data warehouse. They are critical components for feeding a data warehouse, a
business intelligence system or a big data platform.
The ETL processes are:
 Extraction: Extracts raw data into databases or storage systems
 Transformation: Simplifies data to reconcile it across source systems, perform analysis and
enrich with external lookup information. This stage also matches the format required by the
target system.
 Loading: Sourcing the resultant data into various business intelligence (BI) tools, data
warehouse or EDW, etc.
Advantages of ETL:
1. Single view interface to integrate heterogeneous data
2. Ability to join data both at the source and at the integration server with the addition of the
option to apply any business rule from within a single interface.
3. Common data infrastructure for working on data movement and data quality.
4. Parallel Processing Engine for providing exceptional performance and scalability.
1
Beye Network: Is Inmon's Data Warehouse Definition Still Accurate?
Shortcomings of ETL:
1. Migration from server to enterprise edition might require vast time and resources due to the
innumerable architectural differences in the Server and Enterprise edition.
2. No automated error handling or recovery mechanism.
3. Expensive as a solution for small or midsized companies.2
What is ELT?
Until recently, it was normal to stage data into an intermediate system before pushing it into the target
system as the target was better optimized to retrieve and report (and not to perform hard crunching of
numbers or data). This is why many preferred the ETL process, where the intermediate system would be
optimized to perform calculations and data transformation (this is the reason we call this process
transformation). This approach kept the target reporting system independent of the implementation
method during transform stage, resulting in organizations implementing three separate systems to
satisfy the requirements of each stage.
Since hardware systems today are better equipped and capable of doing a lot more, reporting and
calculations can be performed using the same system. This is where the ELT implementation comes in.
ELT stands for Extract, Load, Transform. It is an alternative to ETL as it implements the data lake. In ELT
models, data is processed on entry to the data lake, resulting in faster loading times. In most cases, the
design of the transformational technology ties closely into the platform used for reporting, giving ETL
the advantage of a better hardware and software sync up.
Advantages:
1. No need for a separate transformation engine, the work is done by the target system itself.
2. Data transformation and loading happen in parallel, so less time and resources are spent (as
only filtered, clean data is loaded into the target system)
2
ETL Tools: Major business and technical advantages and disadvantages of using DataStage ETL tool
3. ELT works with high-end data engines such as Hadoop cluster, cloud or data appliances. This
gives is additional performance and security.
4. The processing capability of data warehousing infrastructure reduces time that data spends in
transit and makes the system more cost effective.
Disadvantages:
1. The specifics of ELT development vary on platform i.e. Hadoop clusters work by breaking a
problem into smaller chunks, then distributing those chunks across a large number of machines
for processing. Some problems can be easily split, others will be much harder.
2. Developers need to be aware of the nature of the system they’re using to perform
transformations. While some systems can handle nearly any transformation, others do not have
enough resources, requiring careful planning and design.3
Comparison: ETL vs ELT
Although ETL and ELT are vastly different in terms of architecture and implementation, the main
difference lies in the rethinking of approach taken to transferring data into reporting systems. ELT takes
full advantage of technology and along the way enhances the reporting solution with added values like
tracing of data points.4
Another main attraction of ELT is the reduction in load time and the time that data is in transit, making it
not just efficient but even cost-effective. Even though ELT requires a high-end system, it drastically
reduces the number of components required. 5
Thus, despite ELT implementation being more complex compared to the one way transaction-system-to-
reporting ETL, ELT is now being preferred. Designing a proper ELT system might take some work, but the
payoff is well worth it!
In banking terms, only the data of value ends up in the Data Warehouse for ETL processes. What this
mean is that you Extract the needed data into a staging area (in relational term often staging tables or
the so called global temporary tables), segregate it from unwanted data, perform data manipulation
(Transformation) and finally Load it into target tables in a Data Warehouse. Analysts then use
appropriate BI Tools to look at macroscopic trends in the data. This makes the process of data matching,
3
Ironside: ETL vs. ELT – What’s the Big Difference?
4
Blog: Performance Architects; Difference between ETL and ELT
5
TechTarget: Extract, Load, Transform (ELT) definition
Read more about HEXANIKA’s DRAAS solution at: http://hexanika.com/company-profile/
This is where ELT works best, for it is not just confined to data deemed to be of specific value. Hadoop
(HDFS systems) can store everything from structured data (transactional databases) and unstructured
data (coming for excel sheets, emails, logs, internet, and other). As raw data and transformed data are
saved on the same machine, data linkage and lineage processes are a lot faster and more accurate. This
also drastically reduces the Total Cost of Ownership (TCO) which is an attractive proposition for various
financial institutions using Big Data Storage systems.6
In summary, ELT allows you to extract and load all data as is into HDFS, and then you can do
Transformation through Schema on Read, thereby simplifying the process of Data Warehousing.
Hexanika: Efficient, Simple and Smart!
Hexanika is a FinTech Big Data software company, which has developed a revolutionary software
platform called SmartJoinTM
for financial institutions to address data sourcing and reporting challenges
for regulatory compliance. SmartJoinTM
improves data quality while the automated nature of
6
LinkedIn Pulse: ETL or ELT and the Use Case
SmartRegTM
keeps regulatory reporting in harmony with the dynamic regulatory requirements and keeps
pace with the new developments and latest regulatory updates.
Hexanika leverages the power of ELT using distributed parallel processing, Big Data/Hadoop technology
with a secure data cloud (IBM Cloud). Understanding the high implementation costs of new systems and
the complexities involved in redesigning existing solutions, Hexanika offers a unique build that adapts to
existing architectures. This makes our solution cost-effective, efficient, simple and smart!
Read more about our solution and architecture at: http://hexanika.com/big-data-solution-architecture/
CONTACT US
USA
249 East 48 Street,
New York, NY 10017
Tel: +1 646.733.6636
INDIA
Krupa Bungalow 1187/10,
Shivaji Nagar, Pune 411005
Tel: +91 985068686
Email: info@hexanika.com

More Related Content

What's hot

The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
Databricks
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
Harri Kauhanen
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
punedevscom
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
Slim Baltagi
 
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache IcebergData Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Anant Corporation
 
Airbyte @ Airflow Summit - The new modern data stack
Airbyte @ Airflow Summit - The new modern data stackAirbyte @ Airflow Summit - The new modern data stack
Airbyte @ Airflow Summit - The new modern data stack
Michel Tricot
 
3D: DBT using Databricks and Delta
3D: DBT using Databricks and Delta3D: DBT using Databricks and Delta
3D: DBT using Databricks and Delta
Databricks
 
Delta Lake: Optimizing Merge
Delta Lake: Optimizing MergeDelta Lake: Optimizing Merge
Delta Lake: Optimizing Merge
Databricks
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
Modern Data Stack France
 
Siligong.Data - May 2021 - Transforming your analytics workflow with dbt
Siligong.Data - May 2021 - Transforming your analytics workflow with dbtSiligong.Data - May 2021 - Transforming your analytics workflow with dbt
Siligong.Data - May 2021 - Transforming your analytics workflow with dbt
Jon Su
 
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander ZaitsevClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
Altinity Ltd
 
ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
Gaurav Bhatnagar
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training Presentation
Apurba Biswas
 
Some Iceberg Basics for Beginners (CDP).pdf
Some Iceberg Basics for Beginners (CDP).pdfSome Iceberg Basics for Beginners (CDP).pdf
Some Iceberg Basics for Beginners (CDP).pdf
Michael Kogan
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
BugRaptors
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
Mohammed Fazuluddin
 
Building Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta LakeBuilding Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta Lake
Flink Forward
 
Sqoop
SqoopSqoop
NewSQL - The Future of Databases?
NewSQL - The Future of Databases?NewSQL - The Future of Databases?
NewSQL - The Future of Databases?
Elvis Saravia
 

What's hot (20)

The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
The Modern Data Team for the Modern Data Stack: dbt and the Role of the Analy...
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
 
Apache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming AnalyticsApache Flink: Real-World Use Cases for Streaming Analytics
Apache Flink: Real-World Use Cases for Streaming Analytics
 
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache IcebergData Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
 
Airbyte @ Airflow Summit - The new modern data stack
Airbyte @ Airflow Summit - The new modern data stackAirbyte @ Airflow Summit - The new modern data stack
Airbyte @ Airflow Summit - The new modern data stack
 
3D: DBT using Databricks and Delta
3D: DBT using Databricks and Delta3D: DBT using Databricks and Delta
3D: DBT using Databricks and Delta
 
Delta Lake: Optimizing Merge
Delta Lake: Optimizing MergeDelta Lake: Optimizing Merge
Delta Lake: Optimizing Merge
 
From Data Warehouse to Lakehouse
From Data Warehouse to LakehouseFrom Data Warehouse to Lakehouse
From Data Warehouse to Lakehouse
 
Siligong.Data - May 2021 - Transforming your analytics workflow with dbt
Siligong.Data - May 2021 - Transforming your analytics workflow with dbtSiligong.Data - May 2021 - Transforming your analytics workflow with dbt
Siligong.Data - May 2021 - Transforming your analytics workflow with dbt
 
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander ZaitsevClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
ClickHouse in Real Life. Case Studies and Best Practices, by Alexander Zaitsev
 
ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
 
ETL Testing Training Presentation
ETL Testing Training PresentationETL Testing Training Presentation
ETL Testing Training Presentation
 
Some Iceberg Basics for Beginners (CDP).pdf
Some Iceberg Basics for Beginners (CDP).pdfSome Iceberg Basics for Beginners (CDP).pdf
Some Iceberg Basics for Beginners (CDP).pdf
 
Dataflow with Apache NiFi
Dataflow with Apache NiFiDataflow with Apache NiFi
Dataflow with Apache NiFi
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
 
Building Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta LakeBuilding Reliable Lakehouses with Apache Flink and Delta Lake
Building Reliable Lakehouses with Apache Flink and Delta Lake
 
Sqoop
SqoopSqoop
Sqoop
 
NewSQL - The Future of Databases?
NewSQL - The Future of Databases?NewSQL - The Future of Databases?
NewSQL - The Future of Databases?
 

Viewers also liked

Weducate initiative mcmd project implementation progress report 2015
Weducate initiative mcmd project implementation progress report 2015Weducate initiative mcmd project implementation progress report 2015
Weducate initiative mcmd project implementation progress report 2015
raymondtaaku
 
Shijia wang_portfolio
Shijia wang_portfolioShijia wang_portfolio
Shijia wang_portfolio
Shijia Wang
 
Maslow
MaslowMaslow
Maslow
lhs1907
 
What Does it Really Take to Launch, Lead and Grow a Startup?
What Does it Really Take to Launch, Lead and Grow a Startup?What Does it Really Take to Launch, Lead and Grow a Startup?
What Does it Really Take to Launch, Lead and Grow a Startup?
ALPHA Camp
 
1-3. Australia Startup Ecosystem_021517
1-3. Australia Startup Ecosystem_0215171-3. Australia Startup Ecosystem_021517
1-3. Australia Startup Ecosystem_021517
D.CAMP
 
Design 'super' sprint
Design 'super' sprintDesign 'super' sprint
Design 'super' sprint
NTNU Smart & Sustainable Cities
 
Mm datas comemorativas_genérico_17_br nicke
Mm datas comemorativas_genérico_17_br nickeMm datas comemorativas_genérico_17_br nicke
Mm datas comemorativas_genérico_17_br nicke
Meio & Mensagem
 

Viewers also liked (7)

Weducate initiative mcmd project implementation progress report 2015
Weducate initiative mcmd project implementation progress report 2015Weducate initiative mcmd project implementation progress report 2015
Weducate initiative mcmd project implementation progress report 2015
 
Shijia wang_portfolio
Shijia wang_portfolioShijia wang_portfolio
Shijia wang_portfolio
 
Maslow
MaslowMaslow
Maslow
 
What Does it Really Take to Launch, Lead and Grow a Startup?
What Does it Really Take to Launch, Lead and Grow a Startup?What Does it Really Take to Launch, Lead and Grow a Startup?
What Does it Really Take to Launch, Lead and Grow a Startup?
 
1-3. Australia Startup Ecosystem_021517
1-3. Australia Startup Ecosystem_0215171-3. Australia Startup Ecosystem_021517
1-3. Australia Startup Ecosystem_021517
 
Design 'super' sprint
Design 'super' sprintDesign 'super' sprint
Design 'super' sprint
 
Mm datas comemorativas_genérico_17_br nicke
Mm datas comemorativas_genérico_17_br nickeMm datas comemorativas_genérico_17_br nicke
Mm datas comemorativas_genérico_17_br nicke
 

Similar to Why shift from ETL to ELT?

ETL vs ELT
ETL vs ELT ETL vs ELT
ETL vs ELT
KavitaDubey18
 
What is ETL and Zero ETL | Extract, Transform, Load
What is ETL and Zero ETL | Extract, Transform, LoadWhat is ETL and Zero ETL | Extract, Transform, Load
What is ETL and Zero ETL | Extract, Transform, Load
MounikaPolabathina
 
Should ETL Become Obsolete
Should ETL Become ObsoleteShould ETL Become Obsolete
Should ETL Become Obsolete
Jerald Burget
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
Rhonda Cetnar
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
Ankita Dubey
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
Kai Zhao
 
Data junction tool
Data junction toolData junction tool
Data junction tool
Sara shall
 
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
What Is ETL | Process of ETL 2023 | GrapesTech SolutionsWhat Is ETL | Process of ETL 2023 | GrapesTech Solutions
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
GrapesTech Solutions
 
What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...
kzayra69
 
GROPSIKS.pptx
GROPSIKS.pptxGROPSIKS.pptx
GROPSIKS.pptx
avanceregine312
 
Data warehouse presentation
Data warehouse presentationData warehouse presentation
Data warehouse presentation
Gopalakrishnan Kulasekaran
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETLganblues
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
cscpconf
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATANEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
csandit
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration Initiatives
AllinConsulting
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Datacademy.ai
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
Tracy Morgan
 
An Integrated ERP with Web Portal
An Integrated ERP with Web Portal An Integrated ERP with Web Portal
An Integrated ERP with Web Portal
acijjournal
 

Similar to Why shift from ETL to ELT? (20)

ETL vs ELT
ETL vs ELT ETL vs ELT
ETL vs ELT
 
What is ETL and Zero ETL | Extract, Transform, Load
What is ETL and Zero ETL | Extract, Transform, LoadWhat is ETL and Zero ETL | Extract, Transform, Load
What is ETL and Zero ETL | Extract, Transform, Load
 
Should ETL Become Obsolete
Should ETL Become ObsoleteShould ETL Become Obsolete
Should ETL Become Obsolete
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
 
Big data analytics beyond beer and diapers
Big data analytics   beyond beer and diapersBig data analytics   beyond beer and diapers
Big data analytics beyond beer and diapers
 
Data junction tool
Data junction toolData junction tool
Data junction tool
 
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
What Is ETL | Process of ETL 2023 | GrapesTech SolutionsWhat Is ETL | Process of ETL 2023 | GrapesTech Solutions
What Is ETL | Process of ETL 2023 | GrapesTech Solutions
 
What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...
 
GROPSIKS.pptx
GROPSIKS.pptxGROPSIKS.pptx
GROPSIKS.pptx
 
Data warehouse presentation
Data warehouse presentationData warehouse presentation
Data warehouse presentation
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
 
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATANEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
NEAR-REAL-TIME PARALLEL ETL+Q FOR AUTOMATIC SCALABILITY IN BIGDATA
 
Managing Data Integration Initiatives
Managing Data Integration InitiativesManaging Data Integration Initiatives
Managing Data Integration Initiatives
 
Top 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdfTop 60+ Data Warehouse Interview Questions and Answers.pdf
Top 60+ Data Warehouse Interview Questions and Answers.pdf
 
An Integrated ERP With Web Portal
An Integrated ERP With Web PortalAn Integrated ERP With Web Portal
An Integrated ERP With Web Portal
 
An Integrated ERP with Web Portal
An Integrated ERP with Web Portal An Integrated ERP with Web Portal
An Integrated ERP with Web Portal
 
Etl techniques
Etl techniquesEtl techniques
Etl techniques
 

More from HEXANIKA

Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?
HEXANIKA
 
Scope of Data Integration
Scope of Data IntegrationScope of Data Integration
Scope of Data Integration
HEXANIKA
 
How Big Data helps banks know their customers better
How Big Data helps banks know their customers betterHow Big Data helps banks know their customers better
How Big Data helps banks know their customers better
HEXANIKA
 
Sandbox in Financial Services
Sandbox in Financial ServicesSandbox in Financial Services
Sandbox in Financial Services
HEXANIKA
 
High regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banksHigh regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banks
HEXANIKA
 
Automation in Banking
Automation in BankingAutomation in Banking
Automation in Banking
HEXANIKA
 
Regulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized BanksRegulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized Banks
HEXANIKA
 
Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)
HEXANIKA
 
History of Big Data
History of Big DataHistory of Big Data
History of Big Data
HEXANIKA
 
FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?
HEXANIKA
 
The Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and AftereffectsThe Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and Aftereffects
HEXANIKA
 
A summary of Solvency II Directives
A summary of Solvency II DirectivesA summary of Solvency II Directives
A summary of Solvency II Directives
HEXANIKA
 
A Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliantA Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliant
HEXANIKA
 
Dodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory ReportingDodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory Reporting
HEXANIKA
 
Regulatory impact on small and midsize banks
Regulatory impact on small and midsize banksRegulatory impact on small and midsize banks
Regulatory impact on small and midsize banks
HEXANIKA
 

More from HEXANIKA (15)

Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?
 
Scope of Data Integration
Scope of Data IntegrationScope of Data Integration
Scope of Data Integration
 
How Big Data helps banks know their customers better
How Big Data helps banks know their customers betterHow Big Data helps banks know their customers better
How Big Data helps banks know their customers better
 
Sandbox in Financial Services
Sandbox in Financial ServicesSandbox in Financial Services
Sandbox in Financial Services
 
High regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banksHigh regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banks
 
Automation in Banking
Automation in BankingAutomation in Banking
Automation in Banking
 
Regulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized BanksRegulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized Banks
 
Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)
 
History of Big Data
History of Big DataHistory of Big Data
History of Big Data
 
FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?
 
The Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and AftereffectsThe Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and Aftereffects
 
A summary of Solvency II Directives
A summary of Solvency II DirectivesA summary of Solvency II Directives
A summary of Solvency II Directives
 
A Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliantA Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliant
 
Dodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory ReportingDodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory Reporting
 
Regulatory impact on small and midsize banks
Regulatory impact on small and midsize banksRegulatory impact on small and midsize banks
Regulatory impact on small and midsize banks
 

Recently uploaded

how to swap pi coins to foreign currency withdrawable.
how to swap pi coins to foreign currency withdrawable.how to swap pi coins to foreign currency withdrawable.
how to swap pi coins to foreign currency withdrawable.
DOT TECH
 
where can I find a legit pi merchant online
where can I find a legit pi merchant onlinewhere can I find a legit pi merchant online
where can I find a legit pi merchant online
DOT TECH
 
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdfWhich Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Kezex (KZX)
 
Webinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont BraunWebinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont Braun
FinTech Belgium
 
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
beulahfernandes8
 
Intro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptxIntro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptx
shetivia
 
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdfIsios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Henry Tapper
 
what is the future of Pi Network currency.
what is the future of Pi Network currency.what is the future of Pi Network currency.
what is the future of Pi Network currency.
DOT TECH
 
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Quotidiano Piemontese
 
managementaccountingunitiv-230422140105-dd17d80b.ppt
managementaccountingunitiv-230422140105-dd17d80b.pptmanagementaccountingunitiv-230422140105-dd17d80b.ppt
managementaccountingunitiv-230422140105-dd17d80b.ppt
SuseelaPalanimuthu
 
234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt
PravinPatil144525
 
when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.
DOT TECH
 
Scope Of Macroeconomics introduction and basic theories
Scope Of Macroeconomics introduction and basic theoriesScope Of Macroeconomics introduction and basic theories
Scope Of Macroeconomics introduction and basic theories
nomankalyar153
 
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdfUS Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
pchutichetpong
 
Introduction to Value Added Tax System.ppt
Introduction to Value Added Tax System.pptIntroduction to Value Added Tax System.ppt
Introduction to Value Added Tax System.ppt
VishnuVenugopal84
 
What website can I sell pi coins securely.
What website can I sell pi coins securely.What website can I sell pi coins securely.
What website can I sell pi coins securely.
DOT TECH
 
How to get verified on Coinbase Account?_.docx
How to get verified on Coinbase Account?_.docxHow to get verified on Coinbase Account?_.docx
How to get verified on Coinbase Account?_.docx
Buy bitget
 
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
Vighnesh Shashtri
 
BYD SWOT Analysis and In-Depth Insights 2024.pptx
BYD SWOT Analysis and In-Depth Insights 2024.pptxBYD SWOT Analysis and In-Depth Insights 2024.pptx
BYD SWOT Analysis and In-Depth Insights 2024.pptx
mikemetalprod
 
how to sell pi coins effectively (from 50 - 100k pi)
how to sell pi coins effectively (from 50 - 100k  pi)how to sell pi coins effectively (from 50 - 100k  pi)
how to sell pi coins effectively (from 50 - 100k pi)
DOT TECH
 

Recently uploaded (20)

how to swap pi coins to foreign currency withdrawable.
how to swap pi coins to foreign currency withdrawable.how to swap pi coins to foreign currency withdrawable.
how to swap pi coins to foreign currency withdrawable.
 
where can I find a legit pi merchant online
where can I find a legit pi merchant onlinewhere can I find a legit pi merchant online
where can I find a legit pi merchant online
 
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdfWhich Crypto to Buy Today for Short-Term in May-June 2024.pdf
Which Crypto to Buy Today for Short-Term in May-June 2024.pdf
 
Webinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont BraunWebinar Exploring DORA for Fintechs - Simont Braun
Webinar Exploring DORA for Fintechs - Simont Braun
 
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
Exploring Abhay Bhutada’s Views After Poonawalla Fincorp’s Collaboration With...
 
Intro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptxIntro_Economics_ GPresentation Week 4.pptx
Intro_Economics_ GPresentation Week 4.pptx
 
Isios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdfIsios-2024-Professional-Independent-Trustee-Survey.pdf
Isios-2024-Professional-Independent-Trustee-Survey.pdf
 
what is the future of Pi Network currency.
what is the future of Pi Network currency.what is the future of Pi Network currency.
what is the future of Pi Network currency.
 
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...Turin Startup Ecosystem 2024  - Ricerca sulle Startup e il Sistema dell'Innov...
Turin Startup Ecosystem 2024 - Ricerca sulle Startup e il Sistema dell'Innov...
 
managementaccountingunitiv-230422140105-dd17d80b.ppt
managementaccountingunitiv-230422140105-dd17d80b.pptmanagementaccountingunitiv-230422140105-dd17d80b.ppt
managementaccountingunitiv-230422140105-dd17d80b.ppt
 
234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt234Presentation on Indian Debt Market.ppt
234Presentation on Indian Debt Market.ppt
 
when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.when will pi network coin be available on crypto exchange.
when will pi network coin be available on crypto exchange.
 
Scope Of Macroeconomics introduction and basic theories
Scope Of Macroeconomics introduction and basic theoriesScope Of Macroeconomics introduction and basic theories
Scope Of Macroeconomics introduction and basic theories
 
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdfUS Economic Outlook - Being Decided - M Capital Group August 2021.pdf
US Economic Outlook - Being Decided - M Capital Group August 2021.pdf
 
Introduction to Value Added Tax System.ppt
Introduction to Value Added Tax System.pptIntroduction to Value Added Tax System.ppt
Introduction to Value Added Tax System.ppt
 
What website can I sell pi coins securely.
What website can I sell pi coins securely.What website can I sell pi coins securely.
What website can I sell pi coins securely.
 
How to get verified on Coinbase Account?_.docx
How to get verified on Coinbase Account?_.docxHow to get verified on Coinbase Account?_.docx
How to get verified on Coinbase Account?_.docx
 
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
Empowering the Unbanked: The Vital Role of NBFCs in Promoting Financial Inclu...
 
BYD SWOT Analysis and In-Depth Insights 2024.pptx
BYD SWOT Analysis and In-Depth Insights 2024.pptxBYD SWOT Analysis and In-Depth Insights 2024.pptx
BYD SWOT Analysis and In-Depth Insights 2024.pptx
 
how to sell pi coins effectively (from 50 - 100k pi)
how to sell pi coins effectively (from 50 - 100k  pi)how to sell pi coins effectively (from 50 - 100k  pi)
how to sell pi coins effectively (from 50 - 100k pi)
 

Why shift from ETL to ELT?

  • 1. Why shift from ETL to ELT? Author: Prakash Jalihal Contributor: Vedvrat Shikarpur The process of data warehousing is undergoing rapid transformation, giving rise to various new terminologies, especially due to the shift from the traditional ETL to the new ELT. For someone new to the process, these additional terminologies and abbreviations might seem overwhelming, some may even ask, “Why does it matter if the L comes before the T?” The answer lies in the infrastructure and the setup. Here is what the fuss is all about, the sequencing of the words and more importantly, why you should be shifting from ETL to ELT. Understanding Data Warehousing Processes Data Warehouse or Enterprise Data Warehouse (EDW) is a system implemented for the purpose of
  • 2. reporting and data analysis. They are central repositories of integrated data from disparate sources used for generating reports. The popular definition from Bill Inmon is, “It is a subject oriented, integrated, time variant and non- volatile collection of data used for decision making process.”1  Subject oriented: A data warehouse can be used to analyze a particular subject area.  Integrated: A data warehouse integrates data from one or more disparate data sources.  Time variant: Historical data is stored in a data warehouse.  Nonvolatile: Once data is input in a data warehouse, it cannot be changed or altered. What is ETL? ETL stands for extraction, transformation and loading, and is the process of extracting data from the source system to the data warehouse. They are critical components for feeding a data warehouse, a business intelligence system or a big data platform. The ETL processes are:  Extraction: Extracts raw data into databases or storage systems  Transformation: Simplifies data to reconcile it across source systems, perform analysis and enrich with external lookup information. This stage also matches the format required by the target system.  Loading: Sourcing the resultant data into various business intelligence (BI) tools, data warehouse or EDW, etc. Advantages of ETL: 1. Single view interface to integrate heterogeneous data 2. Ability to join data both at the source and at the integration server with the addition of the option to apply any business rule from within a single interface. 3. Common data infrastructure for working on data movement and data quality. 4. Parallel Processing Engine for providing exceptional performance and scalability. 1 Beye Network: Is Inmon's Data Warehouse Definition Still Accurate?
  • 3. Shortcomings of ETL: 1. Migration from server to enterprise edition might require vast time and resources due to the innumerable architectural differences in the Server and Enterprise edition. 2. No automated error handling or recovery mechanism. 3. Expensive as a solution for small or midsized companies.2 What is ELT? Until recently, it was normal to stage data into an intermediate system before pushing it into the target system as the target was better optimized to retrieve and report (and not to perform hard crunching of numbers or data). This is why many preferred the ETL process, where the intermediate system would be optimized to perform calculations and data transformation (this is the reason we call this process transformation). This approach kept the target reporting system independent of the implementation method during transform stage, resulting in organizations implementing three separate systems to satisfy the requirements of each stage. Since hardware systems today are better equipped and capable of doing a lot more, reporting and calculations can be performed using the same system. This is where the ELT implementation comes in. ELT stands for Extract, Load, Transform. It is an alternative to ETL as it implements the data lake. In ELT models, data is processed on entry to the data lake, resulting in faster loading times. In most cases, the design of the transformational technology ties closely into the platform used for reporting, giving ETL the advantage of a better hardware and software sync up. Advantages: 1. No need for a separate transformation engine, the work is done by the target system itself. 2. Data transformation and loading happen in parallel, so less time and resources are spent (as only filtered, clean data is loaded into the target system) 2 ETL Tools: Major business and technical advantages and disadvantages of using DataStage ETL tool
  • 4. 3. ELT works with high-end data engines such as Hadoop cluster, cloud or data appliances. This gives is additional performance and security. 4. The processing capability of data warehousing infrastructure reduces time that data spends in transit and makes the system more cost effective. Disadvantages: 1. The specifics of ELT development vary on platform i.e. Hadoop clusters work by breaking a problem into smaller chunks, then distributing those chunks across a large number of machines for processing. Some problems can be easily split, others will be much harder. 2. Developers need to be aware of the nature of the system they’re using to perform transformations. While some systems can handle nearly any transformation, others do not have enough resources, requiring careful planning and design.3 Comparison: ETL vs ELT Although ETL and ELT are vastly different in terms of architecture and implementation, the main difference lies in the rethinking of approach taken to transferring data into reporting systems. ELT takes full advantage of technology and along the way enhances the reporting solution with added values like tracing of data points.4 Another main attraction of ELT is the reduction in load time and the time that data is in transit, making it not just efficient but even cost-effective. Even though ELT requires a high-end system, it drastically reduces the number of components required. 5 Thus, despite ELT implementation being more complex compared to the one way transaction-system-to- reporting ETL, ELT is now being preferred. Designing a proper ELT system might take some work, but the payoff is well worth it! In banking terms, only the data of value ends up in the Data Warehouse for ETL processes. What this mean is that you Extract the needed data into a staging area (in relational term often staging tables or the so called global temporary tables), segregate it from unwanted data, perform data manipulation (Transformation) and finally Load it into target tables in a Data Warehouse. Analysts then use appropriate BI Tools to look at macroscopic trends in the data. This makes the process of data matching, 3 Ironside: ETL vs. ELT – What’s the Big Difference? 4 Blog: Performance Architects; Difference between ETL and ELT 5 TechTarget: Extract, Load, Transform (ELT) definition
  • 5. Read more about HEXANIKA’s DRAAS solution at: http://hexanika.com/company-profile/ This is where ELT works best, for it is not just confined to data deemed to be of specific value. Hadoop (HDFS systems) can store everything from structured data (transactional databases) and unstructured data (coming for excel sheets, emails, logs, internet, and other). As raw data and transformed data are saved on the same machine, data linkage and lineage processes are a lot faster and more accurate. This also drastically reduces the Total Cost of Ownership (TCO) which is an attractive proposition for various financial institutions using Big Data Storage systems.6 In summary, ELT allows you to extract and load all data as is into HDFS, and then you can do Transformation through Schema on Read, thereby simplifying the process of Data Warehousing. Hexanika: Efficient, Simple and Smart! Hexanika is a FinTech Big Data software company, which has developed a revolutionary software platform called SmartJoinTM for financial institutions to address data sourcing and reporting challenges for regulatory compliance. SmartJoinTM improves data quality while the automated nature of 6 LinkedIn Pulse: ETL or ELT and the Use Case
  • 6. SmartRegTM keeps regulatory reporting in harmony with the dynamic regulatory requirements and keeps pace with the new developments and latest regulatory updates. Hexanika leverages the power of ELT using distributed parallel processing, Big Data/Hadoop technology with a secure data cloud (IBM Cloud). Understanding the high implementation costs of new systems and the complexities involved in redesigning existing solutions, Hexanika offers a unique build that adapts to existing architectures. This makes our solution cost-effective, efficient, simple and smart! Read more about our solution and architecture at: http://hexanika.com/big-data-solution-architecture/ CONTACT US USA 249 East 48 Street, New York, NY 10017 Tel: +1 646.733.6636 INDIA Krupa Bungalow 1187/10, Shivaji Nagar, Pune 411005 Tel: +91 985068686 Email: info@hexanika.com