- Blog
- 11.01.2017
- Data Fundamentals, Dev Dialogues
Webinar l Extract, Transform, Load (ETL) OR Extract, Load, Transform (ELT)

What is ETL?
Traditional ETL tools extract data from its source systems, transform it in a separate environment, and then load it into a target database for BI/reporting.- E – extracting data from source
- T – transforming (i.e. converting) the raw data into a format that’s suitable for reporting and analytics
- L – loading data into a target platform (e.g. a relational database)

What is ELT?
So what options do you have if you have a lot of data from numerous sources and you want to shape, clean, filter, join and transform that data? “ELT”, as opposed to ETL, extracts data from source systems, loads it in its raw form into a target platform, and then allows you to transform it in-database. This makes all data accessible and easy to audit. ELT leverages the power of the data warehousing platform itself to perform transformations, and get the data into an analytics-ready format. Matillion pushes down all transformations to the data warehouse itself, and means that you only need one powerful piece of infrastructure. Further advancements on relational databases, such as Amazon Redshift’s new Spectrum functionality, makes transforming your data in-database easier, faster and more cost efficient. This design results in savings on infrastructure, better performing workloads, and shorter development cycles. Your data is quickly migrated and immediately available for transformations and analysis based on current business questions and needs. Also it means you don’t need to know how you are going to use that data from the start. You have the freedom to transform at a later stage once its use case becomes more clear. This ability is increasingly appealing given the changing nature of development with the rise of iterative Agile methodologies. Thus ELT, may be more aligned with the current technical mindset.
So ELT is the best option?
The answer to this question is not a simple ‘Yes’ or ‘No’. ELT has some obstacles that you shouldn't ignored. Since your transformations are being done in-database, you will need available space and compute power to perform the desired transformations. Without this performance and queries will suffer. Cloud-based platforms such as Amazon Redshift, Snowflake and Google BigQuery, however, facilitate scalability in a cost efficient manner that helps address this challenge. As previously mentioned, continuous advancements such as Spectrum for Amazon Redshift, make loading your data and then transforming it even easier and faster! Another problem we often hear about is the timely and labour intensive process of script writing. This is not just for loading data, but transforming it. It applies especially when dealing with unusual or unstructured data types, or where access is not simply file-based. The more data, data sources and needed outputs can make these complex opening them open to human error. It could take days or weeks to script, run and rollback in instances where mistakes are made. Luckily, there are a number of tools on the market that make this process quicker and easier, such as Matillion.Matillion ELT
Matillion for Amazon Redshift, Snowflake and Google BigQuery use an ELT approach to mitigate the historical constraints associated with ETL. Offering an alternative to ETL, Matillion allows users to pull data from multiple sources into one location for analysis. Matillion's wide range of connectors help to mitigate the issues with unusual and unstructured data by mitigating the need to manually code script. Using an ELT tool allows companies to benefit from superior scalability and cost efficiency. Matillion is fast and accessible to both technical and non-technical users given its intuitive design. This puts the right data is in the right hands at the right time.Conclusion
So, Extract, Transform, Load (ETL), or Extract, Load, Transform (ELT)? ETL can still bring clients benefits in cases of extreme staticism. However, the changing data landscape is characterised by ever-changing user requirements, more data and data sources, increasingly complex queries/transformations and increased movement to the cloud. This means these use cases are dwindling, if not almost extinct. While we wouldn’t go as far as ETL is dead, its future is bleak in light of ELT advantages. If ETL looks to improve its biggest setback, performance, we would expect the result to be an ELT workflow.Webinar Video
Watch the full length webinar. https://youtu.be/YTloTRUFkhI
Ian Funnell
Manager of Developer Relations
Featured Resources
Blog
Data Mesh vs. Data Fabric: Which Approach Is Right for Your Organization? Part 3
In our recent exploration, we've thoroughly analyzed two key ...
eBooks10 Best Practices for Maintaining Data Pipelines
Mastering Data Pipeline Maintenance: A Comprehensive GuideBeyond ...
NewsMatillion Adds AI Power to Pipelines with Amazon Bedrock
Data Productivity Cloud adds Amazon Bedrock to no-code generative ...
Share: