Featured Resources
Blog
ETL and SQL: How They Work Together in Modern Data Integration
Explore how SQL and ETL power modern data workflows, when to use SQL scripts vs ETL tools, and how Matillion blends automation ...
WhitepapersUnlocking Data Productivity: A DataOps Guide for High-performance Data Teams
Download the DataOps White Paper today and start building data pipelines that are scalable, reliable, and built for success.
BlogWebhooks and Pushdown Python: Building Interactive and Efficient Data Applications
Part 5 of our blog series demonstrating the art of the possible, using Matillion products and features to build the MatiHelper ...
Matillion's "
Stage 1 Matillion connects to Twitter’s API using OAuth credentials. Before using the Twitter Query component you’ll first need to set up your
It’s always best to work down the property list from top to bottom. Some of the key setting are as follows:
The actual run-time will vary according to the complexity of the query, the amount of data returned, and whether Twitter apply rate limiting. Once it’s working, you can add this Orchestration job into your daily schedule, and the new Snowflake table will be dropped and recreated every day, containing all the newly matching data.
Data returned from an API will very likely not conform to the required schema. To cleanse the data, introduce transformations, such as fixing datatypes and removing duplicates. Matillion’s
In the above example, the newly-loaded staging data has a meaningful Twitter “ID” column containing two pieces of information: a timestamp and an ordinal sequence number starting from 1.
To extract the date part, use a SUBSTRING in the Calculator expression:
To extract the ordinal number, take advantage of Snowflake’s sophisticated regular expression handling in a second Calculator expression:
Cleansing and enhancing the data in this way makes it ready for blending with data from other sources.
Share: