Matillion uses the Extract-Load-Transform (ELT) approach to delivering quick results for a wide range of data processing purposes: everything from customer behavior analytics, financial analysis, and even reducing the cost of synthesizing DNA.
The QuickBooks Query component in Matillion ETL for Amazon Redshift presents an easy-to-use graphical interface, enabling you to easily pull data from QuickBooks Online directly into Amazon Redshift. Many of our customers are using this to integrate their company’s accounting and payment data with data from other areas of their business.
The connector is completely self-contained: no additional software installation is required. It’s within the scope of an ordinary Matillion license, so there is no additional cost for using the features.
Start by creating an Orchestration job to extract the required data from QuickBooks Online. In the Components search bar type QuickBooks. Drag and drop the QuickBooks Online Query component onto the job canvas.
The QuickBooks Online Query supports OAuth authentication. The first step in configuring the Amazon Redshift component is to provide the Authentication to QuickBooks Online. The Matillion QuickBooks Online Query component requires OAuth to be set up to authenticate Matillion to connect to QuickBooks Online data. Further details of configuring QuickBooks Online OAuth is available on our support center. Clicking on the 3 dots next to the Authentication property will bring a pop-up box showing all available QuickBooks Online accounts:
Next, select the Data Source (table) you want to pull data from, displayed in the Data Source drop down.
Select the columns from the data source that you are interested in. Matillion will supply a list of the columns available to you in the Data Source previously selected. This will form the new table which you will create in Amazon Redshift with the help of Matillion.
Data Source Filter
Leaving the data source filter empty will allow the query to return all rows (based on the setting in LIMIT). You may want to pull just the new records, for example, since the last time you pulled data in from this data source. If so, populate the data source filter accordingly. Here is an example of pulling just new rows based on a variable “last_update_date”:
Several connection options are available for the QuickBooks Online Query. In particular, if you are connecting to a Sandbox of QuickBooks Online you’ll need to select ‘UseSandbox’ and set it to True since the default setting is False.
Running the QuickBooks Online Query component in Matillion ETL for Amazon Redshift
Before you can run the component, you need to specify a Target Table name. This is the name of a new table that Matillion will create (or overwrite) to write the data into Amazon Redshift. Also, an S3 Staging Area must be specified. This is the S3 bucket will temporarily store the results of the query before you load them into Amazon Redshift.
This component also has a Limit property which forces an upper limit on the number of records returned.
You can run the Orchestration job, either manually or by using the Scheduler, to query your data and bring it into Amazon Redshift.
Transforming the Data/Transformation Job
Once you have extracted your data from QuickBooks Online and loaded into an Amazon Redshift table, you can start to transform the data.
In this way, you can build out the rest of your downstream transformations and analysis, taking advantage of Amazon Redshift’s power and scalability.
The post Using the QuickBooks Query component in Matillion ETL for Amazon Redshift appeared first on Matillion.