- Blog
- 08.06.2024
- Product, Data Fundamentals
AI integrations in Matillion Data Productivity Cloud Designer

Matillion’s Data Productivity Cloud (DPC) continues to evolve, providing users with reliable tools to streamline and enhance data operations. With the latest updates, Designer now integrates seamlessly with various AI providers and models, enhancing its capability to facilitate advanced data processing tasks.
Designer’s AI integrations leverage large language models (LLMs) for generating responses to user prompts. DPC’s AI components provide quick and easy access to powerful tools that drive some of the most successful data teams operating today.
OpenAI and Matillion DPC
Component: OpenAI Prompt
The OpenAI Prompt component allows users to generate responses to text prompts using OpenAI's models. It is ideal for applications requiring high-quality text generation.



AI Provider: OpenAI
Summary:
- Privacy Measures: OpenAI provides secure API access with encryption and strict data usage policies.
- Security Features: Role-based access control, audit logs, and compliance with industry standards.
- Pros: Strong API security and privacy measures.
- Cons: Higher cost, reliance on external API for data security.
Models:
- GPT-4 Turbo
- Pros: High accuracy and fast processing.
- Cons: Higher cost.
- Suggested Use Cases: Complex data analysis, real-time decision-making, customer support chatbots.
- Example: Use GPT-4 Turbo to generate detailed analytical reports from raw data inputs.
- GPT-4o
- Pros: Leading accuracy, comprehensive knowledge base.
- Cons: Higher cost.
- Suggested Use Cases: Detailed report generation, advanced customer interactions, complex data processing.
- Example: Deploy GPT-4o to handle intricate customer queries with high precision.
- GPT-3.5 Turbo
- Pros: Versatile, good performance.
- Cons: Slightly less accurate than GPT-4 models.
- Suggested Use Cases: Content creation, standard customer support, basic data summaries.
- Example: Use GPT-3.5 Turbo to create blog content and summarize customer feedback.
Azure OpenAI and Matillion DPC
Component: Azure OpenAI Prompt
The Azure OpenAI Prompt integrates OpenAI models within the Azure ecosystem, providing functionality similar to that of the OpenAI Prompt but tailored to Azure users.



AI Provider: Azure OpenAI
Summary:
- Isolation and Security: Azure integrates OpenAI models with Azure’s security infrastructure, ensuring isolated execution and comprehensive security.
- Security Features: Encryption, identity and access management, compliance with various industry standards.
- Pros: Integrated with Azure security, strong compliance.
- Cons: Suitable for customers with existing Azure deployments and expertise.
Models:
- The same core models as OpenAI with minor differences in supported features (learn more: Differences between OpenAI and Azure OpenAI GPT-4 Turbo GA Models).
Amazon Bedrock and Matillion DPC
Component: Amazon Bedrock Prompt
The Amazon Bedrock Prompt uses Amazon's Bedrock models for text generation and embeddings, which are suitable for AWS-based applications.



AI Provider: Amazon Bedrock
Summary:
- Isolation Benefits: Amazon Bedrock offers isolated execution environments, ensuring your workloads are securely processed without interference from other users.
- Security Features: Comprehensive security measures, including encryption at rest and in transit, role-based access controls, and compliance with various regulatory standards.
- Pros: Strong isolation and security, compliance with major standards.
- Cons: Requires understanding and configuring AWS security settings.
Models:
- Claude v2 (Anthropic)
- Pros: Ethical considerations, quality performance.
- Cons: Cost varies by provider.
- Suggested Use Cases: Sensitive content moderation, ethically-focused applications, educational content creation.
- Example: Use Claude v2 to create educational content that aligns with ethical guidelines.
- Claude v3 Sonnet (Anthropic)
- Pros: Enhanced quality, ethical focus.
- Cons: Higher cost.
- Suggested Use Cases: Advanced data processing, sensitive content generation, ethical AI applications.
- Example: Use Claude v3 Sonnet to generate high-quality content that requires ethical oversight.
- Claude v3 Haiku (Anthropic)
- Pros: Fast response, ethical considerations.
- Cons: Limited to specific use cases.
- Suggested Use Cases: Real-time data processing, chatbots, ethical content generation.
- Example: Use Claude v3 Haiku for real-time chat applications with ethical content considerations.
- Mistral 7B (Mistral AI)
- Pros: Fast response, efficient.
- Cons: Limited knowledge base.
- Suggested Use Cases: Real-time analytics, dynamic content generation, interactive user interfaces.
- Example: Use Mistral 7B to develop real-time analytics dashboards.
- Mixtral 8x7B (Mistral AI)
- Pros: Fast response, efficient.
- Cons: Limited knowledge base.
- Suggested Use Cases: Real-time analytics, dynamic content generation, interactive user interfaces.
- Example: Use Mixtral 8x7B for interactive content generation tools.
- Titan Text Express (Amazon)
- Pros: High-quality text generation, flexible usage.
- Cons: Complex setup.
- Suggested Use Cases: Advanced search functions, recommendation systems, personalized content delivery.
- Example: Use Titan Text Express to build sophisticated search and recommendation systems.
- Titan Text Lite (Amazon)
- Pros: Cost-effective, easy to implement.
- Cons: Less comprehensive than other models.
- Suggested Use Cases: Basic text generation, cost-sensitive applications, lightweight content processing.
- Example: Use Titan Text Lite for lightweight text processing tasks.
Major AI models and their components in Matillion DPC

Snowpark Container Services and Matillion DPC integration
Matillion DPC now also includes integrations for Snowflake’s Snowpark Container Services. The service allows users to host and run their large language models in a Docker container within Snowflake's infrastructure.
This service integrates with Matillion DPC via the Snowpark Container Prompt component to provide seamless data orchestration and processing capabilities. By leveraging Snowpark Container Services, Matillion DPC users can deploy custom models, perform complex data transformations, and easily manage large datasets within a secure and scalable environment near existing data.
Pros:
- Scalability: Easily scale your data processing capabilities.
- Security: Strong security features to protect your data.
- Flexibility: Supports custom models tailored to specific needs.
Cons:
- Complexity: Requires in-depth knowledge of Snowflake services.
- Cost: Requires careful planning for capacity and cost.
AI in Transformation Pipelines using Snowflake Cortex
Transformation Pipelines vs. Orchestration Pipelines in Matillion DPC
So far, we've been discussing orchestration pipelines, which manage the control flow of tasks. Now, let's introduce transformation pipelines.
Transformation pipelines in Matillion DPC are designed for data transformation tasks using a low-code approach that issues one or more SQL DML queries. These pipelines focus on SQL-based operations to transform, clean, and prepare data efficiently within the database.
On the other hand, orchestration pipelines manage the control flow, dependencies, and execution order of tasks, including the invoking of transformation pipelines. While orchestration pipelines provide structure and control, transformation pipelines handle the actual data manipulation.

Snowflake Cortex Components in Transformation Pipelines
- Cortex Summarize: Uses Snowflake Cortex to summarize the input text of one or more columns, writing the result to new output columns. Automates text summarization and maintains original text if summarization isn't feasible.
- Cortex Extract Answer: Extracts an answer to a given question from text input using Snowflake Cortex. Handles both plain text and JSON objects, providing a confidence score for each answer.
- Cortex Translate: Translates the input text of one or more columns from one supported language to another. Supports multiple languages and can handle multiple columns simultaneously.
- Cortex Sentiment Analyzes English-language input text and returns a sentiment score indicating the level of negative or positive sentiment. It’s useful for analyzing customer feedback or reviews.
- Cortex Completions: Generates a response (completion) to a given prompt using Snowflake Cortex. Allows customization through system and user prompts to generate contextually relevant responses.
Pros and cons of using Snowflake Cortex
Pros:
- Integration with Snowflake: Seamless integration with Snowflake's data cloud, enabling powerful data processing capabilities.
- Scalability: Leverages Snowflake’s infrastructure, providing scalable solutions for handling large datasets.
- Security: Adheres to Snowflake’s strong security measures, ensuring data privacy and compliance.
- Advanced AI Capabilities: Access to advanced language models and AI functionalities directly within the data platform.
Cons:
- Cost: Depending on the models and the amount of data processed, using Snowflake Cortex components can incur additional costs.
- Complexity: Understanding both Matillion DPC and Snowflake Cortex functionalities requires understanding, which might necessitate additional training or expertise.
- Language Limitations: Some components are limited to English-language text processing, which might not be suitable for all use cases.
Conclusion
The integration of AI models into Matillion’s Data Productivity Cloud opens up new possibilities for data processing and analysis. By leveraging the capabilities of LLMs and embedding models, users can enhance their data workflows, making them more efficient and intelligent. Whether using Amazon Bedrock, Azure OpenAI, or hosting models within Snowflake, each integration offers unique benefits tailored to different needs and environments.
To leverage these AI integrations in your workflows, consider the specific needs of your data operations and choose models and components that best meet your requirements. Experiment with different models to see which provides the best performance for your use case, and take advantage of the flexibility and power of Matillion DPC to enhance your data processing capabilities.
For more detailed information on model performance, refer to the model performance data on artificialanalysis.ai. Please note that these values are subject to change as new models and updates are continuously developed.
Learn more with Matillion's free library of AI Videos, Demos and tutorials.
James Kosterman
DevOps Sales Solutions Consultant
Featured Resources
What Is Massively Parallel Processing (MPP)? How It Powers Modern Cloud Data Platforms
Massively Parallel Processing (often referred to as simply MPP) is the architectural backbone that powers modern cloud data ...
BlogETL and SQL: How They Work Together in Modern Data Integration
Explore how SQL and ETL power modern data workflows, when to use SQL scripts vs ETL tools, and how Matillion blends automation ...
WhitepapersUnlocking Data Productivity: A DataOps Guide for High-performance Data Teams
Download the DataOps White Paper today and start building data pipelines that are scalable, reliable, and built for success.
Share: