OSS Edition
Our open-source version is a streamlined, self-hosted solution designed for individuals and small teams that prioritize control over their infrastructure. It offers the essential tools to build, run, and manage data pipelines, ideal for developers seeking flexibility with core functionality. For access to advanced features & enhanced scalability, Mage Pro is the next step.
Developer experience
Data Engineering Code Editor
OSS
Execute Python, SQL, and R code from a block within a pipeline
OSS
Mix and match dbt models and custom Python, SQL, or R code blocks within the same pipeline
OSS
Drag-and-drop code blocks in a visual dependency tree graph to customize the order of execution and flow of data between code blocks
OSS
View code side-by-side with the block's execution output while developing and building pipelines
Configure project, pipeline, or block level settings for limiting the data volume when running code blocks in development
New code editor with enhanced file browser, code management, and multi-row and multi-column layout
Autoscaling code execution framework for running blocks during pipeline development
Install and run VS Code extensions from the new code editor
New pipeline canvas editor for building complex graphs
Retrieval Augmented Generation (RAG) pipeline builder
Al Sidekick for creating pipelines, generating code blocks, and troubleshooting execution errors
Code Blocks
Control the flow of code executions using sensor blocks that pause a branch of code from running until a condition is met
OSS
Automatically retry block runs with customizable number of retries, delay, maximum delay, and exponential backoff
OSS
Search files, using full-text and natural language, across multiple projects and add code blocks to pipelines without duplicating code
Data Integrations
Data integrations with sources and destinations from 100+ third-party services
OSS
Data connectors and integrations with data lakes using Apache Iceberg table format
Data syncs are 12-18x faster using optimized concurrent read/writes with high throughput and capacity
Run no-code data integrations alongside custom code blocks together within a single pipeline
Change data capture (CDC) with select databases
Create custom data integration sources and destinations without changing source code
Real-time Data Streaming
Build streaming pipelines that process real-time data as it arrives using no-code configurations
OSS
Execute custom Python code blocks on incoming real-time data from streaming sources
OSS
SQL Data Models
Interpolate runtime variables, environment variables, secrets, and function macros within the SQL command of a SQL code block
OSS
Special SQL block connectors for DuckDB, OracleDB, Teradata, StarRocks, Couchbase, etc.
OSS
Execute SQL commands on data output from other Python, SQL, or R blocks
Upgraded developer experience for building, managing, and monitoring thousands of SQL models
Development workflow
Code Templates
Data Validation, Quality, and Unit Testing
Production Environment
Data Orchestration
Scheduling
Alerting
Monitoring
Data Governance