What languages does Windmill support for data pipelines?

Windmill supports Python, TypeScript, SQL, Go, Bash, Rust, PHP, and 15+ other languages. Each pipeline step can use a different language, so you can pick the best tool for each task. For example, Python with Polars for transformations, SQL for DuckDB queries, and TypeScript for API calls.

Can I use Snowflake or BigQuery with Windmill?

Yes. Windmill has native integrations for Snowflake, BigQuery, PostgreSQL, MySQL, and 50+ other data sources. You can mix remote warehouse queries with local DuckDB steps in the same pipeline.

How does Windmill compare to Airflow for data pipelines?

Windmill replaces Airflow with a simpler developer experience: no DAG-as-code boilerplate, a built-in code editor with AI assistance, native DuckDB and Polars integrations, and a visual flow editor. Pipelines run on lightweight workers instead of a complex Celery/Kubernetes setup.

Do I need to manage infrastructure to run Windmill?

No. Windmill Cloud is a fully managed service. Sign up and start building pipelines immediately. You can also self-host the open-source Community Edition on Docker or Kubernetes with a single command.

How does Windmill handle errors in data pipelines?

Each pipeline step supports configurable retries with exponential backoff, custom error handler scripts, and automatic notifications. You can restart a failed pipeline from any step without replaying the entire run.

Windmill

Windmill cloud

OSS

Build production-grade data pipelines

For data teams who want reliable ETL with native DuckDB and Ducklake integrations. No Airflow or Spark clusters to manage.

Try Windmill cloud Self-host in 3 mins

✓Write pipeline steps in Python, TypeScript, SQL, Go, Bash and 15+ languages

✓Native integrations with DuckDB, Ducklake and Polars with zero config

✓Built-in S3 / Azure Blob / GCS workspace storage with dataset browsing

✓Run on the fastest workflow engine in the industry

Snowflake

DuckDB

Ducklake

PostgreSQL

Polars

Trusted by 4,000+ organizations, including 300+ EE customers at scale:

Everything you need to build and run production-grade data pipelines

Write each step in Python, TypeScript, SQL, Go, Bash or any supported language, connect to your data sources with native DuckDB and Ducklake integrations, and deploy with built-in scheduling, retries and observability.

Steps as code

Write each pipeline step in the language that fits best. Python, TypeScript, SQL, Go, Bash, Rust, PHP and 20+ more. Mix and match freely within a single pipeline.

	Windmill + DuckDB	Snowflake / BigQuery
Compute	Local on your workers	Remote warehouse
Cost model	Flat, pay for infra only	Per-query pricing
Data storage	Your S3 bucket, open formats	Vendor-managed, proprietary
Vendor lock-in	No	Yes
Orchestration	Built-in (flows, retries, schedules)	Separate tool needed
Setup	Zero config, auto-connected	Credentials, drivers, networking
Data egress fees	No	Yes

Build production-grade data pipelines

Trusted by 4,000+ organizations, including 300+ EE customers at scale:

Everything you need to build and run production-grade data pipelines

Steps as code

DAG visualizer

Connect to any service

Parallel branches

Restart from any step

Retries & error handlers

Trigger from anywhere

Data tables

Deploy & version control

Full observability

The native DuckDB and Ducklake orchestrator

DuckDB

Ducklake

Workspace S3

Polars

Assets lineage

Challenging the status quo of data warehouses

Production-grade performance that replaces Spark

More you can build on Windmill

Frequently asked questions

Build your internal platform on Windmill