Open Source

Autonomous data quality
investigation platform

Detect anomalies. Generate hypotheses. Test with SQL. Synthesize root causes.
All powered by LLMs working in parallel.

Quick install
# Clone and setup
git clone https://github.com/bordumb/dataing.git
cd dataing
just setup

# Run demo
just demo
Lineage Code Anomalies Data Sources LLM Insights

How it works

Dataing automates the tedious parts of data investigation

Anomaly Detection

Automatically detect data quality issues like null spikes, volume drops, schema drift, and duplicates.

Hypothesis Generation

LLMs generate multiple hypotheses about potential root causes based on context and patterns.

Parallel SQL Testing

Test hypotheses concurrently with safe, validated SQL queries against your data warehouse.

Lineage Integration

Connect to OpenLineage, dbt, Dagster, Airflow, or DataHub for full data lineage context.

Enterprise Ready

SSO/OIDC, SCIM, audit logging, and role-based access control for enterprise deployments.

Open Core

Community Edition is fully open source. Enterprise Edition adds advanced features.

Ready to automate your data investigations?

Get started in minutes with our quick setup guide.

Get Started Free