Autonomous data quality
investigation platform
Detect anomalies. Generate hypotheses. Test with SQL. Synthesize root causes.
All powered by LLMs working in parallel.
# Clone and setup
git clone https://github.com/bordumb/dataing.git
cd dataing
just setup
# Run demo
just demo
How it works
Dataing automates the tedious parts of data investigation
Anomaly Detection
Automatically detect data quality issues like null spikes, volume drops, schema drift, and duplicates.
Hypothesis Generation
LLMs generate multiple hypotheses about potential root causes based on context and patterns.
Parallel SQL Testing
Test hypotheses concurrently with safe, validated SQL queries against your data warehouse.
Lineage Integration
Connect to OpenLineage, dbt, Dagster, Airflow, or DataHub for full data lineage context.
Enterprise Ready
SSO/OIDC, SCIM, audit logging, and role-based access control for enterprise deployments.
Open Core
Community Edition is fully open source. Enterprise Edition adds advanced features.
Ready to automate your data investigations?
Get started in minutes with our quick setup guide.
Get Started Free