Automated Data Lineage.
For when Accuracy Matters.

Data lineage is hard. By focusing on it, we provide the accuracy and granularity required to power your operational use cases. Via our web app, CLI or API.

Get started
Video showing data lineage in Alvin

Integrates across your stack

DATA STORES
BQ BigQuery
PG Postgres
MS MySQL
HI Hive
SF Snowflake
RS Redshift
DB Databricks
BI TOOLS
LO Looker
MO Mode
TA Tableau
DATA OPS
AF Airflow
dbtdbt
HT HighTouch

The richest, most accurate lineage dataset available

Most companies that offer data lineage cover less than 70% of cases. It has been our core focus since Alvin was born. So you get the best available.

Sync Completely automated plug and play data lineage
SF sync_alt Lo sync_alt dbt Cross system: connect data stores, BI and orchestration tools
view_column Field level lineage
filter_list Filter lineage by date range and user
Ka S3 folder Add assets and lineage manually for unsupported sources
Video showing data lineage in Alvin.
Video showing impact analysis in Alvin.

Prevent broken pipelines with impact analysis

Avoiding breaking changes beats observing failures. Alvin keeps your pipelines running, and your data available.

sync Dry run your SQL to detect breaking changes
sync_alt Cross system impact
dbt Run impact analysis on your dbt models
git lab Get impact reports as comments on your Git pull requests

Usage analytics for your data assets

Search and filter your assets by usage, last used, tags, data sensitivity, parent, platform, type, any pretty much anything else you can think of.

filter_list Filter by: usage, tag, data owner, last used, parent, PII + more
tonality Save filter sets as views to quickly see what matters to you
attach_money Detect unused columns, table, dashboards and pipelines to cut costs
group See who in your team is using assets what assets
Video showing usage based search and filtering in Alvin.
Video showing data catalog in Alvin.

A modern data catalog, powered by automation

A data catalog is only as good as the data that powers it. Our data catalog is built on top of our high quality usage and lineage data.

search Discover data with usage powered search
sell Use rules to automatically tag your data assets
lock Auto detection of PII and sensitive data
book Business glossary to define your metrics

Interfaces for your entire data team

Web App

Our plug and play SaaS platform with use case driven modules.

API

Built on an easily extensible and open architecture

CLI

Setup and consume Alvin data via our CLI

Blog

Read more
Ready to see Alvin in action?
Empower your data team with the best tools.
Connect your data in minutes and unleash the power of data lineage.