Data lineage that delivers.

Highly accurate lineage and usage data across your entire data stack that 'just works', and a host of features that help your team deliver measurable business impact.

Save days of manual work per week, cut your cloud spend by 30%, and automate your data governance.

Automated Data Lineage.
For when Accuracy Matters.

Data lineage is hard. By focusing on it, we provide the accuracy and granularity required to power your operational use cases. Via our web app, CLI or API.

Get started
Video showing data lineage in Alvin

Integrates across your stack

DATA STORES
BQ BigQuery
PG Postgres
MS MySQL
HI Hive
SF Snowflake
RS Redshift
DB Databricks
BI TOOLS
LO Looker
MO Mode
TA Tableau
DATA OPS
AF Airflow
dbtdbt
HT HighTouch

The richest, most accurate lineage dataset available

Most companies that offer data lineage cover less than 70% of cases. It has been our core focus since Alvin was born. So you get the best available.

Sync Completely automated plug and play data lineage
SF sync_alt Lo sync_alt dbt Cross system: connect data stores, BI and orchestration tools
view_column Field level lineage
filter_list Filter lineage by date range and user
Ka S3 folder Add assets and lineage manually for unsupported sources
Video showing data lineage in Alvin.
Video showing impact analysis in Alvin.

Prevent broken pipelines with impact analysis

Avoiding breaking changes beats observing failures. Alvin keeps your pipelines running, and your data available.

sync Dry run your SQL to detect breaking changes
sync_alt Cross system impact
dbt Run impact analysis on your dbt models
git lab Get impact reports as comments on your Git pull requests

Usage analytics for your data assets

Search and filter your assets by usage, last used, tags, data sensitivity, parent, platform, type, any pretty much anything else you can think of.

filter_list Filter by: usage, tag, data owner, last used, parent, PII + more
tonality Save filter sets as views to quickly see what matters to you
attach_money Detect unused columns, table, dashboards and pipelines to cut costs
group See who in your team is using assets what assets
Video showing usage based search and filtering in Alvin.
Video showing data catalog in Alvin.

A modern data catalog, powered by automation

A data catalog is only as good as the data that powers it. Our data catalog is built on top of our high quality usage and lineage data.

search Discover data with usage powered search
sell Use rules to automatically tag your data assets
lock Auto detection of PII and sensitive data
book Business glossary to define your metrics

Security first

Alvin has been built to the highest security standards. And we have the certification to prove it.

Done
SOC 2 Type II Certified Read more
Done
Metadata only access roles, with no read access to your underlying data View docs
Done
SSO through Google, Microsoft and more.
A-Lign SOC 2 BadgeAICPA SOC 2 Badge
Alvin security graphic.

Interfaces for your entire data team

Web App

Our plug and play SaaS platform with use case driven modules.

API

Built on an easily extensible and open architecture

CLI

Setup and consume Alvin data via our CLI

Blog

Read more
Ready to see Alvin in action?
Empower your data team with the best-of-breed.
Connect your data in minutes and unleash the power of data lineage.