A DBT-Based Column Lineage Tracking Approach for Regulatory and Audit Compliance

Shreekant Malviya

PDF

Published: Apr 28, 2025

Keywords:

dbt-based column lineage, Regulatory & audit compliance, PII/PCI/PHI data governance, OpenLineage / Apache Atlas integration, DSAR (Data Subject Access Requests).

Shreekant Malviya, Abhishek Jaiswal, Vivek Koli

Abstract

Regulatory programs are also becoming more and more intuitive about demonstrated column-level lineage of PII/PCI/PHI and financial qualities, which many organizations now have only table-based DAGs. The paper presents a compile-time solution based on the dbt-first proposal, currently processing compiled SQL and dbt artifacts to produce normalized column-to-column edges and storing them in an open governance store, which is mapped to control evidence. The strategy was tested with a production-similar project (≈250 models, 1,800 columns, ~9,000 edges) and Snowflake and BigQuery profiles in 1,200- 1,800-day schedules of tasks. The accuracy was measured on a stratified, dual-view ground truth (n=400): precision 0.971 (95% CI 0.952–0.987), recall 0.934 (0.905–0.960), F1 0.952. There was coverage of 96.7 overall (customer 97.8%, payments 96.4%, and health 95.1%) fields tagged with critical coverage. Operation impact was limited: compile/run deltas of between 7.6% and lineage metadata of less than 5 GB/month, and platform cost deltas of less than 250/month at scale stated. The compliance results were significantly improved: median DSAR turnaround decreased by 12.1 to 4.3 days (−64%); SOX evidence-pack assembly decreased to 18 minutes, and quarterly defect leakage remained 1.6%. Recall gains were seen in ablations with materialization of transient models (+3.2 pp), macro-depth limits and explicit select lists (+2.9 pp), and propagation of the best-effort UDF (+1.7 pp). Dynamic SQL, non-dbt pipelines are also limited; the key areas of future development are sub-minute streaming lineage, semantic equivalence +37 pp recall, and interoperability using standards.

Issue

Vol. 21 No. 01 (2025)

Section

Articles

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Announcement

Call for Papers for the New Issue
Last Date of Submission: October 31^st, 2025

Indexing

Call for Papers & Reviewers

The Journal of Electrical Systems (JES) is inviting researchers, scholars, and experts in the field of electrical systems to submit their original and unpublished research papers for consideration in our upcoming issues. We welcome high-quality contributions that address innovative ideas, advancements, and challenges in electrical systems and related areas.

Submission Deadline: March 31^st, 2025

Topics of Interest Include, but are not Limited to:

• Power Systems and Smart Grids
• Renewable Energy
• Control Systems
• Electronics and Communication
• Signal Processing
• Artificial Intelligence in Electrical Engineering
• Internet of Things (IoT) in Electrical Systems
• Electric Vehicles and Transportation
• Robotics and Automation

Authors are requested to submit their manuscripts electronically through our online submission system by the specified deadline.

Submission Guidelines:

Manuscripts should be prepared according to the JES guidelines available on our website.
All submissions will undergo a rigorous peer-review process.
Manuscripts must be original, not previously published or under consideration elsewhere.

Call for Reviewers:
JES is also seeking qualified and experienced individuals to join our esteemed panel of reviewers. If you are interested in contributing your expertise to ensure the quality of the papers published in JES, kindly submit your resume to editor@esrgroups.org. Reviewers play a crucial role in maintaining the high standards of our journal.

We look forward to receiving your valuable contributions and appreciate your interest in the Journal of Electrical Systems.

Important Links

Home

Aims and Scope

Instructions for Authors

Editorial Board

Downloads

Download Paper Template

Article Sidebar

Main Article Content

Abstract

Article Details