Data Versioning, Lineage, and Quality Monitoring for AI
With Janani Ravi
Liked by 19 users
Duration: 1h 42m
Skill level: Intermediate
Released: 4/17/2025
Course details
Discover the importance of data versioning and how it impacts ML and AI workflows. Instructor Janani Ravi outlines key concepts such as snapshots, lineage, branching, and how to manage data versions effectively. Explore how to use data version control (DVC) to initialize Git, track files, and version data more efficiently. Get introduced to data lineage in Microsoft Fabric and uncover techniques and best practices to track lineage. Understand common issues with data and models, including processing, schema management, data loss, and bias, and learn how to monitor these aspects for quality. Along the way, learn how to track metrics that help ensure data and model integrity and performance. Whether you're a data scientist, engineer, or currently working in data management, this course equips you with the skills you need to maintain high standards of data versioning and quality monitoring in your projects.
This course was created by Loonycorn. We are pleased to host this training in our library.
Skills you’ll gain
Earn a sharable certificate
Share what you’ve learned, and be a standout professional in your desired industry with a certificate showcasing your knowledge gained from the course.
LinkedIn Learning
Certificate of Completion
-
Showcase on your LinkedIn profile under “Licenses and Certificate” section
-
Download or print out as PDF to share with others
-
Share as image online to demonstrate your skill
Meet the instructor
Learner reviews
Contents
What’s included
- Practice while you learn 1 exercise file
- Learn on the go Access on tablet and phone