Video details

GOTO 2020 • Observability for Data Pipelines: Monitoring, Alerting & Tracing Lineage • Jiaqi Liu


This presentation was recorded at GOTO Chicago 2020. #GOTOcon #GOTOchgo
Jiaqi Liu - Leader in Bridging the Gap between Data Science and Engineering
ABSTRACT Data-intensive applications, with many layers of transformations and movement from different data sources, can often be challenging to maintain even after they are initially built and validated. To truly expand and develop a code base, developers must be able to test confidently during the development process and monitor the production system. Monitoring and testing data pipelines or real-time streaming processes can be very different from monitoring web services. Jiaqi draws on her experience building and maintaining both batch and real-time stream data pipelines to discuss how to leverage monitoring tools like Prometheus and Grafana to define and visualize metrics, how and when to alert on common health indicators, and how to gain visibility in monitoring not just the system health but the health of the data. General concepts she touches on include observability of pipeline health, interpretability of data results and building features into data pipelines that makes monitoring and testing just a little [...]
TIMECODES 00:00 Intro 00:56 Data Pipelines 03:20 What could go wrong? 05:16 Data pipeline concerns 06:14 Interpretability 07:42 Observability 08:44 Pipeline features 09:17 Immutable data 10:52 Data lineage 13:08 Dry run mode 15:21 Testing, monitoring & alerting
Download slides and read the full abstract here: #Prometheus #Grafana #Data #Programming #Backend #DevOps #Streams #Frontend
Looking for a unique learning experience? Attend the next GOTO conference near you! Get your ticket at
SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.