View on GitHub

Data Lineage Tracking And Visualization Solution

Docker Pulls GitHub Stars

Slides

Data lineage tracking is one of the significant problems that financial institutions face when using modern big data tools. This presentation describes Spline – a data lineage tracking and visualization tool for Apache Spark. Spline captures and stores lineage information from internal Spark execution plans and visualizes it in a user-friendly manner.

Presented at Spark Summit Europe 2017