What: In-person Data Engineering NL meetup
When: May 25, 17.45 CET
Where: BigData Republic Office, Coltbaan 4c in Nieuwegein
Whether you are a data scientist wondering why you have duplicates in your dataset, or you’re an engineer dealing with missing values: we have all suffered from at least a few data quality battles. In our next Meetup, Chiel Fernhout (DevOps engineer at Datafold) and Sebas Higler (Data Engineer at BigData Republic) will arm you with open-source weapons to fight these data quality monsters!
In the first talk, Chiel will introduce you to Datafold’s data-diff. Data-diff checks every change to a data pipeline and highlights how the change in source code will affect the data produced by the pipeline. Chiel will show you how the tool works and how it can be integrated with dbt to support Test-Driven Data Development. In particular, he will highlight how you can use data-diff in your CI workflow to deal with data quality issues.
After that, Sebas will discuss several ways to monitor data using dbt Core. He will cover Elementary, an open source Data Observability tool and dbt third-party package. This tool seamlessly integrates with dbt and lets you easily run and inspect a variety of data quality checks.
Like all of our sessions, we’ll have some nice food before the session and drinks afterwards!
We hope to see you there!
The Data Engineering NL Meetup team
Join the meetup
May 25, 17.45 Amsterdam time (CET)
- 17:45 Walk-in + food
- 18.30 Datafold's data-diff by Chiel Fernhout
- 19.00 Questions
- 19.10 Elementary by Sebas Higler
- 19.40 Questions + Discussion
- 19.55 Drinks & snacks
Meet the speakers
Chiel Fernhout - Chiel has seen many sides of the tech world. He started in the backend, moved to full stack Machine Learning (ML) Engineering and finally transitioned to DevOps. As a DevOps Engineer at Datafold he creates great testing automation tools for Data Engineers. In general, he tries to automate himself away ;)
Sebas Higler - Sebas has a background in both Software Engineering and Artificial Intelligence. He enjoys working with and thinking about data problems. In his current role as Data Engineer at BigData Republic he's involved in a project at IKEA using dbt to implement a new datamart.