The Data + AI Summit 2021 Call for Presentations is closing soon.

Submit your full-length session ideas, lightning talk ideas, and more for the world’s largest gathering of Data + AI practitioners.

The conference is at the end of May, but the CFP is due on Sunday, February 28.

Data engineering, data analytics, AI, data science, machine learning, and more. 

Coming from a data warehousing and BI background, Franco Patano wanted to have a catalogue of the Lakehouse, including schema and profiling statistics.

He created the Lakehouse Data Profiler notebook using Python and SQL to analyze the data and generate schema and statistics tables. He then uses the new SQL Analytics product from Databricks to dashboard and visualize the data profiling statistics. He discusses how to use these dashboards to optimize JOINs and other operations.

[ Lightning talk from Data + AI Summit 2020]