The Data + AI Summit 2021 Call for Presentations is closing soon.

Submit your full-length session ideas, lightning talk ideas, and more for the world’s largest gathering of Data + AI practitioners.

The conference is at the end of May, but the CFP is due on Sunday, February 28.

Data engineering, data analytics, AI, data science, machine learning, and more.

https://databricks.com/dataaisummit/north-america-2021/call-for-presentations 

It’s very easy to be distracted by the latest and greatest approaches with technology, but sometimes there’s a reason old approaches stand the test of time.

Star Schemas & Kimball is one of those things that isn’t going anywhere, but as we move towards the “Data Lakehouse” paradigm – how appropriate is this modelling technique, and how can we harness the Delta Engine & Spark 3.0 to maximize it’s performance?

This session looks through the historical problems of attempting to build star-schemas in a lake and steps through a series of technical examples using features such as Delta file formats, Dynamic Partition Pruning and Adaptive Query Execution to tackle these problems.

Coming from a data warehousing and BI background, Franco Patano wanted to have a catalogue of the Lakehouse, including schema and profiling statistics.

He created the Lakehouse Data Profiler notebook using Python and SQL to analyze the data and generate schema and statistics tables. He then uses the new SQL Analytics product from Databricks to dashboard and visualize the data profiling statistics. He discusses how to use these dashboards to optimize JOINs and other operations.

[ Lightning talk from Data + AI Summit 2020]

Databricks just launched a new web series: Data Brew and this is the first episode.

For this first season, we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.

In our inaugural episode, we’d like to welcome data warehouse luminaries Barry Devlin, Susan O’Connell, and Donald Farmer to discuss the evolution of data warehouses, data lakes, and lakehouses.Join us for the debut of Data Brew — a new video / podcast series where we explore and debate the evolution of Data + AI. No hype, no spin, just a straight shot of strong opinions from some really smart people.