Ayman El-Ghazali recently presenting this Introduction to Databricks from the perspective of a SQL DBA at the NoVA SQL Users Group.

Code available at:https://github.com/thesqlpro/blogThis is an introduction to Databricks from the perspective of a SQL DBA. Come learn about the following topics:

  • Basics of how Spark works
  • Basics of how Databricks works (cluster setup, basic admin)
  • How to design and code an ETL Pipeline using Databricks
  • How to read/write from Azure Datalake and Database
  • Integration of Databricks into Azure Data Factory pipeline

Code available at:  https://github.com/thesqlpro/blog

Azure Databricks is fast, easy to use and scalable big data collaboration platform. Based on Apache Spark brings high performance and benefits of spark without need of having high technical knowledge.

You simply write Python/Scala scripts.

Learn the basics of Databricks and show common Blob Storage JSON to Blob Storage CSV transformation scenario in this video.

Samples from video: https://github.com/MarczakIO/Azure4Everyone-Databricks-Intro

Python and Scala are two of the most popular languages used in data science and analytics.

Not too long ago, the data science language debate was centered around R vs. Python. Now the chatter has shifted towards Python vs. Scala.

Both languages provide great support in order to create cutting edge data analytics projects efficiently.

This article from Analytics India Magazine lists the differences between these two popular languages.

In this video, learn how to stop treating Scala as a better Java and start exploring the world of Functional Programming.

There are also code examples to demonstrate a four step path that’ll let you ease yourself into the world of Functional Programming while continuing to deliver production quality code.