SQL Server 2019 Feature Notebooks

SQL Server 2019 Feature Notebooks

Here's a great collection of Jupyter notebooks that explore all the new features of SQL Server 2019. Here are some of the ones that caught... Details
Why do we need Delta Lake for Spark?

Why do we need Delta Lake for Spark?

Learning Journal has a video on why do we need something like Delta Lake. What kind of problem does it solve? Details
Real-time Analytics with Azure Cosmos DB and Apache Spark

Real-time Analytics with Azure Cosmos DB and Apache Spark

In this session from Build 2019, learn how to use the new Spark API feature integration that allows Spark to fully take advantage of Cosmos... Details
Spark vs. Tez: What's the Difference?

Spark vs. Tez: What’s the Difference?

At work recently, a question came up about whether Spark or Tez is better. Here's an interesting article with some interesting perspectives. On paper, Spark... Details
Apache Spark Turns 10: The Secret Sauce Behind One Of The World’s Most Popular Open Source Projects

Apache Spark Turns 10

As Apache Spark is 10 years old. This article in Analytics India Magazine explores what led to Spark's widespread adoption and what will keep it... Details
A Closer Look at Apache Spark

A Closer Look at Apache Spark

ComputerPhile has a great video where Rebecca Tickle explains the inner workings of Apache Spark and what makes it better than MapReduce. As an added bonus,... Details
Azure Databricks for Data Engineers and Data Developers

Azure Databricks for Data Engineers and Data Developers

Data engineering is about 70% of any data pipeline today, and without having the experience to implement a data engineering pipeline well, there is no... Details
Azure Databricks introduces R Studio Integration

Azure Databricks introduces R Studio Integration

Just when you thought Azure Databricks couldn't get any better, watch this video where Yatharth Gupta, Principal Program Manager for Azure Databricks, talks about the newly... Details
Apache Spark Tutorial: Resilient Distributed Datasets

Apache Spark Tutorial: Resilient Distributed Datasets

Here's a particularly interesting tutorial on Spark by Frank Kane, the other guy named Frank in Data Science. ;) Details