Experts Discuss the 4 Most Important Big Data Programming Languages

Experts Discuss the 4 Most Important Big Data Programming Languages

Here's an interesting read on the 4 most important big data programming languages: Python, R, Scala, and Java. While debates over programming languages tend to quickly... Details
Using Open Data to Build Family Trees

Using Open Data to Build Family Trees

Erica Joy (@EricaJoy) joins Ashley McNamara (@ashleymcnamara) to share her not-so-secret personal mission: making genealogy information open, queryable, and easily parsable. She shares a bit... Details
Real-time Analytics with Azure Cosmos DB and Apache Spark

Real-time Analytics with Azure Cosmos DB and Apache Spark

In this session from Build 2019, learn how to use the new Spark API feature integration that allows Spark to fully take advantage of Cosmos... Details
Databricks wants one tool to rule all AI systems – coincidentally, its own MLflow tool

Databricks Wants One Tool to Rule All AI Systems: MLflow

MLflow enables data scientists to track and distribute experiments, package and share models across frameworks, and deploy them – no matter if the target environment... Details
Can You Recognize Yourself from Your Data?

Can You Recognize Yourself from Your Data?

BBC Click explores the impact of GDPR one year later and offers a brief glimpse into what our smartphones know about us. Details
Spark vs. Tez: What's the Difference?

Spark vs. Tez: What’s the Difference?

At work recently, a question came up about whether Spark or Tez is better. Here's an interesting article with some interesting perspectives. On paper, Spark... Details
What’s in Hive 3.0?

What’s in Hive 3.0?

What is new in Apache Hive 3.0? from DataWorks Summit Details
How Big is Big Data?

How Big is Big Data?

Here’s an interesting look at how big big data is from Computerphile, for those not satisfied with my “Costco Test for Big Data.” Details
Code-free modern data warehouse using Azure SQL DW and Data Factory

Code-free modern data warehouse using Azure SQL DW and Data Factory

Gaurav Malhotra joins Scott Hanselman to show how to build a modern data warehouse solution from ingress of structured, unstructured, semi-structured data to code-free data... Details
Databricks open-sources Delta Lake to make data lakes more reliable

DataBricks Open-Sources Delta Lake to Make Data Lakes More Reliable

Databricks, announced that it has open-sourced Delta Lake, a storage layer that makes it easier to ensure data integrity as new data flows into an enterprise’s... Details