Load data using Petastorm

How to Load Data Using Petastorm

Petastorm is an open source data access library. This library enables single-node or distributed training and evaluation of deep learning models directly from datasets in... Details
The Criticality of Azure SQL Capacity Planning

The Criticality of Azure SQL Capacity Planning

Whether migrating an existing application or designing a brand new one, capacity planning process plays a critical role. Learn how to navigate across Azure SQL... Details
The Apache Spark File Format Ecosystem

The Apache Spark File Format Ecosystem

It’s all too easy to overlook the importance of storage and IO in the performance and optimization of Spark jobs. However, the choice of file... Details
Learning How to Transition Your SQL Server Skills to Azure SQL

Learning How to Transition Your SQL Server Skills to Azure SQL

Are you interested in learning how to translate your existing SQL Server expertise to Azure SQL? In this episode, Bob Ward, Anna Hoffman, and Marisa... Details
Role of Data analytics in Company’s culture

The Role of Data Analytics in a Company’s Culture

We can all agree that data analytics is critical to any company’s success, but what does it mean to have a data driven culture? Here’s... Details
SQL Server 2019 Improves Scalar UDF Performance

SQL Server 2019 Improves Scalar UDF Performance

Microsoft added several features to SQL Server 2019 that can improve performance without changes to code. One of these is T-SQL Scalar UDF Inlining. The... Details
Scalable Acceleration of XGBoost Training on Apache Spark GPU Clusters

Scalable Acceleration of XGBoost Training on Apache Spark GPU Clusters

XGBoost is one of the most popular machine learning library, and its Spark integration enables distributed training on a cluster of servers. This talk will... Details
Deep Dive into GPU Support in Apache Spark 3.x

Deep Dive into GPU Support in Apache Spark 3.x

GPU support in Apache Spark presents massive opportunities for significant speedup of ETL, ML and DL applications. Here’s a great video by Databricks on the... Details
How Resumable Indexes in SQL Server 2019 Makes Your Job Easier

How Resumable Indexes in SQL Server 2019 Makes Your Job Easier

Microsoft continues to push the envelope on feature capabilities with every release of SQL Server.  One of the more prominent features that was released with... Details
Big Data Engineer Roles & Responsibilities

Big Data Engineer Roles & Responsibilities

edureka! explains what a Big Data Engineer does. Details