Azure Open Datasets is platform to host data from the open domain such as weather, socioeconomic statistics, machine learning samples, open images, GitHub activity data, etc. on Azure.

Learn more about why are we hosting open data on Azure, how to explore the datasets and how to use them in Azure services such as Azure Machine Learning.

Learn More:

The AI Show’s Favorite links:

Erica Joy (@EricaJoy) joins Ashley McNamara (@ashleymcnamara) to share her not-so-secret personal mission: making genealogy information open, queryable, and easily parsable. She shares a bit about why this is so critical, common challenges, and tips for re-building your own family tree – or using open data to uncover whatever the information you need for your personal mission.

Explore open source at Microsoft

Erica’s favorite open source genealogy tools and services:

Talk about big data. CERN, best known for its nuclear research and large hadron collider, has released 300TB of research data on their new data portal.


Previously, CERN had released around 27 terabytes of research information in November 2014. In that case, the data posted was collected from experiments done in 2010.

Kati Lassila-Perini, a physicist who working on the Compact Muon Solenoid detector, said that “members of the CMS Collaboration put in lots of effort and thousands of person-hours each of service work in order to operate the CMS detector and collect this research data for our analysis.”

Furthermore, she added that “once we’ve exhausted our exploration of the data, we see no reason not to make them available publicly.”

She continued, “The benefits are numerous, from inspiring high-school students to the training of the particle physicists of tomorrow.”

It’s incredible that we live in an age where everyone with an internet connection has access to the raw data that previously only a chosen few of advanced researchers had access to.

Who knows where this can lead?