Learn what in NumPy has changed and what is going to change in this talk by Matti Picus at PyCon Israel 2019.
Data School has a great video on the pandas library. In it, you’ll use pandas to answer questions about a real-world dataset. Through each exercise, you’ll learn important data science skills as well as “best practices” for using pandas. By the end of the tutorial, you’ll be more fluent at using pandas to correctly and efficiently answer your own data science questions.
Here’s an interesting talk from PyCon Germany by Joshua Görner, a Data Scientist at BMW.
From the video description:
Interactive notebooks like Jupyter have become more and more popular in the recent past and build the core of many data scientist’s workplace. Being accessed via web browser they allow scientists to easily structure their work by combining code and documentation. Yet notebooks often lead to isolated and disposable analysis artifacts. Keeping the computation inside those notebooks does not allow for convenient concurrent model training, model exposure or scheduled model retraining. Those issues can be addressed by taking advantage of recent developments in the discipline of software engineering. Over the past years containerization became the technology of choice for crafting and deploying applications. Building a data science platform that allows for easy access (via notebooks), flexibility and reproducibility (via containerization) combines the best of both worlds and addresses Data Scientist’s hidden needs.
Spend some time in Python and you’ll likely encounter its bytecode files — those ‘.pyc’ files Python likes to leave behind after it runs.
Have you ever wondered what’s really going on in those files? Watch this video from PyCon 2018 to learn more about these files and what’s in them.
The pandas library is a powerful tool for multiple phases of the data science workflow, including data cleaning, visualization, and exploratory data analysis. However, proper data science requires careful coding, and pandas will not stop you from creating misleading plots, drawing incorrect conclusions, ignoring relevant data, including misleading data, or executing incorrect calculations.
In this tutorial session from PyCon Cleveland 2018, you’ll perform a variety of data science tasks on a handful of real-world datasets using pandas.