Developing Your Analytical Skills: How to Research and Present Information (Live Online)
- formation par Management Centre Europe
- En ligne
Python plays a crucial role in data engineering, data science and AI development due to its versatility, extensive libraries such as Pandas and PySpark, and its ability to handle large-scale data processing, making it an indispensable tool for extracting insights and building data pipelines. In this course, participants will gain a solid understanding of Python.
They will acquire the necessary skills and knowledge to utilize Python effectively, from basic syntax to implementing real-world solutions and visualizing the results. During the course participants will get hands-on experience with Pydantic, Pandas, Matplotlib, Seaborn, PySpark, ...
Python is a high-level, interpreted, interactive and object-oriented scripting language. This chapter introduces the history of Python and how to install Python and run your first lines of Python Code. There are quite some editors available for writing Python code but this course focusses on using Visual Studio Code as a code editor for Python. We'll also cover modern Python tooling including uv, a fast Python package installer and project manager.
We explore programming in Python by discussing some basic syntax, variables and conditional statements.
Collections allow you to store and organize data efficiently, making it easier to handle. Loops help you repeat actions on these collections.
We explore how to structure reusable code, handle unexpected situations and manage resources.
Python classes provide all the standard features of Object Oriented Programming. Classes can inherit from other base classes, have Constructors for the initialization of objects, and leverage modern Python features like dataclasses for simplified class creation with automatic method generation.
Modules in Python are reusable code libraries and Python ships with quite a large amount of build-in Modules. Learn how to create and import Modules.
You do not need to reinvent the wheel when coding in Python. Its Standard Library offers a rich collection of built-in modules that simplify common tasks, while external libraries provide specialized tools for modern development.
Pydantic is a powerful Python library that uses Python type annotations to validate data and settings management. It provides runtime type checking and automatic data conversion, making it essential for building robust data pipelines and APIs. This chapter covers how to define data models, validate complex data structures, and handle validation errors effectively.
Testing is a critical aspect of software development that ensures code reliability and maintainability. Python provides excellent testing frameworks, with pytest being the most popular choice for its simplicity and powerful features. This chapter covers writing effective unit tests, mocking dependencies, and implementing test-driven development practices for data engineering and app development.
Pandas is a Python library which makes loading and transforming data a lot easier. As long as all your data fits in memory, Pandas is your friend.
Data visualization is a critical skill for data scientists and engineers to communicate insights and identify patterns or anomalies in data. This chapter explores the Python visualization ecosystem, starting from basic plotting in Matplotlib and Pandas, moving to the sophisticated statistical aesthetics of Seaborn, and concluding with interactive, web-ready visualizations using Plotly.
Data lakes allows storing large data volumes in their original format, but Pandas doesn’t scale well. Apache Spark enables distributed processing, and PySpark brings it to Python (available in Micorsoft Fabric, Azure Synapse Analytics and Databricks).
This course combines deep technical expertise with hands-on, instructor-led training, helping you transform knowledge into practical, real-world skills that last.
This course is targeted at data engineers, data scientists and AI developers with no or little experience with Python. Familiarity with programming in general might come in handy.
Certification of Completion by U2U. KMO-Portefeuille