Improve your experience. We are very sorry but this website does not support Internet Explorer. We recommend using a different browser that is supported such as Google Chrome or Mozilla Firefox.

Hands-on Practical Python for Data Wrangling & Transformation

This introductory and beyond level course is for technical users newer to Python who want to learn advanced data handling and transformation skills, using the latest tools and techniques. The course is approximately 50% hands-on to 50% lecture ratio, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises. Student machines are required.



Python, renowned for its simplicity and robustness, has become an indispensable language in various fields, including data science, machine learning, and business analytics. Its extensive libraries for data manipulation and analysis make Python a go-to tool for individuals and organizations aiming to derive meaningful insights from data. Geared for technical users new to Python, Hands-On Practical Python for Data Wrangling & Transformation is a four-day, comprehensive hands-on course that will provide you with the hands-on practice and foundational skills needed to navigate Python programming and data wrangling effectively.

Throughout the course you will explore critical topics such as leveraging Python's built-in types, structuring and organizing code, manipulating file code, and deep-diving into data wrangling. You will also gain exposure to advanced topics, including SQL and RDBMS, and their integration with Python for efficient data handling and management. The focus remains firmly on delivering practical skills that can be directly applied in a professional setting.

The hands-on approach sets this course apart. A significant portion of the learning experience will be dedicated to practical lab exercises where you will apply Python, along with tools like NumPy, Pandas, Matplotlib, SQLite, and SQLAlchemy, to real-world data scenarios. These labs aim to simulate real job tasks, from data transformation to web scraping, preparing you to handle similar tasks in your current or future roles. The course also includes a few bonus, time-permitting chapters on applying Generative AI / AI / GPT to Python and Data Wrangling.

This course leverages an innovative Learning Experience Platform, promoting an interactive and collaborative learning environment, under the real-time live guidance of an industry expert. Upon course completion, you will have a strong foundation in Python programming and data wrangling, be capable of handling files and databases efficiently, and possess the skills to extract meaningful insights from complex datasets, directly benefiting your professional endeavors.

Course Objectives

This course is approximately 50% hands-on; combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises. Our engaging instructors and mentors are highly experienced practitioners who bring years of current "on-the-job" experience into every classroom. After completing this course, students will be able to:

  • Master the essentials of Python programming: From basic syntax to complex functionalities, you'll develop the skills to create, test, and debug Python programs with ease.
  • Get comfortable with Python's built-in data types and structures: You'll understand how to effectively use lists, tuples, sets, and dictionaries in Python, providing the foundational building blocks for data manipulation and analysis.
  • Learn to structure and organize your code: We'll help you write clean, efficient, and well-organized Python code, a crucial skill for any programming role.
  • Grasp the art of data wrangling: By the end of the course, you'll be able to clean, transform, and enrich raw data to a form that's suitable for analysis – a skill in high demand in today's data-driven world.
  • Get hands-on experience with Python libraries: You'll learn to use popular Python libraries such as NumPy, Pandas, and Matplotlib, empowering you to perform complex data analysis and create stunning data visualizations.
  • Apply Python skills to real-world scenarios: Through our practical labs and capstone project, you'll get to apply your Python and data wrangling skills to real-world data scenarios. This experience will prepare you to tackle similar challenges in your professional life with confidence.

Who Should Attend

The ideal audience for this course are individuals in technical roles who have a basic understanding of data science and are looking to expand their skill set with Python programming and data wrangling. This may include data analysts, business intelligence professionals, junior data scientists, and IT professionals involved in data-focused roles. Additionally, researchers, academics, or other professionals seeking to streamline data analysis and management processes in their work might also find significant value in attending.

Course Outline

Lesson 1: Introduction to Python

  • Understand Python's significance and its application in modern enterprises
  • Python Basics and Syntax
  • Python Built-in Types
  • Variables, Lists, Dictionaries, and Tuples
  • Control Structures: If, For, While


  • Hands-on Python basics using Python, Jupyter Notebook

Lesson 2: Organizing and Structuring Code

  • Gain skills to write efficient and organized Python code
  • Writing Functions and Classes
  • Modules and Packages
  • Error Handling and Exceptions: Pythonic Coding Practices


  • Code organization and modularization

Lesson 3: Manipulating Files

  • Learn file handling in Python for reading and writing data
  • Reading and Writing Text Files
  • File Operations and Manipulation
  • Working with JSON and CSV Files
  • Directory Operations


  • File operations and data extraction

Lesson 4: Introduction to Data Wrangling with Python

  • Grasp the concept of Data Wrangling and its importance in Python
  • Introduction to Data Wrangling
  • Loading and Viewing Data
  • Data Cleaning Techniques
  • Data Transformation


  • Initial data wrangling exercises

Lesson 5: Deep Dive into NumPy, Pandas, and Matplotlib

  • Discover essential Python libraries for data analysis and visualization
  • Introduction to NumPy
  • Introduction to Pandas
  • Introduction to Matplotlib
  • Data Analysis and Visualization Using Above Libraries


  • Data manipulation and visualization tasks using Pandas, NumPy, Matplotlib

Lesson 6: Advanced Data Wrangling with Python

  • Gain advanced skills for wrangling data using Python
  • Merging and Joining DataFrames
  • Handling Missing Data
  • Date and Time Data
  • String Manipulations


  • Advanced data wrangling tasks using Python and Pandas

Lesson 7: Web Scraping and Data Gathering

  • Learn the techniques to extract data from the web
  • Introduction to Web Scraping Using BeautifulSoup
  • Regular Expressions in Python: APIs and JSON


  • Web scraping tasks

Lesson 8: Introduction to SQL and RDBMS

  • Understand SQL's role in data wrangling and Python's integration with it
  • SQL Basics
  • Python's sqlite3 module
  • SQL vs. NoSQL
  • Using SQLAlchemy with Python


  • Database interactions and data extraction tasks

Lesson 9: Real-world Data Wrangling

  • Apply learned skills to real-world data wrangling scenarios
  • Case Studies in Data Wrangling
  • Best Practices in Data Wrangling
  • Dealing with Large Datasets
  • Building a Data Wrangling Pipeline


  • Real-world data wrangling task

Lesson 10: Next Steps in Python and Data Wrangling

  • Overview of Advanced Python Topics
  • Overview of Machine Learning with Python
  • Overview of Big Data Tools (e.g., Spark)


  • Exploring Machine Learning and Big Data Tools: Use Scikitlearn to create a basic Machine
  • Learning model and then apply PySpark to handle a small simulated Big Data task

Lesson 11: Capstone Projects / Optional

Lab Projects

  • Hands-on Real-world Data Wrangling Project – Apply the skills learned throughout the course in a practical project.
    • Project 1: Building a Data Pipeline – Extract, transform, and load data from multiple sources
    • Project 2: Web Scraping and Data Analysis - Extract data from the web and perform analysis

Addendum: Post-Training Skills Development

  • Continued Learning Resources
  • Suggestions for Practical Applications of Skills Learned
  • Recommended Python and Data Science Communities and Forums
  • Additional Tools for Data Science (e.g., Scikit-Learn, TensorFlow, PyTorch, etc.)
  • Contributing to Open-Source Projects

Bonus Chapters: (Optional / Time Permitting)

Lesson 12: Generative AI for Python Programming and Data Wrangling

  • Understand the role of AI in code generation and its applications in Python and Data Wrangling
  • Introduction to Generative AI
  • Overview of GPT Technology
  • GPT Applications in Python Programming and Data Wrangling
  • Using AI for Code Completion, Error Detection, and Data Analysis


  • Exploring AI-assisted Python programming and data wrangling with GPT technology

Lesson 13: Advanced Python Skills Using AI Technologies

  • Enhance Python skills and productivity using AI-powered tools
  • Overview of AI Tools for Python
  • AI for Automated Testing and Debugging
  • Using AI for Code Optimization
  • Machine Learning-based Predictive Analytics with Python


  • Apply AI tools to improve Python programming and perform predictive analytics


In order to be successful in the course you should have:

  • Basic understanding of any programming language: Familiarity with concepts like variables, loops, and functions would be beneficial, even if not in Python.
  • Fundamental knowledge of Data Science: A general understanding of what data science is and why it's valuable would help provide context for the Python and data wrangling skills taught in this course.
  • Comfort with basic Mathematical Concepts: As Python is heavily used in data analysis, a comfort level with basic math and statistics would be beneficial, though advanced mathematical skills are not necessary.

Similar courses

Introduction to DAX for Power BI

Using Data Analysis Expressions to solve common business problems in Power BI

More Information
Microsoft Power BI: Data Analysis Practitioner

Analyze business data, visualize insights, and share those insights across the enterprise

More Information
Tableau Part 2

In this course, you will perform advanced data visualization and data blending with Tableau.

More Information
PL-400T00: Microsoft Power Platform Developer

The Microsoft Power Platform helps organizations optimize their operations by simplifying, automating and transforming business tasks and processes.

More Information
Cisco® Solutions: Implementation and Administration (CCNA 200-301)

In this course, you will implement and administer networks by using Cisco solutions.

More Information
CompTIA A+ Part 1 Certification (Exam 220-1101)

Install and configure mobile devices; Compare and contrast networking hardware; Configure internet connections and wireless networking,; Troubleshoot hardware and networks; Install motherboards, RAM, storage devices, CPUs and add-on cards; Deploy and configure connected devices; Summarize cloud-computing concepts and virtualization

More Information
Oracle PL/SQL

This course is designed to create PL/SQL blocks both anonymous and named. This course will cover PL/SQL objects and data types. It will also cover packages and how to debug and improve performance within PL/SQL. It will address deploying PL/SQL objects and using Oracle pre-define packages, procedures, and functions.

More Information
CompTIA Network+ Certification (Exam N10-008)

CompTIA Network+ validates the technical skills needed to securely establish, maintain and troubleshoot the essential networks that businesses rely on. CompTIA Network+ is the only certification that covers the specific skills that network professionals need.

More Information
Agile Fundamentals Workshop

In this course, you will understand and use Agile core terms, explain key Agile concepts and their importance in achieving agility, identify, engage, and leverage key stakeholders in an Agile environment, apply common Agile tools and techniques, embrace and advocate for an Agile mindset to benefit from an Agile approach, select the best practices for a project and apply them appropriately to benefit the project and organization.

More Information
CompTIA Cybersecurity Analyst (CySA+) Certification (Exam CS0-002 & CS0-003)

This course introduces tools and tactics to manage cybersecurity risks, identify various types of common threats, evaluate the organization's security, collect and analyze cybersecurity intelligence, and handle incidents as they occur.

More Information
Advanced Java 9

Students who attend this course will leave armed with new skills to leverage modules, scale applications into multi-core environments, and improve the performance of Java 9 applications. This course will teach students everything they need to successfully master and implement the latest features and benefits of Java 9 and become a more effective Java 9 developer.

More Information
55339: Programming in C#

In this course, students will review the basics of C# program structure, language syntax, and implementation details, and then consolidate their knowledge throughout the week as they build an application that incorporates several features of .NET. The course aims to follow the spirit of the Microsoft Official Curriculum course 20483, while bringing it completely up-to-date with the latest features of C#, .NET 6.0 and Visual Studio 2022.

More Information
55337: Introduction to Programming

In this course you will, explain core programming fundamentals such as computer storage and processing, create and use variables and constants in programs, discuss how to create and use functions in a program, use decisions structures in a computer program, create and use repetition (loops) in a computer program, explain pseudocode and its role in programming, implement object-oriented programming concepts, and identify application errors and explain how to debug an application and handle errors.

More Information
Introduction to R for Data Analysis

R is a functional programming environment for business analysts and data scientists. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when the analyst has a statistical, numerical, or probabilities-based problem based on real data and they've pushed Excel past its limits.

More Information
AngularJS Training: AngularJS Programming

In this course, you will create single page web applications using the MVC pattern of AngularJS, understand the programming model provided by the AngularJS framework, define Angular controllers and directives, and control Angular data bindings.

More Information
Comprehensive Angular 12 Programming

In this course, you will develop single page Angular applications using Typescript, set up a complete Angular development environment, create components, directives, services, pipes, forms and custom validators, handle advanced network data retrieval tasks using observables, consume data from REST web services using the Angular HTTP Client, handle push-data connections using the WebSockets protocol, work with Angular Pipes to format data, and use advanced Angular Component Router features.

More Information
Introduction to Programming with Python®

This course is designed for people who want to learn the Python programming language in preparation for using Python to develop software for a wide range of applications, such as data science, machine learning, artificial intelligence, and web development.

More Information
Data Wrangling with Python

This course teaches concepts by deep-dive on-hand exercises. Throughout the course, you will learn data wrangling with hands-on exercises and activities. You’ll find checklists, best practices, and critical points mentioned throughout the lessons, making things more interesting.

More Information
Developing Advanced Automation with Red Hat Ansible Automation Platform (DO374)

In this course, you will apply recommended practices for effective and efficient automation with Ansible, perform automation operations as rolling updates, use advanced features of Red Hat Ansible Automation Platform to work with data, including filters and plugins, create automation execution environments to contain and scale Red Hat Ansible Automation, and leverage capabilities of the automation content navigator to develop Ansible Playbooks.

More Information
Building Data Analytics Solutions using Amazon Redshift

In this course, you will build a data analytics solution using Amazon Redshift, a cloud data warehouse service.

More Information
Data Warehousing on AWS

In this course, you will learn new concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS.

More Information
Building Data Lakes on AWS

In this course, you will apply data lake methodologies in planning and designing a data lake, articulate the components and services required for building an AWS data lake, secure a data lake with appropriate permission, ingest, store, and transform data in a data lake and query, analyze, and visualize data within a data lake.

More Information
Building Batch Data Analytics Solutions on AWS

In this course, you will learn to build batch data analytics solutions using Amazon EMR, an enterprise-grade Apache Spark and Apache Hadoop managed service.

More Information
Developing on AWS

In this course, you will learn how to use the AWS SDK to develop secure and scalable cloud applications using multiple AWS services such as Amazon DynamoDB, Amazon Simple Storage Service, and AWS Lambda. You explore how to interact with AWS using code and learn about key concepts, best practices, and troubleshooting tips.

More Information
Cloud Operations on AWS

In this course, you will learn how to manage and operate automatable and repeatable deployments of networks and systems on AWS.

More Information
DevOps Engineering on AWS

In this course, you will learn how to use the combination of tools, practices, and cultural philosophy of DevOps to improve an organization’s ability to develop, deliver, and maintain applications and services at high velocity on AWS.

More Information
55315: Introduction to SQL Databases

In this course you will describe key database concepts in the context of SQL Server, characterize database languages used in SQL Server, describe data modeling techniques, discuss normalization and denormalization techniques, distinguish relationship types and effects in database design, describe the effects of database design on performance, and define commonly used database objects.

More Information
55366: Querying Data with Transact-SQL

In this course, you will create single table SELECT queries, create multiple table SELECT queries, insert, update, and delete data, query data using built-in functions, create queries that aggregate data, create subqueries, create queries that use table expressions, use UNION, INTERSECT, and EXCEPT on multiple sets of data, implement window functions in queries, use PIVOT and GROUPING SETS in queries, use stored procedures in queries, add error handling to queries, and use transactions in queries.

More Information
55321: SQL Server Integration Services

In this course you will, create sophisticated SSIS packages for extracting, transforming, and loading data, use containers to efficiently control repetitive tasks and transactions, configure packages to dynamically adapt to environment changes, use Data Quality Services to cleanse data, successfully troubleshoot packages, create and manage the SSIS Catalog, deploy, configure, and schedule packages, secure the SSIS Catalog.

More Information
Advanced Programming Techniques with Python (v1.11)

In this course, you will expand your Python proficiencies, select an object-oriented programming approach for Python applications, create object-oriented Python applications, create a desktop application, create data-driven applications, create and secure web service-connected applications, program Python for data science, implement unit testing and exception handling, and package an application for distribution.

More Information
R Programming for Data Science (v1.0)

This course will teach you the fundamentals of programming in R to get you started. It will also teach you how to use R to perform common data science tasks and achieve data-driven results for the business.

More Information
55316: Administering a SQL Database

In this course you will authenticate and authorize users, assign server and database roles, authorize users to access resources, use encryption and auditing features to protect data, describe recovery models and backup strategies, backup and restore SQL Server databases, automate database management, configure security for the SQL Server agent, manage alerts and notifications, managing SQL Server using PowerShell, trace access to SQL Server, monitor a SQL Server infrastructure, and import and export data.

More Information
Web Development with HTML5, CSS, and JavaScript

In this course, you will develop web content in HTML, enhance its formatting and layout using CSS, and add interactivity using JavaScript.

More Information
Building Modern Data Analytics Solutions on AWS

In this course, you will learn how to leverage AWS data Services to store, process, analyze, stream, and query data to make decisions with speed and agility at scale, how to modernize data solutions end to end, and obtain skills to put your data to work to make better, more informed decisions, respond faster to the unexpected, and uncover new opportunities.

More Information
Developing Serverless Solutions on AWS

In this course, you will practice and deploy serverless solutions on AWS.

More Information
Amazon SageMaker Studio for Data Scientists

In this course, you will learn to accelerate the process to prepare, build, train, deploy, and monitor ML solutions using Amazon SageMaker Studio.

More Information
CompTIA A+ Part 2 Certification (Exam 220-1102)

In this course, you will install, configure, optimize, troubleshoot, repair, upgrade, and perform preventive maintenance on personal computers, digital devices, and operating systems.

More Information
AZ-305T00: Designing Microsoft Azure Infrastructure Solutions

This course teaches Azure Solution Architects how to design infrastructure solutions.

More Information
CompTIA Cloud+ Certification (Exam CV0-003)

CompTIA Cloud+ shows you have the expertise needed to deploy and automate secure cloud environments and protect mission-critical applications and data.

More Information
DP-300T00: Administering Microsoft Azure SQL Solutions

This course provides students with the knowledge and skills to administer a SQL Server database infrastructure for cloud, on-premises and hybrid relational databases and who work with the Microsoft PaaS relational database offerings. Additionally, it will be of use to individuals who develop applications that deliver content from SQL-based relational databases.

More Information
55348: Administering Microsoft Endpoint Configuration Manager

In this 5-day course, you will learn day-to-day management tasks, including how to manage applications, client health, hardware and software inventory, operating system deployment, and software updates by using Configuration Manager. You also will learn how to optimize Endpoint Protection, manage compliance, and create management queries and reports. Although this course and the associated labs are written for Windows Server 2022, the skills taught will also be backwards compatible for Server 2016 and 2019.

More Information
Programming and Data Wrangling with VBA and Excel

In this course, you will develop and deploy VBA modules to solve business problems.

More Information
CompTIA Security+ Certification (Exam SY0-701)

This course maps to the CompTIA Security+ certification exam (SY0-701) and establishes the core knowledge required of any cybersecurity role, as well as providing a springboard to intermediate-level cybersecurity jobs.

More Information
AZ-040T00: Automating Administration with PowerShell

Gain fundamental knowledge and skills to use PowerShell for administering and automating administration of Windows servers.

More Information
55215: SharePoint Online Power User

Learn how to make SharePoint online relevant to your team by using a sites functionality to help you share information and collaborate with your colleagues.

More Information
PL-300T00: Microsoft Power BI Data Analyst

If you are someone with existing SQL or SQL Server knowledge (or someone highly versed in different data repositories), this is the Power BI course for you. This course covers the various methods and best practices that are in line with business and technical requirements for modeling, visualizing, and analyzing data with Power BI.

More Information
Oracle Database 19c: Administration Workshop

This course provides detailed information on the architecture of an Oracle Database instance and database, enabling you to manage your database resources effectively. You learn how to create database storage structures appropriate for the business applications supported by your database. In addition, you learn how to create users and administer database security to meet your business requirements. This course provides basic information on backup and recovery techniques.

More Information
CompTIA Data+ Certification (Exam DA0-001)

CompTIA Data+ is an early-career data analytics certification for professionals tasked with developing and promoting data-driven business decision-making that gives learners the confidence to bring data analysis to life.

More Information
SQL Querying: Fundamentals

In this course, you will compose SQL queries to retrieve desired information from a database.

More Information
SQL Querying: Advanced

In this course, you will work with advanced queries to manipulate and index tables. You will also create transactions so that you can choose to save or cancel the data entry process.

More Information
Introduction to SQL Databases 10985WV (55356)

This 2-day entry-level course examines the services and features of Microsoft SQL 2022. (This is NOT a SQL querying course, SQL Querying syntax will not be discussed). The content focuses on database tables, adding and changing data, creating and using stored procedures, entity relationships, and indexes.

More Information
Data Analysis Fundamentals

Doing data analysis work is about more than learning a software program (Excel, Power BI, Tableau, etc.) - you need to understand the concepts and theory too. This one day course gets you up to speed (and can be useful either before or after your software classes).

More Information
Using Data Science Tools in Python

In this course, you will use various Python tools to load, analyze, manipulate, and visualize business data.

More Information
ITIL 4 Foundation Certification with Exam

ITIL 4 is the next evolution of ITIL, providing a practical and flexible transition that allows organizations to adopt the new ways of working required by the modern digital world. It provides an end-to-end IT/digital operating model for the delivery and operation of tech-enabled products and services and enables IT teams to continue to play a crucial role in wider business strategy.

More Information
Crystal Reports 2020: Part 2

In this course, students will create complex reports & data sources using the tools in Crystal Reports 2020. Students will not only create more complex reports including sub-reports and cross-tabs, but will also increase their speed and efficiency.

More Information
Crystal Reports 2020: Part 1

In this course, students will create a basic report by connecting to a database and modifying the report's presentation.

More Information
55371: Windows Server Administration

This five-day instructor-led course teaches IT professionals the fundamental administration skills required to deploy and support Windows Server in most organizations. It is designed primarily for IT professionals who have some experience with Windows Server and will be responsible for managing identity, networking, storage and compute by using Windows Server, and who need to understand the scenarios, requirements, and options that are available and applicable to Windows Server.

More Information
AZ-800T00: Administering Windows Server Hybrid Core Infrastructure

This four-day course is intended for Windows Server Hybrid Administrators who have experience working with Windows Server and want to extend the capabilities of their on-premises environments by combining on-premises and hybrid technologies. Windows Server Hybrid Administrators implement and manage on-premises and hybrid solutions such as identity, management, compute, networking, and storage in a Windows Server hybrid environment.

More Information

Press enter to see more results