Data Preprocessing with Python for Absolute Beginners
Step-by-Step Guide with Hands-on Projects and Exercises
About the Book
Book description
This book is dedicated to data preparation and explains how to perform different data preparation techniques on various datasets using different data preparation libraries written in the Python programming language.
Key Features
- A crash course in Python to fill any gaps in prerequisite knowledge and a solid foundation on which to build your new skills
- A complete data preparation pipeline for your guided practice
- Three real-world projects covering each major task to cement your learned skills in data preparation, classification, and regression
Book Description
The book follows a straightforward approach. It is divided into nine chapters. Chapter 1 introduces the basic concept of data preparation and installation steps for the software that we will need to perform data preparation in this book. Chapter 1 also contains a crash course on Python, followed by a brief overview of different data types in Chapter 2. You will then learn how to handle missing values in the data, while the categorical encoding of numeric data is explained in Chapter 4.
The second half of the course presents data discretization and describes the handling of outliers’ process. Chapter 7 demonstrates how to scale features in the dataset. Subsequent chapters teach you to handle mixed and DateTime data type, balance data, and practice resampling. A full data preparation final project is also available at the end of the book.
Different types of data preprocessing techniques have been explained theoretically, followed by practical examples in each chapter. Each chapter also contains an exercise that students can use to evaluate their understanding of the chapter’s concepts. By the end of this course, you will have built a solid working knowledge in data preparation--the first steps to any data science or machine learning career and an essential skillset for any aspiring developer.
The code bundle for this course is available at https://www.aispublishing.net/book-data-preprocessing
What you will learn
- Explore different libraries for data preparation
- Understand data types
- Handle missing data
- Encode categorical data
- Discretize data
- Learn to handle outliers
- Practice feature scaling
- Handle mixed and DateTime variables and imbalanced datasets
- Employ your new skills to complete projects in data preparation, classification, and regression
Who this book is for
In addition to beginners in data preparation with Python, this book can also be used as a reference manual by intermediate and experienced programmers. It contains data preprocessing code samples using multiple data visualization libraries.

eBook Preview
Author Details

<a href="https://market.tutorialspoint.com/author/ai_sciences_oanduuml">AI Sciences OÜ</a>
Packt are an established, trusted, and innovative global technical learning publisher, founded in Birmingham, UK with over eighteen years experience delivering rich premium content from ground-breaking authors and lecturers on a wide range of emerging and established technologies for professional development.
Packt’s purpose is to help technology professionals advance their knowledge and support the growth of new technologies by publishing vital user focused knowledge-based content faster than any other tech publisher, with a growing library of over 9,000 titles, in book, e-book, audio and video learning formats, our multimedia content is valued as a vital learning tool and offers exceptional support for the development of technology knowledge.
We publish on topics that are at the very cutting edge of technology, helping IT professionals learn about the newest tools and frameworks in a way that suits them.
Our students work
with the Best


































Related eBooks
Annual Membership
Become a valued member of Tutorials Point and enjoy unlimited access to our vast library of top-rated Video Courses
Subscribe now
Online Certifications
Master prominent technologies at full length and become a valued certified professional.
Explore Now