The Orange Book of Machine Learning : The essentials of making predictions using supervised regression and classification for tabular data

予約

The Orange Book of Machine Learning : The essentials of making predictions using supervised regression and classification for tabular data

Ellis, Carl McBride

Packt Publishing Limited（2026/07発売）

ただいまウェブストアではご注文を受け付けておりません。 ⇒古書を探す

製本 Paperback:紙装版/ペーパーバック版
言語 ENG
商品コード 9781808081316

Full Description

Learn supervised machine learning for tabular data using Python, pandas, scikit-learn, CatBoost, LightGBM, XGBoost, TabPFN, and TabICL for regression, classification, and predictive modeling

Key Features

Explore, clean, and prepare tabular datasets for machine learning workflows
Build regression and classification models using modern machine learning tools
Improve predictions with calibration, conformal intervals, and optimization techniques

Book DescriptionMaster the essential tools and techniques for supervised machine learning on tabular data with this practical guide to regression and classification. Through clear explanations, code snippets, and hands-on notebooks, you'll learn how to use Python and leading machine learning libraries, including pandas, scikit-learn, CatBoost, LightGBM, XGBoost, TabPFN, and TabICL, to build predictive models for real-world datasets.
The book covers the complete workflow, from data exploration and cleaning to model development, evaluation, and optimization. You'll learn how to perform regression analysis for accurate point predictions and estimate uncertainty using conformal prediction intervals. For classification tasks, you'll explore probabilistic predictions and calibration techniques to improve model reliability. You'll also discover practical approaches to feature engineering, feature selection, and hyperparameter optimization to enhance model performance. In addition, the book introduces tabular foundation models and in-context learning techniques, providing insight into the latest advances in machine learning for structured data.
By the end of the book, you'll have the skills and confidence to develop, evaluate, and deploy supervised machine learning models for a wide range of tabular data applications.What you will learn

Perform exploratory data analysis and data cleaning
Apply cross-validation for reliable model evaluation
Build regression models and prediction intervals
Develop calibrated probabilistic classification models
Optimize models through hyperparameter tuning
Engineer and select features for improved performance
Use ensemble learning methods effectively
Explore tabular foundation models and in-context learning

Who this book is forThis book is designed for motivated self-learners, university students studying applied machine learning, junior data scientists, and academic researchers looking to incorporate machine learning into their analytical workflows. Readers should have a basic familiarity with Python and data analysis concepts. Whether you're developing predictive models for business, research, or educational purposes, this book provides the practical guidance needed to apply modern machine learning techniques to structured and tabular datasets.

Table of Contents

Introduction
Statistics
Exploratory data analysis (EDA)
Data cleaning
Cross-validation
Interpolation and smoothing
Regression
Classification
GLM and GAM
Ensemble estimators
Hyperparameter optimization (HPO)
Feature engineering and selection
Tabular foundation models (TFM)