50 ML Projects to Understand LLMs : Investigate transformer mechanisms through data analysis, visualization, and experimentation

個数:

50 ML Projects to Understand LLMs : Investigate transformer mechanisms through data analysis, visualization, and experimentation

  • オンデマンド(OD/POD)版です。キャンセルは承れません。
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Paperback:紙装版/ペーパーバック版/ページ数 566 p.
  • 言語 ENG
  • 商品コード 9781808082559

Full Description

Most books teach you how to build LLMs from scratch or deploy them via APIs. This book uses guided machine learning projects to teach you how to understand, visualize, and investigate LLMs including GPT and BERT.

Key Features

Each project is built around three learning goals: machine learning techniques, LLM mechanisms, and Python coding with data visualization.
This is not a dense theoretical textbook; it's hands-on, practical, and project-oriented.
You will learn how to measure, visualize, and manipulate the internal components of LLMs directly.

Book DescriptionThrough 50 hands-on, guided projects solved in Python, you will investigate the internal mechanisms of large language models by treating their hidden states, attention patterns, and embeddings as data to analyze. Rather than accepting LLMs as black boxes, you will open them up, examine what's inside, and run experiments to understand why they behave the way they do. All projects are based on Python (using libraries such as NumPy, PyTorch, statsmodels, scikit-learn, Matplotlib, Pandas, and Seaborn) and come with full solutions and partial solution notebook files, so you can practice and improve your skills in data science, deep learning, data visualization, and scientific and statistical coding.What you will learn

Tokenization schemes and their statistical properties
Embedding spaces: cosine similarity, semantic axes, and analogy vectors
Output logits, softmax distributions, perplexity, and language biases
Layer-by-layer transformer dynamics and dimensionality
Attention mechanisms: QKV weights, attention scores, head ablation, and activation patching
MLP subblocks: neuron tuning, mutual information, subspace analysis, and statistics-based causal manipulations
Logit lens, indirect object identification, and causal tracing

Who this book is forThis book is for data scientists, ML engineers, and researchers who want to go beyond surface-level understanding of LLMs. Prior Python experience is required. Familiarity with machine learning or deep learning is helpful but not required — techniques are introduced as they arise throughout the projects.

Contents

Table of Contents

Introductions
Tokenization
Embeddings
Output logits
Transformer outputs
Attention
MLP