Engineering Lakehouses with Open Table Formats : Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake

個数:
  • ポイントキャンペーン

Engineering Lakehouses with Open Table Formats : Build scalable and efficient lakehouses with Apache Iceberg, Apache Hudi, and Delta Lake

  • 提携先の海外書籍取次会社に在庫がございます。通常3週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合は、ご注文数量が揃ってからまとめて発送いたします。
    3. 美品のご指定は承りかねます。

    ●3Dセキュア導入とクレジットカードによるお支払いについて
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Paperback:紙装版/ペーパーバック版/ページ数 416 p.
  • 言語 ENG
  • 商品コード 9781836207238
  • DDC分類 005.74

Full Description

Jump-start your journey toward mastering open data architectural patterns by learning the fundamentals and applications of open table formats

Key Features

Build lakehouses with open table formats using compute engines such as Apache Spark, Flink, Trino, and Python
Optimize lakehouses with techniques such as pruning, partitioning, compaction, indexing, and clustering
Find out how to enable seamless integration, data management, and interoperability using Apache XTable
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionEngineering Lakehouses with Open Table Formats provides detailed insights into lakehouse concepts, and dives deep into the practical implementation of open table formats such as Apache Iceberg, Apache Hudi, and Delta Lake.
You'll explore the internals of a table format and learn in detail about the transactional capabilities of lakehouses. You'll also get hands on with each table format with exercises using popular computing engines, such as Apache Spark, Flink, Trino, and Python-based tools. The book addresses advanced topics, including performance optimization techniques and interoperability among different formats, equipping you to build production-ready lakehouses. With step-by-step explanations, you'll get to grips with the key components of lakehouse architecture and learn how to build, maintain, and optimize them.
By the end of this book, you'll be proficient in evaluating and implementing open table formats, optimizing lakehouse performance, and applying these concepts to real-world scenarios, ensuring you make informed decisions in selecting the right architecture for your organization's data needs.What you will learn

Explore lakehouse fundamentals, such as table formats, file formats, compute engines, and catalogs
Gain a complete understanding of data lifecycle management in lakehouses
Learn how to systematically evaluate and choose the right lakehouse table format
Optimize performance with sorting, clustering, and indexing techniques
Use the open table format data with ML frameworks like TensorFlow and MLflow
Interoperate across different table formats with Apache XTable and UniForm
Secure your lakehouse with access controls and ensure regulatory compliance

Who this book is forThis book is for data engineers, software engineers, and data architects who want to deepen their understanding of open table formats, such as Apache Iceberg, Apache Hudi, and Delta Lake, and see how they are used to build lakehouses. It is also valuable for professionals working with traditional data warehouses, relational databases, and data lakes who wish to transition to an open data architectural pattern. Basic knowledge of databases, Python, Apache Spark, Java, and SQL is recommended for a smooth learning experience.

Contents

Table of Contents

Open Data Lakehouse: A New Architectural Paradigm
Transactional Capabilities of the Lakehouse
Apache Iceberg Deep Dive
Apache Hudi Deep Dive
Delta Lake Deep Dive
Catalog and Metadata Management
Interoperability in Lakehouses
Performance Optimization and Tuning in a Lakehouse
Data Governance and Security in Lakehouses
Evaluating and Selecting Open Table Formats
Real-World Applications and Learnings

最近チェックした商品