The Azure Data Lakehouse Toolkit : Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake (1st)

個数:
電子版価格
¥12,043
  • 電子版あり

The Azure Data Lakehouse Toolkit : Building and Scaling Data Lakehouses on Azure with Delta Lake, Apache Spark, Databricks, Synapse Analytics, and Snowflake (1st)

  • 提携先の海外書籍取次会社に在庫がございます。通常3週間で発送いたします。
    重要ご説明事項
    1. 納期遅延や、ご入手不能となる場合が若干ございます。
    2. 複数冊ご注文の場合は、ご注文数量が揃ってからまとめて発送いたします。
    3. 美品のご指定は承りかねます。

    ●3Dセキュア導入とクレジットカードによるお支払いについて
  • 【入荷遅延について】
    世界情勢の影響により、海外からお取り寄せとなる洋書・洋古書の入荷が、表示している標準的な納期よりも遅延する場合がございます。
    おそれいりますが、あらかじめご了承くださいますようお願い申し上げます。
  • ◆画像の表紙や帯等は実物とは異なる場合があります。
  • ◆ウェブストアでの洋書販売価格は、弊社店舗等での販売価格とは異なります。
    また、洋書販売価格は、ご注文確定時点での日本円価格となります。
    ご注文確定後に、同じ洋書の販売価格が変動しても、それは反映されません。
  • 製本 Paperback:紙装版/ペーパーバック版/ページ数 465 p.
  • 言語 ENG
  • 商品コード 9781484282328
  • DDC分類 006.76

Full Description

Design and implement a modern data lakehouse on the Azure Data Platform using Delta Lake, Apache Spark, Azure Databricks, Azure Synapse Analytics, and Snowflake. This book teaches you the intricate details of the Data Lakehouse Paradigm and how to efficiently design a cloud-based data lakehouse using highly performant and cutting-edge Apache Spark capabilities using Azure Databricks, Azure Synapse Analytics, and Snowflake. You will learn to write efficient PySpark code for batch and streaming ELT jobs on Azure. And you will follow along with practical, scenario-based examples showing how to apply the capabilities of Delta Lake and Apache Spark to optimize performance, and secure, share, and manage a high volume, high velocity, and high variety of data in your lakehouse with ease.

The patterns of success that you acquire from reading this book will help you hone your skills to build high-performing and scalable ACID-compliant lakehouses using flexible and cost-efficient decoupled storage and compute capabilities. Extensive coverage of Delta Lake ensures that you are aware of and can benefit from all that this new, open source storage layer can offer. In addition to the deep examples on Databricks in the book, there is coverage of alternative platforms such as Synapse Analytics and Snowflake so that you can make the right platform choice for your needs.

After reading this book, you will be able to implement Delta Lake capabilities, including Schema Evolution, Change Feed, Live Tables, Sharing, and Clones to enable better business intelligence and advanced analytics on your data within the Azure Data Platform.

What You Will Learn

Implement the Data Lakehouse Paradigm on Microsoft's Azure cloud platform
Benefit from the new Delta Lake open-source storage layer for data lakehouses 
Take advantage of schema evolution, change feeds, live tables, and more
Writefunctional PySpark code for data lakehouse ELT jobs
Optimize Apache Spark performance through partitioning, indexing, and other tuning options
Choose between alternatives such as Databricks, Synapse Analytics, and Snowflake

Who This Book Is For
Data, analytics, and AI professionals at all levels, including data architect and data engineer practitioners. Also for data professionals seeking patterns of success by which to remain relevant as they learn to build scalable data lakehouses for their organizations and customers who are migrating into the modern Azure Data Platform. 

Contents

Part I: Getting Started.- Chapter 1: The Data Lakehouse Paradigm.- Part II: Data Platforms.- Chapter 2:  Snowflake.- Chapter 3: Databricks.- Chapter 4: Synapse Analytics.- Part III: Apache Spark ELT.- Chapter 5: Pipelines and Jobs.- Chapter 6: Notebook Code.- Part IV: Delta Lake.-Chapter 7: Schema Evolution.- Chapter 8: Change Feed.- Chapter 9: Clones.- Chapter 10: Live Tables.- Chapter 11: Sharing.- Part V: Optimizing Performance.- Chapter 12: Dynamic Partition Pruning for Querying Star Schemas.- Chapter 13: Z-Ordering & Data Skipping.- Chapter 14: Adaptive Query Execution.- Chapter 15: ​Bloom Filter Index.- Chapter 16: Hyperspace.- Part VI: Advanced Capabilities.- Chapter 17: Auto Loader.- Chapter 18: Python Wheels.- Chapter 19: Security & Controls.

最近チェックした商品