Bookshock Ask Tez ✨
Spark in Action, Second Edition Covers Apache Spark 3 with Examples in Java, Python, and Scala cover

Spark in Action, Second Edition Covers Apache Spark 3 with Examples in Java, Python, and Scala

by Jean-Georges Perrin

Lowest price on Bookshock
$74.25
1 offer
In stock

Ask Tez about this book →

This title is temporarily out of stock. Email support@bookshock.ai or call (972) 638-0790 and we'll let you know when it's back.
Free US shipping
30-day free returns
Stripe-secured checkout

All offers (1)

PriceConditionSeller
$74.25Best price New Basi6 International LLC

Stock and pricing refresh on page load. Tez can also compare prices on Amazon, AbeBooks, and ThriftBooks if you ask.

About this book

Summary<br> The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In <i>Spark in Action, Second Edition</i>, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop.<br> <br> Foreword by Rob Thomas.<br> <br> Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.<br> <br> About the technology<br> Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem.<br> <br> About the book<br> <i>Spark in Action, Second Edition</i>, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms.<br> <br> What's inside<br> <br> Writing Spark applications in Java<br> Spark application architecture<br> Ingestion through files, databases, streaming, and Elasticsearch<br> Querying distributed datasets with Spark SQL<br> <br> About the reader<br> This book does not assume previous experience with Spark, Scala, or Hadoop.<br> <br> About the author<br> <b>Jean-Georges Perrin</b> is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years.<br> <br> Table of Contents<br> <br> PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES<br> <br> 1 So, what is Spark, anyway?<br> <br> 2 Architecture and flow<br> <br> 3 The majestic role of the dataframe<br> <br> 4 Fundamentally lazy<br> <br> 5 Building a simple app for deployment<br> <br> 6 Deploying your simple app<br> <br> PART 2 - INGESTION<br> <br> 7 Ingestion from files<br> <br> 8 Ingestion from databases<br> <br> 9 Advanced ingestion: finding data sources and building<br> <br> your own<br> <br> 10 Ingestion through structured streaming<br> <br> PART 3 - TRANSFORMING YOUR DATA<br> <br> 11 Working with SQL<br> <br> 12 Transforming your data<br> <br> 13 Transforming entire documents<br> <br> 14 Extending transformations with user-defined functions<br> <br> 15 Aggregating your data<br> <br> PART 4 - GOING FURTHER<br> <br> 16 Cache and checkpoint: Enhancing Spark’s performances<br> <br> 17 Exporting data and building full data pipelines<br> <br> 18 Exploring deployment

Details

Format
Paperback
Pages
576
Publisher
Simon and Schuster
Language
EN
Edition
2nd
ISBN-13
9781617295522
ISBN-10
1617295523

Categories

Computers, Languages, Java, Data Science