Více než 4 miliony titulů v angličtině i dalších jazycích! Objevte svůj nový příběh ještě dnes!

Apache Spark 2.x Cookbook

Name: Apache Spark 2.x Cookbook
Brand: Packt Publishing Limited
SKU: 9781787127265
Price: 1257 CZK
Availability: InStock

Autor: Rishi Yadav

Jazyk:

Vazba: Brožovaná

Vydavatel: Packt Publishing Limited

Dostupnost: Skladem u dodavatele

Odesíláme za 9-15 dnů

1 257 Kč

Perform lightning-fast Big Data processing using Apache Spark 2.x with help of this practical guideK...

Informace o knize

Autor

Rishi Yadav

Jazyk

Angličtina

Vazba

Kniha - Brožovaná

Vydáno

2017

Stránek

294

EAN

9781787127265

ISBN

9781787127265

Enbook ID

16434031

Vydavatel

Packt Publishing Limited

Hmotnost

560

Rozměry

234 x 191 x 22

Kompletní popis

Perform lightning-fast Big Data processing using Apache Spark 2.x with help of this practical guide

Key Features:

- Contains quick solutions to solving even the most complex Big Data processing problems using Apache Spark

- Leverage the power of Apache Spark as a unified compute engine and perform streaming analytics, machine learning and graph processing with ease

- From installing and setting up Spark to fine-tuning its performance, this practical guide is all you need to become a master in using Apache Spark

Book Description:

While Apache Spark 1.x gained a lot of traction and adoption in the early years, Spark 2.x delivers notable improvements in the areas of API, schema awareness, Performance, Structured Streaming, and simplifying building blocks to build better, faster, smarter, and more accessible big data applications. This book uncovers all these features in the form of structured recipes to analyze and mature large and complex sets of data.

Starting with installing and configuring Apache Spark with various cluster managers, you will learn to set up development environments. Further on, you will be introduced to working with RDDs, DataFrames and Datasets to operate on schema aware data, and real-time streaming with various sources such as Twitter Stream and Apache Kafka. You will also work through recipes on machine learning, including supervised learning, unsupervised learning & recommendation engines in Spark.

Last but not least, the final few chapters delve deeper into the concepts of graph processing using GraphX, securing your implementations, cluster optimization, and troubleshooting.

What You Will Learn:

- Install and configure Apache Spark with various cluster managers & on AWS

- Set up a development environment for Apache Spark including Databricks Cloud notebook

- Find out how to operate on data in Spark with schemas

- Get to grips with real-time streaming analytics using Spark Streaming & Structured Streaming

- Master supervised learning and unsupervised learning using MLlib

- Build a recommendation engine using MLlib

- Graph processing using GraphX and GraphFrames libraries

- Develop a set of common applications or project types, and solutions that solve complex big data problems

Who this book is for:

This book is for data engineers, data scientists, and Big Data professionals who want to leverage the power of Apache Spark 2.x for real-time Big Data processing. If you're looking for quick solutions to common problems while using Spark 2.x effectively, this book will also help you. The book assumes you have a basic knowledge of Scala as a programming language.

Nejčastěji hledané

Categories

Authors

Publishers

Produkty

Produkty

Nejčastěji hledané

Categories

Authors

Publishers

Produkty

Produkty

Apache Spark 2.x Cookbook

Informace o knize

Kompletní popis

Mohlo by vás zajímat

Zákaznicí kteří koupili tuto knihu koupili také