Big Data SMACK - Raul Estrada, Isaac Ruiz

Big Data SMACK (eBook)

A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

Raul Estrada, Isaac Ruiz (Autoren)

eBook Download: PDF

2016 | 1st ed.
XXV, 264 Seiten
Apress (Verlag)
978-1-4842-2175-4 (ISBN)

This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology-Scala/Spark, Mesos, Akka, Cassandra, and Kafka-in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large datasets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. The book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:

The language: Scala
The engine: Spark (SQL, MLib, Streaming, GraphX)
The container: Mesos, Docker
The view: Akka
The storage: Cassandra
The message broker: Kafka

What you'll learn

How to make big data architecture without using complex Greek letter architectures.
How to build a cheap but effective cluster infrastructure.
How to make queries, reports, and graphs that business demands.
How to manage and exploit unstructured and No-SQL data sources.
How use tools to monitor the performance of your architecture.
How to integrate all technologies and decide which replace and which reinforce.

Who This Book Is For

This book is for developers, data architects, and data scientists looking for how to integrate the most successful big data open stack architecture and how to choose the correct technology in every layer.

Raúl Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. Estrada is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. Estrada loves functional languages like Elixir and Scala, and also has a Master degree on Computer Science.

Isaac Ruiz is a Java programmer since 2001, and a consultant and architect since 2003. Ruiz had participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. Ruiz is a supporter of free software. Ruiz like to experiment with new technologies (frameworks, languages, methods).

Learn how to integrate full-stack open source big data architecture and to choose the correct technology-Scala/Spark, Mesos, Akka, Cassandra, and Kafka-in every layer. Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large data sets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. This book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:The language: ScalaThe engine: Spark (SQL, MLib, Streaming, GraphX)The container: Mesos, DockerThe view: AkkaThe storage: CassandraThe message broker: KafkaWhat You Will Learn:Make big data architecture without using complex Greek letter architecturesBuild a cheap but effective cluster infrastructureMake queries, reports, and graphs that business demandsManage and exploit unstructured and No-SQL data sourcesUse tools to monitor the performance of your architectureIntegrate all technologies and decide which ones replace and which ones reinforceWho This Book Is For:Developers, data architects, and data scientists looking to integrate the most successful big data open stack architecture and to choose the correct technology in every layer

Raúl Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. He is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. He loves functional languages like Elixir and Scala, and also has a Master of Computer Science degree. Isaac Ruiz has been a Java programmer since 2001, and a consultant and architect since 2003. He has participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. He is a supporter of free software. Ruiz likes to experiment with new technologies (frameworks, languages, methods).

Part 1. Introduction

Chapter 1. Big Data, Big Problems

Chapter 2. Big Data, Big Solutions

Part 2. Playing SMACK

Chapter 3. The Language: Scala

Chapter 4. The Model: Akka

Chapter 5. Storage. Apache Cassandra

Chapter 6. The View

Chapter 7. The Manager: Apache Mesos

Chapter 8. The Broker: Apache Kafka

Part 3. Improving SMACK

Chapter 9. Fast Data Patterns

Chapter 10. Big Data Pipelines

Chapter 11. Glossary.

Erscheint lt. Verlag	29.9.2016
Zusatzinfo	XXV, 264 p. 74 illus., 52 illus. in color.
Verlagsort	Berkeley
Sprache	englisch
Themenwelt	Mathematik / Informatik ► Informatik ► Datenbanken
Themenwelt	Mathematik / Informatik ► Informatik ► Netzwerke
Schlagworte	Akka • Apache Cassandra • Apache Kafka • Apache Mesos • Apache Spark • Big Data • data structures • Docker • Hadoop • No-SQL databases • Scala
ISBN-10	1-4842-2175-3 / 1484221753
ISBN-13	978-1-4842-2175-4 / 9781484221754

Haben Sie eine Frage zum Produkt?

PDF (Wasserzeichen)
Größe: 11,4 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Print-Ausgabe

Buch | Softcover

CHF 52,40