Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Getting Started with Kudu - Jean-Marc Spaggiari, Mladen Kovacevic, Brock Noland, Ryan Bossheart

Getting Started with Kudu

Perform Fast Analytics on Fast Data
Buch | Softcover
200 Seiten
2018
O'Reilly Media (Verlag)
978-1-4919-8025-5 (ISBN)
CHF 62,80 inkl. MwSt
With this practical guide, you’ll learn how Kudu’s architecture and features solve a unique problem in the Hadoop ecosystem. If you’re familiar with other storage layer projects such HDFS, HBase, Spanner, and Cassandra, you’ll quickly learn—and appreciate—the unique contribution Kudu makes to this ecosystem.
Get up to speed with Apache Kudu, the column-oriented data store for Hadoop that not only provides an architectural simplification of several existing use cases, but also allows use cases not possible before. With this practical guide, enterprise architects working on big data implemetations will learn how Kudu’s architecture and features solve a unique problem in the Hadoop ecosystem. For example, Kudu makes Hadoop viable for real-time IoT use cases in addition to making a transition from a massively parallel processing (MPP) SQL database engine plausible.

If you’re familiar with other storage layer projects such HDFS, HBase, Spanner, and Cassandra, you’ll quickly learn—and appreciate—the unique contribution Kudu makes to this ecosystem.

Explore how Kudu is compatible with data processing frameworks in the Hadoop environment
Understand Kudu's architecture, internals, installation, and deployment
Learn how to fully administer a Kudu cluster
Become acquainted with low-level client APIs, how to integrate with SQL engines like Impala, and frameworks for integration
Learn about table and schema design
Get use cases, examples, best practices, and sample code

Jean-Marc Spaggiari, an early adopter of Kudu, works as a Principal Solutions Architect for Cloudera to support Hadoop, Kudu, HBase and other tools through technical support and consulting work. His deep knowledge of HBase and HDFS allows him to better understand Kudu and its applications. Jean-Marc's primary role is to support HBase users over their HBase cluster deployments, upgrades, configuration and optimization, as well as to support them regarding HBase related application development. He is also a very active HBase community member, testing every release from performance and stability standpoints. However, with Kudu being geared to quickly penetrate the market, he will also begin recommending, building demo applications and deploying proof of concepts around it. Prior to Cloudera, Jean-Marc worked as a Project Manager and as a Solutions Architect for CGI and insurances companies. He has almost 20 years of Java development experience. In addition to regularly attending Strata+Hadoop World and HBaseCon, he has spoken at various Hadoop User Group meetings and many conferences in North America, usually focusing on HBase related presentations and demonstrations. Jean-Marc is also the author of Architecting HBase Applications (O'Reilly). Mladen Kovacevic comes from a development background in RDBMS technology, and sees Kudu as a game changer in the Hadoop ecosystem. He has presented Kudu at several local meetups, presented on the state of Spark on Kudu during its beta while providing feedback early enough to ensure Spark with Kudu is a first-class citizen at its launch. He is a contributor to Apache Kudu and Kite SDK projects, and works as a Solutions Architect at Cloudera. Mladen's experience includes years of RDBMS engine development, systems optimization, performance and architecture, including optimizing Hadoop on the Power 8 platform while developing IBM's Big SQL technology. Brock Noland followed Kudu months before the first line of code was written, by following Todd Lipcon's paper reading habits. Brock is Chief Architect of phData, a pure-play Hadoop Managed Service Provider. Prior to founding phData, Brock spent four years at Cloudera as a Trainer, Solution Architect, Engineer, Sales Engineer, and Engineering Manager. Brock is a co-founder of Apache Sentry and Apache Project Committee Member on Apache Hive, Parquet, Crunch, Flume, and Incubator. Brock was a mentor to Kudu in the incubator and currently mentors Apache Impala (incubating). In addition he is a member of the Apache Software Foundation. Brock is frequent public speaker, having spoken at dozens of conferences including HBaseCon, numerous Hadoop User Groups, and other conferences. Ryan Bosshart is a Principal Systems Engineer at Cloudera. Ryan has spent the last 10 years building and architecting distributed systems. At Cloudera, Ryan leads the field storage specialization team where he focuses on Apache HDFS, HBase, and Kudu. He has worked with many early users of Kudu to build their relational, time-series, IOT, or real-time architectures. He has seen first-hand Kudu's ability to improve performance and simplify architectures. Ryan is a co-chair of the Twin Cities Spark and Hadoop User Group and the author of the training video Getting Started with Kudu (O'Reilly).

Erscheinungsdatum
Verlagsort Sebastopol
Sprache englisch
Maße 178 x 233 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
ISBN-10 1-4919-8025-7 / 1491980257
ISBN-13 978-1-4919-8025-5 / 9781491980255
Zustand Neuware
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
CHF 62,85
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
CHF 104,90