Databricks Data Intelligence Platform
Apress (Verlag)
979-8-8688-0443-4 (ISBN)
Databricks offers a scalable and efficient solution for processing large volumes of both structured and unstructured data, facilitating advanced analytics, machine learning, and real-time processing. In today's GenAI world, Databricks plays a crucial role in empowering organizations to extract value from their data effectively, driving innovation and gaining a competitive edge in the digital age. This book will not only help you master the Data Intelligence Platform but also help power your enterprise to the next level with a bespoke LLM unique to your organization.
Beginning with foundational principles, the book starts with a platform overview and explores features and best practices for ingestion, transformation, and storage with Delta Lake. Advanced topics include leveraging Databricks SQL for querying and visualizing large datasets, ensuring data governance and security with Unity Catalog, and deploying machine learning and LLMs using Databricks MLflow for GenAI. Through practical examples, insights, and best practices, this book equips solution architects and data engineers with the knowledge to design and implement scalable data solutions, making it an indispensable resource for modern enterprises.
Whether you are new to Databricks and trying to learn a new platform, a seasoned practitioner building data pipelines, data science models, or GenAI applications, or even an executive who wants to communicate the value of Databricks to customers, this book is for you. With its extensive feature and best practice deep dives, it also serves as an excellent reference guide if you are preparing for Databricks certification exams.
What You Will Learn
Foundational principles of Lakehouse architecture
Key features including Unity Catalog, Databricks SQL (DBSQL), and Delta Live Tables
Databricks Intelligence Platform and key functionalities
Building and deploying GenAI Applications from data ingestion to model serving
Databricks pricing, platform security, DBRX, and many more topics
Who This Book Is For
Solution architects, data engineers, data scientists, Databricks practitioners, and anyone who wants to deploy their Gen AI solutions with the Data Intelligence Platform. This is also a handbook for senior execs who need to communicate the value of Databricks to customers. People who are new to the Databricks Platform and want comprehensive insights will find the book accessible.
Nikhil Gupta is a seasoned data professional with over 18 years of experience in big data technologies, driving innovation and strategic growth in the field. As a Solution Architect at Databricks, he leverages his expertise to help customers across various industries, including retail, CPG, financial services, banking, and manufacturing, modernize their data and AI implementations on the Databricks platform. His expertise spans a range of big data technologies, including data warehousing, data lakes, and real-time data processing, making him a trusted advisor for Fortune 500 companies. Jason Yip is a data and machine learning architect. He currently serves as Director of Data and AI at Tredence, a leading data science and analytics company. He advises Fortune 500 companies on implementing data and Generative AI strategies on the cloud. He serves on multiple advisory boards at Databricks, including the Partner Product Advisory Board, and the Solution Architect Champion Advisory board. He is a top voice on Databricks and a former Microsoft employee who successfully led the Microsoft Corporate Finance big data transformation using Databricks.
1. Introduction.- 2. Lakehouse Platform.- 3. Databricks Platform Overview.- 4. Data Ingestion and Real-Time Analytics.- 5. Delta Lake: Deep Dive.- 6. Data Governance with Unity Catalog.- 7. Data Engineering and Analytics.- 8. Data Science, Machine Learning, and AI.- 9. Building GenAI Applications on Databricks Platform.- 10. Data Warehousing with DBSQL.- 11. Data Intelligence Platform.- 12. CI/CD and Application Development.- 13. Databricks Pricing and Observability using System Tables.- 14. Platform Security and Compliance.- 15. Advanced Topics and Industry Applications.- 16. Streaming Applications and HA/DR.- 17. Databricks in the Cloud Ecosystem.- 18. Conclusion.
Erscheinungsdatum | 16.10.2024 |
---|---|
Zusatzinfo | 235 Illustrations, color; 8 Illustrations, black and white; XVII, 473 p. 243 illus., 235 illus. in color. |
Verlagsort | Berlin |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
Informatik ► Theorie / Studium ► Künstliche Intelligenz / Robotik | |
Mathematik / Informatik ► Mathematik ► Finanz- / Wirtschaftsmathematik | |
Schlagworte | Big Data • Database • Databricks • data engineering • data Intelligence Platform • Delta Lake • GenAI • Lakehouse • LLM • Machine learniing • MLFlow • Spark |
ISBN-13 | 979-8-8688-0443-4 / 9798868804434 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich