Nicht aus der Schweiz? Besuchen Sie lehmanns.de
CHF 66,30 inkl. MwSt
Data mining of massive data sets is transforming the way we think about crisis response, marketing, entertainment, cybersecurity and national intelligence. This title examines the frontier of analyzing massive amounts of data, whether in a static database or streaming through a system.
Data mining of massive data sets is transforming the way we think about crisis response, marketing, entertainment, cybersecurity and national intelligence. Collections of documents, images, videos, and networks are being thought of not merely as bit strings to be stored, indexed, and retrieved, but as potential sources of discovery and knowledge, requiring sophisticated analysis techniques that go far beyond classical indexing and keyword counting, aiming to find relational and semantic interpretations of the phenomena underlying the data.


Frontiers in Massive Data Analysis examines the frontier of analyzing massive amounts of data, whether in a static database or streaming through a system. Data at that scale—terabytes and petabytes—is increasingly common in science (e.g., particle physics, remote sensing, genomics), Internet commerce, business analytics, national security, communications, and elsewhere. The tools that work to infer knowledge from data at smaller scales do not necessarily work, or work well, at such massive scale. New tools, skills, and approaches are necessary, and this report identifies many of them, plus promising research directions to explore. Frontiers in Massive Data Analysis discusses pitfalls in trying to infer knowledge from massive data, and it characterizes seven major classes of computation that are common in the analysis of massive data. Overall, this report illustrates the cross-disciplinary knowledge—from computer science, statistics, machine learning, and application disciplines—that must be brought to bear to make useful inferences from massive data.

Table of Contents


Front Matter
Summary
1 Introduction
2 Massive Data in Science, Technology, Commerce, National Defense, Telecommunications, and Other Endeavors
3 Scaling the Infrastructure for Data Management
4 Temporal Data and Real-Time Algorithms
5 Large-Scale Data Representations
6 Resources, Trade-offs, and Limitations
7 Building Models from Massive Data
8 Sampling and Massive Data
9 Human Interaction with Data
10 The Seven Computational Giants of Massive Data Analysis
11 Conclusions
Appendixes
Appendix A: Acronyms
Appendix B: Biographical Sketches of Committee Members

1 Front Matter; 2 Summary; 3 1 Introduction; 4 2 Massive Data in Science, Technology, Commerce, National Defense, Telecommunications, and Other Endeavors; 5 3 Scaling the Infrastructure for Data Management; 6 4 Temporal Data and Real-Time Algorithms; 7 5 Large-Scale Data Representations; 8 6 Resources, Trade-offs, and Limitations; 9 7 Building Models from Massive Data; 10 8 Sampling and Massive Data; 11 9 Human Interaction with Data; 12 10 The Seven Computational Giants of Massive Data Analysis; 13 11 Conclusions; 14 Appendixes; 15 Appendix A: Acronyms; 16 Appendix B: Biographical Sketches of Committee Members

Verlagsort Washington
Sprache englisch
Maße 152 x 229 mm
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Mathematik / Informatik Mathematik
ISBN-10 0-309-28778-2 / 0309287782
ISBN-13 978-0-309-28778-4 / 9780309287784
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Datenanalyse für Künstliche Intelligenz

von Jürgen Cleve; Uwe Lämmel

Buch | Softcover (2024)
De Gruyter Oldenbourg (Verlag)
CHF 104,90
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
CHF 62,85