Nicht aus der Schweiz? Besuchen Sie lehmanns.de
Big Data Preprocessing - Julián Luengo, Diego García-Gil, Sergio Ramírez-Gallego, Salvador García, Francisco Herrera

Big Data Preprocessing

Enabling Smart Data
Buch | Hardcover
XIII, 186 Seiten
2020 | 1st ed. 2020
Springer International Publishing (Verlag)
978-3-030-39104-1 (ISBN)
CHF 112,30 inkl. MwSt
  • Versand in 15-20 Tagen
  • Versandkostenfrei
  • Auch auf Rechnung
  • Artikel merken
This book offers a comprehensible overview of  Big Data Preprocessing, which includes a formal description of each problem.  It also focuses on the most relevant proposed solutions. This book illustrates actual implementations of algorithms that helps the reader deal with these problems. 
This book stresses the gap that exists between big, raw data and the requirements of quality data that businesses are demanding. This is called Smart Data, and to achieve Smart Data the preprocessing is a key step, where the imperfections, integration tasks and other processes are carried out to eliminate superfluous information. The authors present the concept of Smart Data through data preprocessing in Big Data scenarios and connect it with the emerging paradigms of IoT and edge computing, where the end points generate Smart Data without completely relying on the cloud.
Finally, this book provides some novel areas of study that are gathering a deeper attention on the Big Data preprocessing. Specifically, it considers the relation with Deep Learning (as of a technique that also relies in large volumes of data), the difficulty of finding the appropriate selection and concatenation of preprocessing techniques applied and some other open problems.
Practitioners and data scientists who work in this field, and want to introduce themselves to preprocessing in large data volume scenarios will want to purchase this book. Researchers that work in this field, who want to know which algorithms are currently implemented to help their investigations, may also be interested in this book.

Julián Luengo received the M.S. degree in computer science and the Ph.D. from the University of Granada, Granada, Spain, in 2006 and 2011 respectively. He currently acts as an Assistant Professor in the Department of Computer Science and Artificial Intelligence at the University of Granada, Spain. His research interests include machine learning and data mining, data preparation in knowledge discovery and data mining, missing values, noisy data, data complexity and fuzzy systems. Dr. Luengo has been given some awards and honors for his personal work or for his publications in and conferences, such as IFSA-EUSFLAT 2009 Best Student Paper Award. He belongs to the list of the Highly Cited Researchers in the area of Computer Sciences (2015- 2018) (Clarivate Analytics). Diego Garci_a-Gil received the M.Sc. degree in computer science from the University of Granada, Granada, Spain, in 2015. He is currently pursuing the Ph.D. degree with the Department of Computer Science and Artificial Intelligence, University of Granada, Granada, Spain. His current research interests include machine learning, data mining, data preprocessing and Big Data. Sergio Ramírez-Gallego received the M.Sc. degree in computer science from the University of Jaén, Jaén, Spain, in 2012. He obtained the Ph.D. degree with the Department of Computer Science and Artificial Intelligence, University of Granada, Spain in 2018. His current research interests include data mining, data preprocessing, big data, and cloud computing. Salvador García received the B.S. and Ph.D. degrees in Computer Science from the University of Granada, Granada, Spain, in 2004 and 2008, respectively. He is currently an Associate Professor in the Department of Computer Science and Artificial Intelligence, University of Granada, Granada, Spain. Dr. García has published more than 80 papers in international journals (more than 60 in Q1), h-index 43, over 60 papers in international conference proceedings (data from Web of Science). He has organized several special sessions and workshops related to data preprocessing and evolutionary learning in conferences such as "Hybrid Intelligent Systems", "Intelligent Systems Design and Applications" and "International Joint-Conference of Neural Networks". He has been associated with the international program committees and organizing committees of several regular international conferences including IEEE CEC, ICPR, ICDM, IJCAI, etc. As edited activities, he has co-edited two special issues in international journals and he is an associate editor of "Information Fusion" (Elsevier), "Swarm and Evolutionary Computation" (Elsevier) and "AI Communications" (IOS Press) journals, and he is co-Editor in Chief of the international journal "Progress in Artificial Intelligence" (Springer). He is a co-author of the books entitled "Data Preprocessing in Data Mining" and "Learning from Imbalanced Data Sets" published by Springer. His research interests include data science, data preprocessing, Big Data, evolutionary learning, Deep Learning, metaheuristics and biometrics. Francisco Herrera (SM'15) received his M.Sc. in Mathematics in 1988 and Ph.D. in Mathematics in 1991, both from the University of Granada, Spain. He is currently a Professor in the Department of Computer Science and Artificial Intelligence at the University of Granada and Director of DaSCI Institute (Andalusian Research Institute in Data Science and Computational Intelligence). He has been the supervisor of 44 Ph.D. students. He has published more than 400 journal papers, receiving more than 66000 citations (Scholar Google, H-index 132). He is co-author of the books "Genetic Fuzzy Systems" (World Scientific, 2001) and "Data Preprocessing in Data Mining" (Springer, 2015), "The 2-tuple Linguistic Model. Computing with Words in Decision Making" (Springer, 2015), "Multilabel Classification. Problem analysis, metrics a

1. Introduction.- 2. Big Data: Technologies and Tools.- 3. Smart Data.- 4. Dimensionality Reduction for Big Data.- 5. Data Reduction for Big Data.- 6. Imperfect Big Data.- 7. Big Data Discretization.- 8. Imbalanced Data Preprocessing for Big Data.- 9. Big Data Software.- 10. Final Thoughts: From Big Data to Smart Data.-

Erscheinungsdatum
Zusatzinfo XIII, 186 p. 57 illus., 54 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 457 g
Themenwelt Mathematik / Informatik Informatik Datenbanken
Mathematik / Informatik Informatik Netzwerke
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Big Data • classification • Data preprocessing • Data reduction • Data Science • dimensionality reduction • flink • FlinkML • Imbalance Data • Imperfect Data • machine learning • map-reduce • MLLIb • Spark
ISBN-10 3-030-39104-3 / 3030391043
ISBN-13 978-3-030-39104-1 / 9783030391041
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Eine kurze Geschichte der Informationsnetzwerke von der Steinzeit bis …

von Yuval Noah Harari

Buch | Hardcover (2024)
Penguin (Verlag)
CHF 39,20