Scalable Big Data Analytics for Protein Bioinformatics
Springer International Publishing (Verlag)
978-3-030-07538-5 (ISBN)
The content of the book is divided into four parts. The first part provides background information on proteins and their representation levels, including a formal model of a 3D protein structure used in computational processes, and a brief overview of the technologies used in the solutions presented in the book. The second part of the book discusses Cloud services that are utilized in the development of scalable and reliable cloud applications for 3D protein structure similarity searching and protein structure prediction. The third part of the book shows the utilization of scalable Big Datacomputational frameworks, like Hadoop and Spark, in massive 3D protein structure alignments and identification of intrinsically disordered regions in protein structures. The fourth part of the book focuses on finding 3D protein structure similarities, accelerated with the use of GPUs and the use of multithreading and relational databases for efficient approximate searching on protein secondary structures.
The book introduces advanced techniques and computational architectures that benefit from recent achievements in the field of computing and parallelism. Recent developments in computer science have allowed algorithms previously considered too time-consuming to now be efficiently used for applications in bioinformatics and the life sciences. Given its depth of coverage, the book will be of interest to researchers and software developers working in the fields of structural bioinformatics and biomedical databases.
Dariusz Mrozek is currently an Associate Professor and Head of Division of Theory of Informatics in Institute of Informatics at the Silesian University of Technology (SUT) in Gliwice, Poland. He received his PhD degree from SUT in 2006. His research interests cover bioinformatics, information systems, parallel and Cloud computing, databases and Big data. He is now focused on the analysis of protein structures, functions and activities, and the use of novel computation techniques to get insights from biological data, including NGS and proteomics data. He is the author of 90+ papers published in conference proceedings and international journals, co-editor of thirteen books devoted to databases and data processing, and editor of two special issues in reputable scientific journals. He is a member of the IEEE Engineering in Medicine and Biology Society (EMBS), IEEE Systems, Man, and Cybernetics Society (SMCS), and IEEE Cloud Computing Community. Working in different research projects, he cooperated with qualified institutions, e.g. Imperial College of London (on the Chernobyl Tissue Bank), V P Komisarenko Institute of Endocrinology and Metabolism - Academy of Medical Sciences of the Ukraine, Medical Radiological Research Centre - Russian Academy of Medical Sciences, Helmholtz Zentrum Muenchen Deutsches Forschungszentrum Fuer Gesundheit und Umwelt Gmbh, Microsoft Research in the USA, Institute of Oncology in Gliwice, Poland, Medical University of Silesia, Katowice, Poland.
Formal Model of 3D Protein Structures for Functional Genomics, Comparative Bioinformatics, and Molecular Modeling.- Multithreaded PSS-SQL for Searching Databases of Secondary Structures.- GPU and CUDA for 3D Protein Structure Similarity Searching.- Cloud Computing for 3D Protein Structure Alignment.- General Discussion and Concluding Remarks.
"This excellent and practically oriented text can benefit researchers seeking to establish a cloud-based bioinformatics HPC facility. Note that most of the solutions are implemented as embarrassingly parallel processes and not as distributed parallel processes. The book will be of interest to researchers and scientific software developers of bioinformatics and biomedical databases." (Alexander Tzanov, Computing Reviews, June 06, 2019)
Erscheint lt. Verlag | 26.12.2018 |
---|---|
Reihe/Serie | Computational Biology |
Zusatzinfo | XXVI, 315 p. 151 illus., 110 illus. in color. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 155 x 235 mm |
Gewicht | 710 g |
Themenwelt | Mathematik / Informatik ► Informatik ► Netzwerke |
Informatik ► Weitere Themen ► Bioinformatik | |
Naturwissenschaften ► Biologie ► Biochemie | |
Schlagworte | Amino Acid Sequence • Bioinformatics • Cloud Computing • GPU, CUDA • Multi-agent Systems • Multithreaded Processing • Parallel Processing • proteins • Protein Structure |
ISBN-10 | 3-030-07538-9 / 3030075389 |
ISBN-13 | 978-3-030-07538-5 / 9783030075385 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich