Introduction to HPC with MPI for data science / Frank Nielsen.

This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions. Divided into two...

Full description

Saved in:
Bibliographic Details
Main Author: Nielsen, Frank (Author)
Format: Ebook
Language:English
Published: Cham : Springer, 2016.
Series:Undergraduate topics in computer science,
Subjects:
Online Access:Springer eBooks

MARC

LEADER 00000czm a2200000 i 4500
003 OCoLC
005 20221102235807.0
006 m o d
007 cr cnu000|uu||
008 160216s2016 sz a ob 001 0 eng d
011 |a Direct Search Result 
011 |a EDS Title: Introduction to HPC with MPI for Data Science 
011 |a MARC Score : 11050(21150) : OK 
020 |z 3319219022  |q print 
020 |z 9783319219028  |q print 
035 |a (ATU)b24456652 
035 |a (EDS)EDS10123604 
035 |a (OCoLC)939528764 
040 |a GW5XE  |b eng  |e rda  |c GW5XE  |d YDXCP  |d AZU  |d OCLCF  |d COO  |d UAB  |d OCLCQ  |d K6U  |d IAD  |d JBG  |d ICW  |d S4S  |d VT2  |d Z5A  |d ILO  |d LIP  |d ICN  |d OTZ  |d LIV  |d ESU  |d IOG  |d ATU 
050 4 |a QA76.88 
082 0 4 |a 004.11  |2 23 
100 1 |a Nielsen, Frank,  |e author.  |9 866099 
245 1 0 |a Introduction to HPC with MPI for data science /  |c Frank Nielsen. 
264 1 |a Cham :  |b Springer,  |c 2016. 
300 |a 1 online resource (xxxiii, 282 pages) :  |b illustrations. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF 
490 1 |a Undergraduate topics in computer science,  |x 1863-7310 
504 |a Includes bibliographical references and index. 
505 0 |a Preface -- Part 1: High Performance Computing (HPC) with the Message Passing Interface (MPI) -- A Glance at High Performance Computing (HPC) -- Introduction to MPI: The Message Passing Interface -- Topology of Interconnection Networks -- Parallel Sorting -- Parallel Linear Algebra.-The MapReduce Paradigm -- Part 11: High Performance Computing for Data Science -- Partition-based Clustering with k means -- Hierarchical Clustering -- Supervised Learning: Practice and Theory of Classification with k NN rule -- Fast Approximate Optimization to High Dimensions with Core-sets and Fast Dimension Reduction -- Parallel Algorithms for Graphs -- Appendix A: Written Exam -- Appendix B: SLURM: A resource manager and job scheduler on clusters of machines -- Appendix C: List of Figures -- Appendix D: List of Tables -- Appendix E: Index. 
520 |a This gentle introduction to High Performance Computing (HPC) for Data Science using the Message Passing Interface (MPI) standard has been designed as a first course for undergraduates on parallel programming on distributed memory models, and requires only basic programming notions. Divided into two parts the first part covers high performance computing using C++ with the Message Passing Interface (MPI) standard followed by a second part providing high-performance data analytics on computer clusters. In the first part, the fundamental notions of blocking versus non-blocking point-to-point communications, global communications (like broadcast or scatter) and collaborative computations (reduce), with Amdalh and Gustafson speed-up laws are described before addressing parallel sorting and parallel linear algebra on computer clusters. The common ring, torus and hypercube topologies of clusters are then explained and global communication procedures on these topologies are studied. This first part closes with the MapReduce (MR) model of computation well-suited to processing big data using the MPI framework. In the second part, the book focuses on high-performance data analytics. Flat and hierarchical clustering algorithms are introduced for data exploration along with how to program these algorithms on computer clusters, followed by machine learning classification, and an introduction to graph analytics. This part closes with a concise introduction to data core-sets that let big data problems be amenable to tiny data problems. Exercises are included at the end of each chapter in order for students to practice the concepts learned, and a final section contains an overall exam which allows them to evaluate how well they have assimilated the material covered in the book. 
546 |a English. 
588 0 |a Online resource; title from PDF title page (SpringerLink, viewed February 15, 2016). 
650 0 |a High performance computing.  |9 328480 
776 0 8 |i Printed edition:  |z 9783319219028 
776 1 8 |w (OCoLC)982239315  |w (OCoLC)990719642  |w (OCoLC)1005791794  |w (OCoLC)1011791058 
830 0 |a Undergraduate topics in computer science,  |x 1863-7310.  |9 248170 
856 4 0 |u https://ezproxy.aut.ac.nz/login?url=https://link.springer.com/10.1007/978-3-319-21903-5  |z Springer eBooks  |x TEMPORARY ERM URL 
907 |a .b24456652  |b 06-09-21  |c 23-11-17 
942 |c EB 
998 |a none  |b 08-12-17  |c m  |d z   |e -  |f eng  |g sz   |h 0 
999 |c 1441998  |d 1441998 
Availability
Requests
Request this item Request this AUT item so you can pick it up when you're at the library.
Interlibrary Loan With Interlibrary Loan you can request the item from another library. It's a free service.