Building the Next Generation of Parallel and Resilient Applications and LibrariesMichael A. Heroux
Computing Research Division, Sandia National Laboratories
Monday, September 16, 2013 10:00-11:00,
For decades parallel computing has been the focus of intense research and development in selected fields, and numerous large-scale parallel applications have been developed. SPMD via MPI has been a dominant approach to parallelism to date, but this approach alone will be insufficient going forward. Presently we are on the threshold of large-scale software re-design and implementation across most application areas, but the path to developing these applications is uncertain. There are many programming models, languages and architectures from which to pick, and the number of choices is growing. Furthermore, resilience is becoming an issue and may force application developers to explicitly manage system faults, beyond checkpoint-restart.
In this presentation we discuss some of the principles of parallel application development that have produced today’s codes, and how we can address these principles going forward. We also discuss what much change in order to move forward and give ideas for developing parallel applications and libraries now that will have sustained value in the future.
Speaker Bio: Dr. Michael A. Heroux is a Distinguished Member of the Technical Staff at Sandia National Laboratories, working on new algorithm development, and robust parallel implementation of solver components for problems of interest to Sandia and the broader scientific and engineering community. He leads development of the Trilinos Project, an effort to provide state of the art solution methods in a state of the art software framework. Trilinos is a 2004 R&D 100 award-winning product, freely available as Open Source and actively developed by dozens of researchers. In addition to Trilinos, Dr. Heroux works on the development of scalable parallel scientific and engineering applications and maintains his interest in the interaction of scientific/engineering applications and high performance computer architectures. He leads the Mantevo Project, a 2013 R&D 100 winner, which is focused on the development of Open Source, portable mini-applications and mini-drivers for scientific and engineering applications. Dr. Heroux is a telecommuter for Sandia, maintaining an office at home in rural central Minnesota and at St. John's University where he is Scientist in Residence in the Computer Science Department. He is a member of the Society for Industrial and Applied Mathematics (SIAM) and past chair of the SIAM Activity Group on Supercomputing. He is a Distinguished Member of the Association for Computing Machinery (ACM). He is the Editor-in-Chief for the ACM Transactions on Mathematical Software, Subject Area Editor for the Journal on Parallel and Distributed Computing and Associate Editor for the SIAM Journal on Scientific Computing.
Contact: R. Pozo
Note: Visitors from outside NIST must contact Cathy Graham; (301) 975-3800; at least 24 hours in advance.