Welcome!

IBM Cloud Authors: Elizabeth White, Yeshim Deniz, Pat Romanski, Liz McMillan, Stefan Bernbo

News Feed Item

TotalView(R) Achieves Massive Milestone Towards Exascale Debugging

TotalView Debugs 786,432 Processor Cores as Part of Scalability Initiative

SALT LAKE CITY, UT and BOULDER, CO -- (Marketwire) -- 11/12/12 -- At the SC12 conference, Rogue Wave Software, the largest independent provider of cross-platform software development tools and embedded components for the next generation of HPC applications, announced that TotalView® has achieved a significant debugging milestone during testing conducted as part of its strategic scalability initiative. During the testing, TotalView demonstrated its capability to debug a parallel job running on 786,432 processor cores. The tests were conducted on Lawrence Livermore National Laboratory's (LLNL) Sequoia, its IBM® Blue Gene/Q® supercomputer. These scalability tests are key to advancing Rogue Wave's strategic business goal of providing leading tools that scale with its customers' applications on today's petascale computers and to ensure that TotalView is well positioned for the industry's move towards exascale computing. Sequoia serves the National Nuclear Security Administration's Advanced Simulation and Computing (ASC) program, a cornerstone of the effort to ensure the safety, security, and reliability of the nation's nuclear deterrent without underground testing.

"We are actively working to increase the capabilities of our scientific codes to scale and take advantage of the phenomenal power of Sequoia. As part of this effort, we are looking for ways to get more on-node parallelism from existing codes and architecting our new codes to support the even more massive degrees of parallelism that we know will be needed in the future," stated Scott Futral, LLNL group leader for Development Environment. "Rogue Wave's dedication to pushing for ever-increasing scales with its TotalView debugger and the recent tests give us reason to be confident that TotalView will continue to be a critical development tool as we reach higher and higher scales with our own codes."

Rogue Wave's scalability initiative, which is a partnership with LLNL and LLNL's Tri-Lab partners (Los Alamos National Laboratory and Sandia National Laboratory), features a multi-architecture approach, targeting the Blue Gene/Q platform, along with x86-based architectures, like the Cray® XE™. Extreme-scale testing allows TotalView engineers to identify bottlenecks and prioritize efforts in optimizing and tuning the debugging engine for scalability. During the most recent testing session, TotalView successfully scaled across 786,432 cores, with no indication of the debugger hitting any barriers.

Rogue Wave conducted this test using a hybrid MPI + OpenMP code that implements a method for solving a system of linear equations. This application, which makes use of both MPI for distributed memory multi-process parallelism and OpenMP for shared memory thread based parallelism, was selected because it shares important characteristics with many applications used on extreme scale systems, such as Sequoia. This kind of attention to the workloads of large-scale systems is another key aspect of scalability requirements.

Since there was no indication of any barrier being hit at the 786,432 core mark, the testing suggests that TotalView could have leveraged more of Sequoia's 1.5 million cores if additional compute nodes had been available. In order to further push TotalView's scalability, additional tests oversubscribed the machine by spinning up more than one thread per core. Rogue Wave will announce the result of this second set of tests, which demonstrate successful debugging of an even higher number of threads, on Thursday November 15th at 12:00 PM MST. Rogue Wave invites SC12 attendees to visit its booth, #3418, to participate in a competition to correctly guess the number of threads TotalView debugged.

About TotalView

TotalView® is a highly scalable debugger that provides troubleshooting for a wide variety of applications including: serial, parallel, multi-threaded, multiprocess, and remote applications.
Designed for developer productivity, TotalView simplifies and shortens the process of developing, debugging, and optimizing complex code. It provides a unique combination of capabilities for pinpointing and fixing hard-to-reproduce bugs, memory leaks, and performance issues. TotalView raises the bar for debugging by providing several additional features at no extra cost, including debugging for CUDA, OpenACC and deterministic reverse debugging, which allows users to pause, rewind and playback the sessions to accurately identify and correct errors.

About Rogue Wave Software

Rogue Wave Software, Inc. is the largest independent provider of cross-platform software development tools and embedded components for the next generation of HPC applications. Rogue Wave marries High Performance Computing with High Productivity Computing to enable developers to harness the power of parallel applications and multicore computing. Rogue Wave products reduce the complexity of prototyping, developing, debugging, and optimizing multi-processor and data-intensive applications. Rogue Wave customers are industry leaders in the Global 2000, ISVs, OEMs, government laboratories and research institutions that leverage computationally-complex and data-intensive applications to enable innovation and outperform competitors. Rogue Wave is a Battery Ventures portfolio company. For more information, visit www.roguewave.com.

For additional information, contact:
Jessica Fishman
Rogue Wave Software
Email Contact

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

IoT & Smart Cities Stories
The standardization of container runtimes and images has sparked the creation of an almost overwhelming number of new open source projects that build on and otherwise work with these specifications. Of course, there's Kubernetes, which orchestrates and manages collections of containers. It was one of the first and best-known examples of projects that make containers truly useful for production use. However, more recently, the container ecosystem has truly exploded. A service mesh like Istio addr...
CloudEXPO New York 2018, colocated with DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
@DevOpsSummit at Cloud Expo, taking place November 12-13 in New York City, NY, is co-located with 22nd international CloudEXPO | first international DXWorldEXPO and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time t...
DXWordEXPO New York 2018, colocated with CloudEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
Cloud-enabled transformation has evolved from cost saving measure to business innovation strategy -- one that combines the cloud with cognitive capabilities to drive market disruption. Learn how you can achieve the insight and agility you need to gain a competitive advantage. Industry-acclaimed CTO and cloud expert, Shankar Kalyana presents. Only the most exceptional IBMers are appointed with the rare distinction of IBM Fellow, the highest technical honor in the company. Shankar has also receive...
Headquartered in Plainsboro, NJ, Synametrics Technologies has provided IT professionals and computer systems developers since 1997. Based on the success of their initial product offerings (WinSQL and DeltaCopy), the company continues to create and hone innovative products that help its customers get more from their computer applications, databases and infrastructure. To date, over one million users around the world have chosen Synametrics solutions to help power their accelerated business or per...
A valuable conference experience generates new contacts, sales leads, potential strategic partners and potential investors; helps gather competitive intelligence and even provides inspiration for new products and services. Conference Guru works with conference organizers to pass great deals to great conferences, helping you discover new conferences and increase your return on investment.
Business professionals no longer wonder if they'll migrate to the cloud; it's now a matter of when. The cloud environment has proved to be a major force in transitioning to an agile business model that enables quick decisions and fast implementation that solidify customer relationships. And when the cloud is combined with the power of cognitive computing, it drives innovation and transformation that achieves astounding competitive advantage.
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...