{"id":1199,"date":"2021-11-16T08:32:37","date_gmt":"2021-11-16T08:32:37","guid":{"rendered":"https:\/\/salarydistribution.com\/machine-learning\/2021\/11\/16\/worlds-fastest-supercomputers-changing-fast\/"},"modified":"2021-11-16T08:32:37","modified_gmt":"2021-11-16T08:32:37","slug":"worlds-fastest-supercomputers-changing-fast","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2021\/11\/16\/worlds-fastest-supercomputers-changing-fast\/","title":{"rendered":"World\u2019s Fastest Supercomputers Changing Fast"},"content":{"rendered":"<div data-url=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/15\/cloud-computing-ai-supercomputers\/\" data-title=\"World\u2019s Fastest Supercomputers Changing Fast\" data-hashtags=\"\">\n<p>Modern computing workloads \u2014 including scientific simulations, visualization, data analytics, and machine learning \u2014 are pushing supercomputing centers, cloud providers and enterprises to rethink their computing architecture.<\/p>\n<p>The processor or the network or the software optimizations alone can\u2019t address the latest needs of researchers, engineers and data scientists. Instead, the data center is the new unit of computing, and organizations have to look at the full technology stack.<\/p>\n<p>The latest rankings of the world\u2019s most powerful systems show continued momentum for this full-stack approach in the latest generation of supercomputers.<\/p>\n<p>NVIDIA technologies accelerate over 70 percent, or 355, of the systems on the <a href=\"https:\/\/top500.org\/lists\/top500\/2021\/06\">TOP500 list<\/a> released at the SC21 high performance computing conference this week, including over 90 percent of all new systems. That\u2019s up from 342 systems, or 68 percent, of the machines on <a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/06\/28\/top500-ai-cloud-native\/\">the TOP500 list released in June<\/a>.<\/p>\n<p>NVIDIA also continues to have a strong presence on the Green500 list of the most energy-efficient systems, powering 23 of the top 25 systems on the list, unchanged from June. On average, NVIDIA GPU-powered systems deliver 3.5x higher power efficiency than non-GPU systems on the list.<\/p>\n<p>Highlighting the emergence of a new generation of cloud-native systems, <a href=\"https:\/\/azure.microsoft.com\/en-us\/blog\/microsoft-expands-its-aisupercomputer-lineup-with-general-availability-of-the-latest-80gb-nvidia-a100-gpus-in-azure-claims\/\">Microsoft\u2019s GPU-accelerated Azure supercomputer<\/a> ranked 10th on the list, the first top 10 showing for a cloud-based system.<\/p>\n<p>AI is revolutionizing scientific computing.\u00a0 The number of research papers leveraging HPC and machine learning has skyrocketed in recent years; growing from roughly 600 ML + HPC papers submitted in 2018 to nearly 5,000 in 2020.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2021\/11\/Picture2.png\" alt=\"\" width=\"400\" height=\"308\"><\/p>\n<p>The ongoing convergence of HPC and AI workloads is also underscored by new benchmarks such as HPL-AI and MLPerf HPC.<\/p>\n<p>HPL-AI is an emerging benchmark of converged HPC and AI workloads that uses mixed-precision math \u2014 the basis of deep learning and many scientific and commercial jobs \u2014 while still delivering the full accuracy of double-precision math, which is the standard\u00a0 measuring stick for traditional HPC benchmarks.<\/p>\n<p>And MLPerf HPC addresses a style of computing that speeds and augments simulations on supercomputers with AI, with the benchmark measuring performance on three key workloads for HPC centers: astrophysics (Cosmoflow), weather (Deepcam) and molecular dynamics (Opencatalyst).<\/p>\n<p>NVIDIA addresses the full stack with GPU-accelerated processing, smart networking, GPU-optimized applications, and libraries that support the convergence of AI and HPC. This approach has supercharged workloads and enabled scientific breakthroughs.<\/p>\n<p>Let\u2019s look more closely at how NVIDIA is supercharging supercomputers.<\/p>\n<h2><b>Accelerated Computing<\/b><\/h2>\n<p>The combined power of the GPU\u2019s parallel processing capabilities and over 2,500 GPU-optimized applications allows users to speed up their HPC jobs, in many cases from weeks to hours.<\/p>\n<p>We\u2019re constantly optimizing the <a href=\"https:\/\/www.nvidia.com\/en-us\/technologies\/cuda-x\/\">CUDA-X libraries<\/a> and the <a href=\"https:\/\/www.nvidia.com\/en-us\/gpu-accelerated-applications\/\">GPU-accelerated applications<\/a>, so it\u2019s not unusual for users to see an x-factor performance gain on the same GPU architecture.<\/p>\n<p>As a result, the performance of the most widely used scientific applications \u2014 which we call the \u201cgolden suite\u201d \u2014 has improved 16x over the past six years, with more advances on the way.<\/p>\n<figure id=\"attachment_54145\" aria-describedby=\"caption-attachment-54145\" class=\"wp-caption alignleft\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2021\/11\/Picture3.png\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2021\/11\/Picture3.png\" alt=\"\" width=\"310\" height=\"453\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-54145\" class=\"wp-caption-text\">16x performance on top HPC, AI and ML apps from full-stack innovation.**<\/figcaption><\/figure>\n<p>And to help users quickly take advantage of higher performance, we offer the latest versions of the AI and HPC software through containers from the <a href=\"https:\/\/ngc.nvidia.com\/\">NGC catalog<\/a>. Users simply pull and run the application on their supercomputer, in the data center or the cloud.<\/p>\n<h2><b>Convergence of HPC and AI\u00a0<\/b><\/h2>\n<p>The infusion of AI in HPC helps researchers speed up their simulations while achieving the accuracy they\u2019d get with the traditional simulation approach.<\/p>\n<p>That\u2019s why an increasing number of researchers are taking advantage of AI to speed up their discoveries.<\/p>\n<p>That includes four of the finalists for this year\u2019s <a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/15\/ai-gordon-bell-sc21\/\">Gordon Bell prize<\/a>, the most prestigious award in supercomputing. Organizations are racing to build exascale AI computers to support this new model, which combines HPC and AI.<\/p>\n<p>That strength is underscored by relatively new benchmarks, such as HPL-AI and MLPerf HPC, highlighting the ongoing convergence of HPC and AI workloads.<\/p>\n<p>To fuel this trend, last week NVIDIA announced a broad range of advanced <a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/09\/new-accelerated-computing-libraries\/\">new libraries and software development kits<\/a> for HPC.<\/p>\n<p>Graphs \u2014 a key data structure in modern data science \u2014 can now be projected into deep-neural network frameworks with Deep Graph Library, or DGL, a new Python package.<\/p>\n<p><a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/09\/modulus-framework\/\">NVIDIA Modulus<\/a> builds and trains physics-informed machine learning models that can learn and obey the laws of physics.<\/p>\n<p>And NVIDIA introduced three new libraries:<\/p>\n<ul>\n<li><a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/09\/reopt-ai-software-supply-chain\/\">ReOpt<\/a> \u2013 to increase operational efficiency for the $10 trillion logistics industry.<\/li>\n<li><a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/09\/cuquantum-world-record\/\">cuQuantum<\/a> \u2013 to accelerate quantum computing research.<\/li>\n<li><a href=\"https:\/\/developer.nvidia.com\/cunumeric\">cuNumeric<\/a> \u2013 to accelerate NumPy for scientists, data scientists, and machine learning and AI researchers in the Python community.<\/li>\n<\/ul>\n<p>Weaving it all together is <a href=\"https:\/\/www.nvidia.com\/en-us\/omniverse\/\">NVIDIA Omniverse<\/a> \u2014 the company\u2019s virtual world simulation and collaboration platform for 3D workflows.<\/p>\n<p>Omniverse is used to simulate digital twins of warehouses, plants and factories, of physical and biological systems, of the <a href=\"https:\/\/www.nvidia.com\/en-us\/edge-computing\/5g\/\">5G edge<\/a>, <a href=\"https:\/\/www.nvidia.com\/en-us\/autonomous-machines\/robotics\/\">robots<\/a>, <a href=\"https:\/\/www.nvidia.com\/en-us\/self-driving-cars\/\">self-driving cars<\/a> and even <a href=\"https:\/\/nvidianews.nvidia.com\/news\/nvidia-announces-platform-for-creating-ai-avatars\">avatars<\/a>.<\/p>\n<p>Using Omniverse, NVIDIA announced last week that it will build a supercomputer, called <a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/12\/earth-2-supercomputer\/\">Earth-2<\/a>, devoted to predicting climate change by creating a digital twin of the planet.<\/p>\n<h2>Cloud-Native Supercomputing<\/h2>\n<p>As supercomputers take on more workloads across data analytics, AI, simulation and visualization, CPUs are stretched to support a growing number of communication tasks needed to operate large and complex systems.<\/p>\n<p><a href=\"https:\/\/blogs.nvidia.com\/blog\/2020\/05\/20\/whats-a-dpu-data-processing-unit\/\">Data processing units<\/a> alleviate this stress by offloading some of these processes.<\/p>\n<p>As a fully integrated data-center-on-a-chip platform, <a href=\"https:\/\/www.nvidia.com\/en-us\/networking\/products\/data-processing-unit\/\">NVIDIA BlueField DPUs<\/a> can offload and manage data center infrastructure tasks instead of making the host processor do the work, enabling stronger security and more efficient orchestration of the supercomputer.<\/p>\n<p>Combined with <a href=\"https:\/\/www.nvidia.com\/en-us\/networking\/infiniband-switching\/\">NVIDIA Quantum InfiniBand platform<\/a>, this architecture delivers optimal bare-metal performance while natively supporting multinode tenant isolation.<\/p>\n<figure id=\"attachment_54148\" aria-describedby=\"caption-attachment-54148\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2021\/11\/Picture4.png\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2021\/11\/Picture4.png\" alt=\"\" width=\"800\" height=\"306\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-54148\" class=\"wp-caption-text\">NVIDIA\u2019s Quantum InfiniBand platform provides predictive, bare-metal performance isolation.<\/figcaption><\/figure>\n<p>Thanks to a <a href=\"https:\/\/blogs.nvidia.com\/blog\/2021\/11\/09\/zero-trust-cybersecurity\/\">zero-trust approach<\/a>, these new systems are also more secure.<\/p>\n<p>BlueField DPUs isolate applications from infrastructure. <a href=\"https:\/\/developer.nvidia.com\/networking\/doca\">NVIDIA DOCA 1.2<\/a> \u2014 the latest BlueField software platform \u2014 enables next-generation distributed firewalls and wider use of line-rate data encryption. And <a href=\"https:\/\/developer.nvidia.com\/morpheus-cybersecurity\">NVIDIA Morpheus<\/a>, assuming an interloper is already inside the data center, uses deep learning-powered data science to detect intruder activities in real time.<\/p>\n<p>And all of the trends outlined above will be accelerated by new networking technology.<\/p>\n<p><a href=\"https:\/\/nvidianews.nvidia.com\/news\/nvidia-quantum-2-takes-supercomputing-to-new-heights-into-the-cloud\">NVIDIA Quantum-2<\/a>, also announced last week, is a 400Gbps InfiniBand platform and consists of the Quantum-2 switch, the ConnectX-7 NIC, the <a href=\"https:\/\/www.nvidia.com\/content\/dam\/en-zz\/Solutions\/Data-Center\/documents\/datasheet-nvidia-bluefield-3-dpu.pdf\">BlueField-3 DPU<\/a>, as well as new software for the new networking architecture.<\/p>\n<p>NVIDIA Quantum-2 offers the benefits of bare-metal high performance and secure multi-tenancy, allowing the next generation of supercomputers to be secure, cloud-native and better utilized.<\/p>\n<p>\u00a0<\/p>\n<p><em>** Benchmark applications: Amber, Chroma, GROMACS, MILC, NAMD, PyTorch, Quantum Espresso; Random Forest FP32 , TensorFlow, VASP | GPU node: dual-socket CPUs with 4x P100, V100, or A100 GPUs.<\/em><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/blogs.nvidia.com\/blog\/2021\/11\/15\/cloud-computing-ai-supercomputers\/<\/p>\n","protected":false},"author":0,"featured_media":1200,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1199"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=1199"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1199\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/1200"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=1199"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=1199"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=1199"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}