{"id":1984,"date":"2022-03-22T15:41:23","date_gmt":"2022-03-22T15:41:23","guid":{"rendered":"https:\/\/salarydistribution.com\/machine-learning\/2022\/03\/22\/nvidia-maxine-reinvents-real-time-communication-with-ai\/"},"modified":"2022-03-22T15:41:23","modified_gmt":"2022-03-22T15:41:23","slug":"nvidia-maxine-reinvents-real-time-communication-with-ai","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2022\/03\/22\/nvidia-maxine-reinvents-real-time-communication-with-ai\/","title":{"rendered":"NVIDIA Maxine Reinvents Real-Time Communication With AI"},"content":{"rendered":"<div data-url=\"https:\/\/blogs.nvidia.com\/blog\/2022\/03\/22\/maxine-reinvents-communication-ai\/\" data-title=\"NVIDIA Maxine Reinvents Real-Time Communication With AI\" data-hashtags=\"\">\n<p>Everyone wants to be heard. And with more people than ever in video calls or live streaming from their home offices, rich audio free from echo hiccups and background noises like barking dogs is key to better sounding online experiences.<\/p>\n<p><a href=\"https:\/\/developer.nvidia.com\/maxine\">NVIDIA Maxine<\/a> offers GPU-accelerated, AI-enabled software development kits to help developers build scalable, low-latency audio and video effects pipelines that improve call quality and user experience.<\/p>\n<p>Today, NVIDIA announced at <a href=\"https:\/\/www.nvidia.com\/gtc\/\">GTC<\/a> that Maxine is adding acoustic echo cancellation and AI-based upsampling for better sound quality.<\/p>\n<p>Acoustic Echo Cancellation eliminates acoustic echo from the audio stream in real time, preserving speech quality even during double-talk. With AI-based technology, Maxine achieves more effective echo cancellation than that achieved via traditional digital signal processing algorithms.<\/p>\n<p>Audio Super Resolution improves the quality of a low-bandwidth audio signal by restoring the energy lost in higher frequency bands using AI-based techniques. Maxine Audio Super Resolution supports upsampling the audio\u00a0 from 8 kHz (narrowband) to 16 kHz (wideband), from 16 kHz to 48 kHz (ultra-wideband) and from 8 kHz to 48 kHz. Lower sampling rates such as 8 kHz often result in muffled voices and emphasize artifacts such as sibilance and make the speech difficult to understand.<\/p>\n<p>Modern film and television studios often use 48 kHz (or higher) sampling rate for recording audio, in order to maintain fidelity of the original signal and preserve clarity. Audio Super Resolution can help restore the fidelity of old audio recordings, derived from magnetic tapes or other low bandwidth media.<\/p>\n<h2><b>Bridging the Sound Gap\u00a0<\/b><\/h2>\n<p>Most modern telecommunication takes place using wideband or ultra-wideband audio. Since NVIDIA Audio Super Resolution can upsample and restore the narrowband audio in real-time, the technology can effectively be used to bridge the quality gap between traditional copper wire phone lines and modern VoIP-based wideband communication systems.<\/p>\n<p>Real-time communication \u2014 whether for conference calls, call centers or live streaming of all kinds \u2014 is taking a big leap forward with Maxine.<\/p>\n<p>Since its initial release, Maxine has been adopted by many of the world\u2019s leading providers for video communications, content creation and live streaming.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2022\/03\/maxine.jpg\" alt=\"\" width=\"624\" height=\"52\"><\/p>\n<p>The worldwide market for video conferencing is expected to increase to nearly $13 billion in 2028, up from about $6.3 billion in 2021, according to Fortune Business Insights.<\/p>\n<h2><b>WFH: A Way of Life\u00a0<\/b><\/h2>\n<p>The move to work from home, or WFH, has become an accepted norm across companies, and organizations are adapting to the new expectations.<\/p>\n<p>Analyst firm Gartner estimates that only a quarter of meetings for enterprises will be in person in 2024, a decline from 60 percent pre-pandemic.<\/p>\n<p>Virtual collaboration in the U.S. has played an important role as people have taken on hybrid and remote positions in the past two years amid the pandemic.<\/p>\n<p>But as organizations seek to maintain company culture and workplace experience, the stakes have risen for higher-quality media interactivity.<\/p>\n<h2><b>Solving the Cocktail Party Problem\u00a0\u00a0\u00a0\u00a0<\/b><\/h2>\n<p>But sometimes work and home life collide. As a result, meetings are often filled with background noises from kids, construction work outside or emergency vehicle sirens, causing brief interruptions in the flow of conference calls.<\/p>\n<p>Maxine helps solve an age-old audio issue known as the <a href=\"https:\/\/blogs.nvidia.com\/blog\/2018\/08\/28\/music-youtube-cocktail-party-problem-ai-artificial-intelligence-deep-learning\/\">cocktail party problem<\/a>. With AI, it can filter out unwanted background noises, allowing users to be better heard, whether they\u2019re in a home office or on the road.<\/p>\n<p>The Maxine GPU-accelerated platform provides an end-to-end deep learning pipeline that integrates with customizable state-of-the-art models, enabling high-quality features with a standard microphone and camera.<\/p>\n<h2><b>Sound Like Your Best Self<\/b><\/h2>\n<p>In addition to being impacted by background noise, audio quality in virtual activities can sometimes sound thin, missing low- and mid-level frequencies, or even be barely audible.<\/p>\n<p>Maxine enables upsampling of audio in real time so that voices sound fuller, deeper and more audible.<\/p>\n<h2><b>Logitech: Better Audio for Headsets and Blue Yeti Microphones<\/b><\/h2>\n<p>Logitech, a leading maker of peripherals, is implementing Maxine for better interactions with its popular headsets and microphones.<\/p>\n<p>Tapping into AI libraries, Logitech has integrated Maxine directly inside G Hub audio drivers to enhance communications with its devices without the need for additional software. Maxine takes advantage of the powerful Tensor Cores in NVIDIA RTX GPUs so consumers can enjoy real-time processing of their mic signal.<\/p>\n<p>Logitech is now utilizing Maxine\u2019s state-of-the-art denoising in its G Hub software. That has allowed it to remove echoes and background noises \u2014 such as fans, as well as keyboard and mouse clicks \u2014 that can distract from video conferences or live-streaming sessions.<\/p>\n<p>\u201cNVIDIA Maxine makes it fast and easy for Logitech G gamers to clean up their mic signal and eliminate unwanted background noises in a single click.\u201d said Ujesh Desai, GM of Logitech G. \u201cYou can even use G HUB to test your mic signal to make sure you have your Maxine settings dialed in.\u201d<\/p>\n<p>Logitech is now taking advantage of Maxine\u2019s state-of-the-art denoising in its G Hub software. That has allowed it to remove echoes and background noises \u2014 such as fans, as well as keyboard and mouse clicks \u2014 that can distract from video conferences or live-streaming sessions.<\/p>\n<p>\u201cNVIDIA Maxine makes it fast and easy for users to clean up their mic signal and eliminate unwanted background noises in a single click,\u201d said Ujesh Desai, vice president at Logitech. \u201cYou can even test your mic signal to find the perfect settings for your setup.\u201d<\/p>\n<h2><b>Tencent Cloud Boosts Content Creators<\/b><\/h2>\n<p>Tencent Cloud is helping content creators with their productions by offering technology from NVIDIA Maxine that makes it quick and easy to add creative backgrounds.<\/p>\n<p>NVIDIA Maxine\u2019s AI Green Screen feature enables users to create a more immersive presence with high-quality foreground and background separation \u2014 without the need for a traditional green screen. Once the real background is separated, it can easily be replaced with a virtual background, or blurred to create a depth-of-field effect. Tencent Cloud is offering this new capability as a software-as-a-service package for content creators.<\/p>\n<p>NVIDIA Maxine\u2019s AI Green Screen technology helps content creators with their productions by enabling more immersive high quality experiences, without the need for specialized equipment and lighting\u201d said Director of the Product Center, Vulture Li at Tencent Cloud audio and video platform.<\/p>\n<h2><b>Making Virtual Experiences Better<\/b><\/h2>\n<p>NVIDIA Maxine provides state-of-the-art real-time AI audio, video and augmented reality features that can be built into customizable, end-to-end deep learning pipelines.<\/p>\n<p>The AI-powered SDKs from Maxine help developers to create applications that include audio and image denoising, super resolution, gaze correction, 3D body pose estimation and translation features.<\/p>\n<p>Maxine also enables real-time voice-to-text translation for a growing number of languages. At GTC, NVIDIA demonstrated Maxine translating between English, French, German and Spanish.<\/p>\n<p>These effects will allow millions of people to enjoy high-quality and engaging live-streaming video across any device.<\/p>\n<\/p>\n<p>\u00a0<\/p>\n<p><i>Join us at <\/i><a href=\"https:\/\/www.nvidia.com\/gtc\/\"><i>GTC<\/i><\/a><i> this week to learn more about Maxine in the following session:<\/i><\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/blogs.nvidia.com\/blog\/2022\/03\/22\/maxine-reinvents-communication-ai\/<\/p>\n","protected":false},"author":0,"featured_media":1985,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1984"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=1984"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1984\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/1985"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=1984"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=1984"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=1984"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}