{"id":372,"date":"2020-10-08T02:17:51","date_gmt":"2020-10-08T02:17:51","guid":{"rendered":"https:\/\/machine-learning.webcloning.com\/2020\/10\/08\/bada-bing-bada-boom-microsoft-turns-to-turing-nlg-nvidia-gpus-to-instantly-suggest-full-phrase-queries\/"},"modified":"2020-10-08T02:17:51","modified_gmt":"2020-10-08T02:17:51","slug":"bada-bing-bada-boom-microsoft-turns-to-turing-nlg-nvidia-gpus-to-instantly-suggest-full-phrase-queries","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2020\/10\/08\/bada-bing-bada-boom-microsoft-turns-to-turing-nlg-nvidia-gpus-to-instantly-suggest-full-phrase-queries\/","title":{"rendered":"Bada Bing Bada Boom: Microsoft Turns to Turing-NLG, NVIDIA GPUs to Instantly Suggest Full-Phrase Queries"},"content":{"rendered":"<div data-url=\"https:\/\/blogs.nvidia.com\/blog\/2020\/10\/07\/microsoft-turing-nlg\/\" data-title=\"Bada Bing Bada Boom: Microsoft Turns to Turing-NLG, NVIDIA GPUs to Instantly Suggest Full-Phrase Queries\">\n<p>Hate hunting and pecking away at your keyboard every time you have a quick question? You\u2019ll love this.<\/p>\n<p>Microsoft\u2019s <a href=\"http:\/\/www.bing.com\/\">Bing search engine<\/a> has turned to Turing-NLG and NVIDIA GPUs to suggest full sentences for you as you type.<\/p>\n<p>Turing-NLG is a cutting-edge, large-scale unsupervised language model that has achieved strong performance on language modeling benchmarks.<\/p>\n<p>It\u2019s just the latest example of an AI technique called <a href=\"https:\/\/blogs.nvidia.com\/blog\/2018\/08\/02\/supervised-unsupervised-learning\/\">unsupervised learning<\/a>, which makes sense of vast quantities of data by extracting features and patterns without the need for humans to provide any pre-labeled data.<\/p>\n<p>Microsoft calls this Next Phrase Prediction, and it can feel like magic, making full-phrase suggestions in real time for long search queries.<\/p>\n<p>Turing-NLG is among several innovations \u2014 from model compression to state caching and hardware acceleration \u2014 that Bing has harnessed with Next Phrase Prediction.<\/p>\n<p>Over the summer, Microsoft worked with engineers at NVIDIA to optimize Turing-NLG to their needs, accelerating the model on NVIDIA GPUs to power the feature for users worldwide.<\/p>\n<p>A key part of this optimization was to run this massive AI model extremely fast to power real-time search experience. With a combination of hardware and model optimization Microsoft and NVIDIA achieved an average latency below 10 milliseconds.<\/p>\n<p>By contrast, it takes more than 100 milliseconds to blink your eye.<\/p>\n<p>Learn more about the next wave of <a href=\"https:\/\/blogs.bing.com\/search-quality-insights\/september-2020\/Introducing-the-next-wave-of-AI-at-Scale-innovations-in-Bing\">AI innovations at Bing<\/a>.<\/p>\n<p>Before the introduction of Next Phrase Prediction, the approach for handling query suggestions for longer queries was limited to completing the current word being typed by the user.<\/p>\n<p>Now type in \u201cThe best way to replace,\u201d and you\u2019ll immediately see three suggestions for completing the phrase: wood, plastic and metal. Type in \u201chow can I replace a battery for,\u201d and you\u2019ll see \u201ciphone, samsung, ipad and kindle\u201d all suggested.<\/p>\n<p><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2020\/10\/microsoft-2.png\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2020\/10\/microsoft-2.png\" alt=\"\" width=\"480\" height=\"96\"><\/a><\/p>\n<p>With Next Phrase Prediction, Bing can now present users with full-phrase suggestions.<\/p>\n<p>The more characters you type, the closer Bing gets to what you probably want to ask.<\/p>\n<p>And because these suggestions are generated instantly, they\u2019re not limited to previously seen data or just the current word being typed.<\/p>\n<p>So, for some queries, Bing won\u2019t just save you a few keystrokes \u2014 but multiple words.<\/p>\n<p>As a result of this work, the coverage of autosuggestion completions increases considerably, Microsoft reports, improving the overall user experience \u201csignificantly.\u201d<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>http:\/\/feedproxy.google.com\/~r\/nvidiablog\/~3\/afZ7DLCSSGM\/<\/p>\n","protected":false},"author":0,"featured_media":373,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/372"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=372"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/372\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/373"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=372"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=372"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=372"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}