{"id":3105,"date":"2023-08-08T18:50:44","date_gmt":"2023-08-08T18:50:44","guid":{"rendered":"https:\/\/salarydistribution.com\/machine-learning\/2023\/08\/08\/startup-pens-generative-ai-success-story-with-nvidia-nemo\/"},"modified":"2023-08-08T18:50:44","modified_gmt":"2023-08-08T18:50:44","slug":"startup-pens-generative-ai-success-story-with-nvidia-nemo","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2023\/08\/08\/startup-pens-generative-ai-success-story-with-nvidia-nemo\/","title":{"rendered":"Startup Pens Generative AI Success Story With NVIDIA NeMo"},"content":{"rendered":"<div data-url=\"https:\/\/blogs.nvidia.com\/blog\/2023\/08\/08\/writer-nemo-generative-ai\/\" data-title=\"Startup Pens Generative AI Success Story With NVIDIA NeMo\" data-hashtags=\"\">\n<p>Machine learning helped Waseem Alshikh plow through textbooks in college. Now he\u2019s putting <a href=\"https:\/\/www.nvidia.com\/en-us\/glossary\/data-science\/generative-ai\/\">generative AI<\/a> to work, creating content for hundreds of companies.<\/p>\n<p>Born and raised in Syria, Alshikh spoke no English, but he was fluent in software, a talent that served him well when he arrived at college in Lebanon.<\/p>\n<p>\u201cThe first day they gave me a stack of textbooks, each one a thousand pages thick, and all of it in English,\u201d he recalled.<\/p>\n<p>So, he wrote a program \u2014 a crude but effective statistical classifier that summarized the books \u2014 then he studied the summaries.<\/p>\n<h2><b>From Concept to Company<\/b><\/h2>\n<p>In 2014, he shared his story with May Habib, an entrepreneur he met while working in Dubai. They agreed to create a startup that could help marketing departments \u2014 which are always pressured to do more with less \u2014 use machine learning to quickly create copy for their web pages, blogs, ads and more.<\/p>\n<p>\u201cInitially, the tech was not there, until <a href=\"https:\/\/blogs.nvidia.com\/blog\/2022\/03\/25\/what-is-a-transformer-model\/\">transformer models<\/a> were announced \u2014 that was something we could build on,\u201d said Alshikh, the startup\u2019s CTO.<\/p>\n<figure id=\"attachment_65955\" aria-describedby=\"caption-attachment-65955\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Waseem-and-May.jpg\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Waseem-and-May-654x500.jpg\" alt=\"Picture of cofounders of of gen AI startup Writer\" width=\"654\" height=\"500\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-65955\" class=\"wp-caption-text\">Writer co-founders Habib, CEO, and Alshikh, CTO.<\/figcaption><\/figure>\n<p>\u201cWe found a few engineers and spent almost six months building our first model, a neural network that barely worked and had about 128 million parameters,\u201d an often-used measure of an AI model\u2019s capability.<\/p>\n<p>Along the way, the young company won some business, changed its name to <a href=\"https:\/\/writer.com\/\">Writer<\/a> and connected with NVIDIA.<\/p>\n<h2><b>A Startup Accelerated<\/b><\/h2>\n<p>\u201cOnce we got introduced to <a href=\"https:\/\/developer.nvidia.com\/nemo\">NVIDIA NeMo<\/a>, we were able to build industrial-strength models with three, then 20 and now 40 billion parameters, and we\u2019re still scaling,\u201d he said.<\/p>\n<p>NeMo is an application framework that helps companies curate their training datasets, build and customize large language models (<a href=\"https:\/\/blogs.nvidia.com\/blog\/2023\/01\/26\/what-are-large-language-models-used-for\/\">LLMs<\/a>), and run them in production at scale. Organizations everywhere from <a href=\"https:\/\/blogs.nvidia.com\/blog\/2022\/09\/20\/kt-large-language-models\/\">Korea<\/a> to <a href=\"https:\/\/blogs.nvidia.com\/blog\/2022\/06\/19\/ai-sweden-nlp\/\">Sweden<\/a> are using it to customize LLMs for their local languages and industries.<\/p>\n<p>\u201cBefore NeMo, it took us four and a half months to build a new billion-parameter model. Now we can do it in 16 days \u2014 this is mind blowing,\u201d Alshikh said.<\/p>\n<h2><b>Models Make Opportunities<\/b><\/h2>\n<p>In the first six months of this year, the startup\u2019s team of fewer than 20 AI engineers used NeMo to develop 10 models, each with 30 billion parameters or more.<\/p>\n<p>That translates into big opportunities. Hundreds of businesses now use Writer\u2019s models that NeMo customized for finance, healthcare, retail and other vertical markets.<\/p>\n<figure id=\"attachment_65958\" aria-describedby=\"caption-attachment-65958\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Writer-event-recap.jpg\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Writer-event-recap-672x326.jpg\" alt=\"Writer's Recap tool generates event summaries automatically.\" width=\"672\" height=\"326\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-65958\" class=\"wp-caption-text\">Writer\u2019s Recap tool creates written summaries from audio recordings of an interview or event.<\/figcaption><\/figure>\n<p>The startup\u2019s customer list includes household names like Deloitte, L\u2019Oreal, Intuit, Uber and many Fortune 500 companies.<\/p>\n<p>Writer\u2019s success with NeMo is just the start of the story. Dozens of other companies have already downloaded NeMo.<\/p>\n<p>The software will be available soon for anyone to use. It\u2019s part of <a href=\"https:\/\/www.nvidia.com\/en-us\/data-center\/products\/ai-enterprise\/\">NVIDIA AI Enterprise<\/a>, full-stack software optimized to accelerate generative AI workloads and backed by enterprise-grade support, security and application programming interface stability.<\/p>\n<figure id=\"attachment_65961\" aria-describedby=\"caption-attachment-65961\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Writer-full-stack.jpg\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/Writer-full-stack-672x317.jpg\" alt=\"Writer's full-stack AI platform includes NVIDIA NeMo\" width=\"672\" height=\"317\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-65961\" class=\"wp-caption-text\">Writer offers a full-stack platform for enterprise users.<\/figcaption><\/figure>\n<h2><b>A Trillion API Calls a Month<\/b><\/h2>\n<p>Some customers run Writer\u2019s models on their own systems or cloud services. Others ask Writer to host the models, or they use Writer\u2019s API.<\/p>\n<p>\u201cOur cloud infrastructure, managed basically by two people, hosts a trillion API calls a month \u2014 we\u2019re generating 90,000 words a second,\u201d Alshikh said. \u201cWe\u2019re delivering high-quality models that compete with products from companies with larger teams and bigger budgets.\u201d<\/p>\n<figure id=\"attachment_65964\" aria-describedby=\"caption-attachment-65964\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/NeMo-chart-final-scaled.jpg\"><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/blogs.nvidia.com\/wp-content\/uploads\/2023\/08\/NeMo-chart-final-672x212.jpg\" alt=\"Chart describing NVIDIA NeMo\" width=\"672\" height=\"212\"><\/p>\n<p><\/a><figcaption id=\"caption-attachment-65964\" class=\"wp-caption-text\">NVIDIA NeMo supports an end-to-end flow for generative AI from data curation to inference.<\/figcaption><\/figure>\n<p>Writer uses the <a href=\"https:\/\/developer.nvidia.com\/triton-inference-server\">Triton Inference Server<\/a> that\u2019s packaged with NeMo to run models in production for its customers. Alshikh reports that Triton, <a href=\"https:\/\/blogs.nvidia.com\/blog\/2022\/10\/05\/ai-large-language-models-triton\/\">used by many companies running LLMs<\/a>, enables lower latency and greater throughput than alternative programs.<\/p>\n<p>\u201cThis means you can run a service for $20,000, instead of $100,000, so we can invest more in building meaningful features,\u201d he said.<\/p>\n<h2><b>A Wide Horizon<\/b><\/h2>\n<p>Writer is also a member of <a href=\"https:\/\/www.nvidia.com\/en-us\/startups\/\">NVIDIA Inception<\/a>, a program that nurtures cutting-edge startups. \u201cThanks to Inception, we got early access to NeMo and some amazing people who guided us through the process of finding and using the tools we need,\u201d he said.<\/p>\n<p>Now that Writer\u2019s text products are getting traction, Alshikh, who splits his time between homes in Florida and California, is searching the horizon for what\u2019s next. In today\u2019s broad frontier of generative AI, he sees opportunities in images, audio, video, 3D \u2014 maybe all of the above.<\/p>\n<p>\u201cWe see multimodality as the future,\u201d he said.<\/p>\n<p>Check out <a href=\"https:\/\/developer.nvidia.com\/nemo\">this page<\/a> to get started with NeMo. And learn about the early access program for multimodal NeMo <a href=\"https:\/\/developer.nvidia.com\/nemo-framework-open-beta\">here<\/a>.<\/p>\n<p>And if you enjoyed this story, let folks on social networks know using the following, a summary suggested by Writer:<\/p>\n<p>\u201cLearn how startup Writer uses NVIDIA NeMo software to generate content for hundreds of companies and rack up impressive revenues with a small staff and budget.\u201d<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/blogs.nvidia.com\/blog\/2023\/08\/08\/writer-nemo-generative-ai\/<\/p>\n","protected":false},"author":0,"featured_media":3106,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/3105"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=3105"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/3105\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/3106"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=3105"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=3105"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=3105"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}