{"id":1343,"date":"2021-12-11T01:24:07","date_gmt":"2021-12-11T01:24:07","guid":{"rendered":"https:\/\/salarydistribution.com\/machine-learning\/2021\/12\/11\/live-transcriptions-of-f1-races-using-amazon-transcribe\/"},"modified":"2021-12-11T01:24:07","modified_gmt":"2021-12-11T01:24:07","slug":"live-transcriptions-of-f1-races-using-amazon-transcribe","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2021\/12\/11\/live-transcriptions-of-f1-races-using-amazon-transcribe\/","title":{"rendered":"Live transcriptions of F1 races using Amazon Transcribe"},"content":{"rendered":"<div id=\"\">\n<p>The Formula 1 (F1) live steaming service, F1 TV, has live automated closed captions in three different languages: English, Spanish, and French.<\/p>\n<p>For the 2021 season, FORMULA 1 has achieved another technological breakthrough, building a fully automated workflow to create closed captions in three languages and broadcasting to 85 territories using <a href=\"https:\/\/aws.amazon.com\/transcribe\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Transcribe<\/a>. Amazon Transcribe is an automatic speech recognition (ASR) service that allows you to generate audio transcription.<\/p>\n<p>In this post, we share how Formula 1 joined forces with the <a href=\"https:\/\/aws.amazon.com\/professional-services\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Professional Services<\/a> team to make it happen. We discuss how they used Amazon Transcribe and its custom vocabulary feature as well as custom-built postprocessing logic to improve their live transcription accuracy in three languages.<\/p>\n<h2>The challenge<\/h2>\n<p>For F1, everything is about extreme speed: with pit stops as short as 2 seconds, speeds of up to 375 KPH (233 MPH), and 5g forces on drivers under braking and through corners. In this fast-paced and dynamic environment, milliseconds dictate the difference between pole position or second on the grid. The role of the race commentators is to weave the multitude of parallel events and information into a single exciting narrative. This form of commentary greatly increases the engagement and excitement of viewers.<\/p>\n<p>F1 has a strong affinity to cutting edge technology, and partnered with AWS to build a scalable and sustainable closed caption solution for F1 TV, their Over-the-top (OTT) platform, that can support a growing calendar and language portfolio. F1 now provides real-time live captions in three languages across four series: F1 in British English, US Spanish and French; and F2, F3, and Porsche Supercup in British English and US Spanish. This was achieved using Amazon Transcribe to automatically convert the commentary into subtitles.<\/p>\n<p>This task provides many unique challenges. With the excitement of an F1 race, it\u2019s common to have commentators with differing accents move quickly from one topic to another as the race unfolds. Being a sport steeped in technology, commentators often refer to F1 domain-specific terminology such as DRS (Drag Reduction System), aerodynamic, downforce, or halo (a safety device) for example. Moreover, F1 is a global sport, traveling across the world and drawing drivers from many different countries. Looking only at the 2021 season, 16\/20 drivers had non-English names and 17\/20 had non-Spanish names or non-French names. With the advanced customization features available in Amazon Transcribe, we tailored the underlying language models to recognize domain-specific terms that are rare in general language use, which boosted transcription accuracy.<\/p>\n<p>In the following sections, we take a deep dive into how AWS Professional Services partnered with F1 to build a robust, state-of-the-art, real-time race commentary captioning system by enhancing Amazon Transcribe to understand the particularities of the F1 world. You will learn how to utilize Amazon Transcribe in real-time broadcasts and supercharge live captioning for your use case with custom vocabularies, postprocessing steps, and a human-in-the-loop validation layer.<\/p>\n<h2>Solution overview<\/h2>\n<p>The solution works as a proxy to Amazon Transcribe. Custom vocabularies are passed as parameters to Amazon Transcribe, and the resulting captions are postprocessed. The postprocessed text is then moderated by an F1 moderator before being transformed to captions that are displayed to the viewers. The following diagram shows the sequential process.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-31660\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/09\/1-4575-Architecture.jpg\" alt=\"\" width=\"800\" height=\"355\"><\/p>\n<h2>Live transcriptions: Understanding use case specific terminology and context<\/h2>\n<p>The output of Automatic Speech Recognition (ASR) systems is highly context-dependent. ASR language models benefit from utilizing the words across a fully spoken sentence. For example, in the following sentence, the system uses the words \u2018WORLD CHAMPIONSHIP\u2019 towards the end of the sentence to inform context and allow \u2018FORMER ONE\u2019 to be correctly transcribed as \u2018FORMULA 1\u2019.<\/p>\n<table border=\"1px\" cellpadding=\"10px\">\n<tbody>\n<tr>\n<td width=\"575\">\n<p>GOOD AFTERNOON EVERYBODY WELCOME ALONG TO ROUND 4 OF THE <strong>FORMER ONE<\/strong><\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p>GOOD AFTERNOON EVERYBODY WELCOME ALONG TO ROUND 4 OF THE <strong>FORMULA 1 <\/strong>WORLD CHAMPIONSHIP IN 2019<\/p>\n<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Amazon Transcribe supports both <em>batch <\/em>and <em>streaming <\/em>transcription models. In batch transcription, the model issues a transcription using the full context provided in the audio segment. Amazon Transcribe streaming transcription enables you to send an audio stream and receive a transcription stream in real time. Generating subtitles for a live broadcast requires a streaming model because transcriptions should appear on screen shortly after the commentary is spoken. This real-time need presents unique challenges compared to batch transcriptions and often affects the quality of the results because the language model has limited knowledge of the future context.<\/p>\n<p>Amazon Transcribe is pre-trained to capture a wide range of use cases. However, F1 domain-specific terminology, names, and locations aren\u2019t present in the Amazon Transcribe general language model. Getting those words correct is nevertheless crucial for the understanding of the narrative, such as who is leading the race, circuit corners, and technical details.<\/p>\n<p>Amazon Transcribe allows you to develop with custom vocabularies and custom language models to improve transcription accuracy. You can use them separately for streaming transcriptions or together for batch transcriptions.<\/p>\n<p><a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/how-vocabulary.html\" target=\"_blank\" rel=\"noopener noreferrer\">Custom vocabularies<\/a> consist of a list of specific words that you want Amazon Transcribe to recognize in the audio input. These are generally domain-specific words and phrases, such as proper nouns. You can inform Amazon Transcribe how to pronounce these terms with information such as SoundsLike (in regular orthography) or the IPA (<a href=\"https:\/\/en.wikipedia.org\/wiki\/International_Phonetic_Alphabet\" target=\"_blank\" rel=\"noopener noreferrer\">International Phonetic Alphabet<\/a>) description of the term. <a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/how-it-works.html#table-language-matrix\" target=\"_blank\" rel=\"noopener noreferrer\">Custom vocabularies are available for all languages supported by Amazon Transcribe<\/a>. Custom vocabularies improve the ability of Amazon Transcribe to recognize terms without using the context in which they\u2019re spoken.<\/p>\n<p>The following table shows some examples of a custom vocabulary.<\/p>\n<table border=\"1px\" cellpadding=\"10px\">\n<tbody>\n<tr>\n<td width=\"132\"><strong>Phrase<\/strong><\/td>\n<td width=\"133\"><strong>DisplayAs<\/strong><\/td>\n<td width=\"175\"><strong>SoundsLike<\/strong><\/td>\n<td width=\"134\"><strong>IPA<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Charles-Leclerc<\/td>\n<td width=\"133\">Charles Leclerc<\/td>\n<td width=\"175\"><\/td>\n<td width=\"134\">\u0283 \u0251 \u0279 l l \u0259 k l \u025b \u0279<\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Charles-Leclerc<\/td>\n<td width=\"133\">Charles Leclerc<\/td>\n<td width=\"175\">shal-luh-klurk<\/td>\n<td width=\"134\"><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Lewis-Hamilton<\/td>\n<td width=\"133\">Lewis Hamilton<\/td>\n<td width=\"175\">loo-is-ha-muhl-tn<\/td>\n<td width=\"134\"><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Lewis-Hamilton<\/td>\n<td width=\"133\">Lewis Hamilton<\/td>\n<td width=\"175\">loo-uhs-ha-muhl-tn<\/td>\n<td width=\"134\"><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Ferrari<\/td>\n<td width=\"133\">Ferrari<\/td>\n<td width=\"175\"><\/td>\n<td width=\"134\">f \u025d \u0279 \u0251 \u0279 \u026a<\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Ferrari<\/td>\n<td width=\"133\">Ferrari<\/td>\n<td width=\"175\">fuh-rehr-ee<\/td>\n<td width=\"134\"><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Mercedes<\/td>\n<td width=\"133\">Mercedes<\/td>\n<td width=\"175\">mer-sey-deez<\/td>\n<td width=\"134\"><\/td>\n<\/tr>\n<tr>\n<td width=\"132\">Mercedes<\/td>\n<td width=\"133\">Mercedes<\/td>\n<td width=\"175\"><\/td>\n<td width=\"134\">m \u025b \u0279 s e\u026a d i z<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The custom vocabulary includes the following details:<\/p>\n<ul>\n<li><strong>Phrase<\/strong> \u2013 The term that should be recognized.<\/li>\n<li><strong>DisplayAs<\/strong> \u2013 How the word or phrase looks when it\u2019s output. If not declared, the output would be the phrase.<\/li>\n<li><strong>SoundsLike<\/strong> \u2013 The term broken into small pieces with the respective pronunciations in the specified language using standard orthography.<\/li>\n<li><strong>IPA<\/strong> \u2013 The International Phonetic Alphabet representation for the term.<\/li>\n<\/ul>\n<p><a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/custom-language-models.html\" target=\"_blank\" rel=\"noopener noreferrer\">Custom language models<\/a> are valuable when there are larger corpuses of text data that can be used to train models. With the additional data, the models learn to predict the probabilities of sequences of words in the domain-specific context. For this project, F1 chose to use custom vocabulary given the unique words and phrases that are unique to F1 racing.<\/p>\n<h2>Postprocessing: the final layer of performance boosting<\/h2>\n<p>Due to the fast-paced nature of F1 commentary with rapidly changing context as well as commentator accents, inaccurate transcriptions may still occur. However, recurring mistakes can be easily fixed using text replacement. For example, \u201cKvyat and Albon\u201d may be misunderstood as \u201ccreate an album\u201d by the British English language model. Because \u201ccreate an album\u201d is an unlikely term to occur in F1 commentaries, we can safely replace them with their assumed real meanings in a postprocessing routine. On top of that, postprocessing terms can be defined as general, or based on location and race series filters. Such selection allows for more specific term replacement, reducing the chance of erroneous replacements with this approach.<\/p>\n<p>For this project, we gathered thousands of replacements for each language using hours of real-life F1 audio commentary that was analyzed by F1 domain specialists. On top of that, during every live event, F1 runs a transcribed commentary through a human-in-the-loop tool (described in the next section), which allows sentence rejection before the subtitles appear on screen. This data is used later to continuously improve the custom vocabulary and postprocessing rules. The following table shows examples of postprocessing rules for English captions. The location filter is a replacement filter based on race location, and the race series filter is based on the race series.<\/p>\n<table border=\"1px\" cellpadding=\"10px\">\n<tbody>\n<tr>\n<td width=\"201\"><strong>Original Term<\/strong><\/td>\n<td width=\"215\"><strong>Replacement<\/strong><\/td>\n<td width=\"85\"><strong>Location Filter<\/strong><\/td>\n<td width=\"103\"><strong>Race Series Filter<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"201\">CHARLOTTE CLAIRE<\/td>\n<td width=\"215\">CHARLES LECLERC<\/td>\n<td width=\"85\"><\/td>\n<td width=\"103\">FORMULA 1<\/td>\n<\/tr>\n<tr>\n<td width=\"201\">CREATE AN ALBUM<\/td>\n<td width=\"215\">KVYAT AND ALBON<\/td>\n<td width=\"85\"><\/td>\n<td width=\"103\">FORMULA 1<\/td>\n<\/tr>\n<tr>\n<td width=\"201\">SCHWARTZMAN<\/td>\n<td width=\"215\">SHWARTZMAN<\/td>\n<td width=\"85\"><\/td>\n<td width=\"103\">FORMULA 2<\/td>\n<\/tr>\n<tr>\n<td width=\"201\">CURVE A PARABOLIC<\/td>\n<td width=\"215\">CURVA PARABOLICA<\/td>\n<td width=\"85\">Italy<\/td>\n<td width=\"103\"><\/td>\n<\/tr>\n<tr>\n<td width=\"201\">CIRCUIT THE CATALONIA<\/td>\n<td width=\"215\">CIRCUIT DE CATALUNYA<\/td>\n<td width=\"85\">Spain<\/td>\n<td width=\"103\"><\/td>\n<\/tr>\n<tr>\n<td width=\"201\">TYPE COMPOUNDS<\/td>\n<td width=\"215\">TYRE COMPOUNDS<\/td>\n<td width=\"85\"><\/td>\n<td width=\"103\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Another important function of postprocessing is the standardization and formatting of numbers. When generating transcriptions for live broadcasts such as television, it\u2019s a <a href=\"https:\/\/engagemedia.org\/help\/best-practices-for-online-subtitling\/\" target=\"_blank\" rel=\"noopener noreferrer\">best practice<\/a> to use digits when displaying numbers because they\u2019re faster to read and occupy less space on screen. In English, <a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/how-numbers.html\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Transcribe automatically displays numbers<\/a> bigger than 10 as digits, and numbers between 0\u201310 are converted to digits under specific conditions, such as when there are more than one in a row. For example, \u201cthree four five\u201d converts to 345. In an effort to standardize number transcriptions, we digitize all numbers.<\/p>\n<p>As of August 8, 2021, transcriptions only output numbers as digits instead of words for a defined list of languages in both batch and streaming (for more information, see <a href=\"https:\/\/docs.aws.amazon.com\/transcribe\/latest\/dg\/how-numbers.html\" target=\"_blank\" rel=\"noopener noreferrer\">Transcribing numbers and punctuation<\/a>). Notably, this list doesn\u2019t include Spanish (es-US and es-ES) or French (fr-FR and fr-CA). With the postprocessing routine, numbers were also formatted to handle integers, decimals, and ordinals, as well F1-specific lap time formatting.<\/p>\n<p>The following shows an example of number postprocessing for different languages that were built for F1.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-31661\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/09\/2-4575-Table.jpg\" alt=\"\" width=\"800\" height=\"268\"><\/p>\n<h2>Human in the loop: Continuous improvement and adaptation<\/h2>\n<p>Amazon Transcribe custom vocabularies and postprocessing boost the service\u2019s real-time performance significantly. However, the fast-paced and quickly changing environment remains a challenge for automated transcriptions. It\u2019s better for a person reliant on closed captions to miss out on a phase of commentary, rather than see an incorrect transcription that may be misleading. To this end, F1 employs a human in the loop as a final validation, where a moderator has a number of seconds to verify if a word or an entire sentence should be removed before it\u2019s included in the video stream. Any removed sentences are then used to improve the custom vocabularies and postprocessing step for the next races.<\/p>\n<h2>Evaluation<\/h2>\n<p>Minor grammatical errors don\u2019t greatly decrease the understandability of a sentence. However, using the wrong F1 terminology breaks a sentence. Usually ASR systems are evaluated on <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/evaluating-an-automatic-speech-recognition-service\/\" target=\"_blank\" rel=\"noopener noreferrer\">word error rate (WER)<\/a>, which quantifies how many insertions, deletions, and substitutions are required to change the predicted sentence to the correct one.<\/p>\n<p>Although WER is important, F1-specific terms are even more crucial. For this, we created an accuracy score that measures the accuracy of people names (such as Charles Leclerc), teams (McLaren), locations (Hungaroring), and other F1 terms (DRS) transcribed in a commentary. These scores allow us to evaluate how understandable the transcriptions are to F1 fans and, combined with WER, allow us to maintain high-quality transcriptions and improvements in Amazon Transcribe.<\/p>\n<h2>Results<\/h2>\n<p>The F1 TV enhanced live transcriptions system was released on March 26, 2021, during the Formula 1 Gulf Air Bahrain Grand Prix. By the first race, the solution had already achieved a strong reduction in WER and F1-specific accuracy improvements for all three languages, compared to the Amazon Transcribe standard model. In the following tables, we highlight the WER and F1 specific accuracy improvements for the different languages. The numbers compare the developed solution using Amazon Transcribe using custom vocabularies and postprocessing with Amazon Transcribe generic model. The lower the WER, the better.<\/p>\n<table border=\"1px\" cellpadding=\"10px\">\n<tbody>\n<tr>\n<td width=\"67\"><strong>\u00a0<\/strong><\/td>\n<td width=\"141\"><strong>Standard Amazon Transcribe WER<\/strong><\/td>\n<td width=\"207\"><strong>Amazon Transcribe with CV and Postprocessing WER<\/strong><\/td>\n<td width=\"146\"><strong>WER Improvement<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"67\">English<\/td>\n<td width=\"141\">18.95%<\/td>\n<td width=\"207\">11.37%<\/td>\n<td width=\"146\">39.99%<\/td>\n<\/tr>\n<tr>\n<td width=\"67\">Spanish<\/td>\n<td width=\"141\">25.95%<\/td>\n<td width=\"207\">16.21%<\/td>\n<td width=\"146\">37.16%<\/td>\n<\/tr>\n<tr>\n<td width=\"67\">French<\/td>\n<td width=\"141\">37.40%<\/td>\n<td width=\"207\">16.80%<\/td>\n<td width=\"146\">55.08%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<table border=\"1px\" cellpadding=\"10px\">\n<tbody>\n<tr>\n<td width=\"69\"><\/td>\n<td width=\"111\"><strong>Accuracy Group<\/strong><\/td>\n<td width=\"161\"><strong>Standard Amazon Transcribe Accuracy<\/strong><\/td>\n<td width=\"227\"><strong>Amazon Transcribe with CV and Postprocessing Accuracy<\/strong><\/td>\n<td width=\"113\"><strong>Accuracy Improvement<\/strong><\/td>\n<\/tr>\n<tr>\n<td rowspan=\"4\" width=\"69\"><strong>English<\/strong><\/td>\n<td width=\"111\">People Names<\/td>\n<td width=\"161\">40.17%<\/td>\n<td width=\"227\">92.25%<\/td>\n<td width=\"113\">129.68%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Teams<\/td>\n<td width=\"161\">56.33%<\/td>\n<td width=\"227\">95.28%<\/td>\n<td width=\"113\">69.15%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Locations<\/td>\n<td width=\"161\">61.82%<\/td>\n<td width=\"227\">94.33%<\/td>\n<td width=\"113\">52.59%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Other F1 terms<\/td>\n<td width=\"161\">81.47%<\/td>\n<td width=\"227\">90.89%<\/td>\n<td width=\"113\">11.55%<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"4\" width=\"69\"><strong>Spanish<\/strong><\/td>\n<td width=\"111\">People Names<\/td>\n<td width=\"161\">45.31%<\/td>\n<td width=\"227\">95.43%<\/td>\n<td width=\"113\">110.62%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Teams<\/td>\n<td width=\"161\">39.40%<\/td>\n<td width=\"227\">95.46%<\/td>\n<td width=\"113\">142.28%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Locations<\/td>\n<td width=\"161\">58.32%<\/td>\n<td width=\"227\">87.58%<\/td>\n<td width=\"113\">50.17%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Other F1 terms<\/td>\n<td width=\"161\">63.87%<\/td>\n<td width=\"227\">85.25%<\/td>\n<td width=\"113\">33.47%<\/td>\n<\/tr>\n<tr>\n<td rowspan=\"4\" width=\"69\"><strong>French<\/strong><\/td>\n<td width=\"111\">People Names<\/td>\n<td width=\"161\">39.12%<\/td>\n<td width=\"227\">92.38%<\/td>\n<td width=\"113\">136.15%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Teams<\/td>\n<td width=\"161\">33.20%<\/td>\n<td width=\"227\">90.84%<\/td>\n<td width=\"113\">173.61%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Locations<\/td>\n<td width=\"161\">55.34%<\/td>\n<td width=\"227\">89.33%<\/td>\n<td width=\"113\">61.42%<\/td>\n<\/tr>\n<tr>\n<td width=\"111\">Other F1 terms<\/td>\n<td width=\"161\">61.15%<\/td>\n<td width=\"227\">86.77%<\/td>\n<td width=\"113\">41.90%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Although the approach significantly improves the WER measures, its main influence is seen on F1 names, teams, and locations. Because the F1 specific terms are often in local languages, custom vocabularies, and postprocessing steps can quickly teach Amazon Transcribe to consider those terms and correctly transcribe them. The postprocessing step then further adapts the outcome transcriptions to F1\u2019s domain to provide highly accurate automated transcriptions. In the following examples, we present phrases in English, Spanish, and French where Amazon Transcribe custom vocabularies, postprocessing, and number handling techniques successfully improved the transcription accuracy.<\/p>\n<p>For Spanish, we have the original Amazon Transcribe output \u201cEL PILOTO BRITA\u0301NICO LORIS JAMIL TODOS ESTA\u0301 A DOS SEGUNDOS PUNTO TRES DEL LIDER. COMPLETO\u0301 SU U\u0301LTIMA VUELTA EN UNO VEINTINUEVE DOSCIENTOS TREINTA Y CUATRO\u201d compared to the final transcription \u201cEL PILOTO BRITA\u0301NICO LEWIS HAMILTON ESTA\u0301 A 2.3 s DEL LIDER. COMPLETO\u0301 SU U\u0301LTIMA VUELTA EN 1:29.234.\u201d<\/p>\n<p>The custom vocabulary and postprocessing combination converted \u201cLORIS JAMIL TODOS\u201d to \u201cLEWIS HAMILTON,\u201d and the number handling routine converted the lap time to digits and added the appropriate punctuation (1:29.234).<\/p>\n<p>For English, compare the original output \u201cTHE GERMAN DRIVER THE BASTION BETTER COMPLETED THE LAST LAP IN ONE 15 632\u201d to the final transcription \u201cTHE GERMAN DRIVER SEBASTIAN VETTEL COMPLETED THE LAST LAP IN 1:15.632.\u201d<\/p>\n<p>The custom vocabulary and postprocessing combination converted \u201cTHE BASTION BETTER\u201d to \u201cSEBASTIAN VETTEL.\u201d<\/p>\n<p>In French, we can compare the original output \u201cVICTOIRE POUR LES MISS MILLE TONNE DIX-HUIT POLE CENT TROIS PODIUM QUATRE VICTOIRES ICI\u201d to the final output \u201cVICTOIRE POUR LEWIS HAMILTON 18 POLE 103 PODIUM 4 VICTOIRES ICI.\u201d<\/p>\n<p>The custom vocabulary and postprocessing combination converted \u201cLES MISS MILLE TONNE\u201d to \u201cLEWIS HAMILTON,\u201d and the number handling routine converted the numbers to digits.<\/p>\n<p>The following short video shows live captions in action during the Formula 1 Gulf Air Bahrain Grand Prix 2021.<\/p>\n<h2>Summary<\/h2>\n<p>In this post, we explained how F1 is now able to provide live closed captions on their OTT (Over-The-Top) platform to benefit viewers with accessibility needs and those who want to ensure they do not miss any live commentary.<\/p>\n<p>In collaboration with AWS Professional Services, F1 has set up live transcriptions in English, Spanish, and French by using Amazon Transcribe and applying enhancements to capture domain-specific terminology.<\/p>\n<p>Whether for sport broadcasting, streaming educational content, or conferences and webinars, AWS Professional Services is ready to help your team develop a real-time captioning system that is accurate and customizable by making full use of your domain-specific knowledge and the advanced features of Amazon Transcribe. For more information, see <a href=\"https:\/\/aws.amazon.com\/professional-services\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Professional Services<\/a> or reach out through your account manager to get in touch.<\/p>\n<hr>\n<h3>About the Authors<\/h3>\n<p><b data-stringify-type=\"bold\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-31712 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/10\/Beibit-Baktygaliyev.jpg\" alt=\"\" width=\"100\" height=\"135\"><b data-stringify-type=\"bold\"><b data-stringify-type=\"bold\">Beibit Baktygaliyev <\/b><\/b><\/b>is a Senior Data Scientist with AWS Professional Services. As a technical lead, he helps customers to attain their business goals through innovative technology. In his spare time, Beibit enjoys sports and spending time with his family and friends.<\/p>\n<p><b data-stringify-type=\"bold\"><b data-stringify-type=\"bold\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-31711 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/10\/Maira-Ladeira-Tanke.jpg\" alt=\"\" width=\"100\" height=\"132\"><\/b>M<\/b><b data-stringify-type=\"bold\"><b data-stringify-type=\"bold\">aira Ladeira Tanke<\/b><\/b>\u00a0is a Data Scientist at AWS Professional Services. She works with customers across industries to help them achieve business outcomes with AI and ML technologies. In her spare time, Maira likes to play with her cat Smila. She also loves to travel and spend time with her family and friends.<\/p>\n<p><b data-stringify-type=\"bold\"><b data-stringify-type=\"bold\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-31709 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/10\/Sara-Kazdagli.jpg\" alt=\"\" width=\"100\" height=\"131\"><\/b>Sara Kazdagli<\/b> is a Professional Services consultant specialized in Data Analytics and Machine Learning. She helps customers across different industries to build innovative solutions and make data-driven decisions. Sara holds a MSc in Software Engineering and a MSc in Data Science. In her spare time, she like to go on hikes and walks with her Australian shepherd dog Kiba.<\/p>\n<p><b data-stringify-type=\"bold\"><b data-stringify-type=\"bold\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-31710 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2021\/12\/10\/Pablo-Hermoso-Moreno.jpg\" alt=\"\" width=\"100\" height=\"134\"><\/b>Pablo Hermoso Moreno<\/b> is a Data Scientist in the AWS Professional Services Team. He works with clients across industries\u00a0using Machine Learning to tell stories with data and\u00a0reach more informed engineering decisions faster. Pablo\u2019s background is in Aerospace Engineering and having worked in the motorsport industry he has an interest in bridging physics and domain expertise with ML. In his spare time, he enjoys rowing and playing guitar.<\/p>\n<p>       <!-- '\"` -->\n      <\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/aws.amazon.com\/blogs\/machine-learning\/live-transcriptions-of-f1-races-using-amazon-transcribe\/<\/p>\n","protected":false},"author":0,"featured_media":1344,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1343"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=1343"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1343\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/1344"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=1343"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=1343"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=1343"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}