{"id":612,"date":"2020-11-24T08:04:40","date_gmt":"2020-11-24T08:04:40","guid":{"rendered":"https:\/\/machine-learning.webcloning.com\/2020\/11\/24\/customizing-your-machine-translation-using-amazon-translate-active-custom-translation\/"},"modified":"2020-11-24T08:04:40","modified_gmt":"2020-11-24T08:04:40","slug":"customizing-your-machine-translation-using-amazon-translate-active-custom-translation","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2020\/11\/24\/customizing-your-machine-translation-using-amazon-translate-active-custom-translation\/","title":{"rendered":"Customizing your machine translation using Amazon Translate Active Custom Translation"},"content":{"rendered":"<div id=\"\">\n<p>When translating the English phrase \u201cHow are you?\u201d to Spanish, would you prefer to use \u201c\u00bfC\u00f3mo est\u00e1s?\u201d or \u201c\u00bfC\u00f3mo est\u00e1 usted?\u201d instead?<\/p>\n<p><a href=\"https:\/\/aws.amazon.com\/translate\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Translate<\/a> is a neural machine translation service that delivers fast, high-quality, and affordable language translation. Today, we\u2019re excited to introduce Active Custom Translation (ACT), a feature that gives you more control over your machine translation output. You can now influence what machine translation output you would like to get between \u201c\u00bfC\u00f3mo est\u00e1s?\u201d or \u201c\u00bfC\u00f3mo est\u00e1 usted?\u201d. To make ACT work, simply provide your translation examples in TMX, TSV, or CSV format to create parallel data (PD), and Amazon Translate uses your PD along with your <a href=\"https:\/\/docs.aws.amazon.com\/translate\/latest\/dg\/async.html\" target=\"_blank\" rel=\"noopener noreferrer\">batch translation job<\/a> to customize the translation output at runtime. If you have PD that shows \u201cHow are you?\u201d being translated to \u201c\u00bfC\u00f3mo est\u00e1 usted?\u201d, ACT knows to customize the translation to \u201c\u00bfC\u00f3mo est\u00e1 usted?\u201d.<\/p>\n<p>Today, professional translators use examples of previous translations to provide more customized translations for customers. Similar to profession translators, Amazon Translate can now provide customized translations by learning from your translation examples.<\/p>\n<p>Traditionally, this customization was done by creating a custom translation model\u00ad\u2014a specific-purpose translation engine built using customer data. Building custom translation models is complex, tedious, and expensive. It requires special expertise to prepare the data for training, testing, and validation. Then you build, deploy, and maintain the model by updating the model frequently. To save on model training and management costs, you may choose to delay updating your custom translation model, which means your models are always stale\u2014negatively affecting your custom translation experience. In spite of all this work, these custom models perform well when the translation job is within the domain of your data. However, they tend to perform worse than a generic model when the translation job is outside of the domain of your customization data.<\/p>\n<p>Amazon Translate ACT introduces an innovative way of providing customized translation output on the fly with your parallel data, without building a custom translation model. ACT output quality is always up to date with your PD. ACT provides the best translations for jobs both within the domain and outside the domain of PD. For example, if a source sentence isn\u2019t in the domain of the PD, the translation output is still as good as the generic translation with no significant deterioration in translation quality. You no longer need to go through the tedious process of building and retraining custom translation models for each incoming use case. Just update the PD, and the ACT output automatically adapts to the most recent PD, without needing any retraining.<\/p>\n<p>\u201cInnovation is in our DNA. Our customers look to AWS to lead in customization of machine translation. Current custom translation technology is inefficient, cumbersome, and expensive,\u201d says Marcello Federico, Principal Applied Scientist at Amazon Machine Learning, AWS. \u201cActive Custom Translation allows our customers to focus on the value of their latest data and forget about the lifecycle management of custom translation models. We innovated on behalf of the customer to make custom machine translation easy.\u201d<\/p>\n<h2>Don\u2019t just take our word for it<\/h2>\n<p><a href=\"https:\/\/custom.mt\/\" target=\"_blank\" rel=\"noopener noreferrer\">Custom.MT<\/a> implements machine translation for localization groups and translation companies. Konstantin Dranch, Custom.MT co-founder, shares, \u201cAmazon Translate\u2019s ACT is a breakthrough machine translation setup. A manual engine retraining takes 15\u201316 work hours, that\u2019s why most language teams in the industry update their engines only once a month or once a quarter. With ACT, retraining is continuous and engines improve every day based on edits by human translators. Even before the feature was released to the market, we saw tremendous interest from leading software localization teams. With a higher quality of machine translation, enterprise teams can save millions of USD in manual translations and improve other KPIs, such as international user engagement and time to market.\u201d<\/p>\n<p><a href=\"https:\/\/www.welocalize.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">Welocalize<\/a> is a leading global localization and translation company. Senior Manager of AI Deployments at Welocalize Alex Yanishevsky says, \u201cWelocalize produces high-quality translations, so our customers can transform their content and data to grow globally and expand into international markets. Active Custom Translation from Amazon Translate allows us to customize our translations at runtime and provides us with significant flexibility in our production cycles. In addition, we see great business value and engine quality improvement since we can retrain engines frequently without incurring additional hosting or training charges.\u201d<\/p>\n<p><a href=\"https:\/\/www.onehourtranslation.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">One Hour Translation<\/a> is a leading professional language services provider. Yair Tal, CEO of One Hour Translation, says, \u201cThe customer demand for customized Neural Machine Translation (NMT) is growing every month because of the cost savings. As one of the first to try Amazon Translate ACT, we have found that ACT provides the best translation output for many language pairs. With ACT, training and maintenance is simple and the Translate API integrates with our system seamlessly. Translate\u2019s pay-as-you-translate pricing helps our clients, both big and small, get translation output that is tailored for their needs without paying to train custom models.\u201d<\/p>\n<h2>Building an Active Custom Translation job<\/h2>\n<p>Active Custom Translation\u2019s capabilities are built right into the Amazon Translate experience. In this post, we walk you through the step-by-step process of using your data and getting a customized machine translated output securely. ACT is now available on batch translation, so first familiarize yourself with <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/translating-documents-spreadsheets-and-presentations-in-office-open-xml-format-using-amazon-translate\/\" target=\"_blank\" rel=\"noopener noreferrer\">how to create a batch translation<\/a> job.<\/p>\n<p>You need data to customize your translation for terms or phrases that are unique to a specific domain, such as life sciences, law, or finance. You bring in examples of high-quality translations (source sentence and translated target sentence) in your preferred domain as a file in <a href=\"https:\/\/en.wikipedia.org\/wiki\/Translation_management_system\" target=\"_blank\" rel=\"noopener noreferrer\">TMX<\/a>, <a href=\"https:\/\/en.wikipedia.org\/wiki\/Tab-separated_values\" target=\"_blank\" rel=\"noopener noreferrer\">TSV<\/a>, or <a href=\"https:\/\/en.wikipedia.org\/wiki\/Comma-separated_values\" target=\"_blank\" rel=\"noopener noreferrer\">CSV<\/a> format. This data should also be <a href=\"https:\/\/en.wikipedia.org\/wiki\/UTF-8\" target=\"_blank\" rel=\"noopener noreferrer\">UTF-8<\/a> encoded. You use this data to create a PD. Amazon Translate uses this PD to customize your machine translation. Each PD can be up to 1 GB large. You can upload up to 1,000 PD per account per Region. The 1,000 parallel data limit can be increased upon request. You get free storage for parallel data for up to 200 GB. You pay the local <a href=\"http:\/\/aws.amazon.com\/s3\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Simple Storage Service<\/a> (Amazon S3) rate for excess data stored.<\/p>\n<p>For our use case, I have my data in TSV format, and the name of my file is <code>Mydata.tsv<\/code>. I first upload this file to an S3 location (for this post, I store my data in <code>s3:\/\/input-s3bucket\/Paralleldata\/<\/code>).<\/p>\n<p>The following table summarizes the contents of the file.<\/p>\n<table border=\"1px\" width=\"0\" cellpadding=\"5px\">\n<tbody>\n<tr>\n<td width=\"282\"><strong>en<\/strong><\/td>\n<td width=\"294\"><strong>es<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"282\">Amazon Translate is a neural machine translation service.<\/td>\n<td width=\"294\">Amazon Translate es un servicio de traducci\u00f3n autom\u00e1tica basado en redes neuronales.<\/td>\n<\/tr>\n<tr>\n<td width=\"282\">Neural machine translation is a form of language translation automation that uses deep learning models.<\/td>\n<td width=\"294\">La traducci\u00f3n autom\u00e1tica neuronal es una forma de automatizar la traducci\u00f3n de lenguajes utilizando modelos de aprendizaje profundo.<\/td>\n<\/tr>\n<tr>\n<td width=\"282\">How are you?<\/td>\n<td width=\"294\">\u00bfC\u00f3mo est\u00e1 usted?<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>We run this example in the US West (Oregon) Region, <code>us-west-2<\/code>.<\/p>\n<h3>CreateParallelData<\/h3>\n<p>Calling the <code>CreateParallelData<\/code> API creates a PD resource record in our database and asynchronously starts a workflow for processing the PD file and ingesting it into our service.<\/p>\n<h4>CLI<\/h4>\n<p>The following CLI commands are formatted for Unix, Linux, and macOS. For Windows, replace the backslash () Unix continuation character at the end of each line with a caret (^).<\/p>\n<p>Run the following CLI command:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate create-parallel-data \r\n--name ${PARALLEL_DATA_NAME}\r\n--parallel-data-config S3Uri=${S3_URI},Format=${FORMAT} \r\n--region ${REGION}<\/code><\/pre>\n<\/div>\n<p>I use <code>Mydata.tsv<\/code> to create my PD <code>my-parallel-data-1<\/code>:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate create-parallel-data \r\n--name my-parallel-data-1 \r\n--parallel-data-config S3Uri= s3:\/\/input-s3bucket\/Paralleldata\/Mydata.tsv,Format=TSV \r\n--region us-west-2 <\/code><\/pre>\n<\/div>\n<p>You get a response like the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"Name\": \"my-parallel-data-1\",\r\n    \"Status\": \"CREATING\"\r\n}<\/code><\/pre>\n<\/div>\n<p>This means that your PD is being created now.<\/p>\n<p>Run <code>aws translate create-parallel-data<\/code> help for more information.<\/p>\n<h4>Console<\/h4>\n<p>To use the Amazon Translate console, complete the following steps:<\/p>\n<ol>\n<li>On the Amazon Translate console, under <strong>Customization<\/strong>, choose <strong>Parallel data<\/strong>.<\/li>\n<li>Choose <strong>Create parallel data<\/strong>.<\/li>\n<\/ol>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18792\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-1.jpg\" alt=\"\" width=\"800\" height=\"316\"><\/p>\n<ol start=\"3\">\n<li>For <strong>Name<\/strong>, insert <code>my-parallel-data-1<\/code>.<\/li>\n<li>For <strong>Parallel data location in S3<\/strong>, enter your S3 location (for this post, <code>s3:\/\/input-s3bucket\/Paralleldata\/Mydata.tsv<\/code>).<\/li>\n<li>For <strong>File format<\/strong>\u00b8 you can choose CSV, TSV, or TMX. For this post, we choose <strong>Tab-separated values (.tsv)<\/strong>.<\/li>\n<\/ol>\n<p>Your data is always secure with Amazon Translate. It\u2019s encrypted using an AWS owned <a href=\"https:\/\/docs.aws.amazon.com\/kms\/latest\/developerguide\/overview.html\" target=\"_blank\" rel=\"noopener noreferrer\">encryption key<\/a> by default. You can encrypt it using a key from your current account or use a key from a different account.<\/p>\n<ol start=\"6\">\n<li>For this post, for <strong>Encryption key<\/strong>, we select <strong>Use AWS owned key<\/strong>.<\/li>\n<li>Choose <strong>Create parallel data<\/strong>.<\/li>\n<\/ol>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-2.jpg\" alt=\"\" width=\"800\" height=\"467\"><\/p>\n<h3>ListParallelData<\/h3>\n<p>Calling the <code>ListParallelData<\/code> API returns a list of PD that exists and their details (it doesn\u2019t include a pre-signed Amazon S3 URL for downloading the data)<\/p>\n<h4>CLI<\/h4>\n<p>Run the following CLI command:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate list-parallel-data \r\n--region us-west-2<\/code><\/pre>\n<\/div>\n<p>You get a response like the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"ParallelDataPropertiesList\": [\r\n        {\r\n            \"Name\": \"my-parallel-data-1\",\r\n            \"Arn\": \"arn:aws:translate:us-west-2:123456789012:parallel-data\/my-parallel-data-1\",\r\n            \"Status\": \"ACTIVE\",\r\n            \"SourceLanguageCode\": \"en\",\r\n            \"TargetLanguageCodes\": [\r\n                \"es\"\r\n            ],\r\n            \"ParallelDataConfig\": {\r\n                \"S3Uri\": \"s3:\/\/input-s3bucket\/Paralleldata\/Mydata.tsv\",\r\n                \"Format\": \"TSV\"\r\n            },\r\n            \"ImportedDataSize\": 532,\r\n            \"ImportedRecordCount\": 3,\r\n            \"FailedRecordCount\": 0,\r\n            \"CreatedAt\": 1234567890.406,\r\n            \"LastUpdatedAt\": 1234567890.675\r\n        }\r\n    ]\r\n}<\/code><\/pre>\n<\/div>\n<p>The <code>\"Status\": \"ACTIVE\"<\/code> means your PD is ready for you to use.<\/p>\n<p>Run <code>aws translate list-parallel-data help<\/code> for more information.<\/p>\n<h4>Console<\/h4>\n<p>This following screenshot shows the result for list-parallel-data on the Amazon Translate console.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18794\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-3.jpg\" alt=\"\" width=\"800\" height=\"322\"><\/p>\n<h3>GetParallelData<\/h3>\n<p>Calling the <code>GetParallelData<\/code> API returns details of the named parallel data and a pre-signed Amazon S3 URL for downloading the data.<\/p>\n<h4>CLI<\/h4>\n<p>Run the following CLI command:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate get-parallel-data \r\n--name ${PARALLEL_DATA_NAME} \r\n--region ${REGION}<\/code><\/pre>\n<\/div>\n<p>For example, my code looks like the following:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate get-parallel-data \r\n--name my-parallel-data-1 \r\n--region us-west-2<\/code><\/pre>\n<\/div>\n<p>You get a response like the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"ParallelDataProperties\": {\r\n        \"Name\": \"my-parallel-data-1\",\r\n        \"Arn\": \"arn:aws:translate:us-west-2:123456789012:parallel-data\/my-parallel-data-1\",\r\n        \"Status\": \"ACTIVE\",\r\n        \"SourceLanguageCode\": \"en\",\r\n        \"TargetLanguageCodes\": [\r\n            \"es\"\r\n        ],\r\n        \"ParallelDataConfig\": {\r\n            \"S3Uri\": \"s3:\/\/input-s3bucket\/Paralleldata\/Mydata.tsv\",\r\n            \"Format\": \"TSV\"\r\n        },\r\n        \"ImportedDataSize\": 532,\r\n        \"ImportedRecordCount\": 3,\r\n        \"FailedRecordCount\": 0,\r\n        \"CreatedAt\": 1234567890.406,\r\n        \"LastUpdatedAt\": 1234567890.675\r\n    },\r\n    \"DataLocation\": {\r\n        \"RepositoryType\": \"S3\",\r\n        \"Location\": \"xxx\"\r\n    }\r\n}<\/code><\/pre>\n<\/div>\n<p><code>\u201cLocation\u201d<\/code> contains the pre-signed Amazon S3 URL for downloading the data.<\/p>\n<p>Run <code>aws translate get-parallel-data help<\/code> for more information.<\/p>\n<h4>Console<\/h4>\n<p>On the Amazon Translate console, choose one of the PD files on the <strong>Parallel data <\/strong>page.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18795\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-4.jpg\" alt=\"\" width=\"800\" height=\"322\"><\/p>\n<p>You\u2019re directed to another page that includes the detail for this parallel data file. The following screenshot shows the details for <code>get-parallel-data<\/code>.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18796\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-5.jpg\" alt=\"\" width=\"800\" height=\"354\"><\/p>\n<h3>UpdateParallelData<\/h3>\n<p>Calling the <code>UpdateParallelData<\/code> API replaces the old parallel data with the new one.<\/p>\n<h4>CLI<\/h4>\n<p>Run the following CLI command:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate update-parallel-data \r\n--name ${PARALLEL_DATA_NAME}\r\n--parallel-data-config S3Uri=${NEW_S3_URI},Format=${FORMAT} \r\n--region us-west-2<\/code><\/pre>\n<\/div>\n<p>For this post, <code>Mydata1.tsv<\/code> is my new parallel data. My code looks like the following:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate update-parallel-data \r\n--name my-parallel-data-1 \r\n--parallel-data-config S3Uri= s3:\/\/input-s3bucket\/Paralleldata\/Mydata1.tsv,Format=TSV \r\n--region us-west-2<\/code><\/pre>\n<\/div>\n<p>You get a response like the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"Name\": \"my-parallel-data-1\",\r\n    \"Status\": \"ACTIVE\",\r\n    \"LatestUpdateAttemptStatus\": \"UPDATING\",\r\n    \"LatestUpdateAttemptAt\": 1234567890.844\r\n}<\/code><\/pre>\n<\/div>\n<p>The <code>\"LatestUpdateAttemptStatus\": \"UPDATING\"<\/code> means your parallel data is being updated now.<\/p>\n<p>Wait for a few minutes and run get-parallel-data again. You can see the parallel data get updated, such as in the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"ParallelDataProperties\": {\r\n            \"Name\": \"my-parallel-data-1\",\r\n            \"Arn\": \"arn:aws:translate:us-west-2:123456789012:parallel-data\/my-parallel-data-1\",\r\n            \"Status\": \"ACTIVE\",\r\n            \"SourceLanguageCode\": \"en\",\r\n            \"TargetLanguageCodes\": [\r\n                \"es\"\r\n            ],\r\n            \"ParallelDataConfig\": {\r\n                \"S3Uri\": \"s3:\/\/input-s3bucket\/Paralleldata\/Mydata1.tsv\",\r\n                \"Format\": \"TSV\"\r\n            },\r\n        ...\r\n    }\r\n}<\/code><\/pre>\n<\/div>\n<p>We can see that the parallel data has been updated from <code>Mydata.tsv<\/code> to <code>Mydata1.tsv<\/code>.<\/p>\n<p>Run <code>aws translate update-parallel-data help<\/code> for more information.<\/p>\n<h4>Console<\/h4>\n<p>On the Amazon Translate console, choose the parallel data file and choose <strong>Update<\/strong>.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18797\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-6.jpg\" alt=\"\" width=\"800\" height=\"354\"><\/p>\n<p>You can replace the new parallel data file with the existing one by specifying the new Amazon S3 URL.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-18797\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-6.jpg\" alt=\"\" width=\"800\" height=\"354\"><\/p>\n<h2>Creating your first Active Custom Translation job<\/h2>\n<p>In this section, we discuss the different ways you can create your ACT job.<\/p>\n<h3>StartTextTranslationJob<\/h3>\n<p>Calling the <code>StartTextTranslationJob<\/code> starts a batch translation. When you add parallel data to a batch translation job, you create an ACT job. Amazon Translate customizes your ACT output to match the style, tone, and word choices it finds in your PD. ACT is a premium product, so see <a href=\"https:\/\/aws.amazon.com\/translate\/pricing\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Translate pricing<\/a> for pricing information. You can only specify one parallel data file to use with the text translation job.<\/p>\n<h4>CLI<\/h4>\n<p>Run the following command:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate start-text-translation-job \r\n--input-data-config ContentType=${CONTENT_TYPE},S3Uri=${INPUT_S3_URI} \r\n--output-data-config S3Uri=${OUTPUT_S3_URI} \r\n--data-access-role-arn ${DATA_ACCESS_ROLE}\r\n--source-language-code=${SOURCE_LANGUAGE_CODE} --target-language-codes=${TARGET_LANGUAGE_CODE} \r\n--parallel-data-names ${PARALLEL_DATA_NAME}\r\n--region ${REGION}\r\n--job-name ${JOB_NAME}<\/code><\/pre>\n<\/div>\n<p>For example, my code looks like the following:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">aws translate start-text-translation-job \r\n--input-data-config ContentType=application\/vnd.openxmlformats-officedocument.spreadsheetml.sheet,S3Uri= s3:\/\/input-s3bucket\/inputfile\/ \r\n--output-data-config S3Uri= s3:\/\/output-s3bucket\/Output\/ \r\n--data-access-role-arn arn:aws:iam::123456789012:role\/TranslateBatchAPI \r\n--source-language-code=en --target-language-codes=es \r\n--parallel-data-names my-parallel-data-1 \r\n--region us-west-2 \r\n--job-name ACT1<\/code><\/pre>\n<\/div>\n<p>You get a response like the following code:<\/p>\n<div class=\"hide-language\">\n<pre><code class=\"lang-bash\">{\r\n    \"JobId\": \"4446f95f20c88a4b347449d3671fbe3d\",\r\n    \"JobStatus\": \"SUBMITTED\"\r\n}<\/code><\/pre>\n<\/div>\n<p>This output means the job has been submitted successfully.<\/p>\n<p>Run <code>aws translate start-text-translation-job<\/code> help for more information.<\/p>\n<h4>Console<\/h4>\n<p>For instructions on running a batch translation job on the Amazon Translate console, see <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/translating-documents-spreadsheets-and-presentations-in-office-open-xml-format-using-amazon-translate\/\" target=\"_blank\" rel=\"noopener noreferrer\">Translating documents, spreadsheets, and presentations in Office Open XML format using Amazon Translate<\/a>. Choose <code>my-parallel-data-1<\/code> as the parallel data to create your first ACT job, ACT1.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Customizing-your-machine-8.jpg\" alt=\"\" width=\"597\" height=\"829\"><\/p>\n<p>Congratulations! You have created your first ACT job. ACT is available in the following Regions:<\/p>\n<ul>\n<li>US East (Northern Virginia)<\/li>\n<li>US West (Oregon)<\/li>\n<li>Europe (Ireland)<\/li>\n<\/ul>\n<h2>Running your Active Custom Translation job<\/h2>\n<p>ACT works on asynchronous batch translation for language pairs that have English as either the source or target language.<\/p>\n<p>Now, let\u2019s try to translate the following text from English to Spanish and see how ACT helps to customize the output:<\/p>\n<p><strong>\u201cHow are you?\u201d is one of the most common questions you\u2019ll get asked when meeting someone. The most common response is \u201cgood\u201d<\/strong><\/p>\n<p>The following is the output you get when you translate without any customization:<\/p>\n<p><strong>\u201c\u00bfC\u00f3mo est\u00e1s?\u201d es una de las preguntas m\u00e1s comunes que se le har\u00e1n cuando conozca a alguien. La respuesta m\u00e1s com\u00fan es \u201cBuena\u201d<\/strong><\/p>\n<p>The following is the output you get when you translate using ACT with my-parallel-data-1 as the PD:<\/p>\n<p><strong>\u201c\u00bfC\u00f3mo est\u00e1 usted?\u201d es una de las preguntas m\u00e1s comunes que te har\u00e1n cuando te re\u00fanas con alguien. La respuesta m\u00e1s com\u00fan es \u201cBuena\u201d<\/strong><\/p>\n<h2>Conclusion<\/h2>\n<p>Amazon Translate ACT introduces a powerful way of providing personalized translation output with the following benefits:<\/p>\n<ul>\n<li>You don\u2019t have to build a custom translation model<\/li>\n<li>You only pay for what you translate using ACT<\/li>\n<li>There is no additional model building or model hosting cost<\/li>\n<li>Your data is always secure and always under your control<\/li>\n<li>You get the best machine translation even when your source text is outside the domain of your parallel data<\/li>\n<li>You can update your parallel data as often as you need for no additional cost<\/li>\n<\/ul>\n<p>Try ACT today. Bring your parallel data and start customizing your machine translation output. For more information about Amazon Translate ACT, see <a href=\"https:\/\/docs.aws.amazon.com\/translate\/latest\/dg\/async.html\" target=\"_blank\" rel=\"noopener noreferrer\">Asynchronous Batch Processing<\/a>.<\/p>\n<h2>Related resources<\/h2>\n<p>For additional resources, see the following:<\/p>\n<p>\u00a0<\/p>\n<hr>\n<h3>About the Authors<\/h3>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-11440 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/03\/20\/watson-srivathsan-100.jpg\" alt=\"\" width=\"100\" height=\"132\"><strong>Watson G. Srivathsan<\/strong>\u00a0is the Sr. Product Manager for Amazon Translate, AWS\u2019s natural language processing service. On weekends you will find him exploring the outdoors in the Pacific Northwest.<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-18805 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/11\/23\/Xingyao-Wang.jpg\" alt=\"\" width=\"100\" height=\"133\"><strong>Xingyao Wang<\/strong> is the Software Develop Engineer for Amazon Translate, AWS\u2019s natural language processing service. She likes to hang out with her cats at home.<\/p>\n<\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/aws.amazon.com\/blogs\/machine-learning\/customizing-your-machine-translation-using-amazon-translate-active-custom-translation\/<\/p>\n","protected":false},"author":0,"featured_media":613,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/612"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=612"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/612\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/613"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=612"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=612"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=612"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}