{"id":442,"date":"2020-10-24T01:02:56","date_gmt":"2020-10-24T01:02:56","guid":{"rendered":"https:\/\/machine-learning.webcloning.com\/2020\/10\/24\/arcanum-makes-hungarian-heritage-accessible-with-amazon-rekognition\/"},"modified":"2020-10-24T01:02:56","modified_gmt":"2020-10-24T01:02:56","slug":"arcanum-makes-hungarian-heritage-accessible-with-amazon-rekognition","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2020\/10\/24\/arcanum-makes-hungarian-heritage-accessible-with-amazon-rekognition\/","title":{"rendered":"Arcanum makes Hungarian heritage accessible with Amazon Rekognition"},"content":{"rendered":"<div id=\"\">\n<p><a href=\"https:\/\/www.arcanum.hu\/en\/\" target=\"_blank\" rel=\"noopener noreferrer\">Arcanum<\/a> specializes in digitizing Hungarian language content, including newspapers, books, maps, and art. With over 30 years of experience, Arcanum serves more than 30,000 global subscribers with access to Hungarian culture, history, and heritage.<\/p>\n<p>Amazon Rekognition Solutions Architects worked with Arcanum to add highly scalable image analysis to <a href=\"https:\/\/hungaricana.hu\/en\/\" target=\"_blank\" rel=\"noopener noreferrer\">Hungaricana<\/a>, a free service provided by Arcanum, which enables you to search and explore Hungarian cultural heritage, including 600,000 faces over 500,000 images. For example, you can find <a href=\"https:\/\/www.hungaricana.hu\/en\/search\/results\/?list=eyJxdWVyeSI6ICJNXHUwMGYzciBKXHUwMGYza2FpIn0\" target=\"_blank\" rel=\"noopener noreferrer\">historical works by author M\u00f3r J\u00f3kai<\/a> or <a href=\"https:\/\/gallery.hungaricana.hu\/en\/search\/results\/?list=eyJxdWVyeSI6ICJDSU1LRT0oRXNrXHUwMGZjdlx1MDE1MSkifQ\" target=\"_blank\" rel=\"noopener noreferrer\">photos on topics like weddings<\/a>. The Arcanum team chose <a href=\"https:\/\/aws.amazon.com\/rekognition\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Rekognition<\/a> to free valuable staff from time and cost-intensive manual labeling, and improved label accuracy to make 200,000 previously unsearchable images (approximately 40% of image inventory), available to users.<\/p>\n<p>Amazon Rekognition makes it easy to add image and video analysis to your applications using highly scalable machine learning (ML) technology that requires no previous ML expertise to use. Amazon Rekognition also provides highly accurate <a href=\"https:\/\/docs.aws.amazon.com\/rekognition\/latest\/dg\/faces.html\" target=\"_blank\" rel=\"noopener noreferrer\">facial recognition<\/a> and <a href=\"https:\/\/docs.aws.amazon.com\/rekognition\/latest\/dg\/collections.html\" target=\"_blank\" rel=\"noopener noreferrer\">facial search<\/a> capabilities to detect, analyze, and compare faces.<\/p>\n<p>Arcanum uses this facial recognition feature in their image database services to help you find particular people in Arcanum\u2019s articles. This post discusses their challenges and why they chose Amazon Rekognition as their solution.<\/p>\n<h2>Automated image labeling challenges<\/h2>\n<p>Arcanum dedicated a team of three people to start tagging and labeling content for Hungaricana. The team quickly learned that they would need to invest more than 3 months of time-consuming and repetitive human labor to provide accurate search capabilities to their customers. Considering the size of the team and scope of the existing project, Arcanum needed a better solution that would automate image and object labelling at scale.<\/p>\n<h2>Automated image labeling solutions<\/h2>\n<p>To speed up and automate image labeling, Arcanum turned to Amazon Rekognition to enable users to search photos by keywords (for example, type of historic event, place name, or a person relevant to Hungarian history).<\/p>\n<p>For the Hungaricana project, preprocessing all the images was challenging. Arcanum ran a TensorFlow face search across all 28 million pages on a machine with 8 GPUs in their own offices to extract only faces from images.<\/p>\n<p>The following screenshot shows what an extract looks like (image provided by Arcanum Database Ltd).<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-17394 size-full\" title=\"Extract\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/1-Arcanum.jpg\" alt=\"\" width=\"900\" height=\"508\"><\/p>\n<p>The images containing only faces are sent to Amazon Rekognition, invoking the <a href=\"https:\/\/boto3.amazonaws.com\/v1\/documentation\/api\/latest\/reference\/services\/rekognition.html#Rekognition.Client.index_faces\" target=\"_blank\" rel=\"noopener noreferrer\">IndexFaces<\/a> operation to <a href=\"https:\/\/docs.aws.amazon.com\/rekognition\/latest\/dg\/add-faces-to-collection-procedure.html\" target=\"_blank\" rel=\"noopener noreferrer\">add a face to the collection<\/a>. For each face that is detected in the specified face collection, Amazon Rekognition extracts facial features into a feature vector and stores it in an <a href=\"https:\/\/aws.amazon.com\/rds\/aurora\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Aurora<\/a> database. Amazon Rekognition uses feature vectors when it performs face match and search operations using the <a href=\"https:\/\/boto3.amazonaws.com\/v1\/documentation\/api\/latest\/reference\/services\/rekognition.html#Rekognition.Client.search_faces\" target=\"_blank\" rel=\"noopener noreferrer\">SearchFaces<\/a> and <a href=\"https:\/\/boto3.amazonaws.com\/v1\/documentation\/api\/latest\/reference\/services\/rekognition.html#Rekognition.Client.search_faces_by_image\" target=\"_blank\" rel=\"noopener noreferrer\">SearchFacesByImage<\/a> operations.<\/p>\n<p>The image preprocessing helped create a very efficient and cost-effective way to index faces. The following diagram summarizes the preprocessing workflow.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-17395 size-full\" title=\"Preprocessing workflow\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/2-Preprocessing.jpg\" alt=\"\" width=\"900\" height=\"452\"><\/p>\n<p>As for the web application, the workflow starts with a Hungaricana user making a face search request. The following diagram illustrates the application workflow.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-17396 size-full\" title=\"Application workflow\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/3-Application.jpg\" alt=\"\" width=\"900\" height=\"580\"><\/p>\n<p>The workflow includes the following steps:<\/p>\n<ol>\n<li>The user requests a facial match by uploading the image. The web request is automatically distributed by the <a href=\"https:\/\/aws.amazon.com\/elasticloadbalancing\" target=\"_blank\" rel=\"noopener noreferrer\">Elastic Load Balancer<\/a> to the webserver fleet.<\/li>\n<li>\n<a href=\"https:\/\/aws.amazon.com\/ec2\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Elastic Compute Cloud<\/a> (Amazon EC2) powers application servers that handle the user request.<\/li>\n<li>The uploaded image is stored in <a href=\"http:\/\/aws.amazon.com\/s3\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Simple Storage Service<\/a> (Amazon S3).<\/li>\n<li>Amazon Rekognition indexes the face and runs <a href=\"https:\/\/boto3.amazonaws.com\/v1\/documentation\/api\/latest\/reference\/services\/rekognition.html#Rekognition.Client.search_faces\" target=\"_blank\" rel=\"noopener noreferrer\">SearchFaces<\/a> to look for a face similar to the new face ID.<\/li>\n<li>The output of the search face by image operation is stored in <a href=\"https:\/\/aws.amazon.com\/elasticache\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon ElastiCache<\/a>, a fully managed in-memory data store.<\/li>\n<li>The metadata of the indexed faces are stored in an Aurora relational database built for the cloud.<\/li>\n<li>The resulting face thumbnails are served to the customer via the fast content-delivery network (CDN) service <a href=\"https:\/\/aws.amazon.com\/cloudfront\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon CloudFront<\/a>.<\/li>\n<\/ol>\n<h2>Experimenting and live testing Hungaricana<\/h2>\n<p>During our test of Hungaricana, the application performed extremely well. The searches not only correctly identified people, but also provided links to all publications and sources in Arcanum\u2019s privately owned database where found faces are present. For example, the following screenshot shows the result of the famous composer and pianist <a href=\"https:\/\/en.wikipedia.org\/wiki\/Franz_Liszt\" target=\"_blank\" rel=\"noopener noreferrer\">Franz Liszt<\/a>.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-17397 size-full\" title=\"Lizt face search\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/4-FaceSearch.jpg\" alt=\"\" width=\"900\" height=\"687\"><\/p>\n<p>The application provided 42 pages of 6\u00d74 results. The results are capped to 1,000. The 100% scores are the confidence scores returned by Amazon Rekognition and are rounded up to whole numbers.<\/p>\n<p>The application of Hungaricana has always promptly, and with a high degree of certainty, presented results and links to all corresponding publications.<\/p>\n<h2>Business results<\/h2>\n<p>By introducing Amazon Rekognition into their workflow, Arcanum enabled a better customer experience, including building family trees, searching for historical figures, and researching historical places and events.<\/p>\n<p>The concept of face searching using artificial intelligence certainly isn\u2019t new. But Hungaricana uses it in a very creative, unique way.<\/p>\n<p>Amazon Rekognition allowed Arcanum to realize three distinct advantages:<\/p>\n<ul>\n<li>\n<strong>Time savings<\/strong> \u2013 The time to market speed increased dramatically. Now, instead of spending several months of intense manual labor to label all the images, the company can do this job in a few days. Before, basic labeling on 150,000 images took months for three people to complete.<\/li>\n<li>\n<strong>Cost savings <\/strong>\u2013 Arcanum saved around $15,000 on the Hungaricana project. Before using Amazon Rekognition, there was no automation, so a human workforce had to scan all the images. Now, employees can shift their focus to other high-value tasks.<\/li>\n<li>\n<strong>Improved accuracy <\/strong>\u2013 Users now have a much better experience regarding hit rates. Since Arcanum started using Amazon Rekognition, the number of hits has doubled. Before, out of 500,000 images, about 200,000 weren\u2019t searchable. But with Amazon Rekognition, search is now possible for all 500,000 images.<\/li>\n<\/ul>\n<p><em>\u00a0<\/em>\u201cAmazon Rekognition made Hungarian culture, history, and heritage more accessible to the world,\u201d says El\u0151d Biszak, Arcanum CEO. \u201cIt has made research a lot easier for customers building family trees, searching for historical figures, and researching historical places and events. We cannot wait to see what the future of artificial intelligence has to offer to enrich our content further.\u201d<\/p>\n<h2>Conclusion<\/h2>\n<p>In this post, you learned how to add highly scalable face and image analysis to an enterprise-level image gallery to improve label accuracy, reduce costs, and save time.<\/p>\n<p>You can test Amazon Rekognition features such as facial analysis, face comparison, or celebrity recognition on images specific to your use case on the <a href=\"https:\/\/console.aws.amazon.com\/rekognition\/home\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Rekognition console<\/a>.<\/p>\n<p>For video presentations and tutorials, see <a href=\"https:\/\/aws.amazon.com\/rekognition\/getting-started\/\" target=\"_blank\" rel=\"noopener noreferrer\">Getting Started with Amazon Rekognition<\/a>. For more information about Amazon Rekognition, see <a href=\"https:\/\/docs.aws.amazon.com\/rekognition\/index.html\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Rekognition Documentation<\/a>.<\/p>\n<p>\u00a0<\/p>\n<hr>\n<h3>About the Authors<\/h3>\n<p><strong><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-17398 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/Mikasino.jpg\" alt=\"\" width=\"100\" height=\"137\"> Sini\u0161a Mika\u0161inovi\u0107<\/strong> is a Senior Solutions Architect at AWS Luxembourg, covering Central and Eastern Europe\u2014a region full of opportunities, talented and innovative developers, ISVs, and startups. He helps customers adopt AWS services as well as acquire new skills, learn best practices, and succeed globally with the power of AWS. His areas of expertise are Game Tech and Microsoft on AWS. Sini\u0161a is a PowerShell enthusiast, a gamer, and a father of a small and very loud boy. He flies under the flags of Croatia and Serbia.<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p>\u00a0<\/p>\n<p><strong><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-17399 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/10\/22\/Peroc.jpg\" alt=\"\" width=\"101\" height=\"136\"><\/strong><strong>Cameron Peron<\/strong> is Senior Marketing Manager for AWS Amazon Rekognition and the AWS AI\/ML community. He evangelizes how AI\/ML innovation solves complex challenges facing community, enterprise, and startups alike. Out of the office, he enjoys staying active with kettlebell-sport, spending time with his family and friends, and is an avid fan of Euro-league basketball.<\/p>\n<\/p><\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/aws.amazon.com\/blogs\/machine-learning\/arcanum-makes-hungarian-heritage-accessible-with-amazon-rekognition\/<\/p>\n","protected":false},"author":0,"featured_media":443,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/442"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=442"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/442\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/443"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=442"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=442"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=442"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}