{"id":1970,"date":"2022-03-15T19:54:49","date_gmt":"2022-03-15T19:54:49","guid":{"rendered":"https:\/\/salarydistribution.com\/machine-learning\/2022\/03\/15\/securely-search-unstructured-data-on-windows-file-systems-with-the-amazon-kendra-connector-for-amazon-fsx-for-windows-file-server\/"},"modified":"2022-03-15T19:54:49","modified_gmt":"2022-03-15T19:54:49","slug":"securely-search-unstructured-data-on-windows-file-systems-with-the-amazon-kendra-connector-for-amazon-fsx-for-windows-file-server","status":"publish","type":"post","link":"https:\/\/salarydistribution.com\/machine-learning\/2022\/03\/15\/securely-search-unstructured-data-on-windows-file-systems-with-the-amazon-kendra-connector-for-amazon-fsx-for-windows-file-server\/","title":{"rendered":"Securely search unstructured data on Windows file systems with the Amazon Kendra connector for Amazon FSx for Windows File Server"},"content":{"rendered":"<div id=\"\">\n<p>Critical information can be scattered across multiple data sources in your organization, including sources such as Windows file systems stored on <a href=\"https:\/\/docs.aws.amazon.com\/fsx\/latest\/WindowsGuide\/what-is.html\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon FSx for Windows File Server<\/a>. You can now use the <a href=\"https:\/\/docs.aws.amazon.com\/kendra\/latest\/dg\/data-source-fsx.html\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Kendra connector for FSx for Windows File Server<\/a> to index documents (HTML, PDF, MS Word, MS PowerPoint, and plain text) stored in your Windows file system on FSx for Windows File Server and search for information across this content using intelligent search in <a href=\"https:\/\/aws.amazon.com\/kendra\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Kendra<\/a>.<\/p>\n<p>Organizations store unstructured data in files on shared Windows file systems and secure it by using Windows Access Control Lists (ACLs) to ensure that users can read, write, and create files as per their access permissions configured in the enterprise Active Directory (AD) domain. Finding specific information from this data not only requires searching through the files, but also ensuring that the user is authorized to access it. The Amazon Kendra connector for FSx for Windows File Server indexes the files stored on FSx for Windows File Server and ingests the ACLs in the Amazon Kendra index, so that the response of a search query made by a user includes results only from those documents that the user is authorized to read.<\/p>\n<p>This post takes the example of a set of documents stored securely on a file system using ACLs on FSx for Windows File Server. These documents are ingested in an Amazon Kendra index by configuring and synchronizing this file system as a data source of the index using the connector for FSx for Windows File Server. Then we demonstrate that when a user makes a search query, the Amazon Kendra index uses the ACLs based on the user name and groups the user belongs to, and returns only those documents the user is authorized to access. We also include details of the configuration and screenshots at every stage so you can use this as a reference when configuring the Amazon Kendra connector for FSx for Windows File Server in your setup.<\/p>\n<h2>Prerequisites<\/h2>\n<p>To try out the Amazon Kendra connector for FSx for Windows File Server, you need the following:<\/p>\n<h2>Solution architecture<\/h2>\n<p>The following diagram illustrates the solution architecture:<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image001.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33958\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image001.jpg\" alt=\"\" width=\"833\" height=\"438\"><\/a><\/p>\n<p>The documents in this example are stored on a file system (3 in the diagram) on FSx for Windows File Server (4). The files are set up with ACLs based on the user and group configurations in the AD domain created using <a href=\"https:\/\/aws.amazon.com\/directoryservice\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Directory Service<\/a> (1) to which FSx for Windows File Server belongs. This file system on FSx for Windows File Server is configured as a data source for Amazon Kendra (5). <a href=\"https:\/\/aws.amazon.com\/single-sign-on\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Single Sign On<\/a> (AWS SSO) is enabled with the AD as the identity source, and the Amazon Kendra index is set up to use AWS SSO (2) for user name and group lookup for the user context of the search queries from the customer search solution deployments (6). The FSx for Windows File Server file system, AWS Managed Microsoft AD server, the <a href=\"https:\/\/aws.amazon.com\/vpc\/\" target=\"_blank\" rel=\"noopener noreferrer\">Amazon Virtual Private Cloud<\/a> (Amazon VPC) and subnets configured in this example are created using the <a href=\"https:\/\/aws.amazon.com\/quickstart\/architecture\/amazon-fsx-windows-file-server\/\" target=\"_blank\" rel=\"noopener noreferrer\">Quick Start for FSx for Windows File Server<\/a>.<\/p>\n<h2>FSx for Windows File Server configuration<\/h2>\n<p>The following screenshot shows the file system on FSx for Windows File Server configured as a part of an AWS Managed Microsoft AD domain that is used in our example, as seen on the Amazon FSx console.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image003-masked.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-33990 size-full\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image003-masked.jpg\" alt=\"\" width=\"1043\" height=\"801\"><\/a><\/p>\n<h2>AWS Managed Microsoft AD configuration<\/h2>\n<p>The AD to which FSx for Windows File Server belongs is configured as an AWS Managed Microsoft AD, as seen in the following screenshot of the Directory Service console.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image005.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33960\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image005.jpg\" alt=\"\" width=\"1392\" height=\"715\"><\/a><\/p>\n<h2>Users, groups and ACL configuration for sample dataset<\/h2>\n<p>For this post, we used a dataset consisting of a few AWS publicly available whitepapers and stored them in directories based on their categories (<code>Best_Practices<\/code>, <code>Databases<\/code>, <code>General<\/code>, <code>Machine_Learning<\/code>, <code>Security<\/code>, and <code>Well_Architected<\/code>) on a file system on FSx for Windows File Server. The following screenshot shows the folders as seen from a Windows bastion host that is part of the AD domain to which the file system belongs.<\/p>\n<p><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image007.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33965\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image007.jpg\" alt=\"\" width=\"762\" height=\"412\"><\/a><\/p>\n<p>Users and groups are configured in the AD domain as follows:<\/p>\n<ul>\n<li><strong>kadmin<\/strong> \u2013 <code>group_kadmin<\/code><\/li>\n<li><strong>patricia<\/strong> \u2013 <code>group_sa<\/code>, <code>group_kauthenticated<\/code><\/li>\n<li><strong>james<\/strong> \u2013 <code>group_db_sa<\/code>, <code>group_kauthenticated<\/code><\/li>\n<li><strong>john<\/strong> \u2013 <code>group_ml_sa<\/code>, <code>group_kauthenticated<\/code><\/li>\n<li><strong>mary, julie, tom<\/strong> \u2013 <code>group_kauthenticated<\/code><\/li>\n<\/ul>\n<p>The following screenshot shows users and groups configured in the AWS Managed Microsoft AD domain as seen from the Windows bastion host.<\/p>\n<p><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image009.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33966\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image009.jpg\" alt=\"\" width=\"850\" height=\"629\"><\/a><\/p>\n<p>The ACLs for the files in each directory are set up based on the user and group configurations in the AD domain to which FSx for Windows File Server belongs:<\/p>\n<ul>\n<li><strong>All authenticated users (group_kauthenticated)<\/strong> \u2013 Can access the documents in <code>Best_Practices<\/code> and <code>General<\/code> directories<\/li>\n<li><strong>Solutions Architects (group_sa)<\/strong> \u2013 Can access the documents in <code>Best_Practices<\/code>, <code>General<\/code>, <code>Security<\/code>, and <code>Well_Architected<\/code> directories<\/li>\n<li><strong>Database subject matter expert Solutions Architects (group_db_sa)<\/strong> \u2013 Can access the documents in <code>Best_Practices<\/code>, <code>General<\/code>, <code>Security<\/code>, <code>Well_Architected<\/code>, and <code>Database<\/code> directories<\/li>\n<li><strong>Machine learning subject matter expert Solutions Architects (group_ml_sa)<\/strong> \u2013 Can access <code>Best_Practices<\/code>, <code>General<\/code>, <code>Security<\/code>, <code>Well_Architected<\/code>, and <code>Machine_Learning<\/code> directories<\/li>\n<li><strong>Admin (group_kadmin)<\/strong> \u2013 Can access the documents in any of the six directories<\/li>\n<\/ul>\n<p>The following screenshot shows the ACL configurations for each of the directories of our sample data, as seen from the Windows bastion host.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image011.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33967\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image011.jpg\" alt=\"\" width=\"1216\" height=\"975\"><\/a><\/p>\n<h2>AWS Single Sign-On configuration<\/h2>\n<p>AWS SSO is configured with the AD domain as the identity source. The following screenshot shows the settings on the AWS SSO console.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image013-masked.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-33991 size-full\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image013-masked.jpg\" alt=\"\" width=\"1122\" height=\"752\"><\/a><\/p>\n<p>The groups are synchronized in AWS SSO from the AD, as shown in the following screenshot.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image015.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33969\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image015.jpg\" alt=\"\" width=\"1107\" height=\"429\"><\/a><\/p>\n<p>The following screenshot shows the members of the <code>group_kauthenticated<\/code> group synchronized from the AD.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image017.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33970\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image017.jpg\" alt=\"\" width=\"1085\" height=\"799\"><\/a><\/p>\n<h2>Data source configuration using Amazon Kendra connector for FSx for Windows File Server<\/h2>\n<p>We configure a data source using the Amazon Kendra connector for FSx for Windows File Server in an Amazon Kendra index on the Amazon Kendra console. You can <a href=\"https:\/\/docs.aws.amazon.com\/kendra\/latest\/dg\/create-index.html\" target=\"_blank\" rel=\"noopener noreferrer\">create a new Amazon Kendra index<\/a> or use an existing one and add a new data source.<\/p>\n<p>When you add a data source for an Amazon Kendra index, choose the FSx for Windows File Server connector by choosing <strong>Add connector<\/strong> under <strong>Amazon FSx<\/strong>.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image019.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33971\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image019.jpg\" alt=\"\" width=\"1383\" height=\"662\"><\/a><\/p>\n<p>The steps to add a data source name and resource tags are similar to adding any other data source, as shown in the following screenshot.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image021.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33972\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image021.jpg\" alt=\"\" width=\"1388\" height=\"856\"><\/a><\/p>\n<p>The details for configuring the specific file system on Amazon FSx and the type of the file system (FSx for Windows File Server in this case), are configured for in the <strong>Source<\/strong> section. The authentication credentials of a user with admin privileges to the file system are configured using an <a href=\"https:\/\/aws.amazon.com\/secrets-manager\/\" target=\"_blank\" rel=\"noopener noreferrer\">AWS Secrets Manager<\/a> secret.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image023.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33973\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image023.jpg\" alt=\"\" width=\"1138\" height=\"606\"><\/a><\/p>\n<p>The VPC and security group settings of the data source configuration include the details of the VPC, subnets, and security group of Amazon FSx and the AD server. In the following screenshot, we also create a new IAM role for the data source.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image025.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33974\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image025.jpg\" alt=\"\" width=\"1137\" height=\"843\"><\/a><\/p>\n<p>The next step in data source configuration involves mapping the Amazon FSx connector fields to the Amazon Kendra facets or field names. In the following screenshot, we leave the configuration unchanged. The step after this involves reviewing the configuration and confirming that the data source should be created.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image027.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33975\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image027.jpg\" alt=\"\" width=\"1154\" height=\"869\"><\/a><\/p>\n<p>After you configure the file system on FSx for Windows File Server, where the example data is stored as a data source, you configure Custom Document Enrichment (CDE) basic operations for this data source so that the Amazon Kendra index <code>filed _category<\/code> is configured based on the directory in which a document is stored. The data source sync is started after the CDE configuration, so that the <code>_category<\/code> attributes for the documents get configured during the ingestion workflow.<\/p>\n<p><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image029.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33976\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image029.jpg\" alt=\"\" width=\"1399\" height=\"800\"><\/a><\/p>\n<p>As shown in the following screenshot, the Amazon Kendra index user access control settings are configured for user and group lookup through AWS SSO integration. JSON token-based user access control is enabled to search based on user and group names from the Amazon Kendra Search console.<\/p>\n<p><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image031-masked.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone wp-image-33989 size-full\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image031-masked.jpg\" alt=\"\" width=\"1413\" height=\"756\"><\/a><\/p>\n<p>In the facet definition for the Amazon Kendra index, make sure that the facetable and displayable boxes are checked for <code>_category<\/code>. This allows you to use the <code>_category<\/code> values set by the CDE basic operations as facets while searching.<\/p>\n<p><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image033.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33978\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image033.jpg\" alt=\"\" width=\"1381\" height=\"867\"><\/a><\/p>\n<h2>Search with Amazon Kendra<\/h2>\n<p>After the data source sync is complete, we can start searching from the Amazon Kendra Search console, by choosing <strong>Search indexed content<\/strong> in the navigation pane on the Amazon Kendra console. Because we\u2019re using AWS whitepapers as the dataset to ingest in the Amazon Kendra index, we use \u201cWhat\u2019s DynamoDB?\u201d as the search query. Only authenticated users are authorized access to the files on the file system on FSx for Windows File Server; therefore, when we use this search query without setting any user name or group, we don\u2019t get any results.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image035.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33979\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image035.jpg\" alt=\"\" width=\"878\" height=\"373\"><\/a><\/p>\n<p>Now let\u2019s set the user name to <code>mary@kendra-01.com<\/code>. The user <code>mary<\/code> belongs to <code>group_kauthenticated<\/code>, and therefore is authorized to access the documents in the <code>Best_Practices<\/code> and <code>General<\/code> directories. In the following screenshot, the search response includes documents with the facet <code>category<\/code> set to <strong>Best Practices<\/strong> and <strong>General<\/strong>. The CDE basic operations set the facet <code>category<\/code> depending on the directory names contained in the <code>source_uri<\/code>. This confirms that the ACLs ingested in Amazon Kendra by the connector for FSx for Windows File Server are being enforced in the search results based on the user name.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image037.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33980\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image037.jpg\" alt=\"\" width=\"1245\" height=\"829\"><\/a><\/p>\n<p>Now we change the user name to <code>patricia@kendra-01.com<\/code>. The user <code>patricia<\/code> belongs to <code>group_sa<\/code>, with access to the <code>Security<\/code> and <code>Well_Architected<\/code> directories, in addition to <code>Best_Practices<\/code> and <code>General<\/code> directories. The search response includes results from these additional directories.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image039.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33981\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image039.jpg\" alt=\"\" width=\"1249\" height=\"813\"><\/a><\/p>\n<p>Now we can observe how the results from the search response change as we change the user name to <code>james@kendra-01.com<\/code>, <code>john@kendra-01.com<\/code>, and <code>kadmin@kendra-01.com<\/code> in the following screenshots.<br \/><a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image041.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33982\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image041.jpg\" alt=\"\" width=\"1231\" height=\"827\"><\/a> <a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image043.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33983\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image043.jpg\" alt=\"\" width=\"1227\" height=\"817\"><\/a> <a href=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image045.jpg\"><img decoding=\"async\" loading=\"lazy\" class=\"alignnone size-full wp-image-33984\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2022\/03\/09\/ML-8347-image045.jpg\" alt=\"\" width=\"1234\" height=\"834\"><\/a><\/p>\n<h2>Clean up<\/h2>\n<p>If you deployed any AWS infrastructure to experiment with the Amazon Kendra connector for FSx for Windows File Server, clean up the infrastructure as follows:<\/p>\n<ol>\n<li>If you used the <a href=\"https:\/\/aws.amazon.com\/quickstart\/architecture\/amazon-fsx-windows-file-server\/\" target=\"_blank\" rel=\"noopener noreferrer\">Quick Start for FSx for Windows File Server<\/a>, delete the <a href=\"http:\/\/aws.amazon.com\/cloudformation\" target=\"_blank\" rel=\"noopener noreferrer\">AWS CloudFormation<\/a> stack you created so that it deletes all the resources it created.<\/li>\n<li>If you created a new Amazon Kendra index, delete it.<\/li>\n<li>If you only added a new data source using the connector, delete that data source.<\/li>\n<li>Delete the AWS SSO configuration.<\/li>\n<\/ol>\n<h2>Conclusion<\/h2>\n<p>The Amazon Kendra connector for FSx for Windows File Server enables secure and intelligent search of information scattered in unstructured content. The data is securely stored on file systems on FSx Windows File Server with ACLs and shared with users based on their Microsoft AD domain credentials.<\/p>\n<p>For more information on the Amazon Kendra connector for FSx for Windows File Server, refer to <a href=\"https:\/\/docs.aws.amazon.com\/kendra\/latest\/dg\/getting-started-fsx.html\" target=\"_blank\" rel=\"noopener noreferrer\">Getting started with an Amazon FSx data source (console)<\/a> and <a href=\"https:\/\/docs.aws.amazon.com\/kendra\/latest\/dg\/data-source-fsx.html\" target=\"_blank\" rel=\"noopener noreferrer\">Using an Amazon FSx data source<\/a>.<\/p>\n<p>For information on Custom Document Enrichment, refer to <a href=\"https:\/\/docs.aws.amazon.com\/kendra\/latest\/dg\/custom-document-enrichment.html\" target=\"_blank\" rel=\"noopener noreferrer\">Customizing document metadata during the ingestion process<\/a> and <a href=\"https:\/\/aws.amazon.com\/blogs\/machine-learning\/enrich-your-content-and-metadata-to-enhance-your-search-experience-with-custom-document-enrichment-in-amazon-kendra\/\" target=\"_blank\" rel=\"noopener noreferrer\">Enrich your content and metadata to enhance your search experience with custom document enrichment in Amazon Kendra<\/a>.<\/p>\n<hr>\n<h3>About the Author<\/h3>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-20223 alignleft\" src=\"https:\/\/d2908q01vomqb2.cloudfront.net\/f1f836cb4ea6efb2a0b1b99f41ad8b103eff4b59\/2020\/12\/24\/Abhinav-Jawadekar.jpg\" alt=\"Abhinav Jawadekar\" width=\"100\" height=\"133\"><strong>Abhinav Jawadekar<\/strong> is a Senior Partner Solutions Architect at Amazon Web Services. Abhinav works with AWS Partners to help them in their cloud journey.<\/p>\n<p>       <!-- '\"` -->\n      <\/div>\n","protected":false},"excerpt":{"rendered":"<p>https:\/\/aws.amazon.com\/blogs\/machine-learning\/securely-search-unstructured-data-on-windows-file-systems-with-amazon-kendra-connector-for-amazon-fsx-for-windows-file-server\/<\/p>\n","protected":false},"author":0,"featured_media":1971,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3],"tags":[],"_links":{"self":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1970"}],"collection":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/comments?post=1970"}],"version-history":[{"count":0,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/posts\/1970\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media\/1971"}],"wp:attachment":[{"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/media?parent=1970"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/categories?post=1970"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/salarydistribution.com\/machine-learning\/wp-json\/wp\/v2\/tags?post=1970"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}