class Aws::Kendra::Types::Urls

Provides the configuration information of the URLs to crawl.

*When selecting websites to index, you must adhere to the [Amazon Acceptable Use Policy] and all other Amazon terms. Remember that you must only use the Amazon Kendra web crawler to index your own webpages, or webpages that you have authorization to index.*

[1]: aws.amazon.com/aup/

@note When making an API call, you may pass Urls

data as a hash:

    {
      seed_url_configuration: {
        seed_urls: ["SeedUrl"], # required
        web_crawler_mode: "HOST_ONLY", # accepts HOST_ONLY, SUBDOMAINS, EVERYTHING
      },
      site_maps_configuration: {
        site_maps: ["SiteMap"], # required
      },
    }

@!attribute [rw] seed_url_configuration

Provides the configuration of the seed or starting point URLs of the
websites you want to crawl.

You can choose to crawl only the website host names, or the website
host names with subdomains, or the website host names with
subdomains and other domains that the webpages link to.

You can list up to 100 seed URLs.
@return [Types::SeedUrlConfiguration]

@!attribute [rw] site_maps_configuration

Provides the configuration of the sitemap URLs of the websites you
want to crawl.

Only URLs belonging to the same website host names are crawled. You
can list up to three sitemap URLs.
@return [Types::SiteMapsConfiguration]

@see docs.aws.amazon.com/goto/WebAPI/kendra-2019-02-03/Urls AWS API Documentation

Constants

SENSITIVE