Airgentic Help
The Data Sources screen shows your data sources (such as Web Crawl), their sync status, and lets you run a sync manually or on a schedule. This guide describes the Data Sync section and how it fits with crawler settings.

Data sync pulls content from your configured data sources and updates the index used by the AI. For a web crawl data source, that means fetching pages, processing them, and indexing the results so the AI can answer questions from your site.
You'll see a table of data sources, each with Sync Status and Actions, and below that the Web Crawl Schedule and Delete URL sections.
If you need to index content that isn't available via web crawl—such as internal PDFs, Word documents, or standalone HTML files—use the Upload Documents button at the top of the page. This opens a dedicated screen where you can drag and drop files, manage uploaded documents, and trigger indexing.
See the Upload Documents guide for full details on supported file types, public vs. secure storage, and how indexing works.
In the table at the top of the page:
Below the data sources table, Web Crawl Schedule lets you enable or disable automatic syncs and set the time and days of the week. Times use your local timezone. The next run time is shown in the accordion header when the schedule is enabled.
To control what is crawled (URLs, scope, images, metadata), use the gear icon in the Actions column for the Web Crawl row. That opens the Crawler settings screen (General, Crawl Scope, Image Extraction, Field Mappings). Configure crawler settings first if you're adding a new site or changing which pages are included.