How does the auditing work in a headless
# news
l
How does the auditing work in a headless environment? Many SEO packages access the 'frontend', but there could be multiple frontends in a headless environment.
r
By default it will use your Umbraco URL to start the crawl, but you can override that in
appsettings.json
. For example, I've got it running on my own site which is Umbraco backend, Astro frontend. Config looks a bit like this.
Copy code
"ContentAudit": {
  "RespectRobotsTxt": true,
  "UseUmbracoContentIndex": false,
  "UseSitemapXml": true,
  "SitemapUrl": "https://rickbutterfield.dev/sitemap-index.xml",
  "BaseUrl": "https://rickbutterfield.dev"
}
I haven't got multi-site/multi-domain sorted yet, that will go on the roadmap somewhere 😁
l
I figured as much 🙂
I'm experimenting with some crawling as well, but the main issue I'm having is that we have a lot of React frontends and not all data is loaded on page load by async bij javascript/react. That makes crawling sometimes very difficult
r
This uses Playwright underneath and does wait for JS to be executed, but in fairness I haven't tried it with anything massively JS heavy
l
nice!
Good to know
4 Views