What is the search crawler? [EAP is over - the crawler is now generally available] | Community
Skip to main content

What is the search crawler? [EAP is over - the crawler is now generally available]

  • January 11, 2022
  • 33 replies
  • 0 views

Show first post

33 replies

  • Author
  • May 10, 2022

@cedric21 Again apologies for the very long wait!

We are looking into your issue and will provide an answer soon.


  • Author
  • May 10, 2022

@dan28 The limit has been increased to 50000 records. The update to the documentation seems to have gotten stuck somewhere in the update workflow. I will follow up on it, but 50000 is the new limit. 


  • Author
  • May 10, 2022

@cedric21 We are expecting to deploy a fix today or latest tomorrow. Could you let me know if it works or not in a couple of days?


  • Author
  • May 11, 2022

@Korak Purkayastha You can have multiple crawlers from the same or multiple Zendesk accounts using the same sitemap- I.e. indexing the same external website. Only things you need to be aware off is that:

  1. The sitemap always needs to be hosted on the same domain as the pages it points to.
  2. You need to add the domain verification tag for each crawler to the homepage of the domain.

The above goes for both when you have multiple crawlers crawling the same site or only one crawler crawling a site.


  • June 1, 2022

@gorka when will you post about the error Locale not detected?


  • Author
  • June 15, 2022

@Jordan Brown and @Korak Purkayastha

Appologies for the long turn around. Here's the post about how language detection works.
We recently made some changes to the language detection and that's probably why you experience the changes.

TLDR is that we look for a locale in the lang tag, Content-language header and <meta tag> in that order and we match the first found with any exact match in the translations you've set up any help center in the account to be in. If not locale is detected we use CLD to determine the language and we match that to all locales you have enables that has that language.

If there is no match to HC locales as described or no locale or language can be determined the page returns the error "Locale not detected".


  • Author
  • June 15, 2022

@Korak Purkayastha

I confirmed that there was no changes to the "lang" tag and it still matches with the locale enabled in Guide.

Is it still happening?

If it is, could you let me know which account it is and enable account access for us?


  • June 18, 2022

Hi @gorka,

Regarding the product limits mentioned here.

Product limits

  • General Federated search limit of 50000 external records.
  • Record max title length - 255 characters
  • Record max body length - 10000 characters

Does this mean that each External Content Source can have up to 50,000 records?  Or that the Federated Search can only get up to 50,000 results from all active External Content Sources?