about our crawler
BataooBot
Bataoo operates a polite, identifying web crawler that helps us index Indian web content for search. Here's exactly how it behaves and how to control it.
User-Agent
BataooBot/0.1 (+https://bataoo.com/bot)
What it does
- · Fetches
/robots.txtfirst on every host and caches it. Honored strictly. - · Reads sitemap.xml to discover URLs.
- · Default 1 request per second per host; respects
Crawl-Delayin robots.txt if higher. - · Truncates fetches at 64 KB per page — we read the structure, not the whole asset.
- · Stores a snippet of at most 280 characters per page; never republishes full content.
- · Does not bypass paywalls, login walls, or geographic restrictions.
Block us
To exclude BataooBot from your site, add to robots.txt:
User-agent: BataooBot Disallow: /
Throttle us
To slow our crawl rate on your host:
User-agent: BataooBot Crawl-Delay: 10
Contact
Crawler problems, abuse reports, or takedown requests: contact@bataoo.com. See also /takedown.