Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Cloudflare actually has this as a free tier feature so even if you don't want to use it for your site you can just setup a throwaway domain on Cloudflare and periodically copy the robots.txt they generate from your scraper allow/block preferences, since they'll be keeping up to date with all the latest.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: