Maybe some logic honeypot would be good, such as a infinite content paging list with some random trigger hidden at pages with non-sense titles. When one IP hits these triggers, it is automatically banned.
Bots will trigger it by walking through all pages, but real human would not click in since the paging is non-sense and titles are non-sense.
Yeah, but I don't want to ban bots. Also, they are not actively crawling anything, but rather mirroring the content on demand. At least that's my observation so far... thanks anyways.
Bots will trigger it by walking through all pages, but real human would not click in since the paging is non-sense and titles are non-sense.