Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I suspect that the tricks used include getting the Gatekeeper to implicitly accept the exercise as a roleplay and working on their sense of fair play to require them to acquiesce to arguments about the friendliness of the AI.

The human sense of "fair play" can be abused in many ways.



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: