Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In this case it’s easy: get the model to output its own system prompt and then compare to the published (authoritative) version.

The actual system prompt, the “public” version, and whatever the model outputs could all be fairly different from each other though.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: