Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Heretic is a tool that removes censorship (aka "safety alignment") from transformer-based language models without expensive post-training.

I've noticed such "safety alignment" with the current LLMs. Not just insisting on providing the orthodox answer but - if presented with verifiable facts - nothing. “I'm sorry Dave but I can't help you with that” - or words to such effect.

Also: Youtube keeps automatically erasing rude words. How can you do serious historical research with this nonsense?



God forbid offended advertisers. Better to erase history than to lose some shiny pennies.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: