Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> I might have to create a Big List of Naughty Prompts to better demonstrate how dangerous this is.

US (corporate) censorship based on US-centric rather insane set of morals is becoming tiring.



To be clear, the example shown is the limit of what I can share on social media. Grok 4.1 can say far worse.


It’s amusing that censorship in social media is preventing you from posting what you want to post and yet you are asking for censorship of something else (or at least that’s what I understand by your calling this “dangerous”)


In this case, "can share" refers to myself not being comfortable with it.


Have you considered the possible perspective that you yourself deserve censure? You’re the one who asked something (which I infer you deem) questionable to Grok.

Why have such thoughts to begin with?


To be very clear, getting Grok to say henious shit not something I want to subject to random people who follow me on social media even if it's not explicitly against the ToS. If I were to do a writeup or a repository on this, I would need to be very delicate and likely need to involve lawyers, which may make it a nonstarter.

> Why have such thoughts to begin with?

Because my duty to test out how new models respond to adversarial output outweighs my discomfort in doing so. This is not to "own" Elon Musk or be puritanical, it's more as an assessment as a developer who would consider using new LLM APIs and needs to be aware of all their flaws. End users will most definitely try to have sex with the LLM and I need to know how it will respond and whether that needs to be handled downstream.

It has not been an issue (because the models handled adversarial outputs well) until very recently when the safety guardrails completely collapsed in an attempt to court a certain new demographic because LLM user growth is slowing down. I never claim to be a happy person, but it's a skill I'm good at.


I can respect that a whole lot more than the people who think “decency “ causes political division.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: