This is exactly same struggle for me. Writing technical content about PostgreSQL and balancing my voice without sounding like LLM written is genuinely difficult.
As English is not my first language, I do run into problem where the line between fix my clumsy sentence and rewrite my thought is very thin. Same with writing "boring" technical explanation and more approachable content. I'm getting pushed back for both.
I’ll take a clumsy sentence written by a non-native speaker any day over LLM generated mush. At least I know you chose those words specifically so it gives me some insight into your state of mind and intended meaning.
Any native English speaker who doesn’t live under a rock is very accustomed to reading and hearing English from non-native speakers and familiar with the common quirks and mistakes. English is quite forgiving as a language, we understand you. When in doubt, simplify it.
it's a couple mutually-conflicting languages in a trenchcoat; forgiveness and flexibility are perhaps its defining properties.
To the broader issue: "polish" (in any language) is only valuable insofar as it makes the ideas clearer, attests to innate qualities of the author and/or the investment of their time, or carries its own aesthetic value. As LLMs make (a certain kind of polish) cheap to produce, the value of the middle category attenuates to nothing.
In some specific work contexts, such as writing pull request descriptions, not sounding like AI is something I've given up on trying to optimize. It's simply not worth the effort for me being non-native and writing detailed PR descriptions being so arduous, and the agent already has full context anyway. Obviously any fluff or inaccuracies are aggressively weeded out but I don't care anymore about the AI voice.
> any fluff or inaccuracies are aggressively weeded out
this work is paramount. Without clear evidence of human filtering, a long, well formatted message/PR/doc is likely to reduce my estimate of the value/veracity/relevance of its content.
This. My personal style have always been llm-like, including the generous use of em-dashes, and "not-only-this-that" style mannerisms. It' increasingly difficult to retain reputation.
It's not that simple. LLMs were trained on lots of writing, and the "LLM voice" resembles in many ways good English prose, or at least effective public communications voice.
For years, even before LLMs, there have been trends of varied popularity to, for lack of a better word, regress - intentionally omitting capitalization, punctuation, or other important details which convey meaning. I rejected those, and likewise I reject the call to omit the emdash or otherwise alter my own manner of speaking - a manner cultivated through 30+ years of reading and writing English text.
If content is intellectually lacking, call that out, but I am absolutely sick of people calling out writing because they "think it's LLM-written". I'm sick of review tools giving false positives and calling students' work "AI written" because they used eloquent words instead of Up Goer Five[0] vocabulary.
I am just as afraid of a society where we all dumb ourselves down to not appear as machines as I am of one where machine-generated spam overtakes all human messaging.
Well that isn't what I am suggesting. I'm suggesting people ditch x. Reddit. Probably also ditch hn in the next couple months. If you can run a headless agent to post somewhere, just don't bother visiting that site, honestly a great rule of thumb right there.
That should leave you with media sources like nyt and your local library, which seems healthier to me. And maybe it might encourage a new type of forum to emerge where there is some decentralized vetting that you are a human, like verifying by inputting the random hash posted outside the local maker space.
I hope editorial departments everywhere are taking careful notes on the ars technica fiasco. Agree there's room for some kind of quick "verified human" checkmark. It would at least give readers the ability to quickly filter, and eliminate all the spurious "this sounds like vibeslop" accusations.
i think it depends on what is meant by "good" or "bad". llmism may not be substantive writing, but it's approachable writing. a McDonald's lunch of familiar prose with likewise nationwide popularity and nutritional value.
One of the most common criticisms is the use of the emdash. This is a classic bit of English prose that is not problematic except as a stereotype used to dismiss writing for form rather than for content.
Let's grab a few books off the shelf (literally).
Douglas Adams' The Hitchhiker's Guide to the Galaxy has four emdashes on the very first page:
> It is also the story of a book, a book called THGTTG - not an Earth book, never...
Isaac Asimov's classic The Last Question: three emdashes on the first page (as printed in The Complete Stories, Volume I)
> ...they knew what lay behind the cold, clicking, flashing face -- miles and miles of face -- of that giant computer.
Mark Z. Danielewski, House of Leaves: Three emdashes on page 1
> Much like its subject, The Navidson Record itself is also uneasily contained -- whether by category or lection.
Robert Caro, Master of the Senate: Five emdashes on page one
> Its drab tan damask walls...were unrelieved by even a single touch of color -- no painting, no mural -- or, seemingly, by any other ornament
Other pages 1s:
* Murakami - 1Q84: 1
* Murray/Cox - Apollo: 1
* Meadows - Thinking in Systems: 1
* Dostoyevsky - The Brothers Karamazov (Pevear/Volokhonsky translation): 4
* Caro - The Power Broker: 5
* Hofstadter - Godel, Escher, Bach - 3
Honestly, when I started this post I expected to have to dig deeper than page 1. The emdash is an important part of English-language literature and I reject the claim that we should ignore all writing that contains it.
No one is asking that we reject all prose with emdash. Not all emdash-users are LLMs, but many LLMs are profligate emdash-users, so adjust your priors accordingly.
Secondarily, I think there's a part of the discourse missing: the presence of a syntactic emdash in a sentence on the internet is not itself a strong signal of LLM-writing - but the presence of an actual emdash glyph (—) should raise some eyebrows, esp. in fora that aren't commonly authored in rich text editors (here, twitter, ...)
Before LLMs, the em-dash glyph was a decent tell simply that... the author was using a Mac, because it's a simple and easy-to-remember (or even guess!) key-combo on there. Not that you can't type it on other keyboards, but the Mac one for whatever reason had a combo of users-who-wanted-to-type-it and layout-that-makes-it-easy that resulted in a high proportion of correct em-dash employers being Mac users.
(option-underscore, or option-shift-dash if you prefer to think of it that way)
On iOS, you can type it by simply holding down on the "dash" button then selecting the em-dash from the list of options it presents. It may also correct double-dash to em-dash a lot of the time, not sure.
I have used the correct em-dash everywhere I can for over a decade, which amounts to nearly everywhere.
I agree. pgwire-replication is useful when you need to build a customized and closely controlled pipeline. In fact, it will give you the first part of handling the data (reading from the source), you still need to implement the rest yourself.
OP here - still have to try (generally operate on VM/bare metal level); but my understanding is that ioctl call would get passed to the underlying volume; i.e. you would have to mount volume
OP here - yes, this is my use case too: integration and regression testing, as well as providing learning environments. It makes working with larger datasets a breeze.
Actually, a "ghost station" shell has existed under Satellite 3 since 1998, though it was never finished or opened to passengers. The tunnel was built that far just to give the trains space to turn around.
I wouldn't usually use the 'non-native speaker argument', but thank you! Just yesterday I was accused of sounding like AI - https://news.ycombinator.com/item?id=46262777 - my default mode is that I oscillate between sounding too boring/technical, or when trying to do my best, sounding like AI
Your article is obviously written by Slavic writer, haha. Characteristic sound of Slavic tint to the prose. If it is LLM, then prompt engineering is good. I believe it is mostly human-written.
Author here – it’s actually funny, as you pointed out parts that are my own (TM) attempts to make it a bit lighthearted.
LLM is indeed used for correction and improving some sentences, but the rest is my honest attempt at making writing approachable. If you’re willing to invest the time, you can see my fight with technical writing over time if you go through my blog.
(Writing this in the middle of a car wash on my iPhone keyboard ;-)
reply