More

DetroitThrow · 2026-04-02T08:33:12 1775118792

Obviously? Can you not recognize the style and weasel words at this point?

If you can't you should use Pangram, which marks this as 100% AI generated. This would fail most middle school English classes nowadays.

DetroitThrow · 2026-03-31T22:27:56 1774996076

>No one has ever made a purchasing decision based on how good your code is.

I got my company to switch from GitHub to GitLab after repeated outages. I've always moved companies to away from using GCP or Azure because of their reliability problems.

This is a really funny comment.

DetroitThrow · 2026-03-31T20:22:42 1774988562

I know this is obviously sarcasm and it made me laugh but I'm pretty sad HN couldn't catch it.

brightmood · 2026-03-31T22:48:16 1774997296

No. I was being earnest! Works for me TM

DetroitThrow · 2026-03-31T17:15:37 1774977337

>Have you ever run GNU Parallel on a powerful machine just to find one core pegged at 100% while the rest sit mostly idle?

Yes, to my extreme frustration. Thank you, I'm installing this right now while I read the rest of your comment.

jkool702 · 2026-03-31T22:18:39 1774995519

How did it work for you?

DetroitThrow · 2026-03-31T15:50:13 1774972213

Well, if you're an elected official, and you're in charge of government organizations that could be used to enrich billionaire donors by using a donor's services - Oracle fits that niche very well!

DetroitThrow · 2026-03-30T23:40:32 1774914032

You misunderstand what "punching down" or "libel" mean.

DetroitThrow · 2026-03-30T19:35:34 1774899334

That's unrelated. He's been diligently substituting bananas in many experiments to mostly disappointment.

marginalia_nu · 2026-03-30T20:44:22 1774903462

The banhattan project has been a fiasco.

Chicago Peel 1 accomplished fission of fruit flies, which we felt was promising.

The subsequent banana nuclear bomb tests have been an unmitigated disaster. There are so many damn bananas in and around the bikini atolls, just nothing. Not even a fizzle. Mojave is littered with peels. Oppenheimer slipped and broke his leg.

Rumors are the Soviets are using avocados. Maybe that is the key. We are now constructing a demon core from an avocado split lengthwise.

selimthegrim · 2026-03-30T21:49:37 1774907377

This puts Raffi in a whole new light. Also maybe the Banana ball team are refugee all-stars.

thenthenthen · 2026-03-31T09:07:40 1774948060

Haha brilliant thank you for your comment!

onraglanroad · 2026-03-30T20:41:38 1774903298

...but occasional delight.

DetroitThrow · 2026-03-30T14:35:53 1774881353

Given parrots eat their own poop (https://lafeber.com/pet-birds/questions/parrots-eating-poop/), there must be a neuron count/density that activates self-poop eating (assuming anatomy allows it), similar to LLM parameter count.

SoftTalker · 2026-03-30T14:56:22 1774882582

Dogs do that too.

IAmBroom · 2026-03-30T14:59:00 1774882740

My dogs eat poop, and therefore are also like LLMs.

Your hypothesis has therefore been peer-reviewed.

DetroitThrow · 2026-03-27T18:41:10 1774636870

>In the world of harness development I think that's an interesting question to answer!

The challenge isn't about harness development though, and a sufficiently complex harness can solve these tasks rather easily.

And presenting it as if you've made a novel development for solving ARC-AGI-3 leads me to believe you're willing to waste all of our time for your benefit at every step in the future.

cxdorn · 2026-03-27T22:47:22 1774651642

> a sufficiently complex harness can solve these tasks rather easily.

I claim this is not so easily done, and earlier iterations of ARC-AGI did not have the constraint in the first place. You want something that generalizes across all puzzles (hopefully even the private ones), and these puzzles are extremely diverse ... and hard; telling the model the controls and some basic guidelines for the game is the only "obvious" thing you can do.

The other point of my reply was efficiency, both in terms of creating and using the harness; the discussed solution is something that anyone (in fact, likely even an LLM itself) can cook up in a few minutes; it's not much more than a game control wrapper so the agent can play around with the game in live python and some generalities as laid out in the prompt.

(But I'm always happy to be proven wrong. What harnesses did you have in mind?)

DetroitThrow · 2026-03-27T05:13:46 1774588426

The harness seems extremely benchmark specific that gives them a huge advantage over what most models can use. This isn't a qualifying score for that reason.

Here is the ARC-AGI-3 specific harness by the way - lots of challenge information encoded inside: https://github.com/symbolica-ai/ARC-AGI-3-Agents/blob/symbol...