More

vicentwu · 2025-11-19T09:43:45 1763545425

Hmm... The key is to successfully decompose a big, hard problem into easier atomic sub-problems. However, the decomposition process itself is difficult, and this paper is not about that. They decompose a task using a human-written prompt.

vicentwu · 2025-10-06T14:58:17 1759762697

wow....

vicentwu · 2025-10-03T13:06:50 1759496810

I like the vibe.

vicentwu · 2025-05-26T13:30:52 1748266252

Prompts are really an interesting way of programming, and we can actually express logic containing abstract adjectives like ‘happy’ and ‘unsatisfied’ in a somewhat arbitrary way.

vicentwu · 2025-03-05T14:16:06 1741184166

Great!

vicentwu · 2025-02-03T08:49:17 1738572557

"As an analogy, imagine that you could put your dog or cat into hibernate mode whenever you left on a trip. Your dog or cat might not notice, but even if they did, they might not mind. Now imagine that you could put your child into hibernate mode whenever you were too busy to spend time with them. Your child would absolutely notice, and even if you told them it was for their own good, they would make certain inferences about how much you valued them. That’s the situation the human characters in the story find themselves in." Fascinating.

vicentwu · 2025-01-27T13:59:39 1737986379

RL doesn't need that much static data, it needs a lot of "good" tasks/challenges and computation.

vicentwu · on Dec 21, 2024

"Note on "tuned": OpenAI shared they trained the o3 we tested on 75% of the Public Training set. They have not shared more details. We have not yet tested the ARC-untrained model to understand how much of the performance is due to ARC-AGI data."

Really want to see the number of training pairs needed to achieve this socre. If it only takes a few pairs, say 100 pairs, I would say it is amazing!

nmca · on Dec 21, 2024

75% of 400 is 300 :)

vicentwu · on Dec 22, 2024

Trained with 300 raw pairs directly from the ARC training set without using any data augmentation process, such as generating many more pairs with some kind of ARC generator? That's amazing.

WXLCKNO · on Dec 21, 2024

Wow are you AGI?

vicentwu · on Dec 4, 2024

Off the topoc. I think, in the long-term , inference should be done along with some kind of training.

vicentwu · on Sept 26, 2024

Past efforts leds to today's products. We need to wait to see the real imapct on the ability to ship.