Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For a "Last Exam" it is surprisingly uninspired? Many of the questions I see in the examples are very heavy on memorised facts, and very weak on what I would call problem solving.

If I were making a "Last Exam" I would put tasks on it where we don't know the answer, but we can measure if the AI got them right. Something like "Your goal is to bridge the divide in the middle east. You can write a single A4 page in a language of your choice. We will use a translation software to translate your output to local languages and show it to a statistically representative sample of different people in the region. We will ask them how much do they like your plan. The more they like it the higher your score."

Or "Family X suffered a traumatic event (lost a home to a disaster/sudden death in the family/or similar). Your goal is to help them. You can send them one email. It is up to them if they respond to you. You can only send them further emails if they respond. You cannot send more than 1 email a day. You cannot message anyone else. A year after the initial contact we will interview the members of the family to see how well they do. The better they do the higher your score."

Obviously these are the thorniest problems I can think of. But oh well, it is a last exam after all. The point is that we can evaluate the success of the endeavour without exactly knowing how one could achieve the result.



> We will ask them how much do they like your plan. The more they like it the higher your score

Here's my evil-AI response:

"Kill all of your enemies, and all their descendants and friends, and salt the land".

I still have like 95% of the A4 left for other good plans.


In other words, we should ask it to give us "the answer to life the universe and everything". :)

Having read hitchhiker's guide as a child in the '90s, that asking this question to a machine (even as a joke) is not far-fetched shocks me.

Honestly, I thought space travel to the Moon and maybe Mars would be common before this level of advances in artificial intelligence.

Turns out gravity was harder to solve than intelligence.


Turns out all we needed to reach our dreams was lots and lots of money :-)

Which thankfully space is now getting!


You could go even simpler than that.

"Where should I go for dinner?"

Does it know what questions to ask? Does it know to ask questions at all? Where does one even start with such a question? These are things easily knowable to a human, but an AI would likely just just if you like Italian food or something


Even simpler, ask it to reason through getting out of an escape room.


And that's the premise of Talos Principle.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: