More

enjeyw · 2026-01-08T04:37:13 1767847033

One of the big problems with Attention Mechanisms is that the Query needs to look over every single key, which for long contexts becomes very expensive.

A little side project I've been working on is to train a model that sits on top of the LLM, looks at each key and determines whether it's needed after a certain lifespan, and evicts it if possible (after the lifespan is expired). Still working on it, but my first pass test has a reduction of 90% of the keys!

https://github.com/enjeyw/smartkv

krackers · 2026-01-09T02:35:28 1767926128

Is this not similar to DeepSeek lighting indexer

enjeyw · 2026-01-06T04:50:02 1767675002

It absolutely has been!

In general prediction markets can’t be “correct” or “incorrect” - for instance if a prediction market says there’s a 60% chance of an event occurring, and it doesn’t occur, was the market right or wrong? Well it’s hard to say - certainly the market said the event was more likely to occur than not, but only just, and who knows? Maybe the event _only just_ occurred, and very nearly didn’t!

So generally we say a prediction market is “correct” if it is “well calibrated”, which is to say that if we took all the events that the market said had a 60% chance of occurring, then approximately 60% percent of these events occurred (with the same holding true for all other percentages).

On this note, an interesting phenomenon that used to occur was “favorite-longshot bias”, where markets would consistently overestimate the likelihood of longshot events occurring - so events that the market predicted would occur 10% of the time would only occur 5% of the time. What’s fascinating is that once people realized that this bias exited, they began to exploit it by making bets against longshots, which had the effect of moving the market and removing the biases, making the markets well calibrated. It’s a pretty neat example of the efficient market hypothesis in action!

BrenBarn · 2026-01-06T06:31:35 1767681095

Some of the longshot biases still exists and can't be removed due to technical constraints on the platforms. A lot of times there is a minimum contract price, which effectively means the probability of unlikely events cannot be modeled as lower than 1% or 0.1% or whatever. But there are contracts for events much less likely than that.

pseudo0 · 2026-01-06T10:16:50 1767694610

There are also issues with the time value of money for long-shot events. Someone has to be willing to buy a share of "No", and if that works out to a return lower than the risk-free rate (eg. buying t-bills) there will be no incentive to take the "No" position. That makes anything roughly under 3-4% per year pretty unreliable.

bitshiftfaced · 2026-01-06T12:27:22 1767702442

Polymarket and Kalshi both pay interest on long term bets around the same as the risk free rate.

thaumasiotes · 2026-01-06T08:24:42 1767687882

> for instance if a prediction market says there’s a 60% chance of an event occurring, and it doesn’t occur, was the market right or wrong? Well it’s hard to say - certainly the market said the event was more likely to occur than not, but only just, and who knows? Maybe the event _only just_ occurred, and very nearly didn’t!

For most events like this, you'd want to see the market spike to 0% or 100% as the deadline approached. And in particular for an event that happens, you want to see the spike to 100% before it happens. Remaining at 60% until after the fact is wrong because the occurrence of the event becomes more certain as it gets closer.

Being "well-calibrated" as you describe is a very bad quality metric in the sense that two sets of predictions can achieve the same calibration profile while differing markedly in quality. The farther the predictions are from 50%, the better they are, but your calibration metric doesn't take this into account.

jjmarr · 2026-01-06T06:12:16 1767679936

Charlie Kirk has a 3% chance of winning a Nobel Peace Prize right now according to Polymarket. He's climbed from 1% since Maduro was arrested.

It seems unlikely since Nobels aren't awarded posthumously.

pseudo0 · 2026-01-06T10:23:22 1767695002

The issue there is time. The Nobel prizes will be announced in around 9 months. Buying a share of "No" would currently cost 98.2 cents, working out to a (basically) risk-free return of around 2.4%. Alternatively someone who wants a very low-risk investment product could just buy 1-year t-bills with a return of... ~3.5%. And that doesn't require messing around with buying crypto and the inherent risk of trusting Polymarket with your money.

Anything under 3%/year of time until decision is going to have pretty limited predictive value within that range. Anything starting above that range will end up hitting that floor rather than going to zero because of the difficulty of finding a counterparty.

dzhiurgis · 2026-01-06T20:00:30 1767729630

Are t-bills accessible for anyone, especially loaded with crypto?

AFAIK there are coins that pay better (some give you exposure to t-bills).

bitshiftfaced · 2026-01-06T12:28:22 1767702502

Polymarket pays interest on those bets about the same as the risk free rate.

lxgr · 2026-01-06T13:26:31 1767705991

It doesn't, or where do you think those 3% are coming from?

bitshiftfaced · 2026-01-06T14:48:16 1767710896

While Polymarket does offer holding rewards interest, it looks like it doesn't for this particular market.

That doesn't mean there aren't other explanations. It could mean that No holders expect to incur an opportunity cost greater than the risk free rate. Combine that with how there's low liquidity (there's less than $300 on the book buying Yes, and at 2 cents or less), and so we could just be seeing the effect of random fish temporarily distorting the price. It could also mean that the risk of a smart contract failing is making it not worth the hassle for a market maker to come in at such a slim margin and low volume.

lxgr · 2026-01-06T14:54:27 1767711267

They're offering interest on roughly a dozen hand-picked markets, according to their documentation. (I wasn't aware of that, so I stand corrected on the general assertion that they never do.)

> That doesn't mean there aren't other explanations.

Why do you need other explanations, when the observed probability can be precisely and fully explained by opportunity cost?

bitshiftfaced · 2026-01-06T15:19:40 1767712780

I don't have to "need" other explanations in order for them to exist. The current price does happen to accurately reflect what the risk free rate would imply. But look at the graph history: it hovered around 1% for a large chunk of December.

rich_sasha · 2026-01-06T06:24:06 1767680646

How much volume on this bet? Let's ignore black swan events and say it's a guaranteed 3% return. On how much? $1? $10? $1m?

I'd weigh the accuracy by how much money is at stake...

Even then, a "perfect" prediction market need not be accurate, if people use it for hedging. If some low probability event is really bad for me, I may pay over odds (pushing the implied probability up) to get paid if it happens. The equilibrium probability may be efficient, reasonable and biased.

avadodin · 2026-01-06T09:14:59 1767690899

Nobel Peace Prizes are Peace Prizes as much as they are Nobel Prizes.

I'm not sure the same(any) rules apply.

lostlogin · 2026-01-06T09:43:53 1767692633

They would be a better prize if the word ‘Peace’ was removed from the title.

It would make the likes of Kissinger getting it easier to understand.

vintermann · 2026-01-06T08:50:24 1767689424

Well, Nobel peace prizes aren't usually awarded to people calling for invasions of their home country either, or cheering for the extrajudicial double-tap killing of smugglers/random fishermen.

Who's to say a dead person can't have done the most to "promote peace conferences" as mentioned in Nobel's will? These days, I'd say dead people make a larger net contribution to peace than most politicians.

hobofan · 2026-01-06T06:48:21 1767682101

Normally they aren't, but maybe the US will take over Sweden and the Nobel Foundation and make it happen.

Boltgolt · 2026-01-06T08:46:45 1767689205

...only to find out they invaded the wrong country! (Nobel peace prizes are awarded in Oslo)

watwut · 2026-01-06T08:56:15 1767689775

To be fair, it would bw totally on brand. They would not admit mistake tho declare it a success and award own nobel price.

hobofan · 2026-01-06T09:39:22 1767692362

Ooops. I thought something was off when I looked it up (headquarters of Nobel Foundation are in Sweden).

pinkmuffinere · 2026-01-06T10:18:39 1767694719

To be fair you’re not really providing a hard stance against the estimate. You say it is unlikely, and indeed the prediction is a 3% chance. That’s unlikely.

kurtis_reed · 2026-01-06T04:56:13 1767675373

No, markets are evaluated on accuracy, not calibration

enjeyw · 2026-01-06T05:03:27 1767675807

Well markets are evaluated on a number of different metrics depending on what you’re trying to determine.

If you want to go be pedantic about it and select one metric, markets are evaluated on their Brier Score or some other Proper Scoring Rule, not accuracy.

However, I prefer calibration as a high level way to explain prediction market performance to people, as it’s more intuitive.

bitshiftfaced · 2026-01-06T12:34:05 1767702845

Yeah it's a good way to introduce the idea. But I don't think someone would really grasp it until they understand why both calibration and "discrimination" are necessary in determining if a prediction market is accurate.

kurtis_reed · 2026-01-06T05:58:10 1767679090

Proper scoring rules measure accuracy

A4ET8a8uTh0_v2 · 2026-01-06T06:55:56 1767682556

I suspect that you are arguing semantics, where parent and grandparent focus on the nuance of what is ACTUALLY being measured. I am saying it like this, because while I never used prediction markets, I briefly looked into them to see if I could use them well. The question of accuracy came up, which is why I happen to align with posters above.

With that in mind, what do mean exactly.

stingraycharles · 2026-01-06T06:12:44 1767679964

Noob question from me: what’s the difference between accuracy and calibration? A well calibrated market would be more accurate and vice versa, not?

Edit: just found the answer myself: “accuracy measures the percentage of correct predictions out of total predictions, while calibration assesses whether a prediction market's assigned probabilities align with the actual observed frequency of those outcomes”

kurtis_reed · 2026-01-07T14:49:06 1767797346

Suppose there are 1000 events and 500 will have outcome A and 500 will have outcome B. If you predict a 50% chance of A for every event you'll be perfectly calibrated. On the other hand, if you predict a 90% chance of a certain outcome and you're right for 800 events, you're not perfectly calibrated but you have a lower Brier score (lower is better).

kqr · 2026-01-06T06:44:25 1767681865

A forecaster can be calibrated but almost only assign probabilities in the 40--60 % range. This is not as ueful as one assiging calibrated probabilities in the full range.

We try to measure the increased usefulness of the latter with proper scoring rules.

enjeyw · 2026-01-01T00:39:15 1767227955

I used to share a somewhat similar sentiment.

I know one anecdote is not data, but his investment in BYD all the way back in 2008 does counter that viewpoint somewhat - his investment success in the BYD case isn’t from other investors following him in, it’s from him identifying BYD as a successful company far before any other major investors did.

tim333 · 2026-01-01T00:54:10 1767228850

Minor nit - it was Charlie Munger who identified and argued for BYD.

enjeyw · 2026-01-06T23:31:02 1767742262

Oh I didn’t know that! Thanks for the clarification.

enjeyw · 2025-12-14T20:14:33 1765743273

Overly specific LLM research into KV cache eviction.

The vast majority of tokens in a sequence will be irrelevant to an attention mechanism outside of a very small window. Right now however we tend to either keep all cache values forever, or dump them all once they hit a certain age.

My theory is that you can train model to look at the key vectors and from that information alone work out how long to keep a the token in the cache for. Results so far look promising and it’s easy to add after the fact without retraining the core model itself.

enjeyw · 2025-11-24T21:35:52 1764020152

I made a tool for this! It's an essay writing platform that tracks the edits and keystrokes rather than the final output, so its AI detection accuracy is _much_ higher than other tools: https://collie.ink/

enjeyw · 2025-11-17T21:22:59 1763414579

I've been exploring this concept in LLMs for the last week or so, to see if I can RL train one into being inherently curious.

I haven't got any beyond my own working notes and some basic plots, but I've unceremoniously dumped them into a document here incase anyone else finds them interesting. If so I'd _love_ to chat with you. enjeyw @ google's email provder.

https://thealephengine.substack.com/p/67e3786f-8e84-41bd-888...

enjeyw · 2025-09-24T23:30:28 1758756628

Yeah, I've often found that getting everyone pulling in the same direction is much more important than what the direction actually is.

If lots of smart people have thought about something and still disagree on the correct approach, pick one and move one.

enjeyw · 2025-09-06T11:49:49 1757159389

I mean I kind of get it - overgeneralising (and projecting my own feelings), but I think HN favours introducing and discussing foundational concepts over things that are closer to memorising/wrote-learning. I think AI Math vs Leetcode broadly fits into that category.

enjeyw · 2025-08-28T02:20:42 1756347642

I have no reasonable theory as to how Trump/RFK will be able to reveal credible information about Autism that wasn’t already available from public research papers.

adastra22 · 2025-08-28T02:22:46 1756347766

I believe he was being sarcastic, although tone is hard to read online.

enjeyw · 2025-08-28T02:27:10 1756348030

Yeah on re-read I think you’re right. Though who knows in this day and age!

dataflow · 2025-08-28T02:22:36 1756347756

If it's not credible, at least it will be incredible.

CamperBob2 · 2025-08-28T02:21:18 1756347678

You don't say.

enjeyw · 2025-08-28T02:27:46 1756348066

My bad; Missed the sarcasm!

enjeyw · 2025-08-17T21:56:33 1755467793

About 10 years ago I became more aware that reducing my consumption of meat was good for the world. The was good for Beyond Meat’s prospects.

About 5 years ago I became more aware that reducing my consumption of ultra processed food was good for me. This was very bad for Beyond Meat’s prospects.

I suspect this experience generalizes.

knowitnone2 · 2025-08-18T01:15:46 1755479746

I'd argue that if you didn't buy the meat, somebody else did so your argument that it was "good for the world" lacks evidence.

metalcrow · 2025-08-18T03:45:32 1755488732

This statement proves too much. It's a fully generalized argument against participation in any kind of boycott.

Rendello · 2025-08-19T16:39:33 1755621573

Or maybe doing anything ever!