> html Would be willing to bet this is the issue. Adding html files to context f...

gcr · 2025-11-18T16:42:20 1763484140

why?

EDIT: why must users care?

kulahan · 2025-11-18T16:45:14 1763484314

Gotta learn all the quirks of the model before it's replaced in 8 minutes.

NaomiLehman · 2025-11-18T17:08:26 1763485706

Quirks? like context window?

kulahan · 2025-11-18T18:06:32 1763489192

I'm saying it's egregious to expect all users to know the fact that an HTML document, for some reason, uses an enormous amount of context in an LLM designed specifically for working with code.

SPICLK2 · 2025-11-18T16:44:11 1763484251

https://stackoverflow.com/questions/1732348/regex-match-open...

croes · 2025-11-18T17:19:20 1763486360

The accepted answer is one that doesn’t care about the questioner‘s use case and instead gives a pretty excessive "Don‘t do it"

lukan · 2025-11-18T19:09:25 1763492965

It does also give the right solution, using an xml parser.

croes · 2025-11-18T20:41:58 1763498518

We don’t know the use case.

Maybe the questioner is also in full control of the HTML creation and they don’t need a parser for all possible HTML edge cases.

SPICLK2 · 2025-11-19T15:20:10 1763565610

Maybe they are, but they would also need to ensure a well-defined subset of HTML and also show that the subset is a reglar (Chomsky Type 3) grammar.

It seems that even the very conceptually simple example given by the questioner is impossible.