They do, but that's kind of the article's point - someone still has to write and maintain the per-model chat template and tool call parsing inside vllm/sglang. Every time a new model ships with a slightly different format, the inference server needs an update. The M×N problem doesn't disappear, it just gets pushed one layer down.
I would like to see them include some analysis of the first recorded audio conversations such as on wax cylinders or the 1930s WPA/Federal Writers’ Project. Was sentence complexity the same as it is now?
The focus on clarity of thought is a modern shift, or at least the "modern preference". Hemingway was a big proponent on short, direct sentences and few or no adjectives.
In the 1800s and early 1900s, complex sentence structure signaled intelligence for both the author and reader. It was a form of entertainment, in a way, when books were few and nights were long. Try reading Henry James for an idea about what this looked like in practice. Shakespeare is another obvious example of "heightened language" besides archaic words the play are written in iambic pentameter and the spoken text is far from natural (yet incredibly precise).
> the play are written in iambic pentameter and the spoken text is far from natural (yet incredibly precise)
(Iambic pentameters are 10-syllable lines with alternate syllables unstressed and stressed, like "if MUSic BE the FOOD of LOVE ...", the so-called heartbeat rhythm)
Shakespeare actually used a variety of different styles to demarcate different characters, moods, etc. As a very rough rule-of-thumb in Shakespeare, posh characters speak in iambic pentameters, commoners and clowns speak in prose. In A Midsummer Night's Dream, for example, the Athenians speak in iambic pentameters and the clowns speak in prose. When the clowns put on a play for the Athenians, the clowns and the Athenians swap speaking styles, so the Athenians make snarky comments in prose (just like a badly behaved audience) on the badly rhymed acting of the clowns. The fairies, meanwhile, speak in trochaic verse, so their king and queen sound stylistically different from their Athenian equivalents, almost like Shakespeare has given them a foreign accent. When two characters are arguing, the ten syllables of a normal line are sometimes split between them to emphasize the back and forth nature. If a character is flustered or annoyed, their lines may be obviously different from the 10 syllable norm, again to emphasize their mood.
For actors learning their lines, the syllable counts almost act as stage direction hints: if they aren't 10 syllables, then some mood or other needs to be taken into account.
increasing the (social) pressure on maintainers to get PRs merged seems like the last thing you should be doing in light of preventing malicious code ending up in dependencies like this
i'd much rather see a million open PRs than a single malicious PR sneak through due to lack of thorough review.
By 1066, not quite. That was an invading army led by the King of Norway to press his claim on the throne of England. I’m sure many of the soldiers in that army had been Vikings but at that time they were soldiers of a Christian king, which would have been considered much more legitimate than being a heathen raider.
I guess the Normans were also of Nordic descent but they had given up the Viking way of life a century before.
“Viking” isn’t a matter of heredity though. If your grandfather was a bricklayer, you’re not a bricklayer as a matter of heredity, you have to actually lay bricks. Likewise, if your grandfather was a viking, you have to actually go raiding and pillaging to be one yourself. Which is not something you’re likely to do if the king of France gave your grandfather an entire duchy in exchange for a promise to stop doing that sort of thing.
What's gonna bake your noodle is, Viking raids were the VC-funded startups of medieval northern Europe. Norse kings were very generous with their kingdom's treasure, to the raiders with the most fearsome reputations.
I scrolled through the entire readme and didn't see any mention of sqlite_vec. My feedback for the readme would be to optimize for signal- if it is a layer on top of sqlite_vec say what it does on top of that etc
But it is not a layer on top of sqlite_vec, so your logic seems to be:
If the tool uses sqlite_vec (which it doesn't)
Then it should say so in the readme.
You didn't find evidence sqlite_vec in the readme, so your conclusion was that it should be added.
This is seemingly based off of your not liking the author mentioned that it would be the "sqlite of RAG" (which, notably, does not at all imply the use of sqlite, in fact, it suggests this is an alternative to sqlite).
Nothing is very clear here.. the benchmarks might just be comparing WAL mode on vs off, or something else entirely, SQLite does not have 150ms latency on such a small database.
The original commenter wasn't making statements about sqlite being involved, they were saying that a specific library should be mentioned if it was involved, which it wasn't. Unless you are saying sqlite_vec is part of the dependency chain through GRDB?
It would be like commenting "If any other developers were involved in this project you should mention them."
sqlite-vec is a great vector index — Wax actually uses SQLite under the hood too.
The difference is the layer. sqlite-vec gives you vec_distance_cosine() in SQL. Wax gives you: hand it a .mov file, get
back token-budgeted, LLM-ready context from keyframes and transcripts, with EXIF-accurate timestamps and hybrid
BM25+vector search via RRF fusion — all on-device.
It's the difference between a B-tree and an ORM. You'd still need to write the entire ingestion pipeline, media parsing,
frame hierarchy, token counting, and context assembly on top of sqlite-vec. That's what Wax is.
Thanks for clarifying. If mv2s is a sqlite3 db file under the hood that is something I would like to see in the readme as it would make me more likely to use.
This seems like a very white-centric categorization to assume that a Tamil Brahmin should necessarily see himself as in the same racial solidarity group as a Somali, Haitian, or Venezuelan as opposed to a European.
He’s going to be a victim of the same anti immigration anti foreigner anti minority movement that the other groups you mentioned will be victim to. Remember the DHS tweeted a supremacist call to deport 100 million people. That is only possible if you denaturalize all the citizens who were once just immigrant workers on visas. The fact that he’s “Tamil Brahmin” isn’t relevant - his right to exist in America is at risk, by his very own actions of supporting the administration.
The vice president's wife is a Telugu Brahmin, and the director of the FBI is a Gujarati Patidar. What evidence do you have for Trump and ICE having a problem with Hindustanis?
Take a look at the ecosystem of far right influencers and personalities guiding the MAGA movement, and what they post on Twitter. Go look at replies on deportation and ethnic cleansing (“remigration”) posts made by Elon or DHS or others. You’ll see a torrent of vile supremacist content, and most of it is directed at Indians. That includes insults aimed at Usha Vance, various Indian CEOs, and Indian people at large. It’s why Vivek quit social media - almost every single reply to any post he made was racist. It’s what the right always was, unfortunately, even though it looked for a bit like it might be moving on from that past.
When the deportation push shifts from illegal immigrants to legal ones - which it has to because their stated goal of deporting 100 million Americans is not possible otherwise - it’ll come for Indians too. In fact I would say Indians are perhaps the most hated group on the far right.
As for those Indians still ignoring what the administration stands for and where it’s going, such as the ones working for Trump, they’re useful idiots. Doing the dirty work that will come for them in the future.
reply