Why would any content producer want to make it easier for AI bots to scrape their site? Perhaps that's too broad, but there's no way any free / open ad-supported content producer would ever want to support this. Someone needs to figure out microtransactions soon before all the content producers shut up shop, and the growth of open / openish knowledge grinds to a halt.
It's easy to say this, and I am definitely down on the boomers, but I'd say the folks running Europe before WWI were probably a worse generation than the boomers.
TSLA investors don't care (as long as Musk is still there to keep them believing). Years of bad news, and the stock is only 10% off it's all time highs.
I maintain a fork of sqlite-vec (because there hasn't been activity on the main repo for more than a year): sqlite-vec is great for smaller dimensionality or smaller cardinality datasets, but know that it's brute-force, and query latency scales exactly linearly. You only avoid full table scans if you add filterable columns to your vec0 table and include them in your WHERE clause. There's no probabilistic lookup algorithm in sqlite-vec.
You're absolutely right—sqlite-vec currently only supports brute-force search, and its latency does scale linearly with dataset size. We did some rough comparisons using its benchmark tools: on the SIFT dataset, latency was around 100ms; on GIST, it was closer to 1000ms. In contrast, with zvec's HNSW implementation, we get ~1ms latency on SIFT and ~3ms on GIST, while achieving recall@100 of 99.9% on SIFT and 97.7% on GIST.
You're right that we didn't include sqlite-vec in our initial benchmarks—apples-to-apples comparisons are always better. I've actually added basic zvec tests to my fork of sqlite-vec (https://github.com/luoxiaojian/sqlite-vec), so feel free to give it a try. We'll also be publishing a more complete performance comparison in an upcoming blog post—stay tuned!
You joke but you’d be amazed at the Reddit front page. It’s hard to tell anymore if the comments are even people, but I have noticed many fake posts of some Trump tweet he never actually made getting traction.
It’s so easy to verify his public statements. Did he really say that? Just go look.
Yet time and time again people get baited into rage mode. It’s more satisfying to post than it is to do 30 seconds of research.
reply