This is where I think we need better tooling around tiered validation - there's probably quite a bit you can run locally if we had the right separation; splitting the cheap validation from the expensive has compounding benefits for LLMs.
If you found two disjoint sections that seemed positive on their own, did you try looping both separately in the same model? Wondering how localized the structures are.
For all the places it's bad at, AI has been fantastic for making targeted data experiences a lot more accessible to build (see MotherDuck and dives, etc), as long as you can keep the actual data access grounded. Years of tableau/looker have atrophied my creativity a bit, trying to get back to having more fun.
Nice! I’ve been working on https://treeseek.ca which is a different use case from most of the other open data tree sites I’ve seen — I want to be instantly geolocated and shown the nearest trees to me. I do a lot of walking and am often mesmerized by a particular tree, and I wanted something to help me identify them as quickly as possible, with more confidence and speed than e.g. iNaturalist (which i do also use).
This is an app that’s been bouncing around in my head for over a decade but finally got it working well enough for my own purposes about a year and a half ago.
Oh that's great! I was finding fun tree collections and wanted to go see them - unfortunately not in SF so not likely - but your app has some nice data around me that I can check out! Are you primarily using OSM data?
I was thinking of a google maps kind of "here you are, here's your walking path of interesting trees" potentially, or something else that could tie the overview to the street experience - on the backlog!
So the tree data itself mainly comes from municipal open data, just like yours does. Street Trees datasets are pretty common across cities. I just added SF yesterday after replying here :)
Otherwise the map tiles are coming from OpenFreeMap [1] which are indeed based on OSM.
Next steps I'm interested in are including economic + ecological benefits of the trees, highlighting potential pests / invasive species, maybe some other basic info about the species sourced from Wikipedia.
I like how you've got different icons for different types of trees; I've been thinking about how to encode DBH data as well but haven't settled on anything yet.
A belated - nice to see this, I think an intermediate IR is the right way to gate access rather than raw SQL (full disclosure; also hacking on open-source solutions in that space) - how are you balancing expressiveness vs control? Some of the more impressive text to SQL demos rely on agents being able to do fairly complicated calculations - which it seems like IL could support - but I'm still seeing the IL example as containing some of the footguns they also hit - join type (inner/vs outer, group by etc). How are you balancing having enough safe moves to be useful, while not introducing unsafe combinations of moves?
There's a lot of useful autonomous things that don't require unrestricted outbound communication, but agreed that the "safe" claw configuration probably falls quite a bit short of the popular perception of a full AI assistant at this point.
Experimenting with visual/audio combinations to explore aspects of a space dataset I’ve been having lots of fun with. Added in a LLM chat view with Duck DB WASM as well to try out tool use - text to SQL seems to be relatively solved with a light semantic layer; some interesting optimization around what tools to expose and result handling that need some more iteration.
I'd really love a minimalist version, I'm not sure how small it's feasible for them to shrink it. As long as it doesn't get bigger and devices keep getting faster, I suppose?
DuckDB can't get enough love! Beyond being a really performant database, being so friendly [sql and devx] is really underrated and hard to pull off well and is key to it being so fun - you get a compounding ecosystem because it's so easy to get started. Hoping that they can manage to keep it vibrant without it slowing down the pace of innovation at all.
The web/WASM integration is also fabulous. Looking forward to more "small engines" getting into that space to provide some competition and keep pushing it forward.
reply