More

sestep · 2026-02-17T04:18:41 1771301921

One of the other PhD students in my department has an NDSS 2026 paper about combining the strengths of both LLMs and traditional decompilers! https://lukedramko.github.io/files/idioms.pdf

sestep · 2026-02-08T23:00:53 1770591653

Could you clarify what you mean about Nix missing concurrency and parallelism? I often run builds using nix-output-monitor and it definitely looks like things are running in parallel, although I could be mistaken.

embedding-shape · 2026-02-10T11:31:03 1770723063

I meant as part of the language itself, not just the runtime or for specific parts. Say I'm processing 100 JSON files, it'd be great if I could fire that off wrapped in 'parallel' or whatever, similar to Clojure and others I guess.

sestep · 2026-01-24T22:23:15 1769293395

I think you can do this with Virgil, but I'm having trouble finding the exact doc page at the moment: https://github.com/titzer/virgil

titzer · 2026-01-25T01:00:15 1769302815

The description is in the paper, but not all of it is implemented.

https://arxiv.org/abs/2410.11094

Bradley implemented a prototype of the packing solver, but it doesn't do the full generality of what is proposed in the paper.

sestep · 2026-01-14T23:44:48 1768434288

https://samestep.com

sestep · 2026-01-14T23:01:18 1768431678

I tried this five years ago back when I was an engineer on the PyTorch project, and it didn't work well enough to be worth it. Has it improved since then?

conception · 2026-01-15T00:40:24 1768437624

It works well enough that I didn’t realize this wasn’t first party till right now.

manquer · 2026-01-15T00:48:28 1768438108

It works, but there are fair amount of caveats, especially for someone working on things like Pytorch, the runtime is close but not the same, and its support of certain architectures etc can create annoying bugs.

mlrtime · 2026-01-15T12:02:58 1768478578

For me, no. Spend days trying to get it to recreate a production environment workflow. It is too different than production.

tonymet · 2026-01-14T23:29:46 1768433386

it has. it's improved to work with ~ 75% of steps . fast enough to worth trying before push

sestep · 2026-01-14T22:21:27 1768429287

This sounds cool but is extremely uninteresting without performance measurements. Are there any?

sestep · 2026-01-12T18:18:23 1768241903

Same question but for Jai.

throwaway17_17 · 2026-01-13T05:42:20 1768282940

Jai does not compile to C. It has a bytecode representation that is used primarily for compile time execution of code, a native backend used mostly for iteration speed and debug builds, and a LLVM target for optimized release builds.

sestep · 2026-01-12T18:16:56 1768241816

Noob question: if it just compiles to threads, is there any need for special syntax in the first place? My understanding was that no language support should be required for blocking on a thread.

maxbond · 2026-01-13T01:00:33 1768266033

One advantage is that it gives you the opportunity to move to a more sophisticated implementation later without breaking backwards compatibility (assuming the abstraction does not leak).

gmueckl · 2026-01-12T18:47:35 1768243655

Async/await should do a little more under the hood than what the typical OS threading APIs provide, for example forwarding function parameters and return values automatically instead of making the user write their own boilerplate structs for that.

sestep · 2026-01-06T19:14:46 1767726886

Hey Eric, great to see you've now published this! I know we chatted about this briefly last year, but it would be awesome to see how the performance of jax-js compares against that of other autodiff tools on a broader and more standard set of benchmarks: https://github.com/gradbench/gradbench

ekzhang · 2026-01-06T20:14:15 1767730455

For sure! It looks like this is benchmarking the autodiff cpu time, not the actual kernels though, which (correct me if I’m wrong) isn’t really relevant for an ML library — it’s more for if you have a really complex scientific expression

sestep · 2026-01-06T21:07:43 1767733663

Nope, both are measured! In fact, the time to do the autodiff transformation isn't even reflected in the charts shown on the README and the website; those charts only show the time to actually run the computations.

ekzhang · 2026-01-06T21:11:48 1767733908

Hm okay, seems like an interesting set of benchmarks — let me know if there’s anything I can do to help make jax-js more compatible with your docker setup

sestep · 2026-01-06T21:14:57 1767734097

It should be fairly straightforward; feel free to open a PR following the instructions in CONTRIBUTING.md :)

ekzhang · 2026-01-06T23:17:06 1767741426

I don’t think this is straightforward but it may be a skill issue on my part. It would require dockerizing headless Chrome with WebGPU support and dynamically injecting custom bundled JavaScript into the page, then extracting the results with Chrome IPC

sestep · 2026-01-07T01:19:17 1767748757

Ahh no you're right, I forgot about the difficulties for GPU specifically; apologies for my overly curt earlier message. More accurately: I think this is definitely possible (Troels and I have talked a bit about this previously) and I'd be happy to work together if this is something you're interested in. I probably won't work on this if you're not interested on your end, though.

sestep · 2025-12-26T23:40:11 1766792411

I'm a big fan of svg-term myself: https://github.com/marionebl/svg-term-cli

stavros · 2025-12-27T00:01:38 1766793698

Hm, very interesting! This only converts asciinema recordings, though, right? It doesn't automatically record anything?

sestep · 2025-12-27T00:07:50 1766794070

If you have asciinema already installed then you can invoke it through svg-term like this!

  svg-term --command 'cowsay hey there'

But that has the aforementioned issues about not pausing enough, so I usually just record with asciinema first and then invoke svg-term.