Computer-Use Evals Are a Mess
(benanderson.work)
August 2025 Archive
18721.
18722.
SteamJet water thruster for Artemis II cubesat critical orbit correction
(news.satnews.com)
18723.
Invisible Hands on the Scale
(culturalcourage.substack.com)
18724.
Gut Neurons Help the Body Fight Inflammation
(news.weill.cornell.edu)
18725.
Using Derive_more for Errors in Rust
(quamserena.com)
18726.
18727.
Internet-in-a-Box
(mdwiki.org)
18728.
Oscar (Therapy Cat)
(en.wikipedia.org)
18729.
18730.
Highland Pro, Screenwriting Software for Mac
(quoteunquoteapps.com)
18732.
Building a FFmpeg server on top of Chromium for browser replays
(blog.onkernel.com)
18733.
18734.
Ryan Dancey on the Acquisition of TSR
(insaneangel.com)
18735.
Computer-Based System Safety Essential Reading List
(safeautonomy.blogspot.com)
18736.
18737.
xbyak: A JIT assembler for x86/x64 architectures
(github.com)
18738.
18739.
I have no mut and I must borrow
(old.reddit.com)
18740.
The "Super Weight:" How Even a Single Parameter Can Determine a LLM's Behavior
(machinelearning.apple.com)
18741.
18742.
18743.
How the internet gets inside us (2011)
(newyorker.com)
18744.
18745.
18746.
The Baby Paradox in Haskell
(blog.jle.im)
18747.
A tutorial implementation of a dependently typed lambda calculus [pdf]
(webspace.science.uu.nl)
18748.
Is AI Zover?
(ft.com)
18749.
Micro CPV Solar – 50% more efficient, cheaper panels
(spectrum.ieee.org)