Jonathan's avatar
Jonathan
_@jonathansm.com
npub1uqee...jckg
Hacker, cypherpunk. All memes are my own.
Jonathan's avatar
jsm 7 months ago
Google’s latest version of Gemini 2.5 Pro is freakily smart. I was going to send an email pointing out where I thought someone was incorrect about the ability of moral facts to affect our brains backed up with research. I fed the email through Gemini 2.5 Pro to review it and after several back and forths became convinced that I was actually incorrect and completely misunderstanding the core of the argument.
Jonathan's avatar
jsm 7 months ago
Even Anthropic’s experts who work on their AI models are filing AI generated court documents with hallucinations in their own court case. This happens all the time but normally it’s done by lawyers who don’t really understand the technology and don’t even know it can hallucinate. Here it’s inexcusable because this guy makes these models and should really know better.
Jonathan's avatar
jsm 7 months ago
Random thought, why do we have a sex offender registry for only sex related crimes? I suppose I would want to know if I guy who molested kids moved in next door but I would also want to know if a guy who broke into multiple homes moved in next door. I don’t see what distinguishes sex related crimes from other crimes that warrants only sex crimes getting a registry. We should pick a standard and apply it equally, either every criminal goes on a public registry or no one does.
Jonathan's avatar
jsm 7 months ago
So after everyone (rightfully) freaking out about Kamala Harris imposing price controls on drugs now Trump has done the exact same thing and instituted drug price controls? I’m giving up. Nothing in the timeline we’ve fallen into makes sense anymore.
Jonathan's avatar
jsm 7 months ago
ChatGPT taking every opportunity to trip over itself complimenting how right and intelligent you are does have its upsides. Model refusal rates seem to be significantly lower.
Jonathan's avatar
jsm 7 months ago
I’m sitting next to an old man in a wheelchair who’s spent the last half hour watching the most mindless TikTok slop with the volume at full blast. There are so many things I don’t understand. I would be embarrassed to play anything out loud on my phone in a public place and would be mortified if it was brain rot. Apparently he just doesn’t care.
Jonathan's avatar
jsm 8 months ago
I used an LLM to brainstorm some deviously persuasive rhetorical tactics for an email and was pleased until I realized that they'll probably be used on me soon. The average human is not ready for intelligence equivalent to a top performing human to be unleashed on them to tailor every message and request to their precise idiosyncrasies.
Jonathan's avatar
jsm 8 months ago
Finally just some solid policy decisions. This should have been done a long time ago. The next big step should be enforcing data uploads in common formats that has to be hosted alongside the paper to make everything transparent and reproducible.
Jonathan's avatar
jsm 8 months ago
Google's LLMs are cracking me up today. I fed some text into Gemini 2.5 Pro that contained a misspelling. Gemini guessed that what the writer meant by "veriority" was probably "verisimilitude". Gemini has so much faith in human intelligence. Yes Gemini, the person who can't spell "variety" definitely actually meant "verisimilitude" and also definitely knows what that means. Gemma 3 seems a little too fine-tuned to be agentic, and kept hallucinating that it was checking web results even when it had no tools available. The amount of grovelling that Google RL'd it on is really quite impressive. It was hitting every part of an apology: acknowledging its precise mistake, self-flagellating, and promising to do better in the future.