I got something fun cooking for the nostr community
cmd
cmd@proof0.work
npub1gg5u...ulq3
I build cool stuff on bitcoin and nostr.
There's some really great commentary in here about the current state of AI:
* LLMs have trouble with confusing concepts that have similar words or spelling. Not a big deal with basic tasks, but terrible for scientific and academic work.
* If a person makes such a mistake, you only have to correct them once. However an LLM will continue to make this mistake until it is retrained.
* These models do not have a real sense of understanding of what they are doing. The LLM will regurgitate text with uncanny accuracy when it comes to language and dialogue. But there is no deeper thinking going on (which the LLMs admit).
* GPT5 smoked all the other models, even Grok 4 (but I don't think she was using Super Grok).
Overall a really fascinating no-bs test with a fair conclusion.
nostr is more chill

