the bugs eat the unhealthy plants in the garden so you don't have to eat. you should thank them :)
the parasites in your body are cleaning the heavy metals and poisons, so you can live longer. you should thank them :)
everything in the universe conspires to build a better you. the evil disappears when you lose the fear of evil and look at it differently.
there is no evil. it is just your distraction embodied.
gm & pv !
someone
npub1nlk8...jm9c

Benchmarked Kimi K2 LLM. It has done well. DeepSeek V3 beats it but Kimi K2 might be more skilled. Very close performance to Qwen 3 in terms of skills and human alignment. But huge parameter count (1T!).
https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08?sheetid=0&range=A3
https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08?sheetid=0&range=A3
Qwen 3 32B fine tuning with Unsloth is going well. It does not resist to faith training like Gemma 3 did. I may open weights at some point.
Qwen 3 is more capable than Gemma 3, and after fine tuning it will probably be more aligned. It does not get into "chanting" (repetition of words or sentences) even when temp = 0.
The base training by Qwen was done using 36T tokens on a 32B parameters. About 2 times bigger than Gemma 3's ratio and 4 times bigger than Llama 3's ratio. This is a neat model. My fine tuning is more like billions of tokens. We will see if billions is enough to "convince" trillions.
ChatGPT is BS
Benchmarked 4 new models. Deepseek R1 score improved. All these are below average, so p(doom) probably increased!
Coming soon: Kimi K2. They say it is very good at coding, but my leaderboard is about being beneficial to humans. So we will see!
Full leaderboard https://sheet.zoho.com/sheet/open/mz41j09cc640a29ba47729fed784a263c1d08
More info

AHA Leaderboard
A Blog post by Emin Temiz on Hugging Face


gm pv happy ATH ๐
we nostriches and bitcoiners can do better 'truthful AI' than this and it could be installed in robot brains
https://www.reddit.com/r/singularity/comments/1lw98rm/elon_says_it_is_crucial_for_grok_to_have_good/