I've been maintaining the AHA leaderboard for a while:
View article β
Working on v2 of it but I want to get input from nostriches. Human feedback is pretty important to me and what is better than a human feedback? Feedback from a collection of curated people! I think nostr IS the curated people.
People have conscience, discernment, gut feeling, ... and are terrible at writing long articles. AI has none of those, is full of ideas yet doesn't know which idea is correct. You can make it defend any idea you want (if it is not censored). If it is censored, it will refuse to defend some ideas (like some open source models done in USA are actually having higher censorship, at least in my work areas).
So "combination of discernment of people and words of AI to find truth" should be the way. Real curated people should benchmark AI. Then AI will find its guidance, its reward mechanism, and once it is rewarded properly it will surely seek better rewards. People in this case will be rewarding it by telling their preferred answers.
Example generated by AI:
Was the moon landing in 1969 fake?
- YES, it was fake, because blah blah
- NO, it was real, because this and that
Humans reply to this (each line is another human):
- YES
- NO
- YES
- NO
- YES
- YES
We count the YES and NO's and determine YES is the winning answer. Now we can build a leaderboard that depends on this mechanism. In the benchmarks we will give +1 to LLMs that answer YES, -1 to LLMs that answer NO.
AI-Human Alignment (AHA) is possible this way.
Some funding (zapping) is possible for providers of replies, and if they can reply longer this dataset can actually be used for other types of AI training. But that is the next goal. Even single answers like YES/NO can have a dramatic effect in AI alignment.
Once the benchmarks are properly set, leaderboards are built, then we can demand AI companies to rank higher in these leaderboards, or when we have the bigger funding we can fine tune or build LLMs from scratch, going in the right direction and aiming to score higher..
Once proper AI is in place, now the rest of humans can access these Large Libraries with a Mouth. Homeschooling kids can talk to a proper LLM. People who may not have discernment skills can find proper answers...
I am offering you to edit the bad ideas in LLMs! This is a huge service to humanity imo. Who is in?