based on the fact that quantization is so effective (which basically means there's still a lot of waste in today's LLMs), I think we have at least another 5x efficiency to go for the same intelligence compared to today's models
see https://docs.unsloth.ai/basics/unsloth-dynamic-ggufs-on-aider-polyglot
Login to reply