AI llama.cpp adds Multi-Token Prediction and doubles Qwen3.6 27B throughput for local inference 5 min 2.6K views