AI llama.cpp merges speculative checkpointing and local AI inference takes a significant leap forward 4 min 1.6K views