AI llama.cpp merges speculative checkpointing and local AI inference takes a significant leap forward 4 min 2.3K views