Meta releases quantized Llama models, Hugging Face Evaluation Guidebook, a paper on Speculative Streaming: Fast LLM Inference Without Auxiliary Models, and many more!
Share this post
Deep Learning Weekly: Issue 377
Share this post
Meta releases quantized Llama models, Hugging Face Evaluation Guidebook, a paper on Speculative Streaming: Fast LLM Inference Without Auxiliary Models, and many more!