Gemma 2, From bare metal to a 70B model: infrastructure set-up and scripts, Evaluating Open LLMs with MixEval, a paper on A Fully Open, Vision-Centric Exploration of Multimodal LLMs, and more!
Share this post
Deep Learning Weekly: Issue 360
Share this post
Gemma 2, From bare metal to a 70B model: infrastructure set-up and scripts, Evaluating Open LLMs with MixEval, a paper on A Fully Open, Vision-Centric Exploration of Multimodal LLMs, and more!