OpenAI's MLE-bench, Log Datasets & Evaluate LLM Performance with Opik, An Opinionated Evals Reading List, a paper on Pyramidal Flow Matching for Efficient Video Generative Modeling, and many more!
Share this post
Deep Learning Weekly: Issue 375
Share this post
OpenAI's MLE-bench, Log Datasets & Evaluate LLM Performance with Opik, An Opinionated Evals Reading List, a paper on Pyramidal Flow Matching for Efficient Video Generative Modeling, and many more!