#benchmark

Evaluating Local Models with Custom Datasets

I wanted to share some information about comparing open local models. I started with public benchmarks and leaderboards, even open and closed datasets, etc. - and got a sense of which models did well on these public benchmarks. But given these models...

Jan 30, 20255 min read23

Evaluating Local Models with Custom Datasets

Command Palette