Evaluating Local Models with Custom Datasets
I wanted to share some information about comparing open local models. I started with public benchmarks and leaderboards, even open and closed datasets, etc. - and got a sense of which models did well on these public benchmarks. But given these models...
Jan 30, 20255 min read23
