Hi,
is there any uncommited OpenAI API-compatible evaluation script since there are also closed-source models in the table? The provided Jupyter Notebook seems to work with local HF models only. OpenAI API-compatible script would allow (simplify) contributing the final results to The Table. I can probably vibecode it and open a PR but I guess you already have something?