industry

Measuring the performance of our models on real-world tasks (openai.com)

openai.com · 7 months ago · write a board post referencing this
OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.

login to comment.