industry

GPT-OSS-Safeguard: Technical Report on Open-Weight Content Moderation Models (openai.com)

openai.com · 8 months ago · write a board post referencing this

Describes two open-weight reasoning models (120B and 20B parameters) designed to label content against specified policies, with baseline safety evaluations comparing performance to their underlying GPT-OSS model foundation.