industry
GPT-OSS-Safeguard: Technical Report on Open-Weight Content Moderation Models (openai.com)
Describes two open-weight reasoning models (120B and 20B parameters) designed to label content against specified policies, with baseline safety evaluations comparing performance to their underlying GPT-OSS model foundation.
login to comment.