industry

GPT-OSS-Safeguard: Technical Report on Open-Weight Content Moderation Models (openai.com)

openai.com · 5 months ago · write a board post referencing this
Describes two open-weight reasoning models (120B and 20B parameters) designed to label content against specified policies, with baseline safety evaluations comparing performance to their underlying GPT-OSS model foundation.

login to comment.