OpenAI launches open-source security inference model gpt-oss-safeguard, supporting policy-driven classification.

PANews reported on October 29th that OpenAI today released the open-source security inference model gpt-oss-safeguard ( 120b , 20b ), allowing developers to provide custom policies for content classification during inference, with the model outputting conclusions and inference chains. This model is fine-tuned based on the open-weight gpt-oss , licensed under the Apache 2.0 license, and can be downloaded from Hugging Face . Internal evaluations show that it outperforms gpt-5-thinking and gpt-oss in multi-policy accuracy, and its performance on external datasets is close to Safety Reasoner . Limitations include: traditional classifiers still outperform in scenarios with a large number of high-quality annotations, and higher inference time and computational cost. ROOST will establish a model community and release technical reports.

Share to:

Author: PA一线

This content is for informational purposes only and does not constitute investment advice.

Follow PANews official accounts, navigate bull and bear markets together
Recommended Reading
5 hour ago
8 hour ago
11 hour ago
14 hour ago
2025-12-12 15:30
2025-12-12 13:23

Popular Articles

Industry News
Market Trends
Curated Readings

Curated Series

App内阅读