Blog: https://openai.com/12-days/)
OpenAI announced two new large language models, o3 and o3-mini, showcasing significantly improved performance on various benchmarks, including coding, mathematics, and reasoning tasks. These models surpass previous models (like o1) in accuracy and efficiency. While not yet publicly released, OpenAI is initiating public safety testing, inviting researchers to help evaluate the models' safety and identify potential issues before wider release. o3-mini is particularly notable for its cost-effectiveness, achieving comparable performance to o1 at a fraction of the cost. The company also highlighted advancements in its safety testing procedures, employing a new "deliberative alignment" technique to improve the accuracy of safety evaluations.
ai , artificial intelligence , arxiv , research , paper , publication , llm, genai, generative ai , large visual models, large language models, large multi modal models, nlp, text, machine learning, ml, nividia, openai, anthropic, microsoft, google, technology, cutting-edge, meta, llama, chatgpt, gpt, elon musk, sam altman, deployment, engineering, scholar, science, apple, samsung, anthropic, turing