Research News | WTGuru

About Tools Log in

Introducing GPT-Rosalind for life sciences research

OpenAI Plans First Permanent Office in London by 2027

Research with ChatGPT

ChatGPT for research

Announcing the OpenAI Safety Fellowship

Understanding neural networks through sparse circuits

Introducing IndQA

Introducing IndQA

Defining and evaluating political bias in LLMs

Defining and evaluating political bias in LLMs

Introducing deep research

OpenAI o3-mini System Card

OpenAI o3-mini System Card

Computer-Using Agent

Computer-Using Agent

Trading inference-time compute for adversarial robustness

OpenAI o1 System Card

Advancing red teaming with people and AI

Introducing SimpleQA

Introducing SimpleQA

Simplifying, stabilizing, and scaling continuous-time consistency models

Sora 2 is here

How people are using ChatGPT

Why language models hallucinate

Early methods for studying affective use and emotional well-being on ChatGPT

Editor's pick

GPT-5.2 derives a new result in theoretical physics

GPT-5.2 derives a new result in theoretical physics

Latest Briefs

Fast updates from the latest stories.

Startups Overcoming AI Mapping Challenges See Revenue Boost

Startups Overcoming AI Mapping Challenges See Revenue Boost

Inside our approach to the Model Spec

Inside our approach to the Model Spec

Improving instruction hierarchy in frontier LLMs

Improving instruction hierarchy in frontier LLMs

Reasoning models struggle to control their chains of thought, and that’s good

Reasoning models struggle to control their chains of thought, and that’s good

Extending single-minus amplitudes to gravitons

Extending single-minus amplitudes to gravitons

Why we no longer evaluate SWE-bench Verified

Why we no longer evaluate SWE-bench Verified

Our First Proof submissions

Our First Proof submissions

Introducing EVMbench

Introducing EVMbench

More Stories

RESEARCH

Evaluating fairness in ChatGPT

1 year ago

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

RESEARCH

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

1 year ago

Learning to reason with LLMs

RESEARCH

Learning to reason with LLMs

1 year ago

RESEARCH

1 year ago

OpenAI o1 System Card External Testers Acknowledgements

RESEARCH

OpenAI o1 System Card External Testers Acknowledgements

1 year ago

RESEARCH

Introducing SWE-bench Verified

1 year ago

RESEARCH

GPT-4o System Card External Testers Acknowledgements

1 year ago

RESEARCH

Improving Model Safety Behavior with Rule-Based Rewards

1 year ago

Load more