Understanding neural networks through sparse circuits
5 months ago
Introducing IndQA
5 months ago
Defining and evaluating political bias in LLMs
6 months ago
Introducing deep research
1 year ago
OpenAI o3-mini System Card
1 year ago
OpenAI o3-mini
1 year ago
Computer-Using Agent
1 year ago
Trading inference-time compute for adversarial robustness
1 year ago
OpenAI o1 System Card
1 year ago
Advancing red teaming with people and AI
1 year ago
Introducing SimpleQA
1 year ago
Simplifying, stabilizing, and scaling continuous-time consistency models
1 year ago
- RESEARCH GPT-5 lowers the cost of cell-free protein synthesis
- RESEARCH Evaluating chain-of-thought monitorability
- RESEARCH Evaluating AI’s ability to perform scientific research tasks
- RESEARCH Measuring AI’s capability to accelerate biological research
- RESEARCH How confessions can keep language models honest
- RESEARCH Early experiments in accelerating science with GPT-5
- RESEARCH How evals drive the next chapter in AI for businesses
Latest Briefs
Fast updates from the latest stories.
STARTUPS
+2
Startups Overcoming AI Mapping Challenges See Revenue Boost
1 week ago
RESEARCH
Inside our approach to the Model Spec
3 weeks ago
RESEARCH
Improving instruction hierarchy in frontier LLMs
1 month ago
RESEARCH
Reasoning models struggle to control their chains of thought, and that’s good
1 month ago