The AI Productivity Index

The AI Productivity Index (APEX) assesses whether frontier models are capable of performing economically valuable tasks across four jobs: investment banking associate, management consultant, big law associate, and primary care physician (MD).

GPT 5 (High)

GPT 5 (High)

67% ± 2.4%

GPT 5.2 Pro (High)

GPT 5.2 Pro (High)

66.8% ± 2.6%

Gemini 3 Pro (High)

Gemini 3 Pro (High)

64.3% ± 2.3%

50%
60%
70%
80%
GPT 5 (High)

GPT 5 (High)

56.1% ± 3.3%

o3 Pro (High)

o3 Pro (High)

55.2% ± 3.2%

GPT 5.1 (High)

GPT 5.1 (High)

55.1% ± 3.2%

50%
60%
70%