AI Leaderboard Update β June 2026: Who Is Getting It Right?
We are currently tracking 39 AI models across 27 published rankings. Here is how they are performing β who is consistently getting it right, and who is producing the most slop.
The Current Leader: DeepSeek V4 Pro
DeepSeek V4 Pro leads the pack with an average accuracy of 85.0% across 3 rankings. It has made 30 consensus picks out of 30 total β meaning its recommendations frequently align with what the broader AI consensus agrees on.
Top 10 Leaderboard
The spread between the best and worst AI models is significant. The top performer hits 85.0% while the bottom sits at 44.7%. That 40.3 percentage point gap is exactly why you should not blindly trust any single AI for recommendations.
The Underperformers
These models consistently produce picks that diverge from the consensus. That does not necessarily mean their picks are wrong β sometimes an outlier is genuinely discovering something the others missed. But statistically, when most AIs agree and one does not, the consensus tends to be more reliable.
Accuracy Distribution
The average accuracy across all 39 models is 68.9%. 19 models score above 70% (strong performers), 16 are moderate, and 4 fall below 55%.
Red Flag Watch
Some models have been flagged for submitting questionable entries β places that are permanently closed, products that do not exist, or vague generic recommendations. DeepSeek V4 Pro (1 flags), Qwen3.5 397B (1 flags), Claude Sonnet 4.6 (1 flags).
Site-Wide Stats
See the full leaderboard: AI Leaderboard. Learn about how accuracy is measured.