Gemini 3.1 Pro Preview vs GPT-5.3 Codex (xhigh): Which Large Language Models is Best?

Verdict: Gemini 3.1 Pro Preview wins by 3 points.

Gemini 3.1 Pro Preview takes the lead in this comparison, scoring 57 points to GPT-5.3 Codex (xhigh)'s 54. This 3-point gap suggests that Gemini 3.1 Pro Preview outperforms its competitor in general intelligence.

For users focused on reasoning, coding capabilities, Gemini 3.1 Pro Preview from Google currently represents the state-of-the-art. Its higher Elo score indicates greater consistency across our benchmark set.

However, GPT-5.3 Codex (xhigh) remains a formidable contender. Ranked #3, it is a top-tier choice. Depending on your specific needs—such as licensing (Proprietary) or ecosystem integration—GPT-5.3 Codex (xhigh) may still be the right tool for your pipeline.

Comparison Data

Feature	Gemini 3.1 Pro Preview	GPT-5.3 Codex (xhigh)
Rank	#1	#3
Score	57	54
Developer	Google	OpenAI
License	Proprietary	Proprietary

Conclusion

Both models are excellent choices within the Large Language Models landscape. We recommend checking the full leaderboard for the most up-to-date rankings as new models are released frequently.