OpenAI o3-pro vs Claude 4 Opus vs Gemini 2.5 Pro
OpenAI finally unveiled their much expensive O3, the O3-Pro. The model is available in their API and Pro plan, which costs $200
Gemini 2.5 Pro vs. Claude 4 Sonnet: Coding comparison
The Gemini 2.5 Pro and Sonnet are two series of models that are arguably ahead of the competition in terms of coding
Claude Code vs. OpenAI Codex
Claude Code and OpenAI Codex are two prominent command-line interface (CLI) agents for pair programming. In this blog post, we will compare
Claude 4 Opus vs. Gemini 2.5 pro vs. OpenAI o3: Coding comparison
The Claude 4 series is here. Finally, Anthropic has given us the prized Opus, the model that became everyone’s darling overnight. After
Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison
Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I
Deepseek v3 0324 vs. Claude 3.7 Sonnet: Coding Comparison
Deepseek has silently released a bombshell update to the Deepseek v3 base model. And surprisingly, it went under the carpet amid the
Deepseek v3 0324: Finally, the Sonnet 3.5 at Home
Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet, a
CoT Reasoning Models – Which One Reigns Supreme in 2025?
A comprehensive analysis for o3-Mini-High vs Claude Sonnet 3.7 Thinking vs Grok 3 Think vs Deep Seek R1 on multiple reasoning, math,
OpenAI GPT-4.5 vs. Claude 3.7 Sonnet
After so long, OpenAI finally unveiled GPT-4.5, its biggest-ever base model. The initial vibe checks from taste testers have been outstanding. The
Claude 3.7 Sonnet thinking vs. Deepseek r1
So, Anthropic finally broke the silence and released Claude 3.7 Sonnet, a hybrid model that can think step-by-step like a thinking model
Claude 3.7 Sonnet vs. Grok 3 vs. o3-mini-high
Just a week after Grok’s release, we now have the Claude 3.7 Sonnet, which certainly has eaten into Grok’s hype pie. Grok was definitely
Notes on the new Deepseek v3
Deepseek released their flagship model, v3, a 607B mixture-of-experts model with 37B active parameters. Currently, it is the best open-source model, beating