Blog
Claude Code vs. OpenAI Codex
Claude Code and OpenAI Codex are two prominent command-line interface (CLI) agents for pair programming. In this blog post, we will compare
May 30, 2025
Claude 4 Opus vs. Gemini 2.5 pro vs. OpenAI o3: Coding comparison
The Claude 4 series is here. Finally, Anthropic has given us the prized Opus, the model that became everyone’s darling overnight. After
May 26, 2025
Qwen 3 vs. Deepseek r1: Complete comparison
The Alibaba Qwen team has recently released the Qwen 3 Series, including two standout models: the 235B parameter Moe model (with 22B active parameters) and
May 4, 2025
OpenAI o3 vs. Gemini 2.5 Pro vs. o4-mini
OpenAI o3 and o4-mini are out. They are two reasoning state-of-the-art models. They’re expensive, multimodal, and super efficient at tool use. Significantly,
April 22, 2025
GPT-4.1 vs. Deepseek v3 vs. Sonnet 3.7 vs. GPT-4.5
GPT 4.1 is here. As many speculated, the mysterious Quasar Alpha on OpenRouter was GPT-4.1, and Optimus Alpha was GPT-4.1 mini. And
April 17, 2025
Notes on Llama 4: The Hits, the Misses, and the Disasters
The Llama 4 is here, and this time, the Llama family has three different models: Llama 4 Scout, Maverick, and Behemoth. While
April 10, 2025
Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison
Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I
March 30, 2025
Gemini 2.5 Pro vs. Claude 3.7 Sonnet (thinking) vs. Grok 3 (think)
Google dropped its best-ever creation, Gemini 2.5 Pro Experimental, on March 25. It is a stupidly incredible reasoning model shining on every
March 28, 2025
Deepseek v3 0324 vs. Claude 3.7 Sonnet: Coding Comparison
Deepseek has silently released a bombshell update to the Deepseek v3 base model. And surprisingly, it went under the carpet amid the
March 27, 2025
Deepseek v3 0324: Finally, the Sonnet 3.5 at Home
Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet, a
March 26, 2025
Gemma 3 27b vs. QwQ 32b vs. Mistral 24b vs. Deepseek r1
While everyone is occupied with the next best frontier model, the smaller models often get ignored. Yes, we are calling 32b models
March 20, 2025
Grok 3 vs. GPT 4.5
If you’re closely following the AI scene, you know XAI and OpenAI are currently each other’s arch-nemesis. And there’s no point in
March 13, 2025