LLM Archives

Claude, LLM

Gemini 2.5 Pro vs. Claude 4 Sonnet: Coding comparison

The Gemini 2.5 Pro and Sonnet are two series of models that are arguably ahead of the competition in terms of coding

ShrijalJune 11, 20258 min read

Claude, LLM, OpenAI

Claude Code vs. OpenAI Codex

Claude Code and OpenAI Codex are two prominent command-line interface (CLI) agents for pair programming. In this blog post, we will compare

HarshMay 30, 20258 min read

Claude, LLM

Claude 4 Opus vs. Gemini 2.5 pro vs. OpenAI o3: Coding comparison

The Claude 4 series is here. Finally, Anthropic has given us the prized Opus, the model that became everyone’s darling overnight. After

ShrijalMay 26, 202513 min read

LLM

Qwen 3 vs. Deepseek r1: Complete comparison

The Alibaba Qwen team has recently released the Qwen 3 Series, including two standout models: the 235B parameter Moe model (with 22B active parameters) and

HarshMay 4, 202516 min read

LLM

OpenAI o3 vs. Gemini 2.5 Pro vs. o4-mini

OpenAI o3 and o4-mini are out. They are two reasoning state-of-the-art models. They’re expensive, multimodal, and super efficient at tool use. Significantly,

ShrijalApril 22, 202513 min read

LLM

GPT-4.1 vs. Deepseek v3 vs. Sonnet 3.7 vs. GPT-4.5

GPT 4.1 is here. As many speculated, the mysterious Quasar Alpha on OpenRouter was GPT-4.1, and Optimus Alpha was GPT-4.1 mini. And

Sunil Kumar DashApril 17, 202526 min read

LLM

Notes on Llama 4: The Hits, the Misses, and the Disasters

The Llama 4 is here, and this time, the Llama family has three different models: Llama 4 Scout, Maverick, and Behemoth. While

Sunil Kumar DashApril 10, 202510 min read

Claude, Gemini, LLM

Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison

Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I

ShrijalMarch 30, 20258 min read

Gemini, LLM

Gemini 2.5 Pro vs. Claude 3.7 Sonnet (thinking) vs. Grok 3 (think)

Google dropped its best-ever creation, Gemini 2.5 Pro Experimental, on March 25. It is a stupidly incredible reasoning model shining on every

Sunil Kumar DashMarch 28, 202521 min read

Claude, Deepseek, LLM

Deepseek v3 0324 vs. Claude 3.7 Sonnet: Coding Comparison

Deepseek has silently released a bombshell update to the Deepseek v3 base model. And surprisingly, it went under the carpet amid the

HarshMarch 27, 202512 min read

AI Use Case, Claude, Deepseek, LLM

Deepseek v3 0324: Finally, the Sonnet 3.5 at Home

Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet, a

Sunil Kumar DashMarch 26, 202511 min read

LLM

Gemma 3 27b vs. QwQ 32b vs. Mistral 24b vs. Deepseek r1

While everyone is occupied with the next best frontier model, the smaller models often get ignored. Yes, we are calling 32b models

ShrijalMarch 20, 202513 min read

Composio MCP ➔

AgentAuth ➔

SWE-Kit ➔

SDR Kit ➔

AI Crypto Kit ➔

Enterprise ➔

Agency ➔

Startups Program ➔