Notes on Llama 4: The Hits, the Misses, and the Disasters
The Llama 4 is here, and this time, the Llama family has three different models: Llama 4 Scout, Maverick, and Behemoth. While
Gemini 2.5 Pro vs. Claude 3.7 Sonnet: Coding Comparison
Google just launched Gemini 2.5 Pro on March 26th, claiming to be the best in coding, reasoning and overall everything. But I
Gemini 2.5 Pro vs. Claude 3.7 Sonnet (thinking) vs. Grok 3 (think)
Google dropped its best-ever creation, Gemini 2.5 Pro Experimental, on March 25. It is a stupidly incredible reasoning model shining on every
Deepseek v3 0324 vs. Claude 3.7 Sonnet: Coding Comparison
Deepseek has silently released a bombshell update to the Deepseek v3 base model. And surprisingly, it went under the carpet amid the
Deepseek v3 0324: Finally, the Sonnet 3.5 at Home
Deepseek v3 o324, a new checkpoint, has been released by Deepseek in silence, with no marketing or hype, just a tweet, a
Gemma 3 27b vs. QwQ 32b vs. Mistral 24b vs. Deepseek r1
While everyone is occupied with the next best frontier model, the smaller models often get ignored. Yes, we are calling 32b models
Grok 3 vs. GPT 4.5
If you’re closely following the AI scene, you know XAI and OpenAI are currently each other’s arch-nemesis. And there’s no point in
CoT Reasoning Models – Which One Reigns Supreme in 2025?
A comprehensive analysis for o3-Mini-High vs Claude Sonnet 3.7 Thinking vs Grok 3 Think vs Deep Seek R1 on multiple reasoning, math,
OpenAI GPT-4.5 vs. Claude 3.7 Sonnet
After so long, OpenAI finally unveiled GPT-4.5, its biggest-ever base model. The initial vibe checks from taste testers have been outstanding. The
Claude 3.7 Sonnet thinking vs. Deepseek r1
So, Anthropic finally broke the silence and released Claude 3.7 Sonnet, a hybrid model that can think step-by-step like a thinking model
Claude 3.7 Sonnet vs. Grok 3 vs. o3-mini-high
Just a week after Grok’s release, we now have the Claude 3.7 Sonnet, which certainly has eaten into Grok’s hype pie. Grok was definitely
Grok 3 vs. Deepseek r1
After much anticipation, xAI has finally released the third iteration of Grok. It is apparently the smartest LLM in the world, scoring