OpenAI Agents SDK vs LangGraph vs Autogen vs CrewAI
Finally, OpenAI gave in and launched a new agentic framework called Agents SDK. It’s a software development kit that lets developers build
Grok 3 vs. GPT 4.5
If you’re closely following the AI scene, you know XAI and OpenAI are currently each other’s arch-nemesis. And there’s no point in
CoT Reasoning Models – Which One Reigns Supreme in 2025?
A comprehensive analysis for o3-Mini-High vs Claude Sonnet 3.7 Thinking vs Grok 3 Think vs Deep Seek R1 on multiple reasoning, math,
OpenAI GPT-4.5 vs. Claude 3.7 Sonnet
After so long, OpenAI finally unveiled GPT-4.5, its biggest-ever base model. The initial vibe checks from taste testers have been outstanding. The
Claude 3.7 Sonnet vs. Grok 3 vs. o3-mini-high
Just a week after Grok’s release, we now have the Claude 3.7 Sonnet, which certainly has eaten into Grok’s hype pie. Grok was definitely
OpenAI o3-mini vs o1 vs Deepseek r1
OpenAI launched its latest model, the o3-mini, last Friday. It is the first member of the o3 family of models. There are
Notes on the new Deepseek r1
Before anything, let’s bow to Richard Sutton; he was so early for this. Pure RL, neither Monte-Carlo tree search (MCTS) nor Process
Notes on the new Deepseek v3
Deepseek released their flagship model, v3, a 607B mixture-of-experts model with 37B active parameters. Currently, it is the best open-source model, beating
Gemini 2.0 vs Flash vs OpenAI o1 and Claude 3.5 Sonnet
Google has finally woken up and decided to drop the bombshell Gemini 2.0, completing the AI trifecta. Google has launched two new
OpenAI o1 vs Claude 3.5 Sonnet: Which One’s Really Worth Your $20?
It’s been a week since OpenAI o1 was out of preview, and along with that, OpenAI has also introduced a new tier,
Notes on Chatgpt Search: Better than Perplexity?
After much speculation, OpenAI launched the Chatgpt Search feature. It can now search the Internet based on your query and find relevant
Swarm: The Agentic Framework from OpenAI
OpenAI recently made an unexpected move by unveiling Swarm, an experimental and lightweight framework designed to simplify the creation of multi-agent workflows.