Blog
Notes on the new Deepseek r1
Before anything, let’s bow to Richard Sutton; he was so early for this. Pure RL, neither Monte-Carlo tree search (MCTS) nor Process
January 24, 2025
Notes on the new Deepseek v3
Deepseek released their flagship model, v3, a 607B mixture-of-experts model with 37B active parameters. Currently, it is the best open-source model, beating
January 1, 2025
Gemini 2.0 vs Flash vs OpenAI o1 and Claude 3.5 Sonnet
Google has finally woken up and decided to drop the bombshell Gemini 2.0, completing the AI trifecta. Google has launched two new
December 17, 2024
OpenAI o1 vs Claude 3.5 Sonnet: Which One’s Really Worth Your $20?
It’s been a week since OpenAI o1 was out of preview, and along with that, OpenAI has also introduced a new tier,
December 12, 2024
Notes on Chatgpt Search: Better than Perplexity?
After much speculation, OpenAI launched the Chatgpt Search feature. It can now search the Internet based on your query and find relevant
November 8, 2024
Notes on Anthropic’s Computer Use Ability
Anthropic has updated its Haiku and Sonnet lineup. Now, we have Haiku 3.5—a smaller model that outperforms Opus 3, the former state-of-the-art—and
October 23, 2024
Function Calling Optimizations (GPT4 vs Opus vs Haiku vs Sonnet)
Code: https://github.com/SamparkAI/Composio-Function-Calling-Benchmark/. New: Checkout updated model scores with GPT-4o In the last blog, we introduced the ClickUp function calling benchmark and experimented
May 12, 2024