Gemini 2.0 Flash vs OpenAI o1 and Claude 3.5 Sonnet
Google has finally woken up and decided to drop the bombshell Gemini 2.0, completing the AI triumvirate. The timing could not have
OpenAI o1 vs Claude 3.5 Sonnet: Which One’s Really Worth Your $20?
It’s been a week since OpenAI o1 was out of preview, and along with that, OpenAI has also introduced a new tier,
Notes on Chatgpt Search: Better than Perplexity?
After much speculation, OpenAI launched the Chatgpt Search feature. It can now search the Internet based on your query and find relevant
Swarm: The Agentic Framework from OpenAI
OpenAI recently made an unexpected move by unveiling Swarm, an experimental and lightweight framework designed to simplify the creation of multi-agent workflows.
OpenAI o1-preview: A Detailed Analysis
OpenAI finally broke the silence and released the much-anticipated “o1-preview.” And there’s a lot to unpack. As an AI start-up whose bread
Building Devin-like SWE Agents using Composio and OpenAI
In March, Cognition Labs’ announcement of Devin—the software engineering agent—caught the eye of developers, founders, and investors alike. The idea of automating
Function Calling Optimizations (GPT4 vs Opus vs Haiku vs Sonnet)
Code: https://github.com/SamparkAI/Composio-Function-Calling-Benchmark/. New: Checkout updated model scores with GPT-4o In the last blog, we introduced the ClickUp function calling benchmark and experimented
Improving GPT 4 Function Calling Accuracy
Join our Discord Community and check out what we’re building! We just published Part 2 of the blog comparing gpt-4-turbo vs opus