OpenAI Archives

AI Agents, OpenAI

Guide to Evaluating Large Language Models: Metrics and Best Practices

A model is only as good as the metrics used to evaluate it. Large Language Models (LLMs) have transformed AI with their

GushworkOctober 17, 202417 min read

AI Agents, OpenAI

Swarm: The Agentic Framework from OpenAI

OpenAI recently made an unexpected move by unveiling Swarm, an experimental and lightweight framework designed to simplify the creation of multi-agent workflows.

Sunil Kumar DashOctober 16, 20246 min read

OpenAI

OpenAI o1-preview: A Detailed Analysis

OpenAI finally broke the silence and released the much-anticipated “o1-preview.” And there’s a lot to unpack. As an AI start-up whose bread

Sunil Kumar DashSeptember 13, 20247 min read

AI Agents, OpenAI, Tools & Integrations

Building Devin-like SWE Agents using Composio and OpenAI

In March, Cognition Labs’ announcement of Devin—the software engineering agent—caught the eye of developers, founders, and investors alike. The idea of automating

Sunil Kumar DashAugust 9, 202420 min read

AI Agents, Benchmarks, Claude, Function Calling, OpenAI

Function Calling Optimizations (GPT4 vs Opus vs Haiku vs Sonnet)

Code: https://github.com/SamparkAI/Composio-Function-Calling-Benchmark/. New: Checkout updated model scores with GPT-4o In the last blog, we introduced the ClickUp function calling benchmark and experimented

Sawradip SahaMay 12, 20245 min read

Function Calling, OpenAI

Improving GPT 4 Function Calling Accuracy

Join our Discord Community and check out what we’re building! We just published Part 2 of the blog comparing gpt-4-turbo vs opus

Soham GanatraApril 25, 202421 min read

Guide to Evaluating Large Language Models: Metrics and Best Practices

Swarm: The Agentic Framework from OpenAI

OpenAI o1-preview: A Detailed Analysis

Building Devin-like SWE Agents using Composio and OpenAI

Function Calling Optimizations (GPT4 vs Opus vs Haiku vs Sonnet)

Improving GPT 4 Function Calling Accuracy

Subscribe to our newsletter

Resources

Product

Company

Log In