We added frontier models to our MCP playground for free
We're excited to announce that we're providing frontier proprietary/open source models from OpenAI, Anthropic, Gemini, and more to be used for free in MCPJam's LLM playground. You no longer have to bring your own API key to access the best MCP server testing experience. It's on us.

Models we're releasing
Anthropic
- Claude Sonnet 4.5
- Claude Haiku 4.5
Google Gemini
- Gemini 2.5 Flash
- Gemini 2.5 Flash Preview
OpenAI
- GPT-5
- GPT-5 Codex
- GPT-5 Mini
xAI
- Grok 4 Fast
Moonshot AI
- Kimi K2
Zhipu AI
- GLM 4.6
From our observations so far, we noticed that GPT-5 has the best tool call accuracy, followed by Claude, Grok, then Gemini.
Why we're doing this
We're on a mission to help developers build the best MCP servers, and we believe that allowing users that don't have access to these models will help us reach this goal faster. As a server developer, you have to think about how different clients interact with your server, along with their different language models.
We want to give developers the certainty that their server will perform exactly how they intend to in all these clients, and have it all live in a single platform to help with fast iteration.
Models have wildly different performances when interacting with MCP servers. Salesforce AI research team benchmarked the performance of models across different MCP servers of varying tasks in their MCP-Universe paper. This highlights the importance of testing your MCP server with different models for your server's production-readiness.

We also think releasing frontier models is necessary for the OpenAI Apps SDK developer experience. By providing GPT-5 series models, we can emulate how ChatGPT (GPT-5) interacts with your ChatGPT app.
How to start testing
You can start using MCPJam's LLM playground by running the following command:
npx @mcpjam/inspector@latestThis will spin up a local React instance of the MCPJam inspector. You'll need to create an account on MCPJam to use the frontier models. Navigate to the playground tab and start testing!