Explore Models For Conversational AI Development

Dive into the world of ChatBotKit's AI models. Here ew showcases a wide array of sophisticated AI technologies, each designed to cater to different aspects of conversational AI and chatbot development.

gpt-4.5-next

GPT-4.5 excels at tasks that benefit from creative, open-ended thinking and conversation, such as writing, learning, or exploring new ideas.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4.5",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 112000,
  "maxOutputTokens": 16000,
  "pricing": {
    "tokenRatio": 8.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gpt-4.5-classic

GPT-4.5 excels at tasks that benefit from creative, open-ended thinking and conversation, such as writing, learning, or exploring new ideas.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4.5",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 112000,
  "maxOutputTokens": 16000,
  "pricing": {
    "tokenRatio": 8.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gpt-4.5

GPT-4.5 excels at tasks that benefit from creative, open-ended thinking and conversation, such as writing, learning, or exploring new ideas.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4.5",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 112000,
  "maxOutputTokens": 16000,
  "pricing": {
    "tokenRatio": 8.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o3-mini-next

o3-mini is a cost-efficient reasoning model that's optimized for coding, math, and science, and supports tools and Structured Outputs.

Technical Details

{
  "provider": "openai",
  "family": "o3",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 100000,
  "maxOutputTokens": 100000,
  "pricing": {
    "tokenRatio": 0.2444
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o3-mini-classic

o3-mini is a cost-efficient reasoning model that's optimized for coding, math, and science, and supports tools and Structured Outputs.

Technical Details

{
  "provider": "openai",
  "family": "o3",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 100000,
  "maxOutputTokens": 100000,
  "pricing": {
    "tokenRatio": 0.2444
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o3-mini

o3-mini is a cost-efficient reasoning model that's optimized for coding, math, and science, and supports tools and Structured Outputs.

Technical Details

{
  "provider": "openai",
  "family": "o3",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 100000,
  "maxOutputTokens": 100000,
  "pricing": {
    "tokenRatio": 0.2444
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1-next

o1 is our most powerful reasoning model that supports tools, Structured Outputs, and vision. The model has 200K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1-classic

o1 is our most powerful reasoning model that supports tools, Structured Outputs, and vision. The model has 200K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1

o1 is our most powerful reasoning model that supports tools, Structured Outputs, and vision. The model has 200K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1-mini-next

o1-mini is a fast, cost-efficient reasoning model tailored to coding, math, and science use cases. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1-mini-classic

o1-mini is a fast, cost-efficient reasoning model tailored to coding, math, and science use cases. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

o1-mini

o1-mini is a fast, cost-efficient reasoning model tailored to coding, math, and science use cases. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "o1",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gpt-4o-mini-next

GPT-4o mini is OpenAI's most cost-efficient small model that's smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.0333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4o-mini-classic

GPT-4o mini is OpenAI's most cost-efficient small model that's smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.0333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4o-mini

GPT-4o mini is OpenAI's most cost-efficient small model that's smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.0333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4o-next

GPT-4o is faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.5556
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4o-classic

GPT-4o is faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4o

GPT-4o is faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.5556
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4-turbo-next

GPT-4 Turbo is offered at 128K context with an April 2023 knowledge cutoff and basic support for vision.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4-turbo",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 1.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4-turbo-classic

GPT-4 Turbo is offered at 128K context with an April 2023 knowledge cutoff and basic support for vision.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 1.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4-turbo

GPT-4 Turbo is offered at 128K context with an April 2023 knowledge cutoff and basic support for vision.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4-turbo",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 1.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4-next

The GPT-4 model was built with broad general knowledge and domain expertise.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 8192,
  "maxInputTokens": 6144,
  "maxOutputTokens": 2048,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4-classic

The GPT-4 model was built with broad general knowledge and domain expertise.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 8192,
  "maxInputTokens": 6144,
  "maxOutputTokens": 2048,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-4

The GPT-4 model was built with broad general knowledge and domain expertise.

Technical Details

{
  "provider": "openai",
  "family": "gpt-4",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 8192,
  "maxInputTokens": 6144,
  "maxOutputTokens": 2048,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-3.5-turbo-next

GPT-3.5 Turbo is fast and inexpensive model for simpler tasks.

Technical Details

{
  "provider": "openai",
  "family": "gpt-3.5-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 16384,
  "maxInputTokens": 12288,
  "maxOutputTokens": 4096,
  "pricing": {
    "tokenRatio": 0.0833
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-3.5-turbo-classic

GPT-3.5 Turbo is fast and inexpensive model for simpler tasks.

Technical Details

{
  "provider": "openai",
  "family": "gpt-3.5-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 4096,
  "maxInputTokens": 3072,
  "maxOutputTokens": 1024,
  "pricing": {
    "tokenRatio": 0.2222
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-3.5-turbo

GPT-3.5 Turbo is fast and inexpensive model for simpler tasks.

Technical Details

{
  "provider": "openai",
  "family": "gpt-3.5-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 16384,
  "maxInputTokens": 12288,
  "maxOutputTokens": 4096,
  "pricing": {
    "tokenRatio": 0.0833
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

gpt-3.5-turbo-instruct

GPT-3.5 Turbo is fast and inexpensive model for simpler tasks.

Technical Details

{
  "provider": "openai",
  "family": "gpt-3.5-turbo",
  "features": [],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 4096,
  "maxInputTokens": 3072,
  "maxOutputTokens": 1024,
  "pricing": {
    "tokenRatio": 0.1111
  },
  "interactionMaxMessages": 100,
  "temperature": 0.7,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

mistral-large-latest

Top-tier reasoning for high-complexity tasks. The most powerful model of the Mistral AI family.

Technical Details

{
  "provider": "mistral",
  "family": "large",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 32000,
  "maxInputTokens": 24000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.6667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

mistral-small-latest

Cost-efficient reasoning for low-latency workloads.

Technical Details

{
  "provider": "mistral",
  "family": "large",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 32000,
  "maxInputTokens": 24000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.1667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

deepseek-r1-distill-llama-70b

Top-tier reasoning for high-complexity tasks. The most powerful model of the Deepseek AI family.

Technical Details

{
  "provider": "groq",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 96000,
  "maxOutputTokens": 32000,
  "pricing": {
    "tokenRatio": 0.055
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

llama-3.3-70b-versatile

Llama 3.3 is an auto-regressive language model that uses an optimized transformer architecture.

Technical Details

{
  "provider": "groq",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 96000,
  "maxOutputTokens": 32000,
  "pricing": {
    "tokenRatio": 0.0439
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

sonar-deep-research

Deep Research conducts comprehensive, expert-level research and synthesizes it into accessible, actionable reports.

Technical Details

{
  "provider": "perplexity",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.4444
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

sonar-reasoning-pro

Premier reasoning offering powered by DeepSeek R1 with Chain of Thought (CoT) and advanced search grounding.

Technical Details

{
  "provider": "perplexity",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.4444
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

sonar-reasoning

Premier reasoning offering powered by DeepSeek R1 with Chain of Thought (CoT).

Technical Details

{
  "provider": "perplexity",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.2778
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

sonar-pro

Premier search offering with search grounding, supporting advanced queries and follow-ups.

Technical Details

{
  "provider": "perplexity",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

sonar

Lightweight offering with search grounding, quicker and cheaper than Sonar Pro.

Technical Details

{
  "provider": "perplexity",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.0556
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gemini-2.0-flash

A capable multi-modal model with great performance across all tasks, with a 1 million token context window, and built for the era of Agents.

Technical Details

{
  "provider": "google",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 1048576,
  "maxInputTokens": 1040384,
  "maxOutputTokens": 8192,
  "pricing": {
    "tokenRatio": 0.0222
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gemini-2.0-flash-lite

Small and most cost effective model, built for at scale usage

Technical Details

{
  "provider": "google",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 1048576,
  "maxInputTokens": 1040384,
  "maxOutputTokens": 8192,
  "pricing": {
    "tokenRatio": 0.0167
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gemini-1.5-flash

Fast multi-modal model with great performance for diverse, repetitive tasks and a 1 million token context window.

Technical Details

{
  "provider": "google",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 1048576,
  "maxInputTokens": 1040384,
  "maxOutputTokens": 8192,
  "pricing": {
    "tokenRatio": 0.0167
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

gemini-1.5-pro

Highest intelligence Gemini 1.5 series model, with a breakthrough 2 million token context window.

Technical Details

{
  "provider": "google",
  "family": "chat",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 2097152,
  "maxInputTokens": 2088960,
  "maxOutputTokens": 8192,
  "pricing": {
    "tokenRatio": 0.2778
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3-opus

Anthropic's most powerful AI model, with top-level performance on highly complex tasks. It can navigate open-ended prompts and sight-unseen scenarios with remarkable fluency and human-like understanding.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 4.1667
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3.5-sonnet

Anthropic's most intelligent and advanced model, Claude 3.5 Sonnet, demonstrates exceptional capabilities across a diverse range of tasks and evaluations while also outperforming Claude 3 Opus.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3-sonnet

Claude 3 Sonnet strikes the ideal balance between intelligence and speed—particularly for enterprise workloads. It offers maximum utility, and is engineered to be the dependable for scaled AI deployments.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3.5-haiku

Anthropic's fastest, most compact model for near-instant responsiveness. It answers simple queries and requests with speed.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 0.2222
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3-haiku

Anthropic's fastest, most compact model for near-instant responsiveness. It answers simple queries and requests with speed.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 0.0694
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v3

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v2.1

Claude 2.1 is a large language model (LLM) by Anthropic with a 200K token context window, reduced hallucination rates, and improved accuracy over long documents.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 200000,
  "maxInputTokens": 150000,
  "maxOutputTokens": 50000,
  "pricing": {
    "tokenRatio": 1.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-v2

Claude 2.0 is a leading LLM from Anthropic that enables a wide range of tasks from sophisticated dialogue and creative content generation to detailed instruction.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 100000,
  "maxInputTokens": 75000,
  "maxOutputTokens": 25000,
  "pricing": {
    "tokenRatio": 1.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

claude-instant-v1

Claude Instant is Anthropic's faster, lower-priced yet very capable LLM.

Technical Details

{
  "provider": "bedrock",
  "family": "claude",
  "features": [],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 100000,
  "maxInputTokens": 75000,
  "maxOutputTokens": 25000,
  "pricing": {
    "tokenRatio": 0.1333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

beta

custom

Any custom model created by the user.

Technical Details

{
  "provider": "chatbotkit",
  "family": "chatbotkit",
  "features": [],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 4096,
  "maxInputTokens": 3072,
  "maxOutputTokens": 1024,
  "pricing": {
    "tokenRatio": 0.0056
  },
  "interactionMaxMessages": 100,
  "temperature": 0,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-web-001

Fast and efficient question and answer model with web search grounding.

Technical Details

{
  "provider": "chatbotkit",
  "family": "chat",
  "features": [
    "chat"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 120000,
  "maxOutputTokens": 8000,
  "pricing": {
    "tokenRatio": 0.0556
  },
  "interactionMaxMessages": 4,
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-005

This model belongs to the GPT-4o mini family of ChatBotKit models. It is designed for question and answer applications. The model has a token limit of 128000 and provides a balance between cost and quality. It is a custom model based on the gpt model architecture.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 4,
  "forceFunction": "query",
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-004

This model belongs to the GPT-4o family of ChatBotKit models. It is designed for question and answer applications. The model has a token limit of 128000 and provides a balance between cost and quality. It is a custom model based on the gpt model architecture.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 4,
  "forceFunction": "query",
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-003

This model belongs to the GPT-4 Turbo family of ChatBotKit models. It is designed for question and answer applications. The model has a token limit of 128000 and provides a balance between cost and quality. It is a custom model based on the gpt model architecture.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4-turbo",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 1.6667
  },
  "interactionMaxMessages": 4,
  "forceFunction": "query",
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-002

This model belongs to the GPT-4 family of ChatBotKit models. It is designed for question and answer applications. The model has a token limit of 8 * ONE_K and provides a balance between cost and quality. It is a custom model based on the gpt model architecture.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 8192,
  "maxInputTokens": 6144,
  "maxOutputTokens": 2048,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 4,
  "forceFunction": "query",
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-qaa-001

This model belongs to the GPT 3.5 Turbo family of ChatBotKit models. It is designed for question and answer applications. The model has a token limit of 4000 and provides a balance between cost and quality. It is a custom model based on the gpt model architecture.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-3.5-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 4096,
  "maxInputTokens": 3072,
  "maxOutputTokens": 1024,
  "pricing": {
    "tokenRatio": 0.0833
  },
  "interactionMaxMessages": 4,
  "forceFunction": "query",
  "temperature": 0.2,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-algo-004

his model belongs to the GPT-4o family of ChatBotKit models.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4o",
  "features": [
    "chat",
    "functions",
    "vision"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 128000,
  "maxInputTokens": 124000,
  "maxOutputTokens": 4000,
  "pricing": {
    "tokenRatio": 0.8333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-algo-003

his model belongs to the GPT-4 family of ChatBotKit models.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-4",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 8192,
  "maxInputTokens": 6144,
  "maxOutputTokens": 2048,
  "pricing": {
    "tokenRatio": 3.3333
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

text-algo-002

This model belongs to the Turbo family of ChatBotKit models.

Technical Details

{
  "provider": "chatbotkit",
  "family": "gpt-3.5-turbo",
  "features": [
    "chat",
    "functions"
  ],
  "region": "us",
  "availableRegions": [
    "us"
  ],
  "maxTokens": 4096,
  "maxInputTokens": 3072,
  "maxOutputTokens": 1024,
  "pricing": {
    "tokenRatio": 0.0833
  },
  "interactionMaxMessages": 100,
  "temperature": 1,
  "frequencyPenalty": 0,
  "presencePenalty": 0
}

dalle3
This model is based on the DALL-E 3 architecture. It is a high-quality model that can generate images from text. It is tunable and offers a balance between cost and quality.
Technical Details
{ "provider": "openai", "family": "dalle", "features": [], "region": "us", "availableRegions": [ "us" ] }
dalle2
This model is based on the DALL-E 2 architecture. It is a high-quality model that can generate images from text. It is tunable and offers a balance between cost and quality.
Technical Details
{ "provider": "openai", "family": "dalle", "features": [], "region": "us", "availableRegions": [ "us" ] }
stablediffusion
This model is based on the Stable Diffusion architecture. It is a high-quality model that can generate images from text. It is tunable and offers a balance between cost and quality.
Technical Details
{ "provider": "awsbedrock", "family": "stablediffusion", "features": [], "region": "us", "availableRegions": [ "us" ] }

Explore Models For Conversational AI Development

gpt-4.5-next

gpt-4.5-classic

gpt-4.5

o3-mini-next

o3-mini-classic

o3-mini

o1-next

o1-classic

o1

o1-mini-next

o1-mini-classic

o1-mini

gpt-4o-mini-next

gpt-4o-mini-classic

gpt-4o-mini

gpt-4o-next

gpt-4o-classic

gpt-4o

gpt-4-turbo-next

gpt-4-turbo-classic

gpt-4-turbo

gpt-4-next

gpt-4-classic

gpt-4

gpt-3.5-turbo-next

gpt-3.5-turbo-classic

gpt-3.5-turbo

gpt-3.5-turbo-instruct

mistral-large-latest

mistral-small-latest

deepseek-r1-distill-llama-70b

llama-3.3-70b-versatile

sonar-deep-research

sonar-reasoning-pro

sonar-reasoning

sonar-pro

sonar

gemini-2.0-flash

gemini-2.0-flash-lite

gemini-1.5-flash

gemini-1.5-pro

claude-v3-opus

claude-v3.5-sonnet

claude-v3-sonnet

claude-v3.5-haiku

claude-v3-haiku

claude-v3

claude-v2.1

claude-v2

claude-instant-v1

custom

text-qaa-web-001

text-qaa-005

text-qaa-004

text-qaa-003

text-qaa-002

text-qaa-001

text-algo-004

text-algo-003

text-algo-002

dalle3

dalle2

stablediffusion