Skip to main content
Chibi supports a wide range of AI providers. You can configure which models to use for text, images, and voice, as well as fine-tune their behavior.

Master API Keys

To use a specific provider, you must set its corresponding API key in your .env file.
ProviderVariableDescription
OpenAIOPENAI_API_KEYRequired for GPT-5, o3, DALL-E 3, Whisper.
AnthropicANTHROPIC_API_KEYRequired for Claude 4.5 models.
Google GeminiGEMINI_API_KEYRequired for Gemini 2.5/3.0 and Imagen 4.0.
DeepSeekDEEPSEEK_API_KEYRequired for DeepSeek Chat.
MistralMISTRALAI_API_KEYRequired for Mistral models.
xAI (Grok)GROK_API_KEYRequired for Grok models.
AlibabaALIBABA_API_KEYRequired for Qwen (text) and Wan (image) models.
CloudflareCLOUDFLARE_API_KEYRequired for Workers AI models.
CloudflareCLOUDFLARE_ACCOUNT_IDRequired if using Cloudflare. Your Account ID.
MiniMaxMINIMAX_API_KEYRequired for MiniMax models (Text, Image & Speech).
ZhipuAIZHIPUAI_API_KEYRequired for GLM models (Chinese LLM leader).
MoonshotMOONSHOTAI_API_KEYRequired for Kimi models.
ElevenLabsELEVEN_LABS_API_KEYRequired for ElevenLabs TTS.
SunoSUNO_API_ORG_API_KEYRequired for Suno music generation (via sunoapi.org).
Google SearchGOOGLE_SEARCH_API_KEYRequired for Google Search tool.
Google SearchGOOGLE_SEARCH_CXRequired Custom Search Engine ID.

Model Configuration

You can specify which models Chibi should use by default.
VariableDescriptionDefault
DEFAULT_MODELThe default LLM for text chat.None (Auto-select)
DEFAULT_PROVIDERThe default provider for text chat.None (Auto-select)
DEFAULT_IMAGE_MODELThe default model for image generation.None (Auto-select)
MODERATION_PROVIDERThe provider for command moderation.None
MODERATION_MODELThe model used for command moderation.None
MODELS_WHITELISTComma-separated list of allowed models. If empty, all are allowed.None

Available Models (Examples)

Please note that the full list of supported models is much larger, it is enormous. Here only few examples are provided:
  • OpenAI: gpt-5.2, gpt-5.1, o3, o4-mini
  • Anthropic: claude-sonnet-4-5-20250929, claude-haiku-4-5-20251001
  • Grok: grok-4-1-fast-reasoning, grok-4-1-reasoning, grok-beta
  • Gemini: gemini-2.5-pro, gemini-3-pro
  • Alibaba: qwen3-max, qwen-max
  • MiniMax: MiniMax-M2.5, MiniMax-M2.5-highspeed; Image-01 (image)
  • ZhipuAI: glm-5, glm-4-flash, glm-4, glm-4-vision

Text Generation Parameters

Fine-tune how the LLM generates text.
VariableDescriptionDefault
TEMPERATUREControls randomness (0.0 to 2.0).0.5
MAX_TOKENSThe maximum number of tokens to generate.32000
TIMEOUTRequest timeout in seconds.600
RETRIESNumber of retries on failure.3

Image Generation Settings

Configure the quality and dimensions of generated images.
VariableDescriptionDefault
IMAGE_SIZEDefault resolution (e.g., 1024x1024).1024x1024
IMAGE_QUALITYQuality setting (mostly for DALL-E 3).standard
IMAGE_ASPECT_RATIOAspect ratio (e.g., 16:9).16:9
IMAGE_GENERATIONS_LIMITDaily limit per user.5

Provider-Specific Image Sizes

Some providers require specific resolutions. You can override the default IMAGE_SIZE for them:
  • IMAGE_SIZE_NANO_BANANA (for Gemini Image models)
  • IMAGE_SIZE_IMAGEN (for Imagen 4.0)
  • IMAGE_SIZE_ALIBABA (for Wan/Qwen Image)

Voice & Audio (STT/TTS)

Configure Speech-to-Text (STT) and Text-to-Speech (TTS) capabilities.
VariableDescriptionDefault
STT_PROVIDERProvider for transcribing voice messages.None (Auto-select)
STT_MODELModel used for transcription.None (Auto-select)
TTS_PROVIDERProvider for generating voice responses.None (Auto-select)
TTS_MODELModel used for speech generation.None (Auto-select)

MiniMax TTS Specific Settings

For MiniMax Text-to-Speech, you can configure the following:
VariableDescriptionDefault
MINIMAX_TTS_MODELThe specific MiniMax TTS model to use.speech-2.8-turbo
MINIMAX_TTS_VOICEThe voice to use for MiniMax TTS.Korean_HaughtyLady