Supported Languages & Accents

Kalem AI voice agents understand and speak 57+ languages (and expanding). Customize accent, tone, pacing and formality directly through your agent instructions for natural customer interactions worldwide.

Realtime Speech ↔ Speech Accent & Tone via Prompt Adaptive Code-Switch Low Latency Prosody Control

Accent & Style Guidance

Control accent, tone, formality and switching behavior through concise instructions. Examples below demonstrate patterns that work well. Combine role + objective + accent + pacing for best results.

You fully control speaking accent, voice tone, formality, pacing, code‑switching and fallback language by writing them directly in your Agent System Instruction (system prompt). Keep these directives short, explicit, and goal oriented. The model applies changes immediately—no extra API fields required.

Architecture: unified realtime speech↔speech model (no separate STT or TTS stages exposed), so mid‑call accent or tone adjustments take effect instantly.

  • State desired accent ("Neutral Latin American Spanish", "Standard French", "Calm Midwestern US English").
  • Define tone & formality (friendly, professional, concise, empathetic, energetic, etc.).
  • Specify pacing ("moderate speed", "slightly slower for clarity").
  • Describe language switching / fallback rules ("If user speaks Hindi, continue in Hindi until they switch back").
  • Impose response shape limits (max sentence count, question-first, etc.).

System Instruction Snippet Example

Role: Friendly tier‑1 support agent. Accent: Neutral Latin American Spanish unless caller is clearly using English; if caller switches, mirror their language. Tone: Warm, concise, professional. Avoid slang. Pacing: Moderate; brief 1–2 sentence answers. Guidelines: Ask one clarifying question before giving a solution when issue is ambiguous.

English (US → Spanish fallback)

Friendly Midwestern US accent. If caller switches to Spanish, respond in neutral Latin American Spanish and stay there.

Spanish (Formal)

Español neutro latino, tono profesional cordial, ritmo claro y pausado.

Arabic (MSA)

تحدث بالعربية الفصحى الحديثة بوضوح وهدوء، وبدون لهجات محلية.

French (Conversational)

Accent français standard, chaleureux, phrases concises et fluides.

Hindi + English Blend

Neutral Indian English; switch to Hindi if user begins in Hindi. Maintain clarity, moderate speed.

Support Persona

You are a calm tier-1 support agent. Empathetic tone. Ask one question at a time. Keep answers under 2 sentences.

Tip: Avoid over-specifying; overly long style blocks can reduce responsiveness.