docs: refresh groq audio docs

This commit is contained in:
Peter Steinberger
2026-04-04 10:01:12 +01:00
parent ac8d91edff
commit 459ede5a7e
2 changed files with 13 additions and 3 deletions

View File

@@ -129,6 +129,7 @@ Note: Binary detection is best-effort across macOS/Linux/Windows; ensure the CLI
## Notes & limits
- Provider auth follows the standard model auth order (auth profiles, env vars, `models.providers.*.apiKey`).
- Groq setup details: [Groq](/providers/groq).
- Deepgram picks up `DEEPGRAM_API_KEY` when `provider: "deepgram"` is used.
- Deepgram setup details: [Deepgram (audio transcription)](/providers/deepgram).
- Mistral setup details: [Mistral](/providers/mistral).

View File

@@ -55,12 +55,13 @@ export GROQ_API_KEY="gsk_..."
Groq also provides fast Whisper-based audio transcription. When configured as a
media-understanding provider, OpenClaw uses Groq's `whisper-large-v3-turbo`
model to transcribe voice messages.
model to transcribe voice messages through the shared `tools.media.audio`
surface.
```json5
{
media: {
understanding: {
tools: {
media: {
audio: {
models: [{ provider: "groq" }],
},
@@ -75,6 +76,14 @@ If the Gateway runs as a daemon (launchd/systemd), make sure `GROQ_API_KEY` is
available to that process (for example, in `~/.openclaw/.env` or via
`env.shellEnv`).
## Audio notes
- Shared config path: `tools.media.audio`
- Default Groq audio base URL: `https://api.groq.com/openai/v1`
- Default Groq audio model: `whisper-large-v3-turbo`
- Groq audio transcription uses the OpenAI-compatible `/audio/transcriptions`
path
## Available models
Groq's model catalog changes frequently. Run `openclaw models list | grep groq`