docs: refresh groq audio docs

2026-04-21 05:32:53 +00:00 · 2026-04-04 10:01:12 +01:00
parent ac8d91edff
commit 459ede5a7e
2 changed files with 13 additions and 3 deletions
--- a/docs/nodes/audio.md
+++ b/docs/nodes/audio.md
@@ -129,6 +129,7 @@ Note: Binary detection is best-effort across macOS/Linux/Windows; ensure the CLI
 ## Notes & limits

 - Provider auth follows the standard model auth order (auth profiles, env vars, `models.providers.*.apiKey`).
+- Groq setup details: [Groq](/providers/groq).
 - Deepgram picks up `DEEPGRAM_API_KEY` when `provider: "deepgram"` is used.
 - Deepgram setup details: [Deepgram (audio transcription)](/providers/deepgram).
 - Mistral setup details: [Mistral](/providers/mistral).
--- a/docs/providers/groq.md
+++ b/docs/providers/groq.md
@@ -55,12 +55,13 @@ export GROQ_API_KEY="gsk_..."

 Groq also provides fast Whisper-based audio transcription. When configured as a
 media-understanding provider, OpenClaw uses Groq's `whisper-large-v3-turbo`
-model to transcribe voice messages.
+model to transcribe voice messages through the shared `tools.media.audio`
+surface.

 ```json5
 {
-  media: {
-    understanding: {
+  tools: {
+    media: {
      audio: {
        models: [{ provider: "groq" }],
      },
@@ -75,6 +76,14 @@ If the Gateway runs as a daemon (launchd/systemd), make sure `GROQ_API_KEY` is
 available to that process (for example, in `~/.openclaw/.env` or via
 `env.shellEnv`).

+## Audio notes
+
+- Shared config path: `tools.media.audio`
+- Default Groq audio base URL: `https://api.groq.com/openai/v1`
+- Default Groq audio model: `whisper-large-v3-turbo`
+- Groq audio transcription uses the OpenAI-compatible `/audio/transcriptions`
+  path
+
 ## Available models

 Groq's model catalog changes frequently. Run `openclaw models list | grep groq`