mirror of
https://github.com/moltbot/moltbot.git
synced 2026-04-18 04:04:27 +00:00
4.6 KiB
4.6 KiB
summary, read_when, title
| summary | read_when | title | ||
|---|---|---|---|---|
| Deepgram transcription for inbound voice notes |
|
Deepgram |
Deepgram (Audio Transcription)
Deepgram is a speech-to-text API. In OpenClaw it is used for inbound audio/voice note
transcription via tools.media.audio.
When enabled, OpenClaw uploads the audio file to Deepgram and injects the transcript
into the reply pipeline ({{Transcript}} + [Audio] block). This is not streaming;
it uses the pre-recorded transcription endpoint.
| Detail | Value |
|---|---|
| Website | deepgram.com |
| Docs | developers.deepgram.com |
| Auth | DEEPGRAM_API_KEY |
| Default model | nova-3 |
Getting started
Add your Deepgram API key to the environment:```
DEEPGRAM_API_KEY=dg_...
```
```json5
{
tools: {
media: {
audio: {
enabled: true,
models: [{ provider: "deepgram", model: "nova-3" }],
},
},
},
}
```
Send an audio message through any connected channel. OpenClaw transcribes it
via Deepgram and injects the transcript into the reply pipeline.
Configuration options
| Option | Path | Description |
|---|---|---|
model |
tools.media.audio.models[].model |
Deepgram model id (default: nova-3) |
language |
tools.media.audio.models[].language |
Language hint (optional) |
detect_language |
tools.media.audio.providerOptions.deepgram.detect_language |
Enable language detection (optional) |
punctuate |
tools.media.audio.providerOptions.deepgram.punctuate |
Enable punctuation (optional) |
smart_format |
tools.media.audio.providerOptions.deepgram.smart_format |
Enable smart formatting (optional) |