Overview
Base speech-to-text endpoint using x402 payments. Provide either a YouTube URL/ID or a direct audio file URL; the server downloads the audio, runs speech-to-text, and returns the transcript text. Pricing: 0.01 USDC per started minute of audio, computed from the estimated duration and charged upfront via x402. Default discovery price without a specific URL is 0.60 USDC as an upper-bound quote. When called without a specific URL, the endpoint returns a default discovery quote. Clients should re-run discovery with a concrete URL to get an exact quote based on the real audio duration.
Protocol
x402
Price
$0.60
Payment Asset
USD Coin
Payment Network
Base
Category
uncategorized
Provider
56e8ec99-70ab-4ac7-ac99-95eadb1013b6-00-4jxhwwfedokw
Indexed
2026-03-02 01:12:43
Health
Status
degraded
Latency (p50)
611ms
Uptime (30d)
100.0%
Reliability Score
90/100
Last Checked
2026-07-04 18:44:27
Last Healthy
2026-04-08 15:24:20
Consecutive Failures
0
Input Schema
{
"bodyFields": {
"file_url": {
"description": "Direct URL to an audio file (e.g. .mp3, .webm). Must be an http(s) URL. For paid calls, either youtube or file_url is required.",
"required": false,
"type": "string"
},
"youtube": {
"description": "YouTube video URL (e.g. https://www.youtube.com/watch?v=...) or 11-character video ID. For paid calls, either youtube or file_url is required.",
"required": false,
"type": "string"
}
},
"bodyType": "json",
"method": "POST",
"type": "http"
}
Output Schema
{
"action": {
"description": "Action performed. For this endpoint: 'stt'.",
"type": "string"
},
"duration_minutes": {
"description": "Duration rounded up to whole minutes, used for pricing (0.01 USDC per minute).",
"type": "number"
},
"duration_seconds": {
"description": "Audio duration in seconds when available (YouTube metadata or estimated from file size).",
"type": "number"
},
"file_url": {
"description": "Direct file URL that was processed when source = 'file_url'.",
"type": "string"
},
"ok": {
"description": "True on success.",
"type": "boolean"
},
"price_usdc": {
"description": "Price charged for this call in USDC (6 decimals). Computed as 0.01 USDC per started minute of audio.",
"type": "number"
},
"source": {
"description": "Source type used for transcription: 'youtube' or 'file_url'.",
"type": "string"
},
"text": {
"description": "Full transcript text returned by the speech-to-text engine (top-level 'text' field).",
"type": "string"
},
"video_id": {
"description": "YouTube video ID that was processed.",
"type": "string"
},
"youtube_url": {
"description": "Resolved YouTube URL that was processed.",
"type": "string"
}
}