Transcribe audio files to text using Google Cloud Speech-to-Text. Supports multiple audio formats and languages.
Request
Audio file to transcribe (MP3, WAV, FLAC, etc.)
Response
The transcribed text from the audio file
The detected language code (e.g., “en-US”)
Credit Cost
5 credits per minute of audio (rounded up to the nearest minute).
For example:
- 30 seconds of audio = 5 credits (1 minute)
- 2.5 minutes of audio = 15 credits (3 minutes)
Example Request
curl -X POST https://api.gistmag.co.uk/tts/stt \
-F "audio=@recording.wav" \
-F "api_key=your_api_key_here"
- WAV (recommended)
- MP3
- FLAC
- OGG
- Other formats supported by Google Cloud Speech-to-Text
Notes
- The API automatically detects the language in the audio
- Audio is processed securely and not stored
- Maximum file size limits apply based on Google Cloud Speech-to-Text constraints