AudioFree trial

Transcribe Audio

Convert speech to text with speaker labels.

Last updated

Checking your session…

How to use Transcribe Audio

About 3 minutes, start to finish.

  1. 1

    Open the transcriber

    Visit /tools/audio/transcribe-audio.

  2. 2

    Upload your audio

    MP3, MP4, WAV, M4A, OGG, FLAC up to 25 MB.

  3. 3

    Pick a language

    Auto-detect or specify. For multilingual recordings, leave it on auto.

  4. 4

    Click Transcribe

    Whisper large-v3 produces a timestamped transcript in ~3-4 minutes per hour of audio.

  5. 5

    Copy or download

    Output as plain TXT (clean reading) or with start-of-segment timestamps.

Frequently asked about Transcribe Audio

Speaker labels?
Available on Pro and above via a Pyannote speaker-diarisation pass. Best results on 2-4 distinct voices; degrades with overlapping speech.
vs Otter.ai?
Same Whisper backbone. Otter's edge is real-time meeting capture + calendar integration; ours is the bundled price + async transcription quality. See /compare/otter-ai for the full breakdown.
Length limit?
25 MB per file (OpenAI Whisper API cap). At ~128 kbps MP3 that's ~25 minutes of audio. Split longer recordings first.

Stop paying ten AI bills. Start your free Apexkit account.

No credit card. One free use of every tool. Upgrade only if you find yourself coming back.