Audio Transcriber

Transcribe audio files to text locally in your browser. Powered by Whisper AI. No upload, no server — completely private.

How It Works

1

1. Upload Audio

Drop one or more audio files, or use the file picker. Supports MP3, WAV, M4A, OGG, FLAC, WEBM, and MP4.

2

2. Choose a Model

Select a Whisper model. Tiny is fastest; Small is more accurate. The model downloads once and is cached in your browser for future use.

3

3. Transcribe

Click Transcribe Audio. Whisper runs entirely in your browser using WebAssembly. Your audio never leaves your device.

Frequently Asked Questions

Is my audio sent to a server?

No. All transcription runs locally using WebAssembly. Your audio never leaves your device.

Does the model download every time?

No. Once downloaded, the Whisper model is cached in your browser and reused automatically on future visits.

How large are the models?

Tiny is ~40 MB, Base is ~74 MB, Small is ~244 MB. Larger models are more accurate but slower and require more memory.

Can I transcribe multiple files?

Yes. Add multiple files to the queue and they will be processed one after another automatically.

What audio formats are supported?

MP3, WAV, M4A, OGG, FLAC, WEBM, and MP4. All formats are automatically converted to the 16kHz format Whisper requires.

What browsers are supported?

Chrome, Edge, Firefox, and Safari with WebAssembly support. Chrome and Edge offer the best performance with WebGPU acceleration.

Can I transcribe in languages other than English?

Yes. You can select a specific language or use Auto-detect to let Whisper identify the language automatically.