MP4 to SPX Converter

Extract Speex audio from MP4 video files online

Drop files here. 1 GB maximum file size or Sign Up
to

Settings

Set the overall output Speex audio bitrate. Designed for human speech encoding, Speex reaches transparency at ultra-low bitrate with a maximum bitrate of 44 kbps.
Set the number of audio channels. This setting is most useful when downmixing channels (e.g., from 5.1 to stereo).
Set the sample rate of the audio. Music with a full spectrum (20 Hz — 20 kHz) requires values not lower than 44.1 kHz to achieve transparency. More info can be found on the wiki.

mp4

MP4 (MPEG-4 Part 14) is the most widely used multimedia container format in the world, standardized by the Moving Picture Experts Group as part of the MPEG-4 specification in 2003. Built on the ISO base media file format (MPEG-4 Part 12), which itself drew from the Apple QuickTime container, MP4 uses a hierarchical atom/box structure that can encapsulate virtually any type of media data. The container most commonly packages H.264 or H.265 video with AAC audio, though it also supports a wide range of alternative codecs including AV1, VP9, MPEG-4 Visual, AC-3, and ALAC. The design supports advanced features such as streaming hints for progressive download and adaptive streaming, chapter markers, multiple audio and subtitle tracks, metadata tags, and embedded thumbnail images. A standardized structure and broad codec support have made MP4 the default choice for online video platforms, mobile devices, digital cameras, and operating system media libraries. HTML5 video with H.264 in MP4 is supported by every major web browser, establishing the combination as the universal baseline for web video delivery. Efficient packaging overhead, combined with the compression capabilities of modern codecs it carries, enables high-quality video distribution at practical file sizes across bandwidth-constrained networks and storage-limited devices.
read more

spx

Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus) as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
read more
Facebook Amazon Microsoft Tesla Nestle Walmart L'Oreal

Speech-Optimized Codec

Speex excels at encoding human voice. Converting MP4 audio to SPX gives you the best possible speech quality at minimal file sizes.

Open and Free

Speex is open-source and royalty-free. Extract voice audio from your MP4 without any licensing concerns or restrictions.

Cloud Encoding

The MP4 to SPX extraction and Speex encoding runs on our servers. No codec installations needed on your device.

How to convert MP4 to SPX

1

Select files from Computer, Google Drive, Dropbox, URL or by dragging it on the page.

2

Choose spx or any other format you need as a result (more than 200 formats supported)

3

Let the file convert and you can download your spx file right afterwards

About formats

MP4 (MPEG-4 Part 14) is the most widely used multimedia container format in the world, standardized by the Moving Picture Experts Group as part of the MPEG-4 specification in 2003. Built on the ISO base media file format (MPEG-4 Part 12), which itself drew from the Apple QuickTime container, MP4 uses a hierarchical atom/box structure that can encapsulate virtually any type of media data. The container most commonly packages H.264 or H.265 video with AAC audio, though it also supports a wide range of alternative codecs including AV1, VP9, MPEG-4 Visual, AC-3, and ALAC. The design supports advanced features such as streaming hints for progressive download and adaptive streaming, chapter markers, multiple audio and subtitle tracks, metadata tags, and embedded thumbnail images. A standardized structure and broad codec support have made MP4 the default choice for online video platforms, mobile devices, digital cameras, and operating system media libraries. HTML5 video with H.264 in MP4 is supported by every major web browser, establishing the combination as the universal baseline for web video delivery. Efficient packaging overhead, combined with the compression capabilities of modern codecs it carries, enables high-quality video distribution at practical file sizes across bandwidth-constrained networks and storage-limited devices.
Initial release: 2003
Speex is an open-source audio codec purpose-built for speech compression, developed by Jean-Marc Valin under the Xiph.Org Foundation. First released in October 2002, it targets voice-over-IP, conferencing, and any scenario where spoken word needs to travel efficiently over a network. SPX files wrap Speex-encoded audio inside an Ogg container, pairing the codec's speech optimization with Ogg's streaming capabilities. Three sampling rates are supported — narrowband at 8 kHz, wideband at 16 kHz, and ultra-wideband at 32 kHz — along with variable bitrate encoding that adapts in real time to speech complexity. A standout advantage is its patent-free, BSD-licensed nature, which allowed developers to embed it freely in both commercial and open-source products. Speex also bundles acoustic echo cancellation, noise suppression, and automatic gain control, features that rival codecs typically delegate to external libraries. Although its creators officially recommend Opus) as a successor since 2012, Speex remains deployed in legacy VoIP systems, archived recordings, and embedded devices where its lightweight decoder footprint is still valued.
Initial release: October 15, 2002

Frequently Asked Questions

Why convert MP4 to SPX?

Speex is an open-source codec designed specifically for speech encoding — delivering excellent voice quality at very low bitrates for VoIP and voice apps.

What plays SPX files?

VLC, Foobar2000, and most media players with Ogg support handle SPX. VoIP applications and voice recording tools use Speex natively.

Is Speex better than MP3 for speech?

For speech specifically, Speex outperforms MP3 at low bitrates. It was designed for voice, not music — and it excels in that niche.

Can I batch convert?

Upload multiple MP4 files simultaneously. Each audio track is extracted and encoded to SPX independently.

Is Speex open-source?

Yes — Speex is completely free and open-source, released under a BSD-like license. No royalties or licensing fees apply.

Does SPX work for music?

Speex is optimized for speech, not music. For music content, choose Vorbis, OPUS, or FLAC instead.

MP4 to SPX Quality Rating

3.8 (8 votes)
You need to convert and download at least 1 file to provide feedback!