Vision & Generative Media

What Is a Mel Spectrogram?

A mel spectrogram represents audio as a time-frequency image with frequencies scaled to match human hearing. It is a common input feature for speech and audio models.

Further reading

Read more about mel spectrogram — articles and blogs from around the web: