Spectrogram

Last Edited: Dec 24, 2023

What Is an Audio Spectrogram?

An audio spectrogram is a visual representation of sound. In general, a spectrogram is a time-varying spectral representation that shows the variation of the spectral density of a signal concerning time. Spectrograms (voicegrams, sonograms, or spectral waterfalls) typically identify phonetic sounds. In addition, they are used for speech processing, sonar, seismology, etc.

Creation Methods

They can be created using bandpass filters to form the approximated filter bank and a short-time Fourier transform (STFT) calculated from the time signal. This thesis uses the short-time Fourier transform (STFT) to obtain the audio spectrogram. Moreover, the objective of this research is the real-time implementation of the real-time spectrogram of an audio signal on a video monitor using the Xilinx Virtex-5 ML506 Evaluation Board. The Xilinx ML506 Virtex-5 Evaluation Board has powerful audio and video capabilities. FPGA processes the input audio signal to calculate the STFT of the signal. Once the STFT is calculated, it must be converted into a form suitable for the video monitor display. This research has several applications in scientific and commercial devices.

Sound Restoration

Sound engineers often use audio spectrograms in the sound restoration process. The key to successful audio restoration is your ability to analyze the situation correctly, like a doctor recognizing symptoms that point to a specific illness. Fortunately, spectrogram technology makes this task easier by visually representing audio. Any good visualization tool for audio repair and restoration aims to provide you with more information about an audible problem. This not only helps inform your editing decisions, but it can provide new, exciting ways to edit audio in the case of a spectrogram display. You can use it in tandem with a waveform display.

Sound Analyzing

In other words, we could describe the spectrogram as a very sophisticated audio analyzer. A spectrogram is a very detailed, accurate image of your audio displayed in either 2D or 3D. A graph shows the audio according to time and frequency, with brightness or height (3D) indicating amplitude. Whereas a waveform shows how your signal’s amplitude changes over time, the spectrogram shows this change for every frequency component in the signal. If you often use the waveform display, getting your head around this unique way to “see” the audio may take a while.  

FFT Algorithm

Not all spectrograms are created equal. An algorithm, “Fast Fourier Transform,” or FFT for short, computes this visual display. Many products that feature a spectrogram display allow you to adjust the size of the FFT, but what does this mean for audio repair and restoration? Changing the FFT size will change how the algorithm computes the spectrogram, causing it to look different. So, depending on the type of audio you’re working with and visualizing, this may help. As a rule, higher FFT sizes give you more detail in frequencies (frequency resolution).

On the other hand, lower FFT sizes give you more detail in time (time resolution). If you’re trying to identify a plosive, mic-handling noise or additional muddy low-frequency information, a higher FFT size in your spectrogram settings will help. Choose a lower FFT size if you’re trying to identify a high-frequency event or working with a transient signal (such as a percussion or drum loop).

Source Text

Обучение

ОСВОЙ МУЗЫКУ ПРОДАКШН

Курсы от экспертов, чтобы провести вас от базы до готовых треков.

Изображение обложки альбома House Boot Camp.

HOUSEОт упругого баса и плотных киков — курс учит самым современным техникам House продакшна, чтобы преуспеть и выделиться.

Изображение обложки альбома Trap Boot Camp.

TRAPХватит звучать как шаблонный Trap — делайте звучание World с нотами Дальнего Востока. Создавайте этно саундскейпы, чтобы ваш Trap был впереди.

Изображение обложки альбома Ambient Boot Camp.

AMBIENTДелайте расслабляющий, утончённый psy-ambient. Психоделично и спокойно для слуха — создавайте медитативные саундскейпы, чтобы увести слушателей в Zen.