A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.

Capizzi, G., Rocchesso, D., Baldan, S. (2020). Streams as Seams: Carving trajectories out of the time-frequency matrix. In Simone Spagnol and Charalampos Saitis (a cura di), Proceedings of the 17th Sound and Music Computing Conference (pp. 442-449). Torino.

Streams as Seams: Carving trajectories out of the time-frequency matrix

Capizzi, Giovanni;Rocchesso, Davide
;
2020-01-01

Abstract

A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.
2020
Settore INF/01 - Informatica
Settore ING-INF/03 - Telecomunicazioni
Capizzi, G., Rocchesso, D., Baldan, S. (2020). Streams as Seams: Carving trajectories out of the time-frequency matrix. In Simone Spagnol and Charalampos Saitis (a cura di), Proceedings of the 17th Sound and Music Computing Conference (pp. 442-449). Torino.
File in questo prodotto:
File Dimensione Formato  
SMCCIM_2020_paper_60.pdf

accesso aperto

Descrizione: articolo principale
Tipologia: Versione Editoriale
Dimensione 6.25 MB
Formato Adobe PDF
6.25 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/424453
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact