When designing displays for the human senses, perceptual spaces are of great importance to give intuitive access to physical attributes. Similar to how perceptual spaces based on hue, saturation, and lightness were constructed for visual color, research has explored perceptual spaces for sounds of a given timbral family based on timbre, brightness, and pitch. To promote an embodied approach to the design of auditory displays, we introduce the Vowel-Type-Pitch (VTP) space, a cylindrical sound space based on human sung vowels, whose timbres can be synthesized by the composition of acoustic formants and can be categorically labeled. Vowels are arranged along the circular dimension, while voice type and pitch of the vowel correspond to the remaining two axes of the cylindrical VTP space. The decoupling and perceptual effectiveness of the three dimensions of the VTP space are tested through a vowel labeling experiment, whose results are visualized as maps on circular slices of the VTP cylinder. We discuss implications for the design of auditory and multi-sensory displays that account for human perceptual capabilities.

Rocchesso, D., Andolina, S., Ilardo, G., Palumbo, S.D., Galluzzo, Y., Randazzo, M. (2022). A perceptual sound space for auditory displays based on sung-vowel synthesis. SCIENTIFIC REPORTS, 12(1), 1-13 [10.1038/s41598-022-23736-2].

A perceptual sound space for auditory displays based on sung-vowel synthesis

Rocchesso, Davide;Andolina, Salvatore
;
Galluzzo, Ylenia;Randazzo, Mario
2022-11-12

Abstract

When designing displays for the human senses, perceptual spaces are of great importance to give intuitive access to physical attributes. Similar to how perceptual spaces based on hue, saturation, and lightness were constructed for visual color, research has explored perceptual spaces for sounds of a given timbral family based on timbre, brightness, and pitch. To promote an embodied approach to the design of auditory displays, we introduce the Vowel-Type-Pitch (VTP) space, a cylindrical sound space based on human sung vowels, whose timbres can be synthesized by the composition of acoustic formants and can be categorically labeled. Vowels are arranged along the circular dimension, while voice type and pitch of the vowel correspond to the remaining two axes of the cylindrical VTP space. The decoupling and perceptual effectiveness of the three dimensions of the VTP space are tested through a vowel labeling experiment, whose results are visualized as maps on circular slices of the VTP cylinder. We discuss implications for the design of auditory and multi-sensory displays that account for human perceptual capabilities.
12-nov-2022
Rocchesso, D., Andolina, S., Ilardo, G., Palumbo, S.D., Galluzzo, Y., Randazzo, M. (2022). A perceptual sound space for auditory displays based on sung-vowel synthesis. SCIENTIFIC REPORTS, 12(1), 1-13 [10.1038/s41598-022-23736-2].
File in questo prodotto:
File Dimensione Formato  
Rocchesso_et_al-2022-Scientific_Reports.pdf

accesso aperto

Tipologia: Versione Editoriale
Dimensione 7.13 MB
Formato Adobe PDF
7.13 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10447/573525
Citazioni
  • ???jsp.display-item.citation.pmc??? 1
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact