Acoustic analysis of pathological voices compressed with MPEG system
Impacto
Scholar |
Otros documentos de la autoría: González-Álvarez, Julio; Cervera Crespo, Teresa; Llau, M. José
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/8033
comunitat-uji-handle3:10234/8636
comunitat-uji-handle4:
INVESTIGACIONEste recurso está restringido
https://doi.org/10.1016/S0892-1997(03)00007-9 |
Metadatos
Título
Acoustic analysis of pathological voices compressed with MPEG systemFecha de publicación
2003-02Editor
ElsevierCita bibliográfica
GONZALEZ, Julio; CERVERA, Teresa; LLAU, M. José. Acoustic Analysis of Pathological Voices Compressedwith MPEG System. Journal of voice, 2003, 17.2: 126-139.Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
https://www.sciencedirect.com/science/article/pii/S0892199703000079?via%3DihubVersión
info:eu-repo/semantics/publishedVersionPalabras clave / Materias
Resumen
The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech ... [+]
The MPEG-1 Layer 3 compression schema of audio signal, commonly known as mp3, has caused a great impact in recent years as it has reached high compression rates while conserving a high sound quality. Music and speech samples compressed at high bitrates are perceptually indistinguishable from the original samples, but very little was known about how compression acoustically affects the voice signal. A previous work1 with normal voices showed a high fidelity at high-bitrate compressions both in voice parameters and the amplitude-frequency spectrum. In the present work, dysphonic voices were tested through two studies. In the first study, spectrograms, long-term average spectra (LTAS), and fast Fourier transform (FFT) spectra of compressed and original samples of running speech were compared. In the second study, intensities, formant frequencies, formant bandwidths, and a multidimensional set of voice parameters were tested in a set of sustained phonations. Results showed that compression at high bitrates (96 and 128 kbps) preserved the relevant acoustic properties of the pathological voices. With compressions at lower bitrates, fidelity decreases, introducing some important alterations. Results from both works, Gonzalez and Cervera1 and this paper, open up the possibility of using MPEG-compression at high bitrates to store or transmit high-quality speech recordings, without altering their acoustic properties. [-]
Proyecto de investigación
Fundació Caixa Castelló-Bancaixa and the Universitat Jaume I, Castellon, Spain (Project P1.1A2002-01)Derechos de acceso
Copyright © 2003 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/restrictedAccess
Aparece en las colecciones
- PSB_Articles [1321]