TY - JOUR
T1 - A low complexity time-scaling expansion algorithm of speech signals suitable for real time implementation
AU - Duchen-Sanchez, Gonzalo
AU - Garcia-Hernandez, Jose Juan
AU - Nakano-Miyatake, Mariko
AU - Perez-Meana, Hector
N1 - Funding Information:
We thank to the National Science and Technology Council (CONACyT) and to the National Polytechnic Institute of Mexico (IPN) for the financial support provided during the realization of this research. We also like to thank the National Rehabilitation Centre of Mexico for the speech data provided to evaluation of proposed systems.
PY - 2009/1
Y1 - 2009/1
N2 - This paper presents the development and implementation of a variable rate time-scaling expansion system for speech signals, based on the pitch information, in which only the voiced segments are expanded, keeping the unvoiced and silence segments unchanged. The proposed system was first evaluated by computer simulation and then implemented on a digital signal processor (DSP). Time-domain, frequency-domain, mean opinion score (MOS) and diagnostic rhyme test (DRT) evaluations were done to test the actual performance of developed algorithm, which show that the proposed system allows improving the learning level of foreign language students as well as the understanding ability of elderly people. Objective tests also were carried out in order to probe similarity between the original and the expanded signals. Applying an iterative refinement of the C source code it was possible to obtain a real-time implementation. The current implemented algorithm requires 11 kwords program memory and about 9 million of floating point operations per second (MFLOPS).
AB - This paper presents the development and implementation of a variable rate time-scaling expansion system for speech signals, based on the pitch information, in which only the voiced segments are expanded, keeping the unvoiced and silence segments unchanged. The proposed system was first evaluated by computer simulation and then implemented on a digital signal processor (DSP). Time-domain, frequency-domain, mean opinion score (MOS) and diagnostic rhyme test (DRT) evaluations were done to test the actual performance of developed algorithm, which show that the proposed system allows improving the learning level of foreign language students as well as the understanding ability of elderly people. Objective tests also were carried out in order to probe similarity between the original and the expanded signals. Applying an iterative refinement of the C source code it was possible to obtain a real-time implementation. The current implemented algorithm requires 11 kwords program memory and about 9 million of floating point operations per second (MFLOPS).
KW - DSP implementation
KW - Pitch period
KW - Real-time implementation
KW - Time-scale modification
KW - Voiced segment
UR - http://www.scopus.com/inward/record.url?scp=56549126351&partnerID=8YFLogxK
U2 - 10.1016/j.dsp.2008.07.010
DO - 10.1016/j.dsp.2008.07.010
M3 - Artículo
SN - 1051-2004
VL - 19
SP - 104
EP - 117
JO - Digital Signal Processing: A Review Journal
JF - Digital Signal Processing: A Review Journal
IS - 1
ER -