Sound Production Modeling Using Concatenated Acoustic Tubes


03.22.10 Posted in EEN502Project by

Results and Code for Part A

Results and Code for Part B

Code for Part C – Hey

Hey Sound

Code for Part C – Wow

Wow Sound


Part A seems to have come out alright. Nothing was extraordinarily surprising – all figures were kind of what I expected. I think the same about part B. The result signal looks very much like a glottal pulse.

Part C is where I experienced some difficulties. As can be heard from the sound samples above, they are not entirely convincing. That being said, the ‘wow’ sounds OK, but the ‘hey’ sounds more like ‘hi’. I didn’t spend too much time on finding an appropriate energy envelope, but I did spend time trying to make the pitch of the synthesized word sound realistic. For each part of the word, I created new glottal pulses with appropriate frequencies. When transitioning between phonemes, a single, new area function is found by interpolating between the others. To achieve a more realistic sound, I tried playing around with different parameters of each phoneme or transition between phonemes: the time/duration in seconds, the pitch, and the envelope. I feel that the ‘wow’ came out well, but I have some issues with my ‘hey’. It seems my ‘a’ is more like a long ‘a’ as in ‘aww’ where I needed an ‘a’ as in ‘apex’. Darn!




Comments are closed.

Social Networks
Links
Search the Archives: