Skip to end of metadata
Go to start of metadata

CSI-speech project 2010-2013 is a part of the

Computational Science Programme of the Academy of Finland

We develop new inversion algorithms related to the production and perception of speech

These are the main goals of the project:

  1. Non-invasive measurement of occupational voice loading.
  2. Highly natural speech synthesis including affective factors.
  3. Robust speech recognition in natural environments.
  4. A description of the brain mechanisms of speech perception.

Three Centres of Excellence are involved in the project: CoE in Inverse Problems Research, CoE in Adaptive Informatics Research, and CoE in Computational Complex Systems Research.

Books:

Jennifer L Mueller and Samuli Siltanen: Linear and Nonlinear Inverse Problems with Practical Applications, SIAM 2012.

Published journal articles:

Auvinen H., Raitio T., Airaksinen M., Siltanen S., Story B., Paavo A.: Automatic Glottal Inverse Filtering with the Markov Chain Monte Carlo Method, Computer Speech and Language (In Press), 2013.

Ikehata M, Niemi E and Siltanen S: Inverse obstacle scattering with limited-aperture data. Inverse Problems and Imaging 6(1), pp. 77-94, 2012.

Ismo Miettinen, Paavo Alku, Nelli Salminen, Patrick J May, Hannu Tiitinen: Responsiveness of the human auditory cortex to degraded speech sounds: reduction of amplitude resolution vs. additive noise. Brain Research, Vol. 1367, pp. 298-309, 2011.

Nakamura G, Ronkanen P, Siltanen S and Tanuma K: Recovering conductivity at the boundary in three-dimensional electrical impedance tomography. Inverse Problems and Imaging 5(2), pp. 485-510, 2011.

Astala K, Mueller J L, Paivarinta L, Peramaki A and Siltanen S: Direct electrical impedance tomography for nonsmooth conductivities. Inverse Problems and Imaging 5(3), pp. 531-549, 2011.

Paavo Alku, Tom Bäckström, Carlo Magi: Generalization of linear prediction with the autocorrelation method. Electronics Letters, Vol. 47, Issue 2, pp. 145-147, 2011.

Santeri Yrttiaho, Patrick J. C. May, Hannu Tiitinen, Paavo Alku: Cortical encoding of aperiodic and periodic speech sounds: evidence for distinct neural populations. NeuroImage, Vol. 55, Issue 3, pp. 1252-1259, 2011.

Hannu Pulakka, Paavo Alku: Bandwidth extension of telephone speech using a neural network and a filterbank implementation for highband mel spectrum. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, No. 7, pp. 2170-2183, 2011.

Tuomo Raitio, Antti Suni, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio, Paavo Alku: HMM-based speech synthesis utilizing glottal inverse filtering. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, No. 1, pp. 153-165, 2010.

Ismo Miettinen, Hannu Tiitinen, Paavo Alku, Patrick May: Sensitivity of the human auditory cortex to acoustic degradation of speech and non-speech sounds. BMC Neuroscience 2010, 11:24.

Rahim Saeidi, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku: Temporally weighted linear prediction features for tackling additive noise in speaker verification. IEEE Signal Processing Letters, Vol. 17, No. 6, pp. 599-602, 2010.

Krupchyk K, Lassas M and Siltanen S: Determining electrical and heat transfer parameters using coupled boundary measurements. SIAM Journal on Mathematical Analysis 43(5), pp. 2096-2115.

Nelli Salminen, Hannu Tiitinen, Ismo Miettinen, Paavo Alku, Patrick May: Asymmetrical representation of auditory space in human cortex. Brain Research, Vol. 1306, pp. 93-99, 2010.

Nelli H. Salminen, Patrick J.C. May, Paavo Alku, Hannu Tiitinen: A population rate code of auditory space in the human cortex. PLoS One 4(10), 2010.

Laura E. Matilainen, Sanna S. Talvitie, Eero Pekkonen, Paavo Alku, Patrick J.C. May, Hannu Tiitinen: The effects of healthy aging on auditory processing in humans as indexed by transient brain responses. Clinical Neurophysiology, Vol. 121, pp. 902-911, 2010.

Sanna S. Talvitie, Laura E. Matilainen, Eero Pekkonen, Paavo Alku, Patrick J.C. May, Hannu Tiitinen: The effects of cortical ischemic stroke on auditory processing in humans as indexed by transient brain responses. Clinical Neurophysiology, Vol. 121, pp. 912-920, 2010.

Santeri Yrttiaho, Hannu Tiitinen, Paavo Alku, Ismo Miettinen, Patrick May: Temporal integration of vowel periodicity in the auditory cortex. Journal of the Acoustical Society of America, Vol. 128, No. 1, pp. 224-234, 2010.

Refereed conference papers:

Auvinen H. Raitio T., Siltanen S., Alku P.: Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering. Proc. Interspeech 2012, ISSN: 1990-9770, 2012.

Alku Paavo, Pohjalainen Jouni, Vainio Martti, Laukkanen Anne-Maria, Story Brad: Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. Proceedings of InterSpeech 2012, 13th Annual Conference of the International Speech Communication Association, 1-4, 2012.

Hannu Pulakka, Ulpu Remes, Kalle Palomäki, Mikko Kurimo, Paavo Alku: Speech bandwidth extension using Gaussian Mixture Model-based estimation of the highband Mel spectrum. In CD Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), Prague, Czech Republic, May 22-27, 2011.

Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku: Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis. In CD Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), Prague, Czech Republic, May 22-27, 2011.

George Kafentzis, Yannis Stylianou, Paavo Alku: Glottal inverse filtering using Stablilised Weighted Linear Prediction. Proc. of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP’11), Prague, Czech Republic, May 22-27, 2011.

Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle Palomäki, Mikko Kurimo, Paavo Alku: Low-frequency bandwidth extension of telephone speech using sinusoidal synthesis and gaussian mixture model. Proc. of Interspeech’11, Florence, Italy, Aug. 28-31, 2011.

Sami Keronen, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo: Noise robust feature extraction based on Extended Weighted Linear Prediction in LVCSR. Proc. of Interspeech’11, Florence, Italy, Aug. 28-31, 2011.

Jouni Pohjalainen, Tuomo Raitio, Paavo Alku: Detection of shouted speech in the presence of ambient noise. Proc. of Interspeech’11, Florence, Italy, Aug. 28-31, 2011.

Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku: Analysis of HMM-based Lombard speech synthesis. Proc. of Interspeech’11, Florence, Italy, Aug. 28-31, 2011.

Antti Suni, Tuomo Raitio, Martti Vainio, Paavo Alku: The GlottHMM entry for Blizzard Challenge 2011: Utilizing source unit selection in HMM-based speech synthesis for improved excitation generation. Proc. of the ISCA Blizzard Challenge 2011 Workshop, Turin, Italy, Sept. 2, 2011.

Seppanen A, Nissinen A, Kolehmainen V, Siltanen S and Laukkanen A-M 2011, Electrical impedance tomography imaging of larynx. Models and analysis of vocal emissions for biomedical applications, 7th International Workshop, Firenze, Italy, August 25-27, 2011.

Sami Keronen, Ulpu Remes, Kalle J. Palomäki, Tuomas Virtanen, and Mikko Kurimo: Comparison of noise robust methods in large vocabulary speech recognition. In Proceedings of the 18th European Signal Processing Conference, EUSIPCO 2010, Aalborg, Denmark, August 2010.

Hannu Pulakka, Ville Myllylä, Laura Laaksonen, Paavo Alku: Bandwidth extension of telephone speech using a filter bank implementation for highband Mel spectrum. Proc. of the European Signal Processing Conference 2010 (EUSIPCO), Aalborg, Denmark, August 23-27, 2010.

Martti Vainio, Matti Airas, Juhani Järvikivi, Paavo Alku: Laryngeal voice quality in the expression of focus. Proc. of Interspeech’10, Makuhari, Japan, Sept. 26-30, 2010.

Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku: Comparison of formant enhancement methods for HMM-based speech synthesis. Proc. of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, Sept. 22-24, 2010.

Antti Suni, Tuomo Raitio, Martti Vainio, Paavo Alku: The GlottHMM speech synthesis entry for Blizzard Challenge 2010. Proc. of the Blizzard Challenge 2010 Workshop, Kyoto, Japan, Sept. 25, 2010.

Forthcoming publications:

Paavo Alku: Glottal inverse filtering analysis of human voice production, A review of estimation and parameterization methods of the glottal excitation and their applications. (Invited article). Sadhana. In press.

M. Gehre, T. Kluth A. Lipponen, B. Jin, A. Seppänen, J.P. Kaipio, P. Maass: Sparsity Reconstruction in Electrical Impedance Tomography: An Experimental Evaluation, Journal of Computational and Applied Mathematics, In press.

Conference presentations:

Seppänen A, Nissinen A, Kolehmainen V, Siltanen S, Laukkanen A-M, "Electrical Impedance Tomography imaging of larynx",
Finnish-Japanese-Korean Workshop on Inverse Problems. 14.12.2011, Helsinki, Finland.

Laukkanen A-M.: "Phonation related vocal fold loading as a challenge for imaging and quantification",
Finnish-Japanese-Korean Workshop on Inverse Problems. 14.12.2011, Helsinki, Finland.

Samuli Siltanen: Plenary talk: "Electrical Impedance Tomography", Fields-MITACS Conference on Mathematics of Medical Imaging, University of Toronto, Canada, June 23, 2011.

Samuli Siltanen: "Feasibility of electrical impedance tomography for the imaging of the larynx - preliminary results", Advanced Voice Function Assessment (AVFA) Workshop, Goethe-University of Frankfurt am Main, Germany, May 15, 2011.

Samuli Siltanen: "Fake tooth, artificial voice, and inverse calculus" (in Finnish), Kumpula Colloquium, University of Helsinki, Finland. This presentation was aimed at a general audience.

Samuli Siltanen: "Industrial Mathematics" (In Finnish), The Science Forum, University of Helsinki, Finland, January 13, 2011. This presentation was aimed at a general audience.A. Seppänen, V. Kolehmainen, S. Siltanen, A-M. Laukkanen: "Electrical impedance tomography for non-invasive measurement of occupational voice loading", Annual seminar of Academy of Finland Computational Science Research Programme (Lastu), Tuusula, April 6-8, 2011.

Liu D, Seppänen A, Nissinen A, Kolehmainen V, Siltanen S, Laukkanen A-M: Preliminary results on 3D electrical impedance tomography imaging of vocal folds. International Conference on Voice Physiology and Biomechanics, Erlangen, Germany, July 5-7, 2012.

Kankare Elina, Liu Dong, Laukkanen Anne-Maria. Comparison of text reading and spontaneous speech with different parameters of electroglottography, perceptual analysis and VAPP. 5th International Congress of the World Voice Consortium, Luxor, Egypt, October 27th to 31st, 2012.

Alku Paavo, Pohjalainen Jouni, Vainio Martti, Laukkanen Anne-Maria, Story Brad. Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. InterSpeech 2012, Portland, Oregon 11.09.2012.

Peltokoski Joanna, Geneid Ahmed, Laukkanen Anne-Maria, Kankare Elina, Tyrmi Jaana, Liu Dong. Finnish Resonance Tubes. Immediate effects of resonance tube training on supraglottic area and vocal fold vibration.
The 5th World Voice Congress October 27-31, 2012, Luxor, Egypt 31.10.2012.

Radolf V, Nissinen A, Laukkanen A-M, Havlík R, Horáček J. Computer simulation of musical singer voice based on MRI and acoustic measurements. The 18th International Conference Engineering Mechanics 2012, Svratka, Czech Republic, 14 – 17 May 2012. 15.05.2012.

Radolf Vojtech, Horacek Jaromir, Bula Vitek, Vesely Jan, Laukkanen Anne-Maria. Experimental investigation of air pressure and acoustic characteristics of human voice. Comparison of measurements in vivo and in vitro.. International Conference on Voice Physiology and Biomechanics,  Erlangen, Germany, 07.08.2012.

Invited lectures:

A. Seppänen: "Statistical inversion in electrical impedance tomography", School of Mathematical Sciences, Fudan University, Shanghai, China, September 15th, 2010.

A. Seppänen: Three lectures: 1. "Electrical impedance tomography", 2. "Non-stationary inverse problems with application to industrial process tomography", 3. "Localization of internal electrodes in electrical impedance tomography", School of Mathematics and Statistics, Lanzhou University, Lanzhou, China, September 19th - 20th, 2010.

A. Seppänen: "Bayesian inversion in electrical impedance tomography", University of Rome, "La Sapienza", Italy, March 28th, 2011.

Visibility in mainstream media:

2011 December Yliopisto (university magazine)
2012 March Prisma Studio (Finnish science TV show)
2012 October Helsingin Sanomat (main Finnish newspaper)
2012 October Tekniikka & Talous (Finnish engineering weekly paper)
2012 October Aristoteleen kantapää (Finnish national radio)
2012 October Medical News Today
2012 November Akuutti (Finnish national television)

  • No labels