doctor with hat

Tomi H. Kinnunen

Associate Professor, PhD, Docent

Researcher in speech technology, specialized in speaker and language recognition

School of Computing
University of Eastern Finland
P.O. Box 111, FI-80101 Joensuu, FINLAND

E-mail: tkinnu@cs.uef.fi




JOBS !!!


PROGRAM CODE:
DATA:


PUBLICATIONS:

Refereed journal papers:
  1. R. González Hautamäki, M. Sahidullah, V. Hautamäki, and Tomi Kinnunen, Acoustical and perceptual study of voice disguise by age modification in speaker verification, Speech Communication (to appear), doi: 10.1016/j.specom.2017.10.002 [PDF]
  2. M. Sahidullah, D. Thomsen, R. Gonzalez Hautamäki, T. Kinnunen, Z.-H. Tan, R. Parts, and Martti Pitkänen, Robust Voice Liveness Detection and Speaker Verification Using Throat Microphones, IEEE/ACM Trans. on Audio, Speech, and Language Processing. (To appear) [PDF]
  3. A. Sholokhov, M. Sahidullah, T. Kinnunen, “Semi-Supervised Speech Activity Detection with an Application to Automatic Speaker Verification, Computer Speech & Language, 47: 132--156, January 2018 [PDF] [Program code]
  4. Z. Wu, J. Yamagishi, T. Kinnunen, C. Hanilçi, M. Sahidullah, A. Sizov, N. Evans, M. Todisco, H. Delgado, “ASVspoof: the Automatic Speaker Verification Spoofing and Countermeasures Challenge”, IEEE J. on Selected Topics in Signal Processing, 11(4): 588---604, June 2017 [PDF]
  5. A. Sizov, K. A. Lee, T. Kinnunen, “Direct Optimization of the Detection Cost for i-Vector based Spoken Language Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(3): 588---597, March 2017 [PDF]
  6. C. Hanilçi, T. Kinnunen, M. Sahidullah, A. Sizov, “Spoofing Detection Goes Noisy: An Analysis of Synthetic Speech Detection in the Presence of Additive Noise”, Speech Communication, 85: 83---97, December 2016 [PDF]
  7. M. Sahidullah and T. Kinnunen, Local Spectral Variability Features for Speaker Verification”, Digital Signal Processing, 50: 1--11, March 2016 [PDF] [Program code]
  8. H. Behravan, V. Hautamäki, S. M. Siniscalchi, T. Kinnunen, C.-H. Lee,  “i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 24(1): 29---41, January 2016 [PDF]
  9. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki and A.-M. Laukkanen, “Automatic versus Human Speaker Verification: the Case of Voice Mimicry”, Speech Communication, 72: 13--31, September 2015 [PDF]
  10. A. Sizov, E. Khoury, T. Kinnunen, Z. Wu and S. Marcel, Joint Speaker Verification and Anti-Spoofing in the i-Vector Space”, IEEE Transactions on Information Forensics and Security, 10(4): 821--832, April 2015 [PDF] [i-vector data (hosted at IDIAP)] [Code (hosted at IDIAP)]
  11. Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, H. Li, “Spoofing and Countermeasures for Speaker Verification: a Survey”, Speech Communication, 66: 130--153, February 2015 [PDF]
  12. H. Behravan, V. Hautamäki, T. Kinnunen, “Factors Affecting i-Vector Based Foreign Accent Recognition: a Case Study in Spoken Finnish”, Speech Communication, 66: 118--129, February 2015 [PDF]
  13. C. Hanilçi and T. Kinnunen, “Source Cell-Phone Recognition from Recorded Speech Using Non-Speech Segments”, Digital Signal Processing, 35: 75--85, December 2014 [PDF]
  14. J. Pohjalainen, C. Hanilçi, T. Kinnunen, P. Alku, Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch”, IEEE Signal Processing Letters, 21(12): 1516--1520, December 2014 [PDF] [MATLAB CODE]
  15. P. Rajan, A. Afanasyev, V. Hautamäki, T. Kinnunen, From Single to Multiple Enrollment i-Vectors: Practical PLDA Scoring Variants for Speaker Verification, Digital Signal Processing, 31: 93--10, August 2014 [PDF]
  16. V. Hautamäki, T. Kinnunen, F. Sedlak, K.A. Lee, B. Ma, H. Li, “Sparse Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1622--1631, August 2013 [PDF]
  17. O. Schleusing, T. Kinnunen, B. Story, J.-M. Vesin, “Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1560--1572, August 2013 [PDF]
  18. Md. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC and PLP Features for Speaker Verification Using i-Vectors”, Speech Communication, 55(2): 237--251, February 2013  [PDF] [ ISCA-Award for the best paper published in Speech Communication (2013 - 2015) ]
  19. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, “Mixture of Factor Analyzers Using Priors from Non-Parallel Speech for Voice Conversion”, IEEE Signal Processing Letters, 19(12): 914--917, December 2012 [PDF]
  20. P. Mowlaee, R. Saeidi, M.G. Christensen, Z.-H. Tan, T. Kinnunen, P. Fränti, S.H. Jensen, “A Joint Approach for Single-Channel Speaker Identification and Speech Separation”, IEEE Transactions on Audio, Speech and Language Processing, 20(9): 2586--2601, November 2012 [PDF] [supplementary audio material].
  21. T. Kinnunen, R. Saeidi, F. Sedlak, K.A. Lee, J. Sandberg, M. Hansson-Sandsten, H. Li,  “Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 20(7): 1990--2001, September 2012 [PDF][Multitaper Matlab code].
  22. C. Hanilçi, T. Kinnunen, F. Ertas, R. Saeidi, J. Pohjalainen, P. Alku, “Regularized All-Pole Models for Speaker Verification Under Noisy Environments”, IEEE Signal Processing Letters 19(3), 163--166, March 2012 [PDF]. Find also extended analysis and the RLP program codes from Odyssey 2012 version.
  23. T. Kinnunen, I. Sidoroff, M. Tuononen, P. Fränti, “Comparison of Clustering Methods: a Case Study of Text-Independent Speaker Modeling”, Pattern Recognition Letters32(13): 1604--1617, October 2011 [PDF]
  24. K.A. Lee, C.H. You, H. Li, T. Kinnunen, K.C. Sim, “Using Discrete Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 19(4): 861--870, May 2011 [PDF].
  25. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku, “Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification”, IEEE Signal Processing Letters, 17(6), pp. 599--602, June 2010 [PDF].
  26. J. Sandberg, M. Hansson-Sandsten, T. Kinnunen, R. Saeidi, P. Flandrin, P. Borgnat, “Multitaper Estimation of Frequency-Warped Cepstra with Application to Speaker Verification”, IEEE Signal Processing Letters, 17(4): 343--346, April 2010. [PDF][Multitaper Matlab code]
  27. T. Kinnunen and H. Li, An Overview of Text-Independent Speaker Recognition: from Features to Supervectors”, Speech Communication 52(1): 12--40, January 2010 [PDF].
  28. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, P. Fränti, Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification”, Pattern Recognition Letters 30(4): 341--347, March 2009. [PDF]
  29. V. Hautamäki, T. Kinnunen and P. Fränti, Text-Independent Speaker Recognition Using Graph Matching, Pattern Recognition Letters, 29(9): 1427--1432, 2008. [PDF]
  30. V. Hautamäki,  T. Kinnunen, I. Kärkkäinen, M. Tuononen, J. Saastamoinen, P. Fränti, “Maximum a Posteriori Estimation of the Centroid Model for Speaker Verification, IEEE Signal Processing Letters, 15: 162--165, 2008. [PDF]
  31. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification and Verification, IEEE Transactions on Audio, Speech and Language Processing, 14(1): 277--288, Jan 2006. [PDF
 Refereed conference papers:
  1. H. Delgado, M. Todisco, N. Evans, M. Sahidullah, W. M. Liu, F. Alegre, T. Kinnunen, B. Fauve, ”Impact of Bandwidth and Channel Variation on Presentation Attack Detection for Speaker Verification”, Proc. Int. Conf. of the Biometrics Special Interest Group (BIOSIG 2017), to appear. [PDF]
  2. T. Kinnunen, M. Sahidullah, H. Delgado, M. Todisco, N. Evans, J. Yamagishi, K.A. Lee, ”The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection”, Proc. Interspeech 2017, pp. 2--6, Stockholm, Sweden, August 2017 [PDF]
  3. V. Vestman, D. Gowda, M. Sahidullah, P. Alku, T. Kinnunen, ”Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions”, Proc. Interspeech 2017, pp. 1512--1516, Stockholm, Sweden, August 2017 [PDF]
  4. K. A. Lee, V. Hautamäki, T. Kinnunen, A. Larcher, C. Zhang, A. Nautsch, T. Stafylakis, G. Liu, M. Rouvier, W. Rao, F. Alegre, J. Ma, M. W. Mak, A. K. Sarkar, H. Delgado, R. Saeidi, H. Aronowitz, A. Sizov, H. Sun, T. H. Nguyen, G. Wang, B. Ma, V. Vestman, M. Sahidullah, M. Halonen, A. Kanervisto, G. Le Lan, F. Bahmaninezhad, S. Isadskiy, C. Rathgeb, C. Busch, G. Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, P.-M. Bousquet, M. Ajili, W. B. Kheder, D. Matrouf, Z. H. Lim, C. Xu, H. Xu, X. Xiao, E. S. Chng, B. Fauve, K. Sriskandaraja, V. Sethu, W. W. Lin, D. A. L. Thomsen, Z.-H. Tan, M. Todisco, N. Evans, H. Li, J. H. L. Hansen, J.-F. Bonastre, E. Ambikairajah, ”The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016”, Proc. Interspeech 2017, pp. 1328--1332, Stockholm, Sweden, August 2017  [PDF]
  5.  A.K. Sarkar, M. Sahidullah, Z.-H. Tan, T. Kinnunen, ”Improving Speaker Verification Performance in Presence of Spoofing Attacks Using Out-of-Domain Spoofed Data”, Proc. Interspeech 2017, pp. 2611--2615, Stockholm, Sweden, August 2017 [PDF]
  6. T. Kinnunen, L. Juvela, P. Alku, J. Yamagishi, ”Non-parallel Voice Conversion Using i-Vector PLDA: Towards Unifying Speaker Verification and Transformation”, Proc. ICASSP 2017, pp. 5535--5539, New Orleans, US, March 2017 [PDF]
  7. T. Kinnunen, M. Sahidullah, M. Falcone, L. Costantini, R. Gonzalez Hautamäki, D. Thomsen, A. Sarkar, Z.-H. Tan, H. Delgado, M. Todisco, N. Evans, V. Hautamäki, K. A. Lee, ”RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-Dependent Speaker Verification Research”, Proc. ICASSP 2017, pp. 5395--5399, New Orleans, US, March 2017 [PDF] - see also ASVspoof2017 challenge
  8. A. Kanervisto, V. Vestman, M. Sahidullah, V. Hautamäki, T. Kinnunen, ”Effects of Gender Information in Text-Independent and Text-Dependent Speaker Verification”, Proc. ICASSP 2017, pp. 5360--5364, New Orleans, US, March 2017 [PDF]
  9. H. Delgado, M. Todisco, M. Sahidullah, A.K. Sarkar, N. Evans, T. Kinnunen, Z.-H. Tan, ”Further Optimisations of Constant Q Cepstral Processing for Integrated Utterance Verification and Text-Dependent Speaker Verification”, Proc. IEEE Workshop on Spoken Language Technology (SLT), San Diego, US, December 2016. [PDF]
  10. S. Sieranoja, T. Kinnunen, P. Fränti, ”GPS Trajectory Biometrics: From Where You Were to How You Move”,  Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR), pp. 450--560, Mérida, Mexico, December 2016 [PDF]
  11. T. Kinnunen, A. Sholokhov, E. Khoury, D. Thomsen, M. Sahidullah and Z.-H. Tan, ”HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors”,  Proc. Interspeech, pp. 2992--2996, San Francisco, US, September 2016 [PDF]
  12. M. Sahidullah, H. Delgado, M. Todisco, H. Yu, T. Kinnunen, N. Evans and Z.-H. Tan, ”Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015”, Proc. Interspeech, pp. 1700--1704, San Francisco, US, September 2016 [PDF]
  13. M. Sahidullah, R. González Hautamäki, D.A.L. Thomsen, T. Kinnunen, Z.-H. Tan, V. Hautamäki, R. Parts, M. Pitkänen, ”Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech”, Proc. Interspeech, pp. 1720--1724, San Francisco, US, September 2016 [PDF]
  14. T. Kinnunen, M. Sahidullah, I. Kukanov, H. Delgado, M. Todisco, A. sarkar, N. Thomsen, V. Hautamäki, N. Evans, Z.-H. Tan, ”Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus”, Proc. Interspeech, pp. 430--434, San Francisco, US, September 2016 [PDF]
  15. R. Gonzalez Hautamäki, M. Sahidullah, T. Kinnunen, V. Hautamäki, ”Age-related voice disguise and its impact on speaker verification accuracy”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 277--282, Bilbao, Spain, June 2016 [PDF]
  16. A. Sizov, K.A. Lee, T. Kinnunen, ”Discriminating languages in a probabilistic latent subspace”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 81--88, Bilbao, Spain, June 2016 [PDF]
  17. A. H. Poorjam, R. Saeidi, T. Kinnunen, V. Hautamäki, ”Incorporating uncertainty as a quality measure in i-vector based language recognition”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 74--80, Bilbao, Spain, June 2016 [PDF]
  18. H. Behravan, T. Kinnunen, V. Hautamäki, ”Out-of-set i-vector selection for open-set language identification”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016, pp. 303--310, Bilbao, Spain, June 2016 [PDF]
  19. A. Sholokhov, T. Kinnunen, S. Cumani, ”Discriminative multi-domain PLDA for speaker verification”, Proc. ICASSP 2016, pp. 5030--5034, Shanghai, China, March 2016  [PDF]
  20. Z. Wu, T. Kinnunen, N. Evans, J. Yamagishi, C. Hanilçi, M. Sahidullah, A. Sizov, ”ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge”, Proc. Interspeech 2015, pp. 2037--2041, Dresden, Germany, September 2015 [PDF]
  21. M. Sahidullah, T. Kinnunen, C. Hanilçi, ”A comparison of features for synthetic speech detection”, Proc. Interspeech 2015, pp. 2087--2091, Dresden, Germany, September 2015 [PDF]
  22. C. Hanilçi, T. Kinnunen,  M. Sahidullah, A. Sizov, ”Classifiers for synthetic speech detection: a comparison”, Proc. Interspeech 2015, pp. 2057--2061, Dresden, Germany, September 2015 [PDF]
  23. R. Saeidi, T. Niemi, H. Karppelin, J. Pohjalainen, T. Kinnunen, P. Alku, ”Speaker recognition for speech under face cover”, Proc. Interspeech 2015, pp. 1012--1016, Dresden, Germany, September 2015 [PDF]
  24. A. Fedorova, O. Glembek, T. Kinnunen, P. Matějka, ”Exploring ANN back-ends for i-vector based speaker age estimation”, Proc. Interspeech 2015, pp. 3036--3040, Dresden, Germany, September 2015 [PDF]
  25. E. Khoury, T. Kinnunen, A. Sizov, Z. Wu, S. Marcel, ”Introducing i-vectors for joint anti-spoofing and speaker verification”, Proc. Interspeech 2014, pp. 61--65, Singapore, September 2014 [PDF]
  26. H. Behravan, V. Hautamäki, S.M. Siniscalchi, E. Khoury, T. Kurki, T. Kinnunen, C.-H. Lee, ”Dialect levelling in Finnish: a universal speech attribute approach”, Proc. Interspeech 2014, pp. 2165--2169, Singapore, September 2014 [PDF]
  27. A. Sizov, K.A. Lee, T. Kinnunen, ”Unifying probabilistic linear discriminant analysis variants in biometric authentication”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 464--475, Joensuu, Finland, August 2014 [PDF] [Fast PLDA implementation]
  28. V. Hautamäki, A. Pöllänen, T. Kinnunen, K.A. Lee, H. Li and P. Fränti, ”A comparison of categorical attribute data clustering methods”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 53--62, Joensuu, Finland, August 2014 [PDF]
  29. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, A.-M. Laukkanen, ”Comparison of human listeners and speaker verification systems using voice mimicry data”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 137--144, Joensuu, Finland, June 2014 [PDF]
  30. C.S. Greenberg, D. Bansé, G.R. Doddington, D. Garcia-Romero, J. J. Godfrey, T. Kinnunen, A.F. Martin, A. McCree, M. Przybocki, D.A. Reynolds, ”The NIST 2014 speaker recognition i-vector machine learning challenge”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 224--230, Joensuu, Finland, June 2014. [PDF]
  31. H. Behravan, V. Hautamäki, S.M. Siniscalchi, T. Kinnunen, C.-H. Lee, ”Introducing attribute features to foreign accent recognition”, Proc. ICASSP 2014, pp. 5332--5336, Florence, Italy, May 2014 [PDF]
  32. A. Sholokhov, T. Pekhovsky, O. Kudashev, A. Shulipa, T. Kinnunen, ”Bayesian analysis of similarity matrices for speaker diarization”, Proc. ICASSP 2014, pp. 106--110, Florence, Italy, May 2014 [PDF]
  33. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based voice conversion using non-negative spectrogram deconvolution”, Proc. 8th ISCA Speech Synthesis Workshop (SSW'13), pp. 201--206, Barcelona, Spain, September 2013. [PDF]
  34. T. Kinnunen, Md. J. Alam, P. Matejka, P. Kenny, J. Cernocky, D. O'Shaughnessy, ”Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations”, Proc. Interspeech 2013, pp. 3122--3126, Lyon, France, August 2013 [PDF]
  35. C. Hanilci, T. Kinnunen, P. Rajan, J. Pohjalainen, P. Alku, F. Ertas, Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort”, Proc. Interspeech 2013, pp. 2881--2885, Lyon, France, August 2013 [PDF]
  36. P. Rajan, T. Kinnunen, C. Hanilci, J. Pohjalainen, P. Alku, ”Using group delay functions from all-pole models for speaker recognition”, Proc. Interspeech 2013, pp. 2489--2493, Lyon, France, August 2013  [PDF]
  37. Z. Wu, A. Larcher, K.A. Lee, E.S. Chng, T. Kinnunen, H. Li, ”Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints”, Proc. Interspeech 2013, pp. 950--954, Lyon, France, August 2013. [PDF]
  38. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based unit selection for voice conversion utilizing temporal information”, Proc. Interspeech 2013, pp. 3057--3061, Lyon, France, August 2013 [PDF]
  39. V. Hautamäki, K.A. Lee, D. van Leeuwen, R. Saeidi, A. Larcher, T. Kinnunen, T. Hasan, S.O. Sadjadi, G. Liu, H. Boril, J.H.L. Hansen, B. Fauve, "Automatic Regularization of Cross-entropy Cost for Speaker Recognition Fusion", Proc. Interspeech 2013, pp. 1609--1613, Lyon, France, August 2013. [PDF]
  40. N. Evans, T. Kinnunen, J. Yamagishi, ”Spoofing and countermeasures for automatic speaker verification”, Proc. Interspeech 2013, pp. 925--929, Lyon, France, August 2013 [PDF]
  41. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, T. Leino, A.-M. Laukkanen, ”I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry”, Proc. Interspeech 2013, pp. 930--934, Lyon, France, August 2013 [PDF]
  42. R. Gonzalez Hautamäki, V. Hautamäki, P. Rajan and T. Kinnunen, ”Merging human and automatic system decisions to improve speaker recognition performance”, Proc. Interspeech 2013, pp. 2519--2523, Lyon, France, August 2013 [PDF]
  43. R. Saeidi, K. A. Lee, T. Kinnunen, T. Hasan, B. Fauve, P. -M. Bousquet, E. Khoury,P. L. Sordo Martinez, J. M. K. Kua, C. H. You, H. Sun, A. Larcher, P. Rajan, V. Hautamäki, C. Hanilci, B. Braithwaite, R. Gonzales-Hautamäki, S. O. Sadjadi, G. Liu, H. Boril, N. Shokouhi, D. Matrouf, L. El Shafey, P.Mowlaee, J. Epps, T. Thiruvaran, D. A. van Leeuwen, B. Ma, H. Li, J. H. L. Hansen, J.-F. Bonastre, S. Marcel, J. Mason, E. Ambikairajah, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification", Proc. Interspeech 2013, pp. 1986--1990, Lyon, France, August 2013. [PDF]
  44. H. Behravan, V. Hautamäki, T. Kinnunen, ”Foreign Accent Detection from Spoken Finnish Using i-Vectors”, Proc. Interspeech 2013, pp. 79--83, Lyon, France, August 2013 [PDF]
  45. P. Rajan, T. Kinnunen, V. Hautamäki, ”Effect of multicondition training on i-vector PLDA configurations for speaker recogntion”, Proc. Interspeech 2013, pp. 3694--3697, Lyon, France, August 2013 [PDF]
  46. T. Kinnunen, P. Rajan, ”A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data”,  Proc. Int. Conf. on Acoustics,
    Speech and Signal Processing (ICASSP 2013), pp. 7229--7233, Vancouver, Canada, May 2013. [PDF] [MATLAB CODE]
  47. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Speaker Identification From Shouted Speech: Analysis and Compensation”, Proc. Int. Conf. on Acoustics, Speech and
    Signal Processing (ICASSP 2013), pp. 8027--8031, Vancouver, Canada, May 2013 [PDF]
  48. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, E. Ambikairajah, ”A Study on spoofing attack in state-of-the-art speaker verification: the telephone speech case”, Proc. 2012 Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), pp. 1--5, Hollywood, USA, December 2012 [PDF] (BEST PAPER AWARD)
  49. T. Kinnunen, R. Saeidi, J. Leppänen, J.P. Saarinen, ”Audio context recognition in variable mobile environments from short segments using speaker and language recognizers”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  50. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Regularization of all-pole models for speaker verification under additive noise”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  51. V. Hautamäki, K.A. Lee, A. Larcher, T. Kinnunen, B. Ma, H. Li, ”Variational Bayes logistic regression as regularized fusion for NIST SRE 2010”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  52. T. Kinnunen, H. Leisma, M. Machunik, T. Kakkonen, J.-L. Lebrun, SWAN - Scientific Writing AssistaNt. A Tool for Helping Scholars to Write Reader-Friendly Manuscripts”, demonstrator in the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Avignon, France, April 2012. [PDF]
  53. T. Kinnunen, Z.-Z. Wu, K. A. Lee, F. Sedlak, E. S. Chng, H. Li, “Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech”, Proc. ICASSP 2012, pp. 4401--4404, Kyoto, Japan, March 2012  [PDF].
  54. S. Siddiq, T. Kinnunen, M. Vainio, S. Werner, “Intonational Speaker Verification: a Study on Parameters and Performance Under Noisy Conditions”, Proc. ICASSP 2012, pp. 4777--4780, Kyoto, Japan, March 2012 [PDF].
  55. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, J. Sandberg, M. Hansson-Sandsten, “Comparing Spectrum Estimators in Speaker Verification Under Additive Noise”,  Proc. ICASSP 2012, pp. 4769--4772, Kyoto, Japan, March 2012 [PDF].
  56. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC Features for Speaker Verification Using i-Vectors”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 547--552, Hawaii, December 2011. [PDF] [Multitaper Matlab code]
  57. J. Rodriguez-Fuentes, M. Penagarikano, A. Varona, M. Diez, G. Bordel, D. Martinez, J. Villalba, A. Miguel, A. Ortega, E. Lleida, A. Abad, O. Koller, I. Trancoso, P. Lopez-Otero, L. Docio-Fernandez, C. Garcia-Mateo, R. Saeidi, M. Soufifar, T. Kinnunen, T. Svendsen, P. Fränti, “Multi-Site Heterogenous System Fusions for the Albayzin 2010 Language Recognition Evaluation”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 377--382, Hawaii, December 2011. [PDF]
  58. Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti, "Combining voice activity detectors using decision fusion", Proc. Speech and Computer (SPECOM 2011), Kazan, Russia, September 2011, pp. 278--283.
  59. V. Hautamäki, K.A. Lee, T. Kinnunen, B. Ma, H. Li, “Regularized Logistic Regression Fusion for Speaker Verification”, Proc. Interspeech 2011, Florence, Italy, pp. 2745-2748, August 2011 [PDF]
  60. P. Mowlaee, R. Saeidi, Z.-H. Tan, M.G. Christensen, T. Kinnunen, P. Fränti, S.H. Jensen, “Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge”, Proc. Interspeech 2011, Florence, Italy, August 2011, pp. 677-680. [PDF]
  61. F. Sedlak, T. Kinnunen, V. Hautamäki, K.A. Lee, H. Li, Classifier Subset Selection and Fusion for Speaker Verification”, Proc. ICASSP 2011, pp. 4544--4547, Prague, Czech Republic, May 2011. [PDF] [video and slides]
  62. J. Pohjalainen, P. Alku, T. Kinnunen, Shout Detection in Noise”, Proc. ICASSP 2011, pp. 4968--4971, Prague, Czech Republic, May 2011. [PDF]
  63. T. Kinnunen, R. Saeidi, J. Sandberg, M. Hansson-Sandsten, What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering”,  Proc. Interspeech 2010, pp. 2734--2737, Makuhari, Japan, Sept. 2010. [PDF] [Multitaper Matlab code]
  64. J. Pohjalainen, R. Saeidi, T. Kinnunen, P. Alku,  Extended Weighted Linear Prediction (XLP) Analysis of Speech and its Application to Speaker Verification in Adverse Conditions”, Proc. Interspeech 2010, pp. 1477--1480, Makuhari, Japan, Sept. 2010. [PDF]
  65. Z.-Z. Wu, T. Kinnunen, E.S. Chng, H. Li, Text-Independent F0 Transformation with Non-Parallel Data for Voice Conversion”, Proc. Interspeech 2010, pp. 1732--1735, Makuhari, Japan, Sept. 2010. [PDF]
  66. V. Hautamäki, T. Kinnunen, M. Nosratighods, K.A. Lee, B. Ma, H. Li, “Approaching Human Listener Accuracy with Modern Speaker Verification”, Proc Interspeech 2010, pp. 1473--1476, Makuhari, Japan, Sept. 2010. [PDF]
  67. R. Saeidi, P. Mowlaee, T. Kinnunen, Z.-H. Tan, M.G. Christensen, S.H. Jensen, P. Fränti, Improving Monaural Speaker Identification by Double-Talk Detection”, Proc Interspeech 2010, pp. 1069--1072, Makuhari, Japan, Sept. 2010. [PDF]
  68. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku,  Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise”, Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 40-46, June 2010. [PDF]
  69. R. Saeidi, P. Mowlaee, T. Kinnunen, Z. H Tan, M. G. Christensen, S. H. Jensen and P. Fränti, Signal-to-signal ratio independent speaker identification for co-channel speech signals, Int. Conf. on Pattern Recognition (ICPR 2010), pp. 4545--456, Istanbul, Turkey, August 2010. [PDF]
  70. K. A. Lee, H. Li, C. H. You, T. Kinnunen, K. C. Sim, “Discrete Expected Likelihood Kernel for SVM-Based Speaker Verification”, Proc. 18th European Signal Processing Conference (EUSIPCO 2010), pp. 591--595, Aalborg, Denmark, August 2010 [PDF]
  71. T. Kinnunen, F. Sedlak, R. Bednarik, “Towards Task-Independent Person Authentication Using Eye Movement Signals”, Proc. of the 2010 Symposium on Eye-Tracking Research and Applications (ETRA 2010), pp. 187--190, Austin, Texas, March 2010. [PDF]
  72. R. Saeidi, T. Kinnunen, H.R.S. Mohammadi, R. Rodman, P. Fränti, “Joint Frame and Gaussian Selection for Text-Independent Speaker Verification, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 4530--4533, Dallas, Texas, USA, March 2010. [PDF]
  73. T. Kinnunen and P. Alku, “On separating glottal source and vocal tract information in telephony speaker verification, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009), pp. 4545--4548, Taipei, Taiwan, April 2009. [PDF]
  74. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni and P. Fränti, Comparing maximum a posteriori vector quantization and Gaussian mixture models in speaker verification”, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009) , pp. 4229--4232, Taipei, Taiwan, April 2009. [PDF]
  75. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, I. Sidoroff, “Developing Speaker Recognition System:  from Prototype to Practical Application”,  Int. Conf. Forensic Applications and Techniques in Telecommunications, Information and Multimedia (e-Forensics'09), Adelaide, Australia, LNICST vol. 8, 101-114, January 2009.
  76. K.A. Lee, C. You, H. Li, T. Kinnunen and D. Zhu, Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM, Proc. Interspeech 2008 , pp. 1397-1400, Brisbane, Australia, 2008. [PDF]
  77. T. Kinnunen, K.A. Lee and H. Li, Dimension Reduction of the Modulation Spectrogram for Speaker Verification, Proc. Odyssey: The Speaker and Language Recognition Workshop, Stellenbosch, South Africa, January 2008. [PDF]
  78. T. Kinnunen, E. Chernenko, M. Tuononen, P. Fränti, H. Li, “Voice Activity Detection Using MFCC Features and Support Vector Machine”, Proc. Speech and Computer 2007 (SPECOM), vol. 2, 556-561, Moscow, Russia, October 2007. [PDF]
  79. K.A. Lee, C. You, H. Li, T. Kinnunen, ”A GMM-based Probabilistic Sequence Kernel for Speaker Verification”, Proc. Interspeech 2007, p. 294-297, Antwerp, Belgium, August 2007.[PDF]
  80. T. Kinnunen, B. Zhang, J. Zhu, Y. Wang, ”Speaker Verification with Adaptive Spectral Subband Centroids”, Proc. Int. Conf. Biometrics, pp. 58-66, Lecture Notes in Computer Science 4642, Seoul, Korea, August 2007. [PDF]
  81. R. Saeidi, R.S. Mohammadi, R. Rodman, T. Kinnunen, "A New Segmentation Algorithm Combined with Transient Frames Power for Text-Independent Speaker Verification", Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), Vol. 4, pp. 305-308, Honolulu, Hawaii, April 2007. [PDF]
  82. T. Kinnunen, C.W.E. Koh, L. Wang, H. Li, and E.S. Chng, "Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 547-558, Singapore, December 2006.[PDF]
  83. T. Kinnunen, V. Hautamäki, P. Fränti, “On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition” , Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 559-567, Singapore, December 2006. [PDF]
  84. K.A. Lee, H. Sun, R. Tong, B. Ma, M. Dong, C. You, D. Zhu, C.W.E. Koh, L. Wang, T.Kinnunen, E.S. Chng and H. Li, "The IIR Submission to CSLP 2006 Speaker Recognition Evaluation", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 494-505, Singapore, December 2006. [PDF]
  85. R. Tong, B. Ma, K.A. Lee, C. You, D. Zhou, T. Kinnunen, H. Sun, M. Dong, E.S. Chng, H. Li, The IIR NIST 2006 Speaker Recognition System: Fusion of Acoustic and Tokenization Features”, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 566-577, Singapore, December 2006. [PDF]
  86. T. Kinnunen, "Joint Acoustic-Modulation Frequency for Speaker Recognition", Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2006), Vol. I, pp. 665-668, May 14-19, 2006, Toulouse, France. [PDF]
  87. T. Kinnunen, R. Gonzalez-Hautamäki, Long-Term F0 Modeling for Text-Independent Speaker Recognition, Proc. Int. Conf. on Speech and Computer (SPECOM'2005), pp. 567-570, Patras, Greece, October 2005. [PDF]
  88. J. Saastamoinen, Z. Fiedler, T. Kinnunen and P. Fränti, "On factors affecting MFCC-based speaker recognition accuracy", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 503-506, Patras, Greece, October 2005 [PDF].
  89.  H. Gupta, V. Hautamäki, T. Kinnunen and P. Fränti, "Field evaluation of text-dependent speaker recognition in an access control application", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 551-554, Patras, Greece, October 2005. [PDF]
  90. O. Grebenskaya, T. Kinnunen, P. Fränti, “Speaker Clustering in Speech Recognition”, Proc. 2005 Finnish Signal Processing Symposium (FINSIG’05), pp. 46-49, Kuopio, Finland, August 25, 2005. [PDF]
  91. V. Hautamäki, S. Cherednichenko, I. Kärkkäinen, T. Kinnunen, P. Fränti, “Improving K-Means by Outlier Removal”, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 978-987, Joensuu, Finland, June 19-22, 2005.
  92. R. Bednarik, T. Kinnunen, A. Mihaila, P. Fränti, “Eye-Movements as a Biometric“, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 780-789, Joensuu, Finland, June 19-22, 2005. [PDF]
  93. T. Niemi-Laitinen, J. Saastamoinen, T. Kinnunen, P Fränti, ”Applying MFCC-Based Automatic Speaker Recognition to GSM and Forensic Data“, Proc. Human Language Technologies (HLT’2005), p. 317-322, Tallinn, Estonia, April 4-5, 2005 [PDF]
  94. T. Kinnunen, E. Karpov, P. Fränti, “Efficient Online Cohort Selection Method for Speaker Verification”, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP 2004), Vol. III, pp. 2401-2402, Jeju Island, Korea, Oct. 4-8, 2004 [PDF].
  95. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP'2004), Vol. III, pp. 1805-1808, Jeju Island, Korea, Oct. 4-8, 2004. [PDF]
  96. T. Kinnunen, V. Hautamäki, P. Fränti, Fusion of Spectral Feature Sets for Accurate Speaker IdentificationProc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 361-365, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  97. E. Karpov, T. Kinnunen, P. Fränti, "Symmetric Distortion Measure for Speaker Recognition",  Proc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 366-370, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  98. T. Kinnunen, V. Hautamäki, P. Fränti, "On the Fusion of Dissimilarity-Based Classifiers for Speaker Identification", Proc. 8th European Conference on Speech Communiation and Technology (Eurospeech 2003), pp. 2641-2644, Geneva, Switzerland, September 1-4, 2003. [PDF]
  99. T. Kinnunen, E. Karpov, P. Fränti: "A Speaker Pruning Algorithm for Real-Time Speaker Identification", Proc. 4th International Conference on  Audio- and Video-Based Biometric Person Authentication (AVBPA 2003), pp. 639-646, Guilford, UK, June 9-11, 2003. [PDF]
  100. T. Kinnunen: "Designing a Speaker-Discriminative Adaptive Filter Bank for Speaker Recognition", Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), pp. 2325-2328, Denver, Colorado, USA, September 16-20, 2002. [PDF]
  101. T. Kinnunen, I. Kärkkäinen: "Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification", Proc. Joint IAPR International Workshop on Statistical Pattern Recognition (S+SPR 2002), pp. 681-688, Windsor, Canada, August 6-9, 2002. [PDF]
  102. T. Kinnunen, I. Kärkkäinen, P. Fränti: "Is Speech Data Clustered? - Statistical Analysis of Cepstral Features", Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)), vol. 4, pp. 2627-2630, Aalborg, Denmark, September 3-7, 2001. [PDF]
  103. T. Kinnunen, P. Fränti: "Speaker Discriminative Weighting Method for VQ-Based Speaker Identification", Proc. 3rd International Conference on audio-and video-based biometric person authentication (AVBPA 2001), pp. 150-156, Halmstad, Sweden, June 6-8, 2001. [PDF]
  104. T. Kinnunen, T. Kilpeläinen, P. Fränti: "Comparison of Clustering Algorithms in Speaker Identification", Proc. IASTED Int. Conf. Signal Processing and Communications (SPC 2000), pp. 222-227, Marbella, Spain, September 19-22, 2000. [PDF]

Theses:

Other publications