doctor with hat

Tomi Kinnunen, PhD, Docent

Researcher in speech technology, specialized in speaker and language recognition

School of Computing
University of Eastern Finland
P.O. Box 111, FI-80101 Joensuu, FINLAND

E-mail: tkinnu@cs.uef.fi



PROGRAM CODE:
DATA:


PUBLICATIONS:

Refereed journal papers:
  1. C. Hanilçi, T. Kinnunen, M. Sahidullah, A. Sizov, “Spoofing Detection Goes Noisy: An Analysis of Synthetic Speech Detection in the Presence of Additive Noise”, Speech Communication (To appear) [PDF]
  2. M. Sahidullah and T. Kinnunen, Local Spectral Variability Features for Speaker Verification”, Digital Signal Processing, 50: 1--11, March 2016 [PDF] [Program code]
  3. H. Behravan, V. Hautamäki, S. M. Siniscalchi, T. Kinnunen, C.-H. Lee,  “i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 24(1): 29---41, January 2016 [PDF]
  4. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki and A.-M. Laukkanen, “Automatic versus Human Speaker Verification: the Case of Voice Mimicry”, Speech Communication, 72: 13--31, September 2015 [PDF]
  5. A. Sizov, E. Khoury, T. Kinnunen, Z. Wu and S. Marcel, Joint Speaker Verification and Anti-Spoofing in the i-Vector Space”, IEEE Transactions on Information Forensics and Security, 10(4): 821--832, April 2015 [PDF] [i-vector data (hosted at IDIAP)] [Code (hosted at IDIAP)]
  6. Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, H. Li, “Spoofing and Countermeasures for Speaker Verification: a Survey”, Speech Communication, 66: 130--153, February 2015 [PDF]
  7. H. Behravan, V. Hautamäki, T. Kinnunen, “Factors Affecting i-Vector Based Foreign Accent Recognition: a Case Study in Spoken Finnish”, Speech Communication, 66: 118--129, February 2015 [PDF]
  8. C. Hanilçi and T. Kinnunen, “Source Cell-Phone Recognition from Recorded Speech Using Non-Speech Segments”, Digital Signal Processing, 35: 75--85, December 2014 [PDF]
  9. J. Pohjalainen, C. Hanilçi, T. Kinnunen, P. Alku, Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch”, IEEE Signal Processing Letters, 21(12): 1516--1520, December 2014 [PDF] [MATLAB CODE]
  10. P. Rajan, A. Afanasyev, V. Hautamäki, T. Kinnunen, From Single to Multiple Enrollment i-Vectors: Practical PLDA Scoring Variants for Speaker Verification, Digital Signal Processing, 31: 93--10, August 2014 [PDF]
  11. V. Hautamäki, T. Kinnunen, F. Sedlak, K.A. Lee, B. Ma, H. Li, “Sparse Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1622--1631, August 2013 [PDF]
  12. O. Schleusing, T. Kinnunen, B. Story, J.-M. Vesin, “Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1560--1572, August 2013 [PDF]
  13. Md. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC and PLP Features for Speaker Verification Using i-Vectors”, Speech Communication, 55(2): 237--251, February 2013  [PDF] [ ISCA-Award for the best paper published in Speech Communication (2013 - 2015) ]
  14. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, “Mixture of Factor Analyzers Using Priors from Non-Parallel Speech for Voice Conversion”, IEEE Signal Processing Letters, 19(12): 914--917, December 2012 [PDF]
  15. P. Mowlaee, R. Saeidi, M.G. Christensen, Z.-H. Tan, T. Kinnunen, P. Fränti, S.H. Jensen, “A Joint Approach for Single-Channel Speaker Identification and Speech Separation”, IEEE Transactions on Audio, Speech and Language Processing, 20(9): 2586--2601, November 2012 [PDF] [supplementary audio material].
  16. T. Kinnunen, R. Saeidi, F. Sedlak, K.A. Lee, J. Sandberg, M. Hansson-Sandsten, H. Li,  “Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 20(7): 1990--2001, September 2012 [PDF][Multitaper Matlab code].
  17. C. Hanilçi, T. Kinnunen, F. Ertas, R. Saeidi, J. Pohjalainen, P. Alku, “Regularized All-Pole Models for Speaker Verification Under Noisy Environments”, IEEE Signal Processing Letters 19(3), 163--166, March 2012 [PDF]. Find also extended analysis and the RLP program codes from Odyssey 2012 version.
  18. T. Kinnunen, I. Sidoroff, M. Tuononen, P. Fränti, “Comparison of Clustering Methods: a Case Study of Text-Independent Speaker Modeling”, Pattern Recognition Letters32(13): 1604--1617, October 2011 [PDF]
  19. K.A. Lee, C.H. You, H. Li, T. Kinnunen, K.C. Sim, “Using Discrete Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 19(4): 861--870, May 2011 [PDF].
  20. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku, “Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification”, IEEE Signal Processing Letters, 17(6), pp. 599--602, June 2010 [PDF].
  21. J. Sandberg, M. Hansson-Sandsten, T. Kinnunen, R. Saeidi, P. Flandrin, P. Borgnat, “Multitaper Estimation of Frequency-Warped Cepstra with Application to Speaker Verification”, IEEE Signal Processing Letters, 17(4): 343--346, April 2010. [PDF][Multitaper Matlab code]
  22. T. Kinnunen and H. Li, An Overview of Text-Independent Speaker Recognition: from Features to Supervectors”, Speech Communication 52(1): 12--40, January 2010 [PDF].
  23. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, P. Fränti, Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification”, Pattern Recognition Letters 30(4): 341--347, March 2009. [PDF]
  24. V. Hautamäki, T. Kinnunen and P. Fränti, Text-Independent Speaker Recognition Using Graph Matching, Pattern Recognition Letters, 29(9): 1427--1432, 2008. [PDF]
  25. V. Hautamäki,  T. Kinnunen, I. Kärkkäinen, M. Tuononen, J. Saastamoinen, P. Fränti, “Maximum a Posteriori Estimation of the Centroid Model for Speaker Verification, IEEE Signal Processing Letters, 15: 162--165, 2008. [PDF]
  26. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification and Verification, IEEE Transactions on Audio, Speech and Language Processing, 14(1): 277--288, Jan 2006. [PDF
 Refereed conference papers:
  1. H. Delgado, M. Todisco, M. Sahidullah, A.K. Sarkar, N. Evans, T. Kinnunen, Z.-H. Tan, ”Further Optimisations of Constant Q Cepstral Processing for Integrated Utterance Verification and Text-Dependent Speaker Verification”, Proc. IEEE Workshop on Spoken Language Technology (SLT), San Diego, US, December 2016. [PDF]
  2. S. Sieranoja, T. Kinnunen, P. Fränti, ”GPS Trajectory Biometrics: From Where You Were to How You Move”,  Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR), Mérida, Mexico, December 2016 (To appear) [PDF]
  3. T. Kinnunen, A. Sholokhov, E. Khoury, D. Thomsen, M. Sahidullah and Z.-H. Tan, ”HAPPY Team Entry to NIST OpenSAD Challenge: A Fusion of Short-Term Unsupervised and Segment i-Vector Based Speech Activity Detectors”,  Proc. Interspeech, pp. 2992--2996, San Francisco, US, September 2016 [PDF]
  4. M. Sahidullah, H. Delgado, M. Todisco, H. Yu, T. Kinnunen, N. Evans and Z.-H. Tan, ”Integrated Spoofing Countermeasures and Automatic Speaker Verification: an Evaluation on ASVspoof 2015”, Proc. Interspeech, pp. 1700--1704, San Francisco, US, September 2016 [PDF]
  5. M. Sahidullah, R. González Hautamäki, D.A.L. Thomsen, T. Kinnunen, Z.-H. Tan, V. Hautamäki, R. Parts, M. Pitkänen, ”Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech”, Proc. Interspeech, pp. 1720--1724, San Francisco, US, September 2016 [PDF]
  6. T. Kinnunen, M. Sahidullah, I. Kukanov, H. Delgado, M. Todisco, A. sarkar, N. Thomsen, V. Hautamäki, N. Evans, Z.-H. Tan, ”Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus”, Proc. Interspeech, pp. 430--434, San Francisco, US, September 2016 [PDF]
  7. R. Gonzalez Hautamäki, M. Sahidullah, T. Kinnunen, V. Hautamäki, ”Age-related voice disguise and its impact on speaker verification accuracy”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 277--282, Bilbao, Spain, June 2016 [PDF]
  8. A. Sizov, K.A. Lee, T. Kinnunen, ”Discriminating languages in a probabilistic latent subspace”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 81--88, Bilbao, Spain, June 2016 [PDF]
  9. A. H. Poorjam, R. Saeidi, T. Kinnunen, V. Hautamäki, ”Incorporating uncertainty as a quality measure in i-vector based language recognition”, Proc. Odyssey: the Speaker and Language Recognition Workshop, pp. 74--80, Bilbao, Spain, June 2016 [PDF]
  10. H. Behravan, T. Kinnunen, V. Hautamäki, ”Out-of-set i-vector selection for open-set language identification”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016, pp. 303--310, Bilbao, Spain, June 2016 [PDF]
  11. A. Sholokhov, T. Kinnunen, S. Cumani, ”Discriminative multi-domain PLDA for speaker verification”, Proc. ICASSP 2016 (to appear) [PDF]
  12. Z. Wu, T. Kinnunen, N. Evans, J. Yamagishi, C. Hanilçi, M. Sahidullah, A. Sizov, ”ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge”, Proc. Interspeech 2015, pp. 2037--2041, Dresden, Germany, September 2015 [PDF]
  13. M. Sahidullah, T. Kinnunen, C. Hanilçi, ”A comparison of features for synthetic speech detection”, Proc. Interspeech 2015, pp. 2087--2091, Dresden, Germany, September 2015 [PDF]
  14. C. Hanilçi, T. Kinnunen,  M. Sahidullah, A. Sizov, ”Classifiers for synthetic speech detection: a comparison”, Proc. Interspeech 2015, pp. 2057--2061, Dresden, Germany, September 2015 [PDF]
  15. R. Saeidi, T. Niemi, H. Karppelin, J. Pohjalainen, T. Kinnunen, P. Alku, ”Speaker recognition for speech under face cover”, Proc. Interspeech 2015, pp. 1012--1016, Dresden, Germany, September 2015 [PDF]
  16. A. Fedorova, O. Glembek, T. Kinnunen, P. Matějka, ”Exploring ANN back-ends for i-vector based speaker age estimation”, Proc. Interspeech 2015, pp. 3036--3040, Dresden, Germany, September 2015 [PDF]
  17. E. Khoury, T. Kinnunen, A. Sizov, Z. Wu, S. Marcel, ”Introducing i-vectors for joint anti-spoofing and speaker verification”, Proc. Interspeech 2014, pp. 61--65, Singapore, September 2014 [PDF]
  18. H. Behravan, V. Hautamäki, S.M. Siniscalchi, E. Khoury, T. Kurki, T. Kinnunen, C.-H. Lee, ”Dialect levelling in Finnish: a universal speech attribute approach”, Proc. Interspeech 2014, pp. 2165--2169, Singapore, September 2014 [PDF]
  19. A. Sizov, K.A. Lee, T. Kinnunen, ”Unifying probabilistic linear discriminant analysis variants in biometric authentication”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 464--475, Joensuu, Finland, August 2014 [PDF] [Fast PLDA implementation]
  20. V. Hautamäki, A. Pöllänen, T. Kinnunen, K.A. Lee, H. Li and P. Fränti, ”A comparison of categorical attribute data clustering methods”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 53--62, Joensuu, Finland, August 2014 [PDF]
  21. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, A.-M. Laukkanen, ”Comparison of human listeners and speaker verification systems using voice mimicry data”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 137--144, Joensuu, Finland, June 2014 [PDF]
  22. C.S. Greenberg, D. Bansé, G.R. Doddington, D. Garcia-Romero, J. J. Godfrey, T. Kinnunen, A.F. Martin, A. McCree, M. Przybocki, D.A. Reynolds, ”The NIST 2014 speaker recognition i-vector machine learning challenge”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 224--230, Joensuu, Finland, June 2014. [PDF]
  23. H. Behravan, V. Hautamäki, S.M. Siniscalchi, T. Kinnunen, C.-H. Lee, ”Introducing attribute features to foreign accent recognition”, Proc. ICASSP 2014, pp. 5332--5336, Florence, Italy, May 2014 [PDF]
  24. A. Sholokhov, T. Pekhovsky, O. Kudashev, A. Shulipa, T. Kinnunen, ”Bayesian analysis of similarity matrices for speaker diarization”, Proc. ICASSP 2014, pp. 106--110, Florence, Italy, May 2014 [PDF]
  25. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based voice conversion using non-negative spectrogram deconvolution”, Proc. 8th ISCA Speech Synthesis Workshop (SSW'13), pp. 201--206, Barcelona, Spain, September 2013. [PDF]
  26. T. Kinnunen, Md. J. Alam, P. Matejka, P. Kenny, J. Cernocky, D. O'Shaughnessy, ”Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations”, Proc. Interspeech 2013, pp. 3122--3126, Lyon, France, August 2013 [PDF]
  27. C. Hanilci, T. Kinnunen, P. Rajan, J. Pohjalainen, P. Alku, F. Ertas, Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort”, Proc. Interspeech 2013, pp. 2881--2885, Lyon, France, August 2013 [PDF]
  28. P. Rajan, T. Kinnunen, C. Hanilci, J. Pohjalainen, P. Alku, ”Using group delay functions from all-pole models for speaker recognition”, Proc. Interspeech 2013, pp. 2489--2493, Lyon, France, August 2013  [PDF]
  29. Z. Wu, A. Larcher, K.A. Lee, E.S. Chng, T. Kinnunen, H. Li, ”Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints”, Proc. Interspeech 2013, pp. 950--954, Lyon, France, August 2013. [PDF]
  30. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based unit selection for voice conversion utilizing temporal information”, Proc. Interspeech 2013, pp. 3057--3061, Lyon, France, August 2013 [PDF]
  31. V. Hautamäki, K.A. Lee, D. van Leeuwen, R. Saeidi, A. Larcher, T. Kinnunen, T. Hasan, S.O. Sadjadi, G. Liu, H. Boril, J.H.L. Hansen, B. Fauve, "Automatic Regularization of Cross-entropy Cost for Speaker Recognition Fusion", Proc. Interspeech 2013, pp. 1609--1613, Lyon, France, August 2013. [PDF]
  32. N. Evans, T. Kinnunen, J. Yamagishi, ”Spoofing and countermeasures for automatic speaker verification”, Proc. Interspeech 2013, pp. 925--929, Lyon, France, August 2013 [PDF]
  33. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, T. Leino, A.-M. Laukkanen, ”I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry”, Proc. Interspeech 2013, pp. 930--934, Lyon, France, August 2013 [PDF]
  34. R. Gonzalez Hautamäki, V. Hautamäki, P. Rajan and T. Kinnunen, ”Merging human and automatic system decisions to improve speaker recognition performance”, Proc. Interspeech 2013, pp. 2519--2523, Lyon, France, August 2013 [PDF]
  35. R. Saeidi, K. A. Lee, T. Kinnunen, T. Hasan, B. Fauve, P. -M. Bousquet, E. Khoury,P. L. Sordo Martinez, J. M. K. Kua, C. H. You, H. Sun, A. Larcher, P. Rajan, V. Hautamäki, C. Hanilci, B. Braithwaite, R. Gonzales-Hautamäki, S. O. Sadjadi, G. Liu, H. Boril, N. Shokouhi, D. Matrouf, L. El Shafey, P.Mowlaee, J. Epps, T. Thiruvaran, D. A. van Leeuwen, B. Ma, H. Li, J. H. L. Hansen, J.-F. Bonastre, S. Marcel, J. Mason, E. Ambikairajah, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification", Proc. Interspeech 2013, pp. 1986--1990, Lyon, France, August 2013. [PDF]
  36. H. Behravan, V. Hautamäki, T. Kinnunen, ”Foreign Accent Detection from Spoken Finnish Using i-Vectors”, Proc. Interspeech 2013, pp. 79--83, Lyon, France, August 2013 [PDF]
  37. P. Rajan, T. Kinnunen, V. Hautamäki, ”Effect of multicondition training on i-vector PLDA configurations for speaker recogntion”, Proc. Interspeech 2013, pp. 3694--3697, Lyon, France, August 2013 [PDF]
  38. T. Kinnunen, P. Rajan, ”A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data”,  Proc. Int. Conf. on Acoustics,
    Speech and Signal Processing (ICASSP 2013), pp. 7229--7233, Vancouver, Canada, May 2013. [PDF] [MATLAB CODE]
  39. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Speaker Identification From Shouted Speech: Analysis and Compensation”, Proc. Int. Conf. on Acoustics, Speech and
    Signal Processing (ICASSP 2013), pp. 8027--8031, Vancouver, Canada, May 2013 [PDF]
  40. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, E. Ambikairajah, ”A Study on spoofing attack in state-of-the-art speaker verification: the telephone speech case”, Proc. 2012 Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), pp. 1--5, Hollywood, USA, December 2012 [PDF] (BEST PAPER AWARD)
  41. T. Kinnunen, R. Saeidi, J. Leppänen, J.P. Saarinen, ”Audio context recognition in variable mobile environments from short segments using speaker and language recognizers”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  42. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Regularization of all-pole models for speaker verification under additive noise”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  43. V. Hautamäki, K.A. Lee, A. Larcher, T. Kinnunen, B. Ma, H. Li, ”Variational Bayes logistic regression as regularized fusion for NIST SRE 2010”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  44. T. Kinnunen, H. Leisma, M. Machunik, T. Kakkonen, J.-L. Lebrun, SWAN - Scientific Writing AssistaNt. A Tool for Helping Scholars to Write Reader-Friendly Manuscripts”, demonstrator in the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Avignon, France, April 2012. [PDF]
  45. T. Kinnunen, Z.-Z. Wu, K. A. Lee, F. Sedlak, E. S. Chng, H. Li, “Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech”, Proc. ICASSP 2012, pp. 4401--4404, Kyoto, Japan, March 2012  [PDF].
  46. S. Siddiq, T. Kinnunen, M. Vainio, S. Werner, “Intonational Speaker Verification: a Study on Parameters and Performance Under Noisy Conditions”, Proc. ICASSP 2012, pp. 4777--4780, Kyoto, Japan, March 2012 [PDF].
  47. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, J. Sandberg, M. Hansson-Sandsten, “Comparing Spectrum Estimators in Speaker Verification Under Additive Noise”,  Proc. ICASSP 2012, pp. 4769--4772, Kyoto, Japan, March 2012 [PDF].
  48. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC Features for Speaker Verification Using i-Vectors”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 547--552, Hawaii, December 2011. [PDF] [Multitaper Matlab code]
  49. J. Rodriguez-Fuentes, M. Penagarikano, A. Varona, M. Diez, G. Bordel, D. Martinez, J. Villalba, A. Miguel, A. Ortega, E. Lleida, A. Abad, O. Koller, I. Trancoso, P. Lopez-Otero, L. Docio-Fernandez, C. Garcia-Mateo, R. Saeidi, M. Soufifar, T. Kinnunen, T. Svendsen, P. Fränti, “Multi-Site Heterogenous System Fusions for the Albayzin 2010 Language Recognition Evaluation”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 377--382, Hawaii, December 2011. [PDF]
  50. Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti, "Combining voice activity detectors using decision fusion", Proc. Speech and Computer (SPECOM 2011), Kazan, Russia, September 2011, pp. 278--283.
  51. V. Hautamäki, K.A. Lee, T. Kinnunen, B. Ma, H. Li, “Regularized Logistic Regression Fusion for Speaker Verification”, Proc. Interspeech 2011, Florence, Italy, pp. 2745-2748, August 2011 [PDF]
  52. P. Mowlaee, R. Saeidi, Z.-H. Tan, M.G. Christensen, T. Kinnunen, P. Fränti, S.H. Jensen, “Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge”, Proc. Interspeech 2011, Florence, Italy, August 2011, pp. 677-680. [PDF]
  53. F. Sedlak, T. Kinnunen, V. Hautamäki, K.A. Lee, H. Li, Classifier Subset Selection and Fusion for Speaker Verification”, Proc. ICASSP 2011, pp. 4544--4547, Prague, Czech Republic, May 2011. [PDF] [video and slides]
  54. J. Pohjalainen, P. Alku, T. Kinnunen, Shout Detection in Noise”, Proc. ICASSP 2011, pp. 4968--4971, Prague, Czech Republic, May 2011. [PDF]
  55. T. Kinnunen, R. Saeidi, J. Sandberg, M. Hansson-Sandsten, What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering”,  Proc. Interspeech 2010, pp. 2734--2737, Makuhari, Japan, Sept. 2010. [PDF] [Multitaper Matlab code]
  56. J. Pohjalainen, R. Saeidi, T. Kinnunen, P. Alku,  Extended Weighted Linear Prediction (XLP) Analysis of Speech and its Application to Speaker Verification in Adverse Conditions”, Proc. Interspeech 2010, pp. 1477--1480, Makuhari, Japan, Sept. 2010. [PDF]
  57. Z.-Z. Wu, T. Kinnunen, E.S. Chng, H. Li, Text-Independent F0 Transformation with Non-Parallel Data for Voice Conversion”, Proc. Interspeech 2010, pp. 1732--1735, Makuhari, Japan, Sept. 2010. [PDF]
  58. V. Hautamäki, T. Kinnunen, M. Nosratighods, K.A. Lee, B. Ma, H. Li, “Approaching Human Listener Accuracy with Modern Speaker Verification”, Proc Interspeech 2010, pp. 1473--1476, Makuhari, Japan, Sept. 2010. [PDF]
  59. R. Saeidi, P. Mowlaee, T. Kinnunen, Z.-H. Tan, M.G. Christensen, S.H. Jensen, P. Fränti, Improving Monaural Speaker Identification by Double-Talk Detection”, Proc Interspeech 2010, pp. 1069--1072, Makuhari, Japan, Sept. 2010. [PDF]
  60. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku,  Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise”, Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 40-46, June 2010. [PDF]
  61. R. Saeidi, P. Mowlaee, T. Kinnunen, Z. H Tan, M. G. Christensen, S. H. Jensen and P. Fränti, Signal-to-signal ratio independent speaker identification for co-channel speech signals, Int. Conf. on Pattern Recognition (ICPR 2010), pp. 4545--456, Istanbul, Turkey, August 2010. [PDF]
  62. K. A. Lee, H. Li, C. H. You, T. Kinnunen, K. C. Sim, “Discrete Expected Likelihood Kernel for SVM-Based Speaker Verification”, Proc. 18th European Signal Processing Conference (EUSIPCO 2010), pp. 591--595, Aalborg, Denmark, August 2010 [PDF]
  63. T. Kinnunen, F. Sedlak, R. Bednarik, “Towards Task-Independent Person Authentication Using Eye Movement Signals”, Proc. of the 2010 Symposium on Eye-Tracking Research and Applications (ETRA 2010), pp. 187--190, Austin, Texas, March 2010. [PDF]
  64. R. Saeidi, T. Kinnunen, H.R.S. Mohammadi, R. Rodman, P. Fränti, “Joint Frame and Gaussian Selection for Text-Independent Speaker Verification, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 4530--4533, Dallas, Texas, USA, March 2010. [PDF]
  65. T. Kinnunen and P. Alku, “On separating glottal source and vocal tract information in telephony speaker verification, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009), pp. 4545--4548, Taipei, Taiwan, April 2009. [PDF]
  66. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni and P. Fränti, Comparing maximum a posteriori vector quantization and Gaussian mixture models in speaker verification”, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009) , pp. 4229--4232, Taipei, Taiwan, April 2009. [PDF]
  67. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, I. Sidoroff, “Developing Speaker Recognition System:  from Prototype to Practical Application”,  Int. Conf. Forensic Applications and Techniques in Telecommunications, Information and Multimedia (e-Forensics'09), Adelaide, Australia, LNICST vol. 8, 101-114, January 2009.
  68. K.A. Lee, C. You, H. Li, T. Kinnunen and D. Zhu, Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM, Proc. Interspeech 2008 , pp. 1397-1400, Brisbane, Australia, 2008. [PDF]
  69. T. Kinnunen, K.A. Lee and H. Li, Dimension Reduction of the Modulation Spectrogram for Speaker Verification, Proc. Odyssey: The Speaker and Language Recognition Workshop, Stellenbosch, South Africa, January 2008. [PDF]
  70. T. Kinnunen, E. Chernenko, M. Tuononen, P. Fränti, H. Li, “Voice Activity Detection Using MFCC Features and Support Vector Machine”, Proc. Speech and Computer 2007 (SPECOM), vol. 2, 556-561, Moscow, Russia, October 2007. [PDF]
  71. K.A. Lee, C. You, H. Li, T. Kinnunen, ”A GMM-based Probabilistic Sequence Kernel for Speaker Verification”, Proc. Interspeech 2007, p. 294-297, Antwerp, Belgium, August 2007.[PDF]
  72. T. Kinnunen, B. Zhang, J. Zhu, Y. Wang, ”Speaker Verification with Adaptive Spectral Subband Centroids”, Proc. Int. Conf. Biometrics, pp. 58-66, Lecture Notes in Computer Science 4642, Seoul, Korea, August 2007. [PDF]
  73. R. Saeidi, R.S. Mohammadi, R. Rodman, T. Kinnunen, "A New Segmentation Algorithm Combined with Transient Frames Power for Text-Independent Speaker Verification", Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), Vol. 4, pp. 305-308, Honolulu, Hawaii, April 2007. [PDF]
  74. T. Kinnunen, C.W.E. Koh, L. Wang, H. Li, and E.S. Chng, "Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 547-558, Singapore, December 2006.[PDF]
  75. T. Kinnunen, V. Hautamäki, P. Fränti, “On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition” , Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 559-567, Singapore, December 2006. [PDF]
  76. K.A. Lee, H. Sun, R. Tong, B. Ma, M. Dong, C. You, D. Zhu, C.W.E. Koh, L. Wang, T.Kinnunen, E.S. Chng and H. Li, "The IIR Submission to CSLP 2006 Speaker Recognition Evaluation", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 494-505, Singapore, December 2006. [PDF]
  77. R. Tong, B. Ma, K.A. Lee, C. You, D. Zhou, T. Kinnunen, H. Sun, M. Dong, E.S. Chng, H. Li, The IIR NIST 2006 Speaker Recognition System: Fusion of Acoustic and Tokenization Features”, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 566-577, Singapore, December 2006. [PDF]
  78. T. Kinnunen, "Joint Acoustic-Modulation Frequency for Speaker Recognition", Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2006), Vol. I, pp. 665-668, May 14-19, 2006, Toulouse, France. [PDF]
  79. T. Kinnunen, R. Gonzalez-Hautamäki, Long-Term F0 Modeling for Text-Independent Speaker Recognition, Proc. Int. Conf. on Speech and Computer (SPECOM'2005), pp. 567-570, Patras, Greece, October 2005. [PDF]
  80. J. Saastamoinen, Z. Fiedler, T. Kinnunen and P. Fränti, "On factors affecting MFCC-based speaker recognition accuracy", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 503-506, Patras, Greece, October 2005 [PDF].
  81.  H. Gupta, V. Hautamäki, T. Kinnunen and P. Fränti, "Field evaluation of text-dependent speaker recognition in an access control application", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 551-554, Patras, Greece, October 2005. [PDF]
  82. O. Grebenskaya, T. Kinnunen, P. Fränti, “Speaker Clustering in Speech Recognition”, Proc. 2005 Finnish Signal Processing Symposium (FINSIG’05), pp. 46-49, Kuopio, Finland, August 25, 2005. [PDF]
  83. V. Hautamäki, S. Cherednichenko, I. Kärkkäinen, T. Kinnunen, P. Fränti, “Improving K-Means by Outlier Removal”, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 978-987, Joensuu, Finland, June 19-22, 2005.
  84. R. Bednarik, T. Kinnunen, A. Mihaila, P. Fränti, “Eye-Movements as a Biometric“, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 780-789, Joensuu, Finland, June 19-22, 2005. [PDF]
  85. T. Niemi-Laitinen, J. Saastamoinen, T. Kinnunen, P Fränti, ”Applying MFCC-Based Automatic Speaker Recognition to GSM and Forensic Data“, Proc. Human Language Technologies (HLT’2005), p. 317-322, Tallinn, Estonia, April 4-5, 2005 [PDF]
  86. T. Kinnunen, E. Karpov, P. Fränti, “Efficient Online Cohort Selection Method for Speaker Verification”, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP 2004), Vol. III, pp. 2401-2402, Jeju Island, Korea, Oct. 4-8, 2004 [PDF].
  87. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP'2004), Vol. III, pp. 1805-1808, Jeju Island, Korea, Oct. 4-8, 2004. [PDF]
  88. T. Kinnunen, V. Hautamäki, P. Fränti, Fusion of Spectral Feature Sets for Accurate Speaker IdentificationProc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 361-365, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  89. E. Karpov, T. Kinnunen, P. Fränti, "Symmetric Distortion Measure for Speaker Recognition",  Proc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 366-370, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  90. T. Kinnunen, V. Hautamäki, P. Fränti, "On the Fusion of Dissimilarity-Based Classifiers for Speaker Identification", Proc. 8th European Conference on Speech Communiation and Technology (Eurospeech 2003), pp. 2641-2644, Geneva, Switzerland, September 1-4, 2003. [PDF]
  91. T. Kinnunen, E. Karpov, P. Fränti: "A Speaker Pruning Algorithm for Real-Time Speaker Identification", Proc. 4th International Conference on  Audio- and Video-Based Biometric Person Authentication (AVBPA 2003), pp. 639-646, Guilford, UK, June 9-11, 2003. [PDF]
  92. T. Kinnunen: "Designing a Speaker-Discriminative Adaptive Filter Bank for Speaker Recognition", Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), pp. 2325-2328, Denver, Colorado, USA, September 16-20, 2002. [PDF]
  93. T. Kinnunen, I. Kärkkäinen: "Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification", Proc. Joint IAPR International Workshop on Statistical Pattern Recognition (S+SPR 2002), pp. 681-688, Windsor, Canada, August 6-9, 2002. [PDF]
  94. T. Kinnunen, I. Kärkkäinen, P. Fränti: "Is Speech Data Clustered? - Statistical Analysis of Cepstral Features", Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)), vol. 4, pp. 2627-2630, Aalborg, Denmark, September 3-7, 2001. [PDF]
  95. T. Kinnunen, P. Fränti: "Speaker Discriminative Weighting Method for VQ-Based Speaker Identification", Proc. 3rd International Conference on audio-and video-based biometric person authentication (AVBPA 2001), pp. 150-156, Halmstad, Sweden, June 6-8, 2001. [PDF]
  96. T. Kinnunen, T. Kilpeläinen, P. Fränti: "Comparison of Clustering Algorithms in Speaker Identification", Proc. IASTED Int. Conf. Signal Processing and Communications (SPC 2000), pp. 222-227, Marbella, Spain, September 19-22, 2000. [PDF]

Theses:

Other publications