doctor with hat

Tomi Kinnunen, PhD, Docent

Researcher in speech technology, specialized in speaker and language recognition

Speech and Image Processing Unit (SIPU)
School of Computing
University of Eastern Finland
P.O. Box 111, FI-80101 Joensuu, FINLAND

E-mail: tkinnu@cs.uef.fi



PROGRAM CODE:
DATA:


PUBLICATIONS:

Refereed journal papers:
  1. M. Sahidullah and T. Kinnunen, Local Spectral Variability Features for Speaker Verification”, Digital Signal Processing (to appear) [PDF] [Program code]
  2. H. Behravan, V. Hautamäki, S. M. Siniscalchi, T. Kinnunen, C.-H. Lee,  “i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition”, IEEE/ACM Transactions on Audio, Speech and Language Processing, 24(1): 29---41, January 2016 [PDF]
  3. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki and A.-M. Laukkanen, “Automatic versus Human Speaker Verification: the Case of Voice Mimicry”, Speech Communication, 72: 13--31, September 2015 [PDF]
  4. A. Sizov, E. Khoury, T. Kinnunen, Z. Wu and S. Marcel, Joint Speaker Verification and Anti-Spoofing in the i-Vector Space”, IEEE Transactions on Information Forensics and Security, 10(4): 821--832, April 2015 [PDF] [i-vector data (hosted at IDIAP)] [Code (hosted at IDIAP)]
  5. Z. Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, H. Li, “Spoofing and Countermeasures for Speaker Verification: a Survey”, Speech Communication, 66: 130--153, February 2015 [PDF]
  6. H. Behravan, V. Hautamäki, T. Kinnunen, “Factors Affecting i-Vector Based Foreign Accent Recognition: a Case Study in Spoken Finnish”, Speech Communication, 66: 118--129, February 2015 [PDF]
  7. C. Hanilçi and T. Kinnunen, “Source Cell-Phone Recognition from Recorded Speech Using Non-Speech Segments”, Digital Signal Processing, 35: 75--85, December 2014 [PDF]
  8. J. Pohjalainen, C. Hanilçi, T. Kinnunen, P. Alku, Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch”, IEEE Signal Processing Letters, 21(12): 1516--1520, December 2014 [PDF] [MATLAB CODE]
  9. P. Rajan, A. Afanasyev, V. Hautamäki, T. Kinnunen, From Single to Multiple Enrollment i-Vectors: Practical PLDA Scoring Variants for Speaker Verification, Digital Signal Processing, 31: 93--10, August 2014 [PDF]
  10. V. Hautamäki, T. Kinnunen, F. Sedlak, K.A. Lee, B. Ma, H. Li, “Sparse Classifier Fusion for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1622--1631, August 2013 [PDF]
  11. O. Schleusing, T. Kinnunen, B. Story, J.-M. Vesin, “Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution”, IEEE Transactions on Audio, Speech and Language Processing, 21(8): 1560--1572, August 2013 [PDF]
  12. Md. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC and PLP Features for Speaker Verification Using i-Vectors”, Speech Communication, 55(2): 237--251, February 2013  [PDF]
  13. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, “Mixture of Factor Analyzers Using Priors from Non-Parallel Speech for Voice Conversion”, IEEE Signal Processing Letters, 19(12): 914--917, December 2012 [PDF]
  14. P. Mowlaee, R. Saeidi, M.G. Christensen, Z.-H. Tan, T. Kinnunen, P. Fränti, S.H. Jensen, “A Joint Approach for Single-Channel Speaker Identification and Speech Separation”, IEEE Transactions on Audio, Speech and Language Processing, 20(9): 2586--2601, November 2012 [PDF] [supplementary audio material].
  15. T. Kinnunen, R. Saeidi, F. Sedlak, K.A. Lee, J. Sandberg, M. Hansson-Sandsten, H. Li,  “Low-Variance Multitaper MFCC Features: a Case Study in Robust Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 20(7): 1990--2001, September 2012 [PDF][Multitaper Matlab code].
  16. C. Hanilçi, T. Kinnunen, F. Ertas, R. Saeidi, J. Pohjalainen, P. Alku, “Regularized All-Pole Models for Speaker Verification Under Noisy Environments”, IEEE Signal Processing Letters 19(3), 163--166, March 2012 [PDF]. Find also extended analysis and the RLP program codes from Odyssey 2012 version.
  17. T. Kinnunen, I. Sidoroff, M. Tuononen, P. Fränti, “Comparison of Clustering Methods: a Case Study of Text-Independent Speaker Modeling”, Pattern Recognition Letters32(13): 1604--1617, October 2011 [PDF]
  18. K.A. Lee, C.H. You, H. Li, T. Kinnunen, K.C. Sim, “Using Discrete Probabilities with Bhattacharyya Measure for SVM-based Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, 19(4): 861--870, May 2011 [PDF].
  19. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku, “Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification”, IEEE Signal Processing Letters, 17(6), pp. 599--602, June 2010 [PDF].
  20. J. Sandberg, M. Hansson-Sandsten, T. Kinnunen, R. Saeidi, P. Flandrin, P. Borgnat, “Multitaper Estimation of Frequency-Warped Cepstra with Application to Speaker Verification”, IEEE Signal Processing Letters, 17(4): 343--346, April 2010. [PDF][Multitaper Matlab code]
  21. T. Kinnunen and H. Li, An Overview of Text-Independent Speaker Recognition: from Features to Supervectors”, Speech Communication 52(1): 12--40, January 2010 [PDF].
  22. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, P. Fränti, Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification”, Pattern Recognition Letters 30(4): 341--347, March 2009. [PDF]
  23. V. Hautamäki, T. Kinnunen and P. Fränti, Text-Independent Speaker Recognition Using Graph Matching, Pattern Recognition Letters, 29(9): 1427--1432, 2008. [PDF]
  24. V. Hautamäki,  T. Kinnunen, I. Kärkkäinen, M. Tuononen, J. Saastamoinen, P. Fränti, “Maximum a Posteriori Estimation of the Centroid Model for Speaker Verification, IEEE Signal Processing Letters, 15: 162--165, 2008. [PDF]
  25. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification and Verification, IEEE Transactions on Audio, Speech and Language Processing, 14(1): 277--288, Jan 2006. [PDF
 Refereed conference papers:
  1. R. Gonzalez Hautamäki, M. Sahidullah, T. Kinnunen, V. Hautamäki, ”Age-related voice disguise and its impact on speaker verification accuracy”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016 (to appear) [PDF]
  2. A. Sizov, K.A. Lee, T. Kinnunen, ”Discriminating languages in a probabilistic latent subspace”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016 (to appear) [PDF]
  3. A. H. Poorjam, R. Saeidi, T. Kinnunen, V. Hautamäki, ”Incorporating uncertainty as a quality measure in i-vector based language recognition”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016 (to appear) [PDF]
  4. H. Behravan, T. Kinnunen, V. Hautamäki, ”Out-of-set i-vector selection for open-set language identification”, Proc. Odyssey: the Speaker and Language Recognition Workshop, 2016 (to appear) [PDF]
  5. A. Sholokhov, T. Kinnunen, S. Cumani, ”Discriminative multi-domain PLDA for speaker verification”, Proc. ICASSP 2016 (to appear) [PDF]
  6. Z. Wu, T. Kinnunen, N. Evans, J. Yamagishi, C. Hanilçi, M. Sahidullah, A. Sizov, ”ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge”, Proc. Interspeech 2015, pp. 2037--2041, Dresden, Germany, September 2015 [PDF]
  7. M. Sahidullah, T. Kinnunen, C. Hanilçi, ”A comparison of features for synthetic speech detection”, Proc. Interspeech 2015, pp. 2087--2091, Dresden, Germany, September 2015 [PDF]
  8. C. Hanilçi, T. Kinnunen,  M. Sahidullah, A. Sizov, ”Classifiers for synthetic speech detection: a comparison”, Proc. Interspeech 2015, pp. 2057--2061, Dresden, Germany, September 2015 [PDF]
  9. R. Saeidi, T. Niemi, H. Karppelin, J. Pohjalainen, T. Kinnunen, P. Alku, ”Speaker recognition for speech under face cover”, Proc. Interspeech 2015, pp. 1012--1016, Dresden, Germany, September 2015 [PDF]
  10. A. Fedorova, O. Glembek, T. Kinnunen, P. Matějka, ”Exploring ANN back-ends for i-vector based speaker age estimation”, Proc. Interspeech 2015, pp. 3036--3040, Dresden, Germany, September 2015 [PDF]
  11. E. Khoury, T. Kinnunen, A. Sizov, Z. Wu, S. Marcel, ”Introducing i-vectors for joint anti-spoofing and speaker verification”, Proc. Interspeech 2014, pp. 61--65, Singapore, September 2014 [PDF]
  12. H. Behravan, V. Hautamäki, S.M. Siniscalchi, E. Khoury, T. Kurki, T. Kinnunen, C.-H. Lee, ”Dialect levelling in Finnish: a universal speech attribute approach”, Proc. Interspeech 2014, pp. 2165--2169, Singapore, September 2014 [PDF]
  13. A. Sizov, K.A. Lee, T. Kinnunen, ”Unifying probabilistic linear discriminant analysis variants in biometric authentication”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 464--475, Joensuu, Finland, August 2014 [PDF] [Fast PLDA implementation]
  14. V. Hautamäki, A. Pöllänen, T. Kinnunen, K.A. Lee, H. Li and P. Fränti, ”A comparison of categorical attribute data clustering methods”, Proc. Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), pp. 53--62, Joensuu, Finland, August 2014 [PDF]
  15. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, A.-M. Laukkanen, ”Comparison of human listeners and speaker verification systems using voice mimicry data”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 137--144, Joensuu, Finland, June 2014 [PDF]
  16. C.S. Greenberg, D. Bansé, G.R. Doddington, D. Garcia-Romero, J. J. Godfrey, T. Kinnunen, A.F. Martin, A. McCree, M. Przybocki, D.A. Reynolds, ”The NIST 2014 speaker recognition i-vector machine learning challenge”, Proc. Odyssey 2014: The Speaker & Language Recognition Workshop, pp. 224--230, Joensuu, Finland, June 2014. [PDF]
  17. H. Behravan, V. Hautamäki, S.M. Siniscalchi, T. Kinnunen, C.-H. Lee, ”Introducing attribute features to foreign accent recognition”, Proc. ICASSP 2014, pp. 5332--5336, Florence, Italy, May 2014 [PDF]
  18. A. Sholokhov, T. Pekhovsky, O. Kudashev, A. Shulipa, T. Kinnunen, ”Bayesian analysis of similarity matrices for speaker diarization”, Proc. ICASSP 2014, pp. 106--110, Florence, Italy, May 2014 [PDF]
  19. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based voice conversion using non-negative spectrogram deconvolution”, Proc. 8th ISCA Speech Synthesis Workshop (SSW'13), pp. 201--206, Barcelona, Spain, September 2013. [PDF]
  20. T. Kinnunen, Md. J. Alam, P. Matejka, P. Kenny, J. Cernocky, D. O'Shaughnessy, ”Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations”, Proc. Interspeech 2013, pp. 3122--3126, Lyon, France, August 2013 [PDF]
  21. C. Hanilci, T. Kinnunen, P. Rajan, J. Pohjalainen, P. Alku, F. Ertas, Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort”, Proc. Interspeech 2013, pp. 2881--2885, Lyon, France, August 2013 [PDF]
  22. P. Rajan, T. Kinnunen, C. Hanilci, J. Pohjalainen, P. Alku, ”Using group delay functions from all-pole models for speaker recognition”, Proc. Interspeech 2013, pp. 2489--2493, Lyon, France, August 2013  [PDF]
  23. Z. Wu, A. Larcher, K.A. Lee, E.S. Chng, T. Kinnunen, H. Li, ”Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints”, Proc. Interspeech 2013, pp. 950--954, Lyon, France, August 2013. [PDF]
  24. Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, H. Li, ”Exemplar-based unit selection for voice conversion utilizing temporal information”, Proc. Interspeech 2013, pp. 3057--3061, Lyon, France, August 2013 [PDF]
  25. V. Hautamäki, K.A. Lee, D. van Leeuwen, R. Saeidi, A. Larcher, T. Kinnunen, T. Hasan, S.O. Sadjadi, G. Liu, H. Boril, J.H.L. Hansen, B. Fauve, "Automatic Regularization of Cross-entropy Cost for Speaker Recognition Fusion", Proc. Interspeech 2013, pp. 1609--1613, Lyon, France, August 2013. [PDF]
  26. N. Evans, T. Kinnunen, J. Yamagishi, ”Spoofing and countermeasures for automatic speaker verification”, Proc. Interspeech 2013, pp. 925--929, Lyon, France, August 2013 [PDF]
  27. R. Gonzalez Hautamäki, T. Kinnunen, V. Hautamäki, T. Leino, A.-M. Laukkanen, ”I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry”, Proc. Interspeech 2013, pp. 930--934, Lyon, France, August 2013 [PDF]
  28. R. Gonzalez Hautamäki, V. Hautamäki, P. Rajan and T. Kinnunen, ”Merging human and automatic system decisions to improve speaker recognition performance”, Proc. Interspeech 2013, pp. 2519--2523, Lyon, France, August 2013 [PDF]
  29. R. Saeidi, K. A. Lee, T. Kinnunen, T. Hasan, B. Fauve, P. -M. Bousquet, E. Khoury,P. L. Sordo Martinez, J. M. K. Kua, C. H. You, H. Sun, A. Larcher, P. Rajan, V. Hautamäki, C. Hanilci, B. Braithwaite, R. Gonzales-Hautamäki, S. O. Sadjadi, G. Liu, H. Boril, N. Shokouhi, D. Matrouf, L. El Shafey, P.Mowlaee, J. Epps, T. Thiruvaran, D. A. van Leeuwen, B. Ma, H. Li, J. H. L. Hansen, J.-F. Bonastre, S. Marcel, J. Mason, E. Ambikairajah, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification", Proc. Interspeech 2013, pp. 1986--1990, Lyon, France, August 2013. [PDF]
  30. H. Behravan, V. Hautamäki, T. Kinnunen, ”Foreign Accent Detection from Spoken Finnish Using i-Vectors”, Proc. Interspeech 2013, pp. 79--83, Lyon, France, August 2013 [PDF]
  31. P. Rajan, T. Kinnunen, V. Hautamäki, ”Effect of multicondition training on i-vector PLDA configurations for speaker recogntion”, Proc. Interspeech 2013, pp. 3694--3697, Lyon, France, August 2013 [PDF]
  32. T. Kinnunen, P. Rajan, ”A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data”,  Proc. Int. Conf. on Acoustics,
    Speech and Signal Processing (ICASSP 2013), pp. 7229--7233, Vancouver, Canada, May 2013. [PDF] [MATLAB CODE]
  33. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Speaker Identification From Shouted Speech: Analysis and Compensation”, Proc. Int. Conf. on Acoustics, Speech and
    Signal Processing (ICASSP 2013), pp. 8027--8031, Vancouver, Canada, May 2013 [PDF]
  34. Z. Wu, T. Kinnunen, E.S. Chng, H. Li, E. Ambikairajah, ”A Study on spoofing attack in state-of-the-art speaker verification: the telephone speech case”, Proc. 2012 Asia-Pacific Signal & Information Processing Association Annual Summit and Conference (APSIPA ASC 2012), pp. 1--5, Hollywood, USA, December 2012 [PDF] (BEST PAPER AWARD)
  35. T. Kinnunen, R. Saeidi, J. Leppänen, J.P. Saarinen, ”Audio context recognition in variable mobile environments from short segments using speaker and language recognizers”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  36. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, ”Regularization of all-pole models for speaker verification under additive noise”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  37. V. Hautamäki, K.A. Lee, A. Larcher, T. Kinnunen, B. Ma, H. Li, ”Variational Bayes logistic regression as regularized fusion for NIST SRE 2010”, Proc. Odyssey: the speaker and language recognition workshop, Singapore, June 2012. [PDF]
  38. T. Kinnunen, H. Leisma, M. Machunik, T. Kakkonen, J.-L. Lebrun, SWAN - Scientific Writing AssistaNt. A Tool for Helping Scholars to Write Reader-Friendly Manuscripts”, demonstrator in the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Avignon, France, April 2012. [PDF]
  39. T. Kinnunen, Z.-Z. Wu, K. A. Lee, F. Sedlak, E. S. Chng, H. Li, “Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech”, Proc. ICASSP 2012, pp. 4401--4404, Kyoto, Japan, March 2012  [PDF].
  40. S. Siddiq, T. Kinnunen, M. Vainio, S. Werner, “Intonational Speaker Verification: a Study on Parameters and Performance Under Noisy Conditions”, Proc. ICASSP 2012, pp. 4777--4780, Kyoto, Japan, March 2012 [PDF].
  41. C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, J. Sandberg, M. Hansson-Sandsten, “Comparing Spectrum Estimators in Speaker Verification Under Additive Noise”,  Proc. ICASSP 2012, pp. 4769--4772, Kyoto, Japan, March 2012 [PDF].
  42. J. Alam, T. Kinnunen, P. Kenny, P. Ouellet, D. O'Shaughnessy, “Multitaper MFCC Features for Speaker Verification Using i-Vectors”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 547--552, Hawaii, December 2011. [PDF] [Multitaper Matlab code]
  43. J. Rodriguez-Fuentes, M. Penagarikano, A. Varona, M. Diez, G. Bordel, D. Martinez, J. Villalba, A. Miguel, A. Ortega, E. Lleida, A. Abad, O. Koller, I. Trancoso, P. Lopez-Otero, L. Docio-Fernandez, C. Garcia-Mateo, R. Saeidi, M. Soufifar, T. Kinnunen, T. Svendsen, P. Fränti, “Multi-Site Heterogenous System Fusions for the Albayzin 2010 Language Recognition Evaluation”, Proc. IEEE Automatic Speech Recognition and Understanding (ASRU 2011), pp. 377--382, Hawaii, December 2011. [PDF]
  44. Evgeny Karpov, Zaur Nasibov, Tomi Kinnunen, Pasi Fränti, "Combining voice activity detectors using decision fusion", Proc. Speech and Computer (SPECOM 2011), Kazan, Russia, September 2011, pp. 278--283.
  45. V. Hautamäki, K.A. Lee, T. Kinnunen, B. Ma, H. Li, “Regularized Logistic Regression Fusion for Speaker Verification”, Proc. Interspeech 2011, Florence, Italy, pp. 2745-2748, August 2011 [PDF]
  46. P. Mowlaee, R. Saeidi, Z.-H. Tan, M.G. Christensen, T. Kinnunen, P. Fränti, S.H. Jensen, “Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge”, Proc. Interspeech 2011, Florence, Italy, August 2011, pp. 677-680. [PDF]
  47. F. Sedlak, T. Kinnunen, V. Hautamäki, K.A. Lee, H. Li, Classifier Subset Selection and Fusion for Speaker Verification”, Proc. ICASSP 2011, pp. 4544--4547, Prague, Czech Republic, May 2011. [PDF] [video and slides]
  48. J. Pohjalainen, P. Alku, T. Kinnunen, Shout Detection in Noise”, Proc. ICASSP 2011, pp. 4968--4971, Prague, Czech Republic, May 2011. [PDF]
  49. T. Kinnunen, R. Saeidi, J. Sandberg, M. Hansson-Sandsten, What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering”,  Proc. Interspeech 2010, pp. 2734--2737, Makuhari, Japan, Sept. 2010. [PDF] [Multitaper Matlab code]
  50. J. Pohjalainen, R. Saeidi, T. Kinnunen, P. Alku,  Extended Weighted Linear Prediction (XLP) Analysis of Speech and its Application to Speaker Verification in Adverse Conditions”, Proc. Interspeech 2010, pp. 1477--1480, Makuhari, Japan, Sept. 2010. [PDF]
  51. Z.-Z. Wu, T. Kinnunen, E.S. Chng, H. Li, Text-Independent F0 Transformation with Non-Parallel Data for Voice Conversion”, Proc. Interspeech 2010, pp. 1732--1735, Makuhari, Japan, Sept. 2010. [PDF]
  52. V. Hautamäki, T. Kinnunen, M. Nosratighods, K.A. Lee, B. Ma, H. Li, “Approaching Human Listener Accuracy with Modern Speaker Verification”, Proc Interspeech 2010, pp. 1473--1476, Makuhari, Japan, Sept. 2010. [PDF]
  53. R. Saeidi, P. Mowlaee, T. Kinnunen, Z.-H. Tan, M.G. Christensen, S.H. Jensen, P. Fränti, Improving Monaural Speaker Identification by Double-Talk Detection”, Proc Interspeech 2010, pp. 1069--1072, Makuhari, Japan, Sept. 2010. [PDF]
  54. R. Saeidi, J. Pohjalainen, T. Kinnunen, P. Alku,  Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise”, Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 40-46, June 2010. [PDF]
  55. R. Saeidi, P. Mowlaee, T. Kinnunen, Z. H Tan, M. G. Christensen, S. H. Jensen and P. Fränti, Signal-to-signal ratio independent speaker identification for co-channel speech signals, Int. Conf. on Pattern Recognition (ICPR 2010), pp. 4545--456, Istanbul, Turkey, August 2010. [PDF]
  56. K. A. Lee, H. Li, C. H. You, T. Kinnunen, K. C. Sim, “Discrete Expected Likelihood Kernel for SVM-Based Speaker Verification”, Proc. 18th European Signal Processing Conference (EUSIPCO 2010), pp. 591--595, Aalborg, Denmark, August 2010 [PDF]
  57. T. Kinnunen, F. Sedlak, R. Bednarik, “Towards Task-Independent Person Authentication Using Eye Movement Signals”, Proc. of the 2010 Symposium on Eye-Tracking Research and Applications (ETRA 2010), pp. 187--190, Austin, Texas, March 2010. [PDF]
  58. R. Saeidi, T. Kinnunen, H.R.S. Mohammadi, R. Rodman, P. Fränti, “Joint Frame and Gaussian Selection for Text-Independent Speaker Verification, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2010), pp. 4530--4533, Dallas, Texas, USA, March 2010. [PDF]
  59. T. Kinnunen and P. Alku, “On separating glottal source and vocal tract information in telephony speaker verification, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009), pp. 4545--4548, Taipei, Taiwan, April 2009. [PDF]
  60. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni and P. Fränti, Comparing maximum a posteriori vector quantization and Gaussian mixture models in speaker verification”, Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009) , pp. 4229--4232, Taipei, Taiwan, April 2009. [PDF]
  61. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, I. Sidoroff, “Developing Speaker Recognition System:  from Prototype to Practical Application”,  Int. Conf. Forensic Applications and Techniques in Telecommunications, Information and Multimedia (e-Forensics'09), Adelaide, Australia, LNICST vol. 8, 101-114, January 2009.
  62. K.A. Lee, C. You, H. Li, T. Kinnunen and D. Zhu, Characterizing Speech Utterances for Speaker Verification with Sequence Kernel SVM, Proc. Interspeech 2008 , pp. 1397-1400, Brisbane, Australia, 2008. [PDF]
  63. T. Kinnunen, K.A. Lee and H. Li, Dimension Reduction of the Modulation Spectrogram for Speaker Verification, Proc. Odyssey: The Speaker and Language Recognition Workshop, Stellenbosch, South Africa, January 2008. [PDF]
  64. T. Kinnunen, E. Chernenko, M. Tuononen, P. Fränti, H. Li, “Voice Activity Detection Using MFCC Features and Support Vector Machine”, Proc. Speech and Computer 2007 (SPECOM), vol. 2, 556-561, Moscow, Russia, October 2007. [PDF]
  65. K.A. Lee, C. You, H. Li, T. Kinnunen, ”A GMM-based Probabilistic Sequence Kernel for Speaker Verification”, Proc. Interspeech 2007, p. 294-297, Antwerp, Belgium, August 2007.[PDF]
  66. T. Kinnunen, B. Zhang, J. Zhu, Y. Wang, ”Speaker Verification with Adaptive Spectral Subband Centroids”, Proc. Int. Conf. Biometrics, pp. 58-66, Lecture Notes in Computer Science 4642, Seoul, Korea, August 2007. [PDF]
  67. R. Saeidi, R.S. Mohammadi, R. Rodman, T. Kinnunen, "A New Segmentation Algorithm Combined with Transient Frames Power for Text-Independent Speaker Verification", Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), Vol. 4, pp. 305-308, Honolulu, Hawaii, April 2007. [PDF]
  68. T. Kinnunen, C.W.E. Koh, L. Wang, H. Li, and E.S. Chng, "Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 547-558, Singapore, December 2006.[PDF]
  69. T. Kinnunen, V. Hautamäki, P. Fränti, “On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition” , Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 559-567, Singapore, December 2006. [PDF]
  70. K.A. Lee, H. Sun, R. Tong, B. Ma, M. Dong, C. You, D. Zhu, C.W.E. Koh, L. Wang, T.Kinnunen, E.S. Chng and H. Li, "The IIR Submission to CSLP 2006 Speaker Recognition Evaluation", Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), LNAI 4274, pp. 494-505, Singapore, December 2006. [PDF]
  71. R. Tong, B. Ma, K.A. Lee, C. You, D. Zhou, T. Kinnunen, H. Sun, M. Dong, E.S. Chng, H. Li, The IIR NIST 2006 Speaker Recognition System: Fusion of Acoustic and Tokenization Features”, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP'2006), pp. 566-577, Singapore, December 2006. [PDF]
  72. T. Kinnunen, "Joint Acoustic-Modulation Frequency for Speaker Recognition", Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2006), Vol. I, pp. 665-668, May 14-19, 2006, Toulouse, France. [PDF]
  73. T. Kinnunen, R. Gonzalez-Hautamäki, Long-Term F0 Modeling for Text-Independent Speaker Recognition, Proc. Int. Conf. on Speech and Computer (SPECOM'2005), pp. 567-570, Patras, Greece, October 2005. [PDF]
  74. J. Saastamoinen, Z. Fiedler, T. Kinnunen and P. Fränti, "On factors affecting MFCC-based speaker recognition accuracy", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 503-506, Patras, Greece, October 2005 [PDF].
  75.  H. Gupta, V. Hautamäki, T. Kinnunen and P. Fränti, "Field evaluation of text-dependent speaker recognition in an access control application", Proc. Int. Conf. on Speech and Computer (SPECOM'05), pp. 551-554, Patras, Greece, October 2005. [PDF]
  76. O. Grebenskaya, T. Kinnunen, P. Fränti, “Speaker Clustering in Speech Recognition”, Proc. 2005 Finnish Signal Processing Symposium (FINSIG’05), pp. 46-49, Kuopio, Finland, August 25, 2005. [PDF]
  77. V. Hautamäki, S. Cherednichenko, I. Kärkkäinen, T. Kinnunen, P. Fränti, “Improving K-Means by Outlier Removal”, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 978-987, Joensuu, Finland, June 19-22, 2005.
  78. R. Bednarik, T. Kinnunen, A. Mihaila, P. Fränti, “Eye-Movements as a Biometric“, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), Lecture Notes in Computer Science 3540, pp. 780-789, Joensuu, Finland, June 19-22, 2005. [PDF]
  79. T. Niemi-Laitinen, J. Saastamoinen, T. Kinnunen, P Fränti, ”Applying MFCC-Based Automatic Speaker Recognition to GSM and Forensic Data“, Proc. Human Language Technologies (HLT’2005), p. 317-322, Tallinn, Estonia, April 4-5, 2005 [PDF]
  80. T. Kinnunen, E. Karpov, P. Fränti, “Efficient Online Cohort Selection Method for Speaker Verification”, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP 2004), Vol. III, pp. 2401-2402, Jeju Island, Korea, Oct. 4-8, 2004 [PDF].
  81. T. Kinnunen, E. Karpov, P. Fränti, Real-Time Speaker Identification, Proc. 8th Int. Conf. on Spoken Language Processing (ICSLP'2004), Vol. III, pp. 1805-1808, Jeju Island, Korea, Oct. 4-8, 2004. [PDF]
  82. T. Kinnunen, V. Hautamäki, P. Fränti, Fusion of Spectral Feature Sets for Accurate Speaker IdentificationProc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 361-365, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  83. E. Karpov, T. Kinnunen, P. Fränti, "Symmetric Distortion Measure for Speaker Recognition",  Proc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 366-370, St. Petersburg, Russia, September 20-22, 2004. [PDF]
  84. T. Kinnunen, V. Hautamäki, P. Fränti, "On the Fusion of Dissimilarity-Based Classifiers for Speaker Identification", Proc. 8th European Conference on Speech Communiation and Technology (Eurospeech 2003), pp. 2641-2644, Geneva, Switzerland, September 1-4, 2003. [PDF]
  85. T. Kinnunen, E. Karpov, P. Fränti: "A Speaker Pruning Algorithm for Real-Time Speaker Identification", Proc. 4th International Conference on  Audio- and Video-Based Biometric Person Authentication (AVBPA 2003), pp. 639-646, Guilford, UK, June 9-11, 2003. [PDF]
  86. T. Kinnunen: "Designing a Speaker-Discriminative Adaptive Filter Bank for Speaker Recognition", Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), pp. 2325-2328, Denver, Colorado, USA, September 16-20, 2002. [PDF]
  87. T. Kinnunen, I. Kärkkäinen: "Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification", Proc. Joint IAPR International Workshop on Statistical Pattern Recognition (S+SPR 2002), pp. 681-688, Windsor, Canada, August 6-9, 2002. [PDF]
  88. T. Kinnunen, I. Kärkkäinen, P. Fränti: "Is Speech Data Clustered? - Statistical Analysis of Cepstral Features", Proc. 7th European Conference on Speech Communication and Technology (Eurospeech 2001)), vol. 4, pp. 2627-2630, Aalborg, Denmark, September 3-7, 2001. [PDF]
  89. T. Kinnunen, P. Fränti: "Speaker Discriminative Weighting Method for VQ-Based Speaker Identification", Proc. 3rd International Conference on audio-and video-based biometric person authentication (AVBPA 2001), pp. 150-156, Halmstad, Sweden, June 6-8, 2001. [PDF]
  90. T. Kinnunen, T. Kilpeläinen, P. Fränti: "Comparison of Clustering Algorithms in Speaker Identification", Proc. IASTED Int. Conf. Signal Processing and Communications (SPC 2000), pp. 222-227, Marbella, Spain, September 19-22, 2000. [PDF]

Theses:

Other publications