Head Shot Name: Ville Hautamäki, PhD
Occupation: Senior researcher in University of Eastern Finland.
Currently (2018-2019) co-PI of consortium of "Deep reinforcement learning for physical agents (DEEPEN)" project funded by Academy of Finland.
Previously, (2015), PI in a one year foreign accent recognition project funded by MATINE, The Finnish Defence Forces.
(2011-2014), PI in a 3 year dialect and accent recognition post-doc project funded by Academy of Finland.
Research interests: Artificial Intelligence, Reinforcement Learning, Machine Learning, Bayesian Inference, Bioinformatics, Speech Technlogy in general, but speaker recognition and language recognition in particular.
My Google Scholar profile

Journal publications:

  1. Rosa Gonzalez Hautamäki, Md Sahidullah, Ville Hautamäki and Tomi Kinnunen, "Acoustical and perceptual study of voice disguise by age modification in speaker verification", Speech Communication, 2017 (accepted).
  2. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Tomi Kinnunen, and Chin-Hui Lee, "i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition", IEEE Transactions on Audio, Speech and Language Processing, Vol. 24, No 1, pp. 29-41, January 2016.
  3. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki and Anne-Maria Laukkanen, "Automatic versus human speaker verification: the case of voice mimicry", Speech Communication, Vol. 72, pp. 13-31, September, 2015.
  4. Hamid Behravan, Ville Hautamäki, and Tomi Kinnunen, "Factors Affecting i-Vector Based Foreign Accent Recognition: a Case Study in Spoken Finnish", Speech Communication, Vol. 66, pp. 118-129, February, 2015.
  5. Padmanabhan Rajan, Anton Afanasyev, Ville Hautamäki, and Tomi Kinnunen, "From single to multiple enrollment i-vectors: practical PLDA scoring variants for speaker verification", Digital Signal Processing, Vol. 31, pp. 93-101, 2014.
  6. Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong Aik Lee, Bin Ma, Haizhou Li, "Sparse Classifier Fusion for Speaker Verification", IEEE Transactions on Audio, Speech and Language Processing, Vol. 21, No. 8, pp. 1622-1631, August, 2013.
  7. Q. Zhao, V. Hautamäki, I. Kärkkäinen, and P. Fränti, "Random Swap EM algorithm for Gaussian Mixture Models", Pattern Recognition Letters, Vol. 19, No. 12, pp. 914-917, December, 2012.Supplementary material[C++ implementation]
  8. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, and P. Fränti, "Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification", Pattern Recognition Letters, Vol. 30, No. 4, pp. 341-347, March, 2009.
  9. V. Hautamäki, T. Kinnunen, and P. Fränti, "Text-Independent Speaker Recognition Using Graph Matching", Pattern Recognition Letters, Vol. 29, No. 9, pp. 1427-1432, July, 2008.
  10. V. Hautamäki, T. Kinnunen, I. Kärkkäinen, J. Saastamoinen, M. Tuononen and P. Fränti, "Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification", IEEE Signal Processing Letters, Vol. 15, pp. 162-165. 2008.
  11. P. Fränti, O. Virmajoki and V. Hautamäki, "Fast agglomerative clustering using k nearest neighbor graph", IEEE Transactions on Pattern Analysis and Machine Intelligence , Vol. 28, No. 11, pp. 1875-188, November, 2006.
  12. J. Saastamoinen, E. Karpov, V. Hautamäki and P. Fränti, "Accuracy of MFCC based speaker recognition in series 60 device", Journal of Applied Signal Processing, Vol. 17, pp. 2816-2827, September, 2005.

Conference publications:

  1. K. A. Lee, V. Hautamäki, T. Kinnunen, A. Larcher, C. Zhang, A. Nautsch, T. Stafylakis, G. Liu, M. Rouvier, W. Rao, F. Alegre, J. Ma, M. W. Mak, A. K. Sarkar, H. Delgado, R. Saeidi, H. Aronowitz, A. Sizov, H. Sun, T. H. Nguyen, G. Wang, B. Ma, V. Vestman, M. Sahidullah, M. Halonen, A. Kanervisto, G. Le Lan, F. Bahmaninezhad, S. Isadskiy, C. Rathgeb, C. Busch, G. Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, P.-M. Bousquet, M. Ajili, W. B. Kheder, D. Matrouf, Z. H. Lim, C. Xu, H. Xu, X. Xiao, E. S. Chng, B. Fauve, K. Sriskandaraja, V. Sethu, W. W. Lin, D. A. L. Thomsen, Z.-H. Tan, M. Todisco, N. Evans, H. Li, J. H. L. Hansen, J.-F. Bonastre, E. Ambikairajah, The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016, Proc. Interspeech 2017 (to appear)
  2. Tomi Kinnunen, Md Sahidullah, Mauro Falcone, Luca Costantini, Rosa Gonzalez Hautamäki, Dennis Thomsen, Achintya Sarkar, Zheng-Hua Tan, Hector Delgado, Massimiliano Todisco, Nicholas Evans, Ville Hautamäki and Kong Aik Lee, "RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research", ICASSP 2017 (accepted).
  3. Anssi Kanervisto, Ville Vestman, Md Sahidullah, Ville Hautamäkii, Tomi Kinnunen, "Effects of Gender Information in Text-independent and Text-dependent Speaker Verification", ICASSP 2017 (accepted).
  4. Ivan Kukanov, Ville Hautamäki, Sabato Siniscalchi and, Kehuang Li, "Deep learning with Maximal Figure-of-Merit Cost to Advance Multi-label Speech Attribute Detection", SLT, San Diego, USA, December 2016.
  5. Kong Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Rao Wei, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Poorjam, Trung Ngo Trong, Cheng-Lin Xu, Haihua Xu, Bin Ma, Eng Siong Chng and Sylvain Meignier, "The 2015 NIST Language Recognition Evaluation: the Shared View of I2R, Fantastic4 and SingaMS", Interspeech, pp. 3211--3215, San Francisco, USA, September 2016.
  6. Tomi Kinnunen, Md Sahidullah, Ivan Kukanov, Hector Delgado, Massimiliano Todisco, Achintya sarkar, Nicolai Thomsen, Ville Hautamäki, Nicholas Evans and Zheng-Hua Tan, "Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus", Interspeech, pp. 430--434, San Francisco, USA, September 2016.
  7. Md Sahidullah, Rosa Gonzalez Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts and Martti Pitkanen, "Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech", Interspeech, pp. 1720--1724, San Francisco, USA, September 2016.
  8. Kristiina Jokinen, Trung Ngo Trong and Ville Hautamäki, "Variation in Spoken North Sami Language", Interspeech, pp. 3299--3303, San Francisco, USA, September 2016.
  9. Trung Ngo Trong, Ville Hautamäki and Kong Aik Lee, "Deep Language: a comprehensive deep learning approach to end-to-end language recognition", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  10. Rosa Gonzalez Hautamäki, Md Sahidullah, Tomi Kinnunen and Ville Hautamäki, "Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  11. Amir Hossein Poorjam, Rahim Saeidi, Tomi Kinnunen and Ville Hautamäki, "Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  12. Hamid Behravan, Tomi Kinnunen and Ville Hautamäki, "Out-of-set i-Vector Selection for Open-set Language Identification", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  13. Ville Hautamäki, Sabato Siniscalchi, Hamid Behravan, Valerio Mario Salerno and Ivan Kukanov, "Boosting Universal Speech Attributes Classification with Deep Neural Network for Foreign Accent Characterization", Interspeech 2015, pp. 408-412, Dresden, Germany, September 2015.
  14. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, and Chin-Hui Lee, "Dialect Levelling in Finnish: A Universal Speech Attribute Approach", Interspeech 2014, pp. 2165-2169, Singapore, September, 2014.
  15. Ville Hautamäki, Antti Pöllänen, Tomi Kinnunen, Kong Aik Lee, Haizou Li and Pasi Fränti, "A Comparison of Categorical Attribute Data Clustering Methods", S+SSPR 2014, pp. 53-62, Joensuu, Finland, August, 2014.
  16. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki and Anne-Maria Laukkanen, "Comparison of human listeners and speaker verification systems using voice mimicry data", Speaker Odyssey 2014, pp. 137-144 , Joensuu, Finland, June, 2014.
  17. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Tomi Kinnunen, and Chin-Hui Lee, "Introducing Attribute Features to Foreign Accent Recognition", ICASSP 2014, pp. 5369-5373, Florence, Italy, May, 2014.
  18. You-Chi Cheng, Ville Hautamäki, Zhen Huang, Kehuang Li, and Chin-Hui Lee. "An I-Vector Based Descriptor for Alphabetical Gesture Recognition", ICASSP 2014, pp. 6643-6647, Florence, Italy, May, 2014.
  19. Ville Hautamäki, You-Chi Cheng, Padmanabhan Rajan, and Chin-Hui Lee, "Minimax i-vector extractor for short duration speaker verification", Interspeech 2013, Lyon, France, August 2013.
  20. Ville Hautamäki, Kong Aik Lee, David van Leeuwen, Rahim Saeidi, Anthony Larcher, Tomi Kinnunen, Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, John H.L. Hansen and Benoit Fauve, "Automatic regularization of cross-entropy cost for speaker recognition fusion", Interspeech 2013, Lyon, France, August 2013.
  21. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki, Timo Leino and Anne-Maria Laukkanen, "I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry", Interspeech 2013, Lyon, France, August 2013.
  22. Rosa Gonzalez Hautamäki, Ville Hautamäki, Padmanabhan Rajan and Tomi Kinnunen, "Merging human and automatic system decisions to improve speaker recognition performance", Interspeech 2013, Lyon, France, August 2013.
  23. Rahim Saeidi, Kong Aik Lee, Tomi Kinnunen, Taufiq Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo L. Sordo Martinez, Karen Kua, Changhuai You, hanwu sun, Anthony Larcher, Paddy Rajan, Ville Hautamaki, Cemal Hanilci, Billy Braithwaite, Rosa Gonzalez Hautamaki, Seyed Omid Sadjadi, Liu Gang and Hynek Boril, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification", Interspeech 2013, Lyon, France, August 2013.
  24. Zhen Huang, You-Chi Cheng, Kehuang Li, Ville Hautamäki and Chin-Hui Lee, "A Blind Segmentation Approach to Acoustic Event Detection Based on I-Vector", Interspeech 2013, pp. 2282-2286, Lyon, France, August 2013.
  25. Hamid Behravan, Ville Hautamäki and Tomi Kinnunen, "Foreign Accent Detection from Spoken Finnish Using i-Vectors", Interspeech 2013, Lyon, France, August 2013.
  26. Padmanabhan Rajan, Tomi Kinnunen and Ville Hautamäki, "Effect of multicondition training on i-vector PLDA configurations for speaker recogntion", Interspeech 2013, (accepted).
  27. Ville Hautamäki, Kong Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, and Haizhou Li, "Variational Bayes Logistic Regression as Regularized Fusion for NIST SRE 2010", In Speaker Odyssey 2012, June, Singapore, 2012.
  28. Van Hai Do, Xiong Xiao, Ville Hautamäki, Eng Siong Chng, "Speech Attribute Recognition using Context-Dependent Modeling", In APSIPA ASC 2011, October, Xi'an, China.[PDF]
  29. Qinpei Zhao, V. Hautamäki and Pasi Fränti, "RSEM: an accelerated algorithm on repeated EM", ICIG 2011.
  30. Ville Hautamäki, Kong Aik Lee,Tomi Kinnunen, Bin Ma, and Haizhou Li, "Regularized Logistic Regression Fusion for Speaker Verification", In Interspeech 2011, pp. 2745-2748, August,Florence, Italy. [PDF]
  31. Kong Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher,and Haizhou Li, "Spoken Language Recognition in the Latent Topic Simplex", In Interspeech 2011, pp. 2893--2896, August, Florence, Italy. [PDF]
  32. Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong Aik Lee,and Haizhou Li, "Classifier Subset Selection and Fusion for Speaker Verification", In ICASSP 2011 [PDF][video and slides].
  33. Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong Aik Lee, Bin Ma, and Haizhou Li , "Approaching Human Listener Accuracy with Modern Speaker Verification", In Interspeech 2010, Makuhari, Japan, pp. 1473-1476, September 2010. [PDF]
  34. Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, and Haizhou Li , "Towards long-range prosodic attribute modeling for language recognition", In Interspeech 2010, Makuhari, Japan, 1792-1795, September 2010.
  35. P. Fränti, A. Tabarcea, J. Kuittinen, and V. Hautamäki " Location-based Search Engine for Multimedia Phones", In ICME 2010.
  36. A. Tabarcea, V. Hautamäki, and P. Fränti,"Ad-hoc Georeferencing of Web-pages Using Street-name Prefix Trees", In 6th International Conference on Web Information Systems and Technologies (WEBIST 2010).
  37. Q. Zhao, V. Hautamäki, I. Kärkkäinen, and P. Fränti, "Random Swap EM algorithm for Finite Mixture Models in Image Segmentation", In Proc. IEEE Int. Conf. on Image Processing (ICIP 2009),Cairo, Egypt, pp. 2397-2400, November 2009.
  38. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, and P. Fränti, "Comparing Maximum A Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification", In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4545-4548, April 2009.
  39. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, and I. Sidoroff, "Developing Speaker Recognition System: from Prototype to Practical Application", In Proc. e-Forensics 2009 (accepted).
  40. Q. Zhao, V. Hautamäki, P. Fränti," Knee Point Detection in BIC for Detecting the Number of Clusters", Advanced Concepts for Intelligent Vision Systems (ACIVS 2008), Juan-les-Pins, France, pp. 664-673, October 2008. [PDF]
  41. V. Hautamäki, P. Nykänen and P. Fränti, "Time-series Clustering by Approximate Prototypes", 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, Florida, USA, December, 2008. [PDF]
  42. P. Fränti, O. Virmajoki and V. Hautamäki, "Probabilistic Clustering by Random Swap Algorithm", 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, Florida, USA, December, 2008.
  43. V. Hautamäki, M. Tuononen, T. Niemi-Laitinen and P. Fränti, Improving Speaker Verification by Periodicity Based Voice Activity Detection, Proc. 12th International Conference on Speech and Computer (SPECOM 2007), Vol. 2, pp. 645-650, Moscow, October 2007.[PDF]
  44. T. Kinnunen, V. Hautamäki and P. Fränti, On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006), Vol II, Singapore, pp. 559-567, December 2006.
  45. R. Timofte, V. Hautamäki and P. Fränti, Speaker, Vocabulary and Context Independent Word Spotting System for Continuous Speech, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006), Vol II, Singapore, pp. 396-407, December 2006.
  46. H. Gupta, V. Hautamäki, T. Kinnunen and P. Fränti, Field Evaluation of Text-Dependent Speaker Recognition in an Access Control Application, Proc. 10th International Conference on Speech and Computer (SPECOM 2005), pp. 551-554, Patras, Greece, October 2005. [PDF]
  47. V. Hautamäki, S. Cherednichenko, I. Kärkkäinen, T. Kinnunen and P. Fränti, Improving K-Means by Outlier Removal, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), pp. 978-987, Joensuu, Finland, June 2005. [PDF]
  48. T. Kinnunen, V. Hautamäki, P. Fränti, Fusion of Spectral Feature Sets for Accurate Speaker Identification, Proc. 9th International Conference Speech and Computer (SPECOM 2004), pp. 361-365, St. Petersburg, Russia, September, 2004. [PDF]
  49. J. Saastamoinen, E. Karpov, V. Hautamäki and P. Fränti, "Automatic Speaker Recognition for Series 60 Mobile Devices", In Proc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 353-360, St. Petersburg, Russia, September 20-22, 2004.
  50. V. Hautamäki, I. Kärkkäinen and P. Fränti, "Outlier Detection Using k-Nearest Neighbour Graph", 17th International Conference on Pattern Recognition (ICPR 2004), pp. 430-433, Cambridge, United Kingdom, August, 2004. [PDF] code
  51. P. Fränti, O. Virmajoki and V. Hautamäki, "Fast PNN-based clustering using k-nearest neighbor graph", IEEE International Conference on Data Mining (ICDM 2003), Melbourne, Florida, USA, 525-528, November 2003.
  52. T. Kinnunen, V. Hautamäki and P. Fränti, "On the fusion of dissimilarity- based classifiers for speaker identification", European Conference on Speech Communiation and Technology, (Eurospeech 2003), Geneva, Switzerland, 2641-2644, September 2003. [PDF]
  53. P. Fränti and V. Hautamäki, "Compression of aerial images for reduced-color devices", SPIE Conference on Image and Video Communications and Processing,Santa Clara, USA, SPIE Vol. 5022, Part II, 651-662, January 2003. [PDF]

Theses:

  1. Ville Hautamäki, Improving Pattern Recognition Methods for Speaker Recognition, PhD thesis, University of Joensuu, Deparment of Computer Science, October 2008. [PDF]
  2. Ville Hautamäki, Efficient Color Quantization by Hierarchical Clustering Algorithms, Master's thesis, University of Joensuu, Deparment of Computer Science, February 2005. [PDF]
  3. Ville Hautamäki and Jussi Heino, Evaluation of image compression methods for aerial photos, Bachelor's Thesis, University of Joensuu, Department of Computer Science, December 2000. [PDF]

Other publications:

  1. T. Kinnunen, V. Hautamäki, "Automaattinen puhujantunnistus", in O. Aaltonen, R. Aulanko, A. Iivonen, A. Klippi, M. Vainio (Eds.), Puhuva Ihminen - puhetieteiden perusteet, Otava, 2009. ("Automatic speaker recognition"; a book chapter in Finnish about basics of speaker recognition for non-technical audience).
  2. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, I. Sidoroff, Implementing Speaker Recognition System: from Matlab to Practice, Report series / University of Joensuu, Department of Computer Science and Statistics, A-2007-4 (ISBN 978-952-219-061-1, ISSN 1796-7317), November 2007.

updated:Tue Nov 24 08:44:59 EET 2009