Professor Naomi Harte
Professor in Speech Technology, Electronic & Elect. Engineering

Biography

Naomi is Professor in Speech Technology in the School of Engineering in Trinity College. She is Co-PI and a founding member of the ADAPT SFI Centre. In ADAPT, she has led a major Research Theme centered on Multimodal Interaction involving researchers from Universities across Ireland and was instrumental in developing the future vision for the Centre for 2021-2026. She is also a lead academic of the hugely successful Sigmedia Research Group in the School of Engineering. She was appointed as an SFI Engineering Initiative Lecturer in Digital Media in TCD in 2008 (Stokes Programme). Prior to returning to academia, Naomi worked in high-tech start-ups in the field of DSP Systems Development, including her own company. She also previously worked in McMaster University in Canada. She was a Visiting Professor at ICSI in 2015, and became a Fellow of TCD in 2017. She earned a Google Faculty Award in 2018 and was shortlisted for the AI Ireland Awards in 2019. She currently serves on the Editorial Board of Computer Speech and Language and was General Chair of INTERSPEECH 2023 in Dublin.

Naomi's research centres around Human Speech Communication. She likes to consider speech as something we both hear and see, with a strong multimodal aspect to her work. Her research involves the design and application of mathematical algorithms to enhance or augment speech communication between humans and technology. Much of that work is underpinned by signal processing and machine learning, but also requires an understanding of how humans interact. Her current research projects include audio-visual speech recognition, speech synthesis evaluation, multimodal speech analysis, and birdsong. Her industrial background brings a real-world approach to her research.

Publications and Further Research Outputs

Peer-Reviewed Publications
Non-Peer-Reviewed Publications

Peer-Reviewed Publications

Gonzales, Michael Gian and Corcoran, Peter and Harte, Naomi and Schukat, Michael, Joint Speech-Text Embeddings for Multitask Speech Processing, IEEE Access, 12, 2024, p145955 â" 145967 Journal Article, 2024 DOI

Lopez-Espejo, Ivan and Rosello, Eros and Edraki, Amin and Harte, Naomi and Jensen, Jesper, Noise-Robust Hearing Aid Voice Control, IEEE Signal Processing Letters, 2024 Journal Article, 2024 DOI

Storey, Edward and Harte, Naomi and Bell, Peter, Language Bias in Self-Supervised Learning For Automatic Speech Recognition, 2024, pp37 â" 42 Conference Paper, 2024 DOI

Sébastien Le Maguer, Simon King, Naomi Harte, The limits of the Mean Opinion Score for speech synthesis evaluation, Computer Speech and Language, 84, 2024 Journal Article, 2024

Russell, Sam O'Connor and Gessinger, Iona and Krason, Anna and Vigliocco, Gabriella and Harte, Naomi, What automatic speech recognition can and cannot do for conversational speech transcription, Research Methods in Applied Linguistics, 3, (3), 2024 Journal Article, 2024 DOI

Kotey, S., Dahyot, R., Harte, N., Fine Grained Spoken Document Summarization Through Text Segmentation, 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings, 2023, p647-654 Conference Paper, 2023 DOI

Kotey, S., Dahyot, R., Harte, N., Query Based Acoustic Summarization for Podcasts, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1483-1487 Conference Paper, 2023 DOI

Pandey, A., Edlund, J., Le Maguer, S., Harte, N., Listener sensitivity to deviating obstruents in WaveNet, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1080-1084 Conference Paper, 2023 DOI

Le Maguer, S., Anderson, M., Harte, N., Sp1NY: A Quick and Flexible Speech visualisation Tool in Python, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p2012-2013 Conference Paper, 2023

Anderson, M., Kinnunen, T., Harte, N., Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2023-June, 2023 Conference Paper, 2023 DOI

Gonzales, M.G., Corcoran, P., Harte, N., Schukat, M., Joint Speech-Text Embeddings with Disentangled Speaker Features, 2023 34th Irish Signals and Systems Conference, ISSC 2023, 2023 Conference Paper, 2023 DOI

A Karaali and N Harte and CR Jung, Deep Multi-Scale Feature Learning for Defocus Blur Estimation, IEEE Transactions on Image Processing, 2022 Journal Article, 2022 DOI

Pandey, A., Le Maguer, S., Carson-Berndsen, J., Harte, N., Production characteristics of obstruents in WaveNET and older TTS systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2373-2377 Conference Paper, 2022 DOI

Le Maguer, S., King, S., Harte, N., Back to the Future: Extending the Blizzard Challenge 2013, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2378-2382 Conference Paper, 2022 DOI

Ilaria Torre, Simon Holk, Elmira Yadollahi, Iolanda Leite, Rachel McDonnell, Naomi Harte, Smiling in the Face and Voice of Avatars and Robots: Evidence for a smiling McGurk Effect, IEEE Transactions on Affective Computing, 2022, p1-12 Journal Article, 2022 DOI

Reverdy, J., O'Connor Russell, S., Duquenne, L., Garaialde, D., Cowan, B., Harte, N., RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions, 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, p2517-2527 Conference Paper, 2022

Sterpu, G., Harte, N., Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech, Computer Speech and Language, 74, 2022 Journal Article, 2022 DOI

Anderson, M., Harte, N., Learnable Acoustic Frontends in Bird Activity Detection, International Workshop on Acoustic Signal Enhancement, IWAENC 2022 - Proceedings, 2022 Conference Paper, 2022 DOI

Jassim, W.A., Harte, N., Comparison of discrete transforms for deep-neural-networks-based speech enhancement, IET Signal Processing, 16, (4), 2022, p438-448 Journal Article, 2022 DOI

Torre, I. and Deichler, A. and Nicholson, M. and McDonnell, R. and Harte, N., To smile or not to smile: The effect of mismatched emotional expressions in a Human-Robot cooperative task, 2022, pp8-13 Conference Paper, 2022 DOI

G Sterpu and C Saam and N Harte, Learning to count words in fluent speech enables online speech recognition, 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, pp38-45 Conference Paper, 2021

M Anderson and N Harte, Bioacoustic Event Detection with prototypical networks and data augmentation, 2021 Report, 2021

Torre, Ilaria and Carrigan, Emma and Domijan, Katarina and McDonnell, Rachel and Harte, Naomi, Dimensional perception of a 'smiling McGurk effect', 9th International Conference on Affective Computing and Intelligent Interaction (ACII), 2021, pp1-8 Conference Paper, 2021

Mark Anderson, John Kennedy, Naomi Harte, Low Resource Species Agnostic Bird Activity Detection, 2021 IEEE Workshop on Signal Processing Systems (SiPS), 2021, pp34-39 Conference Paper, 2021 URL

Ilaria Torre, Emma Carrigan, Katarina Domijan, Rachel McDonnell, Naomi Harte, The Effect of Audio-Visual Smiles on Social Influence in a Cooperative Human-Agent Interaction Task, ACM Transactions on Computer-Human Interaction (TOCHI), 28, (6), 2021, p1-38 Journal Article, 2021 DOI

Ayushi Pandey, Sébastien Le Maguer, Julie Berndsen, Naomi Harte, Mind your p's and k's--Comparing obstruents across TTS voices of the Blizzard Challenge 2013, Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021, pp166-171 Conference Paper, 2021

Le Maguer, Sebastien and Harte, Naomi, Investigation of Auditory Nerve Model Based Analysis for Vocoded Speech Synthesis, 2020, pp1--6 Conference Paper, 2020 DOI

Jassim, Wissam A and Harte, Naomi, Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement, The Journal of the Acoustical Society of America, 147, (6), 2020, p3830--3848 Journal Article, 2020

Sterpu, G., Saam, C., Harte, N., Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p3506-3509 Conference Paper, 2020 DOI

Sterpu G., Saam C., Harte N., How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition, IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 2020, p1052 - 1064 Journal Article, 2020 DOI

Roddy, Matthew and Harte, Naomi, Neural Generation of Dialogue Response Timings, Annual Conference of the Association for Computational Linguistics (ACL), 2020, pp2442-2452 Conference Paper, 2020

Le Maguer, S., Harte, N., Can auditory nerve models tell us what's different about wavenet vocoded speech?, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p230-234 Conference Paper, 2020 DOI

Fernandez-Lopez, Adriana and Karaali, Ali and Harte, Naomi and Sukno, Federico M, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp6294--6298 Conference Paper, 2020

Ilaria Torre, Emma Carrigan, Rachel McDonnell, Katarina Domijan, Killian McCabe, Naomi Harte, The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction, Proceedings - MIG 2019: ACM Conference on Motion, Interaction, and Games, ACM Conference on Motion, Interaction, and Games, 2019, 2019 Conference Paper, 2019 TARA - Full Text URL DOI

Motion, Interaction and Games in, Motion, Interaction and Games, 2019, pp1--6 , [Torre, Ilaria and Carrigan, Emma and McDonnell, Rachel and Domijan, Katarina and McCabe, Killian and Harte, Naomi] Book Chapter, 2019

Clark, L. and Cowan, B.R. and Edwards, J. and Edlund, J. and Szekely, E. and Munteanu, C. and Murad, C. and Healey, P. and Aylett, M. and Harte, N. and Torre, I. and Moore, R.K. and Doyle, P., Mapping theoretical and methodological perspectives for understanding speech interface interactions, CHI EA '19 Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems , (3299009), 2019 Conference Paper, 2019 TARA - Full Text DOI

Ilaria Torre, Emma Carrigan, Killian McCabe, Rachel McDonnell, Naomi Harte, Survival at the museum: A cooperation experiment with emotionally expressive virtual characters, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 2018, pp423-427 Conference Paper, 2018 DOI TARA - Full Text

Cullen, A. and Harte, N., A longitudinal database of Irish political speech with annotations of speaker ability, Language Resources and Evaluation, 52, (2), 2018, p401-432 Journal Article, 2018 DOI

Edmonds, C.J., Harte, N., Gardner, M., How does drinking water affect attention and memory? The effect of mouth rinsing and mouth drying on children's performance, Physiology and Behavior, 194, 2018, p233-238 Journal Article, 2018 DOI

Roddy, M. and Skantze, G. and Harte, N., Multimodal continuous turn-taking prediction using multiscale Rnns, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction, 2018, pp186-190 Conference Paper, 2018 TARA - Full Text DOI

Laura Dungan, Ali Karaali, Naomi Harte, The Impact Of Reduced Video Quality On Visual Speech Recognition, IEEE International Conference on Image Processing, Athens, Greece, 2018 Conference Paper, 2018 DOI

Sterpu, G. and Saam, C. and Harte, N., Can DNNs Learn to Lipread Full Sentences?, 2018 25th IEEE International Conference on Image Processing (ICIP), (8451388), 2018, pp16-20 Conference Paper, 2018 DOI

Sterpu, G. and Saam, C. and Harte, N., Attention-based audio-visual fusion for robust automatic speech recognition, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 20th ACM International Conference on Multimodal Interaction , 2018, pp111-115 Conference Paper, 2018 TARA - Full Text DOI

Roddy, M. and Skantze, G. and Harte, N., Investigating speech features for continuous turn-taking prediction using LSTMs, Proc. Interspeech 2018, Interspeech 2018, 2018-September, 2018, pp586-590 Conference Paper, 2018 DOI

Cullen, A. and Hines, A. and Harte, N., Perception and prediction of speaker appeal â" A single speaker study, Computer Speech and Language, 52, 2018, p23-40 Journal Article, 2018 DOI

Dungan, L. and Karaali, A. and Harte, N., The impact of reduced video quality on visual speech recognition, 2018 25th IEEE International Conference on Image Processing (ICIP), 2018 25th IEEE International Conference on Image Processing (ICIP), (8451754), 2018, pp2560-2564 Conference Paper, 2018 DOI

G Sterpu and N Harte, Towards Lipreading Sentences with Active Appearance Models, arXiv preprint arXiv:1805.11688, 2018 Journal Article, 2018

O'Reilly, C. and Analuddin, K. and Kelly, D.J. and Harte, N., Measuring vocal difference in bird population pairs, Journal of the Acoustical Society of America, 143, (3), 2018, p1658-1671 Journal Article, 2018 DOI

Wissam A. Jassim and Naomi Harte, Voice Activity Detection Using Neurograms, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Alberta, Canada, 15-20 April 2018, 2018 Conference Paper, 2018

A Cullen and N Harte, Thin slicing to predict viewer impressions of TED Talks, âŠÂ of the 14th International Conference onÂ âŠ, 2017 Journal Article, 2017

Jassim, W.A. and Paramesran, R. and Harte, N., Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features, IET Signal Processing, 11, (5), 2017, p587-595 Journal Article, 2017 DOI

Roddy, M. and Harte, N., Detecting conversational gaze aversion using unsupervised learning, 2017-January, (8081172), 2017, pp76-80 Conference Paper, 2017 DOI

C O'Reilly and N Harte, Pitch tracking of bird vocalizations and an automated process using YIN-bird, Cogent Biology, 2017 Journal Article, 2017

M Roddy and N Harte, Towards predicting dialog acts from previous speakers' non-verbal cues, BIBTEX 2017, 2017, pp1-- Conference Paper, 2017 TARA - Full Text

Sloan, C. and Harte, N. and Kelly, D. and Kokaram, A.C. and Hines, A., Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio, IEEE Transactions on Broadcasting, 63, (4), 2017, p693-705 Journal Article, 2017 DOI TARA - Full Text

O'Reilly, C. and Kokuer, M. and Jancovic, P. and Drennan, R. and Harte, N., Automatic frequency feature extraction for bird species delimitation, 2017-January, (8081511), 2017, pp1759-1763 Conference Paper, 2017 DOI

N Harte and P Jancovic and Karl-L. Schuchmann, Interspeech 2016 Special Session on Bird and Animal Vocalisations Organisers, In:Interspeech 2016, 2016 Meetings /Conferences Organised, 2016

Jan Skoglund Andrew J. HINES Naomi A. HARTE Anil Kokaram, 'Objective speech quality metric', US, US20150199959A1, 2016, Google LLC Patent, 2016 URL

AJ Hines and J Skoglund and N Harte and A Kokaram, Detection of chopped speech, US Patent 9,263,061, 2016 Journal Article, 2016

O'Reilly C, Marples N.M, Kelly D.J, Harte N, YIN-bird: Improved pitch tracking for bird vocalisations, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016, 2016, pp2641 - 2645 Conference Paper, 2016 DOI URL

Andrew J. HINES Jan Skoglund Naomi HARTE Anil Kokaram, 'Detection of chopped speech', US, 2016, Google LLC Patent, 2016 URL

Hines A, Skoglund J, Kokaram A.C, Harte N, Monitoring voip speech quality for chopped and clipped speech, Komunikacie, 18, (1), 2016, p3 - 10 Journal Article, 2016 URL

Sloan C, Harte N, Kelly D, Kokaram A.C, Hines A, Bitrate classification of twice-encoded audio using objective quality features, 2016 8th International Conference on Quality of Multimedia Experience, QoMEX 2016, 2016, 2016, pp7498956- Conference Paper, 2016 URL DOI

Hines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, (1), 2015, p13- Journal Article, 2015 DOI URL TARA - Full Text

Harte N, Gillen E, Hines A, TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications, 7th International Workshop on Quality of Multimedia Experience, QoMEX 2015, 26-29 May 2015 , IEEE, 2015, 7148100- Conference Paper, 2015 DOI

Hines A, Gillen E, Kelly D, Skoglund J, Kokaram A, Harte N, ViSQOLAudio: An objective audio quality metric for low bitrate codecs, Journal of the Acoustical Society of America, 137, (6), 2015, pEL449 - EL455 Journal Article, 2015 URL DOI TARA - Full Text

C. O'Reilly, D. J. Kelly, N. M. Marples and N. Harte , Quantifying difference in vocalizations of bird populations, Proceedings of Interspeech 2015, 2015, 2015, p3417 - 3421 Journal Article, 2015

Harte N, Gillen E, TCD-TIMIT: An audio-visual corpus of continuous speech, IEEE Transactions on Multimedia, 17, (5), 2015, p603 - 615 Journal Article, 2015 DOI URL

Hines A, Gillen E, Harte N, Measuring and monitoring speech quality for voice over IP with POLQA, ViSQOL and P.563, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, 2015-January, 2015, pp438 - 442 Conference Paper, 2015 URL

Kelly F, Harte N, Forensic comparison of ageing voices from automatic and auditory perspectives, International Journal of Speech, Language and the Law, 22, (2), 2015, p167 - 202 Journal Article, 2015 DOI URL

Pitie, F., Kelly, D., Foucu, T., Harte, N., Kokaram, A. , Assessment of Audio/Video synchronisation in streaming media, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014, pp171-176 Conference Paper, 2014 DOI

Cullen, Ailbhe, Hines, Andrew and Harte, Naomi, Building a Database of Political Speech - Does culture matter in charisma annotations? , 1 4th International Workshop on Audio/Visual Emotion Challenge, AVEC 2014, AVEC'14: 4th International Audio/Visual Emotion Challenge and Workshop., Orlando, FL., 2014, pp27 - 31 Conference Paper, 2014 DOI

Francois Pitie and Damien Kelly and Thierry Foucu and Naomi Harte and Anil C. Kokaram , Assessment of Audio/Video synchronisation in streaming media., International Workshop on Quality of Multimedia Experience, Singapore, 2014, pp171 - 176 Conference Paper, 2014

Finnian Kelly, Rahim Saeidi, Naomi Harte, David van Leeuwen, Effect of long-term ageing on i-vector speaker verification, Computer Speech & Language, InterSpeech, Singapore, 2014, pp1068 - 1084 Conference Paper, 2014

Andrew Hines, Eoin Gillen, Jan Skoglund, Damien Kelly, Anil Kokaram and Naomi Harte, Perceived Audio Quality for Streaming Stereo Music. , ACM Multimedia, Orlando, FL, USA, 2014, pp1173 - 1176 Conference Paper, 2014 DOI TARA - Full Text

Ailbhe Cullen and Naomi Harte, Late Integration of Features for Acoustic Emotion Recognition, European Signal Processing Conference (EUSIPCO)., 2013, pp1 - 5 Conference Paper, 2013

Finnian Kelly and Naomi Harte, Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition, InterSpeech, Lyon, France, 2013, pp2846 - 2850 Conference Paper, 2013

K Pan and F Kelly and N Harte and N Harte and S Murphy and DJ Kelly and ..., Shape Models for Image Segmentation in Microscopy, mee.tcd.ie, 2013 Book, 2013

Sooknanan, Ken, Doyle, Jennifer, Wilson, James, Harte, Naomi, Kokaram, Anil and Corrigan, David, Mosaics For Burrow Detection in Underwater Surveillance Video, IEEE Oceans 2013, San Diego, USA, 2013, pp9 - 12 Conference Paper, 2013

Finnian Kelly, Niko Brummer and Naomi Harte, Eigenageing Compensation for Speaker Verification. , InterSpeech , Lyon, France, 2013, pp1624 - 1628 Conference Paper, 2013

Finnian Kelly, Andrzej Drygajlo and Naomi Harte , Speaker verification in score-ageing-quality classification space, Computer Speech & Language, 27, (5), 2013, p1068-1084 Journal Article, 2013

Hines, A., Skoglund, J., Kokaram, A., Harte, N. , Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp3697-3701 Conference Paper, 2013 DOI

Kelly, Finnian and Harte, Naomi in, editor(s)Michael Fairhurst , Age Factors in Biometric Processing, IET, 2013, [Kelly, Finnian and Harte, Naomi] Book Chapter, 2013

A Hines, J Skoglund, A Kokaram, N Harte, Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, 2013, pp3697 - 3701 Conference Paper, 2013

Harte, Naomi, Murphy, Sadhbh, Kelly, David J. and Marples, Nicola M., Identifying new bird species from differences in birdsong. , INTERSPEECH, Lyon France., 2013, pp2900-2904 Conference Paper, 2013

Ailbhe Cullen, John Kane, Thomas Drugman, and Naomi Harte , Creaky Voice and the Classification of Affect, Workshop on Affective Social Speech Signals (WASSS), Grenoble, France, 2013 Conference Paper, 2013 DOI

A Hines, J Skoglund, A Kokaram, N Harte, Monitoring the Effects of Temporal Clipping on VoIP Speech Quality, Interspeech, Lyon, France, 2013, 2013, pp1188 - 1192 Conference Paper, 2013

F Kelly and N Harte and M Fairhurst, The impact of ageing on speech-based biometric systems, Age Factors in Biometric Processing, 2013 Journal Article, 2013

K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, N. Harte and J. Wilson, Indexing and Selection of Well-Lit Details in Underwater Video Mosaics Using Vignetting Estimation, Program Book - OCEANS 2012 MTS/IEEE Yeosu: The Living Ocean and Coast - Diversity of Resources and Sustainable Activities, International OCEANS Conference, Yeosu, South Korea, May, IEEE, 2012, ppArticle number 6263541 Conference Paper, 2012 DOI

Andrew Hines, Naomi Harte, Speech Intelligibility prediction using a Neurogram Similarity Index Measure, Speech Communication, 54, (2), 2012, p306-320 Journal Article, 2012 TARA - Full Text

Andrew Hines, Naomi Harte, Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm, Interspeech, Portland, OR, ISCA, 2012, pp1 - 4 Conference Paper, 2012

K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, J. Wilson and N. Harte , Improving Underwater Visibility Using Vignetting Correction, Proceedings of SPIE - The International Society for Optical Engineering, Visual Information Processing and Communication, Burlingame, California, USA, January, 8305, SPIE, 2012, ppArticle number 83050M Conference Paper, 2012 DOI

Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte, ViSQOL: The Virtual Speech Quality Objective Listener, The International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 4-6 Sept. 2012, 2012, pp1 - 4 Conference Paper, 2012 TARA - Full Text

Cappelletta, L., Harte, N., Phoneme-to-viseme mapping for visual speech recognition, ICPRAM 2012 - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods, 2, 2012, p322-329 Conference Paper, 2012

Corrigan, D. ; Kokaram, A. ; Harte, N. , Algorithms for the Digital Restoration of Torn Film , Image Processing, IEEE Transactions on, 21, (2), 2012, p573-587 Journal Article, 2012 DOI

Hines, A., Skoglund, J., Kokaram, A., Harte, N., VISQOL: The virtual speech quality objective listener, International Workshop on Acoustic Signal Enhancement, IWAENC 2012, 2012 Conference Paper, 2012

Kelly, F., Drygajlo, A., Harte, N., Compensating for ageing and quality variation in speaker verification, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1, 2012, p498-501 Conference Paper, 2012

F. Kelly , A. Drygajlo and N. Harte, Speaker Verification with Long-Term Ageing Data , International Conference on Biometrics (ICB), New Delhi, 2012, pp478 - 483 Conference Paper, 2012

L Cappelletta and N Harte, Non Phonetic Viseme Definition for Visual-Only Speech Recognition, 2012, - Miscellaneous, 2012

Luca Cappelletta and Naomi Harte, Viseme Definitions Comparison for Visual-Only Speech Recognition, European Signal Processing Conference (Eusipco), 2011, pp2109 - 2113 Conference Paper, 2011

Finnian Kelly, Naomi Harte, Effects of Long-Term Ageing on Speaker Verification, Proceedings of the COST 2101 European conference on Biometrics and ID management, Springer-Verlag, 2011, pp113--124 Conference Paper, 2011

C Berry and A Kokaram and N Harte, An extended multiresolution approach to mouth specific aam fitting for speech recognition, 2011 19th European SignalÂ âŠ, 2011 Journal Article, 2011

Andrew Hines and Naomi Harte , Simulated performance intensity functions , Engineering in Medicine and Biology Society Conference (EMBC), EMBS (IEEE). , 2011, pp7139 - 7142 Conference Paper, 2011

Andrew Hines and Naomi Harte, Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions , Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR), 2011 Conference Paper, 2011

Craig Berry, Anil Kokaram, Naomi Harte, An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition. , European Signal Processing Conference (Eusipco), 2011 Conference Paper, 2011 DOI

Andrew Hines and Naomi Harte, Speech intelligibility from image processing, Speech Communication, 52, (9), 2010, p736 - 752 Journal Article, 2010

Finnian Kelly and Naomi Harte, Auditory Features Revisited for Robust Speech Recognition. , International Conference on Pattern Recognition (ICPR). , Istanbul, Turkey, Aug 2010, 2010, pp4456 - 4459 Conference Paper, 2010

Finnian Kelly and Naomi Harte, Training GMMs for Speaker Verification. , IET Irish Signals and Systems Conference, Cork, Ireland, June 2010, 2010, pp163 - 168 Conference Paper, 2010

Luca Cappelletta and Naomi Harte, Nostril Detection for Robust Mouth Tracking, Irish Signals and Systems Conference, Cork, Ireland, 2010, pp239 - 244 Conference Paper, 2010

Finnian Kelly and Naomi Harte, A Comparison of Auditory Features for Robust Speech Recognition. , European Signal Processing Conference (EUSIPCO 2010). , Aalborg, Denmark, August 2010, 2010 Conference Paper, 2010 DOI

A Hines and N Harte, Reproduction of the Performance/Intensity Function using image processing and an auditory nerve computational model, 2010 Conference Paper, 2010 URL

K Finnian and N Harte, A comparison of auditory features for robust speech recognition, presentation, 18th European Signal ProcessingÂ âŠ, 2010 Journal Article, 2010

Andrew Hines and Naomi Harte, Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using A Mean Structural Similarity Measure. , European Signal Processing Conference (EUSIPCO '10). , Aalborg, Denmark, 2010 Conference Paper, 2010 TARA - Full Text

Andrew Hines, Naomi Harte, Error Metrics for Impaired Auditory Nerve Responses of Different Phoneme Groups, Interspeech 2009, Brighton, 2009, 2009, pp1119 - 1122 Conference Paper, 2009 TARA - Full Text

Naomi Harte, Daire Lennon, and Anil Kokaram, On Parsing Visual Sequences with the Hidden Markov Model, EURASIP Journal on Image and Video Processing , Volume 2009, 2009 Journal Article, 2009 DOI TARA - Full Text

Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database , IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June , 2009 Conference Paper, 2009 TARA - Full Text

N Hurley and N Harte and C Fearon and S Rickard, Speech Source Separation in Hardware, 2009, - Miscellaneous, 2009

Andrew Hines, Naomi Harte , Measurement of phonemic degradation in sensorineural hearing loss using a computational model of the auditory periphery , IET Irish Signals and Systems Conference ISSC 2009, UCD, June 10-11, 2009, pp1-6 Conference Paper, 2009 TARA - Full Text URL

Harte, N., Lennon, D., Kokaram, A., On parsing visual sequences with the hidden markov model, Eurasip Journal on Image and Video Processing, 2009, 2009 Journal Article, 2009 DOI

David Corrigan, Naomi Harte, Anil Kokaram, Pathological Motion Detection for Robust Missing Data Treatment, EURASIP Journal on Advances in Signal Processing, 2008, 2008, pArticle ID 542436 Journal Article, 2008 DOI TARA - Full Text

Action Recognition in Multimedia Streams in, editor(s)Petros Maragos, Alexandros Potamianos, Patrick Gros , Multimodal Processing and Interaction, Springer Verlag. , 2008, pp127 - 142, [Daire Lennon, Naomi Harte, and Anil Kokaram, Rozenn Dahyot, Francois Pitie] Book Chapter, 2008

D Lennon and N Harte and A Kokaram, Rotation detection using the curl equation, 2007 IEEE InternationalÂ â", 2007 Journal Article, 2007

Harte, Naomi; Rankin, Andrew; Baugh, Gary; Kokaram, Anil;, Detection of Illegal Dumping from CCTV at Recycling Centres, International Machine Vision and Image Processing, International Machine Vision and Image Processing Conference, Kildare, Ireland , 2007, (5-7 Sept. ), 2007, pp204 Conference Paper, 2007 TARA - Full Text URL

Corrigan, David; Harte, Naomi; Kokaram, Anil;, Automated Segmentation of Torn Frames using the Graph Cuts Technique, Image Processing, IEEE International Conference on Image Processing, 2007. ICIP 2007., San Antonio, TX, USA , 2007, (Sept. 16-Oct. 19), 2007, pp557-560 Conference Paper, 2007 URL DOI TARA - Full Text

D Lennon and N Harte and A Kokaram and E Doyle and ..., A hmm framework for motion based parsing for video from observational psychology, IEEE Irish Machine VisionÂ âŠ, 2006 Journal Article, 2006

Daire Lennon, Naomi Harte, Anil Kokaram, Erika Doyle, Ray Fuller, A HMM Framework for Motion based parsing for video from Observational Psychology, IEEE Irish Machine Vision and Image Processing Conference, Irish Machine Vision and Image Processing Conference , 2006 Conference Paper, 2006 TARA - Full Text URL

Corrigan, D. Harte, N. and Kokaram, A. , Pathological motion detection for robust missing data treatment in degraded archived media, Image Processing, IEEE International Conference on Image Processing 2006, Atlanta, GA , 8-11 Oct. 2006 , 2006, pp621 - 624 Conference Paper, 2006 TARA - Full Text DOI URL

Naomi Harte and Anil Kokaram, Automated Removal of Overshoot Artefact from Images, EUSIPCO , European Signal Processing Conference , 2006 Conference Paper, 2006 URL

Naomi Harte, Shahab U. Ansari, Ian Bruce, Exploiting Voicing Cues for Contrast Enhanced Frequency Shaping of Speech for Impaired Listeners, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, 14-19 May 2006 , 5, IEEE, 2006, ppV Conference Paper, 2006 DOI TARA - Full Text URL

Naomi Harte, Niall Hurley, Conor Fearon, Scott Rickard., Towards a Hardware Realization of Time-Frequency Source Separation of Speech, Proceedings of IEEE European Conference on Circuit Theory and Design, IEEE European Conference on Circuit Theory and Design, 28 Aug -2 Sept. 2005, IEEE, 2005 Conference Paper, 2005 TARA - Full Text URL DOI

Ansari, S., Harte, N., and Bruce, I., , Efficiently combining improved contrast-enhancing frequency shaping and multiband compression to enhance speech intelligibility in hearing aids, Lake Ontario Auditory Neuroscience (LOAN) Meeting, Hamilton, Canada, 2005 Conference Paper, 2005

Niall Hurley, Naomi Harte, Conor Fearon, Scott Rickard,, Blind Source Separation of Speech in Hardware, Workshop on Signal Processing Systems, Nov 2005, IEEE, 2005, pp442- 445 Conference Paper, 2005 URL TARA - Full Text DOI

N.Harte, S. Bates, B. Murray, The IntelliRate Oversampling Architecture for a Gigabit Ethernet Transceiver, Proceedings of Irish Signals and Systems Conference , Irish Signals and Systems Conference , 2002 Conference Paper, 2002

McCourt, P. Harte, N. Vaseghi, S. , Discriminitive Multi-Resolution Sub-Band and Segmental Phonetic Model Combination, Electronics Letters, 36, (3), 2000, p270-271 Journal Article, 2000 URL TARA - Full Text DOI

Paul McCourt, Naomi Harte, Saeed Vaseghi, Combined Temporal and Spectral Multi-Resolution Phonetic Modelling, Proc. Eurospeech, Eurospeech, Budapest, Hungary, September 5-9, 1999, 1999, pp1111-1114 Conference Paper, 1999 URL

McCourt, P., Harte, N., Vaseghi, S., COMBINED TEMPORAL AND SPECTRAL MULTI-RESOLUTION PHONETIC MODELLING, 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, 1999, p1111-1114 Conference Paper, 1999

NA Harte, Segmental phonetic features and models for speech recognition., ethos.bl.uk, 1999 Book, 1999

McMahon, P.; Harte, N.; Vaseghi, S.; McCourt, P, Discriminative spectral-temporal multiresolution features for speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 1999, vol.2, 1999, pppp.581-584 Conference Paper, 1999

P.Hanna, N.Harte, J. Ming, S.Vaseghi, F.J.Smith, Variation of features of interframe dependent HMM for speech recognition, IEE Electronic Letters, Apr., 1998, p858-859 Journal Article, 1998 URL TARA - Full Text

N.Harte, S.Vaseghi, B.Milner, Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model, Proc International Conference on Spoken Language Processing, Proc International 5th International Conference on Spoken Language Processing, Sydney, Australia, Nov 30 - Dec 4, 1998 Conference Paper, 1998 URL

P.McCourt, S.Vaseghi, N.Harte, Multi-Resolution Cepstral Features for Phoneme Recognition Across Speech Sub-Bands, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, International Conference on Acoustics, Speech, and Signal Processing, Seattle, USA, 12-15 May 1998, 1, IEEE, 1998, pp557-560 Conference Paper, 1998 TARA - Full Text URL DOI

N.Harte, S.Vaseghi, P.McCourt, A Novel Model for Phoneme Recognition using Phonetically Derived Features, Proceedings of European Signal Processing Conference (EUSIPCO), , European Signal Processing Conference (EUSIPCO), , 1998, pp1485 - 1488 Conference Paper, 1998

Harte, N., Vascghi, S., McCourt, P., A novel model for phoneme recognition using phonetically derived features, European Signal Processing Conference, 1998-January, 1998 Conference Paper, 1998

Harte, N., Vaseghi, S., Milner, B., JOINT RECOGNITION AND SEGMENTATION USING PHONETICALLY DERIVED FEATURES AND A HYBRID PHONEME MODEL, 5th International Conference on Spoken Language Processing, ICSLP 1998, 1998 Conference Paper, 1998

SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International ConferenceÂ âŠ, 1997 Journal Article, 1997

S.Vaseghi, N.Harte, B.Milner, Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1997, pp1263 Conference Paper, 1997 URL

N. Harte ; S. Vaseghi ; B. Milner , Dynamic features for segmental speech recognition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, p933-- Journal Article, 1996

N.Harte, S.Vaseghi, B.Milner, Dynamic Features for Segmental Speech Recognition, Proc International Conference on Spoken Language Processing, International Conference on Spoken Language Processing, Philadelphia, 3-6 Oct 1996, 1996, pp933-936 Conference Paper, 1996 URL DOI TARA - Full Text

Non-Peer-Reviewed Publications

Dr. Silvia Giordani, Poster Making and Presentation, TCD, Chemistry Dept, 2007 Poster, 2007

Fine-Davis, M., Welcome Address, Mental Health and the Workplace: Challenges and Opportunities, Trinity College, Dublin, 13 March, 2000 Conference Paper, 2000

Projects

Title
- Dynamic Visual Features and Improved Audio-Visual Fusion for Automatic Speech Recognition
Summary
- Human speech is bimodal in nature. Incorporating visual features in Automatic Speech Recognition systems can improve performance in real environments. This work addresses core challenges in audio-visual speech recognition. It will develop new dynamic visual features that better capture the correlations in key mouth movements used by humans in lipreading. This is crucial in improving Hidden Markov Model performance. It will explore a new audio-fusion strategy motivated by the differing visibility of visemes allowing the influence of the audio and video stream to change over time.
Funding Agency
- SFI
Date From
- Oct. 2009
Date To
- Sept. 2013

Title
- Robust Speaker Verification
Summary
- Biometrics involves the use of intrinsic physical or behavioural traits of humans to verify their identity. Traits used in biometrics typically include face, fingerprints, hand geometry, handwriting, iris, retinal, vein, and voice. Many are concerned that these technologies are potentially invasive and open to fraud. Speaker verification, using voice or voice and video, has been recognised as an important alternative in the world of biometrics. It is less invasive and requires less expensive installations that iris and fingerprint authentication systems. The changes that occur in the human voice due to ageing have been well documented. The impact of these changes on speaker verification is less clear. In this work, we examine the effect of long-term vocal ageing on a speaker verification systems.
Funding Agency
- IRCSET
Date From
- 2009
Date To
- 2012

Title
- Audio-Visual Fusion for Human Computer Interaction.
Summary
- This project will thus focus on key challenges in Audio Visual Speech Recognition: . Given state of the art audio and visual features, do early or late integration strategies work better? . How well does such an integration scheme translate to less controlled situations, where the speech is less constrained, intonation or prosody is more natural, or the speech is emotionally influenced? . Can these algorithms work on a real handheld device?
Funding Agency
- IRCSET
Date From
- 2011
Date To
- 2014

Title
- Speech Quality for VoIP
Summary
- This project is developing new metrics to measure speech quality for VoIP applications, particularly Google Chrome WebRTC
Funding Agency
- Google Inc
Date From
- April 2011
Date To
- April 2012

Title
- Advanced Metrics for Audio-Visual Signal Quality in Internet Communications
Funding Agency
- Enterprise Ireland/Google
Date From
- Sept 2013
Date To
- Dec 2014

Keywords

Audio-visual speech processing; Birdsong Analysis; Emotion in Speech; Human-Computer Interaction; Information/Communication Systems; Multimedia; Signal Processing; Speaker Recognition; SPEECH; Speech Biometrics; Speech processing/technology; Speech Quality; SPEECH RECOGNITION

Recognition

Representations

International Expert Reviewer for Swiss National Science Foundation (SNSF)

Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia ongoing

Senior Technical Program Committee for ACM ICMI 2019

TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)

Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"

Regular Session Chair at Interspeech ongoing

Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age

ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.

Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters

PhD External Examiner University of Cambridge

PhD External Examiner, Victoria University, New Zealand

PhD External Examiner, University of York

PhD External Examiner, Athlone Institute of Technology

PhD External Examiner, University of East Anglia

Awards and Honours

AI Awards (Shortlisted in Best Application of AI in an Academic Research Body) 2019

Google Faculty Award 2018

Fellow of Trinity College Dublin 2017

Cognitec Best Student Paper Award for PhD Student Finnian Kelly, International Conference on Biometrics (ICB) 2012

Shortlisted for Provost Teaching Award 2011

British Telecom Research Scholarship 1997-1999

IEE Leslie H. Paddle Scholarship 1995-1998

Glen Dimplex British Council Chevening Scholarship 1995-1996

Awarded a Gold Medal for Distinction in Engineering upon graduation. 1995

Maurice F. Fitzgerald Prize - first overall in the Engineering Faculty in the Degree exams. 1995

David Clark Prize - first place in the Microelectronic and Electrical Engineering Degree exams. 1995

Memberships

IEEE (Institute of Electrical and Electronics Engineers)

ISCA (International Speech Communication Association)

IEEE Women in Engineering

IEEE Signal Processing Society

Trinity College Dublin, The University of Dublin

Trinity Search

Trinity Menu

Trinity Research

Professor Naomi Harte
Professor in Speech Technology, Electronic & Elect. Engineering

Biography

Publications and Further Research Outputs

Peer-Reviewed Publications

Non-Peer-Reviewed Publications

Research Expertise

Projects

Keywords

Recognition

Representations

Awards and Honours

Memberships

Sitemap

Contact Us

Our Location

Trinity Search

Trinity Menu

Professor Naomi Harte Professor in Speech Technology, Electronic & Elect. Engineering

Biography

Publications and Further Research Outputs

Peer-Reviewed Publications

Non-Peer-Reviewed Publications

Research Expertise

Projects

Keywords

Recognition

Representations

Awards and Honours

Memberships

Professor Naomi Harte
Professor in Speech Technology, Electronic & Elect. Engineering