Professor Naomi Harte

Professor in Speech Technology, Electronic & Elect. Engineering

nharte@tcd.ie 3531896 1861 www.sigmedia.tv

Biography

Naomi is Professor in Speech Technology in the School of Engineering in Trinity College. She is Co-PI and a founding member of the ADAPT SFI Centre. In ADAPT, she has led a major Research Theme centered on Multimodal Interaction involving researchers from Universities across Ireland and was instrumental in developing the future vision for the Centre for 2021-2026. She is also a lead academic of the hugely successful Sigmedia Research Group in the School of Engineering. She was appointed as an SFI Engineering Initiative Lecturer in Digital Media in TCD in 2008 (Stokes Programme). Prior to returning to academia, Naomi worked in high-tech start-ups in the field of DSP Systems Development, including her own company. She also previously worked in McMaster University in Canada. She was a Visiting Professor at ICSI in 2015, and became a Fellow of TCD in 2017. She earned a Google Faculty Award in 2018 and was shortlisted for the AI Ireland Awards in 2019. She currently serves on the Editorial Board of Computer Speech and Language and was General Chair of INTERSPEECH 2023 in Dublin. Naomi's research centres around Human Speech Communication. She likes to consider speech as something we both hear and see, with a strong multimodal aspect to her work. Her research involves the design and application of mathematical algorithms to enhance or augment speech communication between humans and technology. Much of that work is underpinned by signal processing and machine learning, but also requires an understanding of how humans interact. Her current research projects include audio-visual speech recognition, speech synthesis evaluation, multimodal speech analysis, and birdsong. Her industrial background brings a real-world approach to her research.

Publications and Further Research Outputs

Peer-Reviewed Publications
Non-Peer-Reviewed Publications

Pitie, F., Kelly, D., Foucu, T., Harte, N., Kokaram, A. , Assessment of Audio/Video synchronisation in streaming media, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014, pp171-176Conference Paper, 2014, DOI
Hines, A., Skoglund, J., Kokaram, A., Harte, N. , Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp3697-3701Conference Paper, 2013, DOI
Francois Pitie and Damien Kelly and Thierry Foucu and Naomi Harte and Anil C. Kokaram , Assessment of Audio/Video synchronisation in streaming media., International Workshop on Quality of Multimedia Experience, Singapore, 2014, pp171 - 176Conference Paper, 2014
Hines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, (1), 2015, p13-Journal Article, 2015, DOI , URL , TARA - Full Text
Hines A, Gillen E, Kelly D, Skoglund J, Kokaram A, Harte N, ViSQOLAudio: An objective audio quality metric for low bitrate codecs, Journal of the Acoustical Society of America, 137, (6), 2015, pEL449 - EL455Journal Article, 2015, DOI , URL , TARA - Full Text
Harte N, Gillen E, TCD-TIMIT: An audio-visual corpus of continuous speech, IEEE Transactions on Multimedia, 17, (5), 2015, p603 - 615Journal Article, 2015, DOI , URL
Harte N, Gillen E, Hines A, TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications, 7th International Workshop on Quality of Multimedia Experience, QoMEX 2015, 26-29 May 2015 , IEEE, 2015, 7148100-Conference Paper, 2015, DOI
C. O'Reilly, D. J. Kelly, N. M. Marples and N. Harte , Quantifying difference in vocalizations of bird populations, Proceedings of Interspeech 2015, 2015, 2015, p3417 - 3421Journal Article, 2015
Sloan C, Harte N, Kelly D, Kokaram A.C, Hines A, Bitrate classification of twice-encoded audio using objective quality features, 2016 8th International Conference on Quality of Multimedia Experience, QoMEX 2016, 2016, 2016, pp7498956-Conference Paper, 2016, DOI , URL
Hines A, Skoglund J, Kokaram A.C, Harte N, Monitoring voip speech quality for chopped and clipped speech, Komunikacie, 18, (1), 2016, p3 - 10Journal Article, 2016, URL
O'Reilly C, Marples N.M, Kelly D.J, Harte N, YIN-bird: Improved pitch tracking for bird vocalisations, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016, 2016, pp2641 - 2645Conference Paper, 2016, DOI , URL
Kelly F, Harte N, Forensic comparison of ageing voices from automatic and auditory perspectives, International Journal of Speech, Language and the Law, 22, (2), 2015, p167 - 202Journal Article, 2015, DOI , URL
Hines A, Gillen E, Harte N, Measuring and monitoring speech quality for voice over IP with POLQA, ViSQOL and P.563, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, 2015-January, 2015, pp438 - 442Conference Paper, 2015, URL
Sloan, C. and Harte, N. and Kelly, D. and Kokaram, A.C. and Hines, A., Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio, IEEE Transactions on Broadcasting, 63, (4), 2017, p693-705Journal Article, 2017, DOI , URL , TARA - Full Text
Roddy, M. and Harte, N., Detecting conversational gaze aversion using unsupervised learning, 2017-January, (8081172), 2017, pp76-80Conference Paper, 2017, DOI , URL
O'Reilly, C. and Kokuer, M. and Jancovic, P. and Drennan, R. and Harte, N., Automatic frequency feature extraction for bird species delimitation, 2017-January, (8081511), 2017, pp1759-1763Conference Paper, 2017, DOI , URL
Cullen, A. and Harte, N., A longitudinal database of Irish political speech with annotations of speaker ability, Language Resources and Evaluation, 52, (2), 2018, p401-432Journal Article, 2018, DOI , URL
Jassim, W.A. and Paramesran, R. and Harte, N., Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features, IET Signal Processing, 11, (5), 2017, p587-595Journal Article, 2017, DOI , URL
Wissam A. Jassim and Naomi Harte, Voice Activity Detection Using Neurograms, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Alberta, Canada, 15-20 April 2018, 2018Conference Paper, 2018
Laura Dungan, Ali Karaali, Naomi Harte, The Impact Of Reduced Video Quality On Visual Speech Recognition, IEEE International Conference on Image Processing, Athens, Greece, 2018Conference Paper, 2018, DOI
Cullen, A. and Hines, A. and Harte, N., Perception and prediction of speaker appeal â" A single speaker study, Computer Speech and Language, 52, 2018, p23-40Journal Article, 2018, DOI , URL
O'Reilly, C. and Analuddin, K. and Kelly, D.J. and Harte, N., Measuring vocal difference in bird population pairs, Journal of the Acoustical Society of America, 143, (3), 2018, p1658-1671Journal Article, 2018, DOI , URL
Clark, L. and Cowan, B.R. and Edwards, J. and Edlund, J. and Szekely, E. and Munteanu, C. and Murad, C. and Healey, P. and Aylett, M. and Harte, N. and Torre, I. and Moore, R.K. and Doyle, P., Mapping theoretical and methodological perspectives for understanding speech interface interactions, CHI EA '19 Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems , (3299009), 2019Conference Paper, 2019, DOI , URL , TARA - Full Text
Roddy, M. and Skantze, G. and Harte, N., Multimodal continuous turn-taking prediction using multiscale Rnns, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction, 2018, pp186-190Conference Paper, 2018, DOI , URL , TARA - Full Text
Sterpu, G. and Saam, C. and Harte, N., Attention-based audio-visual fusion for robust automatic speech recognition, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 20th ACM International Conference on Multimodal Interaction , 2018, pp111-115Conference Paper, 2018, DOI , URL , TARA - Full Text
Ilaria Torre, Emma Carrigan, Killian McCabe, Rachel McDonnell, Naomi Harte, Survival at the museum: A cooperation experiment with emotionally expressive virtual characters, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 2018, pp423-427Conference Paper, 2018, DOI , URL , TARA - Full Text
Sterpu, G. and Saam, C. and Harte, N., Can DNNs Learn to Lipread Full Sentences?, 2018 25th IEEE International Conference on Image Processing (ICIP), (8451388), 2018, pp16-20Conference Paper, 2018, DOI , URL
Dungan, L. and Karaali, A. and Harte, N., The impact of reduced video quality on visual speech recognition, 2018 25th IEEE International Conference on Image Processing (ICIP), 2018 25th IEEE International Conference on Image Processing (ICIP), (8451754), 2018, pp2560-2564Conference Paper, 2018, DOI , URL
Roddy, M. and Skantze, G. and Harte, N., Investigating speech features for continuous turn-taking prediction using LSTMs, Proc. Interspeech 2018, Interspeech 2018, 2018-September, 2018, pp586-590Conference Paper, 2018, DOI , URL
D Lennon and N Harte and A Kokaram and E Doyle and ..., A hmm framework for motion based parsing for video from observational psychology, IEEE Irish Machine VisionÂ âŠ, 2006Journal Article, 2006, URL
K Finnian and N Harte, A comparison of auditory features for robust speech recognition, presentation, 18th European Signal ProcessingÂ âŠ, 2010Journal Article, 2010
AJ Hines and J Skoglund and N Harte and A Kokaram, Detection of chopped speech, US Patent 9,263,061, 2016Journal Article, 2016, URL
A Cullen and N Harte, Thin slicing to predict viewer impressions of TED Talks, âŠÂ of the 14th International Conference onÂ âŠ, 2017Journal Article, 2017, URL
D Lennon and N Harte and A Kokaram, Rotation detection using the curl equation, 2007 IEEE InternationalÂ â", 2007Journal Article, 2007, URL
G Sterpu and N Harte, Towards Lipreading Sentences with Active Appearance Models, arXiv preprint arXiv:1805.11688, 2018Journal Article, 2018, URL
C Berry and A Kokaram and N Harte, An extended multiresolution approach to mouth specific aam fitting for speech recognition, 2011 19th European SignalÂ âŠ, 2011Journal Article, 2011, URL
C O'Reilly and N Harte, Pitch tracking of bird vocalizations and an automated process using YIN-bird, Cogent Biology, 2017Journal Article, 2017, URL
F Kelly and N Harte and M Fairhurst, The impact of ageing on speech-based biometric systems, Age Factors in Biometric Processing, 2013Journal Article, 2013
K Pan and F Kelly and N Harte and N Harte and S Murphy and DJ Kelly and ..., Shape Models for Image Segmentation in Microscopy, mee.tcd.ie, 2013Book, 2013, URL
NA Harte, Segmental phonetic features and models for speech recognition., ethos.bl.uk, 1999Book, 1999, URL
A Hines and N Harte, Reproduction of the Performance/Intensity Function using image processing and an auditory nerve computational model, 2010Conference Paper, 2010, URL
L Cappelletta and N Harte, Non Phonetic Viseme Definition for Visual-Only Speech Recognition, 2012, -Miscellaneous, 2012
N Harte and P Jancovic and Karl-L. Schuchmann, Interspeech 2016 Special Session on Bird and Animal Vocalisations Organisers, In:Interspeech 2016, 2016Meetings /Conferences Organised, 2016, URL
M Roddy and N Harte, Towards predicting dialog acts from previous speakers' non-verbal cues, BIBTEX 2017, 2017, pp1--Conference Paper, 2017, URL , TARA - Full Text
N Hurley and N Harte and C Fearon and S Rickard, Speech Source Separation in Hardware, 2009, -Miscellaneous, 2009
N. Harte ; S. Vaseghi ; B. Milner , Dynamic features for segmental speech recognition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, p933--Journal Article, 1996, URL
SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International ConferenceÂ âŠ, 1997Journal Article, 1997
SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International ConferenceÂ âŠ, 1997Journal Article, 1997
Sterpu G., Saam C., Harte N., How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition, IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 2020, p1052 - 1064Journal Article, 2020, DOI
Roddy, Matthew and Harte, Naomi, Neural Generation of Dialogue Response Timings, Annual Conference of the Association for Computational Linguistics (ACL), 2020, pp2442-2452Conference Paper, 2020, URL
Fernandez-Lopez, Adriana and Karaali, Ali and Harte, Naomi and Sukno, Federico M, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp6294--6298Conference Paper, 2020, URL
Motion, Interaction and Games in, Motion, Interaction and Games, 2019, pp1--6 , [Torre, Ilaria and Carrigan, Emma and McDonnell, Rachel and Domijan, Katarina and McCabe, Killian and Harte, Naomi]Book Chapter, 2019, URL
Jassim, Wissam A and Harte, Naomi, Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement, The Journal of the Acoustical Society of America, 147, (6), 2020, p3830--3848Journal Article, 2020
Andrew J. HINES Jan Skoglund Naomi HARTE Anil Kokaram, 'Detection of chopped speech', US, 2016, Google LLCPatent, 2016, URL
Jan Skoglund Andrew J. HINES Naomi A. HARTE Anil Kokaram, 'Objective speech quality metric', US, US20150199959A1, 2016, Google LLCPatent, 2016, URL
Le Maguer, Sebastien and Harte, Naomi, Investigation of Auditory Nerve Model Based Analysis for Vocoded Speech Synthesis, 2020, pp1--6Conference Paper, 2020, DOI
Ilaria Torre, Emma Carrigan, Rachel McDonnell, Katarina Domijan, Killian McCabe, Naomi Harte, The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction, Proceedings - MIG 2019: ACM Conference on Motion, Interaction, and Games, ACM Conference on Motion, Interaction, and Games, 2019, 2019Conference Paper, 2019, DOI , URL , TARA - Full Text
A Karaali and N Harte and CR Jung, Deep Multi-Scale Feature Learning for Defocus Blur Estimation, IEEE Transactions on Image Processing, 2022Journal Article, 2022, DOI , URL
Ilaria Torre, Emma Carrigan, Katarina Domijan, Rachel McDonnell, Naomi Harte, The Effect of Audio-Visual Smiles on Social Influence in a Cooperative Human-Agent Interaction Task, ACM Transactions on Computer-Human Interaction (TOCHI), 28, (6), 2021, p1-38Journal Article, 2021, DOI , URL
M Anderson and N Harte, Bioacoustic Event Detection with prototypical networks and data augmentation, 2021Report, 2021, URL
G Sterpu and C Saam and N Harte, Learning to count words in fluent speech enables online speech recognition, 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, pp38-45Conference Paper, 2021, URL
Torre, Ilaria and Carrigan, Emma and Domijan, Katarina and McDonnell, Rachel and Harte, Naomi, Dimensional perception of a 'smiling McGurk effect', 9th International Conference on Affective Computing and Intelligent Interaction (ACII), 2021, pp1-8Conference Paper, 2021, URL
Mark Anderson, John Kennedy, Naomi Harte, Low Resource Species Agnostic Bird Activity Detection, 2021 IEEE Workshop on Signal Processing Systems (SiPS), 2021, pp34-39Conference Paper, 2021, URL
Ayushi Pandey, Sébastien Le Maguer, Julie Berndsen, Naomi Harte, Mind your p's and k's--Comparing obstruents across TTS voices of the Blizzard Challenge 2013, Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021, pp166-171Conference Paper, 2021
Ilaria Torre, Simon Holk, Elmira Yadollahi, Iolanda Leite, Rachel McDonnell, Naomi Harte, Smiling in the Face and Voice of Avatars and Robots: Evidence for a smiling McGurk Effect, IEEE Transactions on Affective Computing, 2022, p1-12Journal Article, 2022, DOI
Torre, I. and Deichler, A. and Nicholson, M. and McDonnell, R. and Harte, N., To smile or not to smile: The effect of mismatched emotional expressions in a Human-Robot cooperative task, 2022, pp8-13Conference Paper, 2022, DOI , URL
Sébastien Le Maguer, Simon King, Naomi Harte, The limits of the Mean Opinion Score for speech synthesis evaluation, Computer Speech and Language, 84, 2024Journal Article, 2024
Kotey, S., Dahyot, R., Harte, N., Fine Grained Spoken Document Summarization Through Text Segmentation, 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings, 2023, p647-654Conference Paper, 2023, DOI
Gonzales, M.G., Corcoran, P., Harte, N., Schukat, M., Joint Speech-Text Embeddings with Disentangled Speaker Features, 2023 34th Irish Signals and Systems Conference, ISSC 2023, 2023Conference Paper, 2023, DOI
Anderson, M., Kinnunen, T., Harte, N., Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2023-June, 2023Conference Paper, 2023, DOI
Pandey, A., Edlund, J., Le Maguer, S., Harte, N., Listener sensitivity to deviating obstruents in WaveNet, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1080-1084Conference Paper, 2023, DOI
Pandey, A., Le Maguer, S., Carson-Berndsen, J., Harte, N., Production characteristics of obstruents in WaveNET and older TTS systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2373-2377Conference Paper, 2022, DOI
Anderson, M., Harte, N., Learnable Acoustic Frontends in Bird Activity Detection, International Workshop on Acoustic Signal Enhancement, IWAENC 2022 - Proceedings, 2022Conference Paper, 2022, DOI
Reverdy, J., O'Connor Russell, S., Duquenne, L., Garaialde, D., Cowan, B., Harte, N., RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions, 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, p2517-2527Conference Paper, 2022
Sterpu, G., Harte, N., Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech, Computer Speech and Language, 74, 2022Journal Article, 2022, DOI
Le Maguer, S., King, S., Harte, N., Back to the Future: Extending the Blizzard Challenge 2013, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2378-2382Conference Paper, 2022, DOI
Jassim, W.A., Harte, N., Comparison of discrete transforms for deep-neural-networks-based speech enhancement, IET Signal Processing, 16, (4), 2022, p438-448Journal Article, 2022, DOI
Le Maguer, S., Anderson, M., Harte, N., Sp1NY: A Quick and Flexible Speech visualisation Tool in Python, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p2012-2013Conference Paper, 2023
Kotey, S., Dahyot, R., Harte, N., Query Based Acoustic Summarization for Podcasts, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1483-1487Conference Paper, 2023, DOI
Edmonds, C.J., Harte, N., Gardner, M., How does drinking water affect attention and memory? The effect of mouth rinsing and mouth drying on children's performance, Physiology and Behavior, 194, 2018, p233-238Journal Article, 2018, DOI
Harte, N., Lennon, D., Kokaram, A., On parsing visual sequences with the hidden markov model, Eurasip Journal on Image and Video Processing, 2009, 2009Journal Article, 2009, DOI
McCourt, P., Harte, N., Vaseghi, S., COMBINED TEMPORAL AND SPECTRAL MULTI-RESOLUTION PHONETIC MODELLING, 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, 1999, p1111-1114Conference Paper, 1999
Harte, N., Vascghi, S., McCourt, P., A novel model for phoneme recognition using phonetically derived features, European Signal Processing Conference, 1998-January, 1998Conference Paper, 1998
Harte, N., Vaseghi, S., Milner, B., JOINT RECOGNITION AND SEGMENTATION USING PHONETICALLY DERIVED FEATURES AND A HYBRID PHONEME MODEL, 5th International Conference on Spoken Language Processing, ICSLP 1998, 1998Conference Paper, 1998
Kelly, F., Drygajlo, A., Harte, N., Compensating for ageing and quality variation in speaker verification, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1, 2012, p498-501Conference Paper, 2012
Le Maguer, S., Harte, N., Can auditory nerve models tell us what's different about wavenet vocoded speech?, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p230-234Conference Paper, 2020, DOI
Sterpu, G., Saam, C., Harte, N., Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p3506-3509Conference Paper, 2020, DOI
Cappelletta, L., Harte, N., Phoneme-to-viseme mapping for visual speech recognition, ICPRAM 2012 - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods, 2, 2012, p322-329Conference Paper, 2012
Hines, A., Skoglund, J., Kokaram, A., Harte, N., VISQOL: The virtual speech quality objective listener, International Workshop on Acoustic Signal Enhancement, IWAENC 2012, 2012Conference Paper, 2012
Storey, Edward and Harte, Naomi and Bell, Peter, Language Bias in Self-Supervised Learning For Automatic Speech Recognition, 2024, pp37 â" 42Conference Paper, 2024, DOI , URL
Russell, Sam O'Connor and Gessinger, Iona and Krason, Anna and Vigliocco, Gabriella and Harte, Naomi, What automatic speech recognition can and cannot do for conversational speech transcription, Research Methods in Applied Linguistics, 3, (3), 2024Journal Article, 2024, DOI , URL
Lopez-Espejo, Ivan and Rosello, Eros and Edraki, Amin and Harte, Naomi and Jensen, Jesper, Noise-Robust Hearing Aid Voice Control, IEEE Signal Processing Letters, 2024Journal Article, 2024, DOI , URL
Gonzales, Michael Gian and Corcoran, Peter and Harte, Naomi and Schukat, Michael, Joint Speech-Text Embeddings for Multitask Speech Processing, IEEE Access, 12, 2024, p145955 â" 145967Journal Article, 2024, DOI , URL
Sam O"Connor Russell and Naomi Harte, Towards Multimodal Turn-taking for Naturalistic Human-Robot Interaction, Second International Multimodal Communication Symposium (MMSYM), Frankfurt, Germany, 25-27/09/2024, 2024Conference Paper, 2024, TARA - Full Text
Corrigan, David; Harte, Naomi; Kokaram, Anil;, Automated Segmentation of Torn Frames using the Graph Cuts Technique, Image Processing, IEEE International Conference on Image Processing, 2007. ICIP 2007., San Antonio, TX, USA , 2007, (Sept. 16-Oct. 19), 2007, pp557-560Conference Paper, 2007, DOI , URL , TARA - Full Text
Harte, Naomi; Rankin, Andrew; Baugh, Gary; Kokaram, Anil;, Detection of Illegal Dumping from CCTV at Recycling Centres, International Machine Vision and Image Processing, International Machine Vision and Image Processing Conference, Kildare, Ireland , 2007, (5-7 Sept. ), 2007, pp204Conference Paper, 2007, URL , TARA - Full Text
Corrigan, D. Harte, N. and Kokaram, A. , Pathological motion detection for robust missing data treatment in degraded archived media, Image Processing, IEEE International Conference on Image Processing 2006, Atlanta, GA , 8-11 Oct. 2006 , 2006, pp621 - 624Conference Paper, 2006, DOI , URL , TARA - Full Text
David Corrigan, Naomi Harte, Anil Kokaram, Pathological Motion Detection for Robust Missing Data Treatment, EURASIP Journal on Advances in Signal Processing, 2008, 2008, pArticle ID 542436Journal Article, 2008, DOI , TARA - Full Text
Action Recognition in Multimedia Streams in, editor(s)Petros Maragos, Alexandros Potamianos, Patrick Gros , Multimodal Processing and Interaction, Springer Verlag. , 2008, pp127 - 142, [Daire Lennon, Naomi Harte, and Anil Kokaram, Rozenn Dahyot, Francois Pitie]Book Chapter, 2008
Naomi Harte and Anil Kokaram, Automated Removal of Overshoot Artefact from Images, EUSIPCO , European Signal Processing Conference , 2006Conference Paper, 2006, URL
Daire Lennon, Naomi Harte, Anil Kokaram, Erika Doyle, Ray Fuller, A HMM Framework for Motion based parsing for video from Observational Psychology, IEEE Irish Machine Vision and Image Processing Conference, Irish Machine Vision and Image Processing Conference , 2006Conference Paper, 2006, URL , TARA - Full Text
Naomi Harte, Shahab U. Ansari, Ian Bruce, Exploiting Voicing Cues for Contrast Enhanced Frequency Shaping of Speech for Impaired Listeners, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, 14-19 May 2006 , 5, IEEE, 2006, ppVConference Paper, 2006, DOI , URL , TARA - Full Text
Ansari, S., Harte, N., and Bruce, I., , Efficiently combining improved contrast-enhancing frequency shaping and multiband compression to enhance speech intelligibility in hearing aids, Lake Ontario Auditory Neuroscience (LOAN) Meeting, Hamilton, Canada, 2005Conference Paper, 2005
Naomi Harte, Niall Hurley, Conor Fearon, Scott Rickard., Towards a Hardware Realization of Time-Frequency Source Separation of Speech, Proceedings of IEEE European Conference on Circuit Theory and Design, IEEE European Conference on Circuit Theory and Design, 28 Aug -2 Sept. 2005, IEEE, 2005Conference Paper, 2005, DOI , URL , TARA - Full Text
Niall Hurley, Naomi Harte, Conor Fearon, Scott Rickard,, Blind Source Separation of Speech in Hardware, Workshop on Signal Processing Systems, Nov 2005, IEEE, 2005, pp442- 445Conference Paper, 2005, DOI , URL , TARA - Full Text
N.Harte, S. Bates, B. Murray, The IntelliRate Oversampling Architecture for a Gigabit Ethernet Transceiver, Proceedings of Irish Signals and Systems Conference , Irish Signals and Systems Conference , 2002Conference Paper, 2002
McCourt, P. Harte, N. Vaseghi, S. , Discriminitive Multi-Resolution Sub-Band and Segmental Phonetic Model Combination, Electronics Letters, 36, (3), 2000, p270-271Journal Article, 2000, DOI , URL , TARA - Full Text
Paul McCourt, Naomi Harte, Saeed Vaseghi, Combined Temporal and Spectral Multi-Resolution Phonetic Modelling, Proc. Eurospeech, Eurospeech, Budapest, Hungary, September 5-9, 1999, 1999, pp1111-1114Conference Paper, 1999, URL
P.McCourt, S.Vaseghi, N.Harte, Multi-Resolution Cepstral Features for Phoneme Recognition Across Speech Sub-Bands, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, International Conference on Acoustics, Speech, and Signal Processing, Seattle, USA, 12-15 May 1998, 1, IEEE, 1998, pp557-560Conference Paper, 1998, DOI , URL , TARA - Full Text
P.Hanna, N.Harte, J. Ming, S.Vaseghi, F.J.Smith, Variation of features of interframe dependent HMM for speech recognition, IEE Electronic Letters, Apr., 1998, p858-859Journal Article, 1998, URL , TARA - Full Text
N.Harte, S.Vaseghi, P.McCourt, A Novel Model for Phoneme Recognition using Phonetically Derived Features, Proceedings of European Signal Processing Conference (EUSIPCO), , European Signal Processing Conference (EUSIPCO), , 1998, pp1485 - 1488Conference Paper, 1998
N.Harte, S.Vaseghi, B.Milner, Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model, Proc International Conference on Spoken Language Processing, Proc International 5th International Conference on Spoken Language Processing, Sydney, Australia, Nov 30 - Dec 4, 1998Conference Paper, 1998, URL
S.Vaseghi, N.Harte, B.Milner, Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1997, pp1263Conference Paper, 1997, URL
N.Harte, S.Vaseghi, B.Milner, Dynamic Features for Segmental Speech Recognition, Proc International Conference on Spoken Language Processing, International Conference on Spoken Language Processing, Philadelphia, 3-6 Oct 1996, 1996, pp933-936Conference Paper, 1996, DOI , URL , TARA - Full Text
McMahon, P.; Harte, N.; Vaseghi, S.; McCourt, P, Discriminative spectral-temporal multiresolution features for speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 1999, vol.2, 1999, pppp.581-584Conference Paper, 1999
Andrew Hines, Naomi Harte, Error Metrics for Impaired Auditory Nerve Responses of Different Phoneme Groups, Interspeech 2009, Brighton, 2009, 2009, pp1119 - 1122Conference Paper, 2009, TARA - Full Text
Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database , IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June , 2009Conference Paper, 2009, TARA - Full Text
Andrew Hines, Naomi Harte , Measurement of phonemic degradation in sensorineural hearing loss using a computational model of the auditory periphery , IET Irish Signals and Systems Conference ISSC 2009, UCD, June 10-11, 2009, pp1-6Conference Paper, 2009, URL , TARA - Full Text
Naomi Harte, Daire Lennon, and Anil Kokaram, On Parsing Visual Sequences with the Hidden Markov Model, EURASIP Journal on Image and Video Processing , Volume 2009, 2009Journal Article, 2009, DOI , TARA - Full Text
Andrew Hines and Naomi Harte, Speech intelligibility from image processing, Speech Communication, 52, (9), 2010, p736 - 752Journal Article, 2010
Finnian Kelly and Naomi Harte, Training GMMs for Speaker Verification. , IET Irish Signals and Systems Conference, Cork, Ireland, June 2010, 2010, pp163 - 168Conference Paper, 2010
Finnian Kelly and Naomi Harte, A Comparison of Auditory Features for Robust Speech Recognition. , European Signal Processing Conference (EUSIPCO 2010). , Aalborg, Denmark, August 2010, 2010Conference Paper, 2010, DOI
Finnian Kelly and Naomi Harte, Auditory Features Revisited for Robust Speech Recognition. , International Conference on Pattern Recognition (ICPR). , Istanbul, Turkey, Aug 2010, 2010, pp4456 - 4459Conference Paper, 2010
Andrew Hines and Naomi Harte, Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using A Mean Structural Similarity Measure. , European Signal Processing Conference (EUSIPCO '10). , Aalborg, Denmark, 2010Conference Paper, 2010, TARA - Full Text
Luca Cappelletta and Naomi Harte, Nostril Detection for Robust Mouth Tracking, Irish Signals and Systems Conference, Cork, Ireland, 2010, pp239 - 244Conference Paper, 2010
Andrew Hines, Naomi Harte, Speech Intelligibility prediction using a Neurogram Similarity Index Measure, Speech Communication, 54, (2), 2012, p306-320Journal Article, 2012, DOI , URL , TARA - Full Text
Finnian Kelly, Naomi Harte, Effects of Long-Term Ageing on Speaker Verification, Proceedings of the COST 2101 European conference on Biometrics and ID management, Springer-Verlag, 2011, pp113--124Conference Paper, 2011
Luca Cappelletta and Naomi Harte, Viseme Definitions Comparison for Visual-Only Speech Recognition, European Signal Processing Conference (Eusipco), 2011, pp2109 - 2113Conference Paper, 2011
Craig Berry, Anil Kokaram, Naomi Harte, An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition. , European Signal Processing Conference (Eusipco), 2011Conference Paper, 2011, DOI
Andrew Hines and Naomi Harte , Simulated performance intensity functions , Engineering in Medicine and Biology Society Conference (EMBC), EMBS (IEEE). , 2011, pp7139 - 7142Conference Paper, 2011
Andrew Hines and Naomi Harte, Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions , Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR), 2011Conference Paper, 2011
Corrigan, D. ; Kokaram, A. ; Harte, N. , Algorithms for the Digital Restoration of Torn Film , Image Processing, IEEE Transactions on, 21, (2), 2012, p573-587Journal Article, 2012, DOI
F. Kelly , A. Drygajlo and N. Harte, Speaker Verification with Long-Term Ageing Data , International Conference on Biometrics (ICB), New Delhi, 2012, pp478 - 483Conference Paper, 2012
Andrew Hines, Naomi Harte, Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm, Interspeech, Portland, OR, ISCA, 2012, pp1 - 4Conference Paper, 2012
Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte, ViSQOL: The Virtual Speech Quality Objective Listener, The International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 4-6 Sept. 2012, 2012, pp1 - 4Conference Paper, 2012, TARA - Full Text
K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, N. Harte and J. Wilson, Indexing and Selection of Well-Lit Details in Underwater Video Mosaics Using Vignetting Estimation, Program Book - OCEANS 2012 MTS/IEEE Yeosu: The Living Ocean and Coast - Diversity of Resources and Sustainable Activities, International OCEANS Conference, Yeosu, South Korea, May, IEEE, 2012, ppArticle number 6263541Conference Paper, 2012, DOI
K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, J. Wilson and N. Harte , Improving Underwater Visibility Using Vignetting Correction, Proceedings of SPIE - The International Society for Optical Engineering, Visual Information Processing and Communication, Burlingame, California, USA, January, 8305, SPIE, 2012, ppArticle number 83050MConference Paper, 2012, DOI
A Hines, J Skoglund, A Kokaram, N Harte, Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, 2013, pp3697 - 3701Conference Paper, 2013
A Hines, J Skoglund, A Kokaram, N Harte, Monitoring the Effects of Temporal Clipping on VoIP Speech Quality, Interspeech, Lyon, France, 2013, 2013, pp1188 - 1192Conference Paper, 2013
Ailbhe Cullen and Naomi Harte, Late Integration of Features for Acoustic Emotion Recognition, European Signal Processing Conference (EUSIPCO)., 2013, pp1 - 5Conference Paper, 2013
Ailbhe Cullen, John Kane, Thomas Drugman, and Naomi Harte , Creaky Voice and the Classification of Affect, Workshop on Affective Social Speech Signals (WASSS), Grenoble, France, 2013Conference Paper, 2013, DOI
Finnian Kelly, Andrzej Drygajlo and Naomi Harte , Speaker verification in score-ageing-quality classification space, Computer Speech & Language, 27, (5), 2013, p1068-1084Journal Article, 2013
Finnian Kelly and Naomi Harte, Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition, InterSpeech, Lyon, France, 2013, pp2846 - 2850Conference Paper, 2013
Finnian Kelly, Niko Brummer and Naomi Harte, Eigenageing Compensation for Speaker Verification. , InterSpeech , Lyon, France, 2013, pp1624 - 1628Conference Paper, 2013
Sooknanan, Ken, Doyle, Jennifer, Wilson, James, Harte, Naomi, Kokaram, Anil and Corrigan, David, Mosaics For Burrow Detection in Underwater Surveillance Video, IEEE Oceans 2013, San Diego, USA, 2013, pp9 - 12Conference Paper, 2013
Harte, Naomi, Murphy, Sadhbh, Kelly, David J. and Marples, Nicola M., Identifying new bird species from differences in birdsong. , INTERSPEECH, Lyon France., 2013, pp2900-2904Conference Paper, 2013
Kelly, Finnian and Harte, Naomi in, editor(s)Michael Fairhurst , Age Factors in Biometric Processing, IET, 2013, [Kelly, Finnian and Harte, Naomi]Book Chapter, 2013
Cullen, Ailbhe, Hines, Andrew and Harte, Naomi, Building a Database of Political Speech - Does culture matter in charisma annotations? , 1 4th International Workshop on Audio/Visual Emotion Challenge, AVEC 2014, AVEC'14: 4th International Audio/Visual Emotion Challenge and Workshop., Orlando, FL., 2014, pp27 - 31Conference Paper, 2014, DOI
Finnian Kelly, Rahim Saeidi, Naomi Harte, David van Leeuwen, Effect of long-term ageing on i-vector speaker verification, Computer Speech & Language, InterSpeech, Singapore, 2014, pp1068 - 1084Conference Paper, 2014
Andrew Hines, Eoin Gillen, Jan Skoglund, Damien Kelly, Anil Kokaram and Naomi Harte, Perceived Audio Quality for Streaming Stereo Music. , ACM Multimedia, Orlando, FL, USA, 2014, pp1173 - 1176Conference Paper, 2014, DOI , TARA - Full Text

Fine-Davis, M., Welcome Address, Mental Health and the Workplace: Challenges and Opportunities, Trinity College, Dublin, 13 March, 2000Conference Paper
Dr. Silvia Giordani, Poster Making and Presentation, TCD, Chemistry Dept, 2007Poster

Title

Audio-Visual Fusion for Human Computer Interaction.

Summary

This project will thus focus on key challenges in Audio Visual Speech Recognition: . Given state of the art audio and visual features, do early or late integration strategies work better? . How well does such an integration scheme translate to less controlled situations, where the speech is less constrained, intonation or prosody is more natural, or the speech is emotionally influenced? . Can these algorithms work on a real handheld device?

Funding Agency

IRCSET

Date From

2011

Date To

2014
Title

Robust Speaker Verification

Summary

Biometrics involves the use of intrinsic physical or behavioural traits of humans to verify their identity. Traits used in biometrics typically include face, fingerprints, hand geometry, handwriting, iris, retinal, vein, and voice. Many are concerned that these technologies are potentially invasive and open to fraud. Speaker verification, using voice or voice and video, has been recognised as an important alternative in the world of biometrics. It is less invasive and requires less expensive installations that iris and fingerprint authentication systems. The changes that occur in the human voice due to ageing have been well documented. The impact of these changes on speaker verification is less clear. In this work, we examine the effect of long-term vocal ageing on a speaker verification systems.

Funding Agency

IRCSET

Date From

2009

Date To

2012
Title

Dynamic Visual Features and Improved Audio-Visual Fusion for Automatic Speech Recognition

Summary

Human speech is bimodal in nature. Incorporating visual features in Automatic Speech Recognition systems can improve performance in real environments. This work addresses core challenges in audio-visual speech recognition. It will develop new dynamic visual features that better capture the correlations in key mouth movements used by humans in lipreading. This is crucial in improving Hidden Markov Model performance. It will explore a new audio-fusion strategy motivated by the differing visibility of visemes allowing the influence of the audio and video stream to change over time.

Funding Agency

SFI

Date From

Oct. 2009

Date To

Sept. 2013
Title

Advanced Metrics for Audio-Visual Signal Quality in Internet Communications

Summary

Funding Agency

Enterprise Ireland/Google

Date From

Sept 2013

Date To

Dec 2014
Title

Speech Quality for VoIP

Summary

This project is developing new metrics to measure speech quality for VoIP applications, particularly Google Chrome WebRTC

Funding Agency

Google Inc

Date From

April 2011

Date To

April 2012

Artificial intelligence and machine learning, Electrical engineering,

Recognition

Glen Dimplex British Council Chevening Scholarship 1995-1996
Awarded a Gold Medal for Distinction in Engineering upon graduation. 1995
Shortlisted for Provost Teaching Award 2011
Maurice F. Fitzgerald Prize - first overall in the Engineering Faculty in the Degree exams. 1995
British Telecom Research Scholarship 1997-1999
David Clark Prize - first place in the Microelectronic and Electrical Engineering Degree exams. 1995
Cognitec Best Student Paper Award for PhD Student Finnian Kelly, International Conference on Biometrics (ICB) 2012
Google Faculty Award 2018
AI Awards (Shortlisted in Best Application of AI in an Academic Research Body) 2019
Fellow of Trinity College Dublin 2017
IEE Leslie H. Paddle Scholarship 1995-1998

ISCA (International Speech Communication Association)
IEEE Signal Processing Society
IEEE Women in Engineering
IEEE (Institute of Electrical and Electronics Engineers)

Senior Technical Program Committee for ACM ICMI 2019
PhD External Examiner University of Cambridge
Senior Technical Program Committee for ACM ICMI 2019
PhD External Examiner University of Cambridge
PhD External Examiner, Athlone Institute of Technology
Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"
Regular Session Chair at Interspeech ongoing
International Expert Reviewer for Swiss National Science Foundation (SNSF)
Regular Session Chair at Interspeech ongoing
Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"
International Expert Reviewer for Swiss National Science Foundation (SNSF)
PhD External Examiner, University of York
PhD External Examiner, Athlone Institute of Technology
PhD External Examiner, University of York
ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.
PhD External Examiner, Victoria University, New Zealand
Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age
ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.
Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters
Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia ongoing
Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia ongoing
TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)
Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters
PhD External Examiner, Victoria University, New Zealand
PhD External Examiner, University of East Anglia
Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age
PhD External Examiner, University of East Anglia
TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)