Professor Naomi Harte

Professor Naomi Harte

Professor in Speech Technology, Electronic & Elect. Engineering

3531896 1861www.sigmedia.tv

Biography

Naomi is Professor in Speech Technology in the School of Engineering in Trinity College. She is Co-PI and a founding member of the ADAPT SFI Centre. In ADAPT, she has led a major Research Theme centered on Multimodal Interaction involving researchers from Universities across Ireland and was instrumental in developing the future vision for the Centre for 2021-2026. She is also a lead academic of the hugely successful Sigmedia Research Group in the School of Engineering. She was appointed as an SFI Engineering Initiative Lecturer in Digital Media in TCD in 2008 (Stokes Programme). Prior to returning to academia, Naomi worked in high-tech start-ups in the field of DSP Systems Development, including her own company. She also previously worked in McMaster University in Canada. She was a Visiting Professor at ICSI in 2015, and became a Fellow of TCD in 2017. She earned a Google Faculty Award in 2018 and was shortlisted for the AI Ireland Awards in 2019. She currently serves on the Editorial Board of Computer Speech and Language and will Chair Interspeech 2023 in Dublin. Naomi's research centres around Human Speech Communication. She likes to consider speech as something we both hear and see, with a strong multimodal aspect to her work. Her research involves the design and application of mathematical algorithms to enhance or augment speech communication between humans and technology. Much of that work is underpinned by signal processing and machine learning, but also requires an understanding of how humans interact. Her current research projects include audio-visual speech recognition, speech synthesis evaluation, multimodal speech analysis, and birdsong. Her industrial background brings a real-world approach to her research.

Publications and Further Research Outputs

  • Pitie, F., Kelly, D., Foucu, T., Harte, N., Kokaram, A. , Assessment of Audio/Video synchronisation in streaming media, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014 6th International Workshop on Quality of Multimedia Experience, QoMEX 2014, 2014, pp171-176Conference Paper, 2014, DOI
  • Hines, A., Skoglund, J., Kokaram, A., Harte, N. , Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp3697-3701Conference Paper, 2013, DOI
  • Francois Pitie and Damien Kelly and Thierry Foucu and Naomi Harte and Anil C. Kokaram , Assessment of Audio/Video synchronisation in streaming media., International Workshop on Quality of Multimedia Experience, Singapore, 2014, pp171 - 176Conference Paper, 2014
  • Hines A, Skoglund J, Kokaram A.C, Harte N, ViSQOL: an objective speech quality model, Eurasip Journal on Audio, Speech, and Music Processing, 2015, (1), 2015, p13-Journal Article, 2015, DOI , URL , TARA - Full Text
  • Hines A, Gillen E, Kelly D, Skoglund J, Kokaram A, Harte N, ViSQOLAudio: An objective audio quality metric for low bitrate codecs, Journal of the Acoustical Society of America, 137, (6), 2015, pEL449 - EL455Journal Article, 2015, DOI , URL , TARA - Full Text
  • Harte N, Gillen E, TCD-TIMIT: An audio-visual corpus of continuous speech, IEEE Transactions on Multimedia, 17, (5), 2015, p603 - 615Journal Article, 2015, DOI , URL
  • Harte N, Gillen E, Hines A, TCD-VoIP, a research database of degraded speech for assessing quality in VoIP applications, 7th International Workshop on Quality of Multimedia Experience, QoMEX 2015, 26-29 May 2015 , IEEE, 2015, 7148100-Conference Paper, 2015, DOI
  • C. O'Reilly, D. J. Kelly, N. M. Marples and N. Harte , Quantifying difference in vocalizations of bird populations, Proceedings of Interspeech 2015, 2015, 2015, p3417 - 3421Journal Article, 2015
  • Sloan C, Harte N, Kelly D, Kokaram A.C, Hines A, Bitrate classification of twice-encoded audio using objective quality features, 2016 8th International Conference on Quality of Multimedia Experience, QoMEX 2016, 2016, 2016, pp7498956-Conference Paper, 2016, DOI , URL
  • Hines A, Skoglund J, Kokaram A.C, Harte N, Monitoring voip speech quality for chopped and clipped speech, Komunikacie, 18, (1), 2016, p3 - 10Journal Article, 2016, URL
  • O'Reilly C, Marples N.M, Kelly D.J, Harte N, YIN-bird: Improved pitch tracking for bird vocalisations, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2016, 08-12-September-2016, 2016, pp2641 - 2645Conference Paper, 2016, DOI , URL
  • Kelly F, Harte N, Forensic comparison of ageing voices from automatic and auditory perspectives, International Journal of Speech, Language and the Law, 22, (2), 2015, p167 - 202Journal Article, 2015, DOI , URL
  • Hines A, Gillen E, Harte N, Measuring and monitoring speech quality for voice over IP with POLQA, ViSQOL and P.563, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2015, 2015-January, 2015, pp438 - 442Conference Paper, 2015, URL
  • Sloan, C. and Harte, N. and Kelly, D. and Kokaram, A.C. and Hines, A., Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio, IEEE Transactions on Broadcasting, 63, (4), 2017, p693-705Journal Article, 2017, DOI , URL , TARA - Full Text
  • Roddy, M. and Harte, N., Detecting conversational gaze aversion using unsupervised learning, 2017-January, (8081172), 2017, pp76-80Conference Paper, 2017, DOI , URL
  • O'Reilly, C. and Kokuer, M. and Jancovic, P. and Drennan, R. and Harte, N., Automatic frequency feature extraction for bird species delimitation, 2017-January, (8081511), 2017, pp1759-1763Conference Paper, 2017, DOI , URL
  • Cullen, A. and Harte, N., A longitudinal database of Irish political speech with annotations of speaker ability, Language Resources and Evaluation, 52, (2), 2018, p401-432Journal Article, 2018, DOI , URL
  • Jassim, W.A. and Paramesran, R. and Harte, N., Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features, IET Signal Processing, 11, (5), 2017, p587-595Journal Article, 2017, DOI , URL
  • Wissam A. Jassim and Naomi Harte, Voice Activity Detection Using Neurograms, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Alberta, Canada, 15-20 April 2018, 2018Conference Paper, 2018
  • Laura Dungan, Ali Karaali, Naomi Harte, The Impact Of Reduced Video Quality On Visual Speech Recognition, IEEE International Conference on Image Processing, Athens, Greece, 2018Conference Paper, 2018, DOI
  • Cullen, A. and Hines, A. and Harte, N., Perception and prediction of speaker appeal â€" A single speaker study, Computer Speech and Language, 52, 2018, p23-40Journal Article, 2018, DOI , URL
  • O'Reilly, C. and Analuddin, K. and Kelly, D.J. and Harte, N., Measuring vocal difference in bird population pairs, Journal of the Acoustical Society of America, 143, (3), 2018, p1658-1671Journal Article, 2018, DOI , URL
  • Clark, L. and Cowan, B.R. and Edwards, J. and Edlund, J. and Szekely, E. and Munteanu, C. and Murad, C. and Healey, P. and Aylett, M. and Harte, N. and Torre, I. and Moore, R.K. and Doyle, P., Mapping theoretical and methodological perspectives for understanding speech interface interactions, CHI EA '19 Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems , (3299009), 2019Conference Paper, 2019, DOI , URL , TARA - Full Text
  • Roddy, M. and Skantze, G. and Harte, N., Multimodal continuous turn-taking prediction using multiscale Rnns, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction, 2018, pp186-190Conference Paper, 2018, DOI , URL , TARA - Full Text
  • Sterpu, G. and Saam, C. and Harte, N., Attention-based audio-visual fusion for robust automatic speech recognition, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 20th ACM International Conference on Multimodal Interaction , 2018, pp111-115Conference Paper, 2018, DOI , URL , TARA - Full Text
  • Ilaria Torre, Emma Carrigan, Killian McCabe, Rachel McDonnell, Naomi Harte, Survival at the museum: A cooperation experiment with emotionally expressive virtual characters, ICMI '18 Proceedings of the 20th ACM International Conference on Multimodal Interaction , 2018, pp423-427Conference Paper, 2018, DOI , URL , TARA - Full Text
  • Sterpu, G. and Saam, C. and Harte, N., Can DNNs Learn to Lipread Full Sentences?, 2018 25th IEEE International Conference on Image Processing (ICIP), (8451388), 2018, pp16-20Conference Paper, 2018, DOI , URL
  • Dungan, L. and Karaali, A. and Harte, N., The impact of reduced video quality on visual speech recognition, 2018 25th IEEE International Conference on Image Processing (ICIP), 2018 25th IEEE International Conference on Image Processing (ICIP), (8451754), 2018, pp2560-2564Conference Paper, 2018, DOI , URL
  • Roddy, M. and Skantze, G. and Harte, N., Investigating speech features for continuous turn-taking prediction using LSTMs, Proc. Interspeech 2018, Interspeech 2018, 2018-September, 2018, pp586-590Conference Paper, 2018, DOI , URL
  • D Lennon and N Harte and A Kokaram and E Doyle and ..., A hmm framework for motion based parsing for video from observational psychology, IEEE Irish Machine Vision 
, 2006Journal Article, 2006, URL
  • K Finnian and N Harte, A comparison of auditory features for robust speech recognition, presentation, 18th European Signal Processing 
, 2010Journal Article, 2010
  • AJ Hines and J Skoglund and N Harte and A Kokaram, Detection of chopped speech, US Patent 9,263,061, 2016Journal Article, 2016, URL
  • A Cullen and N Harte, Thin slicing to predict viewer impressions of TED Talks, 
 of the 14th International Conference on 
, 2017Journal Article, 2017, URL
  • D Lennon and N Harte and A Kokaram, Rotation detection using the curl equation, 2007 IEEE International â€", 2007Journal Article, 2007, URL
  • G Sterpu and N Harte, Towards Lipreading Sentences with Active Appearance Models, arXiv preprint arXiv:1805.11688, 2018Journal Article, 2018, URL
  • C Berry and A Kokaram and N Harte, An extended multiresolution approach to mouth specific aam fitting for speech recognition, 2011 19th European Signal 
, 2011Journal Article, 2011, URL
  • C O'Reilly and N Harte, Pitch tracking of bird vocalizations and an automated process using YIN-bird, Cogent Biology, 2017Journal Article, 2017, URL
  • F Kelly and N Harte and M Fairhurst, The impact of ageing on speech-based biometric systems, Age Factors in Biometric Processing, 2013Journal Article, 2013
  • K Pan and F Kelly and N Harte and N Harte and S Murphy and DJ Kelly and ..., Shape Models for Image Segmentation in Microscopy, mee.tcd.ie, 2013Book, 2013, URL
  • NA Harte, Segmental phonetic features and models for speech recognition., ethos.bl.uk, 1999Book, 1999, URL
  • A Hines and N Harte, Reproduction of the Performance/Intensity Function using image processing and an auditory nerve computational model, 2010Conference Paper, 2010, URL
  • L Cappelletta and N Harte, Non Phonetic Viseme Definition for Visual-Only Speech Recognition, 2012, -Miscellaneous, 2012
  • N Harte and P Jancovic and Karl-L. Schuchmann, Interspeech 2016 Special Session on Bird and Animal Vocalisations Organisers, In:Interspeech 2016, 2016Meetings /Conferences Organised, 2016, URL
  • M Roddy and N Harte, Towards predicting dialog acts from previous speakers' non-verbal cues, BIBTEX 2017, 2017, pp1--Conference Paper, 2017, URL , TARA - Full Text
  • N Hurley and N Harte and C Fearon and S Rickard, Speech Source Separation in Hardware, 2009, -Miscellaneous, 2009
  • N. Harte ; S. Vaseghi ; B. Milner , Dynamic features for segmental speech recognition, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, p933--Journal Article, 1996, URL
  • SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference 
, 1997Journal Article, 1997
  • SVNHB Milner, MULTI-RESOLUTION PHONETIC/SEGMENTAL FEATURES AND MODELS FOR HMM-BASED SPEECH RECOGNITION, 1997 IEEE International Conference 
, 1997Journal Article, 1997
  • Sterpu G., Saam C., Harte N., How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition, IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 2020, p1052 - 1064Journal Article, 2020, DOI
  • Roddy, Matthew and Harte, Naomi, Neural Generation of Dialogue Response Timings, Annual Conference of the Association for Computational Linguistics (ACL), 2020, pp2442-2452Conference Paper, 2020, URL
  • Fernandez-Lopez, Adriana and Karaali, Ali and Harte, Naomi and Sukno, Federico M, ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp6294--6298Conference Paper, 2020, URL
  • Motion, Interaction and Games in, Motion, Interaction and Games, 2019, pp1--6 , [Torre, Ilaria and Carrigan, Emma and McDonnell, Rachel and Domijan, Katarina and McCabe, Killian and Harte, Naomi]Book Chapter, 2019, URL
  • Jassim, Wissam A and Harte, Naomi, Estimation of a priori signal-to-noise ratio using neurograms for speech enhancement, The Journal of the Acoustical Society of America, 147, (6), 2020, p3830--3848Journal Article, 2020
  • Andrew J. HINES Jan Skoglund Naomi HARTE Anil Kokaram, 'Detection of chopped speech', US, 2016, Google LLCPatent, 2016, URL
  • Jan Skoglund Andrew J. HINES Naomi A. HARTE Anil Kokaram, 'Objective speech quality metric', US, US20150199959A1, 2016, Google LLCPatent, 2016, URL
  • Le Maguer, Sebastien and Harte, Naomi, Investigation of Auditory Nerve Model Based Analysis for Vocoded Speech Synthesis, 2020, pp1--6Conference Paper, 2020, DOI
  • Ilaria Torre, Emma Carrigan, Rachel McDonnell, Katarina Domijan, Killian McCabe, Naomi Harte, The effect of multimodal emotional expression and agent appearance on trust in human-agent interaction, Proceedings - MIG 2019: ACM Conference on Motion, Interaction, and Games, ACM Conference on Motion, Interaction, and Games, 2019, 2019Conference Paper, 2019, DOI , URL , TARA - Full Text
  • A Karaali and N Harte and CR Jung, Deep Multi-Scale Feature Learning for Defocus Blur Estimation, IEEE Transactions on Image Processing, 2022Journal Article, 2022, DOI , URL
  • Ilaria Torre, Emma Carrigan, Katarina Domijan, Rachel McDonnell, Naomi Harte, The Effect of Audio-Visual Smiles on Social Influence in a Cooperative Human-Agent Interaction Task, ACM Transactions on Computer-Human Interaction (TOCHI), 28, (6), 2021, p1-38Journal Article, 2021, DOI , URL
  • M Anderson and N Harte, Bioacoustic Event Detection with prototypical networks and data augmentation, 2021Report, 2021, URL
  • G Sterpu and C Saam and N Harte, Learning to count words in fluent speech enables online speech recognition, 2021 IEEE Spoken Language Technology Workshop (SLT), 2021, pp38-45Conference Paper, 2021, URL
  • Torre, Ilaria and Carrigan, Emma and Domijan, Katarina and McDonnell, Rachel and Harte, Naomi, Dimensional perception of a 'smiling McGurk effect', 9th International Conference on Affective Computing and Intelligent Interaction (ACII), 2021, pp1-8Conference Paper, 2021, URL
  • Mark Anderson, John Kennedy, Naomi Harte, Low Resource Species Agnostic Bird Activity Detection, 2021 IEEE Workshop on Signal Processing Systems (SiPS), 2021, pp34-39Conference Paper, 2021, URL
  • Ayushi Pandey, Sébastien Le Maguer, Julie Berndsen, Naomi Harte, Mind your p's and k's--Comparing obstruents across TTS voices of the Blizzard Challenge 2013, Proc. 11th ISCA Speech Synthesis Workshop (SSW 11), 2021, pp166-171Conference Paper, 2021
  • Ilaria Torre, Simon Holk, Elmira Yadollahi, Iolanda Leite, Rachel McDonnell, Naomi Harte, Smiling in the Face and Voice of Avatars and Robots: Evidence for a smiling McGurk Effect, IEEE Transactions on Affective Computing, 2022, p1-12Journal Article, 2022, DOI
  • Torre, I. and Deichler, A. and Nicholson, M. and McDonnell, R. and Harte, N., To smile or not to smile: The effect of mismatched emotional expressions in a Human-Robot cooperative task, 2022, pp8-13Conference Paper, 2022, DOI , URL
  • Sébastien Le Maguer, Simon King, Naomi Harte, The limits of the Mean Opinion Score for speech synthesis evaluation, Computer Speech and Language, 84, 2024Journal Article, 2024
  • Kotey, S., Dahyot, R., Harte, N., Fine Grained Spoken Document Summarization Through Text Segmentation, 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings, 2023, p647-654Conference Paper, 2023, DOI
  • Gonzales, M.G., Corcoran, P., Harte, N., Schukat, M., Joint Speech-Text Embeddings with Disentangled Speaker Features, 2023 34th Irish Signals and Systems Conference, ISSC 2023, 2023Conference Paper, 2023, DOI
  • Anderson, M., Kinnunen, T., Harte, N., Learnable Frontends That Do Not Learn: Quantifying Sensitivity To Filterbank Initialisation, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2023-June, 2023Conference Paper, 2023, DOI
  • Pandey, A., Edlund, J., Le Maguer, S., Harte, N., Listener sensitivity to deviating obstruents in WaveNet, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1080-1084Conference Paper, 2023, DOI
  • Pandey, A., Le Maguer, S., Carson-Berndsen, J., Harte, N., Production characteristics of obstruents in WaveNET and older TTS systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2373-2377Conference Paper, 2022, DOI
  • Anderson, M., Harte, N., Learnable Acoustic Frontends in Bird Activity Detection, International Workshop on Acoustic Signal Enhancement, IWAENC 2022 - Proceedings, 2022Conference Paper, 2022, DOI
  • Reverdy, J., O'Connor Russell, S., Duquenne, L., Garaialde, D., Cowan, B., Harte, N., RoomReader: A Multimodal Corpus of Online Multiparty Conversational Interactions, 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, p2517-2527Conference Paper, 2022
  • Sterpu, G., Harte, N., Taris: An online speech recognition framework with sequence to sequence neural networks for both audio-only and audio-visual speech, Computer Speech and Language, 74, 2022Journal Article, 2022, DOI
  • Le Maguer, S., King, S., Harte, N., Back to the Future: Extending the Blizzard Challenge 2013, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022-September, 2022, p2378-2382Conference Paper, 2022, DOI
  • Jassim, W.A., Harte, N., Comparison of discrete transforms for deep-neural-networks-based speech enhancement, IET Signal Processing, 16, (4), 2022, p438-448Journal Article, 2022, DOI
  • Le Maguer, S., Anderson, M., Harte, N., Sp1NY: A Quick and Flexible Speech visualisation Tool in Python, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p2012-2013Conference Paper, 2023
  • Kotey, S., Dahyot, R., Harte, N., Query Based Acoustic Summarization for Podcasts, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2023-August, 2023, p1483-1487Conference Paper, 2023, DOI
  • Edmonds, C.J., Harte, N., Gardner, M., How does drinking water affect attention and memory? The effect of mouth rinsing and mouth drying on children's performance, Physiology and Behavior, 194, 2018, p233-238Journal Article, 2018, DOI
  • Harte, N., Lennon, D., Kokaram, A., On parsing visual sequences with the hidden markov model, Eurasip Journal on Image and Video Processing, 2009, 2009Journal Article, 2009, DOI
  • McCourt, P., Harte, N., Vaseghi, S., COMBINED TEMPORAL AND SPECTRAL MULTI-RESOLUTION PHONETIC MODELLING, 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, 1999, p1111-1114Conference Paper, 1999
  • Harte, N., Vascghi, S., McCourt, P., A novel model for phoneme recognition using phonetically derived features, European Signal Processing Conference, 1998-January, 1998Conference Paper, 1998
  • Harte, N., Vaseghi, S., Milner, B., JOINT RECOGNITION AND SEGMENTATION USING PHONETICALLY DERIVED FEATURES AND A HYBRID PHONEME MODEL, 5th International Conference on Spoken Language Processing, ICSLP 1998, 1998Conference Paper, 1998
  • Kelly, F., Drygajlo, A., Harte, N., Compensating for ageing and quality variation in speaker verification, 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012, 1, 2012, p498-501Conference Paper, 2012
  • Le Maguer, S., Harte, N., Can auditory nerve models tell us what's different about wavenet vocoded speech?, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p230-234Conference Paper, 2020, DOI
  • Sterpu, G., Saam, C., Harte, N., Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2020-October, 2020, p3506-3509Conference Paper, 2020, DOI
  • Cappelletta, L., Harte, N., Phoneme-to-viseme mapping for visual speech recognition, ICPRAM 2012 - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods, 2, 2012, p322-329Conference Paper, 2012
  • Hines, A., Skoglund, J., Kokaram, A., Harte, N., VISQOL: The virtual speech quality objective listener, International Workshop on Acoustic Signal Enhancement, IWAENC 2012, 2012Conference Paper, 2012
  • Corrigan, David; Harte, Naomi; Kokaram, Anil;, Automated Segmentation of Torn Frames using the Graph Cuts Technique, Image Processing, IEEE International Conference on Image Processing, 2007. ICIP 2007., San Antonio, TX, USA , 2007, (Sept. 16-Oct. 19), 2007, pp557-560Conference Paper, 2007, DOI , URL , TARA - Full Text
  • Harte, Naomi; Rankin, Andrew; Baugh, Gary; Kokaram, Anil;, Detection of Illegal Dumping from CCTV at Recycling Centres, International Machine Vision and Image Processing, International Machine Vision and Image Processing Conference, Kildare, Ireland , 2007, (5-7 Sept. ), 2007, pp204Conference Paper, 2007, URL , TARA - Full Text
  • Corrigan, D. Harte, N. and Kokaram, A. , Pathological motion detection for robust missing data treatment in degraded archived media, Image Processing, IEEE International Conference on Image Processing 2006, Atlanta, GA , 8-11 Oct. 2006 , 2006, pp621 - 624Conference Paper, 2006, DOI , URL , TARA - Full Text
  • David Corrigan, Naomi Harte, Anil Kokaram, Pathological Motion Detection for Robust Missing Data Treatment, EURASIP Journal on Advances in Signal Processing, 2008, 2008, pArticle ID 542436Journal Article, 2008, DOI , TARA - Full Text
  • Action Recognition in Multimedia Streams in, editor(s)Petros Maragos, Alexandros Potamianos, Patrick Gros , Multimodal Processing and Interaction, Springer Verlag. , 2008, pp127 - 142, [Daire Lennon, Naomi Harte, and Anil Kokaram, Rozenn Dahyot, Francois Pitie]Book Chapter, 2008
  • Naomi Harte and Anil Kokaram, Automated Removal of Overshoot Artefact from Images, EUSIPCO , European Signal Processing Conference , 2006Conference Paper, 2006, URL
  • Daire Lennon, Naomi Harte, Anil Kokaram, Erika Doyle, Ray Fuller, A HMM Framework for Motion based parsing for video from Observational Psychology, IEEE Irish Machine Vision and Image Processing Conference, Irish Machine Vision and Image Processing Conference , 2006Conference Paper, 2006, URL , TARA - Full Text
  • Naomi Harte, Shahab U. Ansari, Ian Bruce, Exploiting Voicing Cues for Contrast Enhanced Frequency Shaping of Speech for Impaired Listeners, Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, Toulouse, 14-19 May 2006 , 5, IEEE, 2006, ppVConference Paper, 2006, DOI , URL , TARA - Full Text
  • Ansari, S., Harte, N., and Bruce, I., , Efficiently combining improved contrast-enhancing frequency shaping and multiband compression to enhance speech intelligibility in hearing aids, Lake Ontario Auditory Neuroscience (LOAN) Meeting, Hamilton, Canada, 2005Conference Paper, 2005
  • Naomi Harte, Niall Hurley, Conor Fearon, Scott Rickard., Towards a Hardware Realization of Time-Frequency Source Separation of Speech, Proceedings of IEEE European Conference on Circuit Theory and Design, IEEE European Conference on Circuit Theory and Design, 28 Aug -2 Sept. 2005, IEEE, 2005Conference Paper, 2005, DOI , URL , TARA - Full Text
  • Niall Hurley, Naomi Harte, Conor Fearon, Scott Rickard,, Blind Source Separation of Speech in Hardware, Workshop on Signal Processing Systems, Nov 2005, IEEE, 2005, pp442- 445Conference Paper, 2005, DOI , URL , TARA - Full Text
  • N.Harte, S. Bates, B. Murray, The IntelliRate Oversampling Architecture for a Gigabit Ethernet Transceiver, Proceedings of Irish Signals and Systems Conference , Irish Signals and Systems Conference , 2002Conference Paper, 2002
  • McCourt, P. Harte, N. Vaseghi, S. , Discriminitive Multi-Resolution Sub-Band and Segmental Phonetic Model Combination, Electronics Letters, 36, (3), 2000, p270-271Journal Article, 2000, DOI , URL , TARA - Full Text
  • Paul McCourt, Naomi Harte, Saeed Vaseghi, Combined Temporal and Spectral Multi-Resolution Phonetic Modelling, Proc. Eurospeech, Eurospeech, Budapest, Hungary, September 5-9, 1999, 1999, pp1111-1114Conference Paper, 1999, URL
  • P.McCourt, S.Vaseghi, N.Harte, Multi-Resolution Cepstral Features for Phoneme Recognition Across Speech Sub-Bands, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, International Conference on Acoustics, Speech, and Signal Processing, Seattle, USA, 12-15 May 1998, 1, IEEE, 1998, pp557-560Conference Paper, 1998, DOI , URL , TARA - Full Text
  • P.Hanna, N.Harte, J. Ming, S.Vaseghi, F.J.Smith, Variation of features of interframe dependent HMM for speech recognition, IEE Electronic Letters, Apr., 1998, p858-859Journal Article, 1998, URL , TARA - Full Text
  • N.Harte, S.Vaseghi, P.McCourt, A Novel Model for Phoneme Recognition using Phonetically Derived Features, Proceedings of European Signal Processing Conference (EUSIPCO), , European Signal Processing Conference (EUSIPCO), , 1998, pp1485 - 1488Conference Paper, 1998
  • N.Harte, S.Vaseghi, B.Milner, Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model, Proc International Conference on Spoken Language Processing, Proc International 5th International Conference on Spoken Language Processing, Sydney, Australia, Nov 30 - Dec 4, 1998Conference Paper, 1998, URL
  • S.Vaseghi, N.Harte, B.Milner, Multi-Resolution Phonetic/Segmental Features and Models for HMM-Based Speech Recognition, Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2, 1997, pp1263Conference Paper, 1997, URL
  • N.Harte, S.Vaseghi, B.Milner, Dynamic Features for Segmental Speech Recognition, Proc International Conference on Spoken Language Processing, International Conference on Spoken Language Processing, Philadelphia, 3-6 Oct 1996, 1996, pp933-936Conference Paper, 1996, DOI , URL , TARA - Full Text
  • McMahon, P.; Harte, N.; Vaseghi, S.; McCourt, P, Discriminative spectral-temporal multiresolution features for speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 1999, vol.2, 1999, pppp.581-584Conference Paper, 1999
  • Andrew Hines, Naomi Harte, Error Metrics for Impaired Auditory Nerve Responses of Different Phoneme Groups, Interspeech 2009, Brighton, 2009, 2009, pp1119 - 1122Conference Paper, 2009, TARA - Full Text
  • Craig Berry, Naomi Harte, Region of Interest Extraction using Colour Based Methods on the CUAVE Database , IET Irish Signals and Systems Conference ISSC, Dublin, 10-12 June , 2009Conference Paper, 2009, TARA - Full Text
  • Andrew Hines, Naomi Harte , Measurement of phonemic degradation in sensorineural hearing loss using a computational model of the auditory periphery , IET Irish Signals and Systems Conference ISSC 2009, UCD, June 10-11, 2009, pp1-6Conference Paper, 2009, URL , TARA - Full Text
  • Naomi Harte, Daire Lennon, and Anil Kokaram, On Parsing Visual Sequences with the Hidden Markov Model, EURASIP Journal on Image and Video Processing , Volume 2009, 2009Journal Article, 2009, DOI , TARA - Full Text
  • Andrew Hines and Naomi Harte, Speech intelligibility from image processing, Speech Communication, 52, (9), 2010, p736 - 752Journal Article, 2010
  • Finnian Kelly and Naomi Harte, Training GMMs for Speaker Verification. , IET Irish Signals and Systems Conference, Cork, Ireland, June 2010, 2010, pp163 - 168Conference Paper, 2010
  • Finnian Kelly and Naomi Harte, A Comparison of Auditory Features for Robust Speech Recognition. , European Signal Processing Conference (EUSIPCO 2010). , Aalborg, Denmark, August 2010, 2010Conference Paper, 2010, DOI
  • Finnian Kelly and Naomi Harte, Auditory Features Revisited for Robust Speech Recognition. , International Conference on Pattern Recognition (ICPR). , Istanbul, Turkey, Aug 2010, 2010, pp4456 - 4459Conference Paper, 2010
  • Andrew Hines and Naomi Harte, Evaluating Sensorineural Hearing Loss With An Auditory Nerve Model Using A Mean Structural Similarity Measure. , European Signal Processing Conference (EUSIPCO '10). , Aalborg, Denmark, 2010Conference Paper, 2010, TARA - Full Text
  • Luca Cappelletta and Naomi Harte, Nostril Detection for Robust Mouth Tracking, Irish Signals and Systems Conference, Cork, Ireland, 2010, pp239 - 244Conference Paper, 2010
  • Andrew Hines, Naomi Harte, Speech Intelligibility prediction using a Neurogram Similarity Index Measure, Speech Communication, 54, (2), 2012, p306-320Journal Article, 2012, DOI , URL , TARA - Full Text
  • Finnian Kelly, Naomi Harte, Effects of Long-Term Ageing on Speaker Verification, Proceedings of the COST 2101 European conference on Biometrics and ID management, Springer-Verlag, 2011, pp113--124Conference Paper, 2011
  • Luca Cappelletta and Naomi Harte, Viseme Definitions Comparison for Visual-Only Speech Recognition, European Signal Processing Conference (Eusipco), 2011, pp2109 - 2113Conference Paper, 2011
  • Craig Berry, Anil Kokaram, Naomi Harte, An Extended Multiresolution Approach to Mouth Specific AAM Fitting for Speech Recognition. , European Signal Processing Conference (Eusipco), 2011Conference Paper, 2011, DOI
  • Andrew Hines and Naomi Harte , Simulated performance intensity functions , Engineering in Medicine and Biology Society Conference (EMBC), EMBS (IEEE). , 2011, pp7139 - 7142Conference Paper, 2011
  • Andrew Hines and Naomi Harte, Comparing hearing aid algorithm performance using Simulated Performance Intensity Functions , Speech perception and auditory disorders, Int. Symposium on Audiological and Auditory Research (ISAAR), 2011Conference Paper, 2011
  • Corrigan, D. ; Kokaram, A. ; Harte, N. , Algorithms for the Digital Restoration of Torn Film , Image Processing, IEEE Transactions on, 21, (2), 2012, p573-587Journal Article, 2012, DOI
  • F. Kelly , A. Drygajlo and N. Harte, Speaker Verification with Long-Term Ageing Data , International Conference on Biometrics (ICB), New Delhi, 2012, pp478 - 483Conference Paper, 2012
  • Andrew Hines, Naomi Harte, Improved Speech Intelligibility with a Chimaera Hearing Aid Algorithm, Interspeech, Portland, OR, ISCA, 2012, pp1 - 4Conference Paper, 2012
  • Andrew Hines, Jan Skoglund, Anil Kokaram, Naomi Harte, ViSQOL: The Virtual Speech Quality Objective Listener, The International Workshop on Acoustic Signal Enhancement (IWAENC), Aachen, Germany, 4-6 Sept. 2012, 2012, pp1 - 4Conference Paper, 2012, TARA - Full Text
  • K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, N. Harte and J. Wilson, Indexing and Selection of Well-Lit Details in Underwater Video Mosaics Using Vignetting Estimation, Program Book - OCEANS 2012 MTS/IEEE Yeosu: The Living Ocean and Coast - Diversity of Resources and Sustainable Activities, International OCEANS Conference, Yeosu, South Korea, May, IEEE, 2012, ppArticle number 6263541Conference Paper, 2012, DOI
  • K. Sooknanan, A. Kokaram, D. Corrigan, G. Baugh, J. Wilson and N. Harte , Improving Underwater Visibility Using Vignetting Correction, Proceedings of SPIE - The International Society for Optical Engineering, Visual Information Processing and Communication, Burlingame, California, USA, January, 8305, SPIE, 2012, ppArticle number 83050MConference Paper, 2012, DOI
  • A Hines, J Skoglund, A Kokaram, N Harte, Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA, IEEE International Conference on Acoustics, Speech, and Signal Processing, Vancouver, Canada, 2013, pp3697 - 3701Conference Paper, 2013
  • A Hines, J Skoglund, A Kokaram, N Harte, Monitoring the Effects of Temporal Clipping on VoIP Speech Quality, Interspeech, Lyon, France, 2013, 2013, pp1188 - 1192Conference Paper, 2013
  • Ailbhe Cullen and Naomi Harte, Late Integration of Features for Acoustic Emotion Recognition, European Signal Processing Conference (EUSIPCO)., 2013, pp1 - 5Conference Paper, 2013
  • Ailbhe Cullen, John Kane, Thomas Drugman, and Naomi Harte , Creaky Voice and the Classification of Affect, Workshop on Affective Social Speech Signals (WASSS), Grenoble, France, 2013Conference Paper, 2013, DOI
  • Finnian Kelly, Andrzej Drygajlo and Naomi Harte , Speaker verification in score-ageing-quality classification space, Computer Speech & Language, 27, (5), 2013, p1068-1084Journal Article, 2013
  • Finnian Kelly and Naomi Harte, Auditory detectability of vocal ageing and its effect on forensic automatic speaker recognition, InterSpeech, Lyon, France, 2013, pp2846 - 2850Conference Paper, 2013
  • Finnian Kelly, Niko Brummer and Naomi Harte, Eigenageing Compensation for Speaker Verification. , InterSpeech , Lyon, France, 2013, pp1624 - 1628Conference Paper, 2013
  • Sooknanan, Ken, Doyle, Jennifer, Wilson, James, Harte, Naomi, Kokaram, Anil and Corrigan, David, Mosaics For Burrow Detection in Underwater Surveillance Video, IEEE Oceans 2013, San Diego, USA, 2013, pp9 - 12Conference Paper, 2013
  • Harte, Naomi, Murphy, Sadhbh, Kelly, David J. and Marples, Nicola M., Identifying new bird species from differences in birdsong. , INTERSPEECH, Lyon France., 2013, pp2900-2904Conference Paper, 2013
  • Kelly, Finnian and Harte, Naomi in, editor(s)Michael Fairhurst , Age Factors in Biometric Processing, IET, 2013, [Kelly, Finnian and Harte, Naomi]Book Chapter, 2013
  • Cullen, Ailbhe, Hines, Andrew and Harte, Naomi, Building a Database of Political Speech - Does culture matter in charisma annotations? , 1 4th International Workshop on Audio/Visual Emotion Challenge, AVEC 2014, AVEC'14: 4th International Audio/Visual Emotion Challenge and Workshop., Orlando, FL., 2014, pp27 - 31Conference Paper, 2014, DOI
  • Finnian Kelly, Rahim Saeidi, Naomi Harte, David van Leeuwen, Effect of long-term ageing on i-vector speaker verification, Computer Speech & Language, InterSpeech, Singapore, 2014, pp1068 - 1084Conference Paper, 2014
  • Andrew Hines, Eoin Gillen, Jan Skoglund, Damien Kelly, Anil Kokaram and Naomi Harte, Perceived Audio Quality for Streaming Stereo Music. , ACM Multimedia, Orlando, FL, USA, 2014, pp1173 - 1176Conference Paper, 2014, DOI , TARA - Full Text
  • Dr. Silvia Giordani, Poster Making and Presentation, TCD, Chemistry Dept, 2007Poster

Research Expertise

  • Title
    Robust Speaker Verification
    Summary
    Biometrics involves the use of intrinsic physical or behavioural traits of humans to verify their identity. Traits used in biometrics typically include face, fingerprints, hand geometry, handwriting, iris, retinal, vein, and voice. Many are concerned that these technologies are potentially invasive and open to fraud. Speaker verification, using voice or voice and video, has been recognised as an important alternative in the world of biometrics. It is less invasive and requires less expensive installations that iris and fingerprint authentication systems. The changes that occur in the human voice due to ageing have been well documented. The impact of these changes on speaker verification is less clear. In this work, we examine the effect of long-term vocal ageing on a speaker verification systems.
    Funding Agency
    IRCSET
    Date From
    2009
    Date To
    2012
  • Title
    Audio-Visual Fusion for Human Computer Interaction.
    Summary
    This project will thus focus on key challenges in Audio Visual Speech Recognition: . Given state of the art audio and visual features, do early or late integration strategies work better? . How well does such an integration scheme translate to less controlled situations, where the speech is less constrained, intonation or prosody is more natural, or the speech is emotionally influenced? . Can these algorithms work on a real handheld device?
    Funding Agency
    IRCSET
    Date From
    2011
    Date To
    2014
  • Title
    Dynamic Visual Features and Improved Audio-Visual Fusion for Automatic Speech Recognition
    Summary
    Human speech is bimodal in nature. Incorporating visual features in Automatic Speech Recognition systems can improve performance in real environments. This work addresses core challenges in audio-visual speech recognition. It will develop new dynamic visual features that better capture the correlations in key mouth movements used by humans in lipreading. This is crucial in improving Hidden Markov Model performance. It will explore a new audio-fusion strategy motivated by the differing visibility of visemes allowing the influence of the audio and video stream to change over time.
    Funding Agency
    SFI
    Date From
    Oct. 2009
    Date To
    Sept. 2013
  • Title
    Advanced Metrics for Audio-Visual Signal Quality in Internet Communications
    Summary
    Funding Agency
    Enterprise Ireland/Google
    Date From
    Sept 2013
    Date To
    Dec 2014
  • Title
    Speech Quality for VoIP
    Summary
    This project is developing new metrics to measure speech quality for VoIP applications, particularly Google Chrome WebRTC
    Funding Agency
    Google Inc
    Date From
    April 2011
    Date To
    April 2012

Artificial intelligence and machine learning, Electrical engineering,

Recognition

  • Glen Dimplex British Council Chevening Scholarship 1995-1996
  • Awarded a Gold Medal for Distinction in Engineering upon graduation. 1995
  • Shortlisted for Provost Teaching Award 2011
  • Maurice F. Fitzgerald Prize - first overall in the Engineering Faculty in the Degree exams. 1995
  • British Telecom Research Scholarship 1997-1999
  • David Clark Prize - first place in the Microelectronic and Electrical Engineering Degree exams. 1995
  • Cognitec Best Student Paper Award for PhD Student Finnian Kelly, International Conference on Biometrics (ICB) 2012
  • Google Faculty Award 2018
  • AI Awards (Shortlisted in Best Application of AI in an Academic Research Body) 2019
  • Fellow of Trinity College Dublin 2017
  • IEE Leslie H. Paddle Scholarship 1995-1998
  • ISCA (International Speech Communication Association)
  • IEEE Signal Processing Society
  • IEEE Women in Engineering
  • IEEE (Institute of Electrical and Electronics Engineers)
  • Senior Technical Program Committee for ACM ICMI 2019
  • PhD External Examiner University of Cambridge
  • Senior Technical Program Committee for ACM ICMI 2019
  • PhD External Examiner University of Cambridge
  • PhD External Examiner, Athlone Institute of Technology
  • Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"
  • Regular Session Chair at Interspeech ongoing
  • International Expert Reviewer for Swiss National Science Foundation (SNSF)
  • Regular Session Chair at Interspeech ongoing
  • Irish representative to the EU COST Action 2101 entitled "Biometrics for Identity Documents and Smart cards"
  • International Expert Reviewer for Swiss National Science Foundation (SNSF)
  • PhD External Examiner, University of York
  • PhD External Examiner, Athlone Institute of Technology
  • PhD External Examiner, University of York
  • ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.
  • PhD External Examiner, Victoria University, New Zealand
  • Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age
  • ICT Evaluator for FP6 ICT Call FP6-2004-SME-COOP in Co-operative research (Research involving SMEs, Universities and research organisations). Acted as Group Rapporteur.
  • Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters
  • Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia ongoing
  • Peer reviewing for top conferences and journals, e.g.: IEEE ICASSP, Interspeech, ACM ICMI, EUSIPCO, IEEE ASRU, IEEE ICIP, ACL, Speech Communication, JASA, IEEE Trans Multimedia ongoing
  • TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)
  • Expert Evaluator for FP7 Call FP7-REGIONS-2012-2013-1 in Transnational cooperation between regional research-driven clusters
  • PhD External Examiner, Victoria University, New Zealand
  • PhD External Examiner, University of East Anglia
  • Irish representative to the EU COST Action IC1006 Integrating Biometrics and Forensics for the Digital Age
  • PhD External Examiner, University of East Anglia
  • TCD Representative to MIDAS (MicroElectronics Design Association of Ireland)