Projects

Speech communication plays a key role in human intelligence. We are studying intelligent processing of speech and audio exchanged by human beings for automatic recognition, understanding and interaction systems, specifically (1) automatic speech transcription of real-world conversations, (2) analysis of audio scenes composed of multiple sound sources, and (3) humanoid robots that conduct natural dialogue by combining non-verbal information.

 

Automatic Speech Recognition and Rich Transcription

Automatic speech recognition (ASR) of lectures and meetings, and also natural language processing (NLP) for segmenting and extracting information structures, in order to realize intelligent transcription and captioning systems.
Automatic Speech Recognition (ASR) ...Diet project ...Ainu project
Speech Emotion Recognition (SER)
Natural Language Processing for Rich Transcription

Spoken Dialogue Systems for Human-Robot Interaction

Spoken dialogue model and systems integrating verbal and non-verbal information for humanoid robots (android) that behaves like and naturally interacts with human beings.
Spoken Language Understanding (SLU)
Interaction Analysis and Model
Spoken Dialogue Systems (SDS) ...ERICA project

Acoustic Signal Processing for Audio Scene Analysis

Analysis of the real world where multiple persons and a variety of sound sources exist, based on multi-modal sensing and statistical acoustic signal processing.
Source Separation and Speech Enhancement
Robust Speech Recognition
Multi-modal Conversation Analysis

CALL (Computer Assisted Language Learning)

Next-generation CALL system that can automatically check pronunciation of foreign language learners and serve as a virtual language teacher for simulated conversation practice.
Computer Assisted Language Learning (CALL)