An audio-video front-end for multimedia applications
Title | An audio-video front-end for multimedia applications |
Publication Type | Conference Papers |
Year of Publication | 2000 |
Authors | Zotkin DN, Duraiswami R, Davis LS, Haritaoglu I |
Conference Name | 2000 IEEE International Conference on Systems, Man, and Cybernetics |
Date Published | 2000/// |
Publisher | IEEE |
ISBN Number | 0-7803-6583-6 |
Keywords | Acoustic noise, acoustical source location, Application software, audio cues, audio-video front-end, CAMERAS, Computer vision, Microphones, multimedia applications, multimedia systems, multimodal sensor fusion system, multimodal user interfaces, Position measurement, REAL TIME, Real time systems, real-time systems, sensor fusion, sound, Speech recognition, User interfaces, video cameras, video gaming, video-based person tracking, Videoconference, videoconferencing, Virtual reality, visual cues, Working environment noise |
Abstract | Applications such as video gaming, virtual reality, multimodal user interfaces and videoconferencing, require systems that can locate and track persons in a room through a combination of visual and audio cues, enhance the sound that they produce, and perform identification. We describe the development of a particular multimodal sensor fusion system that is portable, runs in real time and achieves these objectives. The system employs novel algorithms for acoustical source location, video-based person tracking and overall system control, which are also described |
DOI | 10.1109/ICSMC.2000.885945 |