An audio-video front-end for multimedia applications

TitleAn audio-video front-end for multimedia applications
Publication TypeConference Papers
Year of Publication2000
AuthorsZotkin DN, Duraiswami R, Davis LS, Haritaoglu I
Conference Name2000 IEEE International Conference on Systems, Man, and Cybernetics
Date Published2000///
PublisherIEEE
ISBN Number0-7803-6583-6
KeywordsAcoustic noise, acoustical source location, Application software, audio cues, audio-video front-end, CAMERAS, Computer vision, Microphones, multimedia applications, multimedia systems, multimodal sensor fusion system, multimodal user interfaces, Position measurement, REAL TIME, Real time systems, real-time systems, sensor fusion, sound, Speech recognition, User interfaces, video cameras, video gaming, video-based person tracking, Videoconference, videoconferencing, Virtual reality, visual cues, Working environment noise
Abstract

Applications such as video gaming, virtual reality, multimodal user interfaces and videoconferencing, require systems that can locate and track persons in a room through a combination of visual and audio cues, enhance the sound that they produce, and perform identification. We describe the development of a particular multimodal sensor fusion system that is portable, runs in real time and achieves these objectives. The system employs novel algorithms for acoustical source location, video-based person tracking and overall system control, which are also described

DOI10.1109/ICSMC.2000.885945