A video-based framework for the analysis of presentations/posters
Title | A video-based framework for the analysis of presentations/posters |
Publication Type | Journal Articles |
Year of Publication | 2005 |
Authors | Zandifar A, Duraiswami R, Davis LS |
Journal | International Journal on Document Analysis and Recognition |
Volume | 7 |
Issue | 2 |
Pagination | 178 - 187 |
Date Published | 2005/// |
Abstract | Detection and recognition of textual information in an image or video sequence is important for many applications. The increased resolution and capabilities of digital cameras and faster mobile processing allow for the development of interesting systems. We present an application based on the capture of information presented at a slide-show presentation or at a poster session. We describe the development of a system to process the textual and graphical information in such presentations. The application integrates video and image processing, document layout understanding, optical character recognition (OCR), and pattern recognition. The digital imaging device captures slides/poster images, and the computing module preprocesses and annotates the content. Various problems related to metric rectification, key-frame extraction, text detection, enhancement, and system integration are addressed. The results are promising for applications such as a mobile text reader for the visually impaired. By using powerful text-processing algorithms, we can extend this framework to other applications, e.g., document and conference archiving, camera-based semantics extraction, and ontology creation. |