New Approaches to Spoken Document Retrieval |
| |
Authors: | Martin Wechsler Eugen Munteanu Peter Schäuble |
| |
Institution: | (1) Mckinsey and Company, Switzerland;(2) Eugen Muntiany, Eurospider Information Technology A6, Zurich, Switzerland;(3) Eurospider Information Technology A6, Zurich, Switzerland |
| |
Abstract: | This paper presents four novel techniques for open-vocabulary spoken document retrieval: a method to detect slots that possibly contain a query feature; a method to estimate occurrence probabilities; a technique that we call collection-wide probability re-estimation and a weighting scheme which takes advantage of the fact that long query features are detected more reliably. These four techniques have been evaluated using the TREC-6 spoken document retrieval test collection to determine the improvements in retrieval effectiveness with respect to a baseline retrieval method. Results show that the retrieval effectiveness can be improved considerably despite the large number of speech recognition errors. |
| |
Keywords: | spoken document retrieval speech recognition retrieval effectiveness |
本文献已被 SpringerLink 等数据库收录! |