Information Retrieval from Unsegmented Broadcast News Audio

Abstract : This paper describes a system for retrieving relevant portions of broadcast news shows starting with only the audio data. A novel method of automatically detecting and removing commercials is presented and shown to increase the performance of the system while also reducing the computational effort required. A sophisticated large vocabulary speech recogniser which produces high-quality transcriptions of the audio and a window-based retrieval system with post-retrieval merging are also described. Results are presented using the 1999 TREC-8 Spoken Document Retrieval data for the task where no story boundaries are known. Experiments investigating the effectiveness of all aspects of the system are described, and the relative benefits of automatically eliminating commercials, enforcing broadcast structure during retrieval, using relevance feedback, changing retrieval parameters and merging during post-processing are shown. An Average Precision of 46.8%, when duplicates are scored as irrelevant, is shown to be achievable using this system, with the corresponding word error rate of the recogniser being 20.5%.
Liste complète des métadonnées

Littérature citée [41 références]  Voir  Masquer  Télécharger
Contributeur : Pierre Jourlin <>
Soumis le : lundi 8 juillet 2019 - 14:19:08
Dernière modification le : mardi 9 juillet 2019 - 01:22:16


Accord explicite pour ce dépôt


  • HAL Id : hal-02171698, version 1



Sue Johnson, Pierre Jourlin, Karen Jones, Philip Woodland. Information Retrieval from Unsegmented Broadcast News Audio. International Journal of Speech Technology, Springer Verlag, 2001, 4, pp.251 - 268. ⟨hal-02171698⟩



Consultations de la notice


Téléchargements de fichiers