Information Retrieval from Unsegmented Broadcast News Audio - Avignon Université Accéder directement au contenu
Article Dans Une Revue International Journal of Speech Technology Année : 2001

Information Retrieval from Unsegmented Broadcast News Audio

Sue E Johnson
  • Fonction : Auteur
Pierre Jourlin
Karen Jones
  • Fonction : Auteur
  • PersonId : 876310
Philip C Woodland
  • Fonction : Auteur

Résumé

This paper describes a system for retrieving relevant portions of broadcast news shows starting with only the audio data. A novel method of automatically detecting and removing commercials is presented and shown to increase the performance of the system while also reducing the computational effort required. A sophisticated large vocabulary speech recogniser which produces high-quality transcriptions of the audio and a window-based retrieval system with post-retrieval merging are also described. Results are presented using the 1999 TREC-8 Spoken Document Retrieval data for the task where no story boundaries are known. Experiments investigating the effectiveness of all aspects of the system are described, and the relative benefits of automatically eliminating commercials, enforcing broadcast structure during retrieval, using relevance feedback, changing retrieval parameters and merging during post-processing are shown. An Average Precision of 46.8%, when duplicates are scored as irrelevant, is shown to be achievable using this system, with the corresponding word error rate of the recogniser being 20.5%.
Fichier principal
Vignette du fichier
Johnson2001_Article_InformationRetrievalFromUnsegm.pdf (216.09 Ko) Télécharger le fichier
Origine : Accord explicite pour ce dépôt
Loading...

Dates et versions

hal-02171698 , version 1 (08-07-2019)

Identifiants

  • HAL Id : hal-02171698 , version 1

Citer

Sue E Johnson, Pierre Jourlin, Karen Jones, Philip C Woodland. Information Retrieval from Unsegmented Broadcast News Audio. International Journal of Speech Technology, 2001, 4, pp.251 - 268. ⟨hal-02171698⟩

Collections

UNIV-AVIGNON LIA
41 Consultations
47 Téléchargements

Partager

Gmail Facebook X LinkedIn More