E-reputation monitoring on Twitter with active learning automatic annotation - Avignon Université Accéder directement au contenu
Rapport Année : 2014

E-reputation monitoring on Twitter with active learning automatic annotation

Jean-Valère Cossu
  • Fonction : Auteur
  • PersonId : 957209
Marc El Bèze
  • Fonction : Auteur
  • PersonId : 949557
Eric Sanjuan

Résumé

Opinion and trend mining on micro blogs like twitter recently attracted research interest in several fields including Information Retrieval and Machine Learning. This paper is intended to develop a so-called active learning for automatically annotating French language tweets that deal with the image (i.e., representation, web reputation) of entities : such as politicians, celebrities, companies or brands. Our main contribution is the methodology followed to build and provide an original annotated French data-set expressing opinion on two French politicians over time. Since the performance of natural language processing tasks are limited by the amount and quality of data available to them, one promising alternative for some tasks is the propagation of pseudo-expert annotations. The paper is focused on key issues about active learning while building a large annotated data set, from noise introduced by humans annotators, abundance of data and the label distribution across data and entities.
Fichier principal
Vignette du fichier
activelearning.pdf (211.55 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01002818 , version 1 (10-06-2014)

Identifiants

  • HAL Id : hal-01002818 , version 1

Citer

Jean-Valère Cossu, Marc El Bèze, Juan-Manuel Torres-Moreno, Eric Sanjuan. E-reputation monitoring on Twitter with active learning automatic annotation. 2014. ⟨hal-01002818⟩
293 Consultations
617 Téléchargements

Partager

Gmail Facebook X LinkedIn More