E-reputation monitoring on Twitter with active learning automatic annotation

Abstract : Opinion and trend mining on micro blogs like twitter recently attracted research interest in several fields including Information Retrieval and Machine Learning. This paper is intended to develop a so-called active learning for automatically annotating French language tweets that deal with the image (i.e., representation, web reputation) of entities : such as politicians, celebrities, companies or brands. Our main contribution is the methodology followed to build and provide an original annotated French data-set expressing opinion on two French politicians over time. Since the performance of natural language processing tasks are limited by the amount and quality of data available to them, one promising alternative for some tasks is the propagation of pseudo-expert annotations. The paper is focused on key issues about active learning while building a large annotated data set, from noise introduced by humans annotators, abundance of data and the label distribution across data and entities.
Type de document :
Rapport
2014
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal-univ-avignon.archives-ouvertes.fr/hal-01002818
Contributeur : Jean-Valère Cossu <>
Soumis le : mardi 10 juin 2014 - 13:02:22
Dernière modification le : vendredi 26 janvier 2018 - 10:46:27
Document(s) archivé(s) le : mercredi 10 septembre 2014 - 11:11:15

Fichier

activelearning.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01002818, version 1

Collections

Citation

Jean-Valère Cossu, Marc El Bèze, Torres-Moreno Juan-Manuel, Eric Sanjuan. E-reputation monitoring on Twitter with active learning automatic annotation. 2014. 〈hal-01002818〉

Partager

Métriques

Consultations de la notice

358

Téléchargements de fichiers

466