An interaction approach between services for extracting relevant data from Tweets corpora

Abstract : We present a system based on the need of special infrastructure adequate to software agents to operate, to compose and make sense from the contents of the Web resources through the development of a multi-agent system oriented services interactions. Our method follows the different construction ontology techniques and updates them by extracting new terms and integrate them to the ontology.It is based on the detection phrases via the ontological database DBPedia. The system treats each syntagme extracted from the corpus of messages and verifies whether it is possible to associate them directly to a DBPedia knowledge. In case of failure, these service agents interact with each other in order to find the best possible answer to the problem, by operating directly in the phrase, trying to semantically modify it, until the association with ontological knowledge becomes possible. The advantage of our approach is its modularity : it is both possible to add / modify / delete a service or define a new one, and then influence the outcome product. We could compare the results extracted from a heterogeneous body of messages from the Twitter social network with Tagme method, based mainly on storage and annotation of encyclopaedic corpus.
Type de document :
Communication dans un congrès
CILC2016. 8th International Conference on Corpus Linguistics, Mar 2016, MALAGA, Spain. EPiC Series in Language and Linguistics, 1, pp.97 - 110, 2016, CILC2016. 8th International Conference on Corpus Linguistics
Liste complète des métadonnées

https://hal-univ-paris8.archives-ouvertes.fr/hal-01489730
Contributeur : Anna Pappa <>
Soumis le : lundi 24 avril 2017 - 17:01:34
Dernière modification le : mardi 22 mai 2018 - 20:40:06
Document(s) archivé(s) le : mardi 25 juillet 2017 - 12:10:20

Fichier

dref_pappa.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01489730, version 1

Collections

Citation

Mehdy Dref, Anna Pappa. An interaction approach between services for extracting relevant data from Tweets corpora. CILC2016. 8th International Conference on Corpus Linguistics, Mar 2016, MALAGA, Spain. EPiC Series in Language and Linguistics, 1, pp.97 - 110, 2016, CILC2016. 8th International Conference on Corpus Linguistics. 〈hal-01489730〉

Partager

Métriques

Consultations de la notice

86

Téléchargements de fichiers

48