Empirical study of machine learning based approach for opinion mining in tweets

Grigori Sidorov, Sabino Miranda-Jiménez, Francisco Viveros-Jiménez, Alexander Gelbukh, Noé Castro-Sánchez, Francisco Velásquez, Ismael Díaz-Rangel, Sergio Suárez-Guerra, Alejandro Treviño, Juan Gordon

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

92 Citas (Scopus)

Resumen

Opinion mining deals with determining of the sentiment orientation- positive, negative, or neutral-of a (short) text. Recently, it has attracted great interest both in academia and in industry due to its useful potential applications. One of the most promising applications is analysis of opinions in social networks. In this paper, we examine how classifiers work while doing opinion mining over Spanish Twitter data. We explore how different settings (n-gram size, corpus size, number of sentiment classes, balanced vs. unbalanced corpus, various domains) affect precision of the machine learning algorithms. We experimented with Naïve Bayes, Decision Tree, and Support Vector Machines. We describe also language specific preprocessing-in our case, for Spanish language-of tweets. The paper presents best settings of parameters for practical applications of opinion mining in Spanish Twitter. We also present a novel resource for analysis of emotions in texts: a dictionary marked with probabilities to express one of the six basic emotions(Probability Factor of Affective use (PFA)(Spanish Emotion Lexicon that contains 2,036 words.

Idioma originalInglés
Título de la publicación alojadaAdvances in Artificial Intelligence - 11th Mexican International Conference on Artificial Intelligence, MICAI 2012, Revised Selected Papers
Páginas1-14
Número de páginas14
EdiciónPART 1
DOI
EstadoPublicada - 2013
Evento11th Mexican International Conference on Artificial Intelligence, MICAI 2012 - San Luis Potosi, México
Duración: 27 oct. 20124 nov. 2012

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NúmeroPART 1
Volumen7629 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia11th Mexican International Conference on Artificial Intelligence, MICAI 2012
País/TerritorioMéxico
CiudadSan Luis Potosi
Período27/10/124/11/12

Huella

Profundice en los temas de investigación de 'Empirical study of machine learning based approach for opinion mining in tweets'. En conjunto forman una huella única.

Citar esto