TY - JOUR
T1 - A tweets classifier based on cosine similarity
AU - Focil-Arias, Carolina
AU - Ziiniga, Jorge
AU - Sidorov, Grigori
AU - Batyrshin, Ildar
AU - Gelbukh, Alexander
PY - 2017
Y1 - 2017
N2 - The 2017 Microblog Cultural Contextualization task consists in three challenges: (1) Content Analysis, (2) Microblog search, and (3) TimeLine illustration. This paper describes the use of cosine similarity, which is characterized by the comparison of similarity between two vectors of an inner product space. This research used two approaches: (1) word2vec and (2) Bag-of-Words (BoW) for extracting all relevant tweets to each event related to the four festivals: Charrues, Transmusicales, Avignon and Edinburgh.
AB - The 2017 Microblog Cultural Contextualization task consists in three challenges: (1) Content Analysis, (2) Microblog search, and (3) TimeLine illustration. This paper describes the use of cosine similarity, which is characterized by the comparison of similarity between two vectors of an inner product space. This research used two approaches: (1) word2vec and (2) Bag-of-Words (BoW) for extracting all relevant tweets to each event related to the four festivals: Charrues, Transmusicales, Avignon and Edinburgh.
KW - Bag-of-Words
KW - Cosine similarity
KW - Information retrieval
KW - Natural language processing
KW - Opinion mining
KW - Word2vec
UR - http://www.scopus.com/inward/record.url?scp=85034753114&partnerID=8YFLogxK
M3 - Artículo de la conferencia
AN - SCOPUS:85034753114
SN - 1613-0073
VL - 1866
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017
Y2 - 11 September 2017 through 14 September 2017
ER -