An Empirical Investigation of Performances of Different Word Embedding Algorithms in Comment Clustering
dc.authorid | Yildiz, Tugba/0000-0002-8552-2806 | |
dc.authorwosid | Yildiz, Tugba/ABC-5958-2020 | |
dc.contributor.author | Dorani, Eimal | |
dc.contributor.author | Duru, Nevcihan | |
dc.contributor.author | Yildiz, Tugba | |
dc.date.accessioned | 2024-07-18T20:47:19Z | |
dc.date.available | 2024-07-18T20:47:19Z | |
dc.date.issued | 2019 | |
dc.department | İstanbul Bilgi Üniversitesi | en_US |
dc.description | Innovations in Intelligent Systems and Applications Conference (ASYU) -- OCT 31-NOV 02, 2019 -- Izmir, TURKEY | en_US |
dc.description.abstract | With the rapid growth of the usage and interest in social network services, evaluating comment clustering has become increasingly important for various commercial and scientific applications. Analyzing, organizing and ascertaining the overall theme of a large volume of comments is a challenging and time-consuming task which has attracted much attention recently. In this study, we proposed a method to address the comment clustering problem. Extensive experiments have been conducted on seven different comment datasets using TF-IDF and different word embedding algorithms, namely Word2vec, Glove and FastText; the internal clustering validation have been conducted to evaluate the performance of each method in clustering of the comments. We observed that word embedding produced significantly better results in comment clustering than TF-IDF. In addition, word2vec has shown the best performance among all; however, we found that Glove is the most stable and consistent across all datasets such that the performance improved as dataset size increased. | en_US |
dc.description.sponsorship | Yasar Univ,IEEE Turkey Sect,Yildiz Teknik Univ,Idea,Siemens | en_US |
dc.identifier.doi | 10.1109/asyu48272.2019.8946379 | |
dc.identifier.endpage | 380 | en_US |
dc.identifier.isbn | 978-1-7281-2868-9 | |
dc.identifier.scopus | 2-s2.0-85078347723 | en_US |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.startpage | 377 | en_US |
dc.identifier.uri | https://doi.org/10.1109/asyu48272.2019.8946379 | |
dc.identifier.uri | https://hdl.handle.net/11411/7771 | |
dc.identifier.wos | WOS:000631252400070 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.ispartof | 2019 Innovations in Intelligent Systems and Applications Conference (Asyu) | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Comment Clustering | en_US |
dc.subject | Word2vec | en_US |
dc.subject | Glove | en_US |
dc.subject | Fasttext | en_US |
dc.subject | Tf=İdf | en_US |
dc.title | An Empirical Investigation of Performances of Different Word Embedding Algorithms in Comment Clustering | en_US |
dc.type | Conference Object | en_US |