Clustering Research Papers Using Genetic Algorithm Optimized Self-Organizing Maps

Ahmed, Reham Fathy M.; Salama, Cherif; Mahdi, Hani;

Abstract


With the huge amount of published research papers, retrieving relevant information is a difficult task for any researcher. Effective clustering algorithms can help improve and simplify the retrieval process. Here, we propose an approach for automatic clustering for text document using a Self-Organizing Map (SOM). It is one of unsupervised artificial neural network that widely used for data analysis, data compression, clustering, and data mining. The quality and accuracy of a SOM algorithm depends on the selection of values for some of its parameters which are its initial learning rate, SOM matrix dimensions, and the number of iterations. Best values are typically selected using trial and error; however, in the current paper we suggest a more systematic approach to parameters optimization using the genetic algorithm. The proposed method is applied to cluster 3 scientific papers datasets using their keywords. Similar research papers were mapped closer to each other. Clustering results were validated using the Dunn index.


Other data

Title Clustering Research Papers Using Genetic Algorithm Optimized Self-Organizing Maps
Authors Ahmed, Reham Fathy M.; Salama, Cherif ; Mahdi, Hani
Keywords cluster validity indices | Document clustering | Genetic Algorithm | Self-Organizing Maps | Word2vec
Issue Date 15-Dec-2020
Journal Proceedings of ICCES 2020 - 2020 15th International Conference on Computer Engineering and Systems 
Conference 2020 15th International Conference on Computer Engineering and Systems (ICCES)
ISBN 9780738105598
DOI 10.1109/ICCES51560.2020.9334573
Scopus ID 2-s2.0-85100850612

Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

Citations 3 in scopus


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.