Deep Learning for Traits Detection Using Social Networks Interactions
Eman Mohammed Hamdi El-Sayed Ibrahim;
Abstract
Social media networks are one of the main platforms used on a daily basis by millions of people. Feelings, emotions, and opinions are expressed by posting text, images, and videos to express self or to communicate with other people. Using text, the words reflect emotions and indicate behavior towards different topics. Detecting emotions and sentiments helps in many directions such that marketing, political orientation and product reviews. A huge amount of textual data is currently available. There is a need for detecting emotions from social media networks which enhances the machine understanding of humans’ perspective.
This thesis contributes to detect positive and negative emotions from short text -tweets- by proposing a deep Convolutional Neural Network (CNN) using different types of word embeddings. CNN is the main building block of the proposed model and is responsible for extracting high-level features from low-level features. Word embeddings are the features fed to the model. The model is constructed by three CNN streams, where each CNN stream contains an embedding layer, a convolutional layer and a max-pooling layer. CNN streams are concatenated and followed by a fully connected layer for classifying text into a positive or a negative emotion class. The used textual features are different types of word embeddings including randomly initialized word embeddings and pre-trained word embeddings. The used pre-trained word embeddings are of different variants such as Word2Vec, Glove and fastText models. The word embeddings in both setups are trainable and updated through the training phase. After training, the model learns relations between words and generates task-specific word embeddings.
The proposed model has been tested on the Stanford Twitter Sentiment (STS) dataset for classifying emotions. Experiments indicated that the achieved accuracy is 78.5% when using the randomly initialized word embeddings. The accuracy increases 3.6% when using fasttext pretrained word embeddings, 4.5% when using Glove pretrained word embeddings and 6.4% when using Word2Vec pretrained word embeddings. The best tuning for the model is when using Word2Vec pretrained word embeddings which achieves 84.9% accuracy. It is concluded that using CNNs in emotion detection from text is very promising as even when using randomly initialized word embeddings it achieves 78.5% accuracy without any external dataset. Also, not only randomly initialized word embedding can achieve good accuracy in emotion detection from text, it is proven that the power of the pretrained word embeddings helps to achieve a higher competitive accuracy in emotion detection from text.
This thesis contributes to detect positive and negative emotions from short text -tweets- by proposing a deep Convolutional Neural Network (CNN) using different types of word embeddings. CNN is the main building block of the proposed model and is responsible for extracting high-level features from low-level features. Word embeddings are the features fed to the model. The model is constructed by three CNN streams, where each CNN stream contains an embedding layer, a convolutional layer and a max-pooling layer. CNN streams are concatenated and followed by a fully connected layer for classifying text into a positive or a negative emotion class. The used textual features are different types of word embeddings including randomly initialized word embeddings and pre-trained word embeddings. The used pre-trained word embeddings are of different variants such as Word2Vec, Glove and fastText models. The word embeddings in both setups are trainable and updated through the training phase. After training, the model learns relations between words and generates task-specific word embeddings.
The proposed model has been tested on the Stanford Twitter Sentiment (STS) dataset for classifying emotions. Experiments indicated that the achieved accuracy is 78.5% when using the randomly initialized word embeddings. The accuracy increases 3.6% when using fasttext pretrained word embeddings, 4.5% when using Glove pretrained word embeddings and 6.4% when using Word2Vec pretrained word embeddings. The best tuning for the model is when using Word2Vec pretrained word embeddings which achieves 84.9% accuracy. It is concluded that using CNNs in emotion detection from text is very promising as even when using randomly initialized word embeddings it achieves 78.5% accuracy without any external dataset. Also, not only randomly initialized word embedding can achieve good accuracy in emotion detection from text, it is proven that the power of the pretrained word embeddings helps to achieve a higher competitive accuracy in emotion detection from text.
Other data
| Title | Deep Learning for Traits Detection Using Social Networks Interactions | Other Titles | التعلم العميق للكشف عن سمات باستخدام تفاعلات الشبكات الاجتماعية | Authors | Eman Mohammed Hamdi El-Sayed Ibrahim | Issue Date | 2020 |
Attached Files
| File | Size | Format | |
|---|---|---|---|
| BB3582.pdf | 1.43 MB | Adobe PDF | View/Open |
Similar Items from Core Recommender Database
Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.