Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window
Sayed, Doaa; Rady, Sherine; Aref, M.;
Abstract
Data stream mining becomes a hot research issue in the ongoing time. The main challenge in data stream mining is the knowledge extraction in real-time from an immense, data stream in only one scan. Data stream clustering demonstrates an significant task in data stream processing. This paper introduces SCluStream an algorithm for determining clusters over a sliding window to manage such challenges. The algorithm is an improvement over CluStream which does not involve this sliding window concept. In the sliding window model, only the most recent data is utilized while the old data is eliminated, which allows for faster execution. A better clustering technique is also involved which managed to contribute to accuracy improvement. The proposed algorithm has been tested on two real datasets; charitable donation data set and forest cover type data set. The results showed that comparing SCluStream to CluStream has proven that the former algorithm is more efficient for clustering big data streams in regard to the accuracy as well as the utilized time and memory usages.
Other data
Title | Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window | Authors | Sayed, Doaa; Rady, Sherine ; Aref, M. | Keywords | mining in data streams;Data stream clustering;Window models;Time series in big data;sliding window | Issue Date | 1-Jul-2020 | Conference | 2020 12th International Conference on Electrical Engineering, ICEENG 2020 | ISBN | 9781728130521 | DOI | 10.1109/ICEENG45378.2020.9171705 | Scopus ID | 2-s2.0-85092004321 |
Attached Files
File | Description | Size | Format | Existing users please Login |
---|---|---|---|---|
ICEENG.2020.Doaa-et-al.pdf | 233.56 kB | Adobe PDF | Request a copy |
Similar Items from Core Recommender Database
Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.