Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window

Sayed, Doaa; Rady, Sherine; Aref, M.;

Abstract


Data stream mining becomes a hot research issue in the ongoing time. The main challenge in data stream mining is the knowledge extraction in real-time from an immense, data stream in only one scan. Data stream clustering demonstrates an significant task in data stream processing. This paper introduces SCluStream an algorithm for determining clusters over a sliding window to manage such challenges. The algorithm is an improvement over CluStream which does not involve this sliding window concept. In the sliding window model, only the most recent data is utilized while the old data is eliminated, which allows for faster execution. A better clustering technique is also involved which managed to contribute to accuracy improvement. The proposed algorithm has been tested on two real datasets; charitable donation data set and forest cover type data set. The results showed that comparing SCluStream to CluStream has proven that the former algorithm is more efficient for clustering big data streams in regard to the accuracy as well as the utilized time and memory usages.


Other data

Title Enhancing CluStream Algorithm for Clustering Big Data Streaming over Sliding Window
Authors Sayed, Doaa; Rady, Sherine ; Aref, M. 
Keywords mining in data streams;Data stream clustering;Window models;Time series in big data;sliding window
Issue Date 1-Jul-2020
Conference 2020 12th International Conference on Electrical Engineering, ICEENG 2020
ISBN 9781728130521
DOI 10.1109/ICEENG45378.2020.9171705
Scopus ID 2-s2.0-85092004321

Attached Files

File Description SizeFormat Existing users please Login
ICEENG.2020.Doaa-et-al.pdf233.56 kBAdobe PDF    Request a copy
Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

Citations 7 in scopus


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.