ON DATA MINING IN BIOINFORMATICS

Wael Zakaria Abd Allah Mohamed;

Abstract


Using data mining techniques in bioinformatics is an attractive re­ search area. This thesis is devoted to extract the association rules (as one of the basic data mining tasks) included in DNA microarray datasets. The thesis presents a study of various algorithms concerning the row and column enumeration based methods, and it concentrates on analysing the MAXCONF algorithm. Biological evaluations show that the MAXCONF algorithm provides excellent results compared with other algorithms applied to DNA microarray. It mines maxi­ mal high confident association rules, MHCR, from DNA microarray dataset. However, it has two drawbacks which are represented in consuming expensive computations and producing many useless rules. Therefore, a slight modification of the algorithm is proposed for fixing these drawbacks. Moreover, two versions of the MAXCONF algorithm have been implemented: the first (MAXCONFJ) mines MHCR from up-expressed genes dataset. The second (MAXCONF2) mines MHCR from up/down-expressed genes dataset.
Two new sequential algorithms (MMHCR and MCR-Miner) are
proposed for mining maximal high confident association rules from up-expressed and up/down-expressed genes dataset respectively. Re­ ally, these algorithms are the column enumeration based analog of MAXCONFl and MAXCONF2 algorithms. In order to build these


Other data

Title ON DATA MINING IN BIOINFORMATICS
Other Titles عن تنقيب البيانات في المعلوماتية الحيوية
Authors Wael Zakaria Abd Allah Mohamed
Keywords Data mining. DA minoarra' tcdmologv. rmnmg association rules, class association rules. umc:l'r da:->sifiuu iun.
Issue Date 2015

Attached Files

File SizeFormat
B4796.pdf1.39 MBAdobe PDFView/Open
Recommend this item

Similar Items from Core Recommender Database

Google ScholarTM

Check

views 16 in Shams Scholar
downloads 5 in Shams Scholar


Items in Ain Shams Scholar are protected by copyright, with all rights reserved, unless otherwise indicated.