Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee; Basad Al-Sarray

doi:https://doi.org/10.52866/ijcsm.2022.02.01.004

Details

Publication Date

Mon Feb 21 2022

Journal Name

Iraqi Journal For Computer Science And Mathematics

DOI

https://doi.org/10.52866/ijcsm.2022.02.01.004

Choose Citation Style

Statistics

View publication

13

Statistics

(1)

Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee

Basad Al-Sarray

...Show More Authors

The influx of data in bioinformatics is primarily in the form of DNA, RNA, and protein sequences. This condition places a significant burden on scientists and computers. Some genomics studies depend on clustering techniques to group similarly expressed genes into one cluster. Clustering is a type of unsupervised learning that can be used to divide unknown cluster data into clusters. The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering. Consequently, clustering is a common approach that divides an input space into several homogeneous zones; it can be achieved using a variety of algorithms. This study used three models to cluster a brain tumor dataset. The first model uses FCM, which is used to cluster genes. FCM allows an object to belong to two or more clusters with a membership grade between zero and one and the sum of belonging to all clusters of each gene is equal to one. This paradigm is useful when dealing with microarray data. The total time required to implement the first model is 22.2589 s. The second model combines FCM and particle swarm optimization (PSO) to obtain better results. The hybrid algorithm, i.e., FCM–PSO, uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–PSO method is effective. The total time of implementation of this model is 89.6087 s. The third model combines FCM with a genetic algorithm (GA) to obtain better results. This hybrid algorithm also uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–GA method is effective. Its total time of implementation is 50.8021 s. In addition, this study uses cluster validity indexes to determine the best partitioning for the underlying data. Internal validity indexes include the Jaccard, Davies Bouldin, Dunn, Xie–Beni, and silhouette. Meanwhile, external validity indexes include Minkowski, adjusted Rand, and percentage of correctly categorized pairings. Experiments conducted on brain tumor gene expression data demonstrate that the techniques used in this study outperform traditional models in terms of stability and biological significance.

View Publication

Publication Date

Tue Mar 01 2016

Journal Name

Journal Of Pharmaceutical Sciences

Development and Evaluation of Biodegradable Particles Coloaded With Antigen and the Toll-Like Receptor Agonist, Pentaerythritol Lipid A, as a Cancer Vaccine

Kawther

...Show More Authors

View Publication

(21)

Publication Date

Fri Jan 01 2016

Journal Name

Machine Learning And Data Mining In Pattern Recognition

A New Strategy for Case-Based Reasoning Retrieval Using Classification Based on Association

Ahmed

...Show More Authors

View Publication Preview PDF

(7)

(5)

Publication Date

Thu Mar 30 2023

Journal Name

Iraqi Journal Of Science

A Tri-Gene Ontology Migration Operator for Improving the Performance of Meta-heuristics in Complex Detection Problems

Isra H.

Dhia A. Jumaa

Bara'a Ali

...Show More Authors

Detecting protein complexes in protein-protein interaction (PPI) networks is a challenging problem in computational biology. To uncover a PPI network into a complex structure, different meta-heuristic algorithms have been proposed in the literature. Unfortunately, many of such methods, including evolutionary algorithms (EAs), are based solely on the topological information of the network rather than on biological information. Despite the effectiveness of EAs over heuristic methods, more inherent biological properties of proteins are rarely investigated and exploited in these approaches. In this paper, we proposed an EA with a new mutation operator for complex detection problems. The proposed mutation operator is formulate

(3)

Publication Date

Sun Jun 04 2017

Journal Name

Baghdad Science Journal

Detection of zpx gene of Cronobacter sakazakii isolated from Clinical samples for Iraqi children under Two Years

cronobacter sakazakii

Stool

Urine

Blood

Cerebrospinal fluid

zpx gene.

Assist. Lecturer Tharieyt Abdulrahman

Assist Prof. Dr. Luma abdal Hady

...Show More Authors

The study included 200 samples were collected from children under two years included (50 samples from each of Cerebrospinal fluid, Blood, Stool and Urine) from, (Central Children Hospital and Children's Protections Educational Hospital) The Iraqi Ministry of Health, the Department of Health Baghdad .the period from the first of 2015 September to the first of December 2015, Were obtained isolates bacterial subjected to the cultural, microscopic and biochemical examination and diagnosed to the species by using vitek2 system .The results showed there were contamination in 6.5% of clinical samples. The diagnosed colonies which gave pink color on the MacConkey agar, golden yellow color on the Trypton Soy agar and green color on t

View Publication Preview PDF

Publication Date

Sun Jun 07 2015

Journal Name

Baghdad Science Journal

On The Nearby-Tip Strain Investigation and Failure-Propability Evaluation for Impacted Thin Plates Using the 2-Random-Variables Multi-Canonical-Based Joint Propability Distributions

Joint Probability Distributions

Multi-Canonical Probability Functions

Nearby-Tip Strains

Thin Plates

Fracture and Failure Analyses

Rasha A.

Muthanna A.

...Show More Authors

The study of the validity and probability of failure in solids and structures is highly considered as one of the most incredibly-highlighted study fields in many science and engineering applications, the design analysts must therefore seek to investigate the points where the failing strains may be occurred, the probabilities of which these strains can cause the existing cracks to propagate through the fractured medium considered, and thereafter the solutions by which the analysts can adopt the approachable techniques to reduce/arrest these propagating cracks.In the present study a theoretical investigation upon simply-supported thin plates having surface cracks within their structure is to be accomplished, and the applied impact load to the

View Publication Preview PDF

Publication Date

Wed Sep 23 2020

Journal Name

Artificial Intelligence Research

Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Maysa

John Q

...Show More Authors

This paper proposes two hybrid feature subset selection approaches based on the combination (union or intersection) of both supervised and unsupervised filter approaches before using a wrapper, aiming to obtain low-dimensional features with high accuracy and interpretability and low time consumption. Experiments with the proposed hybrid approaches have been conducted on seven high-dimensional feature datasets. The classifiers adopted are support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbour (KNN). Experimental results have demonstrated the advantages and usefulness of the proposed methods in feature subset selection in high-dimensional space in terms of the number of selected features and time spe

View Publication

Publication Date

Thu Nov 30 2023

Journal Name

Iraqi Geological Journal

Inverting Gravity Data to Density and Velocity Models for Selected Area in Southwestern Iraq

Athir

Osamah

...Show More Authors

The gravity method is a measurement of relatively noticeable variations in the Earth’s gravitational field caused by lateral variations in rock's density. In the current research, a new technique is applied on the previous Bouguer map of gravity surveys (conducted from 1940–1950) of the last century, by selecting certain areas in the South-Western desert of Iraqi-territory within the provinces' administrative boundary of Najaf and Anbar. Depending on the theory of gravity inversion where gravity values could be reflected to density-contrast variations with the depths; so, gravity data inversion can be utilized to calculate the models of density and velocity from four selected depth-slices 9.63 Km, 1.1 Km, 0.682 Km and 0.407 Km.

View Publication Preview PDF

(3)

Publication Date

Fri Jan 01 2016

Journal Name

Statistics And Its Interface

Search for risk haplotype segments with GWAS data by use of finite mixture models

ALI

Jian

...Show More Authors

The region-based association analysis has been proposed to capture the collective behavior of sets of variants by testing the association of each set instead of individual variants with the disease. Such an analysis typically involves a list of unphased multiple-locus genotypes with potentially sparse frequencies in cases and controls. To tackle the problem of the sparse distribution, a two-stage approach was proposed in literature: In the first stage, haplotypes are computationally inferred from genotypes, followed by a haplotype coclassification. In the second stage, the association analysis is performed on the inferred haplotype groups. If a haplotype is unevenly distributed between the case and control samples, this haplotype is labeled

View Publication

Publication Date

Fri Aug 01 2014

Journal Name

Journal Of Economics And Administrative Sciences

Efficiency Measurement Model for Postgraduate Programs and Undergraduate Programs by Using Data Envelopment Analysis

خالد زغيتون

...Show More Authors

Measuring the efficiency of postgraduate and undergraduate programs is one of the essential elements in educational process. In this study, colleges of Baghdad University and data for the academic year (2011-2012) have been chosen to measure the relative efficiencies of postgraduate and undergraduate programs in terms of their inputs and outputs. A relevant method to conduct the analysis of this data is Data Envelopment Analysis (DEA). The effect of academic staff to the number of enrolled and alumni students to the postgraduate and undergraduate programs are the main focus of the study.

View Publication Preview PDF

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

A Novel System for Confidential Medical Data Storage Using Chaskey Encryption and Blockchain Technology

Blockchain

BFlow

Chaskey

Healthcare

IoT

Security

Aymen Mudheher

Lamia Chaari

Samiha

...Show More Authors

Secure storage of confidential medical information is critical to healthcare organizations seeking to protect patient's privacy and comply with regulatory requirements. This paper presents a new scheme for secure storage of medical data using Chaskey cryptography and blockchain technology. The system uses Chaskey encryption to ensure integrity and confidentiality of medical data, blockchain technology to provide a scalable and decentralized storage solution. The system also uses Bflow segmentation and vertical segmentation technologies to enhance scalability and manage the stored data. In addition, the system uses smart contracts to enforce access control policies and other security measures. The description of the system detailing and p

View Publication Preview PDF

(4)

1 2 ... 62 63 64 65 ... 917 918