Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee; Basad Al-Sarray

doi:https://doi.org/10.52866/ijcsm.2022.02.01.004

Details

Publication Date

Mon Feb 21 2022

Journal Name

Iraqi Journal For Computer Science And Mathematics

DOI

https://doi.org/10.52866/ijcsm.2022.02.01.004

Choose Citation Style

Statistics

View publication

6

Statistics

(1)

Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee

Basad Al-Sarray

...Show More Authors

The influx of data in bioinformatics is primarily in the form of DNA, RNA, and protein sequences. This condition places a significant burden on scientists and computers. Some genomics studies depend on clustering techniques to group similarly expressed genes into one cluster. Clustering is a type of unsupervised learning that can be used to divide unknown cluster data into clusters. The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering. Consequently, clustering is a common approach that divides an input space into several homogeneous zones; it can be achieved using a variety of algorithms. This study used three models to cluster a brain tumor dataset. The first model uses FCM, which is used to cluster genes. FCM allows an object to belong to two or more clusters with a membership grade between zero and one and the sum of belonging to all clusters of each gene is equal to one. This paradigm is useful when dealing with microarray data. The total time required to implement the first model is 22.2589 s. The second model combines FCM and particle swarm optimization (PSO) to obtain better results. The hybrid algorithm, i.e., FCM–PSO, uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–PSO method is effective. The total time of implementation of this model is 89.6087 s. The third model combines FCM with a genetic algorithm (GA) to obtain better results. This hybrid algorithm also uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–GA method is effective. Its total time of implementation is 50.8021 s. In addition, this study uses cluster validity indexes to determine the best partitioning for the underlying data. Internal validity indexes include the Jaccard, Davies Bouldin, Dunn, Xie–Beni, and silhouette. Meanwhile, external validity indexes include Minkowski, adjusted Rand, and percentage of correctly categorized pairings. Experiments conducted on brain tumor gene expression data demonstrate that the techniques used in this study outperform traditional models in terms of stability and biological significance.

View Publication

Publication Date

Tue Dec 01 2015

Journal Name

Journal Of Engineering

Ten Years of OpenStreetMap Project: Have We Addressed Data Quality Appropriately? – Review Paper

OpenStreetMap

VGI

spatial data quality

geometrical similarity

positional accuracy.

Maythm

...Show More Authors

It has increasingly been recognised that the future developments in geospatial data handling will centre on geospatial data on the web: Volunteered Geographic Information (VGI). The evaluation of VGI data quality, including positional and shape similarity, has become a recurrent subject in the scientific literature in the last ten years. The OpenStreetMap (OSM) project is the most popular one of the leading platforms of VGI datasets. It is an online geospatial database to produce and supply free editable geospatial datasets for a worldwide. The goal of this paper is to present a comprehensive overview of the quality assurance of OSM data. In addition, the credibility of open source geospatial data is discussed, highlight

View Publication Preview PDF

Publication Date

Sun Jan 01 2023

Journal Name

Petroleum And Coal

Analyzing of Production Data Using Combination of empirical Methods and Advanced Analytical Techniques

Sarah

Sameera

...Show More Authors

(1)

Publication Date

Tue Jan 01 2019

Journal Name

Journal Of Southwest Jiaotong University

Recognizing Job Apathy Patterns of Iraqi Higher Education Employees Using Data Mining Techniques

Mustafa S.

Suhad Faisal

...Show More Authors

Psychological research centers help indirectly contact professionals from the fields of human life, job environment, family life, and psychological infrastructure for psychiatric patients. This research aims to detect job apathy patterns from the behavior of employee groups in the University of Baghdad and the Iraqi Ministry of Higher Education and Scientific Research. This investigation presents an approach using data mining techniques to acquire new knowledge and differs from statistical studies in terms of supporting the researchers’ evolving needs. These techniques manipulate redundant or irrelevant attributes to discover interesting patterns. The principal issue identifies several important and affective questions taken from

View Publication

(1)

Publication Date

Fri Mar 01 2019

Journal Name

Spatial Statistics

Efficient Bayesian modeling of large lattice data using spectral properties of Laplacian matrix

Adaptive specification

Areal spatial data

Conditionally autoregressive prior

Dimension reduction

Plant abundance

Spike and slab prior

Ghadeer J.M.

Avishek

Mark E.

Anthony G.

...Show More Authors

Spatial data observed on a group of areal units is common in scientific applications. The usual hierarchical approach for modeling this kind of dataset is to introduce a spatial random effect with an autoregressive prior. However, the usual Markov chain Monte Carlo scheme for this hierarchical framework requires the spatial effects to be sampled from their full conditional posteriors one-by-one resulting in poor mixing. More importantly, it makes the model computationally inefficient for datasets with large number of units. In this article, we propose a Bayesian approach that uses the spectral structure of the adjacency to construct a low-rank expansion for modeling spatial dependence. We propose a pair of computationally efficient estimati

View Publication

(9)

(6)

Publication Date

Sun May 11 2025

Journal Name

Iraqi Statisticians Journal

Estimating General Linear Regression Model of Big Data by Using Multiple Test Technique

Ahmed Mahdi

Munaf Yousif

...Show More Authors

View Publication

Publication Date

Fri Mar 15 2019

Journal Name

Alustath Journal For Human And Social Sciences

A Developmental-Longitudinal Study of Request External Modifiers in Authentic and Elicited Data

Shurooq Abboodi

...Show More Authors

View Publication

Publication Date

Sun Jan 01 2017

Journal Name

Iraqi Journal Of Science

Strong Triple Data Encryption Standard Algorithm using Nth Degree Truncated Polynomial Ring Unit

Mays M. Hoobi

...Show More Authors

Cryptography is the process of transforming message to avoid an unauthorized access of data. One of the main problems and an important part in cryptography with secret key algorithms is key. For higher level of secure communication key plays an important role. For increasing the level of security in any communication, both parties must have a copy of the secret key which, unfortunately, is not that easy to achieve. Triple Data Encryption Standard algorithm is weak due to its weak key generation, so that key must be reconfigured to make this algorithm more secure, effective, and strong. Encryption key enhances the Triple Data Encryption Standard algorithm securities. This paper proposed a combination of two efficient encryption algorithms to

Publication Date

Mon May 15 2017

Journal Name

Journal Of Theoretical And Applied Information Technology

Anomaly detection in text data that represented as a graph using dbscan algorithm

Anomaly Detection

Enhanced DBSCAN algorithm

Unsupervised anomaly detection and Concept Frame Graph (CFG)

Asma Khazaal Abdulsahib

...Show More Authors

Anomaly detection is still a difficult task. To address this problem, we propose to strengthen DBSCAN algorithm for the data by converting all data to the graph concept frame (CFG). As is well known that the work DBSCAN method used to compile the data set belong to the same species in a while it will be considered in the external behavior of the cluster as a noise or anomalies. It can detect anomalies by DBSCAN algorithm can detect abnormal points that are far from certain set threshold (extremism). However, the abnormalities are not those cases, abnormal and unusual or far from a specific group, There is a type of data that is do not happen repeatedly, but are considered abnormal for the group of known. The analysis showed DBSCAN using the

Preview PDF

(4)

Publication Date

Tue Dec 01 2015

Journal Name

Journal Of Engineering

Ten Years of OpenStreetMap Project: Have We Addressed Data Quality Appropriately? – Review Paper

Maythm

...Show More Authors

It has increasingly been recognised that the future developments in geospatial data handling will centre on geospatial data on the web: Volunteered Geographic Information (VGI). The evaluation of VGI data quality, including positional and shape similarity, has become a recurrent subject in the scientific literature in the last ten years. The OpenStreetMap (OSM) project is the most popular one of the leading platforms of VGI datasets. It is an online geospatial database to produce and supply free editable geospatial datasets for a worldwide. The goal of this paper is to present a comprehensive overview of the quality assurance of OSM data. In addition, the credibility of open source geospatial data is discussed, highlighting the diff

(3)

Publication Date

Sun Jan 01 2023

Journal Name

Journal Of Engineering

State-of-the-Art in Data Integrity and Privacy-Preserving in Cloud Computing

Cloud Computing (CC)

data integrity

privacy-preserving.

Mariam Duraid

Yousra Abdul Alsahib

...Show More Authors

Cloud computing (CC) is a fast-growing technology that offers computers, networking, and storage services that can be accessed and used over the internet. Cloud services save users money because they are pay-per-use, and they save time because they are on-demand and elastic, a unique aspect of cloud computing. However, several security issues must be addressed before users store data in the cloud. Because the user will have no direct control over the data that has been outsourced to the cloud, particularly personal and sensitive data (health, finance, military, etc.), and will not know where the data is stored, the user must ensure that the cloud stores and maintains the outsourced data appropriately. The study's primary goals are to mak

View Publication Preview PDF

(6)

1 2 ... 146 147 148 149 ... 853 854