The influx of data in bioinformatics is primarily in the form of DNA, RNA, and protein sequences. This condition places a significant burden on scientists and computers. Some genomics studies depend on clustering techniques to group similarly expressed genes into one cluster. Clustering is a type of unsupervised learning that can be used to divide unknown cluster data into clusters. The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering. Consequently, clustering is a common approach that divides an input space into several homogeneous zones; it can be achieved using a variety of algorithms. This study used three models to cluster a brain tumor dataset. The first model uses FCM, which is used to cluster genes. FCM allows an object to belong to two or more clusters with a membership grade between zero and one and the sum of belonging to all clusters of each gene is equal to one. This paradigm is useful when dealing with microarray data. The total time required to implement the first model is 22.2589 s. The second model combines FCM and particle swarm optimization (PSO) to obtain better results. The hybrid algorithm, i.e., FCM–PSO, uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–PSO method is effective. The total time of implementation of this model is 89.6087 s. The third model combines FCM with a genetic algorithm (GA) to obtain better results. This hybrid algorithm also uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–GA method is effective. Its total time of implementation is 50.8021 s. In addition, this study uses cluster validity indexes to determine the best partitioning for the underlying data. Internal validity indexes include the Jaccard, Davies Bouldin, Dunn, Xie–Beni, and silhouette. Meanwhile, external validity indexes include Minkowski, adjusted Rand, and percentage of correctly categorized pairings. Experiments conducted on brain tumor gene expression data demonstrate that the techniques used in this study outperform traditional models in terms of stability and biological significance.
This research proposes the application of the dragonfly and fruit fly algorithms to enhance estimates generated by the Fama-MacBeth model and compares their performance in this context for the first time. To specifically improve the dragonfly algorithm's effectiveness, three parameter tuning approaches are investigated: manual parameter tuning (MPT), adaptive tuning by methodology (ATY), and a novel technique called adaptive tuning by performance (APT). Additionally, the study evaluates the estimation performance using kernel weighted regression (KWR) and explores how the dragonfly and fruit fly algorithms can be employed to enhance KWR. All methods are tested using data from the Iraq Stock Exchange, based on the Fama-French three-f
... Show MoreProjects suspensions are between the most insistent tasks confronted by the construction field accredited to the sector’s difficulty and its essential delay risk foundations’ interdependence. Machine learning provides a perfect group of techniques, which can attack those complex systems. The study aimed to recognize and progress a wellorganized predictive data tool to examine and learn from delay sources depend on preceding data of construction projects by using decision trees and naïve Bayesian classification algorithms. An intensive review of available data has been conducted to explore the real reasons and causes of construction project delays. The results show that the postpo
Autism spectrum disorder(ASD) is a neurological condition marked by impaired communication abilities, social detachment, and repetitive behaviors in individuals. Global health organization facing difficulties in establishing an effective ASD diagnostic system that facilitates precise analysis and early autism prediction. It is a scientific issue that necessitates resolution. This research presents an approach for the early prediction of children with ASD utilizing significant variables through machine learning (ML) methods. Three stages comprise the suggested technique. First, a 1250-case ASD dataset was identified and preprocessed. Five extremely effective traits with high Pearson c
This study presents the multi criteria single-machine model. The machine scheduling problem (MSP) for ntasks on a single machine involves minimizing a function of three criteria: total completion time (C_j),maximum earliest (E_max), and tardiness (〖ΣT〗_j), This is an NP-hard issue. Within this work's theoretical section, we present the mathematical formulation of The presented topic thenhighlights the usefulness of the dominance rule (DR), which may be used to develop effective solutions. Whilein the practical part, one of the important exact methods; The proposed MSP tricriteria are solved by applyingthe Branch and Bound (BAB) method, which finds a set of efficient solutions for 1//F(ΣC_j ,ΣT_j ,E_max) upto n=100 jobs. The BAB appro
... Show MoreWith the proliferation of both Internet access and data traffic, recent breaches have brought into sharp focus the need for Network Intrusion Detection Systems (NIDS) to protect networks from more complex cyberattacks. To differentiate between normal network processes and possible attacks, Intrusion Detection Systems (IDS) often employ pattern recognition and data mining techniques. Network and host system intrusions, assaults, and policy violations can be automatically detected and classified by an Intrusion Detection System (IDS). Using Python Scikit-Learn the results of this study show that Machine Learning (ML) techniques like Decision Tree (DT), Naïve Bayes (NB), and K-Nearest Neighbor (KNN) can enhance the effectiveness of an Intrusi
... Show MoreMany economists believe that the development and promotion of small and medium-sized enterprises is one of the most important sources of economic and social development in countries in general and in developing countries in particular. This is considered to be an essential starting point for increasing production capacity and contributing to tackling poverty and unemployment. In view of the importance of these projects, most developing countries have concentrated their efforts on them. They have encouraged the establishment of small and medium industries, especially after they have proved their ability and efficiency in dealing with the major problems facing different economies.
The banking system plays an important role by finan
... Show MoreThe paper examines key aspects of the use of phraseologi-cal units related to colors in Russian culture and speech. It explores their role in shaping cultural identity, reflecting national characteristics and men-tality. The study analyzes the frequency and contexts of the use of color-related phraseological units in contemporary speech, as well as the influ-ence of media and literature on their popularization. The author highlights the significance of phraseological units in preserving cultural heritage and fostering a deeper understanding of language and culture.
The development of better tools for diagnosis and more accurate prognosis of cancer includes the search for biomarkers; molecules whose presence, absence or change in quantity or structure is associated with a particular tumour or prognosis/therapeutic outcome. While biomarkers need not be functionally relevant, if cell survival, then they could also provide new targets for therapeutic drugs. In recent years attention has been applied to a group of proteins known as cancer testis antigens (CT antigens) [1]. These proteins are products of genes whose expression was normally confined to the testis, yet they are expressed in tumour cells. CT genes are bound to serve a wide array of roles in the testes, which have many highly differentiated cel
... Show More