Data Mining Techniques for Iraqi Biochemical Dataset Analysis

Sarah  Sameer; Suhad Faisal  Behadili; Sarah  Sameer; Suhad Faisal  Behadili

doi:10.21123/bsj.2022.19.2.0385

Details

Publication Date

Fri Apr 01 2022

Journal Name

Baghdad Science Journal

Volume

19

Issue Number

2

DOI

10.21123/bsj.2022.19.2.0385

Choose Citation Style

Statistics

View publication

77

Statistics

(2)

(1)

Data Mining Techniques for Iraqi Biochemical Dataset Analysis

Biomedical

Classification And Regression Tree (CART)

Data mining

Hierarchical clustering

K-means.

Sarah Sameer

Suhad Faisal Behadili

...Show More Authors

This research aims to analyze and simulate biochemical real test data for uncovering the relationships among the tests, and how each of them impacts others. The data were acquired from Iraqi private biochemical laboratory. However, these data have many dimensions with a high rate of null values, and big patient numbers. Then, several experiments have been applied on these data beginning with unsupervised techniques such as hierarchical clustering, and k-means, but the results were not clear. Then the preprocessing step performed, to make the dataset analyzable by supervised techniques such as Linear Discriminant Analysis (LDA), Classification And Regression Tree (CART), Logistic Regression (LR), K-Nearest Neighbor (K-NN), Naïve Bays (NB), and Support Vector Machine (SVM) techniques. CART gives clear results with high accuracy between the six supervised algorithms. It is worth noting that the preprocessing steps take remarkable efforts to handle this type of data, since its pure data set has so many null values of a ratio 94.8%, then it becomes 0% after achieving the preprocessing steps. Then, in order to apply CART algorithm, several determined tests were assumed as classes. The decision to select the tests which had been assumed as classes were depending on their acquired accuracy. Consequently, enabling the physicians to trace and connect the tests result with each other, which extends its impact on patients’ health.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Jan 01 2023

Journal Name

2nd International Conference On Mathematical Techniques And Applications: Icmta2021

Review of clustering for gene expression data

Omar

Basad

...Show More Authors

View Publication

(2)

Publication Date

Tue Mar 22 2016

Journal Name

Iraqi Journal Of Market Research And Consumer Protection

CONSUMPTION EFFECT OF PROTEIN SUPPLEMENTS IN THE BIOCHEMICAL PARAMETER FOR SOME YOUNG MUSCLE BUILDERS: CONSUMPTION EFFECT OF PROTEIN SUPPLEMENTS IN THE BIOCHEMICAL PARAMETER FOR SOME YOUNG MUSCLE BUILDERS

Building muscles

Protein supplements

Alanine amino transferase

Kidney function

Body mass index

Zahraa I. Abudal

Suhayla K.

Lamia shaker

...Show More Authors

Many people take protein supplements in an effort to gain muscle. However, there is some controversy as to whether this is really effective. There is evidence suggesting that consuming high level s of protein may in fact have negative side effects for health. The current study included 29 young Iraqi building muscles in two different groups (taken and not protein supplements) (age range=17-31 years), the cases were selected from family, friends, college students, and Gyms), from November 2014 to March 2015. A careful history was obtained from each volunteer including age, duration of sports, type of supplements, and family history of diseases. Some biochemical parameters like (glucose, urea, uric acid, creatinine, bilirubin, serum protei

View Publication Preview PDF

Publication Date

Thu Mar 17 2016

Journal Name

International Journal Of Computer Applications

Analysis of Wind Speed Data and Annual Energy Potential at Three locations in Iraq

Ali M.

...Show More Authors

View Publication

(3)

Publication Date

Tue Dec 25 2018

Journal Name

Journal Of Engineering Science And Technology

RIETVELD TEXTURE REFINEMENT ANALYSIS OF LINDE TYPE A ZEOLITE FROM X-RAY DIFFRACTION DATA

Sama

BASMA

STUART

...Show More Authors

(33)

Publication Date

Sun Jan 01 2017

Journal Name

Statistical Applications In Genetics And Molecular Biology

Mixture model-based association analysis with case-control data in genome wide association studies

genome wide association studies

haplotype mixture model

odds ratios

testing for inheritance patterns

ALI

Jian

...Show More Authors

Abstract<p>Multilocus haplotype analysis of candidate variants with genome wide association studies (GWAS) data may provide evidence of association with disease, even when the individual loci themselves do not. Unfortunately, when a large number of candidate variants are investigated, identifying risk haplotypes can be very difficult. To meet the challenge, a number of approaches have been put forward in recent years. However, most of them are not directly linked to the disease-penetrances of haplotypes and thus may not be efficient. To fill this gap, we propose a mixture model-based approach for detecting risk haplotypes. Under the mixture model, haplotypes are clustered directly according to their estimated d</p> ... Show More

View Publication

(4)

(2)

Publication Date

Tue Dec 01 2020

Journal Name

Baghdad Science Journal

A Modified Support Vector Machine Classifiers Using Stochastic Gradient Descent with Application to Leukemia Cancer Type Dataset

Classification

Dimension Reduction

Feature Selection

Leukemia Diagnosis

Stochastic Gradient Descend.

Ghadeer JM

...Show More Authors

Support vector machines (SVMs) are supervised learning models that analyze data for classification or regression. For classification, SVM is widely used by selecting an optimal hyperplane that separates two classes. SVM has very good accuracy and extremally robust comparing with some other classification methods such as logistics linear regression, random forest, k-nearest neighbor and naïve model. However, working with large datasets can cause many problems such as time-consuming and inefficient results. In this paper, the SVM has been modified by using a stochastic Gradient descent process. The modified method, stochastic gradient descent SVM (SGD-SVM), checked by using two simulation datasets. Since the classification of different ca

View Publication Preview PDF

(11)

(7)

Publication Date

Tue Dec 01 2020

Journal Name

Journal Of Economics And Administrative Sciences

A proposed guideline for auditing revenues in the Iraqi environment according to IFRS 15

: Revenue

International Financial Reporting Standards (IFRS 15)

Revenue Auditing

The proposed guideline for auditing revenues according to IFRS 15.

Mohammad Ibrahim

Bushra N. Abdullah

...Show More Authors

Deficiencies in revenue-related accounting standards, including American accounting standards as well as international accounting standards, prompted the issuance of the International Financial Reporting Standard IFRS 15 "Revenue from contracts with customers" as part of the convergence plan between the FASB and the International Accounting Standards Board (IASB) according to the requirements of The joint venture between the two councils, whereby the standard aims to define the basis for reporting useful information to the users of the financial statements about the nature, amount, timing and uncertainty about the revenues and cash flows arising from a contract with the customer, The standard is base

View Publication Preview PDF

Publication Date

Tue Dec 01 2020

Journal Name

Gulf Economist

The Bayesian Estimation in Competing Risks Analysis for Discrete Survival Data under Dynamic Methodology with Application to Dialysis Patients in Basra/ Iraq

Discrete hazard function

Dynamic modeling

Competing risks

Time-Varying effect

MAP method

MCMC method

Dialysis patients.

Asmaa

Omar Abdulmohsin

...Show More Authors

Survival analysis is one of the types of data analysis that describes the time period until the occurrence of an event of interest such as death or other events of importance in determining what will happen to the phenomenon studied. There may be more than one endpoint for the event, in which case it is called Competing risks. The purpose of this research is to apply the dynamic approach in the analysis of discrete survival time in order to estimate the effect of covariates over time, as well as modeling the nonlinear relationship between the covariates and the discrete hazard function through the use of the multinomial logistic model and the multivariate Cox model. For the purpose of conducting the estimation process for both the discrete

View Publication Preview PDF

Publication Date

Wed Dec 01 2010

Journal Name

Al-khwarizmi Engineering Journal

Design and Implementation of Iraqi Virtual Library

Bahaa I.

Mohammed Najm

Jalal B.

Jean- Noël

Barry

...Show More Authors

In developing countries, individual students and researchers are not able to afford the high price of the subscription to the international publishers, like JSTOR, ELSEVIER,…; therefore the governments and/or universities of those countries aim to purchase one global subscription to the international publishers to provide their educational resources at a cheaper price, or even freely, to all students and researchers of those institutions. For realizing this concept, we must build a system that sits between the publishers and the users (students or researchers) and act as a gatekeeper and a director of information: this system must register its users and must have an adequate security to e

View Publication Preview PDF

Publication Date

Mon Mar 01 2010

Journal Name

Journal Of Computer Science

Dropping down the Maximum Item Set: Improving the Stylometric Authorship Attribution Algorithm in the Text Mining for Authorship Investigation

Mustafa T.K.

...Show More Authors

View Publication

(4)

(2)

1 2 ... 24 25 26 27 ... 1069 1070