XGBOOST AND COST-SENSITIVE CART FOR
IMBALANCED MULTICLASS DIABETES
CLASSIFICATION IN IRAQ

Nabila A. Alsharif Alsharif; Inaam Aboud Hussain Hussain; Loaiy F. Naji Naji

Details

Publication Date

Tue Feb 03 2026

Journal Name

Journal Of Mechanics Of Continua And Mathematical Sciences

Volume

21

Issue Number

2

Choose Citation Style

Statistics

View publication

5

Statistics

XGBOOST AND COST-SENSITIVE CART FOR IMBALANCED MULTICLASS DIABETES CLASSIFICATION IN IRAQ

Classification

XGBoost

CART

Class imbalance

Diabetes

Pre diabetic

Nabila A. Alsharif Alsharif

Inaam Aboud Hussain Hussain

Loaiy F. Naji Naji

...Show More Authors

Diabetes imposes a substantial public health burden; according to the International Diabetes Federation, there were about 3.4 million diabetes related deaths worldwide in 2024, and in Iraq, the Federation reports that one in nine adults lives with diabetes in 2024, with 14,683 adult deaths attributable to diabetes and a total diabetes related health expenditure of 2,078 million United States dollars. The dataset analyzed in this study contains 1,000 records collected in 2020 from two Iraqi teaching hospitals and includes multiple clinical and laboratory measurements with three outcome classes, namely Non diabetic, Pre diabetic, and Diabetic, with a low prevalence of the Pre diabetic class and an imbalanced overall class distribution; the data are challenging because they contain many outliers, non homogeneous covariance matrices across classes, exact duplicate rows that were removed before modelling, and linear correlations among certain variables. The study objective was to train and evaluate models that discriminate among the three classes and yield accurate, well calibrated predictions for future cases in similar clinical settings, but the diagnostic properties of the data limited the applicability of classical discriminant functions; therefore two supervised learners were employed: Classification and Regression Trees (CART) and Extreme Gradient Boosting (XGBoost), together with preprocessing that removed exact duplicate rows and excluded VLDL because it is algebraically derived from triglycerides in mmol per liter as VLDL equals triglycerides divided by 2.2, which would introduce redundancy and multicollinearity. On the heldout test set, XGBoost achieved higher Accuracy at 98.18 percent compared with 97.58 percent for CART and higher Balanced Accuracy at 93.84 percent compared with 88.16 percent for CART, indicating that XGBoost provided the strongest overall operating point for this three-class task while CART remains useful when simple and transparent rules are required.

Preview PDF

Quick Preview PDF

Publication Date

Sat Jul 01 2017

Journal Name

2017 Computing Conference

Protecting a sensitive dataset using a time based password in big data

Omar Z.

G. J.

H. S.

...Show More Authors

View Publication

(1)

Publication Date

Sun Dec 30 2012

Journal Name

Al-kindy College Medical Journal

Initial Recognition and Prophecy of Diabetic Nephropathy in Type I Diabetes in a Sample of Iraqi Patients

diabetic nephropathy

diabetes mellitus

case control

Wijdan

...Show More Authors

Back ground: Diabetic nephropathy is rapidly becoming the leading cause of end-stage renal disease (ESRD). The onset and course of DN can be ameliorated to a very significant degree if intervention institutes at a point very early in the course of the development of this complication.
Objective: The aim of this study was to characterize risk factors associated with nephropathy in type I diabetes and construct a module for early prediction of diabetic nephropathy (DN) by analyzing their risk factors.
Methods: Case control design of 400 patients with type I diabetes mellitus (IDDM), aged 19-45 years. The cases were 200 diabetic patients with overt protein urea while the controls were 200 diabetic patients with no protein urea or micr

View Publication Preview PDF

Publication Date

Wed Dec 01 2010

Journal Name

Baghdad Science Journal

Relation between Body Iron Store and Insulin Resistance in Type 2 Diabetes

Type 2 diabetes.

Body iron store

Wafa F

Inaam A

Mayada S.

...Show More Authors

The clinical impact of interaction between body iron status (serum iron and ferritin) and type 2 diabetes has been investigated in this study. Thirty-six females were enrolled, eighteen type 2 diabetes and eighteen apparently healthy. These two groups were matched for age and body mass index BMI. The eighteen diabetes females were matched for age, BMI, pharmacological treatment (oral hypoglycemic agent), and chronic diabetes complications. The biochemical parameters measured for both groups (control and diabetes patient) were fasting insulin (Io), fasting blood glucose (Go), serum iron and ferritin. A significant increase in all parameters in patients compared to healthy control was noticed. The insulin resistance (IR) which was calculat

Publication Date

Tue Jul 31 2018

Journal Name

Journal Of Theoretical And Applied Information Technology

Classification and monitoring of autism using svm and vmcm

(Autism

Eye tracking

Classification

VMCM

SVM).

1PROF. KESRA

WAJIH ABDUL GHANI

AMMAR IBRAHIM

...Show More Authors

Autism is a lifelong developmental deficit that affects how people perceive the world and interact with each others. An estimated one in more than 100 people has autism. Autism affects almost four times as many boys than girls. The commonly used tools for analyzing the dataset of autism are FMRI, EEG, and more recently "eye tracking". A preliminary study on eye tracking trajectories of patients studied, showed a rudimentary statistical analysis (principal component analysis) provides interesting results on the statistical parameters that are studied such as the time spent in a region of interest. Another study, involving tools from Euclidean geometry and non-Euclidean, the trajectory of eye patients also showed interesting results. In this

Preview PDF

(4)

Publication Date

Wed Dec 08 2021

Journal Name

Scientific Reports

Weakly Supervised Sensitive Heatmap framework to classify and localize diabetic retinopathy lesions

Mohammed

Ameer Hussein

Mustafa

MD Samiul

...Show More Authors

Abstract<p>Vision loss happens due to diabetic retinopathy (DR) in severe stages. Thus, an automatic detection method applied to diagnose DR in an earlier phase may help medical doctors to make better decisions. DR is considered one of the main risks, leading to blindness. Computer-Aided Diagnosis systems play an essential role in detecting features in fundus images. Fundus images may include blood vessels, exudates, micro-aneurysm, hemorrhages, and neovascularization. In this paper, our model combines automatic detection for the diabetic retinopathy classification with localization methods depending on weakly-supervised learning. The model has four stages; in stage one, various preprocessing techniques are app</p> ... Show More

View Publication

(8)

(7)

Publication Date

Tue Nov 06 2018

Journal Name

Iraqi National Journal Of Nursing Specialties

Lipid Profile and Insulin Resistance in Patients with Type-ΙΙ Diabetes Mellitus

Glycoprotein

Insulin Resistance

Type-ΙΙ Diabetes Mellitus

Hadil

Layla

majeed

awaz

...Show More Authors

Objective: To investigate the relation between dyslipidemia and insulin resistance where it is one of the metabolic
disorders in patients with type-ΙΙ diabetes mellitus and compare the results with the control group.
Methodology: Blood samples were collected from (35) patients with type-ΙΙ diabetes mellitus, besides (35) healthy
individuals as a control group were enrolled in this study. The age of all subjects range from (20-50). Serum was
used in determination of glucose, insulin, lipid profile (cholesterol (Ch), triglyceride (TG), high-density lipoprotein
(HDL-Ch), low-density lipoprotein (LDL-Ch) and very low-density lipoprotein (VLDL), for patients and control
groups. Insulin resistance (IR) was calculated acco

View Publication Preview PDF

Publication Date

Sun Jan 14 2018

Journal Name

Journal Of Engineering

Efficient Cost Management in the Housing Projects

Safaa AL-Deen

sajeda kadum

Kreem Hassn

...Show More Authors

The cost management of cost indicators in housing projects, on the level of planning and design, is the most important quality indicators, for adoption of strategies of planning and design efficient in managing these indicators. So this research points out the need to highlight the most effective and influential cost indicators in housing projects, and to determine strategies in the management of these indicators in order to raise the efficiency of housing projects quality, to seemly the income level target group, taking into consideration the quality of housing standards, to achieve the basic requirements of housing. This paper highlights the importance of the cost management, the types of housing cost, the method

View Publication Preview PDF

Publication Date

Fri Oct 24 2025

Journal Name

Chemical Papers

Development of an advanced flow injection method using curcumin nanoparticle fluorescence for sensitive detection of cobalt (II) and nitrite ions

Wafaa Waleed

Turkey Nagham

...Show More Authors

View Publication

Publication Date

Sun Mar 26 2017

Journal Name

Iraqi Journal Of Pharmaceutical Sciences ( P-issn 1683 - 3597 E-issn 2521 - 3512)

Gestational Diabetes Mellitus and Hormonal Alteration

Sura

Amer

Aufaira

...Show More Authors

Gestational Diabetes Mellitus is known as carbohydrate intolerance first detected during pregnancy. Pregnancy is periods of intense hormonal changes. The aim of the present study was to investigate a possible relation between the changes in serum hormones such as Luteinizing hormone (LH) , follicle stimulating hormone(FSH), Progesterone, and Prolactin with gestational diabetes mellitus. Thirty patients with gestational diabetes mellitus aged (22 -40) year attending the national center for treatment and research of diabetes/ AL-Mustansiriya University in Baghdad and 29 controls aged (20-39) year were participated. Hormonal tests including, FSH, LH, Progesterone, and Prolactin were detected by using Enzyme Linked Fluorescent Assay (ELFA) k

View Publication Preview PDF

(2)

Publication Date

Tue Sep 01 2020

Journal Name

Al-khwarizmi Engineering Journal

Two-Stage Classification of Breast Tumor Biomarkers for Iraqi Women

Iyden Kamil

Ali Hussein

Javier

...Show More Authors

Objective: Breast cancer is regarded as a deadly disease in women causing lots of mortalities. Early diagnosis of breast cancer with appropriate tumor biomarkers may facilitate early treatment of the disease, thus reducing the mortality rate. The purpose of the current study is to improve early diagnosis of breast by proposing a two-stage classification of breast tumor biomarkers fora sample of Iraqi women.

Methods: In this study, a two-stage classification system is proposed and tested with four machine learning classifiers. In the first stage, breast features (demographic, blood and salivary-based attributes) are classified into normal or abnormal cases, while in the second stage the abnormal breast cases are

View Publication Preview PDF

1 2 ... 9 10 11 12 ... 2584 2585