Bayes Classification and Entropy Discretization of Large Datasets using Multi-Resolution Data Aggregation

Safaa Alwajidi; Li Yang

doi:10.25046/aj050557

Details

Publication Date

Wed Jan 01 2020

Journal Name

Advances In Science, Technology And Engineering Systems Journal

Volume

5

Issue Number

5

DOI

10.25046/aj050557

Choose Citation Style

Statistics

View publication

13

Statistics

Bayes Classification and Entropy Discretization of Large Datasets using Multi-Resolution Data Aggregation

Safaa Alwajidi

Li Yang

...Show More Authors

Big data analysis has important applications in many areas such as sensor networks and connected healthcare. High volume and velocity of big data bring many challenges to data analysis. One possible solution is to summarize the data and provides a manageable data structure to hold a scalable summarization of data for efficient and effective analysis. This research extends our previous work on developing an effective technique to create, organize, access, and maintain summarization of big data and develops algorithms for Bayes classification and entropy discretization of large data sets using the multi-resolution data summarization structure. Bayes classification and data discretization play essential roles in many learning algorithms such as decision tree and nearest neighbor search. The proposed method can handle streaming data efficiently and, for entropy discretization, provide su the optimal split value.

View Publication

Publication Date

Tue Sep 01 2020

Journal Name

Al-khwarizmi Engineering Journal

Two-Stage Classification of Breast Tumor Biomarkers for Iraqi Women

Iyden Kamil

Ali Hussein

Javier

...Show More Authors

Objective: Breast cancer is regarded as a deadly disease in women causing lots of mortalities. Early diagnosis of breast cancer with appropriate tumor biomarkers may facilitate early treatment of the disease, thus reducing the mortality rate. The purpose of the current study is to improve early diagnosis of breast by proposing a two-stage classification of breast tumor biomarkers fora sample of Iraqi women.

Methods: In this study, a two-stage classification system is proposed and tested with four machine learning classifiers. In the first stage, breast features (demographic, blood and salivary-based attributes) are classified into normal or abnormal cases, while in the second stage the abnormal breast cases are

View Publication Preview PDF

Publication Date

Sun Jan 10 2016

Journal Name

British Journal Of Applied Science & Technology

The Effect of Classification Methods on Facial Emotion Recognition ‎Accuracy

Facial emotions

feature selection

data clustering

modified K-Means clustering algorithm

LDA algorithm

Statistical classifier

Neural Network

Support Vector Machine (SVM)

Suhaila N.

...Show More Authors

The interests toward developing accurate automatic face emotion recognition methodologies are growing vastly, and it is still one of an ever growing research field in the region of computer vision, artificial intelligent and automation. However, there is a challenge to build an automated system which equals human ability to recognize facial emotion because of the lack of an effective facial feature descriptor and the difficulty of choosing proper classification method. In this paper, a geometric based feature vector has been proposed. For the classification purpose, three different types of classification methods are tested: statistical, artificial neural network (NN) and Support Vector Machine (SVM). A modified K-Means clustering algorithm

View Publication Preview PDF

(2)

Publication Date

Fri Mar 01 2024

Journal Name

Iaes International Journal Of Artificial Intelligence (ij-ai)

Analyzing the behavior of different classification algorithms in diabetes prediction

Israa N.

...Show More Authors

<span lang="EN-US">Diabetes is one of the deadliest diseases in the world that can lead to stroke, blindness, organ failure, and amputation of lower limbs. Researches state that diabetes can be controlled if it is detected at an early stage. Scientists are becoming more interested in classification algorithms in diagnosing diseases. In this study, we have analyzed the performance of five classification algorithms namely naïve Bayes, support vector machine, multi layer perceptron artificial neural network, decision tree, and random forest using diabetes dataset that contains the information of 2000 female patients. Various metrics were applied in evaluating the performance of the classifiers such as precision, area under the c

View Publication

(2)

(1)

Publication Date

Mon Dec 01 2014

Journal Name

Journal Of Economics And Administrative Sciences

Comparison between some of linear classification models with practical application

Linear discriminant analysis

binary response logistic regression and misclassification probability.

حمزة اسماعيل

...Show More Authors

Linear discriminant analysis and logistic regression are the most widely used in multivariate statistical methods for analysis of data with categorical outcome variables .Both of them are appropriate for the development of linear classification models .linear discriminant analysis has been that the data of explanatory variables must be distributed multivariate normal distribution. While logistic regression no assumptions on the distribution of the explanatory data. Hence ,It is assumed that logistic regression is the more flexible and more robust method in case of violations of these assumptions.

In this paper we have been focus for the comparison between three forms for classification data belongs

View Publication Preview PDF

(1)

Publication Date

Tue May 20 2008

Journal Name

Journal Of Planner And Development

Estimating Water Quality from Satellite Image and Reflectance Data

الاستشعار

أ.م.عبد الرزاق طه

...Show More Authors

The useful of remote sensing techniques in Environmental Engineering and another science is to save time, Coast and efforts, also to collect more accurate information under monitoring mechanism. In this research a number of statistical models were used for determining the best relationships between each water quality parameter and the mean reflectance values generated for different channels of radiometer operate simulated to the thematic Mappar satellite image. Among these models are the regression models which enable us to as certain and utilize a relation between a variable of interest. Called a dependent variable; and one or more independent variables

View Publication Preview PDF

Publication Date

Mon Dec 05 2022

Journal Name

Baghdad Science Journal

Cloud Data Security through BB84 Protocol and Genetic Algorithm

Attribute based Encryption

BB84 Protocol Cloud

Data Security

Geneticncryption/Decryption

Quantum Key Distribution

Jaydip

Vipin

...Show More Authors

In the current digitalized world, cloud computing becomes a feasible solution for the virtualization of cloud computing resources. Though cloud computing has many advantages to outsourcing an organization’s information, but the strong security is the main aspect of cloud computing. Identity authentication theft becomes a vital part of the protection of cloud computing data. In this process, the intruders violate the security protocols and perform attacks on the organizations or user’s data. The situation of cloud data disclosure leads to the cloud user feeling insecure while using the cloud platform. The different traditional cryptographic techniques are not able to stop such kinds of attacks. BB84 protocol is the first quantum cry

View Publication Preview PDF

(9)

(6)

Publication Date

Tue Jun 01 2010

Journal Name

Al-khwarizmi Engineering Journal

Land Use/Cover Change Analysis Using Remote Sensing Data: A Case Study, Zhengzhou Area, Henan Province, China

Bassam F.

...Show More Authors

In the last two decades, arid and semi-arid regions of China suffered rapid changes in the Land Use/Cover Change (LUCC) due to increasing demand on food, resulting from growing population. In the process of this study, we established the land use/cover classification in addition to remote sensing characteristics. This was done by analysis of the dynamics of (LUCC) in Zhengzhou area for the period 1988-2006. Interpretation of a laminar extraction technique was implied in the identification of typical attributes of land use/cover types. A prominent result of the study indicates a gradual development in urbanization giving a gradual reduction in crop field area, due to the progressive economy in Zhengzhou. The results also reflect degradati

View Publication Preview PDF

Publication Date

Sat Dec 31 2022

Journal Name

Journal Of Economics And Administrative Sciences

Using Some Estimation Methods for Mixed-Random Panel Data Regression Models with Serially Correlated Errors with Application

FGLS estimation method

mixed-stochastic parameter regression model

first-order serial correlation

(MG) estimation method

Musaab

Mohammed

...Show More Authors

This research includes the study of dual data models with mixed random parameters, which contain two types of parameters, the first is random and the other is fixed. For the random parameter, it is obtained as a result of differences in the marginal tendencies of the cross sections, and for the fixed parameter, it is obtained as a result of differences in fixed limits, and random errors for each section. Accidental bearing the characteristic of heterogeneity of variance in addition to the presence of serial correlation of the first degree, and the main objective in this research is the use of efficient methods commensurate with the paired data in the case of small samples, and to achieve this goal, the feasible general least squa

View Publication Preview PDF

Publication Date

Sun Jan 02 2011

Journal Name

Journal Of The Faculty Of Medicine Baghdad

Relation-ships of neonatal septicemia with the mean serum levels of IL-8 and IL-1 in three large hospitals in Baghdad

Neonatal septicemia (NNS)

Interleukines (ILs).

Yasmeen J

Nedhal S

...Show More Authors

View Publication Preview PDF

Publication Date

Sun Feb 03 2019

Journal Name

Journal Of The College Of Education For Women

Detection of selected cells in multi choice sheets

منى مجيد

...Show More Authors

0

View Publication Preview PDF

1 2 ... 39 40 41 42 ... 2899 2900