Bayes Classification and Entropy Discretization of Large Datasets using Multi-Resolution Data Aggregation

Safaa Alwajidi; Li Yang

doi:10.25046/aj050557

Details

Publication Date

Wed Jan 01 2020

Journal Name

Advances In Science, Technology And Engineering Systems Journal

Volume

5

Issue Number

5

DOI

10.25046/aj050557

Choose Citation Style

Statistics

View publication

10

Statistics

Bayes Classification and Entropy Discretization of Large Datasets using Multi-Resolution Data Aggregation

Safaa Alwajidi

Li Yang

...Show More Authors

Big data analysis has important applications in many areas such as sensor networks and connected healthcare. High volume and velocity of big data bring many challenges to data analysis. One possible solution is to summarize the data and provides a manageable data structure to hold a scalable summarization of data for efficient and effective analysis. This research extends our previous work on developing an effective technique to create, organize, access, and maintain summarization of big data and develops algorithms for Bayes classification and entropy discretization of large data sets using the multi-resolution data summarization structure. Bayes classification and data discretization play essential roles in many learning algorithms such as decision tree and nearest neighbor search. The proposed method can handle streaming data efficiently and, for entropy discretization, provide su the optimal split value.

View Publication

Publication Date

Wed Apr 08 2020

Journal Name

Periodicals Of Engineering And Natural Sciences

Bayes estimators for reliability and hazard function of Rayleigh-Logarithmic (RL) distribution with application

Bayes estimator

Jeffery and conditional probability

iqbal

...Show More Authors

In this paper, we derived an estimators and parameters of Reliability and Hazard function of new mix distribution ( Rayleigh- Logarithmic) with two parameters and increasing failure rate using Bayes Method with Square Error Loss function and Jeffery and conditional probability random variable of observation. The main objective of this study is to find the efficiency of the derived of Bayesian estimator compared to the to the Maximum Likelihood of this function using Simulation technique by Monte Carlo method under different Rayleigh- Logarithmic parameter and sample sizes. The consequences have shown that Bayes estimator has been more efficient than the maximum likelihood estimator in all sample sizes with application

Publication Date

Thu Oct 01 2020

Journal Name

Test Engineering & Management

Strengthening of non-liner finite element RCMD Beam with Large Square Opening Using CFRP

Aya Waleed

Alaa Hussein

...Show More Authors

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

Convolutional Neural Network (CNN)

Synthetic Aperture Radar (SAR)

TensorFlow

Transfer learning

Visual Geometry Group (VGG16)

Aseel Sami

MatheelEmaduldin

...Show More Authors

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Conv

View Publication Preview PDF

(1)

Publication Date

Mon Mar 01 2021

Journal Name

Al-khwarizmi Engineering Journal

Hurst Exponent and Tsallis Entropy Markers for Epileptic Detection from Children

Sumai Hamad

...Show More Authors

The aim of the present study was to distinguish between healthy children and those with epilepsy by electroencephalography (EEG). Two biomarkers including Hurst exponents (H) and Tsallis entropy (TE) were used to investigate the background activity of EEG of 10 healthy children and 10 with epilepsy. EEG artifacts were removed using Savitzky-Golay (SG) filter. As it hypothesize, there was a significant changes in irregularity and complexity in epileptic EEG in comparison with healthy control subjects using t-test (p< 0.05). The increasing in complexity changes were observed in H and TE results of epileptic subjects make them suggested EEG biomarker associated with epilepsy and a reliable tool for detection and identification of this di

View Publication Preview PDF

(4)

(1)

Publication Date

Sat Dec 01 2012

Journal Name

Journal Of Engineering

Different Resolution Merging Methods For Environmental Areas Extraction

Environmental Areas

Resolution merge

High Pass Filter method

PCA

Visual inspection and statistical comparison.

Husham Abd Munaf

...Show More Authors

The usage of remote sensing techniques in managing and monitoring the environmental areas is increasing due to the improvement of the sensors used in the observation satellites around the earth. Resolution merge process is used to combine high resolution one band image with another one that have low resolution multi bands image to produce one image that is high in both spatial and spectral resolution. In this work different merging methods were tested to evaluate their enhancement capabilities to extract different environmental areas; Principle component analysis (PCA), Brovey, modified (Intensity, Hue ,Saturation) method and High Pass Filter methods were tested and subjected to visual and statistical comparison for evaluation. Both visu

View Publication Preview PDF

(1)

Publication Date

Sun Jun 12 2011

Journal Name

Baghdad Science Journal

Satellite Images Unsupervised Classification Using Two Methods Fast Otsu and K-means

Fast Otsu

k-means

unsupervised classification

multithresholding.

Hameed M.

Taghreed A. H.

Amaal J.

...Show More Authors

Two unsupervised classifiers for optimum multithreshold are presented; fast Otsu and k-means. The unparametric methods produce an efficient procedure to separate the regions (classes) by select optimum levels, either on the gray levels of image histogram (as Otsu classifier), or on the gray levels of image intensities(as k-mean classifier), which are represent threshold values of the classes. In order to compare between the experimental results of these classifiers, the computation time is recorded and the needed iterations for k-means classifier to converge with optimum classes centers. The variation in the recorded computation time for k-means classifier is discussed.

View Publication Preview PDF

Publication Date

Tue Dec 03 2013

Journal Name

Baghdad Science Journal

Satellite Images Unsupervised Classification Using Two Methods Fast Otsu and K-means

Taghreed

...Show More Authors

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(17)

(8)

Publication Date

Fri Dec 01 2017

Journal Name

Journal Of Economics And Administrative Sciences

Multi – Linear in Multiple Nonparametric Regression , Detection and Treatment Using Simulation

Ferarr – Glauber Test

Chi-square Statistic

bandwidth estimating h

RULE

BOOT

Kernel ridge regression KRR .

لقاء علي

صابرين حسين

...Show More Authors

It is the regression analysis is the foundation stone of knowledge of statistics , which mostly depends on the ordinary least square method , but as is well known that the way the above mentioned her several conditions to operate accurately and the results can be unreliable , add to that the lack of certain conditions make it impossible to complete the work and analysis method and among those conditions are the multi-co linearity problem , and we are in the process of detected that problem between the independent variables using farrar –glauber test , in addition to the requirement linearity data and the lack of the condition last has been resorting to the

View Publication Preview PDF

Publication Date

Wed Dec 25 2019

Journal Name

Journal Of Engineering

Comparison of Different DEM Generation Methods based on Open Source Datasets

DEM

kriging

IDW

spline

natural neighbor

Google Earth

aya M.

Maythm

...Show More Authors

Digital Elevation Model (DEM) is one of the developed techniques for relief representation. The definition of a DEM construction is the modeling technique of earth surface from existing data. DEM plays a role as one of the fundamental information requirement that has been generally utilized in GIS data structures. The main aim of this research is to present a methodology for assessing DEMs generation methods. The DEMs data will be extracted from open source data e.g. Google Earth. The tested data will be compared with data produced from formal institutions such as General Directorate of Surveying. The study area has been chosen in south of Iraq (Al-Gharraf / Dhi Qar governorate. The methods of DEMs creation are kri

View Publication Preview PDF

(1)

1 2 ... 8 9 10 11 ... 2838 2839