Using VGG Models with Intermediate Layer Feature Maps for Static Hand Gesture Recognition

Osamah Y. Fadhil

doi:10.21123/bsj.2023.7364

Details

Publication Date

Sun Oct 01 2023

Journal Name

Baghdad Science Journal

Volume

20

Issue Number

5

DOI

10.21123/bsj.2023.7364

Choose Citation Style

Statistics

View publication

10

Statistics

(9)

(3)

Using VGG Models with Intermediate Layer Feature Maps for Static Hand Gesture Recognition

Convolutional Neural Networks

Deep Learning

Hand Gesture Recognition

VGG-16

VGG-19.

Osamah Y. Fadhil

Bashar S Mahdi

Ayad R. Abbas

...Show More Authors

A hand gesture recognition system provides a robust and innovative solution to nonverbal communication through human–computer interaction. Deep learning models have excellent potential for usage in recognition applications. To overcome related issues, most previous studies have proposed new model architectures or have fine-tuned pre-trained models. Furthermore, these studies relied on one standard dataset for both training and testing. Thus, the accuracy of these studies is reasonable. Unlike these works, the current study investigates two deep learning models with intermediate layers to recognize static hand gesture images. Both models were tested on different datasets, adjusted to suit the dataset, and then trained under different methods. First, the models were initialized with random weights and trained from scratch. Afterward, the pre-trained models were examined as feature extractors. Finally, the pre-trained models were fine-tuned with intermediate layers. Fine-tuning was conducted on three levels: the fifth, fourth, and third blocks, respectively. The models were evaluated through recognition experiments using hand gesture images in the Arabic sign language acquired under different conditions. This study also provides a new hand gesture image dataset used in these experiments, plus two other datasets. The experimental results indicated that the proposed models can be used with intermediate layers to recognize hand gesture images. Furthermore, the analysis of the results showed that fine-tuning the fifth and fourth blocks of these two models achieved the best accuracy results. In particular, the testing accuracies on the three datasets were 96.51%, 72.65%, and 55.62% when fine-tuning the fourth block and 96.50%, 67.03%, and 61.09% when fine-tuning the fifth block for the first model. The testing accuracy for the second model showed approximately similar results.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Wed Oct 09 2019

Journal Name

Engineering, Technology & Applied Science Research

Serviceability of Reinforced Concrete Gable Roof Beams with Openings under Static Loads

M. A. J.

A. F.

...Show More Authors

This paper presents an analytical study on the serviceability of reinforced concrete gable roof beams with openings of different sizes, based on an experimental study which includes 13 concrete gable roof beams with openings under static loading. For deflection and crack widths under static loading at service stage, a developed unified calculation procedure has been submitted, which includes prismatic beams with one opening subjected to flexure concentrated force. The deflection has been calculated with two methods: the first method calculated deflections via relevant equations and the second was Direct Stiffness Method in which the beam is treated as a structural member with several segments constituting the portions with solid sec

View Publication Preview PDF

(16)

Publication Date

Wed Aug 01 2018

Journal Name

Engineering And Technology Journal

A Proposed Method for the Sound Recognition Process

Mustafa

...Show More Authors

View Publication

Publication Date

Sun Oct 29 2023

Journal Name

Journal Of Al-qadisiyah For Computer Science And Mathematics

Optimization Techniques for Human Multi-Biometric Recognition System

Maryam

...Show More Authors

Researchers are increasingly using multimodal biometrics to strengthen the security of biometric applications. In this study, a strong multimodal human identification model was developed to address the growing problem of spoofing attacks in biometric security systems. Through the use of metaheuristic optimization methods, such as the Genetic Algorithm(GA), Ant Colony Optimization(ACO), and Particle Swarm Optimization (PSO) for feature selection, this unique model incorporates three biometric modalities: face, iris, and fingerprint. Image pre-processing, feature extraction, critical image feature selection, and multibiometric recognition are the four main steps in the workflow of the system. To determine its performance, the model wa

View Publication

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N.

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated fro

View Publication

(7)

(2)

Publication Date

Wed Dec 01 2021

Journal Name

Journal Of Physics: Conference Series

Disc damage likelihood scale recognition for Glaucoma detection

Mohammed S.G.

...Show More Authors

Abstract<p>Glaucoma is a visual disorder, which is one of the significant driving reason for visual impairment. Glaucoma leads to frustrate the visual information transmission to the brain. Dissimilar to other eye illness such as myopia and cataracts. The impact of glaucoma can’t be cured; The Disc Damage Likelihood Scale (DDLS) can be used to assess the Glaucoma. The proposed methodology suggested simple method to extract Neuroretinal rim (NRM) region then dividing the region into four sectors after that calculate the width for each sector and select the minimum value to use it in DDLS factor. The feature was fed to the SVM classification algorithm, the DDLS successfully classified Glaucoma d</p> ... Show More

View Publication

(6)

(2)

Publication Date

Wed Mar 01 2023

Journal Name

Baghdad Science Journal

Existence of Fixed Points for Expansive Mappings in Complete Strong Altering JS-metric space

Altering distance

b-metric space

Dislocated metric space

Expansive mapping

Fixed points

Strong Altering JS-metric

X. M. JEFFIN

P.

B. Ananda

...Show More Authors

The paper aims at initiating and exploring the concept of extended metric known as the Strong Altering JS-metric, a stronger version of the Altering JS-metric. The interrelation of Strong Altering JS-metric with the b-metric and dislocated metric has been analyzed and some examples have been provided. Certain theorems on fixed points for expansive self-mappings in the setting of complete Strong Altering JS-metric space have also been discussed.

View Publication Preview PDF

(1)

Publication Date

Fri Jul 04 2025

Journal Name

Computational And Theoretical Chemistry

Coronene and BN isosters of coronene: Revealing the electron density distribution using magnetic shielding maps

Muntadar

Marija

...Show More Authors

View Publication

Publication Date

Thu Jun 30 2022

Journal Name

Journal Of Economics And Administrative Sciences

Analysis of Models (NAGARCH & APGARCH) by Using Simulations

NAGARCH

APGARCH

Simulation

Asymmetric

Heba

Suhail

...Show More Authors

Simulation experiments are a means of solving in many fields, and it is the process of designing a model of the real system in order to follow it and identify its behavior through certain models and formulas written according to a repeating software style with a number of iterations. The aim of this study is to build a model that deals with the behavior suffering from the state of (heteroskedasticity) by studying the models (APGARCH & NAGARCH) using (Gaussian) and (Non-Gaussian) distributions for different sample sizes (500,1000,1500,2000) through the stage of time series analysis (identification , estimation, diagnostic checking and prediction). The data was generated using the estimations of the parameters resulting f

View Publication Preview PDF

Publication Date

Sun Mar 19 2023

Journal Name

Journal Of Educational And Psychological Researches

E-Learning and Its Relationship with Academic Passion among Middle School Students

E-learning

academic passion

middle school students

Aya

...Show More Authors

Abstract

The current research aims to identify the level of E-learning among middle school students, the level of academic passion among middle school students, and the correlation between e-learning and academic passion among middle school students. In order to achieve the objectives of the research, the researcher developed two questionnaires to measure the variables of the study (e-learning and study passion) among students, these two tools were applied to the research sample, which was (380) male and female students in the first and second intermediate classes. The research concluded that there is a relationship between e-learning and academic passion among students.

View Publication Preview PDF

Publication Date

Thu Nov 01 2018

Journal Name

International Journal Of Science And Research (ij

Mathematical Models for Predicting of Organic and Inorganic Pollutants in Diyala River Using AnalysisNeural Network

. . Diyala river

BOD

TDS

ANN (PDF) Mathematical Models for Predicting of Organic

nawar

...Show More Authors

Diyala river is the most important tributaries in Iraq, this river suffering from pollution, therefore, this research aimed to predict organic pollutants that represented by biological oxygen demand BOD, and inorganic pollutants that represented by total dissolved solids TDS for Diyala river in Iraq, the data used in this research were collected for the period from 2011-2016 for the last station in the river known as D17, before the river meeting Tigris river in Baghdad city. Analysis Neural Network ANN was used in order to find the mathematical models, the parameters used to predict BOD were seven parameters EC, Alk, Cl, K, TH, NO3, DO, after removing the less importance parameters. While the parameters that used to predict TDS were fourte

1 2 ... 22 23 24 25 ... 1344 1345