BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

doi:10.5455/jjcit.71-1703265368

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1703265368

Choose Citation Style

Statistics

View publication

6

Statistics

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.

View Publication

Publication Date

Wed Dec 18 2019

Journal Name

Baghdad Science Journal

Eye Detection using Helmholtz Principle

Contour Extraction

Gestalt Theory

Helmholtz Principle

Skin Detection.

Ahmed

...Show More Authors

Eye Detection is used in many applications like pattern recognition, biometric, surveillance system and many other systems. In this paper, a new method is presented to detect and extract the overall shape of one eye from image depending on two principles Helmholtz & Gestalt. According to the principle of perception by Helmholz, any observed geometric shape is perceptually "meaningful" if its repetition number is very small in image with random distribution. To achieve this goal, Gestalt Principle states that humans see things either through grouping its similar elements or recognize patterns. In general, according to Gestalt Principle, humans see things through genera

View Publication Preview PDF

(8)

(3)

Publication Date

Wed Mar 02 2022

Journal Name

Journal Of Educational And Psychological Researches

Attention Deficit Hyperactivity Disorder (ADHD) of Primary School Pupils

attention deficit

hyperactivity

disorder

pupil

primary school

Muayad .H. Al-jumaiali

...Show More Authors

The aim of this research is to diagnose the attention deficit hyperactivity disorder among primary school pupils in Baquba city of Diyala province. The sample of the study consisted of (25) male and female pupils. The American Guide of Attention Deficit Hyperactivity Scale (DSM-IV, 1994) was used in this study in addition to Conner’s (1996) scale to measure the attention deficit hyperactivity disorder for teachers and parents. The result revealed that (19) male and female pupils diagnosed with attention deficit hyperactivity to various degrees.

View Publication Preview PDF

Publication Date

Mon Sep 30 2024

Journal Name

Iraqi Journal Of Science

Attention-Deficit Hyperactivity Disorder Prediction by Artificial Intelligence Techniques

Prediction

Attention Deficit Hyperactivity Disorder (ADHD)

Artificial Intelligence

KNN

AdaBoost

XGBoost

Pearson correlation.

Rasha H.

Wisal Hashim

...Show More Authors

Attention-Deficit Hyperactivity Disorder (ADHD), a neurodevelopmental disorder affecting millions of people globally, is defined by symptoms of hyperactivity, impulsivity, and inattention that can significantly affect an individual's daily life. The diagnostic process for ADHD is complex, requiring a combination of clinical assessments and subjective evaluations. However, recent advances in artificial intelligence (AI) techniques have shown promise in predicting ADHD and providing an early diagnosis. In this study, we will explore the application of two AI techniques, K-Nearest Neighbors (KNN) and Adaptive Boosting (AdaBoost), in predicting ADHD using the Python programming language. The classification accuracies obtained w

View Publication Preview PDF

(9)

(5)

Publication Date

Wed Jun 16 2021

Journal Name

Cognitive Computation

Deep Transfer Learning for Improved Detection of Keratoconus using Corneal Topographic Maps

Ali H.

Nebras H.

Zahraa M.

Javier

...Show More Authors

Abstract <p>Clinical keratoconus (KCN) detection is a challenging and time-consuming task. In the diagnosis process, ophthalmologists must revise demographic and clinical ophthalmic examinations. The latter include slit-lamb, corneal topographic maps, and Pentacam indices (PI). We propose an Ensemble of Deep Transfer Learning (EDTL) based on corneal topographic maps. We consider four pretrained networks, SqueezeNet (SqN), AlexNet (AN), ShuffleNet (SfN), and MobileNet-v2 (MN), and fine-tune them on a dataset of KCN and normal cases, each including four topographic maps. We also consider a PI classifier. Then, our EDTL method combines the output probabilities of each of the five classifiers to obtain a decision b</p> ... Show More

View Publication

(47)

(36)

Publication Date

Thu Apr 20 2023

Journal Name

Fire

An Efficient Wildfire Detection System for AI-Embedded Applications Using Satellite Imagery

George L.

Ryeim B.

Sanaa S.

Rebecca D.

Joshua M.

Rhode V.

Bahaa I.

...Show More Authors

Wildfire risk has globally increased during the past few years due to several factors. An efficient and fast response to wildfires is extremely important to reduce the damaging effect on humans and wildlife. This work introduces a methodology for designing an efficient machine learning system to detect wildfires using satellite imagery. A convolutional neural network (CNN) model is optimized to reduce the required computational resources. Due to the limitations of images containing fire and seasonal variations, an image augmentation process is used to develop adequate training samples for the change in the forest’s visual features and the seasonal wind direction at the study area during the fire season. The selected CNN model (Mob

View Publication

(35)

(32)

Publication Date

Mon Feb 04 2019

Journal Name

Journal Of The College Of Education For Women

Peace Indicating Words in Pre- Islamic Poetry

الاستاذ المساعد الدكتور علاء جاسم

...Show More Authors

In pre- Islamic poetry, there are a lot of words that indicate
peacefulness of one sort of another, in addition to the inspirations of semantic
modeling in which the poet sets himself in various horizons.
Among these words: brother, comrade, friend, companion, lover,
people, prince, home, land, country, blessing, honesty, contract, company,
justice, thankfulness, forgiveness, pardoning, guest, goodness, faithfulness,
silence, death, peace,….
In addition, there are their derivatives from various aspects that indicate
peacefulness either directly or indirectly.

View Publication Preview PDF

Publication Date

Sat Sep 21 2019

Journal Name

Journal Of The College Of Education For Women

Translation of Polysemous Words in Harry Potter

Equivalence

fantasy novels

Harry Potter

polysemy

translation

Muthana Hameed

...Show More Authors

The paper pays attention to the polysemous words Harry Potter (HP). In this story, the present study exams some picking polysemic words to the extent that the translators of HP prevail to render the proposed significance as per the setting of the first content. Obviously, the picking translators in this examination were not mindful of the wonder of polysemy in the HP. They embrace a strict interpretation methodology to pass on the greater part of the polysemic sense. The method of data collection is divided into two stages. Firstly, determining the situational context of the fantasy and identifying the polysemic sense to clearly make all the contextual meanings of the source text. Secondly, reviewing the selected translation to

View Publication Preview PDF

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Low-Distortion MMSE Estimator for Speech Enhancement Based on Hahn Moments

Ammar S.

Basheera M.

Sadiq H.

Marwah A.

Abir

...Show More Authors

View Publication

(3)

(2)

Publication Date

Sat Jun 01 2013

Journal Name

مجلة كلية بغداد للعلوم الاقتصادية الجامعة

Proposed family speech recognition

Speech recognition

Speech Analysis

Speaker Recognition Using Neural Networks

Denoise

Wavelet.

Sawsan

...Show More Authors

Speech recognition is a very important field that can be used in many applications such as controlling to protect area, banking, transaction over telephone network database access service, voice email, investigations, House controlling and management ... etc. Speech recognition systems can be used in two modes: to identify a particular person or to verify a person’s claimed identity. The family speaker recognition is a modern field in the speaker recognition. Many family speakers have similarity in the characteristics and hard to identify between them. Today, the scope of speech recognition is limited to speech collected from cooperative users in real world office environments and without adverse microphone or channel impairments.

Publication Date

Wed Dec 08 2021

Journal Name

J. Inf. Hiding Multim. Signal Process.

Predication of Most Significant Features in Medical Image by Utilized CNN and Heatmap.

Neural Networks

Deep learning

Convolutional Neural Networks

Medical Images

CNN

Lubab

Samera

Mahir

Shaimaa

...Show More Authors

The growth of developments in machine learning, the image processing methods along with availability of the medical imaging data are taking a big increase in the utilization of machine learning strategies in the medical area. The utilization of neural networks, mainly, in recent days, the convolutional neural networks (CNN), have powerful descriptors for computer added diagnosis systems. Even so, there are several issues when work with medical images in which many of medical images possess a low-quality noise-to-signal (NSR) ratio compared to scenes obtained with a digital camera, that generally qualified a confusingly low spatial resolution and tends to make the contrast between different tissues of body are very low and it difficult to co

View Publication Preview PDF

(3)

1 2 ... 18 19 20 21 ... 2232 2233