BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

doi:10.5455/jjcit.71-1703265368

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1703265368

Choose Citation Style

Statistics

View publication

3

Statistics

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.

View Publication

Publication Date

Mon Feb 10 2025

Journal Name

International Linguistics Research

A Semiotic Study of Reduplicative Words in Selected American Slang Expressions

American slang

semiotics

reduplicative expressions

Barthes’ model

cultural connotation

Mahmood Atiya

Sura Abd Ulghafoor

...Show More Authors

This study explores the semiotic aspects of American slang, specifically focusing on the phenomenon of reduplicative expressions in informal speech. Despite the extensive research on American slang, limited attention has been given to the cultural and mythical meanings embedded within reduplicative expressions. To address this gap, the study investigates how these expressions convey denotative, connotative, and mythical meanings within casual American discourse. The objectives of the study include: 1. To what extent does Barthes’ semiotic model hold potential for application in this study? 2. How are reduplicative slang expressions widely used in everyday American life? 3. To what extent do qualitative and quantitative methods hav

View Publication

Publication Date

Sun Jan 01 2017

Journal Name

Analytical Methods

Determination of pharmaceuticals in freshwater sediments using ultrasonic-assisted extraction with SPE clean-up and HPLC-DAD or LC-ESI-MS/MS detection

Omar S.A.

Alistair B.A.

...Show More Authors

A robust and sensitive analytical method is presented for the extraction and determination of six pharmaceuticals in freshwater sediments.

View Publication

(24)

(23)

Publication Date

Sat Nov 15 2025

Journal Name

Journal Of Baghdad College Of Dentistry

Flattening of the posterior slope of the articular eminence of completely edentulous patients compared to patients with maintained occlusion in relation to age using computed tomography

Luma A

Lamia H

...Show More Authors

Background: The posterior slope of the articular eminence of completely edentulous patients compared to patients with maintained occlusion shows significant flattening. This study aimed to correlate between the flattening of the posterior slope of the articular eminence, with dental status, age, genders, on both sides using computed tomography. Materials and Methods: The sample of the present study was a total of 117 Iraqi subjects, who admitted to the maxillofacial department at Al-Sadr Teaching Hospital in Al-Najaf city. The examination was performed on CT scanner; the eminence inclination was measured in two methods using sagittal section. Results: Clinically, the inclination of articular eminence was higher in edentulous subjects than i

View Publication Preview PDF

Publication Date

Tue Jan 17 2017

Journal Name

International Journal Of Science And Research (ijsr)

Detection System of Varicose Disease using Probabilistic Neural Network

Mays M. Hoobi

...Show More Authors

Publication Date

Thu Dec 01 2022

Journal Name

Iraqi Journal Of Science

PLAGIARISM DETECTION SYSTEM IN SCIENTIFIC PUBLICATION USING LSTM NETWORKS

Mohammed

...Show More Authors

(3)

Publication Date

Sat Jan 02 2010

Journal Name

Journal Of Al-nahrain University

HIDDEN FEATURES DETECTION USING HISTOGRAM MODIFICATION IN MRI IMAGES

Magnetic Resonance Imaging (MRI)

HIDDEN FEATURES

Samar O.

...Show More Authors

Magnetic Resonance Imaging (MRI) uses magnetization and radio waves, rather than x-rays to make very detailed, cross- sectional pictures of the brain. In this work we are going to explain some procedures belongs contrast and brightness improvement which is very important in the improvement the image quality such as the manipulation with the image histogram. Its has been explained in this worked the histogram shrink i.e. reducing the size of the gray level gives a dim low contrast picture is produced, where, the histogram stretching of the gray level was distributed on a wide scale but there is no increase in the number of pixels in the bright region. The histogram equalization has also been discuss together with its effects of the improveme

Publication Date

Sun Sep 01 2013

Journal Name

International Journal Of Advanced Research In Computer Science And Software Engineering

Real Time Motion Detection in Surveillance Camera Using MATLAB

motion detection

real time video

surveillance camera

comparing frames

MATLAB

Furat N.

...Show More Authors

Surveillance cameras are video cameras used for the purpose of observing an area. They are often connected to a recording device or IP network, and may be watched by a security guard or law enforcement officer. In case of location have less percentage of movement (like home courtyard during night); then we need to check whole recorded video to show where and when that motion occur which are wasting in time. So this paper aims at processing the real time video captured by a Webcam to detect motion in the Scene using MATLAB 2012a, with keeping in mind that camera still recorded which means real time detection. The results show accuracy and efficiency in detecting motion

Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Computers, Materials & Continua

Credit Card Fraud Detection Using Improved Deep Learning Models

Sulaiman S.S.

...Show More Authors

View Publication

(18)

(10)

Publication Date

Tue Sep 21 2021

Journal Name

Journal Of Healthcare Engineering

Complexity and Entropy Analysis to Improve Gender Identification from Emotional-Based EEGs

Mohannad K. Sabir

...Show More Authors

Investigating gender differences based on emotional changes becomes essential to understand various human behaviors in our daily life. Ten students from the University of Vienna have been recruited by recording the electroencephalogram (EEG) dataset while watching four short emotional video clips (anger, happiness, sadness, and neutral) of audiovisual stimuli. In this study, conventional filter and wavelet (WT) denoising techniques were applied as a preprocessing stage and Hurst exponent $()$

View Publication

(10)

(7)

Publication Date

Sat Jun 01 2013

Journal Name

Journal Of The College Of Languages (jcl)

Investigating the Mastering of the Pronunciation of Weak and Strong Forms of English Function Words

Mahdii Khalaf Hussein

...Show More Authors

The weak and strong forms are so called because it is not their lexical content that primary matter, but the role they have in the sentence. The problematic confusion, our students encounter, in recognizing and producing the correct pronunciation of weak and strong forms of the English function words is the main incentive behind conducting this study. In order to gather the data, this paper used two types of tests: a recognition test and a production test. The general results reached through the analysis of the students' answers seem to conform to the researcher's assumption: students face a critical problem in recognizing and producing correct pronunciation of the weak and strong forms of the English funct

View Publication Preview PDF

1 2 ... 36 37 38 39 ... 2134 2135