BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

doi:10.5455/jjcit.71-1703265368

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1703265368

Choose Citation Style

Statistics

View publication

6

Statistics

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.

View Publication

Publication Date

Fri Nov 01 2024

Journal Name

Practice Periodical On Structural Design And Construction

Risks Associated with Using Drones in Construction for Safety Management

Ali Amer

M. K. S.

...Show More Authors

View Publication

(9)

(10)

Publication Date

Sun Mar 02 2008

Journal Name

Baghdad Science Journal

A study of analysis and comparison to the low nutrient density foods that more normality for children age (3 –5 years)

Marriam M.

...Show More Authors

Through the early childhood and after the ablactating the child learns acquired food habbits that might studying with him throughout his life. Here the parents role arises: teaching the child the sound food habits and hygienic styles and whatever beneficial to the health and with the sufficient quantities for the body. In this way the experiences the child learns at home will be of great help in his future life in choosing the suitable food after becoming more dependent in making his decisions and choices away from his parents. The results in this study showed that the averages of the children’s consumption of the high energy foods in comparison with the other highest consumption average , after that comes the con sumption of soft drills

View Publication Preview PDF

Publication Date

Sat Jun 01 2024

Journal Name

Journal Of Engineering

Copy Move Image Forgery Detection using Multi-Level Local Binary Pattern Algorithm

Support vector machine

Copy-Move

MICC-F2000

Local binary pattern

Multi Local binary pattern

Marwa Emad

Nada

...Show More Authors

Digital image manipulation has become increasingly prevalent due to the widespread availability of sophisticated image editing tools. In copy-move forgery, a portion of an image is copied and pasted into another area within the same image. The proposed methodology begins with extracting the image's Local Binary Pattern (LBP) algorithm features. Two main statistical functions, Stander Deviation (STD) and Angler Second Moment (ASM), are computed for each LBP feature, capturing additional statistical information about the local textures. Next, a multi-level LBP feature selection is applied to select the most relevant features. This process involves performing LBP computation at multiple scales or levels, capturing textures at different

Publication Date

Sun Jan 01 2017

Journal Name

Spe

SPE-188966-MS: Drilling problems detection in Basrah oil fields using smartphones

Saleh I.

M S

...Show More Authors

(1)

Publication Date

Tue Sep 01 2009

Journal Name

Al-khwarizmi Engineering Journal

Analysis of Wave Propagation in Detection of Aorta Dieses Using Lumps Analysis

A. M

A. Salam

...Show More Authors

In this paper a theoretical attempt is made to determine whether changes in the aorta diameter at different location along the aorta can be detected by brachial artery measurement. The aorta is divided into six main parts, each part with 4 lumps of 0.018m length. It is assumed that a desired section of the aorta has a radius change of 100,200, 500%. The results show that there is a significant change for part 2 (lumps 5-8) from the other parts. This indicates that the nearest position to the artery gives the significant change in the artery wave pressure while other parts of the aorta have a small effect.

View Publication Preview PDF

Publication Date

Tue Jun 20 2023

Journal Name

Baghdad Science Journal

Detection of Autism Spectrum Disorder Using A 1-Dimensional Convolutional Neural Network

Autism Spectrum Disorder

Classification

Deep Learning

Machine Learning

One-Dimensional-Convolutional Neural Network

Aythem Khairi

Mohammed M.

Ahmed Adil

...Show More Authors

Autism Spectrum Disorder, also known as ASD, is a neurodevelopmental disease that impairs speech, social interaction, and behavior. Machine learning is a field of artificial intelligence that focuses on creating algorithms that can learn patterns and make ASD classification based on input data. The results of using machine learning algorithms to categorize ASD have been inconsistent. More research is needed to improve the accuracy of the classification of ASD. To address this, deep learning such as 1D CNN has been proposed as an alternative for the classification of ASD detection. The proposed techniques are evaluated on publicly available three different ASD datasets (children, Adults, and adolescents). Results strongly suggest that 1D

View Publication Preview PDF

(35)

(25)

Publication Date

Thu Dec 03 2020

Journal Name

Civileng

Evaluation of Concrete Material Properties at Early Age

Osamah

Emad

Tilak

Jessey

Kamiran

...Show More Authors

This article investigates the development of the following material properties of concrete with time: compressive strength, tensile strength, modulus of elasticity, and fracture energy. These properties were determined at seven different hydration ages (18 h, 30 h, 48 h, 72 h, 7 days, 14 days, 28 days) for four pure cement concrete mixes totaling 336 specimens tested throughout the study. Experimental data obtained were used to assess the relationship of the above properties with the concrete compressive strength and how these relationships are affected with age. Further, this study investigates prediction models available in literature and recommendations are made for models that are found suitable for application to early age conc

View Publication

(19)

Publication Date

Sat Dec 31 2022

Journal Name

International Journal On “technical And Physical Problems Of Engineering”

Age Estimation Utilizing Deep Learning Convolutional Neural Network

Estimation

Age

Deep Learning

IMDB

CNN.

Mohammed S. H.

...Show More Authors

Estimating an individual's age from a photograph of their face is critical in many applications, including intelligence and defense, border security and human-machine interaction, as well as soft biometric recognition. There has been recent progress in this discipline that focuses on the idea of deep learning. These solutions need the creation and training of deep neural networks for the sole purpose of resolving this issue. In addition, pre-trained deep neural networks are utilized in the research process for the purpose of facial recognition and fine-tuning for accurate outcomes. The purpose of this study was to offer a method for estimating human ages from the frontal view of the face in a manner that is as accurate as possible and takes

(13)

Publication Date

Tue Jun 14 2016

Journal Name

Al-academy

Expressive and aesthetic role of the scenes in the initiation speech the picture (Cinema Togravea)

Athra'a

...Show More Authors

Construction is the opening of the important pillars of the construction of the film as a whole for this, we find that the first of any narrative of my film begin at the borders of this construction is the window that we look through the contents tale and puzzle narrative is of significance that degrade traveler when reservoirs expression later in reasoning and find justifications ills that came by those initiation, this initiation may be the window that lead us to the core, understanding the story through signals received to the recipient to sail because of the paths of pickling what is which is encoded, but this initiation may serve as keys that understanding the be puppies and signals that beset and surrounded to what He holds inevita

View Publication Preview PDF

Publication Date

Sat Feb 07 2026

Journal Name

Algorithms

An In-Depth Review of Speech Enhancement Algorithms: Classifications, Underlying Principles, Challenges, and Emerging Trends

Nisreen Talib

Basheera M.

...Show More Authors

Speech enhancement aims to improve speech quality and intelligibility in noisy environments and is important in applications such as hearing aids, mobile communications and automatic speech recognition (ASR). This paper shows a structured review of speech enhancement techniques, classified depending on the channel configuration and signal processing framework. Both traditional and modern approaches are discussed, including classical signal processing methods, machine learning techniques, and recent deep learning-based models. Furthermore, common noise types, widely used speech datasets, and standard evaluation metrics for evaluating speech quality and intelligibility are reviewed. Key challenges such as non-stationary noise, data li

View Publication

1 2 ... 55 56 57 58 ... 2235 2236