Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Introduction: Attention is a fundamental cognitive function in sports, particularly in volleyball, where players must process multiple stimuli and make rapid decisions. Effective attentional control can enhance an athlete’s ability to react to dynamic game situations. The nomination of ideas strategy. Objective: This study aims to examine the effectiveness of the nomination of ideas strategy in enhancing divided and selective attention and its subsequent impact on volleyball skill performance. Methodology: A controlled experimental design was employed, involving volleyball players divided into an experimental group and a control group. The experimental group integrated the nomination of ideas strategy into their training sessions,
... Show MoreA strong sign language recognition system can break down the barriers that separate hearing and speaking members of society from speechless members. A novel fast recognition system with low computational cost for digital American Sign Language (ASL) is introduced in this research. Different image processing techniques are used to optimize and extract the shape of the hand fingers in each sign. The feature extraction stage includes a determination of the optimal threshold based on statistical bases and then recognizing the gap area in the zero sign and calculating the heights of each finger in the other digits. The classification stage depends on the gap area in the zero signs and the number of opened fingers in the other signs as well as
... Show MoreChange detection is a technology ascertaining the changes of
specific features within a certain time Interval. The use of remotely
sensed image to detect changes in land use and land cover is widely
preferred over other conventional survey techniques because this
method is very efficient for assessing the change or degrading trends
of a region. In this research two remotely sensed image of Baghdad
city gathered by landsat -7and landsat -8 ETM+ for two time period
2000 and 2014 have been used to detect the most important changes.
Registration and rectification the two original images are the first
preprocessing steps was applied in this paper. Change detection using
NDVI subtractive has been computed, subtrac
The research is a contemporary stylistic study of the text that expresses the psychological
meanings in Nahj Al-Balaghah.
The research is divided to three chapters.
The first chapter : characteristics of psychological text.
It is about the power to influence the addressee ,that is to say ,the text influence power
connects with vulnerability and emotions inside the addressee spirit and his response to
violent effects shaking his feelings and that is represented in outbreaking of words vitality and
homogeneously. The Imam's wrath , grief , love to rightness , hatred of falsehood and
deviation from Islam arouses the words and emotions through :
1. Reduction in words and expansion in meaning by means of inspiration
Language is the realistic and sensitive basis for any communication between two or more parties. It is an important workshop that prepares meanings and coding them according to a linguistic structure governed by agreed rules that speak to and coexist with everyone.
Whereas the forms of communication are: personal, mediator and mass, none of them can move away from language in their dealings and communication patterns. Since each has its own characteristics and skills, it must be launched in its fields through verbal and non-verbal symbols and wears the elements of influential language as intended.
It makes the recipient face two things: whether he fails to understand those symbols hence its purpose fail, or he meditates s
... Show MoreInterpreting is a process adopted by a skillful and well qualified interpreter to convey orally the meaning from a source language into a target language simultaneously .In this process the interpreter has no time to think or check the exact meaning of the words, phrases and sentences. The main technique used by the interpreter is based on his/her competence .This type of translation is used in press conferences and political speeches of high rank figures.
This paper deals with analyzing the interpretation of Obama's farewell speech adopted by two authentic TV Channels(Sky News and AL- Jazeera).The aim of this paper is to investigate the quality of each interpreting by adopting Nida's (1996:164
... Show MoreThe expressive speech in the art of acting is considered to be the leading action for thinking and the interior interaction of the individual, in the organization of his relationship with the elements of the environment and its content, and the more acquired the individual's performance today skills in organizing and export of his conflicts and natural occasions , whenever , becomes more adaptable and an interpreted physically and acoustically inside pluralities in personalities, suited to the modern age. This research has addressed a range of topics related to expressive speech in the art of acting, since the first chapter included a systematic framework of the research, represented by the problem , the importance and aims of research t
... Show MoreOnline communication on social networks has become a never-given-up way of expressing and sharing views and opinions within the realm of all topics on earth, and that is that! A basis essential in this is the limits at which "freedom of expression" should not be trespassed so as not to fall into the expression of "hate speech". These two ends make a base in the UN regulations pertaining to human rights: One is free to express, but not to hate by expression. Hereunder, a Critical Discourse Analysis in terms of Fairclough's dialectical-relational approach (2001) is made of Facebook posts (being made by common people, and not of official nature) targeting Islam and Muslims. This is made so as to recognize these instances of "speech" a
... Show MoreA growing and pressing need for Speech Enhancement Algorithms (SEAs) has emerged with the proliferation of hearing devices and mobile devices that aim to improve speech intelligibility without sacrificing speech quality. Recently, a tremendous number of studies have been conducted in the field of speech enhancement. This study aims to map the field of speech enhancement by conducting a systematic literature review to provide comprehensive details of recently proposed SEAs. This systematic review aims to highlight research trends in SEAs and direct researchers to the most important topics published between 2015 and 2024. It attempts to address seven key research questions related to this topic. Moreover, it covers articles available
... Show More