Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Experienced economic environmentRadical changes at the end of the last century and the beginning of the present century, resulting in new concepts and expectations in all aspects of economic, political, social and even behavioral.Each of these concepts is the result of rapid developments in the intangible space. Competition is no longer limited to the mere possession of tangible material resources, but because of its link to knowledge and technology content and to the comprehensive quality standards and efficient and effective policies of States. With the increasing pace of growth and interdependence among the global economies, this resulted in the birth of a new economic system led by technological development and financial liberalizati
... Show MoreSound effects are considered to be a key element in children’s theatre, for it relays the context and amplifies its understandability, acceptability and its impact on the audience, so it’s a fundamental method in portraying the characters within the idea or the story, to produce the title and content with completeness in its relations that are associated with the rest of the fundamental elements represented in lighting, costumes, dialogue, decoration, etc. And this research included a set of subjects that are related to implementing the sound effects used in the Iraqi children’s theatre plays, chapter one included the problem and the need for studying this subject, as well as its importance and aim, and specifying the basic phrases
... Show MoreThis study aims to isolate the pathogenic yeasts from genital tract and investigate their relationship with the age .The results clarified that the most pathogenic yeast isolated from genital tract was Candida albicans , also the results of C.albicanas isolates susceptibility test, to different antifungal revealed that they were sensitive to Miconazole, Ketoconazole and Clotrimazol and were resistant to Nystatin and Grisofulvin. The study of relationship of vaginal infection with the age showed that the incidence of infection with Candida was high among females age group (19-39 years).
Data-driven models perform poorly on part-of-speech tagging problems with the square Hmong language, a low-resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local evaluation strategies for handling the square Hmong part-of-speech tagging problem. The experiment shows that the average accuracy of the proposed model is 6%, 8% more than HMM and BiLSTM-CRF models, respectively. Meanwhile, the average F1 of the proposed model is also 6%, 3% more than HMM and BiLSTM-CRF models, respectively.
Investigating the strength and the relationship between the Self-organized learning strategies and self-competence among talented students was the aim of this study. To do this, the researcher employed the correlation descriptive approach, whereby a sample of (120) male and female student were selected from various Iraqi cities for the academic year 2015-2016. the researcher setup two scales based on the previous studies: one to measure the Self-organized learning strategies which consist of (47) item and the other to measure the self-competence that composed of (50) item. Both of these scales were applied on the targeted sample to collect the required data
in this paper we adopted ways for detecting edges locally classical prewitt operators and modification it are adopted to perform the edge detection and comparing then with sobel opreators the study shows that using a prewitt opreators
The present search aims to develop a test for selective attention, cognitive load and thinking mistakes and measuring these concepts among Baghdad university students. To make a comparison between the selective attention, cognitive load, and the mistakes of thinking among students in term of gender. To identify the relationship among the selective attention, cognitive load and the mistakes of thinking of university students. To achieve these purposes, the searcher has developed a test for selective attention, cognitive load, and the mistakes of thinking. Then, these tools were applied to a sample of (200) university students were selected from (21) college. The researcher used t-test of one sample, t-test of two independent
... Show MoreThe lexical connotation is one of the types of connotation that linguists have dealt with, and stipulated in their studies, meaning access to the real meanings of the words, that the lexicon can address after tracing the real meaning of the metaphorical meanings, if any, and this is known to the semantics additional significance, and the rhetorical meaning Figuratively.
The miraculous Qur'an in its systems often refers to the metaphorical uses of the words as well as the real use. The significance of the words in the Holy Qur'an came in a variety of contexts, making each word a special significance that belongs to it exclusively. This is the miracle of the Holy Qur'an. The coming of the slow walk, with its eight words (came, came, cam
Cryptography is a method used to mask text based on any encryption method, and the authorized user only can decrypt and read this message. An intruder tried to attack in many manners to access the communication channel, like impersonating, non-repudiation, denial of services, modification of data, threatening confidentiality and breaking availability of services. The high electronic communications between people need to ensure that transactions remain confidential. Cryptography methods give the best solution to this problem. This paper proposed a new cryptography method based on Arabic words; this method is done based on two steps. Where the first step is binary encoding generation used t
... Show More