Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
The letter is defined as a message directed by the sender to another party, the future. The aim is to convey, clarify or explain a particular point or subject, and in the form of direct oral communication through speech that contains a set of words and words, The future can discuss the sender directly to exchange ideas with each other, or it may be written and in this case does not require direct interaction between the matchmaker and the recipient. As a result of the different sources and topics of the discourse, and the different types of categories addressed to the speech, and the number, it has been divided into several types.
And schools of discourse analysis emerged in the early eighties of the last century and has spread and ha
To study the site of placentae and the umbilical artery blood flow of different age groupsand relate that to the newborn baby and the mother. 117 placentae samples were investigatedusing ultrasound and 30 placentae samples were studied using Doppler ultrasound during theperiod from August 2007 to August 2008 for full term placentae of mothers aged 15- > 45 yearsold. By ultrasound; there was detection of good pe rcentage of the placental site to be on theposterior wall of the uterus in case of male babies, while it was anterior in case of female babiesand it was previa and fundal in females more than in the males. The Doppler ultrasound revealedthat the mother in any age group can conceive and have a healthy placenta because the readings ina
... Show MoreRap songs often feature artists who utilize explicit language to convey feelings such as happiness, sorrow, and anger, reflecting audience expectations and trends within the music industry. This study intends to conduct a socio-pragmatic analysis of explicit, derogatory, and offensive language in the songs of the American artist Doja Cat, employing Hughes’ (1996) Swearing Word Theory, Jay’s (1996) Taboo Words Theory, Luhr’s (2002) classification of social factors for sociolinguistic examination, Salager’s (1997) categories of hedges for pragmatic assessment, and Austin’s (1965, 1989) theory of speech acts. The researchers collected the data using the AntConc corpus analysis tool. The data shows the singer’s frequent use
... Show MoreThe research aims to develop a proposed mechanism for financial reporting on sustainable investment that takes the specificity of these investments.
To achieve this goal, the researcher used (what if scenario) where the future financial statements were prepared for the year 2026, after completion of the sustainable project and operation, as the project requires four years to be completed.
The researcher relied on the results of the researchers collected from various modern sources relevant to the research topic and published on the internet, and the financial data and information obtained to assess the reality of the company's activity and its environmental, social, and economic i
... Show MoreThis study focused on the various forms of violence that aged people encounter in their late life, the significance of taking measures that enhance not only the welfare for aged people but also the influence of early stage of life and how the old age effects on people experience throughout their live. Further, the importance of raising the awareness among communities to the extent of seriousness of old age in order to teach children the right habits and traditions, which are faded away over time. The researcher indicated that raising the avenues of collaboration among home members, social organization, and all governmental facilities will support and provide a good treatment for this group of people.
Detecting and subtracting the Motion objects from backgrounds is one of the most important areas. The development of cameras and their widespread use in most areas of security, surveillance, and others made face this problem. The difficulty of this area is unstable in the classification of the pixels (foreground or background). This paper proposed a suggested background subtraction algorithm based on the histogram. The classification threshold is adaptively calculated according to many tests. The performance of the proposed algorithms was compared with state-of-the-art methods in complex dynamic scenes.
