Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Mobile ad hoc network is nothing but the temporary network which is having the collection of mobile nodes. Routing and broadcasting are major operations of MANET network. The major operation in ad hoc mobile network is the broadcasting which sometime results to storm problem of the broadcast if the forwarding mechanism is not properly designated. Thus the challenges in the MANET are to reduce the broadcasting redundancy and under high transmission error rate provides high delivery ratio. Hence in our proposed research, we are introducing and investigating the new mechanism of broadcasting called Dual Covered Broadcast. This method takes the broadcast redundancy advantage order to improve packet delivery ratio especially under environments w
... Show MoreRecently new trends of mosques’ architecture have appeared. These trends differed from those of traditional ones in charictaristics which include two and three dimentional level. The traditional mosques' architecture are affected by several factors, so the research problem is (lack of knoweledge about factors forming traditional mosques' architecture and its effect on contemporary trends of mosques' architecture).The hypotheses of research is (the functional, aesthetic and symbolic religious factors of style are the most active factors in forming contemporary trends of mosques' architecture than religious and environmental factor).The research conclusion is that the symbolic functional factor is most effective factor i
... Show MoreModify Multi-Connect Architecture (MMCA) associative memory
Gender and culture are among the factors that influence the process of understanding and interpreting different types of communication, especially images. The current study, which is a part of a master’s thesis, aims at investigating the role of gender and culture in interpreting and understanding the caricatures that deal with women’s issues in Arab societies. To this end, the researchers adopted Barthes’ (1957) concepts of denotation and connotation in his theory of mythologies in addition to Langacker’s (1987) theory of (Domains). The research concludes that the female subjects have better cognitive abilities in investing the signs within the selected caricatures. The other factor the study reached to is that the respondents
... Show MoreThe sound in the cinema and television occupies a large space in the level of use and expression. In addition to the functional aspect of the elements of the sound such as the dialogue, music, effects and silence, in shaping and supporting the narrative structure of the image in the dramatic work, it has today become and in light of the technical developments of the sound, an aesthetic value in the structure and formulation of the contents and ideas presented in the work. The sound also created a variety of forms before the work-factories in the artistic functioning, which enhances the emotional and expressive dimension of the image, and the researcher, as a result of many new developments in the expression o
... Show MoreThe Sound of the letter (ق) in the Contemporary Arabic Dialects