Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Abstract
This Research aims for harnessing critical and innovative thinking approaches besides innovative problem solving tools in pursuing continual quality improvement initiatives for the benefit of achieving operations results effectively in water treatment plants in Baghdad Water Authority. Case study has been used in fulfilling this research in the sadr city water treatment plant, which was chosen as a study sample as it facilitates describing and analyzing its current operational situation, collecting and analyzing its own data, in order to get its own desired improvement opportunity be done. Many statistical means and visual thinking promoting methods has been used to fulfill research task.
... Show MorePostmodern arguments, formed a critic case of what modernity brought in several levels. Postmodern practice was considered as a proactive case having amorphous concepts and features to what entiled as an intellectual trends postmodern philosophically and intellectually. But, what postmodernism architecture broughts in it essence, was not isolation from the intellectual context and entrepreneurship case, and it was not disconnecting from the intellectual and philosophical era of that period. Lliteratures and philosophical argument precede what (Robert Venturi) and (Charles A Jencks) had brought, albeit it was closer to critics and correction the path of modernity from crystallizing a direction that exceeds modrinity to wh
... Show MoreIndividuals across different industries, including but not limited to agriculture, drones, pharmaceuticals and manufacturing, are increasingly using thermal cameras to achieve various safety and security goals. This widespread adoption is made possible by advancements in thermal imaging sensor technology. The current literature provides an in-depth exploration of thermography camera applications for detecting faults in sectors such as fire protection, manufacturing, aerospace, automotive, non-destructive testing and structural material industries. The current discussion builds on previous studies, emphasising the effectiveness of thermography cameras in distinguishing undetectable defects by the human eye. Various methods for defect
... Show MorePavement crack and pothole identification are important tasks in transportation maintenance and road safety. This study offers a novel technique for automatic asphalt pavement crack and pothole detection which is based on image processing. Different types of cracks (transverse, longitudinal, alligator-type, and potholes) can be identified with such techniques. The goal of this research is to evaluate road surface damage by extracting cracks and potholes, categorizing them from images and videos, and comparing the manual and the automated methods. The proposed method was tested on 50 images. The results obtained from image processing showed that the proposed method can detect cracks and potholes and identify their severity levels wit
... Show MoreVoice Activity Detection (VAD) is considered as an important pre-processing step in speech processing systems such as speech enhancement, speech recognition, gender and age identification. VAD helps in reducing the time required to process speech data and to improve final system accuracy by focusing the work on the voiced part of the speech. An automatic technique for VAD using Fuzzy-Neuro technique (FN-AVAD) is presented in this paper. The aim of this work is to alleviate the problem of choosing the best threshold value in traditional VAD methods and achieves automaticity by combining fuzzy clustering and machine learning techniques. Four features are extracted from each speech segment, which are short term energy, zero-crossing rate, auto
... Show MoreDeconstructionism opened the door wide to multiple readings and restore the reader his authority that he lost in the modernism, thus became more able to decipher the plastic discourse through reconstruction according to what he wants or what the plastic discourse gives him of possibilities beyond consumerism and thus the author has been canceled. The problem of the current research is limited to the following question: does deconstructionism in postmodern arts have a role in teaching the artistic tasting for the learner? The aim of the current research is to reveal the deconstruction work mechanisms in postmodern arts and their role in teaching the artistic tasting for the learner. As for the theoretical framework, the first section focu
... Show MoreThe design of components subjected to contact stress as local compressive stress is important in engineering application especially in ball and socket Joining. Two kinds of contact stress are introduced in the ball and socket joint, the first is from normal contact while the other is from sliding contact. Although joining two long links (drive shaft in steering cars) will cause the effect of flexural and tensional buckling stress in hollow columns through the ball and socket ends on the failure condition of the joining mechanism. In this paper the consideration of the combined effect of buckling Load and contact stress on the ball and socket joints have been taken, epically on the stress distribution in the contact area. Different
... Show MoreGenerally, radiologists analyse the Magnetic Resonance Imaging (MRI) by visual inspection to detect and identify the presence of tumour or abnormal tissue in brain MR images. The huge number of such MR images makes this visual interpretation process, not only laborious and expensive but often erroneous. Furthermore, the human eye and brain sensitivity to elucidate such images gets reduced with the increase of number of cases, especially when only some slices contain information of the affected area. Therefore, an automated system for the analysis and classification of MR images is mandatory. In this paper, we propose a new method for abnormality detection from T1-Weighted MRI of human head scans using three planes, including axial plane, co
... Show More