Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
In this work laser detection and tracking system (LDTS) is designed and implemented using a fuzzy logic controller (FLC). A 5 mW He-Ne laser system and an array of nine PN photodiodes are used in the detection system. The FLC is simulated using MATLAB package and the result is stored in a lock up table to use it in the real time operation of the system. The results give a good system response in the target detection and tracking in the real time operation.
Speech is the ability of communication or expression of thoughts among people in spoken words. Human communication via speech is essential since any impairment in this process may have serious social and occupational consequences. Malocclusion is a possible cause of speech impairment in addition to many other etiological factors like hearing loss, neurological disorders, physical disorders, and drug abuse. This article throws light upon the association between speech disorders and malocclusion.
This piece of research deals with assimilation as one of the phonological processes in the language. It is a trial to give more attention to this important process in English language with deep explanation to its counterpart in Arabic. in addition, this study sheds light on the points of similarities and differences concerning this process in the two languages. Assimilation in English means two sounds are involved, and one becomes more like the other.
The assimilating phoneme picks up one or more of the features of another nearby phoneme. The English phoneme /n/ has t
... Show MoreHow I was eager to research the ruling on three of the most dangerous types to Islam and Muslims (the heretic, the sorcerer, the innovator, and related terms).
Because it is the most dangerous deadly disease that destroys the hearts of Muslims, and may even expel a Muslim from the circle of Islam, and how many Muslims have done or committed such a thing without knowing it. Indeed, how many Muslims have left Islam and whose wife has abandoned him without realizing it, and among them are those who have committed it without knowing it. As well as related words associated with heresy.( )
Because people debated such matters between extremists and lenient ones, most of whom were extremists, and they did not reach a conclusion. So I decid
Alms (or Zakat) is one of the Pillar of Islam and it was atask imposed on
Muslims. Becomes of the importance of this task and its influence on the human
Psychic in particular and on the Society in general this study aims at Studying the
words that it refers to in the Holy Quran, At the beginning the researcher has
introduced the words it refers to, and the significance of each in the Holy Quran and
the Speciality of each one of such words, then the Structures they donet have been
also introduced, whether such structures are descriptive, adverbial or verbal.This was
introduced in addition to explaining the influence of changing the Shape of such
words in emphasizing the meaning and the influence of Portraiting styl
A simple, rapid, sensitive and inexpensive approach is described in this work based on a combination of solid‐phase extraction of 8‐hydroxyquinoline (8HQ), for speciation and preconcentration of Cr(III) and Cr(VI) in river water, and the direct determination of these species using a flow injection system with chemiluminescence detection (FI–CL) and a 4‐diethylamino phenyl hydrazine (DEAPH)–hydrogen peroxide system. At different pH, the two forms of chromium [Cr(III) and Cr(VI)] have different exchange capacities for 8HQ, therefore two columns were constructed; the pH of column 1 was adjusted to pH 3 for retaining Cr(III) and column 2 was adjusted to pH 1 for retaining of Cr(VI). The sorbe
Standardized uptake values, often known as SUVs, are frequently utilized in the process of measuring 18F-fluorodeoxyglucose (FDG) uptake in malignancies . In this work, we investigated the relationships between a wide range of parameters and the standardized uptake values (SUV) found in the liver. Examinations with 18F-FDG PET/CT were performed on a total of 59 patients who were suffering from liver cancer. We determined the SUV in the liver of patients who had a normal BMI (between 18.5 and 24.9) and a high BMI (above 30) obese. After adjusting each SUV based on the results of the body mass index (BMI) and body surface area (BSA) calculations, which were determined for each patient based on their height and weight. Under a variety of dif
... Show More