Hybrid Deep Learning Model for Singing Voice Separation

R. Amer; A. Al Tmeme

doi:10.13164/mendel.2021.2.044

Details

Publication Date

Tue Dec 21 2021

Journal Name

Mendel

Volume

27

DOI

10.13164/mendel.2021.2.044

Choose Citation Style

Statistics

View publication

25

Statistics

(4)

Hybrid Deep Learning Model for Singing Voice Separation

R. Amer

A. Al Tmeme

...Show More Authors

Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achieves (4.81) dB GNSDR gain, (7.28) dB GSIR gain, and (3.39) dB GSAR gain in comparison to current approaches

View Publication

Publication Date

Fri Apr 14 2023

Journal Name

Journal Of Big Data

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H.

...Show More Authors

Abstract<p>Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for</p> ... Show More

View Publication Preview PDF

(661)

(664)

Publication Date

Tue Oct 19 2021

Journal Name

Big Data Summit 2: Hpc & Ai Empowering Data Analytics 2018 | Conference Paper

Deep Bayesian for Opinion-target identification

Omar Mustafa

...Show More Authors

The use of deep learning.

View Publication

Publication Date

Thu Oct 01 2020

Journal Name

Journal Of Engineering Science And Technology

Automatic voice activity detection using fuzzy-neuro classifier

Defuzzification

Fuzzy clustering

Neural network

Speech signal

Voice activity detection

Suhaila N.

...Show More Authors

Voice Activity Detection (VAD) is considered as an important pre-processing step in speech processing systems such as speech enhancement, speech recognition, gender and age identification. VAD helps in reducing the time required to process speech data and to improve final system accuracy by focusing the work on the voiced part of the speech. An automatic technique for VAD using Fuzzy-Neuro technique (FN-AVAD) is presented in this paper. The aim of this work is to alleviate the problem of choosing the best threshold value in traditional VAD methods and achieves automaticity by combining fuzzy clustering and machine learning techniques. Four features are extracted from each speech segment, which are short term energy, zero-crossing rate, auto

View Publication Preview PDF

(7)

Publication Date

Fri Jan 01 2021

Journal Name

Artificial Intelligence For Covid-19

An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-Ray Images in Resource Limited Settings

Ali H.

Rami N.

Zahraa M.

Javier

...Show More Authors

View Publication

(35)

(27)

Publication Date

Tue Oct 30 2018

Journal Name

Journal Of Engineering

Deep Oxidative Desulfurization of Model fuels by Prepared Nano TiO2 with Phosphotungstic acid

Oxidative desulfurization

Nanoparticles of TiO2

liquid-liquid-solid system

Basma Abbas

Sameera

Fadhil Abed

...Show More Authors

In this study, nano TiO₂ was prepared with titanium isopropoxide (TTIP) as a resource to titanium oxide. The catalyst was synthesized using phosphotungstic acid (PTA) and, stearyl trimethyl ammonium bromide (STAB) was used as the structure-directing material. Characterization of the product was done by the X-ray diffraction (XRD), X-ray fluorescent spectroscopy (XRF), nitrogen adsorption/desorption measurements, Atomic Force Microscope (AFM) and Fourier transform infrared (FTIR) spectra, were used to characterize the calcined TiO₂ nanoparticles by STAB and PWA. The TiO₂ nanomaterials were prepared in three crystalline forms (amorphous, anatase, anatase-rutile). The results showed that the

View Publication Preview PDF

(10)

Publication Date

Tue Oct 30 2018

Journal Name

Journal Of Engineering

Deep Oxidative Desulfurization of Model fuels by Prepared Nano TiO2 with Phosphotungstic acid

Basma Abbas

Sameera

Fadhil Abed

...Show More Authors

In this study, nano TiO2 was prepared with titanium isopropoxide (TTIP) as a resource to titanium oxide. The catalyst was synthesized using phosphotungstic acid (PTA) and, stearyl trimethyl ammonium bromide (STAB) was used as the structure-directing material. Characterization of the product was done by the X-ray diffraction (XRD), X-ray fluorescent spectroscopy (XRF), nitrogen adsorption/desorption measurements, Atomic Force Microscope (AFM) and Fourier transform infrared (FTIR) spectra, were used to characterize the calcined TiO2 nanoparticles by STAB and PWA. The TiO2 nanomaterials were prepared in three crystalline forms (amorphous, anatase, anatase-rutile). The results showed that the nanoparticles of anatase TiO2 have good cata

(10)

Publication Date

Wed Dec 30 2015

Journal Name

College Of Islamic Sciences

Development and innovation in the theory of singing In the Umayyad era

د. سجا جاسم

...Show More Authors

The Umayyad era is characterized by the diversity of the subjects and their multiplicity in the literary phenomena. These phenomena are singing phenomena, although they were known in previous eras, they took a distinctive form in the era.
In this light, the researcher tried to prove that singing theory in the Umayyad period was characterized by development and renewal. The research was entitled (evolution and renewal in the theory of singing in the Umayyad era).

View Publication Preview PDF

Publication Date

Sun May 22 2022

Journal Name

International Journal Of Early Childhood Special Education

The impact of using learning acceleration model on the achievement of mathematics for third intermediate grade students

Lina Fouad

Azraa Radi

...Show More Authors

The current study aims at identifying the impact of using learning acceleration model on the achievement of mathematics for third intermediategrade students. Forachieving this, the researchers chose the School (Al-Kholood Secondary School for Girls) affiliated to the General Directorate of Babylon Education / Hashemite Education Department for the academic year (2021/2021), The sample reached to (70) female students from the third intermediate grade, with (35) female students for each of the two research groups. The two researchers prepared an achievement test consisting of (25) objective items of multiple choice type, The psychometric properties of the test were confirmed, and after the completion of the experiment, the achievement test wa

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina

Samera

Mahir

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some

View Publication Preview PDF

Publication Date

Mon Feb 04 2019

Journal Name

Iraqi Journal Of Physics

Frequency analyses of human voice using fast Fourier transform

Human voice

larynx

fundamental frequency

Fast Fourier Transform

Frequency spectrum.

Jinan F.

...Show More Authors

Quantitative analysis of human voice has been subject of interest and the subject gained momentum when human voice was identified as a modality for human authentication and identification. The main organ responsible for production of sound is larynx and the structure of larynx along with its physical properties and modes of vibration determine the nature and quality of sound produced. There has been lot of work from the point of view of fundamental frequency of sound and its characteristics. With the introduction of additional applications of human voice interest grew in other characteristics of sound and possibility of extracting useful features from human voice. We conducted a study using Fast Fourier Transform (FFT) technique to analy

View Publication Preview PDF

(3)

1 2 ... 11 12 13 14 ... 728 729