An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman; Ahmed Al Tmeme; Mohammed Najah  Mahdi

doi:10.22153/kej.2023.06.003

Details

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Volume

19

Issue Number

4

DOI

10.22153/kej.2023.06.003

Choose Citation Style

Statistics

View publication

14

View original publication

1

Click abstract more

1

Abstract Views

758

Galley Views

816

Statistics

(5)

(2)

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman

Ahmed Al Tmeme

Mohammed Najah Mahdi

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test AVSS systems. In its basic form, this review aims to highlight the growing importance of AVSS in improving the quality of audio signals.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sat Aug 30 2025

Journal Name

International Journal Of Social Sciences And English Literature

Critical Discourse Analysis of Online Platforms: An Overview

Nawal

Shahad Saad

Alham Fadhl

...Show More Authors

The rise of online platforms has transformed the discourse landscape, enabling users to create and share content actively, thereby shaping public perceptions and societal narratives. Understanding the dynamics of this discourse is essential for comprehending its socio-political implications. This review aims to provide a comprehensive overview of Critical Discourse Analysis (CDA) concerning online platforms, exploring how language is utilized across various digital contexts to influence identity formation and social inequalities. Methodologically, the review systematically searches electronic databases, including Google Scholar and ProQuest, using keywords related to CDA and online platforms. A total of 30 relevant studies are purpo

View Publication

(1)

Publication Date

Wed May 10 2023

Journal Name

Diagnostics

A Deep Feature Fusion of Improved Suspected Keratoconus Detection with Deep Learning

Ali H.

Laith

Zahraa M.

Hazem

Nebras H.

Alexandru

Rossen M.

Hidenori

Yuantong

Siamak

...Show More Authors

Detection of early clinical keratoconus (KCN) is a challenging task, even for expert clinicians. In this study, we propose a deep learning (DL) model to address this challenge. We first used Xception and InceptionResNetV2 DL architectures to extract features from three different corneal maps collected from 1371 eyes examined in an eye clinic in Egypt. We then fused features using Xception and InceptionResNetV2 to detect subclinical forms of KCN more accurately and robustly. We obtained an area under the receiver operating characteristic curves (AUC) of 0.99 and an accuracy range of 97–100% to distinguish normal eyes from eyes with subclinical and established KCN. We further validated the model based on an independent dataset with

View Publication

(36)

(32)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Deep Learning-Based Skin Cancer Identification

Sandhua M

Abir

Dhiya

Basheera M.

Sadiq H.

...Show More Authors

View Publication

(7)

(4)

Publication Date

Tue Jul 01 2025

Journal Name

Mastering The Minds Of Machines

Deep Reinforcement Learning: Bridging Learning and Control in Intelligent Systems

Davut

Hung Vo

Raed Abu

Nada Khalil

Zhe

Canan Batur

Ali

Aseel

Laith

...Show More Authors

View Publication

(1)

Publication Date

Mon Jan 01 2024

Journal Name

Bio Web Of Conferences

Forecasting Cryptocurrency Market Trends with Machine Learning and Deep Learning

Fadhil H.M.

...Show More Authors

Cryptocurrency became an important participant on the financial market as it attracts large investments and interests. With this vibrant setting, the proposed cryptocurrency price prediction tool stands as a pivotal element providing direction to both enthusiasts and investors in a market that presents itself grounded on numerous complexities of digital currency. Employing feature selection enchantment and dynamic trio of ARIMA, LSTM, Linear Regression techniques the tool creates a mosaic for users to analyze data using artificial intelligence towards forecasts in real-time crypto universe. While users navigate the algorithmic labyrinth, they are offered a vast and glittering selection of high-quality cryptocurrencies to select. The

View Publication

(5)

(4)

Publication Date

Tue Jun 01 2021

Journal Name

Al-khwarizmi Engineering Journal

Effect of Environmental Factors on the Accuracy of a Quality Inspection System Based on Transfer Learning

Ahmed

Faiz F.

Wisam S.

...Show More Authors

In this research, a study is introduced on the effect of several environmental factors on the performance of an already constructed quality inspection system, which was designed using a transfer learning approach based on convolutional neural networks. The system comprised two sets of layers, transferred layers set from an already trained model (DenseNet121) and a custom classification layers set. It was designed to discriminate between damaged and undamaged helical gears according to the configuration of the gear regardless to its dimensions, and the model showed good performance discriminating between the two products at ideal conditions of high-resolution images.

So, this study aimed at testing the system performance at poor s

View Publication Preview PDF

(1)

Publication Date

Sun Jun 07 2015

Journal Name

Baghdad Science Journal

Steganography in Audio Using Wavelet and DES

Steganography in Audio

Secret message

DES algorithm

LSB algorithm

Wavelet transform

Rasha H.

...Show More Authors

In this paper, method of steganography in Audio is introduced for hiding secret data in audio media file (WAV). Hiding in audio becomes a challenging discipline, since the Human Auditory System is extremely sensitive. The proposed method is to embed the secret text message in frequency domain of audio file. The proposed method contained two stages: the first embedding phase and the second extraction phase. In embedding phase the audio file transformed from time domain to frequency domain using 1-level linear wavelet decomposition technique and only high frequency is used for hiding secreted message. The text message encrypted using Data Encryption Standard (DES) algorithm. Finally; the Least Significant bit (LSB) algorithm used to hide secr

View Publication Preview PDF

Publication Date

Sun Nov 01 2020

Journal Name

Iop Conference Series: Materials Science And Engineering

Face Recognition and Emotion Recognition from Facial Expression Using Deep Learning Neural Network

Ali

Zubaidah

Zainab

...Show More Authors

Abstract<p>Face recognition, emotion recognition represent the important bases for the human machine interaction. To recognize the person’s emotion and face, different algorithms are developed and tested. In this paper, an enhancement face and emotion recognition algorithm is implemented based on deep learning neural networks. Universal database and personal image had been used to test the proposed algorithm. Python language programming had been used to implement the proposed algorithm.</p>

View Publication

(8)

(2)

Publication Date

Tue Aug 10 2021

Journal Name

Design Engineering

Lossy Image Compression Using Hybrid Deep Learning Autoencoder Based On kmean Clusteri

Image compression

Convolutional Autoencoder (CAE)

k-mean algorithm

PSNR

Compression Rate (CR)

MSE

Clustering

CLIC

Kodak

deep learning

lossy

Mohammed S. H.

...Show More Authors

Image compression plays an important role in reducing the size and storage of data while increasing the speed of its transmission through the Internet significantly. Image compression is an important research topic for several decades and recently, with the great successes achieved by deep learning in many areas of image processing, especially image compression, and its use is increasing Gradually in the field of image compression. The deep learning neural network has also achieved great success in the field of processing and compressing various images of different sizes. In this paper, we present a structure for image compression based on the use of a Convolutional AutoEncoder (CAE) for deep learning, inspired by the diversity of human eye

Publication Date

Fri Nov 21 2025

Journal Name

Journal Of Advances In Information Technology

Towards Accurate SDG Research Categorization: A Hybrid Deep Learning Approach Using Scopus Metadata

text classification

Sustainable Development Goals (SDGs)

deep learning

hybrid bidirectional Long Short-Term Memory-Convolutional Neural Network (LSTM-CNN)

Global Vector (GloVe) embeddings

Jalal Sadoon Hameed

Furat N.

Mohammed

...Show More Authors

The complexity and variety of language included in policy and academic documents make the automatic classification of research papers based on the United Nations Sustainable Development Goals (SDGs) somewhat difficult. Using both pre-trained and contextual word embeddings to increase semantic understanding, this study presents a complete deep learning pipeline combining Bidirectional Long Short-Term Memory (BiLSTM) and Convolutional Neural Network (CNN) architectures which aims primarily to improve the comprehensibility and accuracy of SDG text classification, thereby enabling more effective policy monitoring and research evaluation. Successful document representation via Global Vector (GloVe), Bidirectional Encoder Representations from Tra

View Publication Preview PDF

1 2 ... 5 6 7 8 ... 2673 2674