An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman; Ahmed Al Tmeme; Mohammed Najah  Mahdi

doi:10.22153/kej.2023.06.003

Details

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Volume

19

Issue Number

4

DOI

10.22153/kej.2023.06.003

Choose Citation Style

Statistics

View publication

14

View original publication

1

Click abstract more

1

Abstract Views

758

Galley Views

816

Statistics

(5)

(2)

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman

Ahmed Al Tmeme

Mohammed Najah Mahdi

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test AVSS systems. In its basic form, this review aims to highlight the growing importance of AVSS in improving the quality of audio signals.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Mon Mar 09 2026

Journal Name

Journal Of Asian Architecture And Building Engineering

Visual storytelling and place-based learning: a generative approach to architectural cultural awareness

visual storytelling

place-based education

cultural awareness

qualitative coding (MAXQDA)

al-Madrasah Al-Mustansiriya

Amal

Hoda A. S.

...Show More Authors

In architectural learning, it is difficult to stimulate cultural awareness through the traditional education approaches, which results in historic places being neglected as knowledge sources. This research explores the premise that sketch-based visual storytelling may act as a generative approach to connect cognition, emotion, and behavior in historical contexts. The study adopts a qualitative methodology to explore a learning experience comprising two phases: the first is a formal educational setting, and the second is a historical and cultural context, aiming to investigate the role of sketch-based storytelling in enhancing cultural awareness. MAXQDA was employed to code the students’ storyboards on three levels of cultural awareness, m

View Publication Preview PDF

Publication Date

Fri Dec 01 2023

Journal Name

Applied Energy

Deep clustering of Lagrangian trajectory for multi-task learning to energy saving in intelligent buildings using cooperative multi-agent

Jasim

...Show More Authors

The intelligent buildings provided various incentives to get highly inefficient energy-saving caused by the non-stationary building environments. In the presence of such dynamic excitation with higher levels of nonlinearity and coupling effect of temperature and humidity, the HVAC system transitions from underdamped to overdamped indoor conditions. This led to the promotion of highly inefficient energy use and fluctuating indoor thermal comfort. To address these concerns, this study develops a novel framework based on deep clustering of lagrangian trajectories for multi-task learning (DCLTML) and adding a pre-cooling coil in the air handling unit (AHU) to alleviate a coupling issue. The proposed DCLTML exhibits great overall control and is

View Publication

(36)

(27)

Publication Date

Sun Dec 02 2018

Journal Name

Arab Science Heritage Journal

نظرة نظرة الى فن الفلاحة في الاسلام

عماد الدين عبد الرزاق

...Show More Authors

This research is marked by "a look at the art of farming in Islam" to sign Mehdi Mohaqiq worthy of translation into Arabic because it is known in the arts and sciences of agriculture in the Islamic heritage, which was known and which has been translated into Arabic, has dealt with the following topics:

- The attention of Muslim scholars to study the works of Greece

- The discretion of the caliphs and the judges and the Senate Agriculture ordered

View Publication Preview PDF

Publication Date

Mon Aug 01 2022

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

A survey of deepfakes in terms of deep learning and multimedia forensics

Wildan Jameel

Suhad Malallah

Ayad Rodhan

...Show More Authors

Artificial intelligence techniques are reaching us in several forms, some of which are useful but can be exploited in a way that harms us. One of these forms is called deepfakes. Deepfakes is used to completely modify video (or image) content to display something that was not in it originally. The danger of deepfake technology impact on society through the loss of confidence in everything is published. Therefore, in this paper, we focus on deepfakedetection technology from the view of two concepts which are deep learning and forensic tools. The purpose of this survey is to give the reader a deeper overview of i) the environment of deepfake creation and detection, ii) how deep learning and forensic tools contributed to the detection

View Publication

(6)

(3)

Publication Date

Thu Nov 01 2012

Journal Name

Ijcsi International Journal Of Computer Science

Implementing a novel approach an convert audio compression to text coding via hybrid technique

Data Compression

Audio Compression

4-bit coding algorithm

6-bit coding algorithm.

Omar Adil

Mazin Abed

Ahmed Jasim

...Show More Authors

Compression is the reduction in size of data in order to save space or transmission time. For data transmission, compression can be performed on just the data content or on the entire transmission unit (including header data) depending on a number of factors. In this study, we considered the application of an audio compression method by using text coding where audio compression represented via convert audio file to text file for reducing the time to data transfer by communication channel. Approach: we proposed two coding methods are applied to optimizing the solution by using CFG. Results: we test our application by using 4-bit coding algorithm the results of this method show not satisfy then we proposed a new approach to compress audio fil

View Publication Preview PDF

Publication Date

Tue Jun 15 2021

Journal Name

Iraqi Journal Of Pharmaceutical Sciences ( P-issn 1683 - 3597 E-issn 2521 - 3512)

Natural Products as A Promising Therapy for SARS COV-2; An Overview

SARS CoV-2

Remedsiver

Antiviral

Phytochemical

Interleukin.

Noor S.

Iman S.

...Show More Authors

Recently emerging pandemic SARS CoV-2 conquered our world since December 2019. Continuous efforts have been done to find out effective immunization and precise treatment stetratigies A way from therapeutic options that were tried in SARS CoV-2, an increased attention is directed to predict natural products and mainly phytochemicals as collaborative measures for this crisis. In this review, most of the mentioned compounds specially flavonoids (biacalin, hesperidin, quercetin, luteolin,, and phenolic (resveratrol, curcumin, and theaflavin) exert their effect through interfering with the action of one or more of this proteins (spike protein, papain like protease, 3 chymotrypsin like cysteine protease, and RNA dependent RNA

View Publication Preview PDF

(8)

(2)

Publication Date

Sat Apr 01 2023

Journal Name

The Ocular Surface

Detecting dry eye from ocular surface videos based on deep learning

Hazem

Rossen

Suphi

Ali

Alexandru

Hidenori

Siamak

...Show More Authors

View Publication

(22)

(20)

Publication Date

Thu Dec 16 2021

Journal Name

Translational Vision Science & Technology

A Hybrid Deep Learning Construct for Detecting Keratoconus From Corneal Maps

Ali H.

Zahraa M.

Zaid

Alexandru

Marcelo M.

Rossen M.

Siamak

...Show More Authors

View Publication

(41)

(37)

Publication Date

Mon Nov 26 2018

Journal Name

Arab Science Heritage Journal

نظرة نظرة جديدة إلی الموضوع الرئيس لسورة البقرة

محمد خاقاني

...Show More Authors

There is a long-established discussion among Quran interpreters which with interrelationship and Quran systems. Although many interpreters and Quran experts assumed that there is alink between parts of verse and its adjacent verses, But there is a lot of disagreement about other aspects of such interlinks and interrelationships. Some refuse subject unity of verses and other reject relationship between Surah and its subject. Therefore such disagreements and discussions require getting to certain perspectives which take all aspects and point of views into consideration.

View Publication Preview PDF

Publication Date

Sat Dec 01 2018

Journal Name

Al-nahrain Journal Of Science

Image Classification Using Bag of Visual Words (BoVW)

SIFT

Euclidean distance

classification

k-nearest neighbor

Bag of Visual Words.

Rafal

...Show More Authors

In this paper two main stages for image classification has been presented. Training stage consists of collecting images of interest, and apply BOVW on these images (features extraction and description using SIFT, and vocabulary generation), while testing stage classifies a new unlabeled image using nearest neighbor classification method for features descriptor. Supervised bag of visual words gives good result that are present clearly in the experimental part where unlabeled images are classified although small number of images are used in the training process.

View Publication Preview PDF

(24)

1 2 ... 11 12 13 14 ... 2675 2676