Deep Learning and Fusion Techniques for High-Precision Image Matting:

Liqaa M.  Shoohi; Jamila H.  Saud

Details

Publication Date

Thu Mar 13 2025

Journal Name

Academia Open

Volume

10

Issue Number

1

Choose Citation Style

Statistics

View publication

20

Statistics

Deep Learning and Fusion Techniques for High-Precision Image Matting:

Deep image matting

computer vision

deep learning

fusion techniques

U-net

Liqaa M. Shoohi

Jamila H. Saud

...Show More Authors

General Background: Deep image matting is a fundamental task in computer vision, enabling precise foreground extraction from complex backgrounds, with applications in augmented reality, computer graphics, and video processing. Specific Background: Despite advancements in deep learning-based methods, preserving fine details such as hair and transparency remains a challenge. Knowledge Gap: Existing approaches struggle with accuracy and efficiency, necessitating novel techniques to enhance matting precision. Aims: This study integrates deep learning with fusion techniques to improve alpha matte estimation, proposing a lightweight U-Net model incorporating color-space fusion and preprocessing. Results: Experiments using the AdobeComposition-1k dataset demonstrate superior performance compared to traditional methods, achieving higher accuracy, faster processing speed, and improved boundary preservation. Novelty: The proposed model effectively combines deep learning with fusion techniques, enhancing matting quality while maintaining robustness across various environmental conditions. Implications: These findings highlight the potential of integrating fusion techniques with deep learning for image matting, offering valuable insights for future research in automated image processing applications, including augmented reality, gaming, and interactive video technologies. Highlights:   Better Precision: Fusion techniques enhance fine detail preservation. Faster Processing: Lightweight U-Net improves speed and accuracy. Wide Applications: Useful for AR, gaming, and video processing.   Keywords: Deep image matting, computer vision, deep learning, fusion techniques, U-Net

View Publication Preview PDF

Quick Preview PDF

Publication Date

Fri Mar 18 2022

Journal Name

Aro-the Scientific Journal Of Koya University

Detecting Deepfakes with Deep Learning and Gabor Filters

Wildan Jameel

Suhad Malallah

Ayad Rodhan

...Show More Authors

The proliferation of many editing programs based on artificial intelligence techniques has contributed to the emergence of deepfake technology. Deepfakes are committed to fabricating and falsifying facts by making a person do actions or say words that he never did or said. So that developing an algorithm for deepfakes detection is very important to discriminate real from fake media. Convolutional neural networks (CNNs) are among the most complex classifiers, but choosing the nature of the data fed to these networks is extremely important. For this reason, we capture fine texture details of input data frames using 16 Gabor filters indifferent directions and then feed them to a binary CNN classifier instead of using the red-green-blue

View Publication

(9)

(1)

Publication Date

Sat Apr 15 2023

Journal Name

Journal Of Robotics

A New Proposed Hybrid Learning Approach with Features for Extraction of Image Classification

support vector machine (SVM)

image classification

hybrid learning

VGG-16 deep learning model

Mohanad

Muna

Dheyaa

...Show More Authors

Image classification is the process of finding common features in images from various classes and applying them to categorize and label them. The main problem of the image classification process is the abundance of images, the high complexity of the data, and the shortage of labeled data, presenting the key obstacles in image classification. The cornerstone of image classification is evaluating the convolutional features retrieved from deep learning models and training them with machine learning classifiers. This study proposes a new approach of “hybrid learning” by combining deep learning with machine learning for image classification based on convolutional feature extraction using the VGG-16 deep learning model and seven class

View Publication

(5)

(4)

Publication Date

Wed Jul 06 2022

Journal Name

Journal Of Al-qadisiyah For Computer Science And Mathematics

Pixel Based Techniques for Gray Image Compression: A review

Zahraa. H.

Ghadah K.

...Show More Authors

Currently, with the huge increase in modern communication and network applications, the speed of transformation and storing data in compact forms are pressing issues. Daily an enormous amount of images are stored and shared among people every moment, especially in the social media realm, but unfortunately, even with these marvelous applications, the limited size of sent data is still the main restriction's, where essentially all these applications utilized the well-known Joint Photographic Experts Group (JPEG) standard techniques, in the same way, the need for construction of universally accepted standard compression systems urgently required to play a key role in the immense revolution. This review is concerned with Different

View Publication

(1)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(18)

(2)

Publication Date

Mon Oct 02 2023

Journal Name

Journal Of Engineering

Microgrid Integration Based on Deep Learning NARMA-L2 Controller for Maximum Power Point Tracking

Microgrid

Solar PV

HER

Maximum power point tracking

Deep learning

PO-MPPT

INC-MPPT

Enas Hamid

Nadia Qasim

Hanan Mikhael D.

...Show More Authors

This paper presents a hybrid energy resources (HER) system consisting of solar PV, storage, and utility grid. It is a challenge in real time to extract maximum power point (MPP) from the PV solar under variations of the irradiance strength. This work addresses challenges in identifying global MPP, dynamic algorithm behavior, tracking speed, adaptability to changing conditions, and accuracy. Shallow Neural Networks using the deep learning NARMA-L2 controller have been proposed. It is modeled to predict the reference voltage under different irradiance. The dynamic PV solar and nonlinearity have been trained to track the maximum power drawn from the PV solar systems in real time.

Moreover, the proposed controller i

View Publication Preview PDF

Publication Date

Wed Mar 15 2023

Journal Name

International Journal Of Advances In Intelligent Informatics

An automatic lip reading for short sentences using deep learning nets

Maha Abd

Kadhim

...Show More Authors

One study whose importance has significantly grown in recent years is lip-reading, particularly with the widespread of using deep learning techniques. Lip reading is essential for speech recognition in noisy environments or for those with hearing impairments. It refers to recognizing spoken sentences using visual information acquired from lip movements. Also, the lip area, especially for males, suffers from several problems, such as the mouth area containing the mustache and beard, which may cover the lip area. This paper proposes an automatic lip-reading system to recognize and classify short English sentences spoken by speakers using deep learning networks. The input video extracts frames and each frame is passed to the Viola-Jone

View Publication

(6)

(3)

Publication Date

Thu Dec 16 2021

Journal Name

Translational Vision Science & Technology

A Hybrid Deep Learning Construct for Detecting Keratoconus From Corneal Maps

Ali H.

Zahraa M.

Zaid

Alexandru

Marcelo M.

Rossen M.

Siamak

...Show More Authors

View Publication

(31)

Publication Date

Fri Apr 30 2021

Journal Name

Al-kindy College Medical Journal

The Role of MRI-US Fusion Techniques in Detection of Clinically Significant Prostate Cancer

Prostate cancer

Magnetic Resonance Imaging

US-MRI fusion techniques

Samir

...Show More Authors

Prostate cancer is the commonest male cancer and the second leading cause of cancer-related death in men. Over many decades, prostate cancer detection represented a continuous challenge to urologists. Although all urologists and pathologists agree that tissue diagnosis is essential especially before commencing active surgical or radiation treatment, the best way to obtain the biopsy was always the big hurdle. The heterogenicity of the tumor pathology is very well seen in its radiological appearance. Ultrasound has been proven to be of limited sensitivity and specificity in detecting prostate cancer. However, it was the only available targeting technique for years and was used to guide biopsy needle passed transrectally or transperineally

View Publication Preview PDF

Publication Date

Thu Aug 07 2025

Journal Name

Journal Of Image And Graphics

Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Convolutional Neural Networks (CNN)

image caption

conventional methods

modern methods

hybrid approach

Nuha M.

Nada

...Show More Authors

This study explores the challenges in Artificial Intelligence (AI) systems in generating image captions, a task that requires effective integration of computer vision and natural language processing techniques. A comparative analysis between traditional approaches such as retrieval- based methods and linguistic templates) and modern approaches based on deep learning such as encoder-decoder models, attention mechanisms, and transformers). Theoretical results show that modern models perform better for the accuracy and the ability to generate more complex descriptions, while traditional methods outperform speed and simplicity. The paper proposes a hybrid framework that combines the advantages of both approaches, where conventional methods prod

View Publication Preview PDF

Publication Date

Fri Jul 18 2014

Journal Name

International Journal Of Computer Applications

3-Level Techniques Comparison based Image Recognition

3-level Techniques

image recognition

stationary wavelet transform

wavelet transform

feature extraction.

Zainab

Ahlam

...Show More Authors

Image recognition is one of the most important applications of information processing, in this paper; a comparison between 3-level techniques based image recognition has been achieved, using discrete wavelet (DWT) and stationary wavelet transforms (SWT), stationary-stationary-stationary (sss), stationary-stationary-wavelet (ssw), stationary-wavelet-stationary (sws), stationary-wavelet-wavelet (sww), wavelet-stationary- stationary (wss), wavelet-stationary-wavelet (wsw), wavelet-wavelet-stationary (wws) and wavelet-wavelet-wavelet (www). A comparison between these techniques has been implemented. according to the peak signal to noise ratio (PSNR), root mean square error (RMSE), compression ratio (CR) and the coding noise e (n) of each third

View Publication

1 2 ... 4 5 6 7 ... 1822 1823