Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

10

Statistics

(5)

(4)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Fri Jul 21 2023

Journal Name

Journal Of Engineering

Implementation of Digital Image processing in Calculating Normal Approach for Spherical Indenter Considering Elastic/Plastic Contact

Rana Abdul Rahman

Ahmed

...Show More Authors

In this work a study and calculation of the normal approach between two bodies,
spherical and rough flat surface, had been conducted by the aid of image processing
technique. Four kinds of metals of different work hardening index had been used as a
surface specimens and by capturing images of resolution of 0.006565 mm/pixel a good estimate of the normal approach may be obtained the compression tests had been done in strength of material laboratory in mechanical engineering department, a Monsanto tensometer had been used to conduct the indentation tests. A light section measuring equipment microscope BK 70x50 was used to calculate the surface parameters of the texture profile like standard deviation of asperity peak heights

View Publication Preview PDF

Publication Date

Wed Feb 01 2023

Journal Name

Baghdad Science Journal

Breast Cancer MRI Classification Based on Fractional Entropy Image Enhancement and Deep Feature Extraction

Breast MRI scans

Classification

CNN

Deep features

LSTM

علي

Asaad F.

Hamid A.

Rabha W.

...Show More Authors

Disease diagnosis with computer-aided methods has been extensively studied and applied in diagnosing and monitoring of several chronic diseases. Early detection and risk assessment of breast diseases based on clinical data is helpful for doctors to make early diagnosis and monitor the disease progression. The purpose of this study is to exploit the Convolutional Neural Network (CNN) in discriminating breast MRI scans into pathological and healthy. In this study, a fully automated and efficient deep features extraction algorithm that exploits the spatial information obtained from both T2W-TSE and STIR MRI sequences to discriminate between pathological and healthy breast MRI scans. The breast MRI scans are preprocessed prior to the feature

View Publication Preview PDF

(29)

(9)

Publication Date

Sun Jan 01 2023

Journal Name

Journal Of The Mechanical Behavior Of Materials

Investigation of the performance of integrated intelligent models to predict the roughness of Ti6Al4V end-milled surface with uncoated cutting tool

Salah

Jaharah A.

Che Hassan Che

Adnan Naji Jameel

M. N.

Alessandro

Samaher M.

Oday I.

Mohd Shukor

...Show More Authors

Abstract<p>Titanium alloys are broadly used in the medical and aerospace sectors. However, they are categorized within the hard-to-machine alloys ascribed to their higher chemical reactivity and lower thermal conductivity. This aim of this research was to study the impact of the dry-end-milling process with an uncoated tool on the produced surface roughness of Ti6Al4V alloy. This research aims to study the impact of the dry-end milling process with an uncoated tool on the produced surface roughness of Ti6Al4V alloy. Also, it seeks to develop a new hybrid neural model based on the training back propagation neural network (BPNN) with swarm optimization-gravitation search hybrid algorithms (PSO-GS</p> ... Show More

View Publication

(5)

(4)

Publication Date

Tue Aug 01 2023

Journal Name

Innovative Food Science & Emerging Technologies

Non-thermal pasteurization of milk by an innovative energy-saving moderate electrical field equipped with elongated electrodes and process optimization

Ali Wali M.

Azhar J.

Asaad R.

Mohsen

...Show More Authors

View Publication

(13)

Publication Date

Sun Jan 01 2023

Journal Name

Desalination And Water Treatment

Optimization of chemical oxygen demand removal from petroleum refinery wastewater by electrocoagulation using tubular electrochemical reactor with a novel design

Ghazi Faisal

Thamer J.

Ali H.

...Show More Authors

View Publication

(3)

Publication Date

Mon Jun 01 2009

Journal Name

Al-khwarizmi Engineering Journal

Image Zooming Using Inverse Slantlet Transform

Ahlam

...Show More Authors

Digital image is widely used in computer applications. This paper introduces a proposed method of image zooming based upon inverse slantlet transform and image scaling. Slantlet transform (SLT) is based on the principle of designing different filters for different scales.

First we apply SLT on color image, the idea of transform color image into slant, where large coefficients are mainly the signal and smaller one represent the noise. By suitably modifying these coefficients , using scaling up image by box and Bartlett filters so that the image scales up to 2X2 and then inverse slantlet transform from modifying coefficients using to the reconstructed image .

&nbs

View Publication Preview PDF

Publication Date

Sun Jun 01 2014

Journal Name

Baghdad Science Journal

Image Steganography by Using Multiwavelet Transform

Discrete Multiwavelet Transform (DMWT)

image hiding

SPIHT algorithm

Iman M.G.

...Show More Authors

Steganography is the art of secret communication. Its purpose is to hide the presence of information, using, for example, images as covers. The frequency domain is well suited for embedding in image, since hiding in this frequency domain coefficients is robust to many attacks. This paper proposed hiding a secret image of size equal to quarter of the cover one. Set Partitioning in Hierarchal Trees (SPIHT) codec is used to code the secret image to achieve security. The proposed method applies Discrete Multiwavelet Transform (DMWT) for cover image. The coded bit stream of the secret image is embedded in the high frequency subbands of the transformed cover one. A scaling factors ? and ? in frequency domain control the quality of the stego

View Publication Preview PDF

Publication Date

Wed Jan 30 2019

Journal Name

Journal Of The College Of Education For Women

Image Hiding Using Discrete Cosine Transform

Iman

Farah Jasim

...Show More Authors

Steganography is a mean of hiding information within a more obvious form of
communication. It exploits the use of host data to hide a piece of information in such a way
that it is imperceptible to human observer. The major goals of effective Steganography are
High Embedding Capacity, Imperceptibility and Robustness. This paper introduces a scheme
for hiding secret images that could be as much as 25% of the host image data. The proposed
algorithm uses orthogonal discrete cosine transform for host image. A scaling factor (a) in
frequency domain controls the quality of the stego images. Experimented results of secret
image recovery after applying JPEG coding to the stego-images are included.

View Publication Preview PDF

Publication Date

Mon Jan 01 2007

Journal Name

2007 Ieee International Conference On Signal Processing And Communications

Fast Multi-level Image Vector Quantization

George L.A.

...Show More Authors

View Publication

(2)

(1)

Publication Date

Wed Feb 19 2020

Journal Name

International Journal Of Innovation, Creativity And Change

Secure Image Steganography Through Multilevel Security

Security system

Steganography

Image carries

Stego-image

BIM

Mohammed

...Show More Authors

The concealment of data has emerged as an area of deep and wide interest in research that endeavours to conceal data in a covert and stealth manner, to avoid detection through the embedment of the secret data into cover images that appear inconspicuous. These cover images may be in the format of images or videos used for concealment of the messages, yet still retaining the quality visually. Over the past ten years, there have been numerous researches on varying steganographic methods related to images, that emphasised on payload and the quality of the image. Nevertheless, a compromise exists between the two indicators and to mediate a more favourable reconciliation for this duo is a daunting and problematic task. Additionally, the current

1 2 ... 51 52 53 54 ... 1029 1030