Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

7

Statistics

(1)

(3)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Sat Nov 30 2024

Journal Name

Iraqi Journal Of Science

Admissible Classes of Seven-Parameter Mittag-Leffler Operatorwith Third-Order Differential Subordination Properties

Maryam K.

Abdulrahman H.

...Show More Authors

The main purpose of this paper, is to characterize new admissible classes of linear operator in terms of seven-parameter Mittag-Leffler function, and discuss sufficient conditions in order to achieve certain third-order differential subordination and superordination results. In addition, some linked sandwich theorems involving these classes had been obtained.

View Publication Preview PDF

Publication Date

Sun Mar 07 2010

Journal Name

Baghdad Science Journal

Study the Characteristic of the Coupling Parameter ( ? ) in Dusty Plasma by Computer modeling

"Coulomb coupling parameter

dust grain

dust charge

structure parameter

phase transition

screening length

radius grain

potential energy ."

Hamid H.

Bayan G.

...Show More Authors

Computer modeling has been used to investing the Coulomb coupling parameter ?. The effects of the structure parameter K, grain charge Z, plasma density N, temperature dust grain Td, on the Coulomb coupling parameter had been studied. It was seen that the ? was increasing with increasing Z and N, and decrease with increasing K and T. Also the critical value of ? that the phase transfer of the plasma state from liquid to solid was studied.

View Publication Preview PDF

Publication Date

Tue Mar 01 2016

Journal Name

International Journal Of Computer Science And Mobile Computing

Content-Based Cartoon Image Retrieval

Ghadah

...Show More Authors

Publication Date

Sun Mar 02 2008

Journal Name

Baghdad Science Journal

Tamper Detection in Color Image

Ali Kadhim

...Show More Authors

In this work a fragile watermarking scheme is presented. This scheme is applied to digital color images in spatial domain. The image is divided into blocks, and each block has its authentication mark embedded in it, we would be able to insure which parts of the image are authentic and which parts have been modified. This authentication carries out without need to exist the original image. The results show the quality of the watermarked image is remaining very good and the watermark survived some type of unintended modification such as familiar compression software like WINRAR and ZIP

View Publication Preview PDF

(1)

Publication Date

Mon Apr 15 2019

Journal Name

Proceedings Of The International Conference On Information And Communication Technology

Orthogonal polynomial embedded image kernel

Sadiq H.

Abd Rahman

Abir Jaafar

Basheera M.

Wissam A.

...Show More Authors

View Publication

(25)

(23)

Publication Date

Wed Oct 04 2023

Journal Name

Exergy - New Technologies And Applications

High Synthetic Image Coding System

image coding

image compression

Discrete Cosine Transform (DCT)

Discrete Wavelet Transform (DWT)

entropy coding

quantization

Abdallah

Loay

...Show More Authors

Compressing an image and reconstructing it without degrading its original quality is one of the challenges that still exist now a day. A coding system that considers both quality and compression rate is implemented in this work. The implemented system applies a high synthetic entropy coding schema to store the compressed image at the smallest size as possible without affecting its original quality. This coding schema is applied with two transform-based techniques, one with Discrete Cosine Transform and the other with Discrete Wavelet Transform. The implemented system was tested with different standard color images and the obtained results with different evaluation metrics have been shown. A comparison was made with some previous rel

View Publication

Publication Date

Sun Jan 01 2023

Journal Name

2nd International Conference On Mathematical Techniques And Applications: Icmta2021

Polynomial image compression: A review

Ghadah

...Show More Authors

View Publication

Publication Date

Mon Sep 30 2019

Journal Name

College Of Islamic Sciences

Visual image in Farzdaq hair

Poet

Umayyad era

sight

photo.

د. انتهاء عباس

...Show More Authors

The research shows that the visual image plays an important role when Farzdaq in the issue of aesthetic perception, it enables him to feel a sense of artistic and mental perception to raise astonishment and admiration through his ability to link the optics through the suggestive image to carry us to a new vision imagined full of visual images.

View Publication Preview PDF

Publication Date

Fri May 29 2020

Journal Name

International Journal Of Psychosocial Rehabilitation

Image Fusion Techniques: A Review

Mohammed

...Show More Authors

Image Fusion is being used to gather important data from such an input image array and to place it in a single output picture to make it much more meaningful & usable than either of the input images. Image fusion boosts the quality and application of data. The accuracy of the image that has fused depending on the application. It is widely used in smart robotics, audio camera fusion, photonics, system control and output, construction and inspection of electronic circuits, complex computer, software diagnostics, also smart line assembling robots. In this paper provides a literature review of different image fusion techniques in the spatial domain and frequency domain, such as averaging, min-max, block substitution, Intensity-Hue-Saturation(IH

Publication Date

Tue Aug 31 2021

Journal Name

Inmateh Agricultural Engineering

DETERMINING THE EFFICIENCY OF A SMART SPRAYING ROBOT FOR CROP PROTECTION USING IMAGE PROCESSING TECHNOLOGY

machine learning

image processing

agricultural robot

forward speed

Mustafa Ahmed Jalal

Noor Ahmed

...Show More Authors

A system was used to detect injuries in plant leaves by combining machine learning and the principles of image processing. A small agricultural robot was implemented for fine spraying by identifying infected leaves using image processing technology with four different forward speeds (35, 46, 63 and 80 cm/s). The results revealed that increasing the speed of the agricultural robot led to a decrease in the mount of supplements spraying and a detection percentage of infected plants. They also revealed a decrease in the percentage of supplements spraying by 46.89, 52.94, 63.07 and 76% with different forward speeds compared to the traditional method.

View Publication Preview PDF

(6)

(4)

1 2 ... 45 46 47 48 ... 999 1000