Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

10

Statistics

(7)

(5)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Fri Jan 01 2016

Journal Name

Results In Physics

Optimization of dye extraction from Cordyline fruticosa via response surface methodology to produce a natural sensitizer for dye-sensitized solar cells

Mahmoud A.M.

Norasikin A.

Abu Bakar

Abd. Amir H.

Muneer M.

Kamaruzzaman

...Show More Authors

View Publication

(28)

(23)

Publication Date

Thu Nov 04 2021

Journal Name

Nat. Volatiles & Essent. Oils

Using The Aqueous Extract Of Allium Sativum In Improvement Of Some Physiological And Immunological Parameter In Albino Rats

Allium sativum

aqueous extract

Lymphocyte

Salah M. M.

Duha Zeki

Asmaa I.

Munqith

...Show More Authors

The current study was designated to investigate the effect ofAllium sativumon some physiological and immunological parameters in rats. thirty adult rats were divided into three groups (10 rat for each). G1: served as healthy control, G2 :rats were treated with 150 mg\kg of Allium sativum, G3: treated with 300 mg\kg of Allium sativum. All treated animals were givenorally for 30 days. The aspartate , ) ALT ( alanine transaminase on some parameters were investigated such as garlic effects of total and differential counts of white blood , ) LDH ( lactate dehydrogenase ), AST ( transaminase cells(WBC) like Lymphocyte, Monocyte, Neutrophil, Basophil, Eosinophil,as

Publication Date

Wed Jul 01 2020

Journal Name

Journal Of Engineering

Study the Effect of Catalyst -to- Oil Ratio Parameter (COR) on Catalytic Cracking of Heavy Vacuum Gas Oil

Catalytic Cracking Reaction

Heavy vacuum gas oil

Catalyst to oil ratio parameter

Saleem Mohammad

...Show More Authors

This work deals with the production of light fuel cuts of (gasoline, kerosene and gas oil) by catalytic cracking treatment of secondary product mater (heavy vacuum gas oil) which was produced from the vacuum distillation unit in any petroleum refinery. The objective of this research was to study the effect of the catalyst -to- oil ratio parameter on catalytic cracking process of heavy vacuum gas oil feed at constant temperature (450 °C). The first step of this treatment was, catalytic cracking of this material by constructed batch reactor occupied with auxiliary control devices, at selective range of the catalyst –to- oil ratio parameter ( 2, 2.5, 3 and 3.5) respectively. The conversion of heavy vacuum gas

View Publication Preview PDF

Publication Date

Tue Jun 24 2025

Journal Name

Food And Bioprocess Technology

Classification of Apple Slices Treated by Atmospheric Plasma Jet for Post-harvest Processes Using Image Processing and Convolutional Neural Networks

Apple slice

Convolutional neural network

Atmospheric plasma

Hardness

Mustafa A. J.

Łukasz

Ghaith H.

Zeki

Osman

Piotr

...Show More Authors

Abstract<p>Apple slice grading is useful in post-harvest operations for sorting, grading, packaging, labeling, processing, storage, transportation, and meeting market demand and consumer preferences. Proper grading of apple slices can help ensure the quality, safety, and marketability of the final products, contributing to the post-harvest operations of the overall success of the apple industry. The article aims to create a convolutional neural network (CNN) model to classify images of apple slices after immersing them in atmospheric plasma at two different pressures (1 and 5 atm) and two different immersion times (3 and again 6 min) once and in filtered water based on the hardness of the slices usin</p> ... Show More

View Publication

Publication Date

Tue May 26 2026

Journal Name

Journal Of Baghdad College Of Dentistry

Validity of 3D Reconstructed Computed Tomographic Image in Using Craniometrical Measurements of the Skull for Sex Differentiation (An Iraqi Study)

Noor M

Ahlam A

...Show More Authors

Background: The skull offers a high resistance of adverse environmental conditions over time, resulting in the greater stability of the dimorphic features as compared to other skeletal bony pieces. Sex determination of human skeletal considered an initial step in its identification. The present study is undertaken to evaluate the validity of 3D reconstructed computed tomographic images in sex differentiation by using craniometrical measurements at various parts of the skull. Materials and Method: 3D reconstructed computed tomographic scanning of 100 Iraqi subject, (50 males and 50 females) were analyzed with their age range from20-70 years old. Craniometrical linear measurements were located and marked on both side of the 3D skull images.

View Publication Preview PDF

Publication Date

Tue Dec 01 2015

Journal Name

The Journal Of The Acoustical Society Of America

Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models

A.

W.

S.

...Show More Authors

In this paper, a fusion of K models of full-rank weighted nonnegative tensor factor two-dimensional deconvolution (K-wNTF2D) is proposed to separate the acoustic sources that have been mixed in an underdetermined reverberant environment. The model is adapted in an unsupervised manner under the hybrid framework of the generalized expectation maximization and multiplicative update algorithms. The derivation of the algorithm and the development of proposed full-rank K-wNTF2D will be shown. The algorithm also encodes a set of variable sparsity parameters derived from Gibbs distribution into the K-wNTF2D model. This optimizes each sub-model in K-wNTF2D with the required sparsity to model the time-varying variances of the sources in the s

View Publication

(7)

(3)

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

PDF Comparison based on Various FSO Channel Models under Different Atmospheric Turbulence

Mahdi

Lwaa Faisal

Gaurav

...Show More Authors

Recently, wireless communication environments with high speeds and low complexity have become increasingly essential. Free-space optics (FSO) has emerged as a promising solution for providing direct connections between devices in such high-spectrum wireless setups. However, FSO communications are susceptible to weather-induced signal fluctuations, leading to fading and signal weakness at the receiver. To mitigate the effects of these challenges, several mathematical models have been proposed to describe the transition from weak to strong atmospheric turbulence, including Rayleigh, lognormal, Málaga, Nakagami-m, K-distribution, Weibull, Negative-Exponential, Inverse-Gaussian, G-G, and Fisher-Snedecor F distributions. This paper extensive

View Publication Preview PDF

(6)

(2)

Publication Date

Tue May 26 2026

Journal Name

Philosophy Journal

Philosophy of Civilization Read and critique and analysis of the selected models

أِ.د. علي عبود المحمداوي

...Show More Authors

View Publication

Publication Date

Fri Dec 31 2021

Journal Name

Political Sciences Journal

Role of the executive in federal experiences: a study of selected models

provinces

Executive Authority

Federal

Asst.Prof.Abdulaziz Elewi

...Show More Authors

Receipt date:06/23/2020 accepted date:7/15/2020 Publication date:12/31/2021

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License

The executive authority differs from one country to another, as it differs from a federal state to another according to the nature of the applied political systems, so this research focused on federal states according to their political systems, then going into the details of the executive authority and its role In the federal states by referring to the four federal experiments

View Publication Preview PDF

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Dual Stages of Speech Enhancement Algorithm Based on Super Gaussian Speech Models

Speech Enhancement Algorithms (SEA)

Gaussian speech model

Laplacian speech model

Discrete Tchebichef Transform (DTT)

Discrete Tchebichef-Krawtchouk Transform (DTKT)

Humam Awad

Shams Moaied

Basheera M.

Sadiq H.

Abir Jaafar

...Show More Authors

Various speech enhancement Algorithms (SEA) have been developed in the last few decades. Each algorithm has its advantages and disadvantages because the speech signal is affected by environmental situations. Distortion of speech results in the loss of important features that make this signal challenging to understand. SEA aims to improve the intelligibility and quality of speech that different types of noise have degraded. In most applications, quality improvement is highly desirable as it can reduce listener fatigue, especially when the listener is exposed to high noise levels for extended periods (e.g., manufacturing). SEA reduces or suppresses the background noise to some degree, sometimes called noise suppression alg

View Publication Preview PDF

(6)

1 2 ... 70 71 72 73 ... 1065 1066