Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.18178/joig.13.4.406-418

Details

Publication Date

Thu Aug 07 2025

Journal Name

Journal Of Image And Graphics

Volume

13

Issue Number

4

DOI

10.18178/joig.13.4.406-418

Choose Citation Style

Statistics

View publication

35

Statistics

Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Convolutional Neural Networks (CNN)

image caption

conventional methods

modern methods

hybrid approach

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

This study explores the challenges in Artificial Intelligence (AI) systems in generating image captions, a task that requires effective integration of computer vision and natural language processing techniques. A comparative analysis between traditional approaches such as retrieval- based methods and linguistic templates) and modern approaches based on deep learning such as encoder-decoder models, attention mechanisms, and transformers). Theoretical results show that modern models perform better for the accuracy and the ability to generate more complex descriptions, while traditional methods outperform speed and simplicity. The paper proposes a hybrid framework that combines the advantages of both approaches, where conventional methods produce an initial description, which is then contextually, and refined using modern models. Preliminary estimates indicate that this approach could reduce the initial computational cost by up to 20% compared to relying entirely on deep models while maintaining high accuracy. The study recommends further research to develop effective coordination mechanisms between traditional and modern methods and to move to the experimental validation phase of the hybrid model in preparation for its application in environments that require a balance between speed and accuracy, such as real-time computer vision applications.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sat Jan 01 2011

Journal Name

Journal Of Engineering

Simulation Model for the Assessment of Direct and Indirect Georeferencing Techniques in Analytical Photogrammetry

Bashar

Hussien

Luma K.

...Show More Authors

B Saleem, H Alwan, L Khalid, Journal of Engineering, 2011 - Cited by 2

View Publication

Publication Date

Wed Jan 01 2025

Journal Name

Journal Of Central European Agriculture

Power requirements for corn silage harvesters and application of precision agricultural techniques: a review

agricultural practices

precision agriculture

artificial intelligence

energy requirements

Mustafa

Osman

Hasan

...Show More Authors

The energy requirements of corn silage harvesters and the application of precision agricultural techniques are essential for efficient and productive agricultural practices. The article aims to review previous studies on the energy requirements needed for different corn silage harvesting machines, and on the other hand, to present methods for measuring corn silage productivity directly in the field and monitoring it based on microcontrollers and artificial intelligence techniques. The process of making corn silage is done by cutting green fodder plants into small pieces, so special harvesters are used for this, called corn silage harvesters. The purpose of harvesting corn silage is to efficiently collect and store as many digestible nutrien

View Publication

Publication Date

Thu Dec 01 2011

Journal Name

Journal Of Engineering

SIMULATION MODEL FOR THE ASSESSMENT OF DIRECT AND INDIRECT GEOREFERENCING TECHNIQUES IN ANALYTICAL PHOTOGRAMMETRY

Bundle Adjustment

Direct Georeferencing (DG)

GPS/ INS

Indirect Georeferencing.

H.

B.

L.

...Show More Authors

This paper compares between the direct and indirect georeferencing techniques in Photogrammetry bases on a simulation model. A flight plan is designed which consists of three strips with nine overlapped images for each strip by a (Canon 500D) digital camera with a resolution of 15 Mega Pixels.

The triangulation computations are carried out by using (ERDAS LPS) software, and the direct measurements are taken directly on the simulated model to substitute using GPS/INS in real case. Two computational tests have been implemented to evaluate the positional accuracy for the whole model and the Root Mean Square Error (RMSE) relating to (30) check points show that th

View Publication Preview PDF

Publication Date

Sat Oct 01 2022

Journal Name

Journal Of Engineering

Evaluation of ANFIS and Regression Techniques in Estimating Soil Compression Index for Cohesive soils

ANFIS

Regression

Cohesive Soils

Compression Index

Yaseen Ahmed Hamaamin

Kamal Ahmed Rashed

Younis Mustafa Ali

Tavga

...Show More Authors

Generally, direct measurement of soil compression index (Cc) is expensive and time-consuming. To save time and effort, indirect methods to obtain Cc may be an inexpensive option. Usually, the indirect methods are based on a correlation between some easier measuring descriptive variables such as liquid limit, soil density, and natural water content. This study used the ANFIS and regression methods to obtain Cc indirectly. To achieve the aim of this investigation, 177 undisturbed samples were collected from the cohesive soil in Sulaymaniyah Governorate in Iraq. Results of this study indicated that ANFIS models over-performed the Regression method in estimating Cc with R²of 0.66 and 0.48 for both ANFIS and Regre

View Publication Preview PDF

Publication Date

Mon Sep 30 2024

Journal Name

Al-mustansiriyah Journal Of Science

A Transfer Learning Approach for Arabic Image Captions

Haneen serag

Narjis

Abdul Rahman A.

...Show More Authors

Publication Date

Mon Apr 17 2023

Journal Name

Wireless Communications And Mobile Computing

A Double Clustering Approach for Color Image Segmentation

Asma Khazaal Abdulsahib

Siti Sakira Kamaruddin

and Mustafa Musa Jabar

...Show More Authors

One of the significant stages in computer vision is image segmentation which is fundamental for different applications, for example, robot control and military target recognition, as well as image analysis of remote sensing applications. Studies have dealt with the process of improving the classification of all types of data, whether text or audio or images, one of the latest studies in which researchers have worked to build a simple, effective, and high-accuracy model capable of classifying emotions from speech data, while several studies dealt with improving textual grouping. In this study, we seek to improve the classification of image division using a novel approach depending on two methods used to segment the images. The first

View Publication

(3)

(2)

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N.

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated fro

View Publication

(7)

(2)

Publication Date

Mon Oct 30 2023

Journal Name

Iraqi Journal Of Science

Machine Learning Approach for Facial Image Detection System

Hind Moutaz

...Show More Authors

HM Al-Dabbas, RA Azeez, AE Ali, Iraqi Journal of Science, 2023

View Publication

(7)

Publication Date

Fri Jul 01 2016

Journal Name

International Journal Of Computer Science And Mobile Computing

. Interpolative Absolute Block Truncation Coding for Image Compression

Ghadah

...Show More Authors

Publication Date

Mon Jan 01 2024

Journal Name

Lecture Notes On Data Engineering And Communications Technologies

Utilizing Deep Learning Technique for Arabic Image Captioning

Haneen serag

...Show More Authors

View Publication

1 2 ... 26 27 28 29 ... 2865 2866