Minion gated recurrent unit for continual learning

Abdullah M. Zyarah; Dhireesha Kudithipudi

doi:10.1016/j.neucom.2026.132847

Details

Publication Date

Wed Apr 01 2026

Journal Name

Neurocomputing

Volume

673

DOI

10.1016/j.neucom.2026.132847

Choose Citation Style

Statistics

View publication

1

Statistics

(1)

Minion gated recurrent unit for continual learning

Abdullah M. Zyarah

Dhireesha Kudithipudi

...Show More Authors

The increasing demand for continual learning in sequential data processing has led to progressively complex training methodologies and larger recurrent network architectures. Consequently, this has widened the knowledge gap between continual learning with recurrent neural networks (RNNs) and their ability to operate on devices with limited memory and compute. To address this challenge, we investigate the effectiveness of simplifying RNN architectures, particularly gated recurrent unit (GRU), and its impact on both single-task and multitask sequential learning. We propose a new variant of GRU, namely the minion recurrent unit (MiRU). MiRU replaces conventional gating mechanisms with scaling coefficients to regulate dynamic updates of hidden states and historical context, reducing computational costs and memory requirements. Despite its simplified architecture, MiRU maintains performance comparable to the standard GRU while achieving more than 1.92 speed-up and reducing parameter usage by 2.88, as demonstrated through evaluations on sequential image classification and natural language processing benchmarks. The impact of model simplification on its learning capacity is also investigated by performing continual learning tasks with a rehearsal-based strategy and global inhibition. We find that MiRU demonstrates stable performance in multitask learning even when using only rehearsal, unlike the standard GRU and its variants. These features position MiRU as a promising candidate for edge-device applications.

View Publication

Publication Date

Tue Nov 19 2024

Journal Name

Aip Conference Proceedings

CT scan and deep learning for COVID-19 detection

Saja A.

Alyaa

...Show More Authors

View Publication

Publication Date

Mon Dec 20 2021

Journal Name

Baghdad Science Journal

Generative Adversarial Network for Imitation Learning from Single Demonstration

Deep Learning

Few-shot Learning

Generative Adversarial Network

Imitation Learning

One-shot Learning

Chanh Minh

Phan Xuan

Eiji

...Show More Authors

Imitation learning is an effective method for training an autonomous agent to accomplish a task by imitating expert behaviors in their demonstrations. However, traditional imitation learning methods require a large number of expert demonstrations in order to learn a complex behavior. Such a disadvantage has limited the potential of imitation learning in complex tasks where the expert demonstrations are not sufficient. In order to address the problem, we propose a Generative Adversarial Network-based model which is designed to learn optimal policies using only a single demonstration. The proposed model is evaluated on two simulated tasks in comparison with other methods. The results show that our proposed model is capable of completing co

View Publication Preview PDF

Publication Date

Mon Jun 30 2025

Journal Name

Ingénierie Des Systèmes D Information

Comparative Analysis of Four Programming Languages for Machine Learning

Alaa

saadya

Firas

...Show More Authors

View Publication

(2)

Publication Date

Mon Mar 14 2022

Journal Name

Periodicals Of Engineering And Natural Sciences (pen)

Mathematical simulation of memristive for classification in machine learning

Ammar A

Nada A.Z.

Amenah D.

...Show More Authors

View Publication

Publication Date

Thu Sep 01 2022

Journal Name

Iraqi Journal Of Computers, Communications, Control And Systems Engineering

A Framework for Predicting Airfare Prices Using Machine Learning

Fadhil,

Abdullah,

Younis,

...Show More Authors

Many academics have concentrated on applying machine learning to retrieve information from databases to enable researchers to perform better. A difficult issue in prediction models is the selection of practical strategies that yield satisfactory forecast accuracy. Traditional software testing techniques have been extended to testing machine learning systems; however, they are insufficient for the latter because of the diversity of problems that machine learning systems create. Hence, the proposed methodologies were used to predict flight prices. A variety of artificial intelligence algorithms are used to attain the required, such as Bayesian modeling techniques such as Stochastic Gradient Descent (SGD), Adaptive boosting (ADA), Decision Tre

View Publication Preview PDF

Publication Date

Sat Jan 01 2022

Journal Name

Journal Of Cybersecurity And Information Management

Machine Learning-based Information Security Model for Botnet Detection

Fadhil H.M.

...Show More Authors

Botnet detection develops a challenging problem in numerous fields such as order, cybersecurity, law, finance, healthcare, and so on. The botnet signifies the group of co-operated Internet connected devices controlled by cyber criminals for starting co-ordinated attacks and applying various malicious events. While the botnet is seamlessly dynamic with developing counter-measures projected by both network and host-based detection techniques, the convention techniques are failed to attain sufficient safety to botnet threats. Thus, machine learning approaches are established for detecting and classifying botnets for cybersecurity. This article presents a novel dragonfly algorithm with multi-class support vector machines enabled botnet

View Publication

(12)

(7)

Publication Date

Fri Sep 30 2022

Journal Name

Iraqi Journal Of Computer, Communication, Control And System Engineering

A Framework for Predicting Airfare Prices Using Machine Learning

Fadhil H.M.

...Show More Authors

Many academics have concentrated on applying machine learning to retrieve information from databases to enable researchers to perform better. A difficult issue in prediction models is the selection of practical strategies that yield satisfactory forecast accuracy. Traditional software testing techniques have been extended to testing machine learning systems; however, they are insufficient for the latter because of the diversity of problems that machine learning systems create. Hence, the proposed methodologies were used to predict flight prices. A variety of artificial intelligence algorithms are used to attain the required, such as Bayesian modeling techniques such as Stochastic Gradient Descent (SGD), Adaptive boosting (ADA), Deci

View Publication

(20)

(7)

Publication Date

Mon Apr 07 2025

Journal Name

Al-nahrain Journal For Engineering Sciences

Design of Reverse Osmosis Water Treatment Unit Using Lanxess Lewaplus2

Khalid M. Mousa

Saad A. Ali

...Show More Authors

Basrah is the richest town and the economic capital of Iraq. It suffers from lack of drinking water. This project is a dream to supply drinking water to Basrah citizens within WHO standards. Water should pass sedimentation and filtration stages before interring reverse osmosis unit. The design is carried out using lewaplus2 software. Several parameters should be selected in the design step membrane type, number of stages, number per element in each stage, and the recovery percentage. An optimization is carried out using Minitab ver. 18 for the acceptable limit of TDS and minimum cost and it was found that the optimum conditions were 52% for first stage, the numbers of vessels are 20 for both the first and second stage. In addition,

View Publication

Publication Date

Thu Apr 18 2019

Journal Name

Al-kindy College Medical Journal

Prevalence of H pylori in obese attending Obesity therapy Unit

Obesity

helicobacter pylori

BMI

Mumtaz K

...Show More Authors

Background: Obesity is an increasing health problem in developed countries and has grown into a major global epidemic. Recent studies suggested colonization of the stomach by Hpylori might affect gastric expression of appetite- and satiety-related hormone and patients cured of H pylori infection gained weight. Obesity and Helicobacter pylori (H. pylori) are important because of the problems they lead and their frequency of occurrence.

Objectives: To find out the prevalence of H. pylori infection in obese.

Type of the study:A cross-sectional study

Methods: A total of 32 obese female admitted to the study. Body mass indices (BMI) of all subjects wer

View Publication Preview PDF

(2)

Publication Date

Wed Sep 30 2009

Journal Name

Iraqi Journal Of Chemical And Petroleum Engineering

Recovery of Catalyst from Tar Formed in Phenol Production Unit

Wadood

Sami

...Show More Authors

This work was conducted to study the recovery of catalyst and desirable components from tar formed in phenol production unit and more particularly relates to such a method whereby better recovery of copper salts, phenol, benzoic acid and benzoate salts from tar by aqueous acid solution was accomplished.
The effect of solvent type, solvent concentration (5, 10, 15, 20, 25 and 30 wt%), agitation speed (100, 200, 300 and 400 rpm), agitation time (5, 10, 15, 20 and 25 min), temperature (90, 100, 110, 120, 130 and 140 oC) , phase ratio (1/1, 2/1, 3/1, 4/1 and 5/1) and number of extraction (1, 2, 3, 4, and 5) were examined in order to increase the catalyst and desirable components extraction.
Four types of solvent were used; hydrochloric

View Publication Preview PDF

1 2 ... 4 5 6 7 ... 660 661