Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Osamah Mohammed Alyasiri; Yu-N Cheah; Ammar Kamal Abasi; Omar Mustafa Al-Janabi

doi:10.1109/ACCESS.2022.3165814

Details

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Volume

10

DOI

10.1109/ACCESS.2022.3165814

Choose Citation Style

Statistics

View publication

44

View original publication

2

Click abstract more

2

View pdf

5

Statistics

(56)

(47)

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed Alyasiri

Yu-N Cheah

Ammar Kamal Abasi

Omar Mustafa Al-Janabi

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Fusion: Practice And Applications

A Hybrid Meta-Heuristic Approach for Test Case Prioritization and Optimization

Fadhil H.M.

...Show More Authors

The application of the test case prioritization method is a key part of system testing intended to think it through and sort out the issues early in the development stage. Traditional prioritization techniques frequently fail to take into account the complexities of big-scale test suites, growing systems and time constraints, therefore cannot fully fix this problem. The proposed study here will deal with a meta-heuristic hybrid method that focuses on addressing the challenges of the modern time. The strategy utilizes genetic algorithms alongside a black hole as a means to create a smooth tradeoff between exploring numerous possibilities and exploiting the best one. The proposed hybrid algorithm of genetic black hole (HGBH) uses the

View Publication

(3)

(1)

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Development of an ANN Model for RGB Color Classification using the Dataset Extracted from a Fabricated Colorimeter

Colorimeter

RGB classifier

ANN

TensorFlow

ML.

Shahad A.

Furat I.

Ahmed

...Show More Authors

Codes of red, green, and blue data (RGB) extracted from a lab-fabricated colorimeter device were used to build a proposed classifier with the objective of classifying colors of objects based on defined categories of fundamental colors. Primary, secondary, and tertiary colors namely red, green, orange, yellow, pink, purple, blue, brown, grey, white, and black, were employed in machine learning (ML) by applying an artificial neural network (ANN) algorithm using Python. The classifier, which was based on the ANN algorithm, required a definition of the mentioned eleven colors in the form of RGB codes in order to acquire the capability of classification. The software's capacity to forecast the color of the code that belongs to an object under de

Preview PDF

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Development of an ANN Model for RGB Color Classification using the Dataset Extracted from a Fabricated Colorimeter

Shahad A.

Furat I.

Ahmed

...Show More Authors

Codes of red, green, and blue data (RGB) extracted from a lab-fabricated colorimeter device were used to build a proposed classifier with the objective of classifying colors of objects based on defined categories of fundamental colors. Primary, secondary, and tertiary colors namely red, green, orange, yellow, pink, purple, blue, brown, grey, white, and black, were employed in machine learning (ML) by applying an artificial neural network (ANN) algorithm using Python. The classifier, which was based on the ANN algorithm, required a definition of the mentioned eleven colors in the form of RGB codes in order to acquire the capability of classification. The software's capacity to forecast the color of the code that belongs to an ob

View Publication Preview PDF

Publication Date

Mon May 15 2017

Journal Name

Journal Of Theoretical And Applied Information Technology

Anomaly detection in text data that represented as a graph using dbscan algorithm

Anomaly Detection

Enhanced DBSCAN algorithm

Unsupervised anomaly detection and Concept Frame Graph (CFG)

Asma Khazaal Abdulsahib

...Show More Authors

Anomaly detection is still a difficult task. To address this problem, we propose to strengthen DBSCAN algorithm for the data by converting all data to the graph concept frame (CFG). As is well known that the work DBSCAN method used to compile the data set belong to the same species in a while it will be considered in the external behavior of the cluster as a noise or anomalies. It can detect anomalies by DBSCAN algorithm can detect abnormal points that are far from certain set threshold (extremism). However, the abnormalities are not those cases, abnormal and unusual or far from a specific group, There is a type of data that is do not happen repeatedly, but are considered abnormal for the group of known. The analysis showed DBSCAN using the

Preview PDF

(4)

Publication Date

Mon Oct 02 2023

Journal Name

Journal Of Engineering

Tools for Drought Identification and Assessment: A Review

Drought

Types

Description

Indicators

SPI

Hawraa

Thamer Ahmed

...Show More Authors

Drought is a natural phenomenon in many arid, semi-arid, or wet regions. This showed that no region worldwide is excluded from the occurrence of drought. Extreme droughts were caused by global weather warming and climate change. Therefore, it is essential to review the studies conducted on drought to use the recommendations made by the researchers on drought. The drought was classified into meteorological, agricultural, hydrological, and economic-social. In addition, researchers described the severity of the drought by using various indices which required different input data. The indices used by various researchers were the Joint Deficit Index (JDI), Effective Drought Index (EDI), Streamflow Drought Index (SDI), Sta

View Publication Preview PDF

(3)

Publication Date

Wed Sep 01 2021

Journal Name

International Journal Of Nonlinear Analysis And Application

Suggested methods for prediction using semiparametric regression function

Semi- parametric method

Neural Network models (NN)

regression

Ferritin level

COVID 19

multilayer perceptron (MLP).

Mohamed A.S.

...Show More Authors

Ferritin is a key organizer of protected deregulation, particularly below risky hyperferritinemia, by straight immune-suppressive and pro-inflammatory things. , We conclude that there is a significant association between levels of ferritin and the harshness of COVID-19. In this paper we introduce a semi- parametric method for prediction by making a combination between NN and regression models. So, two methodologies are adopted, Neural Network (NN) and regression model in design the model; the data were collected from مستشفى دار التمريض الخاص for period 11/7/2021- 23/7/2021, we have 100 person, With COVID 12 Female & 38 Male out of 50, while 26 Female & 24 Male non COVID out of 50. The input variables of the NN m

Preview PDF

Publication Date

Mon Jul 01 2013

Journal Name

2013 35th Annual International Conference Of The Ieee Engineering In Medicine And Biology Society (embc)

Protocol for site selection and movement assessment for the myoelectric control of a multi-functional upper-limb prosthesis

Ali H.

Javier

Guido

Nicholas

...Show More Authors

View Publication

(2)

(1)

Publication Date

Mon Jun 19 2023

Journal Name

Journal Of Engineering

Data Classification using Quantum Neural Network

Signal classification

artificial neural network

quantum computing

data analysis and fuzziness.

Ghassan H.

Zainab T.

Hassan Saadallah

...Show More Authors

In this paper, integrated quantum neural network (QNN), which is a class of feedforward

neural networks (FFNN’s), is performed through emerging quantum computing (QC) with artificial neural network(ANN) classifier. It is used in data classification technique, and here iris flower data is used as a classification signals. For this purpose independent component analysis (ICA) is used as a feature extraction technique after normalization of these signals, the architecture of (QNN’s) has inherently built in fuzzy, hidden units of these networks (QNN’s) to develop quantized representations of sample information provided by the training data set in various graded levels of certainty. Experimental results presented here show that

View Publication Preview PDF

Publication Date

Sun Sep 24 2023

Journal Name

Journal Of Al-qadisiyah For Computer Science And Mathematics

Human Recognition Using Ear Features: A Review

Maha A.

Kadhim M.

...Show More Authors

Over the past few years, ear biometrics has attracted a lot of attention. It is a trusted biometric for the identification and recognition of humans due to its consistent shape and rich texture variation. The ear presents an attractive solution since it is visible, ear images are easily captured, and the ear structure remains relatively stable over time. In this paper, a comprehensive review of prior research was conducted to establish the efficacy of utilizing ear features for individual identification through the employment of both manually-crafted features and deep-learning approaches. The objective of this model is to present the accuracy rate of person identification systems based on either manually-crafted features such as D

View Publication

Publication Date

Mon Oct 03 2022

Journal Name

International Journal Of Interactive Mobile Technologies (ijim)

A New Feature-Based Method for Similarity Measurement under the Linux Operating System

Almarsoomi F.A.

...Show More Authors

This paper presents a new algorithm in an important research field which is the semantic word similarity estimation. A new feature-based algorithm is proposed for measuring the word semantic similarity for the Arabic language. It is a highly systematic language where its words exhibit elegant and rigorous logic. The score of sematic similarity between two Arabic words is calculated as a function of their common and total taxonomical features. An Arabic knowledge source is employed for extracting the taxonomical features as a set of all concepts that subsumed the concepts containing the compared words. The previously developed Arabic word benchmark datasets are used for optimizing and evaluating the proposed algorithm. In this paper,

View Publication

1 2 ... 16 17 18 19 ... 2225 2226