BotDetectorFW: an optimized botnet detection framework based on five features-distance measures supported by comparisons of four machine learning classifiers using CICIDS2017 dataset

Jabbar A.F. Jabbar; imad j. mohammed mohammed

doi:10.11591/ijeecs.v21.i1.pp377-390

Details

Publication Date

Fri Jan 01 2021

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Volume

21

DOI

10.11591/ijeecs.v21.i1.pp377-390

Choose Citation Style

Statistics

View publication

18

View original publication

1

Statistics

(8)

(3)

BotDetectorFW: an optimized botnet detection framework based on five features-distance measures supported by comparisons of four machine learning classifiers using CICIDS2017 dataset

Jabbar A.F. Jabbar

imad j. mohammed mohammed

...Show More Authors

<p><span>A Botnet is one of many attacks that can execute malicious tasks and develop continuously. Therefore, current research introduces a comparison framework, called BotDetectorFW, with classification and complexity improvements for the detection of Botnet attack using CICIDS2017 dataset. It is a free online dataset consist of several attacks with high-dimensions features. The process of feature selection is a significant step to obtain the least features by eliminating irrelated features and consequently reduces the detection time. This process implemented inside BotDetectorFW using two steps; data clustering and five distance measure formulas (cosine, dice, driver & kroeber, overlap, and pearson correlation) using C#, followed by selecting the best N features used as input into four classifier algorithms evaluated using machine learning (WEKA); multilayerperceptron, JRip, IBK, and random forest. In BotDetectorFW, the thoughtful and diligent cleaning of the dataset within the preprocessing stage beside the normalization, binary clustering of its features, followed by the adapting of feature selection based on suitable feature distance techniques, and finalized by testing of selected classification algorithms. All together contributed in satisfying the high-performance metrics using fewer features number (8 features as a minimum) compared to and outperforms other methods found in the literature that adopted (10 features or higher) using the same dataset. Furthermore, the results and performance evaluation of BotDetectorFM shows a competitive impact in terms of classification accuracy (ACC), precision (Pr), recall (Rc), and f-measure (F1) metrics.</span></p>

View Publication

Publication Date

Sat Jul 31 2021

Journal Name

Iraqi Journal Of Science

A Decision Tree-Aware Genetic Algorithm for Botnet Detection

Thurayaa B.

Sarab M.

Bara'a A.

...Show More Authors

In this paper, the botnet detection problem is defined as a feature selection problem and the genetic algorithm (GA) is used to search for the best significant combination of features from the entire search space of set of features. Furthermore, the Decision Tree (DT) classifier is used as an objective function to direct the ability of the proposed GA to locate the combination of features that can correctly classify the activities into normal traffics and botnet attacks. Two datasets namely the UNSW-NB15 and the Canadian Institute for Cybersecurity Intrusion Detection System 2017 (CICIDS2017), are used as evaluation datasets. The results reveal that the proposed DT-aware GA can effectively find the relevant features from

(8)

(2)

Publication Date

Tue Dec 28 2021

Journal Name

2021 2nd Information Technology To Enhance E-learning And Other Application (it-ela)

Pedestrian and Objects Detection by Using Learning Complexity-Aware Cascades

Mohammed F. Alrifaie

...Show More Authors

View Publication Preview PDF

(8)

(2)

Publication Date

Wed Mar 10 2021

Journal Name

Baghdad Science Journal

Detecting Textual Propaganda Using Machine Learning Techniques

Social Networks

Disinformation

Propaganda

Term Frequency

Bag of Words.

Akib Mohi Ud Din

Qamar Rayees

Syed Tanzeel

...Show More Authors

Social Networking has dominated the whole world by providing a platform of information dissemination. Usually people share information without knowing its truthfulness. Nowadays Social Networks are used for gaining influence in many fields like in elections, advertisements etc. It is not surprising that social media has become a weapon for manipulating sentiments by spreading disinformation. Propaganda is one of the systematic and deliberate attempts used for influencing people for the political, religious gains. In this research paper, efforts were made to classify Propagandist text from Non-Propagandist text using supervised machine learning algorithms. Data was collected from the news sources from July 2018-August 2018. After annota

View Publication Preview PDF

(27)

(15)

Publication Date

Wed Jan 01 2025

Journal Name

Journal Of Engineering And Sustainable Development

Improving Performance Classification in Wireless Body Area Sensor Networks Based on Machine Learning Techniques

Data analytics

Learning Vector Quantization

Machine Learning

Support Vector Machine

Wireless Body Area Network

sabreen

Mohammed Ali

...Show More Authors

Wireless Body Area Sensor Networks (WBASNs) have garnered significant attention due to the implementation of self-automaton and modern technologies. Within the healthcare WBASN, certain sensed data hold greater significance than others in light of their critical aspect. Such vital data must be given within a specified time frame. Data loss and delay could not be tolerated in such types of systems. Intelligent algorithms are distinguished by their superior ability to interact with various data systems. Machine learning methods can analyze the gathered data and uncover previously unknown patterns and information. These approaches can also diagnose and notify critical conditions in patients under monitoring. This study implements two s

View Publication

(3)

(2)

Publication Date

Tue Jul 09 2024

Journal Name

Diagnostics

A Novel Hybrid Machine Learning-Based System Using Deep Learning Techniques and Meta-Heuristic Algorithms for Various Medical Datatypes Classification

deep learning

autoencoder

classification

medical dataset

COVID-19

brain tumor

meta-heuristic algorithm

Yezi Ali

Mehmet Serdar

Alok

...Show More Authors

Medicine is one of the fields where the advancement of computer science is making significant progress. Some diseases require an immediate diagnosis in order to improve patient outcomes. The usage of computers in medicine improves precision and accelerates data processing and diagnosis. In order to categorize biological images, hybrid machine learning, a combination of various deep learning approaches, was utilized, and a meta-heuristic algorithm was provided in this research. In addition, two different medical datasets were introduced, one covering the magnetic resonance imaging (MRI) of brain tumors and the other dealing with chest X-rays (CXRs) of COVID-19. These datasets were introduced to the combination network that contained deep lea

View Publication

(8)

(10)

Publication Date

Mon Dec 20 2021

Journal Name

Baghdad Science Journal

Recurrent Stroke Prediction using Machine Learning Algorithms with Clinical Public Datasets: An Empirical Performance Evaluation

Artificial Neural Network

Bayesian Rule List

Machine Learning

Recurrent Stroke Prediction

Support Vector Machine

Fadratul Hafinaz

Mohd Adib

...Show More Authors

Recurrent strokes can be devastating, often resulting in severe disability or death. However, nearly 90% of the causes of recurrent stroke are modifiable, which means recurrent strokes can be averted by controlling risk factors, which are mainly behavioral and metabolic in nature. Thus, it shows that from the previous works that recurrent stroke prediction model could help in minimizing the possibility of getting recurrent stroke. Previous works have shown promising results in predicting first-time stroke cases with machine learning approaches. However, there are limited works on recurrent stroke prediction using machine learning methods. Hence, this work is proposed to perform an empirical analysis and to investigate machine learning al

View Publication Preview PDF

(14)

(8)

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(18)

(8)

Publication Date

Sat Sep 30 2023

Journal Name

نسق

Problems and Difficulties Faced by Iraqi University Students in Employing Distance Learning

Noor

...Show More Authors

Publication Date

Sat Mar 08 2025

Journal Name

Fusion: Practice And Applications

Fast Numeric Sign Detection Using Adaptive Thresholding and Geometry of Optimized Fingers

ASL

Global thresholding

Chain coding Edge detection

Elbow point extraction

Gaps\blots removal

Mela G.

Loay E.

...Show More Authors

A strong sign language recognition system can break down the barriers that separate hearing and speaking members of society from speechless members. A novel fast recognition system with low computational cost for digital American Sign Language (ASL) is introduced in this research. Different image processing techniques are used to optimize and extract the shape of the hand fingers in each sign. The feature extraction stage includes a determination of the optimal threshold based on statistical bases and then recognizing the gap area in the zero sign and calculating the heights of each finger in the other digits. The classification stage depends on the gap area in the zero signs and the number of opened fingers in the other signs as well as

Publication Date

Wed Sep 22 2021

Journal Name

Samarra Journal Of Pure And Applied Science

Toward Constructing a Balanced Intrusion Detection Dataset

Amer Abulmajeed Abdulrahman

Mahmood Khalel

...Show More Authors

Several Intrusion Detection Systems (IDS) have been proposed in the current decade. Most datasets which associate with intrusion detection dataset suffer from an imbalance class problem. This problem limits the performance of classifier for minority classes. This paper has presented a novel class imbalance processing technology for large scale multiclass dataset, referred to as BMCD. Our algorithm is based on adapting the Synthetic Minority Over-Sampling Technique (SMOTE) with multiclass dataset to improve the detection rate of minority classes while ensuring efficiency. In this work we have been combined five individual CICIDS2017 dataset to create one multiclass dataset which contains several types of attacks. To prove the eff

View Publication

(11)

1 2 ... 4 5 6 7 ... 2765 2766