U-Net for genomic sequencing: A novel approach to DNA sequence classification

Raghad K Mohammed; Azmi Tawfeq Hussein Alrawi; Ali Jbaeer Dawood

doi:10.1016/j.aej.2024.03.066

Details

Publication Date

Sat Jun 01 2024

Journal Name

Alexandria Engineering Journal

Volume

96

DOI

10.1016/j.aej.2024.03.066

Choose Citation Style

Statistics

View publication

11

Statistics

(3)

U-Net for genomic sequencing: A novel approach to DNA sequence classification

DNA sequence classification U-net architecture Deep learning Genomics Sequence data

Raghad K Mohammed

Azmi Tawfeq Hussein Alrawi

Ali Jbaeer Dawood

...Show More Authors

The precise classification of DNA sequences is pivotal in genomics, holding significant implications for personalized medicine. The stakes are particularly high when classifying key genetic markers such as BRAC, related to breast cancer susceptibility; BRAF, associated with various malignancies; and KRAS, a recognized oncogene. Conventional machine learning techniques often necessitate intricate feature engineering and may not capture the full spectrum of sequence dependencies. To ameliorate these limitations, this study employs an adapted UNet architecture, originally designed for biomedical image segmentation, to classify DNA sequences.The attention mechanism was also tested LONG WITH u-Net architecture to precisely classify DNA sequences into BRAC, BRAF, and KRAS categories. Our comprehensive methodology includes rigorous data preprocessing, model training, and a multi-faceted evaluation approach. The adapted U-Net model exhibited exceptional performance, achieving an overall accuracy of 0.96. The model also achieved high precision and recall rates across the classes, with precision ranging from 0.93 to 1.00 and recall between 0.95 and 0.97 for the key markers BRAC, BRAF, and KRAS. The F1-score for these critical markers ranged from 0.95 to 0.98. These empirical results substantiate the architecture’s capability to capture local and global features in DNA sequences, affirming its applicability for critical, sequence-based bioinformatics challenges

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Mar 03 2009

Journal Name

Journal Of Economics And Administrative Sciences

Comparison of repetitive estimation methodsSelf-data

جنان عباس

...Show More Authors

In this study, we review the ARIMA (p, d, q), the EWMA and the DLM (dynamic linear moodelling) procedures in brief in order to accomdate the ac(autocorrelation) structure of data .We consider the recursive estimation and prediction algorithms based on Bayes and KF (Kalman filtering) techniques for correlated observations.We investigate the effect on the MSE of these procedures and compare them using generated data.

View Publication Preview PDF

Publication Date

Sat Jun 29 2013

Journal Name

Journal Of Statistics Applications & Probability

Analyzing Skewed Data with the Epsilon Skew Gamma distribution

Skewed Data

Epsilon Skew

Gamma Distribution

Ebtisam

...Show More Authors

A new distribution, the Epsilon Skew Gamma (ESΓ ) distribution, which was first introduced by Abdulah [1], is used on a near Gamma data. We first redefine the ESΓ distribution, its properties, and characteristics, and then we estimate its parameters using the maximum likelihood and moment estimators. We finally use these estimators to fit the data with the ESΓ distribution

(5)

Publication Date

Mon Jun 01 2026

Journal Name

Iraoi Journal Of Statistical Sciences

حول تقليص تقدير المركبات الرئيسة مع التطبيق

Shrinkage Estimation

Data Reduction

Eigen Vectors.

Omar Abdulmohsin

عمر

...Show More Authors

This research deals with a shrinking method concerned with the principal components similar to that one which used in the multiple regression “Least Absolute Shrinkage and Selection: LASS”. The goal here is to make an uncorrelated linear combinations from only a subset of explanatory variables that may have a multicollinearity problem instead taking the whole number say, (K) of them. This shrinkage will force some coefficients to equal zero, after making some restriction on them by some "tuning parameter" say, (t) which balances the bias and variance amount from side, and doesn't exceed the acceptable percent explained variance of these components. This had been shown by MSE criterion in the regression case and the percent explained

View Publication Preview PDF

Publication Date

Mon Feb 18 2019

Journal Name

Iraqi Journal Of Physics

Data visualization and distinct features extraction of the comet Ison 2013

Interaction of Comet

Histogram

data visualization.

Salman. Z.

...Show More Authors

The distribution of the intensity of the comet Ison C/2013 is studied by taking its histogram. This distribution reveals four distinct regions that related to the background, tail, coma and nucleus. One dimensional temperature distribution fitting is achieved by using two mathematical equations that related to the coordinate of the center of the comet. The quiver plot of the gradient of the comet shows very clearly that arrows headed towards the maximum intensity of the comet.

View Publication Preview PDF

Publication Date

Tue Jan 03 2023

Journal Name

College Of Islamic Sciences

Ruling on selling big data (Authentical Fiqh Study): Ruling on selling big data (Authentical Fiqh Study)

Judgment

sale

data

huge

sayings

jurists.

حسين

...Show More Authors

Abstract:

Research Topic: Ruling on the sale of big data

Its objectives: a statement of what it is, importance, source and governance.

The methodology of the curriculum is inductive, comparative and critical

One of the most important results: it is not permissible to attack it and it is a valuable money, and it is permissible to sell big data as long as it does not contain data to users who are not satisfied with selling it

Recommendation: Follow-up of studies dealing with the provisions of the issue

Subject Terms

Judgment, Sale, Data, Mega, Sayings, Jurists

View Publication Preview PDF

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Obese Women and Choosing Ready-made Clothes: Difficulties and Choices

obesity

clothing

ready-made clothes

binary classification.

Shaimaa Khaleel

...Show More Authors

Thisstudy aims to determine the specifications of obese women accordingto the heightand type of obesity. It also aimstoidentify the significance of differences in choosing ready-made clothes for the research sample. Finally, the significance of differences in choosing ready-made clothes according to the variable of binaryclassification ofobesity is also identified.The study sample includes obese women: employees, non-employees and students with the age group (18-50) years.The weights and lengths of the sample have been taken to suit the group of obese women.Aquestionnaire in the form of an open question was distributed among (50) obese womenso as to extract the items of the questionnaire. After that, the questionnaire was distributed amo

View Publication Preview PDF

Publication Date

Fri Oct 01 2021

Journal Name

Journal Of Al-rafidain University College For Sciences ( Print Issn: 1681-6870 ,online Issn: 2790-2293 )

The Use of Logistic Regression Model in Estimating the Probability of Being Affected By Breast Cancer Based On the Levels of Interleukins and Cancer Marker CA15-3

Logistic Regression

K-Means

Classification

Breast Cancer

ALI

Sara

...Show More Authors

Breast cancer has got much attention in the recent years as it is a one of the complex diseases that can threaten people lives. It can be determined from the levels of secreted proteins in the blood. In this project, we developed a method of finding a threshold to classify the probability of being affected by it in a population based on the levels of the related proteins in relatively small case-control samples. We applied our method to simulated and real data. The results showed that the method we used was accurate in estimating the probability of being diseased in both simulation and real data. Moreover, we were able to calculate the sensitivity and specificity under the null hypothesis of our research question of being diseased o

View Publication

Publication Date

Tue Dec 15 2020

Journal Name

Aip Conference Proceedings

The water bodies in the Southern East of Iraq before and after 2018

Marshlands

classification

support vector machine

drought

mosaic

Zainab Assif Abdullah

Hameed M

...Show More Authors

This study is concerned with the recent changes that occurred in the last three years (2017-2019) in the marshes region in southern Iraq as a result of the changes in the global climate, the study included all the water bodies in the five governorates that are located in the southern regions of Iraq (Wasit, Maysan, Dhi-Qar, Qadisiyah and Basrah), which represent the marshes lands in Iraq. Scenes of the Landsat 8 satellite are used to create a mosaic to cover the five governorates within a time window with the slightest difference between the date of the scene capture, not to exceed 8 days. The results of calculating the changes in water areas were obtained using the classifier support vector machine, where high accuracy ratios were recorded

View Publication Preview PDF

(5)

(4)

Publication Date

Wed Mar 30 2022

Journal Name

Iraqi Journal Of Science

Monitoring Vegetation Area in Baghdad Using Normalized Difference Vegetation Index

(NDVI)

Classification methods

Image processing

Remote Sensing.

Abeer N.

Alaa S.

...Show More Authors

Vegetation monitoring is considered an important application in remote sensing task due to variation of vegetation types and their distribution. The vegetation concentration around the Earth is increase in 5% in 2000 according to NASA monitoring. This increase is due to the Indian vegetable programs. In this research, the vegetation monitoring in Baghdad city was done using Normalized Difference Vegetation Index (NDVI) for temporal Landsat satellite images (Landsat 5 TM& Landsat 8 OIL). These images had been used and utilize in different times during the period from 2000, 2010, 2015 & 2017. The outcomes of the study demonstrate that a change in the vegetation Cover (VC) in Baghdad city. (NDVI) generally shows a

View Publication Preview PDF

(15)

(9)

Publication Date

Mon Apr 01 2019

Journal Name

Arpn Journal Of Engineering And Applied Sciences

Assessment of vegetable cover in south Iraq by remote sensing methods

date palm

wheat

barley

image processing

classification.

Halla S.

Alaa S.

...Show More Authors

The vegetable cover plays an important role in the environment and Earth resource sciences. In south Iraq, the region is classified as arid or semiarid area due to the low precipitations and high temperature among the year. In this paper, the Landat-8 satellite imagery will be used to study and estimate the vegetable area in south Iraq. For this purpose many vegetation indices will be examined to estimate and extract the area of vegetation contain in and image. Also, the weathering parameters must be investigated to find the relationship between these parameters and the arability of vegetation cover crowing in the specific area. The remote sensing packages and Matlab written subroutines may be use to evaluate the results.

Preview PDF

1 2 ... 1305 1306 1307 1308 ... 1322 1323