Classification of imbalanced data is an important issue. Many algorithms have been developed for classification, such as Back Propagation (BP) neural networks, decision tree, Bayesian networks etc., and have been used repeatedly in many fields. These algorithms speak of the problem of imbalanced data, where there are situations that belong to more classes than others. Imbalanced data result in poor performance and bias to a class without other classes. In this paper, we proposed three techniques based on the Over-Sampling (O.S.) technique for processing imbalanced dataset and redistributing it and converting it into balanced dataset. These techniques are (Improved Synthetic Minority Over-Sampling Technique (Improved SMOTE), Borderline-SMOTE + Imbalanced Ratio(IR), Adaptive Synthetic Sampling (ADASYN) +IR) Algorithm, where the work these techniques are generate the synthetic samples for the minority class to achieve balance between minority and majority classes and then calculate the IR between classes of minority and majority. Experimental results show ImprovedSMOTE algorithm outperform the Borderline-SMOTE + IR and ADASYN + IR algorithms because it achieves a high balance between minority and majority classes.
Poverty phenomenon is very substantial topic that determines the future of societies and governments and the way that they deals with education, health and economy. Sometimes poverty takes multidimensional trends through education and health. The research aims at studying multidimensional poverty in Iraq by using panelized regression methods, to analyze Big Data sets from demographical surveys collected by the Central Statistical Organization in Iraq. We choose classical penalized regression method represented by The Ridge Regression, Moreover; we choose another penalized method which is the Smooth Integration of Counting and Absolute Deviation (SICA) to analyze Big Data sets related to the different poverty forms in Iraq. Euclidian Distanc
... Show MoreIn this study, the Earth's surface was studied in Razzaza Lake for 25 years, using remote sensing methods. Images of the satellites Landsat 5 (TM) and 8 (OLI) were used to study and determine the components of the land cover. The study covered the years 1995-2021 with an interval of 5 years, as this region is uninhabited, so the change in the land cover is slow. The land cover was divided into three main classes and seven subclasses and classified using the maximum likelihood classifier with the help of training sets collected to represent the classes that made up the land cover. The changes detected in the land cover were studied by considering 1995 as a reference year. It was found that there was a significant reduction in the water mass
... Show MoreThe usual methods of distance determination in Astronomy parallax and Spectroscopic with Expansion Methods are seldom applicable to Nebulae. In this work determination of the distances to individual Nebulae are calculated and discussed. The distances of Nebulae to the Earth are calculated. The accuracy of the distance is tested by using Aladin sky Atlas, and comparing Nebulae properties were derived from these distance made with statistical distance determination. The results showed that angular Expansions may occur in a part of the nebulae that is moving at a velocity different than the observed velocity. Also the results of the comparison of our spectroscopic distances with the trig
Reservoir quality assessment is important for detecting hydrocarbon-bearing zones and guiding future enhancement strategies. This study presents a detailed petrophysical evaluation of the Mishrif Formation in the Buzurgan Oilfield, which was selected due to its strategic value through its significant remaining reserves which making it an ideal candidate for advanced evaluation techniques. This study aims for shale content, porosity, permeability, water saturation, net to gross, and lithology determination. Well log and core data were used together to establish accurate property estimations. Permeability prediction through conventional methods, like core permeability-porosity correlations, was highly dispersive due to the heterogenei
... Show MoreThe basis of the personality of each individual lies in the early years of his or her life. If the personality of the child has been well organized and if the motives have been fully expressed and effectively directed, the child will have a strong will, happy self-confidence and a strong personality. If there is a failure In the early years, the individual will be unable to meet his responsibilities in life and may be the victim of many psychological disorders. The family is a learning process through which children acquire the customs, traditions, attitudes and values prevailing in their social environment. (Pre-and-after) play and its relationship to parenting methods of (democratic-bullying-overprotection- and neglect), which wi
... Show MoreBackground: The problem of difficult gallbladder is not clearly defined and associated with real missing of therapeutic approaches that decreased morbidity. Moreover, the difficult gallbladder was reported as a contributing risk factor for biliary injury due to raised difficulty in surgical dissection within Calot’s triangle. The aim of this study is to determine the surgical outcomes of the open fundus-first cholecystectomy in lowering the rate of lethal intraoperative risks.
Subjects and Methods: Our prospective study conducted during the period of January 2019 to December 2022 at Ibn Sina specialized hospital, Khartoum, Sudan, for two hundred and fifty-three patients underw
... Show MoreThe purpose of this paper is to identifying the values of some physical and Bio- Kinematic variables during the performance of the jump spike serve skill, and identifying the effect of the proposed training program using intermittent training to develop some physical and Bio- Kinematic variables and accuracy of the jump spike serve skill among the research sample. The experimental method was used and the research was conducted on a deliberately chosen sample of the players of the Army Club, who were primarily advanced in volleyball, and the number of the sample was (10) players. The conclusions were reached that the proposed training program using intermittent training has a positive effect on some of the physical and Bio- Kinematic variabl
... Show MoreThin films ZrO2: MgO nanostructure have been synthesized by a radio frequency magnetron plasma sputtering technique at different ratios of MgO (0,6, 8 and 10)% percentage to be used as the gas sensor for nitrogen dioxide NO2. The samples were investigated by X-ray diffraction (XRD), atomic force microscopy (AFM), scanning electron microscopy (SEM), energy-dispersive X-ray (EDX) and sensing properties were also investigated. The average particle size of all prepared samples was found lower than 33.22nm and the structure was a monoclinic phase. The distribution of grain size was found lower than36.3 nm and uninformed particles on the surface. Finally, the data of sensing properties have been discussed, where the
... Show MoreIn this work, the effect of annealing temperature on the electrical properties are studied of p-Se/ n-Si solar cell, which p-Se are deposit by DC planar magnetron sputtering technique on crystal silicon. The chamber was pumped down to 2×10−5 mbar before admitting the gas in. The gas was Ar. The sputtering pressure varied within the range of 4x10-1 - 8x10-2mbar by adjusting the pumping speed through the opening control of throttle valve. The electrical properties are included the C-V and I-V measurements. From C-V measurements, the Vbi are calculated while from I-V measurements, the efficiency of solar cell is calculated.
A simple straightforward mathematical method has been developed to cluster grid nodes on a boundary segment of an arbitrary geometry that can be fitted by a relevant polynomial. The method of solution is accomplished in two steps. At the first step, the length of the boundary segment is evaluated by using the mean value theorem, then grids are clustered as desired, using relevant linear clustering functions. At the second step, as the coordinates cell nodes have been computed and the incremental distance between each two nodes has been evaluated, the original coordinate of each node is then computed utilizing the same fitted polynomial with the mean value theorem but reversibly.
The method is utilized to predict
... Show More