The need for an efficient method to find the furthermost appropriate document corresponding to a particular search query has become crucial due to the exponential development in the number of papers that are now readily available to us on the web. The vector space model (VSM) a perfect model used in “information retrieval”, represents these words as a vector in space and gives them weights via a popular weighting method known as term frequency inverse document frequency (TF-IDF). In this research, work has been proposed to retrieve the most relevant document focused on representing documents and queries as vectors comprising average term term frequency inverse sentence frequency (TF-ISF) weights instead of representing them as vectors of term TF-IDF weight and two basic and effective similarity measures: Cosine and Jaccard were used. Using the MS MARCO dataset, this article analyzes and assesses the retrieval effectiveness of the TF-ISF weighting scheme. The result shows that the TF-ISF model with the Cosine similarity measure retrieves more relevant documents. The model was evaluated against the conventional TF-ISF technique and shows that it performs significantly better on MS MARCO data (Microsoft-curated data of Bing queries).
Abstract
Due to the continuing demand for larger bandwidth, the optical transport becoming general in the access network. Using optical fiber technologies, the communications infrastructure becomes powerful, providing very high speeds to transfer a high capacity of data. Existing telecommunications infrastructures is currently widely used Passive Optical Network that apply Wavelength Division Multiplexing (WDM) and is awaited to play an important role in the future Internet supporting a large diversity of services and next generation networks. This paper presents a design of WDM-PON network, the simulation and analysis of transmission parameters in the Optisystem 7.0 environment for bidirectional traffic. The sim
... Show MoreDiyala River is a tributary of Tigris River, it is one of the important rivers in Iraq. It covers a total distance of 445 km (275 miles). 32600 km2is the area that drains by Diyala River between Iraqi-Iranian borders. This research aims to evaluate the water quality index WQI of Diyala River, where three stations were chosen along the river. These stations are D12 at Jalawlaa City at the beginning of Diyala River, the second station is D15 at Baaquba City at the mid distance of the river, and the third station is D17 which is the last station before the confluence of Diyala River with Tigris River at Baghdad city. Bhargava method was used in order to evaluate the water quality index for both irrigation and drink
... Show MoreThe study aims to evaluate the removal of sulfur content from Iraqi light naphtha produced in Al-Dora refinery by adsorption desulfurization DS technique using modified activated carbon MAC loaded with nickel Ni and copper Cu as single binary metals. The experiments were carried in a batch unit with various operating parameters; MAC dosage, agitation speed, and a contact time of 300 min at constant initial sulfur concentration 155 ppm and temperature. The results showed higher DS% by AC/Ni-Cu (66.45)% at 500 rpm and 1 g dosage than DS (29.03)% by activated carbon AC, increasing MAC dosage, agitation speed, and contact time led to increasing DS% values. The adsorption capacity of MAC results was recorded (16, 15, and 20) mg sulfu
... Show MoreFacial recognition has been an active field of imaging science. With the recent progresses in computer vision development, it is extensively applied in various areas, especially in law enforcement and security. Human face is a viable biometric that could be effectively used in both identification and verification. Thus far, regardless of a facial model and relevant metrics employed, its main shortcoming is that it requires a facial image, against which comparison is made. Therefore, closed circuit televisions and a facial database are always needed in an operational system. For the last few decades, unfortunately, we have experienced an emergence of asymmetric warfare, where acts of terrorism are often committed in secluded area with no
... Show MoreText based-image clustering (TBIC) is an insufficient approach for clustering related web images. It is a challenging task to abstract the visual features of images with the support of textual information in a database. In content-based image clustering (CBIC), image data are clustered on the foundation of specific features like texture, colors, boundaries, shapes. In this paper, an effective CBIC) technique is presented, which uses texture and statistical features of the images. The statistical features or moments of colors (mean, skewness, standard deviation, kurtosis, and variance) are extracted from the images. These features are collected in a one dimension array, and then genetic algorithm (GA) is applied for image clustering.
... Show MoreAn oil spill is a leakage of pipelines, vessels, oil rigs, or tankers that leads to the release of petroleum products into the marine environment or on land that happened naturally or due to human action, which resulted in severe damages and financial loss. Satellite imagery is one of the powerful tools currently utilized for capturing and getting vital information from the Earth's surface. But the complexity and the vast amount of data make it challenging and time-consuming for humans to process. However, with the advancement of deep learning techniques, the processes are now computerized for finding vital information using real-time satellite images. This paper applied three deep-learning algorithms for satellite image classification
... Show MoreThe presence of construction wastes such as clay bricks, glass, wood, plastic, and others in large quantities causes serious environmental problems in the world. Where these wastes can be used to preserve the natural resources used in construction and reduce the impact of this problem on the environment, it also works to reduce the problem of high loads of concrete blocks. Clay bricks aggregate (AB) can be recycled as coarse aggregate and replaced with volumetric proportions of coarse aggregate by ( 5% and 10%), as well as the use of clay brick powder (PB) by replacing its weight of cement (5% and 10%) and reduced in the manufacture of concrete blocks (blocks). Four mixtures will be prepared and tested to learn how to re
... Show More