Document retrieval using term term frequency inverse sentence frequency weighting scheme

Mohannad T. Mohammed; Omar Fitian Rashid

doi:10.11591/ijeecs.v31.i3.pp1478-1485

Details

Publication Date

Fri Sep 01 2023

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Volume

31

DOI

10.11591/ijeecs.v31.i3.pp1478-1485

Choose Citation Style

Statistics

View publication

4

Statistics

(9)

(7)

Document retrieval using term term frequency inverse sentence frequency weighting scheme

Document representation

Document retrieval

Similarity measures

Term frequency inverse

sentence frequency

Weighting schemes

Mohannad T. Mohammed

Omar Fitian Rashid

...Show More Authors

The need for an efficient method to find the furthermost appropriate document corresponding to a particular search query has become crucial due to the exponential development in the number of papers that are now readily available to us on the web. The vector space model (VSM) a perfect model used in “information retrieval”, represents these words as a vector in space and gives them weights via a popular weighting method known as term frequency inverse document frequency (TF-IDF). In this research, work has been proposed to retrieve the most relevant document focused on representing documents and queries as vectors comprising average term term frequency inverse sentence frequency (TF-ISF) weights instead of representing them as vectors of term TF-IDF weight and two basic and effective similarity measures: Cosine and Jaccard were used. Using the MS MARCO dataset, this article analyzes and assesses the retrieval effectiveness of the TF-ISF weighting scheme. The result shows that the TF-ISF model with the Cosine similarity measure retrieves more relevant documents. The model was evaluated against the conventional TF-ISF technique and shows that it performs significantly better on MS MARCO data (Microsoft-curated data of Bing queries).

View Publication

Publication Date

Fri Feb 08 2019

Journal Name

Iraqi Journal Of Laser

Effects of 650 nm Diode Laser and 532 nm Frequency-Doubled Q-Switched Nd:YAG Laser on The Growth of Candida albicans, With and Without Photosensitizers

Intisar K.

Amel M.

...Show More Authors

This work describes an experimental setup to evaluate the photodynamictoxicity of 650 nm diode laser and 532 nm Frequency-doubled Q-Switched Nd:YAG laser on the growth of Candida albicans as well as the potential fungicidal effect when combining the laser irradiation with specific photosensitizers namely methylene blue, toluidine blue, acridine orange and safranin O. In this study the findings showed that the number of colony-forming units per millilitre (CFU/ml) of C. albicans decreased with increasing exposure time. In particular in the case of the frequency doubled Nd:YAG laser combined with safranin O, the best lethal effect occurred at 11 minutes exposure time with 2.26 J/cm² energy density (89.18% reduction) in comparison with the

View Publication Preview PDF

Publication Date

Tue Apr 30 2024

Journal Name

Iraqi Journal Of Science

Mining Deviations in Document Writing Style through Vector Dissimilarity

Nasreen J.

...Show More Authors

Doubts arise about the originality of a document when noticing a change in its writing style. This evidence to plagiarism has made the intrinsic approach for detecting plagiarism uncover the plagiarized passages through the analysis of the writing style for the suspicious document where a reference corpus to compare with is absent. The proposed work aims at discovering the deviations in document writing style through applying several steps: Firstly, the entire document is segmented into disjointed segments wherein each corresponds to a paragraph in the original document. For the entire document and for each segment, center vectors comprising average weight of their word are constructed. Second, the degree of cl

View Publication

(1)

Publication Date

Tue Sep 01 2020

Journal Name

Clinical Plasma Medicine

Breast cancer treatment using cold atmospheric plasma generated by the FE-DBD scheme

Ban H.

Ahmed Majeed

Hamid H.

...Show More Authors

Background Cold atmospheric plasma (CAP) is widely used in the cancer therapy field. This type of plasma is very close to room temperature. This paper illustrates the effects of CAP on breast cancer tissues both in vivo and in vitro. Methods The mouse mammary adenocarcinoma cell line AN3 was used for the in vivo study, and the MCF7, AMJ13, AMN3, and HBL cell lines were used for the in vitro study. A floating electrode-dielectric barrier discharge (FE-DBD) system was used. The cold plasma produced by the device was tested against breast cancer cells. Results The induced cytotoxicity percentages were 61.7%, 68% and 58.07% for the MCF7, AMN3, and AMJ13 cell lines, respectively, whereas the normal breast tissue HBL cell line exhibited very li

View Publication

(65)

(52)

Publication Date

Sat Jun 15 2024

Journal Name

Journal Of Legal Sciences

The period for challenging the unconstitutionality of the medium-term general budget law In light of the decision of the Federal Supreme Court No. 155 and its unified No. 163 / Federal / 2022

Ali

...Show More Authors

يتعرض قانون الموازنة العامة الاتحادية للطعن بعدم الدستورية كغيره من القوانين، بل أن الطعن فيه يكاد يكون سنوياً حال نشره في الجريدة الرسمية ، وتوجه إليه المطاعن بعدم الدستورية إما عن إجراءات تشريعه أو لمضامينه المتعارضة مع الدستور نصاً أو روحاً ، ولكنّه إذا كانت مدة الطعن بعدم دستورية القوانين كافة متاحة دون قيد زمني محدد ولا تتطلب سوى إجراءات إقامة الدعوى العامة وأخصها قيام شرط المصلحة في حالة الدعوى ال

Publication Date

Thu Jun 20 2019

Journal Name

Baghdad Science Journal

A Comparative Analysis of the Zernike Moments for Single Object Retrieval

shape features

shape image retrieval

Zernike Moments

Abu Bakar

...Show More Authors

Zernike Moments has been popularly used in many shape-based image retrieval studies due to its powerful shape representation. However its strength and weaknesses have not been clearly highlighted in the previous studies. Thus, its powerful shape representation could not be fully utilized. In this paper, a method to fully capture the shape representation properties of Zernike Moments is implemented and tested on a single object for binary and grey level images. The proposed method works by determining the boundary of the shape object and then resizing the object shape to the boundary of the image. Three case studies were made. Case 1 is the Zernike Moments implementation on the original shape object image. In Case 2, the centroid of the s

View Publication Preview PDF

(2)

Publication Date

Thu Oct 01 2015

Journal Name

Engineering And Technology Journal

Genetic Based Optimization Models for Enhancing Multi- Document Text Summarization

Hilal

Nasreen J.

...Show More Authors

View Publication

Publication Date

Sun Feb 02 2025

Journal Name

Engineering, Technology & Applied Science Research

An Enhanced Document Source Identification System for Printer Forensic Applications based on the Boosted Quantum KNN Classifier

printer forensics

document source identification

quantum-inspired computing

feature modeling

Shahlaa

Wisal Hashim

Iptehaj

Oday Ali

Saad M.

...Show More Authors

Document source identification in printer forensics involves determining the origin of a printed document based on characteristics such as the printer model, serial number, defects, or unique printing artifacts. This process is crucial in forensic investigations, particularly in cases involving counterfeit documents or unauthorized printing. However, consistent pattern identification across various printer types remains challenging, especially when efforts are made to alter printer-generated artifacts. Machine learning models are often used in these tasks, but selecting discriminative features while minimizing noise is essential. Traditional KNN classifiers require a careful selection of distance metrics to capture relevant printing

View Publication Preview PDF

(4)

(3)

Publication Date

Fri Jan 01 2021

Journal Name

Cogent Engineering

Content-based image retrieval: A review of recent trends

Ibtihaal M.

Sadiq H.

Basheera M.

...Show More Authors

View Publication

(135)

(127)

Publication Date

Thu Oct 31 2024

Journal Name

Intelligent Automation And Soft Computing

Fusion of Type-2 Neutrosophic Similarity Measure in Signatures Verification Systems: A New Forensic Document Analysis Paradigm

Type-2 neutrosophic reasoning

biometric signature verification

forensic document experts’

analysis

Shahlaa

Wisal Hashim

Oday

Saad

...Show More Authors

Signature verification involves vague situations in which a signature could resemble many reference samples or might differ because of handwriting variances. By presenting the features and similarity score of signatures from the matching algorithm as fuzzy sets and capturing the degrees of membership, non-membership, and indeterminacy, a neutrosophic engine can significantly contribute to signature verification by addressing the inherent uncertainties and ambiguities present in signatures. But type-1 neutrosophic logic gives these membership functions fixed values, which could not adequately capture the various degrees of uncertainty in the characteristics of signatures. Type-1 neutrosophic representation is also unable to adjust to various

View Publication Preview PDF

(4)

(3)

Publication Date

Sun Dec 17 2017

Journal Name

Al-khwarizmi Engineering Journal

Experimental and Prediction Using Artificial Neural Network of Bed Porosity and Solid Holdup in Viscous 3-Phase Inverse Fluidization

bed porosity

solid holdup

three phase

inverse fluidization

ANNs

Amer A.

...Show More Authors

In the present investigation, bed porosity and solid holdup in viscous three-phase inverse fluidized bed (TPIFB) are determined for aqueous solutions of carboxy methyl cellulose (CMC) system using polyethylene and polypropylene as a particles with low-density and diameter (5 mm) in a (9.2 cm) inner diameter with height (200 cm) of vertical perspex column. The effectiveness of gas velocity U_g , liquid velocity U_L, liquid viscosity μ_L, and particle density ρ_s on bed porosity B_P and solid holdups ε_g were determined. The bed porosity increases with "increasing gas velocity", "liquid velocity", and "liquid viscosity". Solid holdup decreases with increasing gas, liquid

View Publication Preview PDF

1 2 ... 17 18 19 20 ... 395 396