The Automated VSMs to Categorize Arabic Text Data Sets

Authors

  • Mamoun Suleiman Al Rababaa Al albayt University, Mafraq
  • Essam Said Hanandeh Zarqa University, Zarqa

DOI:

https://doi.org/10.24297/ijct.v13i1.2925

Keywords:

Arabic data sets, Data mining, Text categorisation, Term weighting, VSM.

Abstract

Text Categorization is one of the most important tasks in information retrieval and data mining. This paper aims at investigating different variations of vector space models (VSMs) using KNN algorithm. we used 242 Arabic abstract documents that were used by (Hmeidi & Kanaan, 1997). The bases of our comparison are the most popular text evaluation measures; we use Recall measure, Precision measure, and F1 measure. The Experimental results against the Saudi data sets reveal that Cosine outperformed over of the Dice and Jaccard coefficients.

Downloads

Download data is not yet available.

Downloads

Published

2014-03-31

How to Cite

Al Rababaa, M. S., & Hanandeh, E. S. (2014). The Automated VSMs to Categorize Arabic Text Data Sets. INTERNATIONAL JOURNAL OF COMPUTERS &Amp; TECHNOLOGY, 13(1), 4074–4081. https://doi.org/10.24297/ijct.v13i1.2925

Issue

Section

Research Articles