Effects of Classification Techniques on Medical Reports Classification

Authors

  • Elfadil Abdalla Mohamed Alain University of Science and Technology
  • Fathi H. Saad NHS Oxfordshire, Oxford, UK
  • Omer I. E. Mohamed Alkhawarzmi College, UAE

DOI:

https://doi.org/10.24297/ijct.v13i2.2906

Keywords:

Document classification, positive-class based learning, partially supervised classification, labelled and unlabeled data, medical text mining, and features reduction.

Abstract

Text classification is the process of assigning pre-defined category labels to documents based on what a classifications has learned from training examples. This paper investigates the partially supervised classification approach in the medical field. The approaches that have been evaluated include Rocchio, Naïve Bayesian (NB), Spy, Support vector machine (SVM), and Expectation Maximization (EM). A combination of these methods has been conducted.  The experimental result showed that the combination which uses EM in step 2 is always produces better results than those uses SVM using small set of training samples. We also found that reducing the features based on tf-tdf values is decreasing the classification performance dramatically. Moreover, reducing the features based on their frequencies improve the classification performance significantly while also increasing efficiency, but it may require some experimentation 

Downloads

Download data is not yet available.

Author Biographies

Elfadil Abdalla Mohamed, Alain University of Science and Technology

Dr. Elfadil AbdallaMIS department

Omer I. E. Mohamed, Alkhawarzmi College, UAE

Department of Computer Science

Downloads

Published

2014-04-16

How to Cite

Mohamed, E. A., Saad, F. H., & Mohamed, O. I. E. (2014). Effects of Classification Techniques on Medical Reports Classification. INTERNATIONAL JOURNAL OF COMPUTERS &Amp; TECHNOLOGY, 13(2), 4206–4221. https://doi.org/10.24297/ijct.v13i2.2906

Issue

Section

Research Articles