Ham-spam filtering using different PCA scenarios

Dagher, Issam; Antoun, Rima

UOBScholar Hub

UOB Libraries created the UOBScholar Hub, an Institutional Repository (IR) for archiving and collecting all the research output of the UOB community. It aims to improve the visibility, usage and impact of research conducted at UOB. Materials included are: academic journal articles, conference papers and presentations, books and book chapters, ongoing research papers, reports and patents.

Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/595

Title:	Ham-spam filtering using different PCA scenarios
Authors:	Dagher, Issam Antoun, Rima
Affiliations:	Department of Computer Engineering
Keywords:	Feature extraction Information filtering Pattern classification Text analysis Unsolicited e-mail
Subjects:	Principal component analysis
Issue Date:	2017
Publisher:	IEEE
Part of:	2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES)
Start page:	542
End page:	545
Conference:	IEEE International Conference on Computational Science and Engineering CSE2016 (19th : 24-26 Aug. 2016 : Paris, France)
Abstract:	The objective of this paper is to discuss different scenarios for Principal Component Analysis classifier implemented for email filtering process (Ham vs. spam emails). The study highlights on the variation of the accuracy of these classifiers with respect to the variation in feature preprocessing. Four scenarios were considered: Scenario 1: Ham and Spam classes are represented with different features. Scenario 2: Ham and Spam classes are represented with same features. Scenario 3: Ham and Spam classes are represented with common terms. Scenario 4: Ham and Spam classes are represented with common Features and Characteristic terms. Different experiments were done using a public corpus extracted from the University of California-Irvine Machine Learning Repository. Different training and test sets were used. A comparison with Support Vector Machine and Bayes detector was done to prove its superior behavior.
URI:	https://scholarhub.balamand.edu.lb/handle/uob/595
Ezproxy URL:	Link to full text
Type:	Conference Paper
Appears in Collections:	Department of Computer Engineering

Show full item record

Record view(s)

66

checked on Dec 29, 2024

Google Scholar^TM

Check

UOBScholar Hub

Record view(s)

Google ScholarTM

Google Scholar^TM