Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/595
Title: Ham-spam filtering using different PCA scenarios
Authors: Dagher, Issam 
Antoun, Rima
Affiliations: Department of Computer Engineering 
Keywords: Feature extraction
Information filtering
Pattern classification
Text analysis
Unsolicited e-mail
Subjects: Principal component analysis
Issue Date: 2017
Publisher: IEEE
Part of: 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES)
Start page: 542
End page: 545
Conference: IEEE International Conference on Computational Science and Engineering CSE2016 (19th : 24-26 Aug. 2016 : Paris, France) 
Abstract: 
The objective of this paper is to discuss different scenarios for Principal Component Analysis classifier implemented for email filtering process (Ham vs. spam emails). The study highlights on the variation of the accuracy of these classifiers with respect to the variation in feature preprocessing. Four scenarios were considered: Scenario 1: Ham and Spam classes are represented with different features. Scenario 2: Ham and Spam classes are represented with same features. Scenario 3: Ham and Spam classes are represented with common terms. Scenario 4: Ham and Spam classes are represented with common Features and Characteristic terms. Different experiments were done using a public corpus extracted from the University of California-Irvine Machine Learning Repository. Different training and test sets were used. A comparison with Support Vector Machine and Bayes detector was done to prove its superior behavior.
URI: https://scholarhub.balamand.edu.lb/handle/uob/595
Ezproxy URL: Link to full text
Type: Conference Paper
Appears in Collections:Department of Computer Engineering

Show full item record

Record view(s)

40
checked on Apr 23, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.