Keith A. Pray - Professional and Academic Site
About Me
·
·
·
LinkedIn Profile Facebook Profile GoodReads Profile
Professional
Academic
Teaching
                                          
Printer Friendly Version
Initial Experiments

[ Intro ] [ shorttest1 ] [ simpletest-no-missing ] [ simpletest1 ] [ test1 ] [  ]

Up: Report ]

Initial Experiment Results

      All experiments here were done with the full training data set and tested with the full test data set. For the details of each experiment, please see the test.txt file for that test.

      The best results from this set of testing was the Kernel Density Estimator test (uses one Gaussian kernel per observed data value) instead of a normal distribution. Since this test was run out of my own curiosity for the complex Naive Bayes Classifier provided with Weka, I am not including it with the other test results using a simple Naive Bayes Classifier which more closely resembles the C code provided by the book. I provide the results here for completeness. The overall success rate for this classifier was 83.0047%.

      The next best results were obtained from Default settings. This used all the training data and all the testing data to achieve an 82.9924% success rate. It is interesting to note that when training examples with missing attribute values were removed from the training set, the success rate went down to 82.4104%.

Summary of these results:

Test Result
Short (500 train, 500 test) 82.8343
Default 82.9924
No missing attributes 82.4104
Complex Naive Bayes (Weka) 83.0047

      Based on the discussion in class (October 19, 2000) I decided to try two data pre-processing approaches.

  1. normalizing the continuous attributes
  2. discretizing the continuous attributes


by: Keith A. Pray
Last Modified: July 4, 2004 8:59 AM
© 2004 - 1975 Keith A. Pray.
All rights reserved.

Current Theme: 

Kapowee Hosted | Kapow Generated in 0.008 second | XHTML | CSS