Initial Experiments

Intro ] [ shorttest1 ] [ simpletest-no-missing ] [ simpletest1 ] [ test1 ] [  ]

Up: Report ]

Naive Bayes Classifier

Class >50K: Prior probability = 0.24

age:  Normal Distribution. 
Mean = 44.105 
StandardDev = 10.364 
WeightSum = 7841
Precision = 1.0138888888888888

workclass:  Discrete Estimator. 
Counts =  4964 725 623 372 618 354 1 1  
(Total = 7658)

fnlwgt:  Normal Distribution. 
Mean = 188004.5163 
StandardDev = 102535.4004 
WeightSum = 7841 
Precision = 68.01958700974731

education:  Discrete Estimator. 
Counts =  2222 1388 61 1676 424 266 362 28 41 34 960 7 63 307 17 1  
(Total = 7857)

education-num:  Normal Distribution. Mean = 11.6117 StandardDev = 2.385 
WeightSum = 7841 Precision = 1.0

marital-status:  Discrete Estimator. 
Counts =  6693 464 492 67 86 35 11  
(Total = 7848)

occupation:  Discrete Estimator. 
Counts =  284 930 138 984 1969 1860 87 251 508 116 321 2 212 2  
(Total = 7664)

relationship:  Discrete Estimator. 
Counts =  746 68 5919 857 38 219  
(Total = 7847)

race:  Discrete Estimator. 
Counts =  7118 277 37 26 388  
(Total = 7846)

sex:  Discrete Estimator. 
Counts =  1180 6663  
(Total = 7843)

capital-gain:  Normal Distribution. 
Mean = 4029.7337 
StandardDev = 14582.8927 
WeightSum = 7841 
Precision = 847.4491525423729

capital-loss:  Normal Distribution. 
Mean = 194.5982 
StandardDev = 594.0944 
WeightSum = 7841 
Precision = 47.86813186813187

hours-per-week:  Normal Distribution. 
Mean = 45.4355 
StandardDev = 10.9806 
WeightSum = 7841 
Precision = 1.053763440860215

native-country:  Discrete Estimator. 
Counts =  7172 8 31 13 40 45 1 41 25 9 17 21 26 19 2 62 26 13 11 6 34 5 6 13
	  3 3 5 21 5 3 4 4 3 4 4 7 10 3 3 7 1  
(Total = 7736)


Class <=50K: Prior probability = 0.76

age:  Normal Distribution. 
Mean = 36.849 
StandardDev = 13.8028 
WeightSum = 24720 
Precision = 1.0138888888888888

workclass:  Discrete Estimator. 
Counts =  17734 1818 495 590 1477 946 15 8  
(Total = 23083)

fnlwgt:  Normal Distribution. 
Mean = 190340.8916 
StandardDev = 106480.1251 
WeightSum = 24720 
Precision = 68.01958700974731

education:  Discrete Estimator. 
Counts =  3135 5905 1116 8827 154 803 1022 488 607 401 765 163 872 108 318 52  
(Total = 24736)

education-num:  Normal Distribution. 
Mean = 9.5951 
StandardDev = 2.4361 
WeightSum = 24720 
Precision = 1.0

marital-status:  Discrete Estimator. 
Counts =  8285 3981 10193 960 909 385 14  
(Total = 24727)

occupation:  Discrete Estimator. 
Counts =  646 3171 3159 2668 2099 2282 1285 1753 3264 880 1278 149 439 9  
(Total = 23082)

relationship:  Discrete Estimator. 
Counts =  824 5002 7276 7450 945 3229  
(Total = 24726)

race:  Discrete Estimator. 
Counts =  20700 764 276 247 2738  
(Total = 24725)

sex:  Discrete Estimator. 
Counts =  9593 15129  
(Total = 24722)

capital-gain:  Normal Distribution. 
Mean = 149.7434 
StandardDev = 965.1905 
WeightSum = 24720 
Precision = 847.4491525423729

capital-loss:  Normal Distribution. 
Mean = 53.0209 
StandardDev = 310.1864 
WeightSum = 24720 
Precision = 47.86813186813187

hours-per-week:  Normal Distribution. 
Mean = 38.8175 
StandardDev = 12.3267 
WeightSum = 24720 
Precision = 1.053763440860215

native-country:  Discrete Estimator. 
Counts =  22000 13 61 103 83 94 15 61 39 22 65 56 71 26 13 138 49 49 72 63
	  611 34 20 18 69 17 25 32 41 58 11 62 33 10 16 11 98 18 30 15 2  
(Total = 24324)



=== Error on training data ===

Correctly Classified Instances       27149               83.3789 %
Incorrectly Classified Instances      5412               16.6211 %
Mean absolute error                      0.1743
Root mean squared error                  0.3731
Relative absolute error                 47.6767 %
Root relative squared error             87.2648 %
Total Number of Instances            32561     


=== Confusion Matrix ===

     a     b   <-- classified as
  4032  3809 |     a = >50K
  1603 23117 |     b = <=50K


=== Error on test data ===

Correctly Classified Instances       13514               83.0047 %
Incorrectly Classified Instances      2767               16.9953 %
Mean absolute error                      0.1764
Root mean squared error                  0.376 
Relative absolute error                 48.5636 %
Root relative squared error             88.5068 %
Total Number of Instances            16281     


=== Confusion Matrix ===

     a     b   <-- classified as
  1940  1906 |     a = >50K
   861 11574 |     b = <=50K

 

by: Keith A. Pray
Last Modified: July 4, 2004 8:59 AM
© 2004 - 1975 Keith A. Pray.
All rights reserved.