|
Initial Experiments
Naive Bayes Classifier
Class >50K: Prior probability = 0.24
age: Normal Distribution.
Mean = 44.105
StandardDev = 10.364
WeightSum = 7841
Precision = 1.0138888888888888
workclass: Discrete Estimator.
Counts = 4964 725 623 372 618 354 1 1
(Total = 7658)
fnlwgt: Normal Distribution.
Mean = 188004.5163
StandardDev = 102535.4004
WeightSum = 7841
Precision = 68.01958700974731
education: Discrete Estimator.
Counts = 2222 1388 61 1676 424 266 362 28 41 34 960 7 63 307 17 1
(Total = 7857)
education-num: Normal Distribution. Mean = 11.6117 StandardDev = 2.385
WeightSum = 7841 Precision = 1.0
marital-status: Discrete Estimator.
Counts = 6693 464 492 67 86 35 11
(Total = 7848)
occupation: Discrete Estimator.
Counts = 284 930 138 984 1969 1860 87 251 508 116 321 2 212 2
(Total = 7664)
relationship: Discrete Estimator.
Counts = 746 68 5919 857 38 219
(Total = 7847)
race: Discrete Estimator.
Counts = 7118 277 37 26 388
(Total = 7846)
sex: Discrete Estimator.
Counts = 1180 6663
(Total = 7843)
capital-gain: Normal Distribution.
Mean = 4029.7337
StandardDev = 14582.8927
WeightSum = 7841
Precision = 847.4491525423729
capital-loss: Normal Distribution.
Mean = 194.5982
StandardDev = 594.0944
WeightSum = 7841
Precision = 47.86813186813187
hours-per-week: Normal Distribution.
Mean = 45.4355
StandardDev = 10.9806
WeightSum = 7841
Precision = 1.053763440860215
native-country: Discrete Estimator.
Counts = 7172 8 31 13 40 45 1 41 25 9 17 21 26 19 2 62 26 13 11 6 34 5 6 13
3 3 5 21 5 3 4 4 3 4 4 7 10 3 3 7 1
(Total = 7736)
Class <=50K: Prior probability = 0.76
age: Normal Distribution.
Mean = 36.849
StandardDev = 13.8028
WeightSum = 24720
Precision = 1.0138888888888888
workclass: Discrete Estimator.
Counts = 17734 1818 495 590 1477 946 15 8
(Total = 23083)
fnlwgt: Normal Distribution.
Mean = 190340.8916
StandardDev = 106480.1251
WeightSum = 24720
Precision = 68.01958700974731
education: Discrete Estimator.
Counts = 3135 5905 1116 8827 154 803 1022 488 607 401 765 163 872 108 318 52
(Total = 24736)
education-num: Normal Distribution.
Mean = 9.5951
StandardDev = 2.4361
WeightSum = 24720
Precision = 1.0
marital-status: Discrete Estimator.
Counts = 8285 3981 10193 960 909 385 14
(Total = 24727)
occupation: Discrete Estimator.
Counts = 646 3171 3159 2668 2099 2282 1285 1753 3264 880 1278 149 439 9
(Total = 23082)
relationship: Discrete Estimator.
Counts = 824 5002 7276 7450 945 3229
(Total = 24726)
race: Discrete Estimator.
Counts = 20700 764 276 247 2738
(Total = 24725)
sex: Discrete Estimator.
Counts = 9593 15129
(Total = 24722)
capital-gain: Normal Distribution.
Mean = 149.7434
StandardDev = 965.1905
WeightSum = 24720
Precision = 847.4491525423729
capital-loss: Normal Distribution.
Mean = 53.0209
StandardDev = 310.1864
WeightSum = 24720
Precision = 47.86813186813187
hours-per-week: Normal Distribution.
Mean = 38.8175
StandardDev = 12.3267
WeightSum = 24720
Precision = 1.053763440860215
native-country: Discrete Estimator.
Counts = 22000 13 61 103 83 94 15 61 39 22 65 56 71 26 13 138 49 49 72 63
611 34 20 18 69 17 25 32 41 58 11 62 33 10 16 11 98 18 30 15 2
(Total = 24324)
=== Error on training data ===
Correctly Classified Instances 27149 83.3789 %
Incorrectly Classified Instances 5412 16.6211 %
Mean absolute error 0.1743
Root mean squared error 0.3731
Relative absolute error 47.6767 %
Root relative squared error 87.2648 %
Total Number of Instances 32561
=== Confusion Matrix ===
a b <-- classified as
4032 3809 | a = >50K
1603 23117 | b = <=50K
=== Error on test data ===
Correctly Classified Instances 13514 83.0047 %
Incorrectly Classified Instances 2767 16.9953 %
Mean absolute error 0.1764
Root mean squared error 0.376
Relative absolute error 48.5636 %
Root relative squared error 88.5068 %
Total Number of Instances 16281
=== Confusion Matrix ===
a b <-- classified as
1940 1906 | a = >50K
861 11574 | b = <=50K
|
|
|