Ph.D. Research : Ph.D. Home : Documents : References : A Comparison Of Unsupervised Dimension Reduction Algorithms For Classification Notes

Keith's Ph.D. Research

·	Ph.D. Home
·	Keith's Professional Site

About

·	People
·	WPI CS KDDRG
·	WPI CS Department

Documents

·	Presentations
·	References
·	To Do List
·	More...

Experiments

·	Data
·	Results
·	More...

Printer Friendly Version

References

[ Intro ] [ A Comparison Of Unsupervised Dimension Reduction Algorithms For Classification Notes ] [ Beck AT 1961.pdf ]

[ Up: Documents ]

2007-01-29
----------

Notes on Page 71 (first page of paper)

Overall it seems we should try this (DPDR) on our data.

Add to TODO list: 
    find (write) implementation for Weka, 
    Ensure SVM in Weka uses RBF kernel 
    (or something that relies only on Euclidean or cosine 
    similarity measures)

This paper deals only with feature extraction methods as a 
way to reduce the dimensionality of data. It is not about 
attribute selection.

Notes on Page 72

The authors refer to "powerful" classifiers such as SVMs but 
do not define "powerful".

Notes on Page 73

The data used in this study is largely similar to our own 
pancreatic patient data. Both have very few instance and a 
large number of attributes. In fact, the data used in this 
study has far more attribute per instance than our own.
One thing to note when viewing the results is that these data 
sets deal with gene expression levels, that are known to be 
string indicating factors in the target attribute. I am 
currently not as certain our data is complete with respect to 
capturing all the factors determining our target attributes.

One interesting point is that for all the feature extraction 
methods used, the target dimensionality is a parameter with the 
exception of DPDR. All have the number of instances as one of 
the values tried, with the exception of MDS, even though it 
certainly looks possible. I wonder why the authors chose to not 
have a common point of reference. I did not see mention of
specifying the number of dimensions DPDR results in being used 
as a value for this parameter.

Notes on Page 74

The results shown in Table 3 are quite good. See previous note 
on confidence of our own pancreatic patient data.

Intro

A Comparison Of Unsupervised Dimension Reduction Algorithms For Classification Notes

Beck AT 1961.pdf

Carolina Ruiz Bagging Boosting.ppt

Distance Preserving Dimension Reduction For Manifold Learning Notes

FeatureSelectionForMachineLearning-ComparingACorrelation-basedFilterApproachToTheWrapper.pdf

Hayward Thesis.pdf

Hayward Thesis Notes

Matrices Vector Spaces And Information Retrieval Notes

Shivin Misra thesis final.pdf

Stuart Floyd MS Thesis Final.pdf

Stuart Floyd Thesis Presentation.pdf