Matlab Toolbox for Dimensionality Reduction (v0.7.1b)
=====================================================
Information
-------------------------
Author: Laurens van der Maaten
Affiliation: University of California, San Diego / Delft University of Technology
Contact: lvdmaaten@gmail.com
Release date: June 25, 2010
Version: 0.7.1b
Installation
-------------------------
Copy the drtoolbox/ folder into the $MATLAB_DIR/toolbox directory (where $MATLAB_DIR indicates your Matlab installation directory). Start Matlab and select 'Set path...' from the File menu. Click the 'Add with subfolders...' button, select the folder $MATLAB_DIR/toolbox/drtoolbox in the file dialog, and press Open. Subsequently, press the Save button in order to save your changes to the Matlab search path. The toolbox is now installed.
Some of the functions in the toolbox use MEX-files. Precompiled versions of these MEX-files are distributed with this release, but the compiled version for your platform might be missing. In order to compile all MEX-files, type cd([matlabroot '/toolbox/drtoolbox']) in your Matlab prompt, and execute the function MEXALL.
Features
-------------------------
This Matlab toolbox implements 32 techniques for dimensionality reduction. These techniques are all available through the COMPUTE_MAPPING function or trhough the GUI. The following techniques are available:
- Principal Component Analysis ('PCA')
- Linear Discriminant Analysis ('LDA')
- Multidimensional scaling ('MDS')
- Probabilistic PCA ('ProbPCA')
- Factor analysis ('FactorAnalysis')
- Sammon mapping ('Sammon')
- Isomap ('Isomap')
- Landmark Isomap ('LandmarkIsomap')
- Locally Linear Embedding ('LLE')
- Laplacian Eigenmaps ('Laplacian')
- Hessian LLE ('HessianLLE')
- Local Tangent Space Alignment ('LTSA')
- Diffusion maps ('DiffusionMaps')
- Kernel PCA ('KernelPCA')
- Generalized Discriminant Analysis ('KernelLDA')
- Stochastic Neighbor Embedding ('SNE')
- Symmetric Stochastic Neighbor Embedding ('SymSNE')
- t-Distributed Stochastic Neighbor Embedding ('tSNE')
- Neighborhood Preserving Embedding ('NPE')
- Linearity Preserving Projection ('LPP')
- Stochastic Proximity Embedding ('SPE')
- Linear Local Tangent Space Alignment ('LLTSA')
- Conformal Eigenmaps ('CCA', implemented as an extension of LLE)
- Maximum Variance Unfolding ('MVU', implemented as an extension of LLE)
- Landmark Maximum Variance Unfolding ('LandmarkMVU')
- Fast Maximum Variance Unfolding ('FastMVU')
- Locally Linear Coordination ('LLC')
- Manifold charting ('ManifoldChart')
- Coordinated Factor Analysis ('CFA')
- Gaussian Process Latent Variable Model ('GPLVM')
- Autoencoders using stack-of-RBMs pretraining ('AutoEncoderRBM')
- Autoencoders using evolutionary optimization ('AutoEncoderEA')
Furthermore, the toolbox contains 6 techniques for intrinsic dimensionality estimation. These techniques are available through the function INTRINSIC_DIM. The following techniques are available:
- Eigenvalue-based estimation ('EigValue')
- Maximum Likelihood Estimator ('MLE')
- Estimator based on correlation dimension ('CorrDim')
- Estimator based on nearest neighbor evaluation ('NearNb')
- Estimator based on packing numbers ('PackingNumbers')
- Estimator based on geodesic minimum spanning tree ('GMST')
In addition to these techniques, the toolbox contains functions for prewhitening of data (the function PREWHITEN), exact and estimate out-of-sample extension (the functions OUT_OF_SAMPLE and OUT_OF_SAMPLE_EST), and a function that generates toy datasets (the function GENERATE_DATA).
The graphical user interface of the toolbox is accessible through the DRGUI function.
Usage
-------------------------
Basically, you only need one function: mappedX = compute_mapping(X, technique, no_dims);
Try executing the following code:
[X, labels] = generate_data('helix', 2000);
figure, scatter3(X(:,1), X(:,2), X(:,3), 5, labels); title('Original dataset'), drawnow
no_dims = round(intrinsic_dim(X, 'MLE'));
disp(['MLE estimate of intrinsic dimensionality: ' num2str(no_dims)]);
mappedX = compute_mapping(X, 'Laplacian', no_dims, 7);
figure, scatter(mappedX(:,1), mappedX(:,2), 5, labels); title('Result of dimensionality reduction'), drawnow
It will create a helix dataset, estimate the intrinsic dimensionality of the dataset, run Laplacian Eigenmaps on the dataset, and plot the results. All functions in the toolbox can work both on data matrices as on PRTools datasets (http://prtools.org). For more information on the options for dimensionality reduction, type HELP COMPUTE_MAPPING in your Matlab prompt. Information on the intrinsic dimensionality estimators can be obtained by typing the HELP INTRINSIC_DIM.
Other functions that are useful are the GENERATE_DATA function and the OUT_OF_SAMPLE and OUT_OF_SAMPLE_EST functions. The GENERATE_DATA function provides you with a number of artificial datasets to test the techniques. The OUT_OF_SAMPLE function allows for out-of-sample extension for the techniques PCA, LDA, LPP, NPE, LLTSA, Kernel PCA, and autoencoders. The OUT_OF_SAMPLE_EST function allows you to perform an out-of-sample extension using an estimation technique, that is generally applicable.
Many of the available functions are also available through the GUI, which can be executed by running the function DRGUI.
Pitfalls
-------------------------
When you run certain code, you might receive an error that a certain file is missing. This is because in some parts of the code, MEX-functions are used. I provide a number of precompiled versions of these MEX-functions in the toolbox. However, the MEX-file for your platform might be missing. To fix this, type in your Matlab:
mexall
Now you have compiled versions of the MEX-files as well. This fix also solves slow execution of the shortest path computations in Isomap.
If you encounter an error considering CSDP while running the FastMVU-algorithm, the binary of CSDP for your platform is missing. If so, please obtain a binary distribution of CSDP from https://projects.coin-or.org/Csdp/ and place it in the drtoolbox/techniques directory. Make sure it has the right name for your platform (csdp.exe for Windows, csdpmac for Mac OS X (PowerPC), csdpmaci for Mac OS X (Intel), and csdplinux for Linux).
Many methods for dimensionality reduction perform spectral analyses of sparse matrices. You might think that eigenanalysis is a well-studied problem that can easily be solved. However, eigenanalysis of large matrices turns out to be tedious. The toolbox allows you to use two different methods for eigenanalysis:
- The original Matlab functions (based on Arnoldi methods)
- The JDQR functions (based on Jacobi-Davidson methods)
For problems up to 10,000 datapoints, we recommend using the 'Matlab' setting. For larger problems, switching to 'JDQR' is often worth trying.
Papers
-------------------------
For more information on the implemented techniques and for a theoretical and empirical comparison, please have a look at the following papers:
- L.J.P. van der Maaten, E.O. Postma, and H.J. van den Herik. Dimensionality Reduction: A Comparative Review. Tilburg University Technical Report, TiCC-TR 2009-005, 2009.
Version history
-------------------------
Version 0.7.1b:
- Small bugfixes.
Version 0.7b:
- Many small bugfixes and speed improvements.
- Added out-of-sample extension for manifold charting.
- Added first version of graphical user interface for the toolbox. The GUI was developed by Maxim Vedenev with the help of Susanth Vemulapalli and Maarten Huybrecht. I made some changes in the initial version of the GUI code.
- Added implementation of Gaussian Process Latent Variable Model (GPLVM).
- Removed Simple PCA as probabilistic PCA is more appropriate.
Version 0.6b:
- Resolved bug in LLE that was introduced with v0.6b.
- Added implementation of t-SNE.
- Resolved small bug in data generation function.
- Improved RBM implementation in au
没有合适的资源?快使用搜索试试~ 我知道了~
美赛ABCDEF题常见参考代码.zip
共1453个文件
m:923个
mat:81个
txt:64个
1.该资源内容由用户上传,如若侵权请联系客服进行举报
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
版权申诉
0 下载量 15 浏览量
2024-04-26
07:21:53
上传
评论
收藏 40.15MB ZIP 举报
温馨提示
美赛ABCDEF题常见参考代码.zip 美赛ABCDEF题常见参考代码.zip
资源推荐
资源详情
资源评论
收起资源包目录
美赛ABCDEF题常见参考代码.zip (1453个子文件)
renyuanshusan.asv 34KB
DataAnalysis.asv 28KB
GPSR_Basic.asv 21KB
figures_1_2_3.asv 8KB
wavenn.asv 3KB
main.asv 3KB
main1.asv 3KB
chapter8.2.asv 3KB
LPINT.asv 2KB
LPINT.asv 2KB
EZR.asv 2KB
Untitled2.asv 1KB
Untitled2.asv 1KB
GP.asv 958B
max_coherent.asv 343B
About.bmp 930KB
tfdomain.bmp 224KB
Coastguard_cif_000003.bmp 100KB
Coastguard_cif_000002.bmp 100KB
Coastguard_cif_SI_3_0.70_000004.bmp 100KB
Coastguard_cif_SI_3_0.70_000002.bmp 100KB
Coastguard_cif_SI_3_0.70_000005.bmp 100KB
Coastguard_cif_000006.bmp 100KB
Coastguard_cif_000004.bmp 100KB
Coastguard_cif_000007.bmp 100KB
Coastguard_cif_000000.bmp 100KB
Coastguard_cif_000001.bmp 100KB
Coastguard_cif_SI_3_0.70_000001.bmp 100KB
Coastguard_cif_000005.bmp 100KB
lena256.bmp 65KB
svmtrain.c 11KB
svmpredict.c 9KB
mexCCACollectData.c 8KB
svm_model_matlab.c 8KB
kernel_function.c 7KB
mexCCACollectData2.c 6KB
find_nn.c 4KB
libsvmread.c 4KB
computegr.c 3KB
libsvmwrite.c 2KB
KRUSK.C 2KB
AnalysisCHM.chm 396KB
COPYRIGHT 1KB
Copyright 487B
svm.cpp 61KB
dijkstra.cpp 27KB
decode.cpp 23KB
encode.cpp 11KB
PRISM.cpp 6KB
csdplinux 1.62MB
csdpmac 81KB
csdpmaci 88KB
main.css 1KB
temp.dat 150B
KRUSK.DLL 40KB
lphr.dll 30KB
lphd.dll 30KB
dijkstra.dll 9KB
mexCCACollectData2.dll 8KB
computegr.dll 7KB
mexCCACollectData.dll 7KB
用蒙特卡洛法实现对排队等待问题的计算机模拟.doc 360KB
matlab多元非线性回归教程.doc 333KB
matlab多元非线性回归教程.doc 333KB
传染病模型SI、SIS、SIR.doc 88KB
CS_BP程序注释.doc 66KB
程序运行后保存的一个结果文本汇报.doc 57KB
linxl.doc 39KB
用Matlab求解下列各初值问题.doc 37KB
相似性度量方法介绍.doc 34KB
GARCH-Matlab.doc 34KB
Markov.doc 33KB
粗大误差处理方法.doc 29KB
程序实现的大致思路.doc 28KB
bp.doc 25KB
基于遗传算法修正的BP神经网络算法.docx 851KB
改进灰色马尔科夫模型湖北省用水量预测模型.docx 256KB
模糊综合评价原理+案例讲解与Matlab实现.docx 166KB
多项式逻辑回归.docx 21KB
离散型遗传算法求解组合优化代码.docx 13KB
.DS_Store 22KB
.DS_Store 16KB
.DS_Store 14KB
.DS_Store 14KB
.DS_Store 14KB
.DS_Store 12KB
.DS_Store 10KB
.DS_Store 8KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
.DS_Store 6KB
共 1453 条
- 1
- 2
- 3
- 4
- 5
- 6
- 15
资源评论
MarcoPage
- 粉丝: 3359
- 资源: 4033
下载权益
C知道特权
VIP文章
课程特权
开通VIP
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功