核密度估计matlab程序.zip_核密度估计matlab,matlab核密度估计资源-CSDN文库

共151个文件

m：71个

cpp：21个

dll：17个

需积分: 34 36 浏览量 2021-07-13 22:11:17 上传评论 7 收藏 2.16MB ZIP 举报

核密度估计（Kernel Density Estimation, KDE）是一种非参数统计方法，用于估计数据分布的形状。在MATLAB中实现KDE可以帮助我们理解数据的集中趋势、峰值和异常值，而无需预先假设数据遵循特定的概率分布。这个zip文件包含了用于执行核密度估计的MATLAB程序及相关支持文件。 `makemex`文件是MATLAB中的编译命令，用于将C或C++源代码编译为可执行的MEX文件。在本案例中，可能有一些函数是用C或C++编写并以DLL（动态链接库）的形式提供的，如adjustWeights.dll、adjustBW.dll、adjustPoints.dll、llGrad.dll和knn.dll。这些DLL文件是预编译的二进制代码，用于加速核密度估计的计算过程。在使用KDE程序之前，必须运行`makemex`以确保所有依赖项都已正确编译和加载。 `reduce.m`、`plot.m`、`evalIFGT.m`和`evalFGT.m`是MATLAB脚本或函数文件。它们分别承担不同的职责： 1. `reduce.m`：可能是一个数据降维或预处理函数，用于处理输入数据，以便更好地进行核密度估计。 2. `plot.m`：很可能是绘制核密度图的函数，它能可视化数据的分布情况，帮助用户直观地理解数据特征。 3. `evalIFGT.m`和`evalFGT.m`：这两个函数可能涉及到阈值评估或者与累积分布函数（CDF）相关的操作。在核密度估计中，可能需要计算数据点超过某个阈值的概率，这些函数可能就是为此目的设计的。在使用这些MATLAB程序进行核密度估计时，通常的步骤包括： 1. 加载数据：导入需要分析的数据集到MATLAB环境中。 2. 调整参数：选择合适的核函数（如高斯核）和带宽（决定核密度估计的精度和平滑程度）。 3. 运行核密度估计：使用提供的MATLAB函数进行估计，这可能涉及调用编译后的DLL文件来加速计算。 4. 数据可视化：通过`plot.m`或其他图形工具显示核密度图，分析数据分布。 5. 结果应用：根据得到的核密度估计，进行预测、风险控制或预期收益计算。需要注意的是，`license.gpl`文件表明这些代码可能遵循GPL许可证，这意味着代码是开源的，并且如果在项目中使用，可能需要遵守相应的开源许可条款。这个MATLAB程序包提供了一套工具，用于执行核密度估计和相关分析，适用于各种科学和工程领域，尤其是那些需要理解数据分布特点和进行预测分析的任务。正确理解和使用这些工具能够提升数据分析的效率和准确性。

资源详情

资源评论

资源推荐

收起资源包目录

核密度估计matlab程序.zip （151个子文件）

BallTreeDensityClass.cc 26KB

BallTreeClass.cc 16KB

prodSampleEpsilon.cpp 13KB

iseEpsilon.cpp 12KB

prodSampleGibbsMS.cpp 12KB

prodSampleGibbs.cpp 10KB

klGradRS.cpp 8KB

prodSampleExact.cpp 7KB

entropyGradISE.cpp 7KB

reduceSolve.cpp 7KB

prodSampleGibbsMS1.cpp 4KB

prodSampleGibbsMS2.cpp 4KB

prodSampleGibbs1.cpp 4KB

prodSampleGibbs2.cpp 4KB

entropyGradRS.cpp 3KB

llGrad.cpp 2KB

adjustBW.cpp 1KB

knn.cpp 1KB

DualTree.cpp 1KB

BallTreeDensity.cpp 926B

adjustWeights.cpp 867B

adjustPoints.cpp 688B

BallTree.cpp 630B

prodSampleGibbsMS1.dll 248KB

prodSampleGibbsMS2.dll 248KB

prodSampleEpsilon.dll 247KB

iseEpsilon.dll 247KB

prodSampleGibbs2.dll 246KB

prodSampleGibbs1.dll 246KB

prodSampleExact.dll 245KB

entropyGradISE.dll 244KB

adjustBW.dll 242KB

DualTree.dll 242KB

adjustWeights.dll 242KB

adjustPoints.dll 242KB

BallTreeDensity.dll 242KB

BallTree.dll 225KB

llGrad.dll 67KB

knn.dll 66KB

reduceSolve.dll 8KB

license.gpl 24KB

kernels.h 8KB

BallTreeDensity.h 6KB

BallTree.h 5KB

reduceSolveM.m 13KB

reduce.m 8KB

plot.m 7KB

evalIFGT.m 7KB

evalFGT.m 7KB

joinTrees.m 6KB

Contents.m 5KB

ksize.m 5KB

productApprox.m 4KB

reduceKD.m 3KB

modes.m 3KB

klGrad.m 3KB

makemex.m 3KB

ksizeHall.m 3KB

productExact.m 3KB

encode.m 3KB

llGrad.m 2KB

kde.m 2KB

maxlogerr.m 2KB

llHess.m 2KB

ksizeCalcUseful.m 2KB

hist.m 2KB

prodSampleImportPair.m 2KB

reduceKD2.m 2KB

prodSampleImportGauss.m 2KB

kld.m 2KB

randKernel.m 2KB

prodSampleImportMix.m 2KB

miGrad.m 2KB

entropy.m 2KB

DualTree.m 2KB

evaluate.m 1KB

ksizeROT.m 1KB

golden.m 1KB

ise.m 1KB

condition.m 1KB

ksizeLSCV.m 1KB

demo_kde_3.m 1KB

adjustBW.m 1KB

ksizeMSP.m 1KB

adjustWeights.m 1KB

entropyGrad.m 1006B

resample.m 1004B

sample.m 999B

demo_kde_2.m 975B

display.m 953B

findBWCrit.m 947B

covar.m 885B

evalAvgLogL.m 862B

maketmp.m 858B

knn.m 857B

demo_kde_1.m 812B

quantize.m 728B

rescale.m 689B

marginal.m 658B

getBW.m 651B

共 151 条

============================================================================== MATLAB KDE Class Description & Specification ============================================================================== The KDE class is a general matlab class for k-dimensional kernel density estimation. It is written in a mix of matlab ".m" files and MEX/C++ code. Thus, to use it you will need to be able to compile C++ code for Matlab. Note that the default compiler for Windows does *not* support C++, so you will need GCC under Linux, or GCC or Visual C++ for Windows. Bloodshed (http://www.bloodshed.net) supplies a nice development environment along with the MinGW (http://www.mingw.org) compiler. See the page http://gnumex.sourceforge.net/ for help setting up MEX with MinGW. Kernels supported are: Gaussian, Epanetchnikov (truncated quadratic), and Laplacian (Double exponential) For multivariate density estimates, the code supports product kernels -- kernels which are products of the kernel function in each dimension. For example, for Gaussian kernels this is equivalent to requiring a diagonal covariance. It can also support non-uniform kernel bandwidths -- i.e. bandwidths which vary over kernel centers. The implementation uses "kd-trees", a heirarchical representation for point sets which caches sufficient statistics about point locations etc. in order to achieve potential speedups in computation. For the Epanetchnikov kernel this can translate into speedups with no loss of precision; but for kernels with infinite support it provides an approximation tolerance level, which allows tradeoffs between evaluation quality and computation speed. In particular, we implement Alex Gray's "Dual Tree" evaluation algorithm; see [Gray and Moore, "Very Fast Multivariate Kernel Density Estimation using via Computational Geometry", in Proceedings, Joint Stat. Meeting 2003] for more details. This gives a tolerance parameter which is a percent error (from the exact, N^2 computation) on the value at any evaluated point. In general, "tolerance" parameters in the matlab code / notes refers to this percent tolerance. This percentage error translates to an absolute additive error on the mean log-likelihood, for example. An exception to this is the gradient calcuation functions, which calculate using an absolute tolerance value. This is due to the difficulty of finding a percentage bound when the function calculated is not strictly positive. We have also recently implemented the so-called Improved Fast Gauss Transform, described in [Yang, Duraiswami, and Gumerov, "Improved Fast Gauss Transform", submitted to the Siam Journal of Scientific Computing]. This often performs MUCH faster than the dual tree algorithm mentioned above, but the error bounds which control the computation are often quite loose, and somewhat unwieldy (for example, it is difficult to obtain the fractional error bounds provided & used by the dual tree methods and other functions in the KDE toolbox). Thus for the moment we have left the IFGT separate, with alternate controls for computational complexity (see below, and the file "evalIFGT.m"). ============================================================================== Getting Started ============================================================================== Unzip the KDE class to a directory called @kde. Compile the MEX functions. This can be done by running "makemex" from inside matlab, in the "@kde/mex" directory. If this fails, make sure that MEX / C++ compilation works. The KDE toolbox is tested in Matlab R13, but apparently has problems in R12; I'm planning to investigate this. NOTE: MS Visual C++ has a bug in dealing with "static const" variables; I think there is a patch available, or you can change these to #defines. Operate from the class' parent directory, or add it to your MATLAB path (e.g. if you unzip to "myhome/@kde", cd in matlab to the "myhome" dir, or add it to the path.) Objects of type KDE may be created by e.g. p = kde( rand(2,1000), [.05;.03] ); % Gaussian kernel, 2D % BW = .05 in dim 1, .03 in dim 2. p = kde( rand(2,1000), .05, ones(1,1000) ) % Same as above, but uniform BW and % specifying weights p = kde( rand(2,1000), .05, ones(1,1000), 'Epanetchnikov') % Quadratic kernel % Just 'E' or 'e' also works p = kde( rand(2,1000), 'rot' ); % Gaussian kernel, 2D, % BW chosen by "rule of thumb" (below) To see the kernel shape types, you can use: plot(-3:.01:3, evaluate(kde(0,1,1,T),-3:.01:3) ); % where T = 'G', 'E', or 'L' Kernel sizes may be selected automatically using e.g. p = ksize(p, 'lcv'); % 1D Likelihood-based search for BW p = ksize(p, 'rot'); % "Rule of Thumb"; Silverman '86 / Scott '92 p = ksize(p, 'hall'); % Plug-in type estimator Density estimates may be visualized using e.g. plot(p); or mesh(hist(p)); See help kde/plot and help kde/hist for more information. Also, the demonstration programs @kde/examples/demo_kde_#.m may be helpful. ============================================================================== KDE Matlab class definition ============================================================================== The following is a simple list of all accessible functions for the KDE class. Constructors: ===================================================== kde( ) : empty kde kde( kde ) : re-construct kde from points, weights, bw, etc. kde( points, bw ) : construct Gauss kde with weights 1/N kde( points, bw, weights) : construct Gaussian kde kde( points, bw, weights,type): potentially non-Gaussian marginal( kde, dim) : marginalize to the given dimensions condition( kde, dim, A) : marginalize to ~dim and weight by K(x_i(dim),a(dim)) resample( kde, [kstype] ) : draw N samples from kde & use to construct a new kde reduce( kde, ...) : construct a "reduced" density estimate (fewer points) joinTrees( t1, t2 ) : make a new tree with t1 and t2 as the children of a new root node Accessors: (data access, extremely limited or no processing req'd) ===================================================== getType(kde) : return the kernel type of the KDE ('Gaussian', etc) getBW(kde,index) : return the bandwidth assoc. with x_i (Ndim x length(index)) adjustBW : set the bandwidth(s) of the KDE (by reference!) Note: cannot change from a uniform -> non-uniform bandwidth ksize : automatic bandwidth selection via a number of methods LCV : 1D search using max leave-one-out likelihood criterion HALL : Plug-in estimator with good asymptotics; MISE criterion ROT,MSP : Fast standard-deviaion based methods; AMISE criterion LOCAL : Like LCV, but makes BW propto k-th NN distance (k=sqrt(N)) getPoints(kde) : Ndim x Npoints array of kernel locations adjustPoints(p,delta) : shift points of P by delta (by reference!) getWeights : [1 x Npts] array of kernel weights adjustWeights : set kernel weights (by reference!) rescale(kde,alpha) : rescale a KDE by the (vector) alpha getDim : get the dimension of the data getNpts : get the # of kernel locations getNeff : "effective" # of kernels (accounts for non-uniform weights) sample(P,Np,KSType) : draw Np new samples from P and set BW according to KSType Display: (visualization / Description) ===================================================== plot(kde...) : plot the specified dimensions of the KDE locations hist(kde...) : discretize the kde at uniform bin lengths display : text output describing the KDE double : boolean evaluation of the KDE (non-empty) Statistics: (useful stats & operations on a kde) ===================================================== covar : find the (weighted) covariance of the kernel centers mean : find the (weighted) mean of the kernel centers modes