PyPI官网下载|pypolyagamma-1.1.5.tar.gz资源-CSDN文库

版权申诉

103 浏览量 2022-01-29 11:26:59 上传评论收藏 212KB GZ 举报

共37个文件

cpp：9个

h：7个

py：5个

《PyPI官网下载 | pypolyagamma-1.1.5.tar.gz——Python中的多元高斯多项式伽马分布库》 PyPI（Python Package Index）是Python开发者的重要资源库，它为全球的Python用户提供了一个集中下载和分享Python软件包的平台。在这个平台上，我们找到了名为“pypolyagamma-1.1.5.tar.gz”的压缩包文件。这个压缩包包含了版本号为1.1.5的Python库——pypolyagamma，它主要用于处理多元高斯多项式伽马分布。多元高斯多项式伽马分布（Polya-Gamma Distribution）在统计学和机器学习领域具有广泛的应用。这种分布是一个连续概率分布，常被用作高斯过程回归、贝叶斯推断以及各类模型中的随机变量的生成。pypolyagamma库则提供了Python接口，使得用户能够方便地在Python环境中生成、操作和分析这类分布。在Python库的开发中，pypolyagamma-1.1.5版本可能包含了优化和修复，以提升性能和稳定性。通常，更新版本会修复已知的bug，增加新功能，或者对现有功能进行优化。用户可以通过安装这个版本来确保使用的是社区推荐的稳定版本。在实际应用中，pypolyagamma库常常与数据科学、机器学习项目结合，特别是涉及高斯过程回归的场景。高斯过程是一种非参数模型，通过引入多元高斯多项式伽马分布，可以更好地处理复杂的依赖关系和不确定性。例如，在贝叶斯统计中，该库可以用于构建贝叶斯回归模型，提供对参数的先验分布，从而进行后验推断。在分布式系统和云原生环境（Cloud Native）中，Python库的可移植性和跨平台兼容性至关重要。pypolyagamma作为一个纯Python实现的库，具备良好的兼容性，可以在各种Python运行环境中无缝运行，包括在Zookeeper这样的分布式协调服务上。这使得开发者能够在大规模分布式计算环境中使用pypolyagamma进行数据分析和建模。 pypolyagamma库是Python生态系统中一个强大的工具，尤其对于那些需要处理复杂概率分布的统计建模和机器学习任务。其1.1.5版本的发布，意味着开发者可以更加高效和可靠地利用多元高斯多项式伽马分布进行科学研究和工程实践。通过PyPI下载并安装这个库，可以轻松地将这个强大的统计工具集成到自己的Python项目中，提高数据处理的效率和精度。

资源推荐

资源详情

资源评论

收起资源包目录

pypolyagamma-1.1.5.tar.gz （37个子文件）

pypolyagamma-1.1.5

MANIFEST.in 262B

PKG-INFO 578B

pypolyagamma

parallel.pyx 1KB

utils.py 5KB

cpp

PolyaGammaAlt.h 3KB

PolyaGammaAlt.cpp 7KB

PolyaGammaSP.cpp 6KB

include

GRNG.hpp 3KB

RNG.cpp 11KB

RNG.hpp 6KB

GRNG.cpp 4KB

SRNG.hpp 3KB

PolyaGamma.h 2KB

InvertY.cpp 2KB

PolyaGammaSmallB.h 731B

PolyaGammaSmallB.cpp 2KB

PolyaGammaOMP.h 2KB

PolyaGamma.cpp 6KB

PolyaGammaSP.h 1KB

PolyaGammaPar.h 8KB

PolyaGammaHybrid.h 2KB

InvertY.hpp 2KB

distributions.py 23KB

parallel.cpp 731KB

__init__.py 865B

pypolyagamma.pyx 987B

pypolyagamma.cpp 693KB

deps

README.md 226B

test

test_basics.py 3KB

setup.cfg 79B

setup.py 6KB

pypolyagamma.egg-info

PKG-INFO 578B

requires.txt 23B

SOURCES.txt 1KB

top_level.txt 13B

dependency_links.txt 1B

README.md 8KB

# PyPólyaGamma [![Test status](https://travis-ci.org/slinderman/pypolyagamma.svg?branch=master)](https://travis-ci.org/slinderman/pypolyagamma) This is a Cython port of Jesse Windle's code at https://github.com/jwindle/BayesLogit. It provides a Python interface for efficiently sampling Pólya-gamma random variates. Install with: pip install pypolyagamma Please open issues if you have any trouble! # Background Pólya-gamma augmentation is a method of performing fast and simple Bayesian inference in models with Gaussian latent variables and count observations. While such models are non-conjugate, if it has the right form (specifically, if it is a Bernoulli, binomial, negative binomial, or multinomial with a logistic link function), we can introduce a set of Pólya-gamma auxiliary variables that render it conditionally conjugate. This facilitates fast Gibbs sampling algorithms on an extended space of Gaussian latent variables and Pólya-gamma auxiliary variables, where integrating out the auxiliary variables leaves the original model intact. Given the auxiliary variables, the latent Gaussian variables have a Gaussian conditional distribution. Likewise, given the Gaussian latent variables and the observed count data, the auxiliary variables have a Pólya-gamma conditional distribution. Thus, to implement the Gibbs sampling algorithm, we must be able to efficiently sample Pólya-gamma random variates. This library provides code to do exactly that. The augmented density, the non-Gaussian marginal, and the Gaussian conditionals are illustrated in the figure below. In this case, the posterior is from a simple binomial model. Next, we'll show how to perform Gibbs sampling for such a model. ![Marginals](https://raw.githubusercontent.com/slinderman/pypolyagamma/simplegsl/aux/marginals.png) See below for more references and links. # Demo For convenience, we have created classes for simple count regression models, like the Bernoulli, binomial, negative binomial, and multinomial (with stick breaking) observation models. For example, you can fit a Bernoulli regression as follows: ```python from pypolyagamma import BernoulliRegression D_out = 1 # Output dimension D_in = 2 # Input dimension reg = BernoulliRegression(D_out, D_in) # Given X, an NxD_in array of real-valued inputs, # and Y, and NxD_out array of binary observations. Fit # the linear model y_n ~ Bern(sigma(A x_n + b)), # where sigma is the logistic function. A is D_out x D_in, # b is D_out x 1, and the entries in y_n are cond. indep. samples = [] for _ in range(100): reg.resample((X,Y)) samples.append((reg.A, reg.b)) ``` Under the hood, this will instantiate Pólya-gamma auxiliary variables and perform conditionally-conjugate Gibbs sampling. We can visualize the inferred parameters in terms of the implied probability for each point in the input space. ![Bernoulli Regression](https://raw.githubusercontent.com/slinderman/pypolyagamma/v1.1/aux/bernoulli_regression.png) Here's how you can manually perform inference in a simple binomial model with `N=10` counts and probability `p=logistic(x)`, with a standard normal prior on `x`. First, sample a count from the model: ```python from pypolyagamma import logistic, PyPolyaGamma # Consider a simple binomial model with unknown probability # Model the probability as the logistic of a scalar Gaussian. N = 10 mu = 0.0 sigmasq = 1.0 x_true = npr.normal(mu, np.sqrt(sigmasq)) p_true = logistic(x_true) y = npr.binomial(N, p_true) ``` Now we can run a Gibbs sampler to estimate the posterior distribution of `x` given `y`. ```python # Gibbs sample the posterior distribution p(x | y) # Introduce PG(N,0) auxiliary variables to render # the model conjugate. First, initialize the PG # sampler and the model parameters. N_samples = 10000 pg = PyPolyaGamma(seed=0) xs = np.zeros(N_samples) omegas = np.ones(N_samples) # Now run the Gibbs sampler for i in range(1, N_samples): # Sample omega given x, y from its PG conditional omegas[i] = pg.pgdraw(N, xs[i-1]) # Sample x given omega, y from its Gaussian conditional sigmasq_hat = 1./(1. / sigmasq + omegas[i]) mu_hat = sigmasq_hat * (mu / sigmasq + (y - N / 2.)) xs[i] = npr.normal(mu_hat, np.sqrt(sigmasq_hat)) ``` For this simple example, we can compute the true posterior and compare the samples to the target density. ![Binomial](https://raw.githubusercontent.com/slinderman/pypolyagamma/master/aux/binomial.png) # Manual Installation If you're a developer, you can also install from source: git clone git@github.com:slinderman/pypolyagamma.git cd pypolyagamma pip install -e . To check if it worked, run: nosetests If all the tests pass then you're good to go! Under the hood, the installer will download [GSL](https://www.gnu.org/software/gsl/), untar it, and place it in `deps/gsl`. It will then configure GSL and compile the Pólya-gamma code along with the required GSL source files. This way, you don't need GSL to be installed and available on your library path. ## Parallel sampling with OpenMP By default, the simple installation above will not support parallel sampling. If you are compiling with GNU `gcc` and `g++`, you can enable OpenMP support with the flag: USE_OPENMP=True pip install -e . Mac users: you can install `gcc` and `g++` with Homebrew. Just make sure that they are your default compilers, e.g. by setting the environment variables `CC` and `CXX` to point to the GNU versions of `gcc` and `g++`, respectively. With Homebrew, these versions will be in `/usr/local/bin` by default. To sample in parallel, call the `pgdrawvpar` method: ```python n = 10 # Number of variates to sample b = np.ones(n) # Vector of shape parameters c = np.zeros(n) # Vector of tilting parameters out = np.empty(n) # Outputs # Construct a set of PolyaGamma objects for sampling nthreads = 8 seeds = np.random.randint(2**16, size=nthreads) ppgs = [pypolyagamma.PyPolyaGamma(seed) for seed in seeds] # Sample in parallel pypolyagamma.pgdrawvpar(ppgs, b, c, out) ``` If you haven't installed with OpenMP, this function will revert to the serial sampler. # References - [Polson, Nicholas G., James G. Scott, and Jesse Windle. "Bayesian inference for logistic models using Pólya–Gamma latent variables." _Journal of the American statistical Association_ 108.504 (2013): 1339-1349.](http://www.tandfonline.com/doi/pdf/10.1080/01621459.2013.829001) - [Windle, Jesse, Nicholas G. Polson, and James G. Scott. "Sampling Polya-Gamma random variates: alternate and approximate techniques." _arXiv preprint arXiv:1405.0506_ (2014).](http://arxiv.org/pdf/1405.0506) - [Linderman, Scott, Matthew Johnson, and Ryan P. Adams. "Dependent Multinomial Models Made Easy: Stick-Breaking with the Polya-gamma Augmentation." _Advances in Neural Information Processing Systems (NIPS)_. 2015.](http://papers.nips.cc/paper/5660-dependent-multinomial-models-made-easy-stick-breaking-with-the-polya-gamma-augmentation.pdf) Check out our github repo, [pgmult](https://github.com/HIPS/pgmult) - [Linderman, Scott W., Ryan P. Adams, and Jonathan W. Pillow. "Bayesian latent structure discovery from multi-neuron recordings." _Advances in Neural Information Processing Systems (NIPS)_ 2016.](https://arxiv.org/pdf/1610.08465) Check out our github repo, [pyglm](https://github.com/slinderman/pyglm)) - [Linderman, Scott W., Matthew Johnson, Andrew C. Miller, Ryan P. Adams, David M. Blei, and Liam Paninski. "Bayesian learning and inference in recurrent switching linear dynamical systems." _Artificial Intelligence and Statistics (AISTATS)_ 2017.](https://arxiv.org/pdf/1610.08466.pdf)

评论收藏

内容反馈

版权申诉