# bigvis
[![Travis-CI Build Status](https://travis-ci.org/hadley/bigvis.svg?branch=master)](https://travis-ci.org/hadley/bigvis)
[![Coverage Status](https://img.shields.io/codecov/c/github/hadley/bigvis/master.svg)](https://codecov.io/github/hadley/bigvis?branch=master)
The bigvis package provides tools for exploratory data analysis of __large datasets__ (10-100 million obs). The aim is to have most operations take less than 5 seconds on commodity hardware, even for 100,000,000 data points.
Since bigvis is not currently available on CRAN, the easiest way to try it out is to:
```R
# install.packages("devtools")
devtools::install_github("hadley/bigvis")
```
## Workflow
The bigvis package is structured around the following workflow:
* `bin()` and `condense()` to get a compact summary of the data
* if the estimates are rough, you might want to `smooth()`. See `best_h()` and `rmse_cvs()` to figure out a good starting bandwidth
* if you're working with counts, you might want to `standardise()`
* visualise the results with `autoplot()` (you'll need to load `ggplot2` to use this)
## Weighted statistics
Bigvis also provides a number of standard statistics efficiently implemented on weighted/binned data: `weighted.median`, `weighted.IQR`, `weighted.var`, `weighted.sd`, `weighted.ecdf` and `weighted.quantile`.
## Acknowledgements
This package wouldn't be possible without:
* the fantastic [Rcpp](http://dirk.eddelbuettel.com/code/rcpp.html) package, which makes it amazingly easy to integrate R and C++
* JJ Allaire and Carlos Scheidegger who have indefatigably answered my many C++ questions
* the generous support of Revolution Analytics who supported the early development.
* Yue Hu, who implemented a proof of concepts that showed that it might be possible to work with this much data in R.
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
Exploratory_data_analysis_for_large_datasets_(10-1_bigvis.zip (103个子文件)
RcppExports.cpp 12KB
smooth-1d.cpp 6KB
bin-structure.cpp 5KB
bin.cpp 5KB
condense.cpp 5KB
smooth-nd.cpp 4KB
stats.cpp 3KB
mean.cpp 3KB
median.cpp 2KB
count.cpp 2KB
group-tempvar.cpp 2KB
group.cpp 2KB
frange.cpp 2KB
BinnedVectors.cpp 1KB
kernel.cpp 1KB
BigVis.cpp 997B
summary.cpp 750B
double-diff-sum.cpp 699B
lowerBound.cpp 517B
BinnedVector.cpp 237B
Summary2d.cpp 63B
DESCRIPTION 892B
.gitignore 56B
.gitignore 8B
bigvis.h 4KB
group.h 3KB
summary.h 3KB
group-hex.h 2KB
Summary2d.h 1KB
stats.h 579B
Makevars 31B
README.md 2KB
notes.md 2KB
NAMESPACE 1KB
weighted-stats.r 4KB
autoplot.r 4KB
h.r 4KB
RcppExports.R 3KB
smooth.r 3KB
condensed.r 3KB
peel.r 3KB
ranged.r 2KB
rebin.r 2KB
test-condense.r 2KB
test-binned-vectors.r 2KB
challenge.r 2KB
rmse.r 2KB
condense.r 2KB
bin.r 2KB
id.r 1KB
mt.r 1KB
standardise.r 1KB
dgrid.r 1KB
test-smooth.r 1KB
movies.r 1KB
condense-gen.r 1KB
origin.r 1012B
width.r 942B
test-weighted-stats.r 940B
test-stat.r 923B
test-summary-moments.r 825B
breaks.r 809B
bigvis.r 667B
test-origin.r 592B
test-group-1d.r 546B
utils.r 486B
adjust.r 450B
test-breaks.r 448B
test-frange.r 374B
test-group-2d.r 292B
test-ranged.r 198B
.Rbuildignore 81B
best_h.Rd 2KB
smooth.Rd 2KB
rmse_cvs.Rd 1KB
peel.Rd 1KB
movies.Rd 1KB
standardise.Rd 1KB
condensed.Rd 1KB
weighted.var.Rd 1KB
ranged.Rd 1KB
transform.condensed.Rd 1KB
bin.Rd 1KB
condense.Rd 1KB
mt.Rd 1022B
autoplot.condensed.Rd 1017B
weighted.ecdf.Rd 1017B
frange.Rd 1009B
dchallenge.Rd 1007B
weighted.quantile.Rd 987B
h_grid.Rd 845B
find_origin.Rd 774B
weighted.IQR.Rd 745B
breaks.Rd 743B
weighted.median.Rd 705B
find_width.Rd 598B
round_any.condensed.Rd 575B
dgrid.Rd 509B
bigvis.Rd 465B
is.ranged.Rd 312B
共 103 条
- 1
- 2
资源评论
好家伙VCC
- 粉丝: 2107
- 资源: 9145
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功