没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
Think Stats
Exploratory Data Analysis in Python
Version 2.0.27
Think Stats
Exploratory Data Analysis in Python
Version 2.0.27
Allen B. Downey
Green Tea Press
Needham, Massachusetts
Copyright © 2014 Allen B. Downey.
Green Tea Press
9 Washburn Ave
Needham MA 02492
Permission is granted to copy, distribute, and/or modify this document under the
terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Inter-
national License, which is available at
http://creativecommons.org/licenses/
by-nc-sa/4.0/
.
The original form of this book is L
A
T
E
X source code. Compiling this code has the
effect of generating a device-independent representation of a textbook, which can
be converted to other formats and printed.
The L
A
T
E
X source for this book is available from
http://thinkstats2.com
.
Preface
This book is an introduction to the practical tools of exploratory data anal-
ysis. The organization of the book follows the process I use when I start
working with a dataset:
• Importing and cleaning: Whatever format the data is in, it usually
takes some time and effort to read the data, clean and transform it, and
check that everything made it through the translation process intact.
• Single variable explorations: I usually start by examining one variable
at a time, finding out what the variables mean, looking at distributions
of the values, and choosing appropriate summary statistics.
• Pair-wise explorations: To identify possible relationships between
variables, I look at tables and scatter plots, and compute correlations
and linear fits.
• Multivariate analysis: If there are apparent relationships between
variables, I use multiple regression to add control variables and in-
vestigate more complex relationships.
• Estimation and hypothesis testing: When reporting statistical results,
it is important to answer three questions: How big is the effect? How
much variability should we expect if we run the same measurement
again? Is it possible that the apparent effect is due to chance?
• Visualization: During exploration, visualization is an important tool
for finding possible relationships and effects. Then if an apparent ef-
fect holds up to scrutiny, visualization is an effective way to commu-
nicate results.
This book takes a computational approach, which has several advantages
over mathematical approaches:
剩余241页未读,继续阅读
资源评论
syzhjjw
- 粉丝: 0
- 资源: 6
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功