没有合适的资源?快使用搜索试试~ 我知道了~
Practical Data Analysis with Python
5星 · 超过95%的资源 需积分: 19 27 下载量 101 浏览量
2015-03-24
11:56:24
上传
评论 1
收藏 6.47MB PDF 举报
温馨提示
试读
94页
Practical Data Analysis with Python by Anita Raichand 數據結構分析
资源推荐
资源详情
资源评论
TableofContents
Introduction
PracticalDataAnalysiswithPython-DataMunging
Background
DataMungingandCarpentry
PracticalDataAnalysiswithPython-GroupingandAggregatingData
Groupingandqueryingdata
Groupingandaggregating
PracticalDataAnalysiswithPython-Visualization
Datavisualization
PracticalDataAnalysiswithPython-TimeSeries
Introduction
Theaimofthisbookistoshowhowtoapplydataanalysisprinciplestoapracticaluse
casescenariousingPythonasthedataanalysislanguage.We’llgoonthisjourneyby
lookingatthethedataworkflowfrommungingtogroupingdatatovisualizingandalso
includesometime-seriesanalysisaswell.Theformatincludesaskingquestionsofthe
dataandshowingtheprogrammingstepsneededtoanswerthequestion.Bytheendof
readingthisbook,youwillbeabletoapplythesetechniquestoyourowndata.
ThisfirsteditionofthebookwaspublishedinJanuaryof2015.
PracticalDataAnalysiswithPython-DataMunging
Background
BayAreaBikeSharecommencedit’spilotphaseofoperationintheSanFranciscobay
areainAugust2013withplanstoexpand.ItisthefirstbikesharingschemeinCalifornia.
Asitismeantforshorttrips,thebikesshouldbereturnedtoadockinthirtyminutesor
lessoranadditionalfeewouldbeincurredaccordingtothewebsite.Therearetwotypes
ofmemberships:customerandsubscriber.Asubscriberisanannualmembershipwhilea
customerisdefinedassomeoneusingeitherthetwenty-fourhourorthreedaypasses.
Currently(Sept2014),itcostsninedollarsfortwenty-fourhours,twenty-twodollarsfor
threedays,andeight-eightdollarsfortheyear.Overtimefeesarefourdollarsforanextra
thirtyminutesandsevendollarsforeachthirtyminutesafterthat.Dataonthefirstsix
monthsofoperationswerereleasedaspartofadatachallenge.Thedataincludedthree
filesfortriphistory,weatherinformation,anddockavailability.Themergeddatawasused
forthefollowinganalysis.
DataMungingandCarpentry
First,we’llreadinthedataandinspectthedatacolumnsanddatatypesandthinkabout
whatquestionswewanttoaskourdataandwhatthingsareweinterestedinlearningabout
thedata.Becuriousandempatheticinthinkingaboutwhatthevariousstakeholders
includingtheCity,thecustomers,andotherinterestedpeoplewouldbeinterestedin
gleaningbykeepingcivicfiscal,civic,andsocialgoalsinmind.Inadditiontothat,there
willbequiteabitofcleaninganddatacarpentryneededtogetthedataintoaformat
usefulforanalysis.
ThedatasetcomesfromthreecsvfilesfromtheBayAreaBikesharedatachallenge.We
mergedthedatainRaswestartedasimilaranalysistherebutreallywantedtouse
IPythonandthesuperbtimeseriesfunctionalityinPandas.We’llleaveittothereaderto
mergethecsvfilesandthenyou’llbeabletoreproducetheanalysis.
Optionally,onecancacheorlogthecodeinIPythonwiththefollowingtwocommands.
%load_extipycache
%logstart
Activatingauto-logging.Currentsessionstateplusfutureinputsaved.Filename:
ipython_log.py
Mode:rotate
Outputlogging:False
Rawinputlog:False
Timestamping:False
State:active
剩余93页未读,继续阅读
资源评论
- wyj1983222015-07-29总结pandas实战用法,很棒,赞一下!
- suiyuan43252016-05-06最近在学python,书很清晰,上手很快
- huxiaoyuwi2016-05-16可以,还不错
Dip2
- 粉丝: 1
- 资源: 16
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功