没有合适的资源?快使用搜索试试~ 我知道了~
Data Science Programming in Python 无水印pdf
需积分: 14 21 下载量 151 浏览量
2017-10-03
13:56:09
上传
评论
收藏 6.37MB PDF 举报
温馨提示
Data Science Programming in Python 英文无水印pdf pdf所有页面使用FoxitReader和PDF-XChangeViewer测试都可以打开 本资源转载自网络,如有侵权,请联系上传者或csdn删除 本资源转载自网络,如有侵权,请联系上传者或csdn删除
资源推荐
资源详情
资源评论
DataScienceProgramminginPython
Copyright©2016byAnitaRaichand
Allrightsreserved.Nopartofthispublicationmaybereproduced,distributed,or
transmittedinanyformorbyanymeans,includingphotocopying,recording,orother
electronicormechanicalmethods,withoutthepriorwrittenpermissionoftheauthor,
exceptinthecaseofbriefquotationsembodiedincriticalreviewsandcertainother
noncommercialusespermittedbycopyrightlaw.
TableofContents
Introduction
DataScienceProgramminginPython-DataMunging
Background
DataMungingandCarpentry
DataScienceProgramminginPython-GroupingandAggregatingData
Groupingandqueryingdata
Groupingandaggregating
DataScienceProgramminginPython-Visualization
Datavisualization
DataScienceProgramminginPython-TimeSeries
Afterword
Introduction-DataScienceProgramminginPython
Theaimofthisbookistoshowhowtoapplydataanalysisprinciplestoapracticaluse
casescenariousingPythonasthedataanalysislanguage.We’llgoonthisjourneyby
lookingatthethedataworkflowfrommungingtogroupingdatatovisualizingandalso
includesometime-seriesanalysisaswell.Theformatincludesaskingquestionsofthe
dataandshowingtheprogrammingstepsneededtoanswerthequestion.Bytheendof
readingthisbook,youwillbeabletoapplythesetechniquestoyourowndata.
Aboutthebook
Thisbookiswritteninaliterateprogrammingstylewheretext,code,andoutputare
presentedtogether.Thiswillmaximizeyourlearningandunderstandingofcodeandthe
dataanalysisworkflow.Thebookteachesthetypeofinteractivecodinganditerative
analysisthatisessentialtobesuccessfulindatascienceprogramming.
CodingTips
Inthecodesnippets,abackslashcharacter(\)meansthatthesamelineofcodeiswrapped
tothenextlineinthebook.Youdonotneedtotypethischaracterintoaninterpreter.
UseaREPL(en.wikipedia.org/wiki/Read–eval–print_loop)tohaveaninteractive
environmentwhereyoucanwritecodeandseetheresultingoutput.
Trythemethodsyoulearninthisbookonyourowndatatoreinforcelearning.Usea
Pythoninterpretertocodeandyourfavoriteeditortotakenotes.
DataScienceProgramminginPython-DataMunging
Background
BayAreaBikeSharecommencedit’spilotphaseofoperationintheSanFranciscobay
areainAugust2013withplanstoexpand.ItisthefirstbikesharingschemeinCalifornia.
Asitismeantforshorttrips,thebikesshouldbereturnedtoadockinthirtyminutesor
lessoranadditionalfeewouldbeincurredaccordingtothewebsite.Therearetwotypes
ofmemberships:customerandsubscriber.Asubscriberisanannualmembershipwhilea
customerisdefinedassomeoneusingeitherthetwenty-fourhourorthreedaypasses.
Currently(Sept2014),itcostsninedollarsfortwenty-fourhours,twenty-twodollarsfor
threedays,andeight-eightdollarsfortheyear.Overtimefeesarefourdollarsforanextra
thirtyminutesandsevendollarsforeachthirtyminutesafterthat.Dataonthefirstsix
monthsofoperationswerereleasedaspartofadatachallenge.Thedataincludedthree
filesfortriphistory,weatherinformation,anddockavailability.Themergeddatawasused
forthefollowinganalysis.
DataMungingandCarpentry
First,we’llreadinthedataandinspectthedatacolumnsanddatatypesandthinkabout
whatquestionswewanttoaskourdataandwhatthingsareweinterestedinlearningabout
thedata.Becuriousandempatheticinthinkingaboutwhatthevariousstakeholders
includingtheCity,thecustomers,andotherinterestedpeoplewouldbeinterestedin
gleaningbykeepingcivicfiscal,civic,andsocialgoalsinmind.Inadditiontothat,there
willbequiteabitofcleaninganddatacarpentryneededtogetthedataintoaformat
usefulforanalysis.
ThedatasetcomesfromthreecsvfilesfromtheBayAreaBikesharedatachallenge.We
mergedthedatainRaswestartedasimilaranalysistherebutreallywantedtouse
IPythonandthesuperbtimeseriesfunctionalityinPandas.
Optionally,onecancacheorlogthecodeinIPythonwiththefollowingtwocommands.
%load_extipycache
%logstart
Activatingauto-logging.Currentsessionstateplusfutureinputsaved.Filename:
ipython_log.py
Mode:rotate
Outputlogging:False
Rawinputlog:False
Timestamping:False
State:active
Importthelibrariesthatareneeded.
importnumpyasnp
importpandasaspd
剩余94页未读,继续阅读
资源评论
yinkaisheng-nj
- 粉丝: 763
- 资源: 6231
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功