介绍社交网络中数据挖掘的基本情况~ Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii 1. Introduction: Hacking on Twitter Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Installing Python Development Tools 1 Collecting and Manipulating Twitter Data 3 Tinkering with Twitter’s API 4 Frequency Analysis and Lexical Diversity 7 Visualizing Tweet Graphs 14 Synthesis: Visualizing Retweets with Protovis 15 Closing Remarks 17 2. Microformats: Semantic Markup and Common Sense Collide . . . . . . . . . . . . . . . . . . 19 XFN and Friends 19 Exploring Social Connections with XFN 22 A Breadth-First Crawl of XFN Data 23 Geocoordinates: A Common Thread for Just About Anything 30 Wikipedia Articles + Google Maps = Road Trip? 30 Slicing and Dicing Recipes (for the Health of It) 35 Collecting Restaurant Reviews 37 Summary 40 3. Mailboxes: Oldies but Goodies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 mbox: The Quick and Dirty on Unix Mailboxes 42 mbox + CouchDB = Relaxed Email Analysis 48 Bulk Loading Documents into CouchDB 51 Sensible Sorting 52 Map/Reduce-Inspired Frequency Analysis 55 Sorting Documents by Value 61 couchdb-lucene: Full-Text Indexing and More 63 Threading Together Conversations 67 Look Who’s Talking 73 ix
- s0308012272012-12-08挖掘的知识值得一看,英文巴德还可以学习英文。
- Daniel_JKLu2013-06-02这本书对我帮助很大,谢谢楼主!
- 粉丝: 0
- 资源: 9
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助