没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
Prepared exclusively for Joyce Liu - Liu Jiao
What readers are saying about Pragmatic Data Crunching
This book is full of useful practical tips; even more important, it distills a life-
time’s worth of real-life experience about the kinds of everyday, down-to-earth
data manipulation issues that hardly anybody writes about. It’s the next-best
thing to having twenty years’ experience in programming and data handling.
Alex Martelli
Author, Python in a Nutshell
Greg has done a wonderful job of bringing together a much-needed essay of
techniques, tips, and tricks on data manipulation. His examples are current
and his humor enjoyable. I recommend you get a copy of this book if you are
embarking on a journey that involves working among differing data formats.
Brent Gorda
I wish I had read this book or its equivalent about fifteen years ago.
Michelle Wahl Craig
Faculty, University of Toronto
A very pleasant read. It covers the core concepts with just enough background
information to prevent it from sounding like a beginner’s handbook.
Jason Montojo
Blueprint
Overall, a nice short book, covering important topics in a reasonable level of
depth and breadth. Nice job. I’d certainly recommend it.
David Ascher
ActiveState
Prepared exclusively for Joyce Liu - Liu Jiao
Data Crunching
Solve Everyday Problems Using Java, Python, and More
Greg Wilson
The Pragmatic Bookshelf
Raleigh, North Carolina Dallas, Texas
Prepared exclusively for Joyce Liu - Liu Jiao
Bookshelf
Pragmatic
Many of the designations used by manufacturers and sellers to distinguish their products are claimed
as trademarks. Where those designations appear in this book, and The Pragmatic Programmers, LLC
was aware of a trademark claim, the designations have been printed in initial capital letters or in all
capitals. The Pragmatic Starter Kit, The Pragmatic Programmer, Pragmatic Programming, Pragmatic
Bookshelf and the linking g device are trademarks of The Pragmatic Programmers, LLC.
Every precaution was taken in the preparation of this book. However, the publisher assumes no
responsibility for errors or omissions, or for damages that may result from the use of information
(including program listings) contained herein.
Our Pragmatic courses, workshops, and other products can help you and your team create better
software and have more fun. For more information, as well as the latest Pragmatic titles, please visit
us at
http://www.pragmaticprogrammer.com
Copyright
©
2005 Greg Wilson.
All rights reserved.
No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any
form, or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the
prior consent of the publisher.
Printed in the United States of America.
ISBN 0-9745140-7-1
Printed on acid-free paper with 85% recycled, 30% post-consumer content.
First printing, April 2005
Version: 2005-6-8
Prepared exclusively for Joyce Liu - Liu Jiao
Contents
1Introduction 1
1.1 Name That Molecule ........................... 1
1.2 There’s One in Every Crowd... ..................... 2
1.3 And the Moral Is... ............................ 3
1.4 Questions About Data Crunching ................... 4
1.5 Road Map ................................. 6
2Text 8
2.1 Reversing a File .............................. 8
2.2 Reformatting Data ............................ 10
2.3 Handling Multiline Records ....................... 16
2.4 Checking for Collisions ......................... 22
2.5 Including One File in Another ..................... 27
2.6 The Unix Shell .............................. 30
2.7 Very Large Data Sets ........................... 37
2.8 Summary ................................. 38
3 Regular Expressions 39
3.1 The Shell ................................. 40
3.2 Basic Patterns .............................. 41
3.3 Extracting Matched Values ....................... 48
3.4 Practical Applications .......................... 57
3.5 Speaking in Tongues ........................... 67
3.6 Other Systems .............................. 69
3.7 Summary ................................. 72
4XML 74
4.1 A Quick Introduction .......................... 74
4.2 SAX ..................................... 79
4.3 DOM .................................... 90
4.4 XPath ................................... 99
4.5 XSLT .................................... 104
4.6 Summary ................................. 112
Prepared exclusively for Joyce Liu - Liu Jiao
剩余197页未读,继续阅读
资源评论
- kldeng_052013-05-20书很不错,英文版的,需要慢慢嚼了
okie-dokie
- 粉丝: 8
- 资源: 40
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 论文(最终)_20240430235101.pdf
- 基于python编写的Keras深度学习框架开发,利用卷积神经网络CNN,快速识别图片并进行分类
- 最全空间计量实证方法(空间杜宾模型和检验以及结果解释文档).txt
- 5uonly.apk
- 蓝桥杯Python组的历年真题
- 2023-04-06-项目笔记 - 第一百十九阶段 - 4.4.2.117全局变量的作用域-117 -2024.04.30
- 2023-04-06-项目笔记 - 第一百十九阶段 - 4.4.2.117全局变量的作用域-117 -2024.04.30
- 前端开发技术实验报告:内含4四实验&实验报告
- Highlight Plus v20.0.1
- 林周瑜-论文.docx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功