• hadoop 权威指南(中文版)

    这本书很全,是Hadoop中的圣经级教材,不过看起来挺累。 内容简介 Discover how Apache Hadoop can unleash the power of your data. This comprehensive resource shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework -- an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters. This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce Become familiar with Hadoop’s data and I/O building blocks for compression, data integrity, serialization, and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud Use Pig, a high-level query language for large-scale data processing Analyze datasets with Hive, Hadoop’s data warehousing system Take advantage of HBase, Hadoop’s database for structured and semi-structured data Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems "Now you have the opportunity to learn about Hadoop from a master -- not only of the technology, but also of common sense and plain talk."

    5
    138
    40.3MB
    2012-02-11
    9
  • hadoop in action

    这本书是入门推荐。 内容简介 "Hadoop in Action" teaches readers how to use Hadoop and write MapReduce programs. The intended readers are programmers, architects, and project managers who have to process large amounts of data offline. "Hadoop in Action" will lead the reader from obtaining a copy of Hadoop to setting it up in a cluster and writing data analytic programs. The book begins by making the basic idea of Hadoop and MapReduce easier to grasp by applying the default Hadoop installation to a few easy-to-follow tasks, such as analyzing changes in word frequency across a body of documents. The book continues through the basic concepts of MapReduce applications developed using Hadoop, including a close look at framework components, use of Hadoop for a variety of data analysis tasks, and numerous examples of Hadoop in action. "Hadoop in Action" will explain how to use Hadoop and present design patterns and practices of programming MapReduce. MapReduce is a complex idea both conceptually and in its implementation, and Hadoop users are challenged to learn all the knobs and levers for running Hadoop. This book takes you beyond the mechanics of running Hadoop, teaching you to write meaningful programs in a MapReduce framework. This book assumes the reader will have a basic familiarity with Java, as most code examples will be written in Java. Familiarity with basic statistical concepts (e.g. histogram, correlation) will help the reader appreciate the more advanced data processing examples.

    0
    68
    5.09MB
    2012-02-11
    10
  • linux_内核_0.11

    Linux-0.11 版本是在1991年12月8日发布的。 Linux-0.11 版整个内核源代码只有325K字节左右,其中包括的内容基本上都是 Linux 的精髓。 0.11 版的引导启动程序结构则与现在的基本上是一样的

    0
    48
    78KB
    2011-06-11
    3
  • linux内核详细注释

    是针对linux早起的内核版本0.99做了详细的注释。现在的2.6内核版对初学linux kernel的同学们来说有点过于庞大复杂。本书很好的阐释了linux内核的精髓。

    0
    61
    3.91MB
    2011-06-11
    0
  • c++ Primer 第四版答案

    C++ Primer第四版的课后习题,内容非常全面。清晰度一般 ,pdf格式。

    0
    144
    6.59MB
    2009-10-18
    2
  • Java2核心技术卷1的源码

    Java2核心技术卷1的源码 Java2核心技术卷1的源码 Java2核心技术卷1的源码

    0
    95
    458KB
    2009-10-16
    0
上传资源赚积分or赚钱