HbaseReferenceBook-Hbase参考指南英文版

所需积分/C币:2 2018-08-27 13:33:53 13.83MB PDF
5
收藏 收藏
举报

Apache HBase ™ Reference Guide,Hbase的配置,shell使用,设计和安全等
35. On the number of column families 123 36. Rowkey design…………… 124 37. Number of versions .131 38. Supported Datatypes .132 39. Joins… ∴.133 40. Time To Live (TTL)...................................................134 41. Keeping Deleted Cells∴…… 135 42. Secondary Indexes and Alternate Query Paths ···..·······..····· ·····;·..··.···· 139 43. Constraints∴ 141 44 Schema Design Case Studies 142 45. Operational and Performance Configuration Options∴………………………………152 HBase and MapReduce ....··· 152 46. HBase, MapReduce, and the classpath……… 153 47 MapReduce Scan Caching ∴157 48. Bundled HBase MapReduce Jobs 158 49. HBase as a MapReduce Job Data Source and Data Sink………………………………159 50. Writing HFiles Directly During Bulk Import.…………………… ·········.·· 160 51. RowCounter Example ∴161 52.Map- Task Splitting……………,…,…,…,…,…,…,…,………,……,,………,162 53. HBase MapReduce Examples 163 54. Accessing Other HBase Tables in a MapReduce job…………,…,…,,.171 55 Speculative Execution 172 56 Cascading ∴173 Securing Apache Hbase…… 174 57. Using Secure Http( Https) for the Web ui,……,…,…,……,……,175 58. Secure Client Access to Apache HBase,…,,…,,,,…,……,176 59. Simple User Access to Apache HBase…,…,…,…,…,,,…,,184 60. Securing Access to HDFS and ZooKeeper.…,,……,,…,,…,…,,,188 61. Securing Access To Your Data ......................................................................................190 62. Security Configuration Example .······ 217 Architecture .221 63. Overview 222 64. Catalog Tables .224 65. Client .226 66. Client Request Filters∴……,…,,…,…,…,…,,,229 67. Master .234 68. RegionServer ·. ··“ ····:.·;····· ..236 69. Regions…...….….….….….….….….….….….….….….….…..….….253 70. Bulk Loading…… .281 71. HDES 283 72. Timeline- consistent High Available Reads∴…… 284 73. Storing Medium-sized Objects(MOB)∴… ..297 Apache HBase APIs 302 74. Examples 303 Apache HBase External APIs 307 75. REST ∴308 76. Thrift .319 77. C/C++ Apache HBase Client 320 78. Using Java Data Objects (Do)with HBase 321 79. Scala 鲁 326 80. thon 328 Thrift aPi and Filter language ····“··· ········; ·····;·..··.···· 331 81 Filter language 332 HBase and spark∴ .337 82. Basic Spark ∴339 83. Spark streaming…… ∴343 84. Bulk Load 85. SparksQL/Dataframes…… 50 Apache HBase Coprocessors 354 86. Coprocessor overview∴…………………………………………355 87. Types of Coprocessors……………………… ·········.·· ∴357 88. Loading Coprocessors ∴359 89. Examples∴……,…,…,…,…,…,…,…,…,………,………,…………,365 90. Guidelines For deploying A Coprocessor……,,…,…,…,,…,…,,,.372 91. Monitor Time Spent in Coprocessors……………,,,… ∴374 Apache HBase Performance Tuning……,,……,…,,,,……374 92. Operating system 375 93. Network 376 94.java…………………………………………………………,378 95 HBase Configurations 379 96.Z00 Keeper……………,…,…,…,…,……,…………,…………,………………383 97. Schema design…… :.·....·····.·..···..··.···..··.:·.·...···..·.·..:·.···...·:·····.·····....·· 384 98 HBase general patterns .......................444.......4..4.................4...388 99. Writing to HBase .······ 389 100. Reading from HBase .392 101. Deleting from HBase.…,…,,,,,, 397 102. HDFS .398 103, Amazon ec2 400 104. Collocating HBase and Mapreduce,…,,…,…,,…,,,…,,…,….,401 105. Case studies .402 Troubleshooting and Debugging Apache HBase ····:.·;····· 402 106 General guidelines 107.Logs∴…………… .404 108 Resources ..408 109. Tools 110. Client 111. MapReduce 424 112. NameNode 426 113 Network 29 114. RegionServer 115. Master .435 116 ZooKeeper ..436 117. Amazon ec2 437 118 HBase and Hadoop version issues. ..............................................................................438 119 IPC Configuration Conflicts with Hadoop 439 120 HBase and hdfs ····“··· ·······.· 440 121. Running unit or integration tests………………… 443 122. Case Studies 444 123. Cryptographic Features 445 124. Operating System Specific Issues…………………………… ....··· 125. JDK Issues……………………… 47 Apache HBase Case Studies ∴447 126, Overview 448 127. Schema Design∴…… 449 128. Performance/Troubleshooting………………… ·········.·· 450 Apache HBase Operational management............. 鲁、··。·垂·自自;曲自曲自看·自曲。看··番曲。音,曲 ,453 129.HbaseTOolsandutilities....4.,.44..........4.4.44.44..w..4..w...wwww....454 130. Region management.……,…,,… 131 Node Management 鲁4鲁。B ,471 132. HBase metrics∴ 477 133 HBase Monitoring..... 482 134. Cluster replication……,,, 486 135. Running Multiple Workloads On a Single Cluster.…,,…,……,…,……,498 136. HBase Backup… 503 137. HBase Snapshots……,…,…,…,…,…,…,…,,…….505 138. Storing Snapshots in Microsoft Azure Blob Storage…,…,…,…….,510 139. Capacity Planning and region Sizing…,…,…,…,……,,…,…,511 140. Table rename ..·.:·.···.···..:..:··.·····:···.·· .······ ∴516 Building and Developing Apache HBase………,…,,…,,,,….516 141. Getting Involved… 517 142. Apache HBase Repositories .519 143, IDES 520 144. Building Apache HBase 524 145. Releasing Apache hbase .528 146. Voting on Release Candidates…… 535 147. Generating the HBase Reference guide.................................... 536 148. Updating hbase. apache.org…………… .537 149. Tests 538 150. Developer Guidelines…… 552 Unit Testing HBase Applications 566 151.J 568 152. Mockit 570 153. MRUnit .572 154 Integration Testing with an HBase Mini-Cluster ∴.574 Z00 Keeper...…. .576 155. Using existing ZooKeeper ensemble ..578 156 SASL Authentication with ZooKeeper 579 Community...........................................................586 157. Decisions 587 158. Community roles ·····;·..··.···· 588 159 Commit Message format 589 Appendix 589 Appendix A: Contributing to Documentation………… .590 Appendix B: FAQ ∴600 Appendix o: hock In Depth…… ∴603 Appendix d: Access Control matrix…………… ∴608 Appendix E: Compression and Data Block Encoding In HBase ∴,614 160. Enable Data Block Encoding…………………………………626 Appendix F: SQL over HBase…………………… .··:.···.··..··· 627 Appendix G:YCSB… 628 Appendix h: HFile format………,…,…,…,…,…,,……,,,……,…6529 Appendix I: Other Information About HBase......,.......................... 638 Appendix j: HBase History,…,,……,…,…,…,…,…,,,…,, 640 Appendix K: HBase and the Apache Software Foundation,…,…,,…,…,,,…,,641 Appendix l; Apache HBase Orca..,…,…,, ∴642 Appendix M: Enabling Dapper-like Tracing in HBase 643 161. Client modifications . 162. Tracing from HBase Shell 646 Appendix N:0.95 RPC Specification……,,……,…,…,…,…,,,…,.647 Preface This is the official reference guide for the HBase version it ships with Herein you will find either the definitive documentation on an HBase topic as of its standing when the referenced HBase version shipped, or it will point to the location in Javadoc or JIRA where the pertinent information can be found This reference guide is a work in progress. The source for this guide can be found in the src/main/asciidoc directory of the HBase source. This reference guide is marked up using AsciiDoc from which the finished guide is generated as part of the ' build target. Run myn site to generate this documentation. Amendments and improvements to the documentation are welcomed Click this link to file a new documentation bug against Apache hBase with some values pre-selected For an overview of AsciiDoc and suggestions to get started contributing to the documentation, see the relevant section later in this documentation If this is your first foray into the wonderful world of Distributed Computing, then you are in for some interesting times. First off, distributed systems are hard; making a distributed system hum requires a disparate skillset that spans systems (hardware and software)and networking Your cluster's operation can hiccup because of any of a myriad set of reasons from bugs in HBase itself through misconfigurations-misconfiguration of HBase but also operating system misconfigurations--through to hardware problems whether it be a bug in your network card drivers or an underprovisioned ram bus(to mention two recent examples of hardware issues that manifested as "HBase is slow ) You will also need to do a recalibration if up to this your computing has been bound to a single box. Here is one good starting point: Fallacies of Distributed Computing That said, you are welcome It's a fun place to be Yours, the HBase Community. Please use JIRa to report non-security-related bugs To protect existing HBase installations from new vulnerabilities, please do not use Jira to report security-related bugs. Instead, send your report to the mailing list private@apache. org, which allows anyone to send messages, but restricts who can read them. Someone on that list will contact you to follow up on your report The phrases /supported/, /not supported/, /tested/, and/not tested/ occur several places throughout this guide. In the interest of clarity, here is a brief explanation of what is generally meant by these phrases, in the context of hbase Commercial technical support for Apache HBase is provided by many Hadoop vendors NOTE This is not the sense in which the term /support/ is used in the context of the apache HBase project. The Apache HBase team assumes no responsibility for your HBase clusters, your configuration, or your data Supported In the context of Apache HBase, /supported/ means that HBase is designed to work in the way described, and deviation from the defined behavior or functionality should be reported as a bug Not Supported In the context of Apache HBase, /not supported/ means that a use case or use pattern is not expected to work and should be considered an antipattern. If you think this designation should be reconsidered for a given feature or use pattern, file a Jira or start a discussion on one of the mailing lists Tested In the context of Apache HBase, /tested/ means that a feature is covered by unit or integration tests, and has been proven to work as expected Not tested In the context of Apache HBase,/not tested/ means that a feature or use pattern may or may notwork in a given way, and may or may not corrupt your data or cause operational issues. It is an unknown, and there are no guarantees. If you can provide proof that a feature designated as/not tested/ does work in a given way, please submit the tests and/or the metrics so that other users can gain certainty about such features or use patterns Getting Started 2 Chapter 1 Introduction Quickstart will get you up and running on a single-node, standalone instance of HBase, followed by a pseudo-distributed single-machine instance, and finally a fully-distributed cluster Chapter 2 Quick Start- Standalone HBase This guide describes the setup of a standalone hbase instance running against the local filesystem. This is not an appropriate configuration for a production instance of HBase, but will allow you to experiment with HBase. This section shows you how to create a table in HBase using the hbase shell CLI, insert rows into the table, perform put and scan operations against the table, enable or disable the table, and start and stop HBase. Apart from downloading HBase, this procedure should take less than 10 minutes Local filesystem and Durab Warning The following is fixed in HBase 0.98.3 and beyond. See HBASE-11272 and HBASE 11218 Using HBase with a local filesystem does not guarantee durability. The hDFS local filesystem implementation will lose edits if files are not properly closed. This is very likely to happen when you are experimenting with new software, starting and stopping the daemons often and not always cleanly You need to run HBase on HDFS to ensure all writes are preserved. Running against the local filesystem is intended as a shortcut to get you familiar with how the general system works, as the very first phase of evaluation. See hbase-3696 and its associated issues for more details about the issues of running on the local filesystem Loopback Ip-HBase 094.x and earlier note The below advice is for hbase-094 x and older versions only. This is fixed in hbase-0960 and beyond Prior to HBase 0.94.x, HBase expected the loopback IP address to be 127.0.0.1. Ubuntu and some other distributions default to 127.0.1.1 and this will cause problems for you. see Why does hBase care about /etc/hosts? for detail Example 1. Example /etc/hosts File for ubuntu The following /etc/hosts file works correctly for HBase 0.94. x and earlier, on Ubuntu. Use this as a template if you run into trouble. 127.0.0.1 localhost 127.0.0.1 ubuntu. ubuntu-doma in ubuntu 2.1. DK Version Requirements HBase requires that a Dk be installed. See Java for information about supported JDK versions 4

...展开详情
试读 127P HbaseReferenceBook-Hbase参考指南英文版
立即下载 低至0.43元/次 身份认证VIP会员低至7折
一个资源只可评论一次,评论内容不能少于5个字
  • 分享宗师

    成功上传21个资源即可获取
关注 私信 TA的资源
上传资源赚积分or赚钱
    最新推荐
    HbaseReferenceBook-Hbase参考指南英文版 2积分/C币 立即下载
    1/127
    HbaseReferenceBook-Hbase参考指南英文版第1页
    HbaseReferenceBook-Hbase参考指南英文版第2页
    HbaseReferenceBook-Hbase参考指南英文版第3页
    HbaseReferenceBook-Hbase参考指南英文版第4页
    HbaseReferenceBook-Hbase参考指南英文版第5页
    HbaseReferenceBook-Hbase参考指南英文版第6页
    HbaseReferenceBook-Hbase参考指南英文版第7页
    HbaseReferenceBook-Hbase参考指南英文版第8页
    HbaseReferenceBook-Hbase参考指南英文版第9页
    HbaseReferenceBook-Hbase参考指南英文版第10页
    HbaseReferenceBook-Hbase参考指南英文版第11页
    HbaseReferenceBook-Hbase参考指南英文版第12页
    HbaseReferenceBook-Hbase参考指南英文版第13页
    HbaseReferenceBook-Hbase参考指南英文版第14页
    HbaseReferenceBook-Hbase参考指南英文版第15页
    HbaseReferenceBook-Hbase参考指南英文版第16页
    HbaseReferenceBook-Hbase参考指南英文版第17页
    HbaseReferenceBook-Hbase参考指南英文版第18页
    HbaseReferenceBook-Hbase参考指南英文版第19页
    HbaseReferenceBook-Hbase参考指南英文版第20页

    试读结束, 可继续阅读

    2积分/C币 立即下载 >