Apache Hive (TM) @VERSION@
======================
The Apache Hive (TM) data warehouse software facilitates querying and
managing large datasets residing in distributed storage. Built on top
of Apache Hadoop (TM), it provides:
* Tools to enable easy data extract/transform/load (ETL)
* A mechanism to impose structure on a variety of data formats
* Access to files stored either directly in Apache HDFS (TM) or in other
data storage systems such as Apache HBase (TM)
* Query execution using Apache Hadoop MapReduce or Apache Tez
frameworks.
Hive implements a dialect of SQL (Hive QL) that focuses on analytics
and presents a rich set of SQL semantics including OLAP functions,
subqueries, common table expressions and more. Hive allows SQL
developers or users with SQL tools to easily query, analyze and
process data stored in Hadoop.
Hive also allows programmers familiar with the MapReduce framework
to plug in their custom mappers and reducers to perform more
sophisticated analysis that may not be supported by the built-in
capabilities of the language. QL can also be extended with custom
scalar functions (UDF's), aggregations (UDAF's), and table
functions (UDTF's).
Hive users have a choice of 2 runtimes when executing SQL queries.
Users can choose to use the Apache Hadoop MapReduce framework,
which is mature and proven at large scales. MapReduce is a purely
batch framework, and queries run using the MapReduce framework
may experience higher latencies (tens of seconds), even
over small datasets. Alternatively, users can choose to use the
newer Apache Tez framework to process SQL queries. Tez is
designed for interactive query and has substantially reduced
overheads versus MapReduce. Users are free to switch back and
forth between these frameworks at any time. In either case,
Hive is best suited for use cases where the amount of data
processed is large enough to require a distributed system.
Hive is not designed for online transaction processing and does
not support row level insert/updates. It is best used for batch
jobs over large sets of immutable data (like web logs). What
Hive values most are scalability (scale out with more machines
added dynamically to the Hadoop cluster), extensibility (with
MapReduce framework and UDF/UDAF/UDTF), fault-tolerance, and
loose-coupling with its input formats.
General Info
============
For the latest information about Hive, please visit out website at:
http://hive.apache.org/
Getting Started
===============
- Installation Instructions and a quick tutorial:
https://cwiki.apache.org/confluence/display/Hive/GettingStarted
- A longer tutorial that covers more features of HiveQL:
https://cwiki.apache.org/confluence/display/Hive/Tutorial
- The HiveQL Language Manual:
https://cwiki.apache.org/confluence/display/Hive/LanguageManual
Requirements
============
- Java 1.6, 1.7
- Hadoop 1.x, 2.x
Upgrading from older versions of Hive
=====================================
- Hive @VERSION@ includes changes to the MetaStore schema. If
you are upgrading from an earlier version of Hive it is imperative
that you upgrade the MetaStore schema by running the appropriate
schema upgrade scripts located in the scripts/metastore/upgrade
directory.
- We have provided upgrade scripts for MySQL, PostgreSQL, Oracle,
Microsoft SQL Server, and Derby databases. If you are using a
different database for your MetaStore you will need to provide
your own upgrade script.
Useful mailing lists
====================
1. user@hive.apache.org - To discuss and ask usage questions. Send an
empty email to user-subscribe@hive.apache.org in order to subscribe
to this mailing list.
2. dev@hive.apache.org - For discussions about code, design and features.
Send an empty email to dev-subscribe@hive.apache.org in order to
subscribe to this mailing list.
3. commits@hive.apache.org - In order to monitor commits to the source
repository. Send an empty email to commits-subscribe@hive.apache.org
in order to subscribe to this mailing list.
没有合适的资源?快使用搜索试试~ 我知道了~
apache-hive-1.1.0-cdh5.7.1-bin.tar.gz
需积分: 16 15 下载量 106 浏览量
2018-05-22
20:55:05
上传
评论
收藏 100.85MB GZ 举报
温馨提示
共748个文件
sql:182个
txt:171个
jar:153个
部署安装mysql5.6, hadoop-2.6.0-cdh5.7.1 伪分布式已启动,即在hadoop上部署hive
资源推荐
资源详情
资源评论
收起资源包目录
apache-hive-1.1.0-cdh5.7.1-bin.tar.gz (748个子文件)
flights_tiny.txt.1 5KB
futurama_episodes.avro 3KB
episodes.avro 597B
doctors.avro 521B
dec.avro 343B
map_null_val.avro 341B
dec_old.avro 331B
map_null_schema.avro 187B
type_evolution.avro 167B
grad.avsc 304B
beeline 881B
fastbinary.c 26KB
debug.cmd 3KB
templeton.cmd 3KB
hiveserver2.cmd 3KB
hwi.cmd 2KB
metastore.cmd 2KB
hiveserver.cmd 2KB
help.cmd 1KB
jar.cmd 1KB
orcfiledump.cmd 1KB
rcfilecat.cmd 1KB
schemaTool.cmd 1KB
lineage.cmd 1KB
cli.cmd 1KB
execHiveCmd.cmd 1KB
php_thrift_protocol.cpp 29KB
php_thrift_protocol.cpp 10KB
2000_cols_data.csv 40KB
UserVisits.dat 7KB
employee.dat 105B
employee2.dat 64B
in_file.dat 24B
test2.dat 23B
test.dat 11B
lt100.txt.deflate 267B
upgrade.order.derby 160B
FacebookService-remote 4KB
php_thrift_protocol.h 964B
php_thrift_protocol.h 930B
hcat 5KB
hive 8KB
hiveserver2 885B
hive-exec-1.1.0-cdh5.7.1.jar 18.38MB
hive-jdbc-1.1.0-cdh5.7.1-standalone.jar 12.03MB
hadoop-hdfs-2.6.0-cdh5.7.1.jar 9.66MB
groovy-all-2.4.4.jar 6.67MB
hive-metastore-1.1.0-cdh5.7.1.jar 5.2MB
hbase-protocol-1.2.0-cdh5.7.1.jar 4.18MB
accumulo-core-1.6.0.jar 4.17MB
calcite-core-1.0.0-incubating.jar 3.14MB
derby-10.11.1.1.jar 2.96MB
parquet-hadoop-bundle-1.5.0-cdh5.7.1.jar 2.59MB
guava-14.0.1.jar 2.09MB
hive-service-1.1.0-cdh5.7.1.jar 1.94MB
ant-1.9.1.jar 1.9MB
datanucleus-core-3.2.10.jar 1.8MB
datanucleus-rdbms-3.2.9.jar 1.73MB
netty-all-4.0.23.Final.jar 1.7MB
jetty-all-server-7.6.0.v20120127.jar 1.61MB
jetty-all-7.6.0.v20120127.jar 1.6MB
htrace-core-3.2.0-incubating.jar 1.42MB
zookeeper-3.4.5-cdh5.7.1.jar 1.29MB
xercesImpl-2.9.1.jar 1.17MB
leveldbjni-all-1.8.jar 1021KB
snappy-java-1.0.4.1.jar 973KB
jaxb-impl-2.2.3-1.jar 869KB
jackson-databind-2.2.2.jar 846KB
commons-math-2.1.jar 813KB
hive-serde-1.1.0-cdh5.7.1.jar 807KB
apacheds-kerberos-codec-2.0.0-M15.jar 675KB
janino-2.7.6.jar 598KB
jersey-server-1.14.jar 586KB
commons-collections-3.2.2.jar 575KB
hbase-common-1.2.0-cdh5.7.1.jar 563KB
joda-time-1.6.jar 522KB
log4j-1.2.16.jar 470KB
avro-1.7.6-cdh5.7.1.jar 466KB
jersey-core-1.14.jar 456KB
apache-log4j-extras-1.2.17.jar 438KB
mail-1.4.1.jar 437KB
antlr-2.7.7.jar 435KB
httpclient-4.2.5.jar 423KB
calcite-linq4j-1.0.0-incubating.jar 422KB
commons-vfs2-2.0.jar 406KB
jasper-compiler-5.5.23.jar 399KB
velocity-1.5.jar 383KB
datanucleus-api-jdo-3.2.6.jar 332KB
libfb303-0.9.2.jar 306KB
hive-common-1.1.0-cdh5.7.1.jar 294KB
commons-configuration-1.6.jar 292KB
commons-lang-2.6.jar 278KB
commons-httpclient-3.0.1.jar 273KB
hive-hcatalog-core-1.1.0-cdh5.7.1.jar 251KB
plexus-utils-1.5.6.jar 245KB
curator-recipes-2.6.0.jar 242KB
junit-4.11.jar 239KB
commons-compress-1.4.1.jar 236KB
ST4-4.0.4.jar 231KB
jackson-core-asl-1.9.2.jar 223KB
共 748 条
- 1
- 2
- 3
- 4
- 5
- 6
- 8
资源评论
leofionn
- 粉丝: 313
- 资源: 5
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功