Apache Hive TM

The Apache Hive ™ data warehouse software facilitates querying and managing large datasets residing in distributed storage. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.

Getting Started

Check out the Getting Started Guide on the Hive wiki.

Getting Involved

Hive is an open source volunteer project under the Apache Software Foundation. Previously it was a subproject of Apache Hadoop, but has now graduated to become a top-level project of its own. We encourage you to learn about the project and contribute your expertise. Here are some starter links:

Give us feedback: What can we do better?
Join the mailing list: Meet the community.
Become a Hive Fan on Facebook.

Apache Hive, Hive, Apache, the Apache feather logo, and the Apache Hive project logo are trademarks of The Apache Software Foundation. Other names appearing on the site may be trademarks of their respective owners.

General

Documentation

Community

Development

PMC

ASF

Apache Hive TM

Getting Started

Getting Involved