Showing posts with label HIVE. Show all posts
Showing posts with label HIVE. Show all posts
Metastore is the internal database for Hive, it is the central repository of Hive Metadata. In this Metastore all the tables information, tables schema, column information, partitioning information will get stored. By default Hive will come up with Derby database as its Metastore. As the default Metastore is Derby, we need not to configure the same, if we want to change the database then we must configure explicitly. 

If we want to configure the Hive Metastore explicitly(other than Derby), we can do so in "hive-site.xml". As part of Metastore configuration there are two parts: 
  • Setting of Connection Url
  • Setting of Driver name for the same 
The following figure shows you the configuration details. 
 
Hive Metastore
Configuration of HIVE Metastore

           

Hive Metastore

Posted at  01:43  |  in  HIVE  |  Read More»

Metastore is the internal database for Hive, it is the central repository of Hive Metadata. In this Metastore all the tables information, tables schema, column information, partitioning information will get stored. By default Hive will come up with Derby database as its Metastore. As the default Metastore is Derby, we need not to configure the same, if we want to change the database then we must configure explicitly. 

If we want to configure the Hive Metastore explicitly(other than Derby), we can do so in "hive-site.xml". As part of Metastore configuration there are two parts: 
  • Setting of Connection Url
  • Setting of Driver name for the same 
The following figure shows you the configuration details. 
 
Hive Metastore
Configuration of HIVE Metastore

           

As of now we have seen Hive Introduction. We know that HIVE is used to pose queries on the processed data. Generally in RDBMS we pose queries on Tables. Hive also stores the data in the form of Tables, these tables are stored on HDFS. The following are the two types of tables in HIVE.
  • Managed Tables
  • External Tables
Hive also stores the metadata of these tables in Hive Metastore. In this post let us see in detail about how to create a table, and how to import data into them.

Managed Tables: 

As the name implies, these tables are managed by Hive Warehouse system. All the Managed tables will be stored in the sub-directories of Hive Warehouse directory. The following is the command tothe Managed Tables.
Managed Table Example
Managed Table Creation

The following is the example of Managed tables:
Managed Table example
Managed table example

External Tables:

As the name implies these are not managed by Warehouse. All the external tabular data is stored in the location specified in the at the time of creation. The following is the syntax to create the External tables. 
External Table Syntax
External Table Syntax
If we don't specify the HDFS location or external keyword in the command, it will create a Managed table instead of External table. The following is the example of external table. 
External Table example
External Table Example
  
       


 

Hive Tables

Posted at  23:27  |  in  HIVE  |  Read More»

As of now we have seen Hive Introduction. We know that HIVE is used to pose queries on the processed data. Generally in RDBMS we pose queries on Tables. Hive also stores the data in the form of Tables, these tables are stored on HDFS. The following are the two types of tables in HIVE.
  • Managed Tables
  • External Tables
Hive also stores the metadata of these tables in Hive Metastore. In this post let us see in detail about how to create a table, and how to import data into them.

Managed Tables: 

As the name implies, these tables are managed by Hive Warehouse system. All the Managed tables will be stored in the sub-directories of Hive Warehouse directory. The following is the command tothe Managed Tables.
Managed Table Example
Managed Table Creation

The following is the example of Managed tables:
Managed Table example
Managed table example

External Tables:

As the name implies these are not managed by Warehouse. All the external tabular data is stored in the location specified in the at the time of creation. The following is the syntax to create the External tables. 
External Table Syntax
External Table Syntax
If we don't specify the HDFS location or external keyword in the command, it will create a Managed table instead of External table. The following is the example of external table. 
External Table example
External Table Example
  
       


 

HIVE is one of the component of Hadoop. It was built by Jeff Hammerbacher and his team at facebook. Hive is a framework for data warehousing on top of HADOOP. This was initially built to run queries on the huge data of facebook. The main use of Hive framework is , using this we can place the processed data into tables, and then we can pose queries on those tables. 

Hive Introduction - Hadooptutor
Hive Logo
To work with Hive framework one should have a good idea on SQL, and basic java programming skills. We have HiveQL(Hive Query Language), a dialect of SQL to work with Hive. Hive stores the data in Tabular format.

Hive Introduction

Posted at  22:48  |  in  HIVE  |  Read More»

HIVE is one of the component of Hadoop. It was built by Jeff Hammerbacher and his team at facebook. Hive is a framework for data warehousing on top of HADOOP. This was initially built to run queries on the huge data of facebook. The main use of Hive framework is , using this we can place the processed data into tables, and then we can pose queries on those tables. 

Hive Introduction - Hadooptutor
Hive Logo
To work with Hive framework one should have a good idea on SQL, and basic java programming skills. We have HiveQL(Hive Query Language), a dialect of SQL to work with Hive. Hive stores the data in Tabular format.

About-Privacy Policy-Contact us
Copyright © 2013 Hadoop Tutor. Blogger Template by Bloggertheme9
Proudly Powered by Blogger.
back to top