For enabling kerberos for zookeeper, we need to follow the below steps: 1) Setup external zookeeper. 2) create file conf/jaas.conf which will contain server keytab and principal. Server {   com.sun.security.auth.module.Krb5LoginModule required   useKeyTab=true   keyTab="/zookeeper/conf/zkpr.keytab"   storeKey=true   useTicketCache=false   principal="zookeeper/localhost@EXAMPLE.COM"; }; 3) create file conf/java.env export JVMFLAGS="-Djava.security.auth.login.config=/zookeeper/conf/jaas.conf" export...
Read more

Here are some of the errors you face while setting up kerberos enabled hadoop 1) Be sure to validate your ticket and keytab file. Ticket Validation: klist Output: Ticket cache: FILE:/tmp/krb5cc_1001 Default principal: zookeeper/localhost@EXAMPLE.COM Valid starting       Expires              Service principal 2017-05-22T18:40:52  2017-05-23T04:40:52  krbtgt/EXAMPLE.COM@EXAMPLE.COM renew until 2017-05-29T18:40:52 Keytab validation: kinit...
Read more

For YARN,add following properties in yarn-site.xml: <!-- resource manager secure configuration info --> <property>   <name>yarn.resourcemanager.principal</name>   <value><PRINCIPAL></value> </property> <property>   <name>yarn.resourcemanager.keytab</name>   <value><KEYTAB_PATH></value> </property> <!-- remember the principal for the node manager is the principal for the host this...
Read more

1) Add following properties in hdfs-site.xml <!-- NameNode security config --> <property>   <name>dfs.namenode.keytab.file</name>   <value><KEYTAB_PATH></value> <!-- path to the HDFS keytab --> </property> <property>   <name>dfs.namenode.kerberos.principal</name>   <value><PRINCIPAL></value> </property> <property>   <name>dfs.datanode.keytab.file</name>  ...
Read more

HDFS typically stands for Hadoop distributed file system.As compared to traditional RDBMS, HDFS follows the distributed approach and has the following advantages: 1)  Data doesn't need to be centralized. However,it is distributed across network.As per traditional RDBMS, data needs to be first accumulated at one place with the help of relational tables.In HDFS, there is no need to accumulate data at one place.Instead,job is split and executed parallely at different nodes where data is stored. 2)...
Read more

Big Data as the name implies is a technology which deals with huge amount of data.While learning about big data,the following question arises: 1)What is Big data? 2)What are the sources of such huge amount of data? 3)Why the need arises now for big data technologies? 4)What are the different kinds of Big Data Technologies? Lets find the answers for these questions: 1) Big Data is a accepted appellation acclimated to call the exponential advance and availability of data, both structured and...
Read more

TCL also known as Transaction Control Language.It is used to manage different transactions occurring within a database.Various statements included under this category are: 1) COMMIT 2) ROLLBACK COMMIT: This is used to commit or saving the changes permanently you have made.Syntax: COMMIT; ROLLBACK: It is used to discard the changes you have made.Syntax: ROLLBACK; IMPORTANT POINTS: 1) In case of Mysql,always use START TRANSACTION at the beginning. 2) When you use commit/rollback.It...
Read more

Powered by Blogger.