{"id":5593,"date":"2016-02-04T10:09:20","date_gmt":"2016-02-04T02:09:20","guid":{"rendered":"http:\/\/rmohan.com\/?p=5593"},"modified":"2016-02-04T12:43:58","modified_gmt":"2016-02-04T04:43:58","slug":"hdfs-command-syntax","status":"publish","type":"post","link":"https:\/\/mohan.sg\/?p=5593","title":{"rendered":"HDFS Command Syntax"},"content":{"rendered":"<p><code><b>HDFS Command Syntax Overview: <\/b><br \/>\n<b>hadoop fs <\/b><\/code><code><code>: <\/code>Ex.: hadoop fs -ls<\/code><code><code>\u00a0<\/code><\/code><br \/>\n<code><code><b>hadoop version<\/b> : check hadoop installed properly<\/code><\/code><br \/>\n<b><\/b><\/p>\n<p><code><b>HELP:<\/b><br \/>\n<b>help [cmd]<\/b>: hopefully this is self-describing<b>\u00a0<\/b><\/code><\/p>\n<p><code><b>Inspect files:<\/b><br \/>\nls\/lsr : list all files in<br \/>\ncat : print on stdout<br \/>\ntail [-f] : output the last part of the<\/code><br \/>\n<code>test : return attributes of file and directory<\/code><br \/>\n<code>touchz : create new emty file size 0<br \/>\ndu\/dus : show space utilization<\/code><br \/>\n<code>count : no. of directories, files, and bytes<\/code><br \/>\n<code>setrep : (-r) change the replication factor of file\/directory<\/code><br \/>\n<code>stat : info about the specified path<\/code><br \/>\n<code><b>Create\/remove files:<\/b><br \/>\nmkdir : create a directory<br \/>\nmv : move (rename) files<br \/>\ncp : copy files<br \/>\nrm\/rmr : remove files<br \/>\n<b>Copy\/Put files from remote m\/c into the HADOOP cluster:<\/b><br \/>\ncopyFromLocal : copy a local file to the HDFS<br \/>\ncopyToLocal : copy a file on the HDFS to the local disk<\/code><br \/>\n<code>cp : copies one or more files<\/code><br \/>\n<code>get : copies files to the local file system<\/code><br \/>\n<code>put : copies files from the local file system<\/code><br \/>\n<code>mv : moves one or more files<\/code><\/p>\n<p><code><b>Hadoop Namenode Commands:<\/b><br \/>\nhadoop namenode -format: Format HDFS filesystem from Namenode<br \/>\nhadoop namenode -upgrade: Upgrade the NameNode<br \/>\nstart-dfs.sh Start: HDFS Daemons<br \/>\nstop-dfs.sh Stop: HDFS Daemons<br \/>\nstart-mapred.sh: Start: MapReduce Daemons<br \/>\nstop-mapred.sh Stop: MapReduce Daemons<br \/>\nhadoop namenode -recover -force: Recover namenode metadata after a cluster failure (may lose data)\u00a0<\/code><\/p>\n<p><code><b>Hadoop Configuration Files:<\/b><br \/>\ncore-site.xml : Parameters for entire Hadoop cluster<br \/>\nhdfs-site.xml : Parameters for HDFS and its clients<br \/>\nmapred-site.xml : Parameters for MapReduce and its clients<\/code><br \/>\n<code>yarn-site.xml : <\/code><code><code>Parameters for nodemanager and resource manager<\/code> <\/code><br \/>\n<code>masters : Host machines for secondary Namenode<br \/>\nslaves : List of slave hosts<\/code><br \/>\n<code><code>hadoop-env.sh : Sets ENV variables for Hadoop\u00a0<\/code><\/code><br \/>\n<b><code>set JAVA_HOME=%JAVA_HOME%<br \/>\nset HADOOP_PREFIX=D:\\Hadoop<\/code><\/b><br \/>\n<code><b>Hadoop Job Commands<\/b><br \/>\nhadoop job -submit : Submit the job<br \/>\nhadoop job -status : Print job status completion percentage<br \/>\nhadoop job -list all : List all jobs<br \/>\nhadoop job -list-active-trackers : List all available TaskTrackers<br \/>\nhadoop job -set-priority : Set priority for a job. Valid priorities : VERY_HIGH, HIGH, NORMAL, LOW, VERY_LOW<br \/>\nhadoop job -kill-task : Kill a task<br \/>\nhadoop job -history : Display job history including job details, failed and killed jobs<br \/>\n<b>Hadoop mradmin Commands<\/b><br \/>\nhadoop mradmin -safemode get : Check Job tracker status<br \/>\nhadoop mradmin -refreshQueues : Reload mapreduce configuration<br \/>\nhadoop mradmin -refreshNodes : Reload active TaskTrackers<br \/>\nhadoop mradmin -refreshServiceAcl : Force Jobtracker to reload service ACL<br \/>\nhadoop mradmin -refreshUserToGroupsMappings : Force jobtracker to reload user group mappings<br \/>\n<b>Hadoop fsck Commands<\/b><br \/>\nhadoop fsck \/ : Filesystem check on HDFS<br \/>\nhadoop fsck \/ -files : Display files during check<br \/>\nhadoop fsck \/ -files -blocks : Display files and blocks during check<br \/>\nhadoop fsck \/ -files -blocks -locations : Display files, blocks and its locationhadoop fsck \/ -files -blocks -locations -racks : Display network topology for data-node locations<br \/>\nhadoop fsck -delete : Delete corrupted files<br \/>\nhadoop fsck -move : Move corrupted files to \/lost+found directory<\/code><br \/>\n<code><b>Hadoop Balancer Commands<\/b><br \/>\nstart-balancer.sh : Balance the cluster<br \/>\nhadoop dfsadmin -setBalancerBandwidth : Adjust bandwidth used by the balancer<br \/>\nhadoop balancer -threshold 20 : Limit balancing to only 20% resources in the cluster<\/code><br \/>\n<code><b>Hadoop Safe Mode (Maintenance Mode) Commands<\/b><br \/>\nThe following dfsadmin commands helps the cluster to enter or leave safe mode, which is also called as maintenance mode.<br \/>\nIn this mode, Namenode does not accept any changes to the name space, it does not replicate or delete blocks.<br \/>\nhadoop dfsadmin -safemode enter : Enter safe mode<br \/>\nhadoop dfsadmin -safemode leave : Leave safe mode<br \/>\nhadoop dfsadmin -safemode get : Get the status of mode<br \/>\nhadoop dfsadmin -safemode wait : Wait until HDFS finishes data block replication<br \/>\nhadoop dfsadmin -report : total usage on the cluster<\/code><\/p>\n<p><code><b>Launching Hadoop Jobs:<\/b><br \/>\nhadoop jar [mainClass] args... : <\/code><code><code>Launch job via jar file<\/code><\/code><br \/>\n<code>hadoop jar com.twitter.scalding.Tool [mainClass] args <\/code><code><code>: A Scalding job is launched using<\/code><\/code><code><code>\u00a0<\/code><\/code><br \/>\n<code><code>mapred job -kill <\/code>: If you need to kill a map-reduce job\u00a0<\/code><code>\u00a0<\/code><br \/>\n<code><\/code><\/p>\n<p><code><b>Commonly Used Administration Commands:<\/b><br \/>\nFormat the namenode: hadoop namenode -format<br \/>\nStarting Secondary namenode: hadoop secondrynamenode<br \/>\nRun namenode : hadoop namenode<br \/>\nRun data node: hadoop datanode<br \/>\nCluster Balancing: hadoop balancer<br \/>\nRun MapReduce job tracker node: hadoop jobtracker<br \/>\nRun MapReduce task tracker node: hadoop tasktracker<\/code><\/p>\n<p><code><b>Start\/Stop Yarn (starts resourcemanager and nodemanager)and DFS (Starts namenode and data node) from sbin directory:<\/b><br \/>\n<b>start-yarn, stop-yarn<br \/>\nstart-dfs, stop-dfs<\/b><\/code><br \/>\n<code><br \/>\n<b>Start and Stop ALL daemon from sbin directory:<\/b><br \/>\n<b>start-all, stop-all<\/b> <\/code><\/p>\n<p><code><br \/>\n<b>Check All 5 daemons (Namenode,Secoundary Node,Job Tracker, DataNode, Task Tracker ) are up:<\/b><\/code><br \/>\n<code><b>jps <\/b><\/code><\/p>\n","protected":false},"excerpt":{"rendered":"<p>HDFS Command Syntax Overview: hadoop fs : Ex.: hadoop fs -ls hadoop version : check hadoop installed properly <\/p>\n<p>HELP: help [cmd]: hopefully this is self-describing <\/p>\n<p>Inspect files: ls\/lsr : list all files in cat : print on stdout tail [-f] : output the last part of the test : return attributes of file [&#8230;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[60],"tags":[],"_links":{"self":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/5593"}],"collection":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=5593"}],"version-history":[{"count":1,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/5593\/revisions"}],"predecessor-version":[{"id":5594,"href":"https:\/\/mohan.sg\/index.php?rest_route=\/wp\/v2\/posts\/5593\/revisions\/5594"}],"wp:attachment":[{"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=5593"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=5593"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mohan.sg\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=5593"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}