Pig Grunt Shell¶
Shell Commands¶
sh Command¶
grunt> sh shell command parameters
grunt> sh ls
fs Command¶
grunt> sh File System command parameters
grunt> fs -ls /
Utility Commands¶
clear Command¶
grunt> clear
help Command¶
grunt> help
history Command¶
grunt> history
set Command¶
Key | Description and values |
---|---|
default_parallel | You can set the number of reducers for a map job by passing any whole number as a value to this key. |
debug | You can turn off or turn on the debugging freature in Pig by passing on/off to this key. |
job.name | You can set the Job name to the required job by passing a string value to this key. |
job.priority | You can set the job priority to a job by passing one of the following values to this key − very_low, low, normal, high, very_high |
stream.skippath | For streaming, you can set the path from where the data is not to be transferred, by passing the desired path in the form of a string to this key. |
quit Command¶
grunt> quit
exec Command¶
grunt> exec [–param param_name = param_value] [–param_file file_name] [script]
vi student.txt
001,Rajiv,Hyderabad
002,siddarth,Kolkata
003,Rajesh,Delhi
hdfs dfs -mkdir /pig_data
hdfs dfs -put ~/pig/student.txt /pig_data
vi sample_script.pig
student = LOAD 'hdfs://localhost:9000/pig_data/student.txt' USING PigStorage(',')
as (id:int,name:chararray,city:chararray);
Dump student;
grunt> exec sample_script.pig
kill Command¶
grunt> kill JobId
grunt> kill Id_0055
run Command¶
grunt> run [–param param_name = param_value] [–param_file file_name] script
grunt> run sample_script.pig
grunt> Dump;
Note − The difference between exec and the run command is that if we use run, the statements from the script are available in the command history.