Parallel BASH is a modified version of BASH intended for text processing on computer clusters. It enables use of common UNIX text processing tools (e.g., awk, perl, grep) across multicore or distributed systems. It is particularly suited for scalable processing of large (multi-GB or larger) files.
by
otis
2009-07-21 16:50
bash
·
shell
·
script
·
MapReduce
·
hdfs
·
parallel computing
·
distributed computing