That's a good question. I just picked up the interest two days ago although I've been reading about some of the worlds fastest computers and how they are using Linux clusters because they are so inexpensive to build. Then just did a couple of google searches on "Linux Cluster HOWTO" and noticed two technologies that popped out, being "PVM" and "MPI" both of which are freely available.
Then I noticed that the "pvm" and "xpvm" RPMs are on the RedHat 7.2 CD (I think they've been included for several releases). Installed the RPMs two nights ago but didn't have much luck getting them running as the documents I did find on google seemed to lack a basic understanding of how the master spawned and communicated with PVM on the nodes. Last night I figured that really easy part out. If you start PVM on any one of your machines it will automatically start up "pvmd" on the local machine and then try and start "pvmd" on all the other nodes in your pvm hosts file using "rsh" (I set an environment variable to make it use ssh instead). First set up ssh to be able to log into all of the other nodes without requiring a password.
Another key thing is to have all of your nodes host names you include in your PVM hosts file in your local /etc/hosts file (appears having their names in DNS is not enough). After getting this far it should spawn "pvmd" on all nodes automatically if they are not already running.
Of course you have to have the PVM RPM installed on all nodes, you need your compiler installed etc. Then there are some example programs in /usr/share/pvm3/examples. Read the "Readme" file in that directory on how to compile them. The binaries will go to "/usr/share/pvm3/bin/LINUX" and they have to be in that directory on every node.
I didn't set it up this way but I think most large scale Linux clusters would have the "code" directors NFS mounted so you wouldn't have to distribute everything. I just did a little shell script with a "for machine in mach1 mach2 mach3 mach4" loop to compile the programs on every machine using "ssh" and takes a parameter of the program to be compiled. Again, I'm sure this is not the optimal method.
Apparently O'Reilly has a couple of good books on clustering that I should probably get.
Here is another piece of software I would like to try that I saw some Universities using and I think you can download and use for no cost:
http://www.openpbs.organd the commercial version:
http://www.pbspro.comAlso, the "xpvm" will graphically show your nodes and how they are communicating with each other and graph the jobs/load over time, etc...
Let me know what you come up with...
[ January 31, 2002: Message edited by: VoidMain ]