To manage and administer an Accelerator host cluster, three tools are available to perform operations across all hosts simultaneously. The tools, clusterexec
, clusterupload
, and clusterdownload
are platform-independent tools and part of the eRunner package. By default, the eRunner daemon (UNIX) and eRunner service (Windows) are installed on agent machines and eMake machines. The tools are installed on Cluster Manager machines and eMake machines. Only the server side (erunnerd) is installed on agents to allow them to serve requests from tools. If you decided not to install eRunner during Cluster Manager installation, you do not have access to these tools. These Cluster Manager administration tools allow you to:
-
Start and stop agents
-
Reboot hosts
-
Run commands on hosts
-
Upload files
-
Download files
These tools are particularly useful for automating the cluster upgrade process to update the build environment (for example, build tools, compiler, header files, system libraries, and so on) and for collecting debug information. Other uses include restarting agents, restarting a subset of agents, or obtaining host status information.
Specific command-line tool functions are:
-
clusterexec
—Lets you run arbitrary commands on a host machine or all hosts in a cluster -
clusterupload
—Lets you upload executables and files (for example, compiler, libraries, and so on) to all hosts in a cluster -
clusterdownload
—Lets you download files from host machines to a central location
clusterupload
, clusterexec
, and clusterdownload
communicate with the eRunner service on Windows hosts or the eRunner daemon on UNIX machines. The eRunner service/daemon listens for connections on port 2411 by default.
Because these tools are platform-neutral, clusterexec
running on Windows can communicate with eRunner on Linux hosts so those hosts can execute the commands. Conversely, use clusterupload
on Linux to upload files to a Windows host. However, some rules should be followed when using these tools cross-platform from UNIX to Windows. For more detailed information, see Using clusterexec.
clusterupload and clusterdownload do not support ACLs. During upload/download, they are ignored. An effect of ignoring ACLs is that Cygwin symlinks are not handled properly.
|
Using clusterexec
This command executes shell commands given by cmd1
, cmd2
, and so on, on one or more hosts—typically, all hosts in a cluster. The syntax for running clusterexec
is:
% clusterexec [options] "cmd1 arg2; cmd2; ..."
For example, if you are running Linux and want to find out how long the operating system on each host in the cluster has been running since the last reboot, you could run the uptime
commands on each host in the cluster, lin-cluster
. Using clusterexec
, the syntax would be:
% clusterexec --cm=lin-cluster uptime
Each command may be the following type:
-
A fully-qualified path to an executable on the hosts.
-
The executable name on the hosts. The eRunner service that processes the
clusterexec
command does aPATH
search based on the following: -
For Linux, the
PATH
thaterunner
searches is defined in/etc/init.d/erunner
and includes the well-known binary and system binary installations on the host. -
For Windows, the
path
thaterunner
searches is thePATH
system environment variable on the host. -
A built-in command defined in the eRunner service. See the “eRunner Built-in Commands” table below for a description of these commands.
Some platform-specific rules to observe:
-
clusterexec
commands run as theroot
user on Linux, or as theLocalSystem
user on Windows -
When using
clusterexec
on Linux to run a command on a Windows host, the shell may alter the command, for example, the UNIX shell treats backslash (\) characters as escapes. This issue may apply also if the user is running in a UNIX-like shell on Windows (for example,cygwin bash
). To correct this situation, use one of the following methods: -
Use single quotes instead of double quotes, for example:
clusterexec --hosts=win1 'c:\\winnt\\system32\\xcopy c:\\a.txt
-
Use double-escape backslashes, for example:
clusterexec --hosts=win1 "c:\\\\winnt\\\\system32\\\\xcopy
-
Use forward slashes (/) instead of backslashes where possible, for example:
clusterexec --hosts=win1 "c:/winnt/system32/xcopy c:/a.txt
The
--hosts
argument can take patterns in the form[X-Y]
to indicate a range of host Name/IP. This argument can also take patterns in the form[X,Y,Z]
to indicate a list of hosts with X,Y, and Z substituted at the pattern location to operate onhost1
,host2
,host3
,host5
,host7
,host-a
,host-b
, andhost-c
, you can use a--hosts
argument such ashost[1-3] host[5,7] host-[a-c]
. Patterns must be specified in[ ]
or{ }
brackets.If a command fails,
clusterexec
does not run subsequent commands by default. Use the-k
option to override this behavior and keep going after failure.
clusterexec
command-line options are described in the following table:
Switch Options | Description | ||
---|---|---|---|
|
Prints a usage message summarizing information in this table |
||
|
Set of hosts on which to run commands. value should be in the form |
||
|
Cluster Manager that is contacted to get the relevant hosts. This is used only when |
||
|
Platform of desired hosts. This is used only when |
||
|
Only use hosts with at least one pingable enabled agent in Cluster Manager. This option applies only when |
||
|
Name of a file containing commands to run. If this option is specified, an inline script (as shown above), must not be specified |
||
|
Continue running commands after failure. If more than one command is specified, continue to run subsequent commands, even if an earlier command fails |
||
|
Default = true (merge
|
||
|
Displays |
||
|
Send the entire command string to the shell ( |
||
|
Abort execution after value seconds |
Commands may refer to executables on hosts or commands built into the eRunner service/daemon on the hosts. Valid built-in commands are listed in the following table:
Commands | Description |
---|---|
|
Stops the agent service on the hosts |
|
Starts the agent service on the hosts |
|
Restarts the agent service on the hosts |
|
Reboot the hosts. If delay is specified, reboot each host delay ms (milliseconds) apart |
|
Shuts down eRunner service on the hosts |
|
Restarts eRunner service on the hosts |
|
Prints the eRunner service version on the hosts |
|
Query or set the eRunner service log level on the hosts. Valid values for level are |
Using clusterupload
The clusterupload
command is a convenient tool for transferring a file or files to all hosts in a cluster. This command can upload one or more files to one or more hosts in the cluster. The syntax for running clusterupload
is:
clusterupload [options] <source> <target>
where target could be a file or a directory. The interpretation depends on whether or not target already exists on the host (as a file or as a directory). To interpret the logic, source is the relative or absolute path of the file/directory on the local machine and target is the absolute path to the file/directory on the hosts. Then,
-
if source is a file and target is a directory, source is copied into target .
-
if source is a file and target is a file, source is copied over target .
-
if source is a directory, and target is a file, an error is reported.
-
if source is a directory ending in / or \ , the source contents are copied into the target if target is a directory. If target is not a directory, an error is reported.
-
if source is a directory not ending in / or \, the last component of the source path becomes a subdirectory under target , and the source contents are copied into that subdirectory.
Target paths need not exist in advance—relevant directories are created as needed. If you specify a relative path for target , clusterupload
gives you an error—relative paths are not allowed.
For taking multiple sources , the syntax is:
clusterupload [options] <source1> <source2> <target>
where target is assumed to be a directory. (The target is always assumed to be a directory when multiple sources are specified.)
Switch option values are described in the following table:
Switch Options | Description |
---|---|
|
Prints a usage message summarizing information in this table |
|
Verbose mode |
|
Set of hosts within the cluster to update. value should be in the form |
|
The Cluster Manager contacted to get relevant hosts. This is used only when |
|
Platform of desired hosts. This is used only when |
|
Only upload to hosts pingable and enabled in Cluster Manager. This option applies only when |
|
Filename containing a list of source and target files to upload. If this option is specified, source and target must not be specified. Each line in the file must be in the form: sourcePath ⇒ targetPath . If |
|
Stops agents when committing uploaded data to its final location |
|
Displays |
|
Aborts execution after value seconds |
Using clusterdownload
The clusterdownload
command is a convenient tool for transferring a file or files from hosts in a cluster to a central location. This command downloads one or more files to one or more hosts in the cluster. The syntax for running clusterdownload
is:
clusterdownload [options] <sourcedir> <targetdir>
The target directory does not need to exist in advance; it is created as needed. Relative paths are supported for targetDir but not for sourceDir .
For downloading multiple sources, you can use a pattern in sourceDir or use the --include
option:
clusterdownload [options] "/opt/ecloud/i686_Linux/logs/*.log" <target>
If wildcard characters are used, quotes are required so the command is not expanded first by the shell. — or use —
|
The second example above copies all .log
files from the specified directory.
Switch option values are described in the following table:
Switch Options | Description |
---|---|
|
Prints a usage message summarizing information in this table |
|
Print version information |
|
Verbose mode: shows progress information as well as connection requests |
|
Set of hosts within the cluster from which files need to come. value should be in the form: |
|
Cluster Manager that is contacted to get the relevant hosts. This is used only when |
|
Platform of desired hosts. This is used only when |
|
Abort the connection after value seconds. Any partially downloaded files are discarded |
|
Recurse into subdirectories |
|
Update only. Do not overwrite newer files. File modification times are used |
|
Preserve file timestamps |
|
Show what would have been transferred |
|
Only update existing files |
|
Show progress during the file transfer. When this option is used, |
|
Exclude files matching pattern . Patterns can include the usual ? and * meta-characters. Patterns apply to filenames, not to paths. When a file matches both the include and exclude patterns, it is excluded |
|
Include files matching pattern . Patterns can include the usual ? and * meta-characters. Patterns apply to filenames, not to paths. When a file matches both the include and exclude patterns, it is excluded |
|
Set the logfile name. This can include an absolute or relative path. If no path is specified, the default location on Linux is |
|
There is no default log level. If you specify |
Sample Uses of Cluster Tools
In the following example, clusterupload
is used to upload a new version of bash , and clusterexec
is used to run a command on Accelerator cluster hosts.
-
Upload
bash
to/usr/bin
on a Linux cluster:% clusterupload --cm=dilbert-cm ~/download/bash /usr/bin/bash
-
Find out how long hosts have been up on a Linux cluster:
% clusterexec --cm=dilbert-cm uptime
The result would be similar to:dilbert1.electric-cloud.com output:16:02:46 up 5:47, 0 users, load average: 0.00, 0.04, 0.07dilbert2.electric-cloud.com output:16:02:45 up 5:47, 0 users, load average: 0.08, 0.04, 0.06dilbert3.electric-cloud.com output:16:02:46 up 6:16, 0 users, load average: 0.00, 0.02, 0.05dilbert4.electric-cloud.com output:16:02:45 up 6:16, 0 users, load average: 0.00, 0.01, 0.02dilbert6.electric-cloud.com output:16:02:45 up 6:17, 0 users, load average: 0.00, 0.04, 0.05dilbert5.electric-cloud.com output:16:02:45 up 6:17, 0 users, load average: 0.00, 0.00, 0.01
In the next examples, clusterexec
is used to gather various statistics on an Accelerator host cluster.
-
Find out how long hosts have been up on a Windows cluster:
C:\> clusterexec --cm=win-cm "net statistics workstation"
The output would be similar to: WIN2 output:Workstation Statistics for \\Statistics since 10/25/2005 3:54 PM. . .The command completed successfully.` WIN1 output: `Workstation Statistics for \\Statistics since 10/25/2005 4:02 PM. . .The command completed successfully.
-
Get the last 5 lines of agent log files on host1:
% clusterexec --hosts=host1 -s "tail -5 /var/log/ecagent*.log"` The result would be: host1 output: `==> /var/log/ecagent1.log <==*** -numagents 2*** -version*** -webport 8001***-------------------------------------------------------==> /var/log/ecagent2.log <==*** -numagents 2*** -version*** -webport 2421***-------------------------------------------------------
In the example above: To obtain the last 5 lines of log files for all agents, an
sh
invocation is necessary. Use the*
wildcard to achieve this result. To use the wildcard, a shell must invoke the expansion;clusterexec
does not automatically invoke a shell in which to run commands.clusterexec
does not handle pipelines. For pipelines, the-s
option should be used, for example:clusterexec --hosts=host1 -s "ps -ef | grep agent"
By default, clusterexec
stops running commands after the first failure, for example:
clusterexec --hosts="lin1" "badCmd.sh; /bin/echo hi"
will not run the echo
command on lin1
if badCmd.sh
returns a non-zero exit code.
A simple example for a Windows cluster download:
clusterdownload --cm=mycm c:/ECloud/ecagent*log c:/tmp
This example downloads all agent log files in c:\ECloud
from all agent hosts on the cluster to directory /tmp
. Suppose your cluster has host1
and host2
and each host has 2 agents:
On the host where you ran clusterdownload
, the result is:
/tmp/host1/ecagent1.log/tmp/host1/ecagent2.log/tmp/host2/ecagent1.log/tmp/host2/ecagent2.log
To get agent logs from Linux hosts, use:
clusterdownload --cm=mycm "/var/log/ecagent*.log" /tmp
If you want to get all the log files under /opt/ecloud,
you could use:
clusterdownload --cm=mycm -r "/opt/ecloud/*.log" /tmp` -- or -- `clusterdownload --cm=mycm --include=*.log -r /opt/ecloud /tmp
If wildcard characters are used, quotes are required so the command is not expanded first by the shell. |
Stopping, Starting, or Restarting the Daemon
On Linux platforms, you can stop, start, or restart the eRunner daemon by using the following commands:
/etc/init.d/erunner stop
/etc/init.d/erunner start
/etc/init.d/erunner restart
Checking Whether the Daemon is Running
On Linux platforms, you can check whether the eRunner daemon is running by using the following command:
/etc/init.d/erunner status
If the daemon is running, the following message appears:
erunner running with PID <PROCESS_ID>
If the daemon is not running, the following message appears:
erunner is not running