|actionTitle||Request Software Install|
Software packages are available as modules and are accessible from the compute nodes of any of our three clusters. They are not available on the login nodes so you will need to be in an interactive session to access them. A list of installed software can be seen by clicking the link below, but may not be as current as using the
module avail command (more information under Module Commands).
|title||Click here to view a list of all software modules|
If multiple versions of software are available on the system, the newest is made the default. This means loading a module without specifying the version will select the most recent. We strongly recommend including version information in your module statements. This ensures that you maintain a consistent environment for your analyses in the event of a software upgrade.
To see, access, and get information on individual software packages available on the system, use the module commands detailed below. If you have trouble accessing the module command, see our FAQ section.
$ module avail
| Display all the software and versions installed on the system| module load modulename
$ module avail module_name
|Display all installed versions of the software "module_name" |
|Display the software you have loaded in your environment|
$ module what-is module_name
|Displays some descriptive information about a specific module|
$ module load module_name
|Load a software module in your environment|| module purge Unload all the software modules Unload a specific software package |
$ module unload module_name
|Unload a specific software package from your environment|| module unload modulename| module
| Unload all the software modules from your environment|
| Display a help menu for the module command|
Puma, Ocelote, and El Gato all run CentOS7 and have the following compilers available:
$ module load intel/2020.1
$ module load intel/2020.4
$ module load gnu/5.4.0
$ module load gnu7/7.3.0
$ module load gnu8/8.3.0
This one is loaded by default
Common Software Packages
Installing additional software
To submit a request to have software installed on the
the : http://uits.arizona.edu/forms/hpc-software-install-request
UArizona HPC systems, use
time frame for how long it takes to install software
with email@example.com can
, there are many variables that determine this. If you haven't heard back in a week, it is reasonable for you to follow up
your home directories with
may also install software packages into
the space that is allocated to you with your HPC account. However, you cannot install software that requires root permission
Follow this link for detailed how to install your own software
or use a method like "yum install" that accesses system paths.
Using and Installing Perl
Follow this link for more information on using Perl.
Using and Installing Python
Follow this link for more information on using Python.
Using R Packages
You can install your own R packages which is similar to using virtualenv with Python
- Make directory to store packages
$ mkdir -p ~/R/library
- Tell R where the directory is by creating an environment file:
$ echo 'R_LIBS=~/R/library/' >> ~/.Renviron
- For example to install and load the package "ggplot2":
$ module load R
- After this you'll only need the library command to load your custom package
- For more information:
There are three ways to run Matlab
- The command line version using modules. This is the most common as you will typically submit a job using PBS.
- Graphical mode using the Ocelote Desktop of OnDemand
- Using the Matlab integration with Python in a Jupyter notebook.
Here are the details for method 1.
MATLAB performs its own hardware discovery and it might try to access all the cores and the memory of the node even if the full node wasn't allocated. That will result in scheduler killing the job. To prevent that the full Ocelote node of 28 cores and 168GB of memory should be allocated to run a MATLAB job.
Like any other application, MATLAB has to be loaded as a module before you can use it. To see all the installed versions of the MATLAB use command module avail matlab.
The typical procedure for performing calculations on UA HPC systems is to run your program non-interactively on compute nodes. The easiest way to run MATLAB non-interactively is to use input/output redirection. This method uses Linux operators < and > to point MATLAB to the input file and tell where to write the output (see the example script). The other method is to invoke MATLAB from the PBS script and execute specified statement using -r option. For details please refer to the manual page of matlab command:
#PBS -N job_name
#PBS -W group_list=group_name
#PBS -q standard
#PBS -l select=1:ncpus=28:mem=168gb:pcmem=6gb
#PBS -l walltime=01:00:00
#PBS -l cput=28:00:00
module load matlab
matlab -nodisplay -nosplash < script_name.m > output.txt
The options -nodisplay and -nosplash in the example prevent MATLAB from opening elements of GUI. To view the full list of options for matlab command load the MATLAB module and type matlab -h in Linux prompt, or use the link above to the manual page on MathWorks website."
Apache Spark is a fast and general-purpose cluster computing system. However it has not been installed for support in a multi-node environment as yet. That functionality is planned for the future. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.
There are some simple tests here.
OceloteGCC is available without loading a module. gcc --version shows that it is 5.2.0. If you also need the GNU Scientific Libraries (gsl), that is available using module load gsl which will get you version 2.1The Intel Compiler suite 2016 is available in both 32 and 64 bit versions. The math kernel libraries (mkl) are provided as separate modules, also in 32 and 64 bit versions.PGI compilers (Portland Group) are installed as a module. We have provided these in particular for their OpenACC support. We do not build software with the PGI compilers.MPI compilers. There are more choices now so pay attention.
The principles are similar for ElGato except that the intel and intel-mpi compilers are only available for the 2013 versions.
openmpi is available both for version 1.6.5 and version 1.8.1
El Gato has a separate web site with easy to follow instructions.
AVX2 The new cluster has Intel V3 Haswell processors. A key feature of these is AVX2. Read this Intel document.
- There are standard Red Hat versions of mpich, mvapich, mvapich2 and openmpi. Some extra options are invoked when you load one - use module avail to see the specific name.
- The same four compilers are available with more detailed options for gcc, intel and open64. Again use module avail for the appropriate choices.
For AVX2 support, compile with the -xHOST option. Note that -xHOST alone does not enable aggressive optimization, so compilation with -O3 is also suggested. The -fast flag invokes -xHOST, but should be avoided since it also turns on interprocedural optimization (-ipo), which may cause problems in some instances.
For GNU compilers, AVX support is only available in version 4.6 or later. For AVX support, compile with -mavx