Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • wget – retrieves the contents of an Internet URL
  • cp – copies directories or files located on any local file system
  • scp – copies directories or files to/from a remote system
  • rsync – copies directories or files on either local or remote systems

...

There are 3 local file systems available on any TACC compute cluster (stampede2, lonestar6, etc.), each with different characteristics. All these local file systems and your account has a directory in each of the three.

The 3 file local systems have different characteristics, but all are fast and set up for parallel I/O.

...


HomeWorkScratch
quota10 GB1024 GB = 1 TB2+ PB (basically infinite)
policybacked upnot backed up,
not purged
not backed up,
purged if not accessed recently (~10 days)
access commandcdcdwcds
environment variable$HOME

$WORK (different sub-directory for each cluster)

$STOCKYARD (root of the shared Work file system)

$SCRATCH
root file system/home/work/scratch
use forSmall files such as scripts that you don't want to lose.Medium-sized files you don't want to copy over all the time. For example, custom programs you install (these can get large), or annotation file used for analysis.Large files accessed from batch jobs. Your starting files will be copied here from somewhere else, and your final results files will be copied elsewhere (e.g. stockyard, corral, your BRCF POD, or your organization's storage area.

When you first login, the system gives you information about disk quotas and your compute allocation balance in "SU" (system units).

Code Block
--------------------- Project balances for user abattenh ----------------------
| Name           Avail SUs     Expires | Name           Avail SUs     Expires |
| OTH21095            1000 905  20222023-09-30 | MCB21106            1496 995  20222023-1209-3130 |
| OTH21164             1000215  20222024-1205-31 | OTH21180 OTH21037            4224899  20222024-1203-31 |
------------------------ Disk quotas for user abattenh ------------------------ 
| Disk         Usage (GB)     Limit    %Used   File Usage       Limit   %Used |
| /home1 scratch             0.07       110.70     0.00           52567           0    0.00 |
| /home1              0.0      11.7     0.01          232           0    0.00 |
| /work             169.0    1024.0    16.50        7936279361     3000000    2.65 |
------------------------------------------------------------------------------- 

changing TACC file systems

...

cdw cds cd
Code Block
languagebash
titleChanging file systems at TACC
at TACC
cdw    # cd $WORK
cds    # cd $SCRATCH
cd     # cd $HOME


Tip

The cd (change directory) command with no arguments takes you to your home Home directory on any Linux/Unix system.

The cdw and cds commands are specific to the TACC environment.

...

File systems are storage areas where files and directories are arranged in a hierarchy. Any computer can have one or more file systems " mounted" (accessible as local storage). The df command can be used on any Unix system to show all the "top-level" mounted file systems. TACC has a lot of temporary file systems, so lets just look at the first 15 and tell df to use "human readable" size formatting with the -h option:

Code Block
languagebash
df -h | head -15

The rightmost "Mounted on" column give the top-level access path. Find /home1, /work, and /scratch and note their sizes Size numbers!

What do we mean by "hierarchy"? It is like a tree, with the root file system (denoted by the leading / ) as the trunk, sub-directories as branches, sub-sub-directories as branches from branches (and so forth), with files as leaves off any branch.

...

But everyone has a Home directory, so you must only be seeing a part of the Home directory hierarchy. To see the absolute path of a directory you're in, use the pwd -P command. Note that absolute paths always start with a forward slash ( / ) denoting its the root file system.

Code Block
languagebash
pwd -P
# will show something like this
# /home1/01063/abattenh

...

Here's a depiction of the three file systems as seen from your Home directory ( ~ ), showing where the path-valued environment variables represent, and where the three symbolic links (~/CoreNGS, ~/scratch, ~/work) you created in your Home directory point. Notice that both the Work and Scratch file systems have a top-level hierarchy like we saw in Home above.

On many Linux systems, you can use the tree command to view the full file system hierarchy starting from a specified directory:

Image Added

Staging your data

...