...
The following is a categorized list of software available through the UT GSAF and/or TACC's life sciences group. Each page lists a summary of the software, the hardware it is currently installed on, links to user documentation, and helpful tips.
Many of these are available via the TACC module
system (use module keyword
or module spider
to search).
The current listing of modules and versions available on Lonestar is also posted here.
In addition, the BioITeam maintains executables in /corral-repl/utexas/BioITeam/bin
; we advise adding this directory to your path.
Contents
- Getting an account of GSAF's Galaxy instance NOTE: Galaxy at this site is no longer actively supported.
- Getting an account on GSAF server-fourierseq
- Getting started with Unix and Perl
- Tips for working with TACC resources
- TACC Lonestar workflow scripts
- GSAF:References (by organism)
- GSAF:General purpose tools
- GSAF:Galaxy Workflows
- GSAF:Microarray data analysis tools
- GSAF:NGS Data Quality Control Tools
- GSAF:Mappers / Aligners
- GSAF:Gene prediction tools
- GSAF:SNP discoveryGSAF:
- RNA-Seq analysis
- Splice Junction discovery
- GSAF:Genome Alignment and Visualization
- GSAF:De novo assembly
- Transcriptome De novo assembly
- GSAF:454 Analysis tools
- GSAF:Useful scripts
- GSAF:Software users group meetings
- Variant calling
- ZOHO Information
- Extracting barcode split data from SOLiD 5500 XSQ files
- R and R packages
...
General purpose tools
Anchor | ||||
---|---|---|---|---|
|
- Blast
- Bioconductor
- Bioperl
- BioMart Perl APIs
- NCBI Eutils
- R
- Python Library
- Graphics programs
- BOOST libraries
- Phred, Phrap, Consed, cross_match, daev
- Picard
- Hmmer
- Data compression programs
- Clustering programs - MCL and usearch, uclust
- ssh - generating keys
Microarray data analysis tools
Anchor | ||||
---|---|---|---|---|
|
NGS Data Quality Control Tools
Anchor | ||||
---|---|---|---|---|
|
Mappers/Aligners
Anchor | ||||
---|---|---|---|---|
|
- mapreads SOLiD data only, ungapped alignment
- MAQ - best for short-read SNP calling; ungapped alignment
- muscle - "old school" aligner - good for 454 amplicons
- SOAP - very fast and versatile: any read length, gapped, paired-end, SNP calling
- SSAHA & SSAHA2 - like Maq, fast for ungapped mapping - SNP calling, contig placement to reference, etc.
- Bowtie - very fast, ungapped alignment. Does not support color space data
- SHRiMP - A sensitive and accurate mapper. Supports color space data and gapped alignment.
- BFAST - BLAT-like short read mapper. Natively supports SOLiD colorspace short reads.
- BWA - The successor to MAQ; a BW mapper, but which allows for gaps and handles colorspace natively.
- GMAP and GSNAP - Mappers for cDNA and very sensitive detection of short indels.
- Mosaik - A suite of alignment and reference-guided assembly tools.
- See Category:Mapper for more details.
Gene prediction tools
Anchor | ||||
---|---|---|---|---|
|
SNP discovery and Annotation
Anchor | ||||
---|---|---|---|---|
|
- Corona-Lite - SOLiD data only
- Breakdancer
- Genome Analysis Tool Kit (GATK)
- MAQ - best for short-read SNP calling; ungapped alignment
- Picard
- SOAP - very versatile: any read length, gapped, paired-end, SNP calling
- SAMTOOLS
- Annovar
RNA-Seq Analysis
Anchor | ||||
---|---|---|---|---|
|
- Tophat-Cufflinks-Cuffdiff, ignoring novel transcripts
- Tophat-Cufflinks-Cuffdiff, allowing for novel transcripts
- Removing duplicates from alignment output
Splice Junction discovery
Anchor | ||||
---|---|---|---|---|
|
...
Genome Alignment and Visualization
Anchor | ||||
---|---|---|---|---|
|
...
- Circos
- IGV
- MaqView
- Mauve
- Affymetrix Integrated Genome Browser: easy to install genome browser. Download here
- SAMStat
De novo assembly
Anchor | ||||
---|---|---|---|---|
|
- MIRA
- Velvet
- ABYSS
- Using consed, including editing Mira assemblies
- ABI's SOLiD de novo pipeline
- Phred, Phrap, Consed, cross_match, daev
- Allpaths-LG
- Mosaik - A suite of alignment and reference-guided assembly tools.
- (Newbler, the Roche/454 assembler, is under 454 Analysis tools)
Transcriptome de novo assembly
Anchor | ||||
---|---|---|---|---|
|
ABI pipelines
Anchor | ||||
---|---|---|---|---|
|
...
- Convert ABI SOLiD data to fasta fastq
- General parser scripts - scripts for parsing and filtering of fasta, fastq files and output files from different mappers; base space, color space conversion scripts.
- Small rna analysis
- Generation of wig files from mapreads output
- Conversion of mapreads output to GFF, SAM, or BAM format - These utilities can be used to convert mapreads mapping output to base space format
- Generation of gene counts from results of mapping to genome - These scripts can be used to identify the reads that correspond to genes, after mapping to the genome.
- Get Tm (melting temperature), length, and %GC from a bunch of sequences
- Conversion of gene ID's from one form to another (i.e. NCBI to Ensembl & vice-versa)
- Quick tips on GO analysis
- Median polish to consolidate quantitations
- Make a quick venn diagram based on lists in 3 files
- Plot a read length histogram based on sequences in a fasta file
- Reverse complement for fasta files
- Tricks to preprocess SOLiD and 454 data
- Convert BLAST results to GFF
Software users group meetings
Anchor | ||||
---|---|---|---|---|
|
...