Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Code Block
titlefastx_Clipper example
fastx_clipper -a GATCGGAAGAGCACACGTCTGAACTCCA -i data/Sample1_R1.fastq -o Sample1_R1.cutadaptfastxclipped.fastq -l 20


More sophisticated tools like Trimgalore and cutadapt may be suitable, particularly with dealing with paired end data.

Code Block
titleCutadapt example
module load cutadapt
#IF WE HAD PAIRED END DATA
cutadapt -a GATCGGAAGAGCACACGTCTGAACTCCA -o <R1_outputfile> -p <R2_outputfile> <inputR1file> <innputR2file><inputR2file>




Expand
titleThe gory details on the *-a* adapter sequence argument

Please refer to https://utexas.atlassian.net/wiki/display/GSAF/Illumina+-+all+flavors for Illumina library adapter layout.


Below information holds true from old ligation-based RNA-libraries. In this case, the R1 adaptor is different from the R2 adaptor.
 
For current stranded RNA libraries (such as dUTP, used by GSAF), the R1 adaptor would be the same as the R2 adaptor.
 
Fastqc reports will give you an idea about which adaptor is present in your data. Further, it's always a good idea to grep -c <partofdaptorseq> <fastqfile> to make sure you have the right adaptor sequence before trimming.


...