Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

...

Code Block
shuffleSequences_fastq.pl R1.fastq R2.fastq > R1_R2.fastq &

2. Make hash of the reads using Velveth. Hash length (or kmer length) matters. You might have to run velvet several times to pick the best kmer length. It should be long enough to avoid false positives, but not long that you miss out reads in your assembly. Recommendation: try k-mer lengths between 61-73.

...

Contigs.fa will have all the contigs that were assembled. The contig header will have information such as length and coverage for that contig.
Note that the length and coverage information provided in the header of each contig should be understood in k-mers and in k-mer coverage respectively. E.g. for a 500bp contig and a k-mer length of 21, the length in the header will be 480.

To pull out the top 50 largest contig headers:

...