Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

We're really interested in places in the genome where we think there are mutations. In the Variant calling tutorial we identified such locations but lacked a good way to visualize them. This is your opportunity to visualize them. We have already transferred the SRR030257.vcf file back to your local computer, but before we can visualize them, we need to (guess what?) index it.

...

It will look like nothing has happened aside from the appearance of "Done" in the messages box, but you can now close the "Run" window and choose File > Load from File. If you navigate to your IGV directory, you will now see a brand new SRR030257.vcf.idx file. You can now load the SRR030257.vcf file, and it will show up as a new track near the top of your window.

...

  • Zoom in using the slider in the upper right. Do this until you see mapped reads and finally individual bases appear.
  • Navigate by clicking and dragging in the window. This is how you move left and right along the genome.
  • Navigate more quickly. Use page-up page-downhomeend.
  • Jump to the next point of interest. Click on a track name on the left side of the window (Ex: bowtieSRR030257.vcf), to select it. You can then use control-f and control-b to jump forward and backward within that list of features. Try this on the variant calls track.
  • Jump right to a gene. (If you have gene features loaded.) Type its name into the search box. Try "topA".
  • Load multiple BAM alignments or VCF files at once. Try this to compare a few different regions between the bowtie and BWA results.
  • Change the appearance of genes. Right click on the gene track and try "expanded". Experiment with the other options.
  • Change the appearance of reads. Right click on a BAM track and choose "show all bases" and "expanded". Experiment with the other options.

...

  1. Download and install the Integrative Genome Viewer from the Broad Institute. 
  2. Select "Human hg19" as the reference genome from the top left drop down (you may need to select "more" to have hg19 as an option)
  3. Get some data: File > Load from Server… > 1000 genomes > Alignments > ACB > exome > HG01880
  4. Navigate to the rightmost exons of the GABBR1 gene.
  5. Zoom in until you find some SNPs. (Hint look just to the left of the 2nd exon).
  6. What type of library is this? (Hint: zoom out)
  7. If you knew this was a cancer patient, consider how strongly you would think this may be a potentially causative mutation.
  8. Imagine it was actually in the exon rather than just into the intron... would that make you consider it more?
  9. Load and look at the SNP track: File > Load from server > Annotations > Variants and Repeats > dbSNP 1.3.7
  10. The track may load with the Refseq genes, making it useful to resize that window to view both the gene and the dbSNP information simultaneously.
  11. Consider if this makes you think it more likely or less likely that this is a causative mutation.

 

...

Optional Tutorial Exercises ...

 

Expand
titleTo visualize mapped data without calling variants

You will need to index your reference FASTA and convert your SAM output files into sorted and indexed BAM files. The "why?" behind these steps is described more fully in the Variant calling tutorial. If you are in your mapping directory, these commands will perform the necessary steps.

Warning
titleSubmit to the TACC queue or run in an idev shell
Code Block
samtools faidx NC_012967.1.fasta
samtools view -b -S -o bowtie/SRR030257.bam bowtie/SRR030257.sam
samtools sort bowtie/SRR030257.bam bowtie/SRR030257.sorted
samtools index bowtie/SRR030257.sorted.bam

Repeat the last three commands for each SAM output file that you want to visualize in IGV.

...