Transcript Quantification with Kallisto
Skip the mapping step with Kallisto
*Thanks to Anna Battenhouse to the text and figures!
Kallisto is a tool from the Pachter lab that performs quanitfication of transcripts without requiring alignment.
- No explicit alignment to reference genome or transciptome
- Instead, uses “pseudoalignment” to transcriptome
- For each read, determine not where in each transcript it aligns, but rather which transcripts it is compatible with
- Simultaneously addresses 2 aspects of “multi-mapping” reads in traditional RNAseq pipelines
- Multiple possible genomic loci (addressed during alignment)
- Multiple possible transcripts of origin (addressed during quantification)
- Pseudoalignments are sufficient to quantify transcript abundances
- Expectation Maximization (EM) algorithm is applied to a “simple” RNAseq Likelihood function
- Report estimated abundances as Transcripts per Million (TPM) + counts
No P-value reported or differential expression (DE) support, but…
- kallisto re-runs EM on multiple bootstrap re-samples to estimate variance
- then kallisto bootstraps are used by add-on sleuth DE package
Why Kallisto?
Speed and performance are greatly improved with Kallisto.
Welcome to the University Wiki Service! Please use your IID (yourEID@eid.utexas.edu) when prompted for your email address during login or click here to enter your EID. If you are experiencing any issues loading content on pages, please try these steps to clear your browser cache.