Motif finding - lesson plan
Learning objectives
- Understand concept of motif and why they are important for understanding biology
- Understand difference between motifs and frequent k-mers (chapter 2 vs. chapter 1)
- Understand concept of consensus string/motif
- Able to construct brute force algorithm to find motifs
- Understand motif profiles as probability distributions
- Able to score a sequence against a profile using probabilities
- Understand the necessity of entropy and information content
- Understand need for pseudo-counts
- Understand difference between Las Vegas and Monte Carlo randomized algorithms
- Understand Gibbs sampling procedure
- Understand why Gibbs sampler allows sub-optimal solutions
Class activities
- Describe biological problem
- Compute entropy of figure 2.2
- Score of motif table - brute force algorithm to consensus string
- From consensus string to probabilities - probabilistic scoring
- Randomized search
- Gibbs sampler
Exercises
Rosalind section 3