Phylogeny inference and data analysis, spring 2011
Phylogeny inference and data analysis, spring 2011
Lecturer
Scope
4-12 cu.
Time schedule, prerequisites, content
Basic probabilistics and statistical inference are assumed as prerequisites.
The course is tailored for Bioinformatics Master program (MBI) and thus some knowledge on sequence analysis, working with sequence databases, sequence alignment, distance matrix based phylogenies by MEGA-software, are assumed to be known.
The course is, however, open for all students: additional sessions will be arranged for students who are not familiar with these basics (introduced in other MBI-courses).
Part I
III period:
Tuesdays 15-17 and Thursdays 14-16 in B120 and in computer class C128.
Time schedule for extra sessions (for non-MBI -students) in computer class during three first course weeks in negotiable (for example before or after lecture times).
The aim of the course is to elucidate both biological and statistical aspects of phylogenies, i.e. evolutionary trees and networks, which are elementary structures of differences among biological entities (species, individuals, genes, sequences in general), amenable to statistical inference. The major categories of phylogeny inference methods are distance matrix, parsimony, maximum likelihood, Bayesian and network approaches.
Exam (40 points) and homework assignments (20points): 4cr, essay (statistically or biologically focused): 2cr.
50-60 points: grade 5, 30 points: grade 1.
Course is focused on practical working => exam in computer class.
Week 1 program is for non-MBI (Bioinformatics Master program) students. Lectures and practicals in computer class overlap with topics which have in been in Introduction to bioinformatics / Molecules for bioinformatics: Basic phylogeny concepts, basic practical computer working and getting familiar with molecular sequence databases.
18.01., 15-17, computer class C128.
Week 2
Tue 25.01, 15-17, B120, Description of the course, assignments, how to proceed with data collection.
Thu 27.01, 14-16, B120, Alignment with Clustal, editing with Genedoc, MEGA
Week 3
Tue 01.02, 15-16, C128, Practical session with MEGA, ,
Wed 02.02, 12-20, C128, Extra help for those that need, data collections and alignments for datasets 1 and 2.
Thu 03.02, 14-16, D340, Phylogeny books and programs, Distance matrix methods (this is given as a paper copy), Nucleotide substitution modelling.
Week 4
Tue 08.02, 15-16, C128, MrBayes demo, checking that everybody can start with the program. Use this NEXUS-file for training: .
Wed 09.02, 12-14, C128, Extra help, checking results,dataset 1 should be ready for MrBayes analyses (i.e. you have already done MEGA-analyses) and dataset 2 collected.
Thu 10.02, 14-16, B120, Parsimony, Maximum likelihood.
Week 5
Tue 15.02, 15.00-16, C128, Datasets 3 and 4, Splitstree and Network.
Thu 17.02, 14-16, B120, Lectures on methods continue.
Week 6
Tue 22.02, 15-17, C128, Results from datasets 1 and 2 + lectures on methods (2h session probabaly not enough, we continue until everything is done.)
Thu 24.02, 14-16, B120, Results from dataset 4 + lectures on networks and phylogeny examples (phylodynamics, phylogeography, phylogenomics, phyloprofiling).
Week 7
Thu 03.03, 16-20, C128, EXAM.
Lectures, links, note also the material in "software tools" | Recommended review (and other) papers |
---|---|
| Before phylogenies, seq collection and alignments: |
Distance matrix methods in phylogeny inference |
|
|
|
Examples |
---|
Phylogenies from non-sequence material, |
Assignments (homework) |
---|
Working in 2-4 student groups. You get 20 points for the total 60 points (40 from exam) by doing these. Note that the course is focused on practicals and you need these skills in exam (reasonable aspects, not sophisticated details).
Dataset 1 |
Dataset 2 |
Dataset 3 |
Dataset 4 |
Software tools |
---|
Sequence alignment
|
Phylogeny software
|
Part II
IV period: Practical project work in computer class and at home, time schedule negotiable (4-6 cr), topics to negotiated at the end of part I
The project is based on sequence data collected from databases.
Registration
This is registration to part I. At the end of III-period you should decide if you continue to part II.
Did you forget to register? What to do.