Computational Molecular Evolution
18–29 May 2024
Wellcome Genome Campus, UK
Hands-on training in the latest methods and skills for molecular evolutionary analysis of sequence data
Run biennially at the Wellcome Genome Campus, this hands-on computational course aims to provide early-career stage researchers with the theoretical knowledge and practical skills to carry out molecular evolutionary analyses on sequence data. The previous iteration of this course was run in Heraklion, Greece, 2023.
In addition, the course will offer a unique opportunity for direct interaction with some of the world-leading scientists and authors of famous analysis tools in phylogenomics and population genomics.
The demand for such training is large and growing, as are the sequence datasets and researchers’ awareness of the important insights that can be gained from phylogenetic and molecular evolutionary techniques.
Experience from previous years has led to preference being given to candidates who:
- are doctoral students and postdocs in the early stages of their research career;
- already have some familiarity with phylogenetic methods (i.e. have already used some of the relevant tools);
- have already collected/assembled a molecular sequence dataset to analyse in their work;
- are instructors teaching a molecular evolution course to biology students.
We accept a mixture of bioinformaticians and empirical biologists to encourage interactions between the disciplines.
The course is also suitable for established researchers and instructors who would like to further their understanding of statistical methods and computational algorithms that underlie modern phylogenetic and population genetic software tools such as RAxML-NG, MrBayes/RevBayes, PAML, BEAST, and BPP.
The overall objective is to raise the standard of research that uses statistical methods and software programs in the field of molecular evolution and phylogenetics, by providing high-quality, in-depth, face-to-face training on this topic for as many researchers as possible.
The extensive programme comprises a mixture of lectures and computer practicals covering the following topics:
- Overview of phylogenetic terminology, methods and analyses
- Multiple sequence alignment
- Markov models of nucleotide, amino acid, and codon substitution
- Phylogeny reconstruction (including maximum likelihood and Bayesian methods)
- Hypothesis testing and computer simulation
- Coalescent and multispecies coalescent
- Analysis of genomic data under multispecies coalescent
- Protein and nucleotide sequence analysis
By the end of the course participants will have an improved theoretical understanding of relevant methods in phylogenomics and population genomics and will be able to:
- Critically interpret and apply relevant techniques to their evolutionary research.
- Interpret evolutionary trees and recognise the power of molecular phylogenies for understanding real-world biological questions, relating to evolutionary history, current-day biodiversity and future diversification of living organisms.
- Extract genome sequence data from public databases, and create multiple sequence alignments.
- Employ appropriate bioinformatics skills for analysing genome-scale datasets, including linux command line, shell scripts, and compiling and running specialist programs.
- Select and apply phylogenetic software packages (such as RAxML-NG, PAML, MrBayes, BEAST, BPP) to infer phylogenetic trees, estimate species divergence times, and test phylogenetic hypotheses.
- Explain the underlying principles of major phylogenetic methods such as distance matrix-based, maximum likelihood, and Bayesian methods, including the MCMC method.
- Explain the use of Markov models of nucleotide, amino acid and codon substitution, hypothesis testing using the likelihood ratio test, coalescent and multispecies coalescent models in species tree estimation and species delimitation.
- Apply likelihood ratio tests to infer the existence and location of molecular adaptation affecting protein-coding genes.
- Estimate species trees, delimit species boundaries and estimate the time and rate of between-species gene flow using coalescent-based analyses of genomic sequence data.
The training team will also organise ad hoc sessions on special topics during the course depending on the interests of the participants. These will be led by course instructors or teaching assistants. Ad hoc sessions in past courses have included introductions to source control using github, high-performance computing (HPC), calculation of Bayes factors, and gene-tree species-tree reconciliation.
Please note: This course focuses on molecular evolutionary and phylogenetic analyses of molecular sequence data, including genomic data generated using next-generation sequencing (NGS) technology. This course does not cover pre-processing of NGS data such as read mapping, variant calling, or genome assembly.
Instructors and speakers
Lead Instructors and Organisers
Aglaia (Cilia) Antoniou
Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research (IMBBC, HCMR), Heraklion, Greece
University of Washington, USA
Institute of Computer Science Foundation for Research and Technology, Hellas, Heraklion, Greece
University College London, United Kingdom
How to apply
While the course will predominantly rely upon command-line programs, knowledge of Unix/Linux commands is not a prerequisite for this course. Participants who are selected for this course will be required to complete pre-course exercises using Unix/Linux commands.The information about these pre-course exercises will be provided to the successful applicants.
How to Apply
Please click the Apply button above to begin the online application process. Places are limited and will be awarded on merit. If you have any problems with the online application process, please contact us.
Please note: Applications must be supported by a recommendation from a scientific or clinical sponsor (e.g. supervisor, line manager or head of department) merely stating their support for the applicant to attend the course and potential funding, if available – no lengthy standard recommendation letters are required. A request for a supporting statement will be sent to the nominated sponsor automatically during the application process. Applicants must ensure that their sponsor provides this supporting statement by the application deadline. Applications without a supporting statement will not be considered.
Successful applicants will be provided with a support letter for their visa application, if required.
Please visit the following websites for further information on visiting the UK:
|Cost||Accommodation / meals|
|*Course fee||£1900||This is a residential course and the fee includes all accommodation and meals.|
*The course fee is subsidised by Wellcome Connecting Science and applies to non-commercial applicants. Please contact us for the commercial fee.
Limited bursaries are available (up to 50% reduction on the course fee) and are awarded on merit. If you would like to apply for a bursary, please complete the bursary section of the online application form.
Where there are many bursary applications, the selection committee may issue smaller amounts.
Bursaries can be applied for as part of the course application form. Applicants will be notified of a bursary award along with their place on the course, usually within one month of the application deadline. The decision of the selection committee is final.
Please note that both the applicant and sponsor are required to provide a justification for the bursary as part of the application.
If you wish to book onsite accommodation either side of the course dates, please contact the Conference Centre directly.
Additional funding opportunities
Visit our support page for additional financial support currently available.
Accommodation services phishing scam – please be vigilant. More information.