Learning Outcomes:
On completion of this module students should be able to identify and apply some of the standard methods in computational analysis of biosequences. They should understand why sequence regions are typically conserved or variable in populations and in evolution, and how that information may be applied. They should understand some of the challenges and solutions involved in dealing with large biological datasets, such as the problem of multiple testing across many molecular variants.
Indicative Module Content:
Protein and DNA sequence databases.
Sequence motifs.
Sequence alignment.
Trees based on similarities of sequences.
Analysing genome wide variation in genes.
protein structure/interactions.
Somatic mutation databases in cancer.