Logged in as: guest Log in | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Home | Research | Courses | Publications | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Comp 555: BioAlgorithms -- Fall 2013
Homework Information: Some of the problems are probably too long to attempt the night before the due date, so plan accordingly. No late homework will accepted. However, your lowest homework will be dropped. Feel free to work with others, but the work you hand in should be your own.
Question 1. Consider the following distance matrix
Question 2. Consider the following SNP panel where rows are haplotypes and columns are SNPs.
Compute and depict the maximal compatible intervals using each of the following:
Programming Problem. (Please submit code by emailing kemal@cs.unc.edu with the subject "COMP 555 PS5") Consider the following file of genotypes, Gtypes.csv, with four samples. In this file the genotypes for a given genomic position are given for each row. The columns correspond to a marker name, a chromosome, and a columes for the genotypes calls for all four samples. The column labelled "G2" is a second generation cross with the following pedigree: G2 = FVB/NJ x (PWK/PhJ x WSB/EiJ), and, thus is decendent of the samples genotyped in the other three columns. This animal has one chromosome inherited from its maternal parent (FVB/NJ) and the second chromosome is a mix of its grandparents (PWK/PhJ and WSB/EiJ). The objective of this programming project is to use a Hidden-Markov Model (HMM) to infer the genomic origin (PWK/PhJ and WSB/EiJ) at each marker of the of the second chromosome. This problem closely resembles the "Fair-Bet Casino" problem discussed both in the textbook and in class. There are two possible states at every marker P and W. The HMM emits a genotype value with the following likelihoods:
Where 'f' represents the FVB "nucleotide" genotype call, and 'p'/'w' represent the PWK/WSB nucleotide if it is different than 'f'. The probability of transitioning from the P state to the W state, or vice versa, is 0.01. |