Genome 540 Homework Assignment 8

Due Sunday Mar 6

  • Your output should provide
  • You must turn in your results and your computer program, using the template (file format) described below. Please put everything into ONE plain text file - do not send an archive of files or a tar file, or a word processing document file. Compress it (using either Unix compress, or gzip -- if you don't have access to either of these programs let us know), and send it as an attachment to both Phil at phg@u.washington.edu, and Tobias at mann@gs.washington.edu. Here is the template. <gs540_hw assignment='8' name='student name' email='student email'> <results> <result type='first line' file='filename'> first line of the .fna file that you use for viterbi training. NOTE: please do not change the filename or the first line in any way. Please include all characters of the first line (including the '>' character). </result> <result type='viterbi iteration' iteration='<some number>'> <result type='gene histogram'> The gene histogram should give the number of top strand and bottom strand genes found at this iteration, and should be reported as below. top strand genes=<some number>,bottom strand genes=<some number> </result> <model type='hmm'> The model object should specify an HMM by giving state labels, initial state probabilities (= transition probabilities from the begin state), state transition probabilities, and symbol emission probabilities. <states> give your state labels, separated by commas: 1,2 </states> <initial_state_probabilities> initial state probabilities should give, for each state, the state label and probability of starting in that state (i.e. the probability of transitioning into that state from the begin state) separated by an equals sign. Entries should be separated by commas. 1=0.90000,2=0.10000 </initial_state_probabilities> <transition_probabilities state='1'> transition probabilities should give, for each state, the state label and probability of transitioning to that state from the state indicated in the attributes list. The present field (with state='1') gives the probabilities of transitioning from state 1 to states 1 and 2: 1=0.99000,2=0.01000 </transition_probabilities> <transition_probabilities state='2'> 1=0.20000,2=0.80000 </transition_probabilities> <emission_probabilities state='1'> For each symbol emitted by the state indicated in the attributes for this field, give the probability of emitting that symbol. TTT=.15000,TCT=.15000,TAT=.15000,TGT=.15000, ... </emission_probabilities> <emission_probabilities state='2'> TTT=.20000,TCT=.15000,TAT=.15000,TGT=.10000, ... </emission_probabilities> </model> </result> </results> <program> <comments> put comments about your code here </comments> <file> file contents here </file> </program> </gs540_hw>