]> first line of the input fasta file This should give, for each dinucleotide occurring in the sequence (not including dinucleotides containing ambiguity codes) the dinucleotide, followed by an equals sign, followed by an integer giving the number of times the dinucleotide occurs in the sequence. Put a comma between the different dinucs. e.g. AA=50,AC=50,AG=50,AT=50 Like dinucleotide histogram, but giving fraction of times (to 4 decimal places) each dinucleotide occurs in the sequence. Again, ignore ambiguity-coded nucleotides. e.g. for the counts given above one would get AA=.2500,AC=.2500,AG=.2500,AT=.2500 Put the matrix of conditional nucleotide probabilities here, as a list (first_nuc,second_nuc)=count,... For example (A,A)=0.25,(A,C)=0.75, ... where the interpretation is that nucleotide A is followed 25% of the time by another A, and 75% of the time by C. Again, ignore occurrences of ambiguity-coded nucleotides. A list of positions in the input fasta file of D-segments where scores g.t.e. 100 occurred. These should be given in the form (start,stop) where start indicates position of the first C (in top strand, origin 1 co-ordinates), and stop indicates position of the last C. e.g. (15774,15779),(16007,16014),... This should be a list of the form (i,n) where i is an integer and n gives the number of times a D-segment scored g.t.e. i and l.t. i+1 occurred. Omit cases i if no score in that range was observed. e.g. (100,403),(101,250),(102,17),... The value of lambda determined in part 3. The frequencies of the scores +17 and -1 that you used to find lambda, to five decimal places. e.g. +17=0.00025, -1=0.99975 first line of the simulated fasta file. The filename for the simulated sequence should be as above, 'chr21_sim'. As above for chr21, but for the simulated sequence. As above for chr21, but for the simulated sequence. As above for chr21, but for the simulated sequence. As above for chr21, but for the simulated sequence. The table from part 6. It should be in the format ( s, h(s), h(s)/h(40), predicted h(s)/h(40) ). where s is the score, and h is as described in the assignment text. e.g. (40, 3500, 1, 1), (41, 3200, 0.97, 0.96), ... The written answer for part 7. Please write at most a paragraph or two. put comments about your code here file contents here