A file in fasta format is probably the most common way to store sequence information. The format is very simple: the name of each sequence is listed on a line beginning with a '>' (the sequence name is everything after the '>'), and the sequence associated with that name is listed on one or more lines after the name line. For a more detailed description, see http://www.ncbi.nlm.nih.gov/BLAST/fasta.shtml. For the examples below, program output will be shown for the following fasta-format file, called seqs. fa: human ORF Finding: Write a program find0rf s .py that takes the name of a single fasta-format file from the command line. Your program should read DNA sequences from the fasta file, translate them into protein sequences, and finally print out the protein sequence of all possible ORFs (open reading frames starting with a start codon and ending with one of the three stop codons, with no stop codons in between]. You should look at all three reading frames on the forward strand (do not worry about the three reading frames on the reverse complement). The ORFs your program will be able to find can range in length from 2 amino acids (a start and a stop) to the entire fasta sequence record (of any length). There may be more than one ORF in each fasta sequence record, possibly in different reading frames. You do not need to worry about outputting the ORFs in any specific order so long as your output includes all ORFs present in the sequence. To get you started, think about tackling the problem as follows: Read sequences from the fasta file For each sequence, translate the reading frames starting at the 1st, 2nd, and 3rd base in the sequence For each translated sequence, find any ORFs present in that sequence Example program usage: python findOrfs.py seqs.fa ORFs in seqA - human: ORFs in seqC - gorilla: MTR ORFs in that sequence