Name:

BIO 450/ CS 499: Assignment 2

For each of your unknown (i.e., query) sequences, answer the following questions. The points for each question will be assessed across all of the sequences as a whole. If one of the answers for one of the sequences is incomplete or inaccurate a part of the total points for that question will be deducted.

For example, you get 1 point for identifying the GenBank accession number. If several are missing you will lose the whole point. If only one or two is missing you will lose half a point.

Sequence 1

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 2

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 3

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 4

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 5

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 6

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)

Sequence 7

What is the length the query sequence? Where did you find this information, or how did you compute it? (4 points total)

What is the most likely function encoded by this sequence? What data supports this conclusion? (4 points total)

What organism was the most likely source of the sequence? What is the common name for this organism, if one exists? What data supports these answers? (4 points total)

What is the GenBank accession number for the best-matching sequence? (1 point total)

Estimate the number of sequences with an E value less than 0.01. Briefly summarize what other sequences these appear to hit (are they from the same or different organisms? Do the other sequence hits have seemingly similar functions?). (4 points total)

If available, list the names of three different organisms with sequences that achieve significant e-values. (3 points total)