about sitemap home home
Databases Data Formats Database Search Genome Browser RNA Secondary Structure Alignments Primer Design WebServices
BLAST FASTA HMMER e2g
Exercise BLAST Exercise BLAST statistics
Bielefeld University Center of Biotechnoloy Institute of Bioinformatics BiBiServ
 
Database Searches - BLAST Exercise 2
Statistics
The Expect value (E) is a parameter that describes the number of hits one can "expect" to see just by chance when searching a database of a particular size. It decreases exponentially with the Score (S) that is assigned to a match between two sequences. Essentially, the E value describes the random background noise that exists for matches between sequences. For example, an E value of 1 assigned to a hit can be interpreted as meaning that in a database of the current size one might expect to see 1 match with a similar score simply by chance. This means that the lower the E-value, or the closer it is to "0" the more "significant" the match is. (For more details see the calculations in NCBI's BLAST Course.)
Use the sequence from Exercise 1 again:
  1. Perform a blastx search for Homo sapiens seuqences against
    • nr
    • SwissProt
  2. Look for a hit that appears in both outputs and compare the E-values: do they change? Why?