next up previous
Next: Biological Background Up: Gene Finding Previous: Gene Finding

Motivation

GenBank is huge. There are more than 3 billions bases of human DNA sequences and complete DNA sequences for dozens of species available in GenBank. Not all the sequences are coding, namely are a template for a protein. In the human genome only 3%-5% of the sequences are coding. Due to the size of the database, manual searching of genes who do code for proteins is not practical. We need to find a way for automatic finding of genes.

Peer Itsik
2000-12-25