What Protein Are You?

Explore protein structure files in the public Protein Data Bank.


I. Introduction

The Protein Data Bank (PDB) stores all known three-dimensional (3D) structures of proteins in freely accessible datafiles. The datafile names for the more than 13,000 proteins in the PDB are all four characters long, beginning with a digit (usually 1) and usually followed by three letters. An example is 1EMB for the green fluorescent protein from jellyfish. The PDB is accessed at http://www.rcsb.org/pdb/.

II. Protein Name

Go to the PDB web site. The PDB homepage has a "Site Search" box for a protein identifier (ID). Protein IDs are usually the digit "1" followed by three letters. Enter the digit "1" followed by your three initials and click the Explore button. There is an excellent chance that the PDB has a 3D protein structure stored under that filename.

If no protein structure has that filename, try a simple variation. If your name has more initials, use a different 3-letter combination. If you have just two initials, like MN, add a third initial between them, like MAN, or following them, like MNA. If you have three initials like JKL but get no hit with filename 1JKL, try 1JLA, 1JAL, or some other 3-letter variation. Eventually you'll get a hit. As a last resort, try the initials of a friend or family member (not a classmate).

III. Protein Game

Once your search has found a protein examine the initial page. Note the protein ID at the top of the page, followed by more information. You can download or display the full annotation, including the atomic coordinates of all the protein atoms, by clicking on the small icons to the right of the PDB ID. You can view displays of the protein 3D structure at the right of the main page or in the "Display files" option at the left. Explore these different methods of displaying 3D structure. Another way to display the 3D structure is to download the PDB file (described above) to your local computer's desktop. Then display the structure with the 3D molecular viewing program RasMol on your computer. Chime and RasMol can be downloaded and installed from the Molecular Visualization Freeware page at http://www.umass.edu/microbio/rasmol/.

Read the file annotation and view the 3D structure to learn about your protein. When viewing, rotate the protein molecule, change display modes to highlight the polypeptide backbone and to distinguish subunits and ligands.  Detailed instructions for the use of these programs may be found in the tutorials and instruction manuals also found at http://www.umass.edu/microbio/rasmol

IV. Scoring

Score points for your protein, from features as seen in the PDB file. Score in each of these categories. If your protein has identical subunits, score the structural features of one subunit only. If a protein fits more than one description in a category, take the highest score (not the sum).

Add up the total from all ten categories. Generally speaking, the rarer, more unusual proteins in the PDB have the higher scores. Ask around among your friends and classmates for their protein names and scores.