1. What is the Protein DataBank (PDB) and its purpose?
The Protein DataBank (PDB), established in 1971, is a freely accessible, single global archive of experimentally determined structure data for biological macromolecules. Its aim is to provide researchers with a comprehensive resource for studying the three-dimensional structures of proteins, nucleic acids, and complex assemblies. The PDB offers valuable insights into the molecular mechanisms of biological processes and facilitates the development of new drugs and therapies. It serves as a crucial platform for scientists to share and analyze structural data, enabling collaborative research and advancements in the field of structural biology.
read more
2. What are the limitations of experimental structure determination techniques?
Experimental structure determination techniques, such as X-ray crystallography, Nuclear Magnetic Resonance (NMR), and Cryo-electron microscopy (cryo-EM), have their own limitations. X-ray crystallography provides high-resolution structures but requires crystallization of the protein, which may not always be possible. NMR is suitable for smaller proteins but has lower resolution compared to X-ray crystallography. Cryo-EM is more suitable for complexes but may have lower resolution for smaller proteins. It is important to be aware of these limitations when using experimental data to build computational models. Understanding the limitations helps researchers choose the most appropriate technique for their specific protein structure determination needs.
read more
3. What information does an ATOM line in a PDB file contain?
An ATOM line in a PDB file contains extensive information about a resolved atom in the crystal structure. The meaning of each column is explained below. The record name indicates the type of line, with ATOM being the most common. The atom serial number represents the number of the atom in the total structural complex, which is not renewed for the next molecule in the complex. The atom name is the abbreviation of the atom name, such as CA for alpha carbon. The alternate location indicator shows different locations for an atom, with a character (A, B, C, etc.) indicating the location. Residue refers to the amino-acid residue to which the atom belongs, in 3-letter notation. Chain indicates the molecule to which the atom belongs. Residue sequence number shows the position of the amino acid in the chain, with occasional jumps due to failed elucidation. Code for insertion of residues is rarely used and helps match important amino acids in different versions of a structure. X, Y, and Z coordinates represent the spatial coordinates of the atom in Angstroms.
read more
4. What additional descriptions and data can be accessed through the PDB's browser-based user interface?
The PDB's browser-based user interface provides additional descriptions, derived data, and relevant cross references. Users can inspect a structure's Ramachandran plot, structure validation reports, and detailed descriptions of experimental methods used to determine a structure. The feature viewer displays data derived by the PDB, such as secondary structure, disorder calculations, and hydrophobicity, alongside information from other databases like UniProt, Pfam, and Phosphosite. It also offers structural information on PDB entries from the PDBsum web server and access to homology models from the Structural Biology Knowledgebase (SBKB) and Protein Model Portal.
read more