Post-translational modification

In molecular biology, post-translational modification (PTM) is the covalent process of changing proteins following protein biosynthesis. PTMs may involve enzymes or occur spontaneously. Proteins are created by ribosomes, which translate mRNA into polypeptide chains, which may then change to form the mature protein product. PTMs are important components in cell signalling, as for example when prohormones are converted to hormones.

Post-translational modification of insulin. At the top, the ribosome translates a mRNA sequence into a protein, insulin, and passes the protein through the endoplasmic reticulum, where it is cut, folded, and held in shape by disulfide (-S-S-) bonds. Then the protein passes through the golgi apparatus, where it is packaged into a vesicle. In the vesicle, more parts are cut off, and it turns into mature insulin.

Post-translational modifications can occur on the amino acid side chains or at the protein's C- or N- termini.[1] They can expand the chemical set of the 22 amino acids by changing an existing functional group or adding a new one such as phosphate. Phosphorylation is highly effective for controlling the enzyme activity and is the most common change after translation. [2] Many eukaryotic and prokaryotic proteins also have carbohydrate molecules attached to them in a process called glycosylation, which can promote protein folding and improve stability as well as serving regulatory functions. Attachment of lipid molecules, known as lipidation, often targets a protein or part of a protein attached to the cell membrane.

Other forms of post-translational modification consist of cleaving peptide bonds, as in processing a propeptide to a mature form or removing the initiator methionine residue. The formation of disulfide bonds from cysteine residues may also be referred to as a post-translational modification.[3] For instance, the peptide hormone insulin is cut twice after disulfide bonds are formed, and a propeptide is removed from the middle of the chain; the resulting protein consists of two polypeptide chains connected by disulfide bonds.

Some types of post-translational modification are consequences of oxidative stress. Carbonylation is one example that targets the modified protein for degradation and can result in the formation of protein aggregates.[4][5] Specific amino acid modifications can be used as biomarkers indicating oxidative damage.[6]

Sites that often undergo post-translational modification are those that have a functional group that can serve as a nucleophile in the reaction: the hydroxyl groups of serine, threonine, and tyrosine; the amine forms of lysine, arginine, and histidine; the thiolate anion of cysteine; the carboxylates of aspartate and glutamate; and the N- and C-termini. In addition, although the amide of asparagine is a weak nucleophile, it can serve as an attachment point for glycans. Rarer modifications can occur at oxidized methionines and at some methylene groups in side chains.[7]

Post-translational modification of proteins can be experimentally detected by a variety of techniques, including mass spectrometry, Eastern blotting, and Western blotting. Additional methods are provided in the #External links section.

PTMs involving addition of functional groups edit

Addition by an enzyme in vivo edit

Hydrophobic groups for membrane localization edit

Cofactors for enhanced enzymatic activity edit

Modifications of translation factors edit

Smaller chemical groups edit

Non-enzymatic modifications in vivo edit

Examples of non-enzymatic PTMs are glycation, glycoxidation, nitrosylation, oxidation, succination, and lipoxidation.[15]

Non-enzymatic additions in vitro edit

  • biotinylation: covalent attachment of a biotin moiety using a biotinylation reagent, typically for the purpose of labeling a protein.
  • carbamylation: the addition of Isocyanic acid to a protein's N-terminus or the side-chain of Lys or Cys residues, typically resulting from exposure to urea solutions.[18]
  • oxidation: addition of one or more Oxygen atoms to a susceptible side-chain, principally of Met, Trp, His or Cys residues. Formation of disulfide bonds between Cys residues.
  • pegylation: covalent attachment of polyethylene glycol (PEG) using a pegylation reagent, typically to the N-terminus or the side-chains of Lys residues. Pegylation is used to improve the efficacy of protein pharmaceuticals.

Conjugation with other proteins or peptides edit

Chemical modification of amino acids edit

Structural changes edit

Statistics edit

Common PTMs by frequency edit

In 2011, statistics of each post-translational modification experimentally and putatively detected have been compiled using proteome-wide information from the Swiss-Prot database.[24] The 10 most common experimentally found modifications were as follows:[25]

FrequencyModification
58383Phosphorylation
6751Acetylation
5526N-linked glycosylation
2844Amidation
1619Hydroxylation
1523Methylation
1133O-linked glycosylation
878Ubiquitylation
826Pyrrolidone carboxylic acid
504Sulfation

Common PTMs by residue edit

Some common post-translational modifications to specific amino-acid residues are shown below. Modifications occur on the side-chain unless indicated otherwise.

Amino AcidAbbrev.Modification
AlanineAla or AN-acetylation (N-terminus)
ArginineArg or Rdeimination to citrulline, methylation
AsparagineAsn or Ndeamidation to Asp or iso(Asp), N-linked glycosylation, spontaneous isopeptide bond formation
Aspartic acidAsp or Disomerization to isoaspartic acid, spontaneous isopeptide bond formation
CysteineCys or Cdisulfide-bond formation, oxidation to sulfenic, sulfinic or sulfonic acid, palmitoylation, N-acetylation (N-terminus), S-nitrosylation
GlutamineGln or Qcyclization to pyroglutamic acid (N-terminus), deamidation to Glutamic acid or isopeptide bond formation to a lysine by a transglutaminase
Glutamic acidGlu or Ecyclization to Pyroglutamic acid (N-terminus), gamma-carboxylation
GlycineGly or GN-Myristoylation (N-terminus), N-acetylation (N-terminus)
HistidineHis or HPhosphorylation
IsoleucineIle or I
LeucineLeu or L
LysineLys or Kacetylation, ubiquitylation, SUMOylation, methylation, hydroxylation leading to allysine, spontaneous isopeptide bond formation
MethionineMet or MN-acetylation (N-terminus), N-linked Ubiquitination, oxidation to sulfoxide or sulfone
PhenylalaninePhe or F
ProlinePro or Phydroxylation
SerineSer or SPhosphorylation, O-linked glycosylation, N-acetylation (N-terminus)
ThreonineThr or TPhosphorylation, O-linked glycosylation, N-acetylation (N-terminus)
TryptophanTrp or Wmono- or di-oxidation, formation of kynurenine, tryptophan tryptophylquinone
TyrosineTyr or Ysulfation, phosphorylation
ValineVal or VN-acetylation (N-terminus)

Databases and tools edit

Flowchart of the process and the data sources to predict PTMs.[26]

Protein sequences contain sequence motifs that are recognized by modifying enzymes, and which can be documented or predicted in PTM databases. With the large number of different modifications being discovered, there is a need to document this sort of information in databases. PTM information can be collected through experimental means or predicted from high-quality, manually curated data. Numerous databases have been created, often with a focus on certain taxonomic groups (e.g. human proteins) or other features.

List of resources edit

  • PhosphoSitePlus[27] – A database of comprehensive information and tools for the study of mammalian protein post-translational modification
  • ProteomeScout[28] – A database of proteins and post-translational modifications experimentally
  • Human Protein Reference Database[28] – A database for different modifications and understand different proteins, their class, and function/process related to disease causing proteins
  • PROSITE[29] – A database of Consensus patterns for many types of PTM's including sites
  • RESID[30] – A database consisting of a collection of annotations and structures for PTMs.
  • iPTMnet [31]– A database that integrates PTM information from several knowledgbases and text mining results.
  • dbPTM[26] – A database that shows different PTM's and information regarding their chemical components/structures and a frequency for amino acid modified site
  • Uniprot has PTM information although that may be less comprehensive than in more specialized databases.
    Effect of PTMs on protein function and physiological processes.[32]
  • The O-GlcNAc Database[33][34] - A curated database for protein O-GlcNAcylation and referencing more than 14 000 protein entries and 10 000 O-GlcNAc sites.

Tools edit

List of software for visualization of proteins and their PTMs

  • PyMOL[35] – introduce a set of common PTM's into protein models
  • AWESOME[36] – Interactive tool to see the role of single nucleotide polymorphisms to PTM's
  • Chimera[37] – Interactive Database to visualize molecules

Case examples edit

See also edit

References edit

External links edit

(Wayback Machine copy)

(Wayback Machine copy)