Amino acid

organic compound containing both amino and acidic functional groups, important in biology

Amino acids are the building blocks of proteins. In eukaryotes, there are 20 "standard" amino acids out of which almost all proteins are made.

Amino acids are the end product of protein. The general structure of an α-amino acid, with the amino group on the left and the carboxyl group on the right.

In biochemistry, an amino acid is any molecule that has both amine (NH2+R) and carboxyl (C=O) functional groups. In biochemistry, this term refers to alpha-amino acids with the general formula H2NCHRCOOH, where R is one of many side groups (see diagram).

There are hundreds

Across all forms of life, about 500 amino acids are known.[1] The most important thing that amino acids do is to be part of proteins, which are long chains of amino acids. Every protein has its own sequence of amino acids, and that sequence makes the protein take different shapes, and have different functions. Amino acids are like the alphabet for proteins; even though you only have a few letters, if you connect them, you can make many different sentences.

Nine of the 20 standard amino acids are "essential" amino acids for humans. They cannot be built (synthesised) from other compounds by the human body. They must be taken in as food. Others may be essential for some ages or medical conditions. Essential amino acids may also differ between species. Herbivores have to get their essential amino acids from their diet, which for some is almost entirely grass. Ruminants such as cows get some amino acids via microbes in the first two stomach chambers.

Structure

An amino acid is an organic chemical. It consists of an α-carbon atom that is covalently bonded to four groups.[2]

  • a hydrogen atom
  • an amino group (-NH2)
  • a carboxyl group (-COOH)
  • a variable R group

Every amino acid has at least one amino group (-NH2) and one carboxyl group (-COOH), except proline.

Gene expression and biochemistry

These are the proteinogenic amino acids, which are the building blocks for proteins. They are produced by cellular machinery coded for in the genetic code of any organism.[3]

Amino AcidShortAbbrev.Codon(s)Occurrence
in human proteins
(%)
Essential in humans
AlanineAAlaGCU, GCC, GCA, GCG7.8No
CysteineCCysUGU, UGC1.9Conditionally
Aspartic acidDAspGAU, GAC5.3No
Glutamic acidEGluGAA, GAG6.3Conditionally
PhenylalanineFPheUUU, UUC3.9Yes
GlycineGGlyGGU, GGC, GGA, GGG7.2Conditionally
HistidineHHisCAU, CAC2.3Yes
IsoleucineIIleAUU, AUC, AUA5.3Yes
LysineKLysAAA, AAG5.9Yes
LeucineLLeuUUA, UUG, CUU, CUC, CUA, CUG9.1Yes
MethionineMMetAUG2.3Yes
AsparagineNAsnAAU, AAC4.3No
PyrrolysineOPylUAG*0No
ProlinePProCCU, CCC, CCA, CCG5.2No
GlutamineQGlnCAA, CAG4.2No
ArginineRArgCGU, CGC, CGA, CGG, AGA, AGG5.1Conditionally
SerineSSerUCU, UCC, UCA, UCG, AGU, AGC6.8No
ThreonineTThrACU, ACC, ACA, ACG5.9Yes
SelenocysteineUSecUGA**>0No
ValineVValGUU, GUC, GUA, GUG6.6Yes
TryptophanWTrpUGG1.4Yes
TyrosineYTyrUAU, UAC3.2Conditionally
Stop codon-TermUAA, UAG, UGA††--

UAG is normally the amber stop codon, but encodes pyrrolysine if a PYLIS element is present.
UGA is normally the opal (or umber) stop codon, but encodes selenocysteine if a SECIS element is present.
The stop codon is not an amino acid, but is included for completeness.
UAG and UGA do not always act as stop codons (see above).
An essential amino acid cannot be synthesized in humans. It must be supplied in the diet. Conditionally essential amino acids are not normally required in the diet, but must be supplied to populations which do not make enough of it.

To these α-amino acids further in biosynthesis processes appearing non-essential ones are structurally (here by using SMILES notation) related:

OC(=O)C(N)–

  • ├ H .. 🅖 Glycine
  • ├ C .. 🅐 Alanine
  • │├ C .. 2-Aminobutanoic acid
  • ││├ C .. Norvaline
  • │││├ –2H .. 🅟 Proline (Dehydronorvaline)
  • │││├ C .. Norleucine
  • ││││└ N .. 🅚 Lysine
  • ││││    └ C(=O)C1N=CCC1C .. 🅞 Pyrrolysine
  • │││└ NC(=N)N .. 🅡 Arginine
  • ││├ C(=O)N .. 🅠 Glutamine
  • ││├ C(=O)O .. 🅔 Glutamic acid
  • ││├ O .. Homoserine
  • ││└ S .. Homocysteine
  • ││    └ C .. 🅜 Methionine
  • │├ C(C)C .. 🅛 Leucine
  • │├ C(=O)N .. 🅝 Asparagine
  • │├ C(=O)O .. 🅓 Aspartic acid
  • │├ C1=CNC=N1 .. 🅗 Histidine
  • │├ c1ccccc1 .. 🅕 Phenylalanine
  • │├ c1ccc(O)cc1 .. 🅨 Tyrosine
  • │├ C1=CNc2ccccc12 .. 🅦 Tryptophan
  • │├ C1=CNc2ccc(O)cc12 .. Oxitriptan
  • │├ c(cc1I)cc(I)c1-O-c2cc(I)c(O)c(I)c2 .. Thyroxine
  • │├ O .. 🅢 Serine
  • │├ S .. 🅒 Cysteine
  • │└ [SeH] .. 🅤 Selenocysteine
  • ├ C(C)C .. 🅥 Valine
  • ├ C(C)O .. 🅣 Threonine
  • └ C(C)CC .. 🅘 Isoleucine

References