Difference between revisions of "Protein Domain"

From PhosphataseWiki
Jump to: navigation, search
(PEST sequence)
(Statistics)
 
(38 intermediate revisions by one user not shown)
Line 5: Line 5:
 
! Domain
 
! Domain
 
! Function
 
! Function
! Human protein number
+
! Human protein number (domain number)
! Human PP number
+
! Human protein phosphatase (PP) number
 
! PP subfamily
 
! PP subfamily
! Length
+
! Length (aa)
 +
|-
 +
| C1
 +
| Bind to secondary messenger [https://en.wikipedia.org/wiki/Diglyceride DAG]
 +
|
 +
|
 +
|
 +
| ~50
 +
|-
 +
| ADF:Gelsolin
 +
|
 +
|
 +
|
 +
|
 +
|
 +
|-
 +
| ADF:Coffilin_ADF
 +
|
 +
|
 +
|
 +
|
 +
|
 +
|-
 +
| EF hand
 +
|
 +
|
 +
|
 +
|
 +
|
 
|-
 
|-
 
| FERM
 
| FERM
Line 17: Line 45:
 
|
 
|
 
|-
 
|-
| PTB
+
| IQ motif
| Bind to pTyr
+
|
| 27
+
|  
 +
|
 
|
 
|
 
|
 
|
| ~130 aa
 
 
|-
 
|-
 
| PDZ
 
| PDZ
Line 28: Line 56:
 
| ~180 (~260)
 
| ~180 (~260)
 
|  
 
|  
|
+
|  
| 80-90 aa
+
| 80-90
 
|-
 
|-
 
| PEST
 
| PEST
Line 37: Line 65:
 
|
 
|
 
|  
 
|  
 +
|-
 +
| PTB
 +
| Bind to pTyr
 +
| 27
 +
|
 +
|
 +
| ~130
 
|-
 
|-
 
| SH2
 
| SH2
Line 42: Line 77:
 
| 115 (120)
 
| 115 (120)
 
|  
 
|  
|
+
| PTPN6
| ~100 aa
+
| ~100
 
|-
 
|-
 
| SH3
 
| SH3
Line 49: Line 84:
 
| (~300)
 
| (~300)
 
|
 
|
 +
| STS
 +
| ~60
 +
|-
 +
| THAP
 +
| Bind to DNA
 +
| 12 (12) <cite>Roussigne03</cite>
 +
| 0
 
|
 
|
| ~60 aa
+
| 80-90
 +
|-
 +
| TPR
 +
|
 +
|
 +
|
 +
|
 +
|
 +
|-
 +
| Zf:FYVE/PHD
 +
|
 +
|
 +
|
 +
|
 +
|
 
|}
 
|}
  
Line 57: Line 113:
 
=== Description ===
 
=== Description ===
  
===== [http://en.wikipedia.org/wiki/FERM_domain FERM] =====
+
===== [http://pfam.xfam.org/clan/ADF ADF]: actin cytoskeleton organization =====
 +
ADF stands for Actin Depolymerizing Factor, involved in the rapid recycling of their actin cytoskeleton to enable a dynamic change in their shape.
 +
====== [https://en.wikipedia.org/wiki/Gelsolin Gelsolin] ======
 +
====== [http://pfam.xfam.org/family/PF00241 Cofflin_ADF] ======
 +
 
 +
===== [https://en.wikipedia.org/wiki/C1_domain C1]: DAG binding =====
 +
C1 domain (also known as phorbol esters/diacylglycerol binding domain) binds an important secondary messenger diacylglycerol (DAG), as well as the analogous phorbol esters. It is characterised by a rich cysteine and histidine content. The C1 domain is originally found in the N-terminal region of conservation in protein kinase C domains. It is usually around 50 aa long.
 +
 
 +
===== [https://en.wikipedia.org/wiki/C2_domain C2]: membrane targeting =====
 +
"A C2 domain is a protein structural domain involved in targeting proteins to cell membranes.  The C2 domain in PTEN, brings the phosphatase domain into contact with the plasma membrane, where it can dephosphorylate its substrate, phosphatidylinositol (3,4,5)-trisphosphate (PIP3), without removing it from the membrane - which would be energetically very costly. The C2 domain is currently only known from eukaryotes. Over 17 distinct clades of C2 domains have been identified. Most C2 families can be traced back to basal eukaryotic species indicating an early diversification before the last eukaryotic common ancestor (LECA). Only the PKC-C2 domain family contains conserved calcium-binding residues, suggesting the typical calcium-dependent membrane interaction is a derived feature limited in PKC-C2 domains. "
 +
 
 +
===== [https://en.wikipedia.org/wiki/EF_hand EF hand]: Ca<sup>2+</sup> binding =====
 +
The EF hand is a helix-loop-helix structural domain or motif found in a large family of calcium-binding proteins. The EF-hand motif contains a helix-loop-helix topology, much like the spread thumb and forefinger of the human hand, in which the Ca2+ ions are coordinated by ligands within the loop. It usually has about 30 aa residues.
 +
 
 +
===== [http://en.wikipedia.org/wiki/FERM_domain FERM]: cytoskeletal-associated at the interface between plasma membrane and cytoskeleton =====
 
FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus in the majority of proteins in which it is found.
 
FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus in the majority of proteins in which it is found.
  
===== [http://en.wikipedia.org/wiki/Phosphotyrosine-binding_domain PTB] =====
+
===== [https://en.wikipedia.org/wiki/IQ_calmodulin-binding_motif IQ]: calmodulin (Ca<sup>2+</sup> sensor) binding =====
The PTB domain is a structural domain of ~130 aa (Pfam model). It usually binds to phosphorylated tyrosine. However, PTB domain of SHC can bind to NPLH sequence found in the carboxyl terminus of murine PTPN12/PTP-PEST <cite>Charest94</cite>. PTB domain is found at least in 27 genes in human genome.
+
The IQ  calmodulin-binding motif is an amino acid sequence motif containing the following sequence: [FILV]Qxxx[RK]Gxxx[RK]xx[FILVWY]. The term "IQ" refers to the first two amino acids of the motif: isoleucine (commonly) and glutamine (invariably). Calmodulin (CaM) is recognized as a major calcium (Ca2+) sensor and orchestrator of regulatory events through its interaction with a diverse group of cellular proteins.
  
===== [http://en.wikipedia.org/wiki/PDZ_domain PDZ] =====
+
===== [http://en.wikipedia.org/wiki/PDZ_domain PDZ]: binding to the C-terminal of specific proteins =====
 
The PDZ domain is a common structural domain of 80-90 amino-acids found in the signaling proteins of all three kingdoms of life. PDZ is an acronym combining the first letters of three proteins — post synaptic density protein (PSD95), Drosophila disc large tumor suppressor (Dlg1), and zonula occludens-1 protein (zo-1) — which were first discovered to share the domain. It has previously been referred to as DHR (Dlg homologous region) or GLGF (glycine-leucine-glycine-phenylalanine) domains.  
 
The PDZ domain is a common structural domain of 80-90 amino-acids found in the signaling proteins of all three kingdoms of life. PDZ is an acronym combining the first letters of three proteins — post synaptic density protein (PSD95), Drosophila disc large tumor suppressor (Dlg1), and zonula occludens-1 protein (zo-1) — which were first discovered to share the domain. It has previously been referred to as DHR (Dlg homologous region) or GLGF (glycine-leucine-glycine-phenylalanine) domains.  
  
 
Proteins with these domains help hold together and organize signaling complexes at cellular membranes. In general PDZ domains bind to a short region of the C-terminus of other specific proteins. These short regions bind to the PDZ domain by beta sheet augmentation. This means that the beta sheet in the PDZ domain is extended by the addition of a further beta strand from the tail of the binding partner protein. About 260 PDZ domains within 180 proteins in human have been found.
 
Proteins with these domains help hold together and organize signaling complexes at cellular membranes. In general PDZ domains bind to a short region of the C-terminus of other specific proteins. These short regions bind to the PDZ domain by beta sheet augmentation. This means that the beta sheet in the PDZ domain is extended by the addition of a further beta strand from the tail of the binding partner protein. About 260 PDZ domains within 180 proteins in human have been found.
  
===== [http://en.wikipedia.org/wiki/PEST_sequence PEST] =====
+
===== [http://en.wikipedia.org/wiki/PEST_sequence PEST]: rapid degradation signal =====
A PEST sequence is a peptide sequence that is rich in proline (P), glutamic acid (E), serine (S), and threonine (T). This sequence is associated with proteins that are rapidly degraded, possibly via the proteasome or calpain.
+
A PEST sequence is a peptide sequence that is rich in proline (P), glutamic acid (E), serine (S), and threonine (T). This sequence is associated with proteins that are rapidly degraded, possibly via the proteasome or cal pain. The computational tool [http://emboss.bioinformatics.nl/cgi-bin/emboss/epestfind ePESTfind] is used to predict PEST sequence.
  
The computational tool [http://emboss.bioinformatics.nl/cgi-bin/emboss/epestfind ePESTfind] is used to predict PEST sequence.
+
===== [[Protein_Domain_PH|PH]]: phosphatidylinositol lipids binding =====
 +
====== [[Protein_Domain_GRAM|GRAM]] ======
  
===== [http://en.wikipedia.org/wiki/SH2_domain SH2] =====
+
===== [http://en.wikipedia.org/wiki/Phosphotyrosine-binding_domain PTB] =====
 +
The PTB domain is a structural domain of ~130 aa (Pfam model). It usually binds to phosphorylated tyrosine. However, PTB domain of SHC can bind to NPLH sequence found in the carboxyl terminus of murine PTPN12/PTP-PEST <cite>Charest94</cite>. PTB domain is found at least in 27 genes in human genome.
 +
 
 +
===== [http://en.wikipedia.org/wiki/SH2_domain SH2]: pTyr binding =====
 
The SH2 (Src Homology 2) domain is a structurally conserved protein domain of ~100 aa long. SH2 typically binds to pTyr (phosphorylated tyrosine) which usually locates in peptide linear motif of 3-6 aa long. About 120 SH2 domains within 115 proteins in human have been found.
 
The SH2 (Src Homology 2) domain is a structurally conserved protein domain of ~100 aa long. SH2 typically binds to pTyr (phosphorylated tyrosine) which usually locates in peptide linear motif of 3-6 aa long. About 120 SH2 domains within 115 proteins in human have been found.
  
===== [http://en.wikipedia.org/wiki/SH3_domain SH3] =====
+
===== [http://en.wikipedia.org/wiki/SH3_domain SH3]: proline-rich region binding =====
 
The SRC Homology 3 Domain (or SH3 domain) is a small protein domain of about 60 aa long. SH3 typically via binding to proline-rich peptides in their respective binding partner. Approximately 300 SH3 domains are found in proteins in human genome.
 
The SRC Homology 3 Domain (or SH3 domain) is a small protein domain of about 60 aa long. SH3 typically via binding to proline-rich peptides in their respective binding partner. Approximately 300 SH3 domains are found in proteins in human genome.
 +
 +
===== THAP =====
 +
DNA-binding domain similar to P element transposase <cite>Roussigne03</cite>.
 +
 +
===== [https://en.wikipedia.org/wiki/Tetratricopeptide TPR]: protein-protein interacting structural motif =====
 +
TPR is short for tetratricopeptide repeat (TPR). It is a structural motif mediates protein-protein interactions. It consists in a degenerate 34 amino acid sequence motif identified in a wide variety of proteins. It is often but not always found in tandem arrays.
 +
 +
===== Zinc finger =====
 +
====== [http://pfam.xfam.org/family/PF01363.17 FYVE_PHD]: PI3P binding ======
 +
FYVE zinc finger domain is named after the four cysteine-rich proteins: Fab 1 (yeast orthologue of PIKfyve), YOTB, Vac 1 (vesicle transport protein), and EEA1, in which it has been found. FYVE domains bind Phosphatidylinositol 3-phosphate, in a way dependent on its metal ion coordination and basic amino acids. The FYVE domain inserts into cell membranes in a pH dependent manner. The FYVE domain has been connected to vacuolar protein sorting and endosome function.
  
 
=== References ===
 
=== References ===
Line 83: Line 167:
 
#Charest96 pmid=8626541
 
#Charest96 pmid=8626541
 
#Frame10 pmid=20966971
 
#Frame10 pmid=20966971
 +
#Roussigne03 pmid=12575992
 
</biblio>
 
</biblio>

Latest revision as of 17:51, 11 September 2015

Most domains can be computationally profiled by Hidden Markov Model (HMM), which can be found in Pfam, SMART, NCBI CDD, Gene3D, Superfamily and similar databases.

Statistics

Domain Function Human protein number (domain number) Human protein phosphatase (PP) number PP subfamily Length (aa)
C1 Bind to secondary messenger DAG ~50
ADF:Gelsolin
ADF:Coffilin_ADF
EF hand
FERM Target proteins to plasma membrane >30 (~50) [1]
IQ motif
PDZ Bind to specific C-terminal short region ~180 (~260) 80-90
PEST Rapid degradation signal
PTB Bind to pTyr 27 ~130
SH2 Bind to pTyr 115 (120) PTPN6 ~100
SH3 Bind to proline-rich peptide (~300) STS ~60
THAP Bind to DNA 12 (12) [2] 0 80-90
TPR
Zf:FYVE/PHD

Note: 1) PP is short for protein phosphatase. 2) Numbers in parenthesis are domain numbers.

Description

ADF: actin cytoskeleton organization

ADF stands for Actin Depolymerizing Factor, involved in the rapid recycling of their actin cytoskeleton to enable a dynamic change in their shape.

Gelsolin
Cofflin_ADF
C1: DAG binding

C1 domain (also known as phorbol esters/diacylglycerol binding domain) binds an important secondary messenger diacylglycerol (DAG), as well as the analogous phorbol esters. It is characterised by a rich cysteine and histidine content. The C1 domain is originally found in the N-terminal region of conservation in protein kinase C domains. It is usually around 50 aa long.

C2: membrane targeting

"A C2 domain is a protein structural domain involved in targeting proteins to cell membranes. The C2 domain in PTEN, brings the phosphatase domain into contact with the plasma membrane, where it can dephosphorylate its substrate, phosphatidylinositol (3,4,5)-trisphosphate (PIP3), without removing it from the membrane - which would be energetically very costly. The C2 domain is currently only known from eukaryotes. Over 17 distinct clades of C2 domains have been identified. Most C2 families can be traced back to basal eukaryotic species indicating an early diversification before the last eukaryotic common ancestor (LECA). Only the PKC-C2 domain family contains conserved calcium-binding residues, suggesting the typical calcium-dependent membrane interaction is a derived feature limited in PKC-C2 domains. "

EF hand: Ca2+ binding

The EF hand is a helix-loop-helix structural domain or motif found in a large family of calcium-binding proteins. The EF-hand motif contains a helix-loop-helix topology, much like the spread thumb and forefinger of the human hand, in which the Ca2+ ions are coordinated by ligands within the loop. It usually has about 30 aa residues.

FERM: cytoskeletal-associated at the interface between plasma membrane and cytoskeleton

FERM domain (F for 4.1 protein, E for ezrin, R for radixin and M for moesin) is a widespread protein module involved in localising proteins to the plasma membrane. FERM domains are found in a number of cytoskeletal-associated proteins that associate with various proteins at the interface between the plasma membrane and the cytoskeleton. The FERM domain is located at the N terminus in the majority of proteins in which it is found.

IQ: calmodulin (Ca2+ sensor) binding

The IQ calmodulin-binding motif is an amino acid sequence motif containing the following sequence: [FILV]Qxxx[RK]Gxxx[RK]xx[FILVWY]. The term "IQ" refers to the first two amino acids of the motif: isoleucine (commonly) and glutamine (invariably). Calmodulin (CaM) is recognized as a major calcium (Ca2+) sensor and orchestrator of regulatory events through its interaction with a diverse group of cellular proteins.

PDZ: binding to the C-terminal of specific proteins

The PDZ domain is a common structural domain of 80-90 amino-acids found in the signaling proteins of all three kingdoms of life. PDZ is an acronym combining the first letters of three proteins — post synaptic density protein (PSD95), Drosophila disc large tumor suppressor (Dlg1), and zonula occludens-1 protein (zo-1) — which were first discovered to share the domain. It has previously been referred to as DHR (Dlg homologous region) or GLGF (glycine-leucine-glycine-phenylalanine) domains.

Proteins with these domains help hold together and organize signaling complexes at cellular membranes. In general PDZ domains bind to a short region of the C-terminus of other specific proteins. These short regions bind to the PDZ domain by beta sheet augmentation. This means that the beta sheet in the PDZ domain is extended by the addition of a further beta strand from the tail of the binding partner protein. About 260 PDZ domains within 180 proteins in human have been found.

PEST: rapid degradation signal

A PEST sequence is a peptide sequence that is rich in proline (P), glutamic acid (E), serine (S), and threonine (T). This sequence is associated with proteins that are rapidly degraded, possibly via the proteasome or cal pain. The computational tool ePESTfind is used to predict PEST sequence.

PH: phosphatidylinositol lipids binding
GRAM
PTB

The PTB domain is a structural domain of ~130 aa (Pfam model). It usually binds to phosphorylated tyrosine. However, PTB domain of SHC can bind to NPLH sequence found in the carboxyl terminus of murine PTPN12/PTP-PEST [3]. PTB domain is found at least in 27 genes in human genome.

SH2: pTyr binding

The SH2 (Src Homology 2) domain is a structurally conserved protein domain of ~100 aa long. SH2 typically binds to pTyr (phosphorylated tyrosine) which usually locates in peptide linear motif of 3-6 aa long. About 120 SH2 domains within 115 proteins in human have been found.

SH3: proline-rich region binding

The SRC Homology 3 Domain (or SH3 domain) is a small protein domain of about 60 aa long. SH3 typically via binding to proline-rich peptides in their respective binding partner. Approximately 300 SH3 domains are found in proteins in human genome.

THAP

DNA-binding domain similar to P element transposase [2].

TPR: protein-protein interacting structural motif

TPR is short for tetratricopeptide repeat (TPR). It is a structural motif mediates protein-protein interactions. It consists in a degenerate 34 amino acid sequence motif identified in a wide variety of proteins. It is often but not always found in tandem arrays.

Zinc finger
FYVE_PHD: PI3P binding

FYVE zinc finger domain is named after the four cysteine-rich proteins: Fab 1 (yeast orthologue of PIKfyve), YOTB, Vac 1 (vesicle transport protein), and EEA1, in which it has been found. FYVE domains bind Phosphatidylinositol 3-phosphate, in a way dependent on its metal ion coordination and basic amino acids. The FYVE domain inserts into cell membranes in a pH dependent manner. The FYVE domain has been connected to vacuolar protein sorting and endosome function.

References

  1. Frame MC, Patel H, Serrels B, Lietha D, and Eck MJ. The FERM domain: organizing the structure and function of FAK. Nat Rev Mol Cell Biol. 2010 Nov;11(11):802-14. DOI:10.1038/nrm2996 | PubMed ID:20966971 | HubMed [Frame10]
  2. Roussigne M, Kossida S, Lavigne AC, Clouaire T, Ecochard V, Glories A, Amalric F, and Girard JP. The THAP domain: a novel protein motif with similarity to the DNA-binding domain of P element transposase. Trends Biochem Sci. 2003 Feb;28(2):66-9. DOI:10.1016/S0968-0004(02)00013-0 | PubMed ID:12575992 | HubMed [Roussigne03]
  3. Charest A, Wagner J, Jacob S, McGlade CJ, and Tremblay ML. Phosphotyrosine-independent binding of SHC to the NPLH sequence of murine protein-tyrosine phosphatase-PEST. Evidence for extended phosphotyrosine binding/phosphotyrosine interaction domain recognition specificity. J Biol Chem. 1996 Apr 5;271(14):8424-9. DOI:10.1074/jbc.271.14.8424 | PubMed ID:8626541 | HubMed [Charest96]
All Medline abstracts: PubMed | HubMed