Difference between revisions of "Phosphatase Subfamily FCP1 Tech Notes"

From PhosphataseWiki
Jump to: navigation, search
(PSI-BLAST)
Line 9: Line 9:
 
===== PSI-BLAST =====
 
===== PSI-BLAST =====
  
PSI-BLAST the FCP1_C region (716-961 as defined by CDD):
+
We first PSI-BLASTed the FCP1_C region (716-961 as defined by CDD) of human FCP1. We eyeballed the hits in the third round, which started to contain animal [http://www.ncbi.nlm.nih.gov/gene?cmd=retrieve&list_uids=23247 KIAA0556], but failed to hit any region in Drosophila melanogaster FCP1. We then stopped our search using human FCP1. The full sequence of human FCP1 is below:
  
 
MEVPAAGRVPAEGAPTAAVAEVRCPGPAPLRLLEWRVAAGAAVRIGSVLAVFEAAASAQSSGASQSRVASGGCVRPARPERRLRSERAGVVRELCAQPGQVVAPGAVLVRLEGCSHPVVMKGLCAECGQDLTQLQSKNGKQQVPLSTATVSMVHSVPELMVSSEQAEQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIFHFQLGRGEPMLHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCGDSMVCIIDDREDVWKFAPNLITVKKYVYFQGTGDMNAPPGSRESQTRKKVNHSRGTEVSEPSPPVRDPEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAGAPEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSSDSESSSESEGTKSSSSASDGESEGKRGRQKPKAAPEGAGALAQGSSLEPGRPAAPSLPGEAEPGAHAPDKEPELGGQEEGERDGLCGLGNGCADRKEAETESQNSELSGVTAGESLDQSMEEEEEEDTDEDDHLIYLEEILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYHATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQLFPLRDDHTKAQRENSPAAFPDREGVPPTALFHPMPVLPKAQPGPEVRIYDSNTGKLIRTGARGPPAPSSSLPIRQEPSSFRAVPPPQPQMFGEELPDAQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKRRPEEQEEEPQPRKPGTRRERTLGAPASSERSAAGGRGPRGHKRKLNEEDAASESSRESSNEDEGSSSEADEMAKALEAELNDLM
 
MEVPAAGRVPAEGAPTAAVAEVRCPGPAPLRLLEWRVAAGAAVRIGSVLAVFEAAASAQSSGASQSRVASGGCVRPARPERRLRSERAGVVRELCAQPGQVVAPGAVLVRLEGCSHPVVMKGLCAECGQDLTQLQSKNGKQQVPLSTATVSMVHSVPELMVSSEQAEQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIFHFQLGRGEPMLHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCGDSMVCIIDDREDVWKFAPNLITVKKYVYFQGTGDMNAPPGSRESQTRKKVNHSRGTEVSEPSPPVRDPEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAGAPEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSSDSESSSESEGTKSSSSASDGESEGKRGRQKPKAAPEGAGALAQGSSLEPGRPAAPSLPGEAEPGAHAPDKEPELGGQEEGERDGLCGLGNGCADRKEAETESQNSELSGVTAGESLDQSMEEEEEEDTDEDDHLIYLEEILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYHATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQLFPLRDDHTKAQRENSPAAFPDREGVPPTALFHPMPVLPKAQPGPEVRIYDSNTGKLIRTGARGPPAPSSSLPIRQEPSSFRAVPPPQPQMFGEELPDAQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKRRPEEQEEEPQPRKPGTRRERTLGAPASSERSAAGGRGPRGHKRKLNEEDAASESSRESSNEDEGSSSEADEMAKALEAELNDLM
 +
 +
 +
We then PSI-BLASTed the region right next to BRCT domain (668-880) of fruit fly FCP1. The full sequence of fruit fly FCP1 is below:
 +
 +
MQNIPDEEGATPSRAPGGVAASGAAEGDDGGGGSGAKNSASGNSNNNTSGGGIIVLRAALGENEARAVINKWRVREGQFVSAAQILFLYQPVGVDAKDAKDAGKPGGDCAIQRYKSQRAGVVKKRLRKEGELLTKGDAILELSECIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLADRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPNGDSMVCIIDDREDVWNMASNLIQVKPYHFFQHTGDINAPPGLSKHELDGEGVDFKEITEKHGDKDKTESSSEVKPEDTDKGDNTVTSTSKDDDVNEESVDVFEIEGDAKDPEVSNASSATEAPKEPRDKLNGKTNAEDIVVIDDSSSGSPDAEKAASDGEDVVVIDDNSKESTKAEVPPTPAEKNEVVASSTTSPDEKRPSADADVATTSKTPSLRAPLEGQKQIEIEDPDDYLLYLEVILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKLFPLDRKVRNKGRQPPAHCHSPEHVVNYSERSEISPSSSKQQEEQSGNFRETLNPLLVFTNADIESMNKDYETFFESDSSSDEGPVNFENPPMDKKLLKRKREDDNSNRAHDFFTRSDDIMIGAPNLVEVDISSNEEADDNNEKEDDDDEMPSAKFRRGEDLPSDLELGSESNSEKEPEDEDDGEWNMMGAALEREFLGLEDFDM

Revision as of 18:43, 15 October 2015

Phosphatase Classification: Fold HAD: Superfamily HAD: Family FCP: Subfamily FCP1: Technical Notes

FCP1_C domain

Using Pfam profile

In order to find FCP1_C domain phylogenetic distribution among FCP1, we obtained the FCP1 from our internal orthology database, which contains 227 from 183 eukaryotic genomes. We then searched the FCP1_C domain by Pfam web server.

We also searched the FCP1_C domain region of human FCP1 defined by Pfam against NR dataset limiting to metazoa excluding deuterosomes through NCBI BLAST server. We found no hit has E-value lower than 0.1. We did not find FCP1_C in nematostella sequence in NR dataset, because our updated nematostella sequence is different from the one in NR.

PSI-BLAST

We first PSI-BLASTed the FCP1_C region (716-961 as defined by CDD) of human FCP1. We eyeballed the hits in the third round, which started to contain animal KIAA0556, but failed to hit any region in Drosophila melanogaster FCP1. We then stopped our search using human FCP1. The full sequence of human FCP1 is below:

MEVPAAGRVPAEGAPTAAVAEVRCPGPAPLRLLEWRVAAGAAVRIGSVLAVFEAAASAQSSGASQSRVASGGCVRPARPERRLRSERAGVVRELCAQPGQVVAPGAVLVRLEGCSHPVVMKGLCAECGQDLTQLQSKNGKQQVPLSTATVSMVHSVPELMVSSEQAEQLGREDQQRLHRNRKLVLMVDLDQTLIHTTEQHCQQMSNKGIFHFQLGRGEPMLHTRLRPHCKDFLEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLFSHRILSRDECIDPFSKTGNLRNLFPCGDSMVCIIDDREDVWKFAPNLITVKKYVYFQGTGDMNAPPGSRESQTRKKVNHSRGTEVSEPSPPVRDPEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAGAPEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSSDSESSSESEGTKSSSSASDGESEGKRGRQKPKAAPEGAGALAQGSSLEPGRPAAPSLPGEAEPGAHAPDKEPELGGQEEGERDGLCGLGNGCADRKEAETESQNSELSGVTAGESLDQSMEEEEEEDTDEDDHLIYLEEILVRVHTDYYAKYDRYLNKEIEEAPDIRKIVPELKSKVLADVAIIFSGLHPTNFPIEKTREHYHATALGAKILTRLVLSPDAPDRATHLIAARAGTEKVLQAQECGHLHVVNPDWLWSCLERWDKVEEQLFPLRDDHTKAQRENSPAAFPDREGVPPTALFHPMPVLPKAQPGPEVRIYDSNTGKLIRTGARGPPAPSSSLPIRQEPSSFRAVPPPQPQMFGEELPDAQDGEQPGPSRRKRQPSMSETMPLYTLCKEDLESMDKEVDDILGEGSDDSDSEKRRPEEQEEEPQPRKPGTRRERTLGAPASSERSAAGGRGPRGHKRKLNEEDAASESSRESSNEDEGSSSEADEMAKALEAELNDLM


We then PSI-BLASTed the region right next to BRCT domain (668-880) of fruit fly FCP1. The full sequence of fruit fly FCP1 is below:

MQNIPDEEGATPSRAPGGVAASGAAEGDDGGGGSGAKNSASGNSNNNTSGGGIIVLRAALGENEARAVINKWRVREGQFVSAAQILFLYQPVGVDAKDAKDAGKPGGDCAIQRYKSQRAGVVKKRLRKEGELLTKGDAILELSECIHTTVIKDMCADCGADLRQNENGQTSEASVPMVHTMPDLKVTQKLAQKLGHDDTRRLLADRKLVLLVDLDQTVIHTTNDTVPDNIKGIYHFQLYGPHSPWYHTRLRPGTAEFLERMSQLYELHICTFGARNYAHMIAQLLDPEGKFFSHRILSRDECFNATSKTDNLKALFPNGDSMVCIIDDREDVWNMASNLIQVKPYHFFQHTGDINAPPGLSKHELDGEGVDFKEITEKHGDKDKTESSSEVKPEDTDKGDNTVTSTSKDDDVNEESVDVFEIEGDAKDPEVSNASSATEAPKEPRDKLNGKTNAEDIVVIDDSSSGSPDAEKAASDGEDVVVIDDNSKESTKAEVPPTPAEKNEVVASSTTSPDEKRPSADADVATTSKTPSLRAPLEGQKQIEIEDPDDYLLYLEVILRNIHKRFYSIYDETTEIPDLKVIVPKIRSEVLRGKNLVFSGLVPTQMKLEQSRAYFIAKSLGAEVKPNIDKEITHLVAVNAGTYKVNAAKKEPAIKVVNANWLWTCAERWEHVEEKLFPLDRKVRNKGRQPPAHCHSPEHVVNYSERSEISPSSSKQQEEQSGNFRETLNPLLVFTNADIESMNKDYETFFESDSSSDEGPVNFENPPMDKKLLKRKREDDNSNRAHDFFTRSDDIMIGAPNLVEVDISSNEEADDNNEKEDDDDEMPSAKFRRGEDLPSDLELGSESNSEKEPEDEDDGEWNMMGAALEREFLGLEDFDM