Difference between revisions of "HMM PD0133"
From PhosphataseWiki
(→How the HMM is built) |
|||
Line 10: | Line 10: | ||
=== How the HMM is built === | === How the HMM is built === | ||
We PSI-BLASTed the sequence that containing the C1 domain in ''C. elegans'' mtm-5 (1540-1590 determined by searching the complete sequence against CDD and Pfam database). However, the search converged right after the 1st round. | We PSI-BLASTed the sequence that containing the C1 domain in ''C. elegans'' mtm-5 (1540-1590 determined by searching the complete sequence against CDD and Pfam database). However, the search converged right after the 1st round. | ||
+ | |||
+ | Below are the two full sequences used: | ||
+ | |||
+ | ''C. elegans'' sequence | ||
+ | MRDPDKVKSGPICDTVAVIVLEESDDENALPDVLHEVQSPHTSDNIPTSSIKKFARPRGWYNQSVSSPSEFFYQILTTERGTRRIAYVLSTWEEDEKTLNFKAVSIVLISQNFHPKAFKEILLEISNDLRTPEFSSSSELIRFLTYELVEEGSTIEIRTKTLHVELGFELIPISPVTGKDVAMLFKMLGFQNVIKIIHALLSDCRIVLASSSLMRLSRCQNAILSLLYPFEYVHSCVTILPDSLAEVLESPTPFLIGVLSEFVTSFGDENIVVYLDNGEVHVPDHAEIYKSDDYYYNSLHQRLRDVMFTTTSQEDLSIPNEERIEVDDFILDKKLRACFIYYFAELLYGYQYYILYTRIKGNFEKKLTTSLTFHVGAFRGFRKLTDMMSSSLLKSVYFQTFILTRALPRRKHDLFDEISCFKELDQLIFKQNSTSSESKKIIEHISCELIQKERYMEKCSARKQEIFTKIHWISGKELAQNNNSIIHTVKPKMRSNVILQAMLPVVNTHAEYHANQFEAYAHRIEALRNCLAAIFEGKVAFASKSLDAVKSSMRFAPLRIELCRLLNQKCSHDKLTDKQFEDIALLMNAALQAECEEDKDGVVRSLMYLSNVYSRKVAQGMQQYMYTAVQEHKVWKNQRFWTSCFYYEVHEMLFSEMLQKDRKITESLWCHTLRPCAMEMINTDDTDQEELVKQENEMIQAQAKHFANILISLQIPLSEEFFEHEDAHRSVLNEKCKWIVNTLDSILGVTGRINGLSLSRIQTYVEAHVESLRDVYVEMSTGEHLKKGNFDPVLAHGEFLISDPIDCYLLTSIEESEMSLNRLENLLPADGSLFLTNYRVIFKGKSVDINATNGTIVQTIPLYSMESFKKLTNKKLIPTQLIEKGVKIEHIISIRSSCASSIIIAFDEDEINNMAIEKFLEVIETNSHNSFAFYNTRKDMKVVENGSHKFGTLNSAIRGFTKKKTDTRRIRSHSSHRGSIQLSFDKMEELDYLKKNAHIRYAVIDYPRIGLNSKIVKLRMSHSNLDYTICPSYPGNFIVPSETNESELAKVAKGFVEHRLPVVVWMNENGALLVRASAFTSIDMVKKLKKVVNYRRNASKLTGSMTGSQQTLHSKASSNEESSSNIVAGAEIKSAEVQMNYIAKLSNSSQRAVSYALPTQYADKFSTFNDGCTLTQNNANGFPTTRIHRKALYVLLEKGHGVKIPIDSNAEAIMVRSVKESELRRSLQRARQICSSEFQVENRTSFLESWNASNWPQCVSRMIELSNSIVALMNLYNSSVAICLEAGRSITTILSSLSQLLSDPYYRTCDGFQVLVEKEWLAFGHYFHKDTETSSPSFICFLDCVYQISQQYPTAFEFSYFYISFLAYHSTAGYFRTFIDDCEEKRLQSDANEFYLPDNLATINVWEFIKLRNRVSAAFYNELYEQIGDIVIPSSSIPQIHMWPFLAETHLKYGSPYDIEPASHEQQLVDPDYEEEEDWSKLNNTDIDERHLNRRVRSPERDPANMDMIRLLQKSYLTELFDASDRKTTTNGESNGKETIHELTPFTVGARPVQCCYCTNILTRWSKAVHCKKCRIHVHEGCVNRNITIGNITHTWDAKPFEDIKMPSGAIQIGTPQAEKMLHSPNNTLTRESMSPPTANTIPPLCTGYLSKRGAKLKLWVPRFFVLYPDSPKVYYYEDFENWKTAEKPSGCIDLVDFKSFNLEQTGRRGLIELHMKNKTHRLLSENINEAIRWKECIEQVIRD | ||
+ | |||
+ | ''Drosophila Melanogaster'' sequence | ||
+ | MSRLADYFVIVGYDSDKEKTASNVGGQPTCGKIVQRFPEKDWPDTPFIEGIEWFCQPLGWSLSYEKQEPKFFVSVLTDIDANKHYCACLSFHETVAITQTRSVDDEDETIGSSRLLGATPSSMDGITTTSTPASITHHSVMYAPKCLVLISRLDCAETFKNCLGTIYTVYIENLAYGLETLIGNILGCIQVPPAGGPQVRFSIGAGDKQSLQPPQSSSLPTTGSGVHFLFKQLGIKNVLILLCSVMTENKILFLSKCYWHLTDSCRALVALMYPFRYTHVYIPILPAPLTEVLSTPTPFIMGIHSSLQTEITDLLDVIVVDLDGGLVTIPESLTPPVPILPSPLWEQTQDLLSMILFPNLAQADLAFPTLERPSAIAKTDAQIDKELRAIFMRLFAQLLQGYRSCLTIIRIHPKPVITFHKAGFLGARDLIESEFLFRVLDSMFFTTFVNERGPPWRSSDAWDELYSSMNELLKSEAQNRNLVGRTQRFKGYFNFTFPSYFQILTHIQELGRVLYENEGTLAHISYAQKVLRPPEGAFQRIHQPAFPRISSEKVELIIQEGIRKNGVPQRFHVTRNQHRIIPMGPRLPEALDVRPNVQNSARRLEVLRICVSYIFENRITDARKLLPAVMRTLMHRDARLILCREFFGYVHGNKAVLDHQQFELVVRFMNKALQKSSGIDEYTVAAALLPMSTIFCRKLSTGVVQFAYTEIQDHAIWKNLQFWESTFFQDVQGQIKALYLLHRRQNEHQKEANCVLDEVPLEEPTALEITAEQLRKSPNIEEEKKAELAKSEESTLYSQAIHFANRMVSLLIPLDVNVDAASKPKPAFRLEENQSVSNSIMGSHSLSEHSDEGFEENNALEIGVTVGKTISRFIDCVCTEGGVTSEHIRNLHDMVPGVVHMHIESLEPVYLEAKRHPHVQKPKIQTPCLLPGEDLVTDHLRCFLMPDGREDETQCLIPAEGALFLTNYRVIFKGSPCDPLFCEQVIVRTFPIASLLKEKKISVLYLAHLDQTLTEGLQLRSSSFQLIKVAFDPEVTPEQIESFRKILSKARHPFDEFEYFAFQSYGTMLQGVAPLKTKEKYSTLKGFAKKTLLRGAKKAGFKQKQQTKRKLVSDYDYGSADAQETQSIDDELEDGDEFETQNNAMPRLLTTKDVERMRERSYVQDWKRLGFDAESQRGFRISNANTSYATCRSYPAIIVAPVQCSDAAIMHLGRCFKGQRIPLPTWRHANGALLIRGGQPNSKSVIGMLKNTTGSTTNAHHDVTHYPEQDKYFLALINTMPKLTPLALNQYSGMNLSMSSLMGHSSSDDRQPLTPELSRKHKNNLDISDGNKSSQGGKGGTMKGNPKNSLAHPFRKMRLYALGEKSQAKSNMNVDFCADFIPVDYPDIRQSRPAFKKLIRACMPSHNTNEADGQSFAKMVEQSDWLQQISSLMQLSGAVVDLIDLQESSVMLSLEDGSDVTAQLSSIAQLCLDPYYRSLDGFRVLVEKEWLAFGHRFAHRSNLKPSHANTNIAFAPTFLQFLDVVHQLQRQFPMAFEFNDFYLRFLAYHSVSCRFRTFLFDCELERSDSGIAAMEDKRGSLNAKHMFGAGGMATNGSDDECSVYPLDIRSQRAPAPLNRIGHSIFDYIERQHNKTPIFYNFLYSGDKSVTLRPQNNVAALDLWCYYTNEELAQGAPYDLEVTTVDDEIDLSETKGKRMVITAGYDNMEKCNPSAYVCLLSEVKQAETERGHLPQKWLQVWNSLEVPQLEPVARNTSLGNIFVQTHQHKRSTLEIIMKGRLAGYQDKYFHPHRFEKHPYTTPTNCNHCTKLLWGPVGYRCMDCGNSYHEKCTEHSMKNCTKYKAIDGAVGPPNVNMSQGDTASIASSAATTARTSSHHFYNQFSSNVAENRTHEGHLYKRGALLKGWKQRWFVLDSIKHQLRYYDTSEDTAPKGIIELAEVQSVTAAQPAQIGAKGVDEKGFFDLKTSKRIYNFYAINANLAQEWIEKLQACLQ |
Revision as of 18:07, 11 September 2015
Back to List of HMMs
Symbol: MTMR5_C1
Name: MTMR5, C1 domain
Description
The MTMR5 subfamily has a C1 domain which is able to be detected by Pfam C1_1 profile in C1 clan. We try to build a HMM, which can detect the C1 domain of MTMR5.
How the HMM is built
We PSI-BLASTed the sequence that containing the C1 domain in C. elegans mtm-5 (1540-1590 determined by searching the complete sequence against CDD and Pfam database). However, the search converged right after the 1st round.
Below are the two full sequences used:
C. elegans sequence
MRDPDKVKSGPICDTVAVIVLEESDDENALPDVLHEVQSPHTSDNIPTSSIKKFARPRGWYNQSVSSPSEFFYQILTTERGTRRIAYVLSTWEEDEKTLNFKAVSIVLISQNFHPKAFKEILLEISNDLRTPEFSSSSELIRFLTYELVEEGSTIEIRTKTLHVELGFELIPISPVTGKDVAMLFKMLGFQNVIKIIHALLSDCRIVLASSSLMRLSRCQNAILSLLYPFEYVHSCVTILPDSLAEVLESPTPFLIGVLSEFVTSFGDENIVVYLDNGEVHVPDHAEIYKSDDYYYNSLHQRLRDVMFTTTSQEDLSIPNEERIEVDDFILDKKLRACFIYYFAELLYGYQYYILYTRIKGNFEKKLTTSLTFHVGAFRGFRKLTDMMSSSLLKSVYFQTFILTRALPRRKHDLFDEISCFKELDQLIFKQNSTSSESKKIIEHISCELIQKERYMEKCSARKQEIFTKIHWISGKELAQNNNSIIHTVKPKMRSNVILQAMLPVVNTHAEYHANQFEAYAHRIEALRNCLAAIFEGKVAFASKSLDAVKSSMRFAPLRIELCRLLNQKCSHDKLTDKQFEDIALLMNAALQAECEEDKDGVVRSLMYLSNVYSRKVAQGMQQYMYTAVQEHKVWKNQRFWTSCFYYEVHEMLFSEMLQKDRKITESLWCHTLRPCAMEMINTDDTDQEELVKQENEMIQAQAKHFANILISLQIPLSEEFFEHEDAHRSVLNEKCKWIVNTLDSILGVTGRINGLSLSRIQTYVEAHVESLRDVYVEMSTGEHLKKGNFDPVLAHGEFLISDPIDCYLLTSIEESEMSLNRLENLLPADGSLFLTNYRVIFKGKSVDINATNGTIVQTIPLYSMESFKKLTNKKLIPTQLIEKGVKIEHIISIRSSCASSIIIAFDEDEINNMAIEKFLEVIETNSHNSFAFYNTRKDMKVVENGSHKFGTLNSAIRGFTKKKTDTRRIRSHSSHRGSIQLSFDKMEELDYLKKNAHIRYAVIDYPRIGLNSKIVKLRMSHSNLDYTICPSYPGNFIVPSETNESELAKVAKGFVEHRLPVVVWMNENGALLVRASAFTSIDMVKKLKKVVNYRRNASKLTGSMTGSQQTLHSKASSNEESSSNIVAGAEIKSAEVQMNYIAKLSNSSQRAVSYALPTQYADKFSTFNDGCTLTQNNANGFPTTRIHRKALYVLLEKGHGVKIPIDSNAEAIMVRSVKESELRRSLQRARQICSSEFQVENRTSFLESWNASNWPQCVSRMIELSNSIVALMNLYNSSVAICLEAGRSITTILSSLSQLLSDPYYRTCDGFQVLVEKEWLAFGHYFHKDTETSSPSFICFLDCVYQISQQYPTAFEFSYFYISFLAYHSTAGYFRTFIDDCEEKRLQSDANEFYLPDNLATINVWEFIKLRNRVSAAFYNELYEQIGDIVIPSSSIPQIHMWPFLAETHLKYGSPYDIEPASHEQQLVDPDYEEEEDWSKLNNTDIDERHLNRRVRSPERDPANMDMIRLLQKSYLTELFDASDRKTTTNGESNGKETIHELTPFTVGARPVQCCYCTNILTRWSKAVHCKKCRIHVHEGCVNRNITIGNITHTWDAKPFEDIKMPSGAIQIGTPQAEKMLHSPNNTLTRESMSPPTANTIPPLCTGYLSKRGAKLKLWVPRFFVLYPDSPKVYYYEDFENWKTAEKPSGCIDLVDFKSFNLEQTGRRGLIELHMKNKTHRLLSENINEAIRWKECIEQVIRD
Drosophila Melanogaster sequence
MSRLADYFVIVGYDSDKEKTASNVGGQPTCGKIVQRFPEKDWPDTPFIEGIEWFCQPLGWSLSYEKQEPKFFVSVLTDIDANKHYCACLSFHETVAITQTRSVDDEDETIGSSRLLGATPSSMDGITTTSTPASITHHSVMYAPKCLVLISRLDCAETFKNCLGTIYTVYIENLAYGLETLIGNILGCIQVPPAGGPQVRFSIGAGDKQSLQPPQSSSLPTTGSGVHFLFKQLGIKNVLILLCSVMTENKILFLSKCYWHLTDSCRALVALMYPFRYTHVYIPILPAPLTEVLSTPTPFIMGIHSSLQTEITDLLDVIVVDLDGGLVTIPESLTPPVPILPSPLWEQTQDLLSMILFPNLAQADLAFPTLERPSAIAKTDAQIDKELRAIFMRLFAQLLQGYRSCLTIIRIHPKPVITFHKAGFLGARDLIESEFLFRVLDSMFFTTFVNERGPPWRSSDAWDELYSSMNELLKSEAQNRNLVGRTQRFKGYFNFTFPSYFQILTHIQELGRVLYENEGTLAHISYAQKVLRPPEGAFQRIHQPAFPRISSEKVELIIQEGIRKNGVPQRFHVTRNQHRIIPMGPRLPEALDVRPNVQNSARRLEVLRICVSYIFENRITDARKLLPAVMRTLMHRDARLILCREFFGYVHGNKAVLDHQQFELVVRFMNKALQKSSGIDEYTVAAALLPMSTIFCRKLSTGVVQFAYTEIQDHAIWKNLQFWESTFFQDVQGQIKALYLLHRRQNEHQKEANCVLDEVPLEEPTALEITAEQLRKSPNIEEEKKAELAKSEESTLYSQAIHFANRMVSLLIPLDVNVDAASKPKPAFRLEENQSVSNSIMGSHSLSEHSDEGFEENNALEIGVTVGKTISRFIDCVCTEGGVTSEHIRNLHDMVPGVVHMHIESLEPVYLEAKRHPHVQKPKIQTPCLLPGEDLVTDHLRCFLMPDGREDETQCLIPAEGALFLTNYRVIFKGSPCDPLFCEQVIVRTFPIASLLKEKKISVLYLAHLDQTLTEGLQLRSSSFQLIKVAFDPEVTPEQIESFRKILSKARHPFDEFEYFAFQSYGTMLQGVAPLKTKEKYSTLKGFAKKTLLRGAKKAGFKQKQQTKRKLVSDYDYGSADAQETQSIDDELEDGDEFETQNNAMPRLLTTKDVERMRERSYVQDWKRLGFDAESQRGFRISNANTSYATCRSYPAIIVAPVQCSDAAIMHLGRCFKGQRIPLPTWRHANGALLIRGGQPNSKSVIGMLKNTTGSTTNAHHDVTHYPEQDKYFLALINTMPKLTPLALNQYSGMNLSMSSLMGHSSSDDRQPLTPELSRKHKNNLDISDGNKSSQGGKGGTMKGNPKNSLAHPFRKMRLYALGEKSQAKSNMNVDFCADFIPVDYPDIRQSRPAFKKLIRACMPSHNTNEADGQSFAKMVEQSDWLQQISSLMQLSGAVVDLIDLQESSVMLSLEDGSDVTAQLSSIAQLCLDPYYRSLDGFRVLVEKEWLAFGHRFAHRSNLKPSHANTNIAFAPTFLQFLDVVHQLQRQFPMAFEFNDFYLRFLAYHSVSCRFRTFLFDCELERSDSGIAAMEDKRGSLNAKHMFGAGGMATNGSDDECSVYPLDIRSQRAPAPLNRIGHSIFDYIERQHNKTPIFYNFLYSGDKSVTLRPQNNVAALDLWCYYTNEELAQGAPYDLEVTTVDDEIDLSETKGKRMVITAGYDNMEKCNPSAYVCLLSEVKQAETERGHLPQKWLQVWNSLEVPQLEPVARNTSLGNIFVQTHQHKRSTLEIIMKGRLAGYQDKYFHPHRFEKHPYTTPTNCNHCTKLLWGPVGYRCMDCGNSYHEKCTEHSMKNCTKYKAIDGAVGPPNVNMSQGDTASIASSAATTARTSSHHFYNQFSSNVAENRTHEGHLYKRGALLKGWKQRWFVLDSIKHQLRYYDTSEDTAPKGIIELAEVQSVTAAQPAQIGAKGVDEKGFFDLKTSKRIYNFYAINANLAQEWIEKLQACLQ