Phosphatase GeneID SpurP050
We have the sequence as:
XRFSISWPRVLPKGTLAGGVNQAGLDYYTNLIDELLANDIEPVVTLYHWDLPQVLQDDYGGWENDSLTDLFNDYANLCFNEYASKVNLWITFNEPYVVTWLGYGIGVFAPGVYSPGYAPYRAAHTIIKAHAKAYNTYRANYFPQYGGKVSITLSTDFGMPEFPDKEVDVAAADRYMQFTAGWFAHPILKNGDYPDVMKWQVGNKSQEQGLSESRLPVFTEEEKQYIKGTGDFFGLNSYTTTVCRHRIEPAGDPNYEGDQMEVLGGCQEPLIPRNGIGRSGVFCALWSVMEKLRKDNTIDIFQAVKRLRSSRPNMVETLDQYQLCYDVMNQHLHQFEEYANLDIL
But, the beginning of this is clearly a non-phosphatase, with good homologs in many organisms, none of which have a phosphatase domain, and have EST coverage. When we trim that off, we get
>Trimmed RNGIGRSGVFCALWSVMEKLRKDNTIDIFQAVKRLRSSRPNMVETLDQYQLCYDVMNQHLHQFEEYANLDIL
This sequence is identical over 45 AA to SpurP102, so this is no longer a unique gene and can be deleted from the phosphatome. Since SpurP102 is at the end of a contig, our guess is that this contig overlaps with the P050 one that then extends a little, but it really is just one gene.