Csa1G043300 (gene) Cucumber (Chinese Long) v2

NameCsa1G043300
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionMetallophosphoesterase; contains IPR004843 (Metallophosphoesterase domain)
LocationChr1 : 4842595 .. 4848071 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGCTACGTTTGTTCCTTTTTGTTCGAACTTTACACTAAAACTCCGCCCTTCTAATCCCCCTTCGATCCTTGCACCTCTACGCAAAAATCAGAGAGGCGCGAAGTTGAAATGTTTCGGTGTTGCGAGGCCTCATATTCTGCCTTCCGCATTCGGAGAAGACGGTGGCCTGCGAGTCTTTGTGCTCTCTGATCTACATACGGATTACGACGAAAATATGAACTGGATTCACTCCTTGTCATTGGACAAATATAGAGACGATGTTCTTATTGTCCCCGGAGATGTAGCCGAGACGATAAGTAATTTTGTTTCGACAATGGCTATGTTGAAGGATAGATTTGAGCGCGTCTTCTTTGTGCCTGGGAATCATGATCTGTGGTGCCGTCGGGAGGAAGACAATTATGTAAGTGGTTGGAGGTTAGGTTTCGTGCTGTAAGCTCTCTCGTAATTTCTTAGTTTTTATATTTTTAGGATGACTTACCAGTTTTGAAGCATCTATACATTGTGATTTTCATTAGATTTTCCTTCTTAGCTATTTCATTACACGCTTTCTTTTTTTCAGCTCGATTCTATTGAGAAGATGAGTAAACTTCTTGATGCGTGTAGAGATCTTGGAGTTGATACCAATCCGGCAATCTTGAACGGATTGGGAATAGTTCCGTTGTTCTCTTGGTATCACGAGGTGGGTGATTGACTGTTACTTTCTCTTTGTAGCTTTCAGTTCTAGTGATATTAATGCTCATCCTTGTGCGTTATTTGATGTTTTTGCATCAGAGTTTTGACAGAGAAATGGACTTACAGGGTATCCGCATTCCATCTCTGGAGATGGTGATTCACTTTCTTTTCCCTAGTAAGAGTCTTTAGATCTAGTTTTAATATCAGGTTTTTTCTGTATCTATAACGCCTTTGGTATAGTTTCATGTTAATCTTTTTATTTATTATTCAAGGTTCTTGCAATGGTCTCATTGGATTTTTGTTTTCTTTTGAATAATACATATAGAAAATGGAAATGGAAATGAAATTAGTTCTTCGTATCATGGAGAACTAATTCCATTTCCAGAGCTCTAATTTGTTTGCCTTTTGTCCTCCATCCCTTTTAATTTTTGCATTTTGAATGTCTTTTTTGGTTCGTTTGATGTGGGCTGCATCTTTTCAACTTGTCTGTGTATTTATTTCAAGATTCCTGTTTCTTTGATTTCTCATGGAATGTTACCTTGAGCAATGATCAAAACTTGTGTTGGTGGAATAAGGAGTTTTCTCTCTTCTACAGTTGAGCTTTGAAATTTCAATGTTTTGTACCTATAGCTTGCTGGCAACTTCCATTTCGTTTCTCATTATAATATCTGCTGTGATAAAATTTTATTGAGTGAACAATTCTCCCAATTAGTTACTTTATTGTAGATGTTCACGCATGGTAGTTATATTATTTGTGTTTGAAGAAGGCTCGAGATTATTGTTGTGTCTTCGTACTGTTTTTTAATCCTTTTACACGAAATTTTATTTTGGTTATTTTTGCAACTTCATTTTCTTATTTGAATCCGTTGATTTGAAAATGAAAACTCGTGGAATTGGACCAATTGGTTTTAAATGGCTAATGGCAGGTATGTAAAGACTTCCATGCATGTAAATGGCCCGGAGATCTTTCAAATGAAGGTGCTTCACTAGCATTGTTCTTTGATGCAATGAATGAAAAGAATAATATAATGATCGAGAAGATTCGAAGAACCTGCAGTCAGATAATTACATTTTCTCACTTTGTTCCCAGGTCAGTAAATATACGTGCGCAAACACCCAACTCATTAATTGTGTTTTACAAACTGATTCTCATATCACGGTCTTCTATGGAGAGCATTTGGTGCATCTTTTATATGTTGTTAAATTATGGATCGTTTATGTCAATAGACAGTGTTCAATTTGCATGTCGTATGCTGTTGCCTTGGAAATGCAATTATTTAATCGAGGTATTTGGTCCACACTTTATTGTTGCATATAAAGTAGCAACAAGTGTGATGATTCATTTATGGCACTTAGATTCTTAAATGAAGTTGGATTTACATTCTTTCTCCTGAACCCAATCAAACACTGACTCAAACCAGAGGGCAAGTTTGGATTACTTCTTCTTTTTTAGAATTTTGATAACTGCCAATTATATTAAATAACAGGGGTATAGGAGGACCCCACTCCATGGATTATGTTCATTATGTATATGTTCATGTCCCATTCCAATTAACCTTTGGAAGAGAAAGTTCGCCGATCCAAAGCTAGCTTCTAGGGTTAAAATAATGAAGTTATTAAATCCACGATCTTGGAGCTTATCACTAAACATCCTATACTCTTACCACTAGGATCGCTTCTTAATCACGGGAAAGCATGGCTACTGAAACGTTTGACGAAGTCAAGCATATGATGATTTATGACTTTTTTATCAATAGCTTCTTTGTGTTTGTCATTTTTGTTAATATAATGATGTGCTTTGCTTGTTGTGTTCATAGGCCAGAGCTATGTCCAGAAAAGAGGATGCTATTCTACCCAAAACTCCCTAAAATCATTGGCTCAGATTATCTCGAGGATCGTATAAGATCAATACACGGGAGTAAAGAAACAACATCAGCATGTCATGTGTTTGGTCATACCCATTTCTGCTGGGATCTAGTACTTGATGGCATCAGGTAATGTTAAATCATTTCACTTCACTTGCAAAATGCAAAATTTTTCCTTGACCTTCGAGTTCAAATTTTAGAGCATGAAAAGGAGTAGAAGACAATGAATTCTTCTCATTGTGTGACCCTTTTTATACGCATGAACTTGTGCTTATATGTTTGATCATAAGAAGATTGCTGAGATGCTTTATTTTCAGGTATGTCCAGGCGCCATTGGCTTATCCCAGAGAACGCAAGAAAAGGATGAATGGCGGTGAAGATTGGCTTCCTTTTTGCATTTTTTCCAACGGAAGGTTCGCTCATAAACTCACCCCTTGTTACTGGTCTGATTATTATGCATCCAATCCAAGATCACCTCACAACACCCAACTTGCTCCTTGGGTTGCCAAATACTATAAAATGAAATGAAAGAACTGTTCAGGTTCAATCCTATTTTTGTATATTTTTTTTGCGTTTATTGTTTTAGTATACGTACTGTCATAAGTCATGTTGTTCTTTTGTACGTCGGTTTTGGTTTCATTTTCTCCTTGTTTATCCTTTTTTCCTTTCACAATTTCAGATATATATAATTCATTTTTTACAGAGATGACTATATTTTAGTTTATAATTCGTGTCAATAGTATTAAGGACCAAAATGAATATGGAGAGCAAAGACAAAAGATTGAAGGACCATTTTTGGTTTGTCTTGAAATTACAGGAAATGAAATGTCAACTTACGAGGACGACAAGGATATGGATTAAGGGCAAAACTTTGGACCTATTCATGGTATTTCCATAACCTAATCCAACTCTTTTGTCTCTGTTGTGAAATTTATTGAATTCTCCGTTTGAAAATGTCACCGTATTTGGATTTATAAAAAATATAGGGGGTAACCTTTAAGATCGAACCTTGACCTTATCACAAATGGATTTAAAAAGGAAAAGAAAAGAAAACATCACCTTAGAATTTAAGAAAAGATAAAGATGTAAGCTGTATGACATTTATTACCTTCATGAACATAAACACCACAAATAAACCCCAAACGGAGACAACAAACCACCCTTGGCCAACAAAAGGCTTCGAGTCTTTTCCAAAGAAGAAAAAAAAACCCTACAACATATAGAGCAGAAAAACAGTCTCACATCCAAAACACACATTCGAATATGTTTTTACAAATTAAAAAAAAAAAAAAAAAAAACTCAGTAATCCTCGGGAGCCAAGGGCTGGTGAGTATGTAGAGGAGGTTCAACGGGAATGACCAAGTACTCATATACAAGGGCCGCAAGCCCACCTCCGATTAATGGGCCAATCCAATAGATCCAATGATTGTCCCATCTCCAGCCCACTAAAGAAGGCCCAAATGCCCTTGCTGGGTTCATGCAGGCCCCATCGAAGGCCCCACCCACCAGAATATTGGCCCCGACTATTAACCCGATTGCCAGTGGTGCTATTGTTCCCAAACTGCCCCTCTTTGGGTCTATTGCTGTTGCGTAAACTGTGTAGACCAATGCAAATGTAAGGATTATCTCTAGTAGAAACCCGTGCAATTCCGATACTCCTGATGAAACAAAGAACCCCATAGGCCTCTGCAATTTGACAAAAAAAAAAAAAAAAAGAGATGATAAAATAAACAAAGGAAGTAAGGACGTTAAAGGAAAGTAAAAATTAGGGATGGATATCTTGAGCGCCTCTACTAGAAACAATAATCTAAAAGGGTTATGAAGCATGGCATGAGAGACTTAGATGGAAGCCTCTTAGTTTTTAAAAATTAAGCACATTTTCCTCAATTGCTTTGAATGCTTAGCCAAAATTTTGGAGCTCTTTTCTTACGTTTACAGAACTTAGATGGTGAAACCAGAGAGAGTCAGTTTGAGTATGCACGAAATTGTGGAATTAAAAGTTAAATGAAAAAAAAAATAAAAACATTTCTTTTTTCTTCCTTCTTAAAAGTTTGATTAGATACTTCGGAAACATACACAAAAGTTATATGTGTACTTTAAAATAATAATGAGTTCAAGATCTTTAGTATTTTGTTTTTTTTTTCTTAACCTTTTTTATTCTTGAATCTTCATATAATTGTGAAAAAAGAAAAAAAAAATTAAAGGCTACATTTTCTAAAGCATTAGTTATATGTGAGTGTTTTTCTTTTACCATGCCACCCGTTGCAAGTCTCAAGAGAAGTGAAGCGATAATGGCACCCAAAATCTGTGCAACCCAATAGAAGAATGCACGAATAAGAGATATCCTCCCTCCGATAAGGGCCCCAAAAGTTACTGCAGGATTCACATGCCCACCCGATATGTTGATGCTCGCTGCCACCGCCGAGAACAGCGCAAACGCGTGTGCTATCGCTATCACTACTAAATCGGACGCAGCTCTGCCTGTGTCGGTCCCCTTTCTACCATAACCGTGACCATAACTGTACCCTCCTCGGCCGTAACTTCCATGCCCATAACTTCCGTGACCATAACTTCCATAATCCGCTGGCCTGAAAATTTTATCTGCAAAAATGGAAAACCAAGAAATTAATAATCCATCCATTGATCTTGCATGCATGTACTTGAAAATGTTTGTAAAAAAATTTAAAAGAAAATCCTTTCAAATACAATATTGAATTTTTTAATAAGATCAAGTTTGGGCGGTTGCATGTTTGAAAAAAAAAAATCATTTGTCAAATCGTTGTTGATTATTTAAAATTAGTTTATTAGTCAATTTTTTACTTTCAAAATAACTGTAAAACAGTCATACCAAGAGCGAGAACGGAGCCTTCGCCAGCAAAGACGAAGATGAAAGTGGAAATGAACTCAGCTAAGGTAGCACGAATGGAGTCGGGGT

mRNA sequence

ATGGTGGCTACGTTTGTTCCTTTTTGTTCGAACTTTACACTAAAACTCCGCCCTTCTAATCCCCCTTCGATCCTTGCACCTCTACGCAAAAATCAGAGAGGCGCGAAGTTGAAATGTTTCGGTGTTGCGAGGCCTCATATTCTGCCTTCCGCATTCGGAGAAGACGGTGGCCTGCGAGTCTTTGTGCTCTCTGATCTACATACGGATTACGACGAAAATATGAACTGGATTCACTCCTTGTCATTGGACAAATATAGAGACGATGTTCTTATTGTCCCCGGAGATGTAGCCGAGACGATAAGTAATTTTGTTTCGACAATGGCTATGTTGAAGGATAGATTTGAGCGCGTCTTCTTTGTGCCTGGGAATCATGATCTGTGGTGCCGTCGGGAGGAAGACAATTATCTCGATTCTATTGAGAAGATGAGTAAACTTCTTGATGCGTGTAGAGATCTTGGAGTTGATACCAATCCGGCAATCTTGAACGGATTGGGAATAGTTCCGTTGTTCTCTTGGTATCACGAGAGTTTTGACAGAGAAATGGACTTACAGGGTATCCGCATTCCATCTCTGGAGATGGTATGTAAAGACTTCCATGCATGTAAATGGCCCGGAGATCTTTCAAATGAAGGTGCTTCACTAGCATTGTTCTTTGATGCAATGAATGAAAAGAATAATATAATGATCGAGAAGATTCGAAGAACCTGCAGTCAGATAATTACATTTTCTCACTTTGTTCCCAGGCCAGAGCTATGTCCAGAAAAGAGGATGCTATTCTACCCAAAACTCCCTAAAATCATTGGCTCAGATTATCTCGAGGATCGTATAAGATCAATACACGGGAGTAAAGAAACAACATCAGCATGTCATGTGTTTGGTCATACCCATTTCTGCTGGGATCTAGTACTTGATGGCATCAGGTATGTCCAGGCGCCATTGGCTTATCCCAGAGAACGCAAGAAAAGGATGAATGGCGGTGAAGATTGGCTTCCTTTTTGCATTTTTTCCAACGGAAGGAAATGA

Coding sequence (CDS)

ATGGTGGCTACGTTTGTTCCTTTTTGTTCGAACTTTACACTAAAACTCCGCCCTTCTAATCCCCCTTCGATCCTTGCACCTCTACGCAAAAATCAGAGAGGCGCGAAGTTGAAATGTTTCGGTGTTGCGAGGCCTCATATTCTGCCTTCCGCATTCGGAGAAGACGGTGGCCTGCGAGTCTTTGTGCTCTCTGATCTACATACGGATTACGACGAAAATATGAACTGGATTCACTCCTTGTCATTGGACAAATATAGAGACGATGTTCTTATTGTCCCCGGAGATGTAGCCGAGACGATAAGTAATTTTGTTTCGACAATGGCTATGTTGAAGGATAGATTTGAGCGCGTCTTCTTTGTGCCTGGGAATCATGATCTGTGGTGCCGTCGGGAGGAAGACAATTATCTCGATTCTATTGAGAAGATGAGTAAACTTCTTGATGCGTGTAGAGATCTTGGAGTTGATACCAATCCGGCAATCTTGAACGGATTGGGAATAGTTCCGTTGTTCTCTTGGTATCACGAGAGTTTTGACAGAGAAATGGACTTACAGGGTATCCGCATTCCATCTCTGGAGATGGTATGTAAAGACTTCCATGCATGTAAATGGCCCGGAGATCTTTCAAATGAAGGTGCTTCACTAGCATTGTTCTTTGATGCAATGAATGAAAAGAATAATATAATGATCGAGAAGATTCGAAGAACCTGCAGTCAGATAATTACATTTTCTCACTTTGTTCCCAGGCCAGAGCTATGTCCAGAAAAGAGGATGCTATTCTACCCAAAACTCCCTAAAATCATTGGCTCAGATTATCTCGAGGATCGTATAAGATCAATACACGGGAGTAAAGAAACAACATCAGCATGTCATGTGTTTGGTCATACCCATTTCTGCTGGGATCTAGTACTTGATGGCATCAGGTATGTCCAGGCGCCATTGGCTTATCCCAGAGAACGCAAGAAAAGGATGAATGGCGGTGAAGATTGGCTTCCTTTTTGCATTTTTTCCAACGGAAGGAAATGA

Protein sequence

MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRVFVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFVPGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK*
BLAST of Csa1G043300 vs. TrEMBL
Match: A0A0A0LR12_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043300 PE=4 SV=1)

HSP 1 Score: 714.1 bits (1842), Expect = 7.8e-203
Identity = 340/340 (100.00%), Postives = 340/340 (100.00%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60
           MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV
Sbjct: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60

Query: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120
           FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV
Sbjct: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120

Query: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180
           PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE
Sbjct: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180

Query: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240
           MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII
Sbjct: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240

Query: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300
           TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD
Sbjct: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300

Query: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK 341
           LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK
Sbjct: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK 340

BLAST of Csa1G043300 vs. TrEMBL
Match: U5FHK5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s03550g PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 1.4e-130
Identity = 229/350 (65.43%), Postives = 266/350 (76.00%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKC------FGVARPHILPSA-FG 60
           MV       S  TL   P  PP    P   N    + +C      + + RP ILP     
Sbjct: 1   MVMVMRSISSCLTLSQTPPPPPR---PRASNHLSTQKQCKRNSRNYCITRPQILPPPPLS 60

Query: 61  EDG-GLRVFVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKD 120
            D  G RVFVLSDLHTDY ENMNW+ SLS   Y++D+L++ GDVAET  NF STM++LKD
Sbjct: 61  RDALGFRVFVLSDLHTDYPENMNWVKSLSTKAYKNDLLLLAGDVAETYHNFYSTMSLLKD 120

Query: 121 RFERVFFVPGNHDLWCRREEDN---YLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPL 180
           RF+ VF+VPGNHDLWCR E +    YLDS++K++KLLDACR LGV T P +L GLGIVPL
Sbjct: 121 RFQHVFYVPGNHDLWCRSEPEGHPYYLDSLDKLNKLLDACRGLGVQTRPMVLYGLGIVPL 180

Query: 181 FSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMI 240
           FSWYHESFDREMD+ GIRIPSLEMVCKDFHACKWP ++SN  ASLA +FDAMNE+N   +
Sbjct: 181 FSWYHESFDREMDIAGIRIPSLEMVCKDFHACKWPREISNRSASLASYFDAMNEENEDAV 240

Query: 241 EKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSAC 300
           + I+ TC+QIITFSHF+PR ELCPEKRMLFYP LPKIIGSD+LE RIRSIHGS+   SAC
Sbjct: 241 KLIKNTCTQIITFSHFLPRQELCPEKRMLFYPNLPKIIGSDFLEVRIRSIHGSEGNASAC 300

Query: 301 HVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 340
           HVFGHTHFCWD VLDGIRY+QAPLAYPRERK+RMNGGE WLPFC++S G+
Sbjct: 301 HVFGHTHFCWDSVLDGIRYIQAPLAYPRERKRRMNGGETWLPFCVYSGGK 347

BLAST of Csa1G043300 vs. TrEMBL
Match: A0A061FWD8_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_013436 PE=4 SV=1)

HSP 1 Score: 473.4 bits (1217), Expect = 2.3e-130
Identity = 219/310 (70.65%), Postives = 260/310 (83.87%), Query Frame = 1

Query: 29  RKNQRGAKLKCFGVARPHILPSAFGEDGGLRVFVLSDLHTDYDENMNWIHSLSLDKYRDD 88
           +K +RG K  C  + RP ILP+A  +  GLRVFVLSDLHTDY ENM W+ S    ++  D
Sbjct: 19  KKYRRGIKRGC--IKRPQILPTALRDGFGLRVFVLSDLHTDYPENMAWVRSFPTKRHNKD 78

Query: 89  VLIVPGDVAETISNFVSTMAMLKDRFERVFFVPGNHDLWCRREEDNYLDSIEKMSKLLDA 148
           VL+V GDVAE   NFV TM++L+DRFE VF+VPGNHDLWCR E D+ LDS++K++KLLDA
Sbjct: 79  VLLVAGDVAEMYDNFVLTMSLLRDRFEYVFYVPGNHDLWCRWERDD-LDSLQKLNKLLDA 138

Query: 149 CRDLGVDTNPAILNGLGIVPLFSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDLS 208
           CR LGV+TNP +++GLGIVPLFSWYHESFDRE D+ GIRIPSL+M CKDF ACKWPG+LS
Sbjct: 139 CRQLGVETNPVVIDGLGIVPLFSWYHESFDREEDITGIRIPSLDMACKDFRACKWPGNLS 198

Query: 209 NEGASLALFFDAMNEKNNIMIEKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKIIG 268
           N  +SLAL+FDAMNE N   +++I+ TCSQIITFSHFVPR ELCPEKRMLFYP LPKIIG
Sbjct: 199 NRDSSLALYFDAMNENNQDTVKQIQSTCSQIITFSHFVPRQELCPEKRMLFYPNLPKIIG 258

Query: 269 SDYLEDRIRSIHGSKETTSACHVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGED 328
           SD+LEDRIRSIHG + ++ ACHVFGHTHFCWD ++DGIRYVQAPLAYPRER++RMNGGE 
Sbjct: 259 SDWLEDRIRSIHGIEGSSFACHVFGHTHFCWDAIVDGIRYVQAPLAYPRERRRRMNGGET 318

Query: 329 WLPFCIFSNG 339
           WLPFCI+S+G
Sbjct: 319 WLPFCIYSDG 325

BLAST of Csa1G043300 vs. TrEMBL
Match: A0A0D2UBV7_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_008G293400 PE=4 SV=1)

HSP 1 Score: 469.9 bits (1208), Expect = 2.6e-129
Identity = 216/311 (69.45%), Postives = 260/311 (83.60%), Query Frame = 1

Query: 28  LRKNQRGAKLKCFGVARPHILPSAFGEDGGLRVFVLSDLHTDYDENMNWIHSLSLDKYRD 87
           L+K +R  K  C  + RP ++  A  +  GLRVFVLSDLHTDY ENM W+ SLS  ++  
Sbjct: 18  LKKYRRCVKRGC--IKRPQVVSGAKKDGFGLRVFVLSDLHTDYPENMAWVRSLSTKRHEK 77

Query: 88  DVLIVPGDVAETISNFVSTMAMLKDRFERVFFVPGNHDLWCRREEDNYLDSIEKMSKLLD 147
           DVL+V GDVAE   NF+ TM++LK+RFE VFFVPGNHDLWCR E +++ DS+EK++KLLD
Sbjct: 78  DVLLVAGDVAEMYDNFILTMSLLKERFEYVFFVPGNHDLWCRWETEDF-DSLEKLNKLLD 137

Query: 148 ACRDLGVDTNPAILNGLGIVPLFSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDL 207
           AC+ LGV+TNPA+++GLGI+PLFSWYHESFDRE D+ G+RIPSL+M CKDFHACKWPG+L
Sbjct: 138 ACKQLGVETNPAVIDGLGIIPLFSWYHESFDREDDIVGVRIPSLDMACKDFHACKWPGNL 197

Query: 208 SNEGASLALFFDAMNEKNNIMIEKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKII 267
           SN   SLAL+FD MNEKN   +++I+ TCSQIITFSHFVPR ELCPEKRMLFYP LPK+I
Sbjct: 198 SNRDTSLALYFDLMNEKNQNTVKRIQSTCSQIITFSHFVPRQELCPEKRMLFYPNLPKVI 257

Query: 268 GSDYLEDRIRSIHGSKETTSACHVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGE 327
           GSD+LEDRIRSIHG + ++ ACHVFGHTHFCWD V+DGIRYVQAPLAYPRERK+RMNGGE
Sbjct: 258 GSDWLEDRIRSIHGVESSSFACHVFGHTHFCWDAVVDGIRYVQAPLAYPRERKRRMNGGE 317

Query: 328 DWLPFCIFSNG 339
            WLPFCI+ +G
Sbjct: 318 TWLPFCIYLDG 325

BLAST of Csa1G043300 vs. TrEMBL
Match: F6HE74_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0039g00160 PE=4 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 6.3e-128
Identity = 220/340 (64.71%), Postives = 269/340 (79.12%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSA-FGEDGGLR 60
           MV   VP C   + KL  S   S     R+ +R  K  C  + RP I+PS+   E  G R
Sbjct: 1   MVLEIVPSCLGLSQKLSHSIHLSKQIASREYERDVKNSC--IIRPQIMPSSNHAEAVGPR 60

Query: 61  VFVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFF 120
           VF++SDLH DY ENM W+  LS  +++ DVL+V GDVAET  NFV TM++L D+FE VF+
Sbjct: 61  VFMISDLHADYSENMTWMKDLSTMRHKKDVLLVAGDVAETYHNFVLTMSLLTDKFEYVFY 120

Query: 121 VPGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDR 180
           VPGNHDLWCRREE++ L+S++K++KLLDAC+ LGV T+P I++GLGI+PLFSWYHESFD+
Sbjct: 121 VPGNHDLWCRREEEDSLNSLDKLNKLLDACKRLGVQTSPMIIDGLGIIPLFSWYHESFDK 180

Query: 181 EMDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQI 240
           E D+  + IPSLEM CKDFHACKWP +LSN   SLAL+FDAMNEKN  +I++I+  CSQI
Sbjct: 181 EEDITEVFIPSLEMACKDFHACKWPEELSNRDTSLALYFDAMNEKNQDLIKEIQSECSQI 240

Query: 241 ITFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCW 300
           ITFSHF+PR ELCPEKRMLFYP LPKIIGSD+LE R+RSIHG++ + SACHVFGHTHFCW
Sbjct: 241 ITFSHFLPRQELCPEKRMLFYPNLPKIIGSDFLEVRLRSIHGAEGSASACHVFGHTHFCW 300

Query: 301 DLVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 340
           D +LDGIRYVQAPLAYPRERK+RMNGGEDWLPFCI+ +G+
Sbjct: 301 DSMLDGIRYVQAPLAYPRERKRRMNGGEDWLPFCIYCDGK 338

BLAST of Csa1G043300 vs. NCBI nr
Match: gi|778657558|ref|XP_011651098.1| (PREDICTED: uncharacterized protein LOC101212697 isoform X2 [Cucumis sativus])

HSP 1 Score: 714.1 bits (1842), Expect = 1.1e-202
Identity = 340/340 (100.00%), Postives = 340/340 (100.00%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60
           MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV
Sbjct: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60

Query: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120
           FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV
Sbjct: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120

Query: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180
           PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE
Sbjct: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180

Query: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240
           MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII
Sbjct: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240

Query: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300
           TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD
Sbjct: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300

Query: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK 341
           LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK
Sbjct: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGRK 340

BLAST of Csa1G043300 vs. NCBI nr
Match: gi|449439733|ref|XP_004137640.1| (PREDICTED: uncharacterized protein LOC101212697 isoform X1 [Cucumis sativus])

HSP 1 Score: 712.2 bits (1837), Expect = 4.3e-202
Identity = 339/339 (100.00%), Postives = 339/339 (100.00%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60
           MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV
Sbjct: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60

Query: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120
           FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV
Sbjct: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120

Query: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180
           PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE
Sbjct: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180

Query: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240
           MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII
Sbjct: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240

Query: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300
           TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD
Sbjct: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300

Query: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 340
           LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR
Sbjct: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 339

BLAST of Csa1G043300 vs. NCBI nr
Match: gi|659067013|ref|XP_008437211.1| (PREDICTED: uncharacterized protein LOC103482705 [Cucumis melo])

HSP 1 Score: 670.6 bits (1729), Expect = 1.4e-189
Identity = 322/339 (94.99%), Postives = 327/339 (96.46%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKCFGVARPHILPSAFGEDGGLRV 60
           MVATFVPFCSNFTLKL  SNPP ILAPLRKNQRGAKLKC GVARP ILPSAFGED GLRV
Sbjct: 1   MVATFVPFCSNFTLKLCHSNPPLILAPLRKNQRGAKLKCCGVARPQILPSAFGEDDGLRV 60

Query: 61  FVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKDRFERVFFV 120
           FVLSDLHTDYDENMNWIHSLS DKYRDDVLIV GDVAETISNFVSTMA LKDRFERVFFV
Sbjct: 61  FVLSDLHTDYDENMNWIHSLSSDKYRDDVLIVAGDVAETISNFVSTMATLKDRFERVFFV 120

Query: 121 PGNHDLWCRREEDNYLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180
           PGNHDLWCRRE DNYLDS+EKMSKLLDAC DLGVDTNPAILNGLGIVPLFSWYHESFDRE
Sbjct: 121 PGNHDLWCRREGDNYLDSLEKMSKLLDACIDLGVDTNPAILNGLGIVPLFSWYHESFDRE 180

Query: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMIEKIRRTCSQII 240
           MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNI+IEKIRRTCSQII
Sbjct: 181 MDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIIIEKIRRTCSQII 240

Query: 241 TFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300
           TFSHFVPR ELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD
Sbjct: 241 TFSHFVPRLELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSACHVFGHTHFCWD 300

Query: 301 LVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 340
           LVLDGIRYVQAPLAYPRERK+RMNGGE+WLPFCI+SNGR
Sbjct: 301 LVLDGIRYVQAPLAYPRERKRRMNGGENWLPFCIYSNGR 339

BLAST of Csa1G043300 vs. NCBI nr
Match: gi|566211149|ref|XP_006372648.1| (hypothetical protein POPTR_0017s03550g [Populus trichocarpa])

HSP 1 Score: 474.2 bits (1219), Expect = 2.0e-130
Identity = 229/350 (65.43%), Postives = 266/350 (76.00%), Query Frame = 1

Query: 1   MVATFVPFCSNFTLKLRPSNPPSILAPLRKNQRGAKLKC------FGVARPHILPSA-FG 60
           MV       S  TL   P  PP    P   N    + +C      + + RP ILP     
Sbjct: 1   MVMVMRSISSCLTLSQTPPPPPR---PRASNHLSTQKQCKRNSRNYCITRPQILPPPPLS 60

Query: 61  EDG-GLRVFVLSDLHTDYDENMNWIHSLSLDKYRDDVLIVPGDVAETISNFVSTMAMLKD 120
            D  G RVFVLSDLHTDY ENMNW+ SLS   Y++D+L++ GDVAET  NF STM++LKD
Sbjct: 61  RDALGFRVFVLSDLHTDYPENMNWVKSLSTKAYKNDLLLLAGDVAETYHNFYSTMSLLKD 120

Query: 121 RFERVFFVPGNHDLWCRREEDN---YLDSIEKMSKLLDACRDLGVDTNPAILNGLGIVPL 180
           RF+ VF+VPGNHDLWCR E +    YLDS++K++KLLDACR LGV T P +L GLGIVPL
Sbjct: 121 RFQHVFYVPGNHDLWCRSEPEGHPYYLDSLDKLNKLLDACRGLGVQTRPMVLYGLGIVPL 180

Query: 181 FSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDLSNEGASLALFFDAMNEKNNIMI 240
           FSWYHESFDREMD+ GIRIPSLEMVCKDFHACKWP ++SN  ASLA +FDAMNE+N   +
Sbjct: 181 FSWYHESFDREMDIAGIRIPSLEMVCKDFHACKWPREISNRSASLASYFDAMNEENEDAV 240

Query: 241 EKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKIIGSDYLEDRIRSIHGSKETTSAC 300
           + I+ TC+QIITFSHF+PR ELCPEKRMLFYP LPKIIGSD+LE RIRSIHGS+   SAC
Sbjct: 241 KLIKNTCTQIITFSHFLPRQELCPEKRMLFYPNLPKIIGSDFLEVRIRSIHGSEGNASAC 300

Query: 301 HVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGEDWLPFCIFSNGR 340
           HVFGHTHFCWD VLDGIRY+QAPLAYPRERK+RMNGGE WLPFC++S G+
Sbjct: 301 HVFGHTHFCWDSVLDGIRYIQAPLAYPRERKRRMNGGETWLPFCVYSGGK 347

BLAST of Csa1G043300 vs. NCBI nr
Match: gi|590666858|ref|XP_007037080.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 473.4 bits (1217), Expect = 3.3e-130
Identity = 219/310 (70.65%), Postives = 260/310 (83.87%), Query Frame = 1

Query: 29  RKNQRGAKLKCFGVARPHILPSAFGEDGGLRVFVLSDLHTDYDENMNWIHSLSLDKYRDD 88
           +K +RG K  C  + RP ILP+A  +  GLRVFVLSDLHTDY ENM W+ S    ++  D
Sbjct: 19  KKYRRGIKRGC--IKRPQILPTALRDGFGLRVFVLSDLHTDYPENMAWVRSFPTKRHNKD 78

Query: 89  VLIVPGDVAETISNFVSTMAMLKDRFERVFFVPGNHDLWCRREEDNYLDSIEKMSKLLDA 148
           VL+V GDVAE   NFV TM++L+DRFE VF+VPGNHDLWCR E D+ LDS++K++KLLDA
Sbjct: 79  VLLVAGDVAEMYDNFVLTMSLLRDRFEYVFYVPGNHDLWCRWERDD-LDSLQKLNKLLDA 138

Query: 149 CRDLGVDTNPAILNGLGIVPLFSWYHESFDREMDLQGIRIPSLEMVCKDFHACKWPGDLS 208
           CR LGV+TNP +++GLGIVPLFSWYHESFDRE D+ GIRIPSL+M CKDF ACKWPG+LS
Sbjct: 139 CRQLGVETNPVVIDGLGIVPLFSWYHESFDREEDITGIRIPSLDMACKDFRACKWPGNLS 198

Query: 209 NEGASLALFFDAMNEKNNIMIEKIRRTCSQIITFSHFVPRPELCPEKRMLFYPKLPKIIG 268
           N  +SLAL+FDAMNE N   +++I+ TCSQIITFSHFVPR ELCPEKRMLFYP LPKIIG
Sbjct: 199 NRDSSLALYFDAMNENNQDTVKQIQSTCSQIITFSHFVPRQELCPEKRMLFYPNLPKIIG 258

Query: 269 SDYLEDRIRSIHGSKETTSACHVFGHTHFCWDLVLDGIRYVQAPLAYPRERKKRMNGGED 328
           SD+LEDRIRSIHG + ++ ACHVFGHTHFCWD ++DGIRYVQAPLAYPRER++RMNGGE 
Sbjct: 259 SDWLEDRIRSIHGIEGSSFACHVFGHTHFCWDAIVDGIRYVQAPLAYPRERRRRMNGGET 318

Query: 329 WLPFCIFSNG 339
           WLPFCI+S+G
Sbjct: 319 WLPFCIYSDG 325

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0A0LR12_CUCSA7.8e-203100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G043300 PE=4 SV=1[more]
U5FHK5_POPTR1.4e-13065.43Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0017s03550g PE=4 SV=1[more]
A0A061FWD8_THECC2.3e-13070.65Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_013436 PE=4 SV=1[more]
A0A0D2UBV7_GOSRA2.6e-12969.45Uncharacterized protein OS=Gossypium raimondii GN=B456_008G293400 PE=4 SV=1[more]
F6HE74_VITVI6.3e-12864.71Putative uncharacterized protein OS=Vitis vinifera GN=VIT_16s0039g00160 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
gi|778657558|ref|XP_011651098.1|1.1e-202100.00PREDICTED: uncharacterized protein LOC101212697 isoform X2 [Cucumis sativus][more]
gi|449439733|ref|XP_004137640.1|4.3e-202100.00PREDICTED: uncharacterized protein LOC101212697 isoform X1 [Cucumis sativus][more]
gi|659067013|ref|XP_008437211.1|1.4e-18994.99PREDICTED: uncharacterized protein LOC103482705 [Cucumis melo][more]
gi|566211149|ref|XP_006372648.1|2.0e-13065.43hypothetical protein POPTR_0017s03550g [Populus trichocarpa][more]
gi|590666858|ref|XP_007037080.1|3.3e-13070.65Uncharacterized protein isoform 1 [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004843Calcineurin-like_PHP_ApaH
Vocabulary: Molecular Function
TermDefinition
GO:0016787hydrolase activity
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008152 metabolic process
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G043300.1Csa1G043300.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004843Calcineurin-like phosphoesterase domain, apaH typePFAMPF00149Metallophoscoord: 58..161
score: 1.
NoneNo IPR availablePANTHERPTHR36492FAMILY NOT NAMEDcoord: 1..339
score: 1.3E
NoneNo IPR availablePANTHERPTHR36492:SF2SUBFAMILY NOT NAMEDcoord: 1..339
score: 1.3E