Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCTTCATGTGCTTTCTTTGGTAAAAGAGTTTTGAGATATTAATCCCAATGGCTTCTATCAATAATAGTGAATACTATTACCCTTGGTTTCCTCTTCCACCTTATCATCCTTTTCAGCCACCACCACACAACCCAATTGCTCCACCATCTCCTCCCCTCTTTCCACCACACAAACCAATTTCTCCCCCGTCGCCTAAGCCAAGTACGCCACCACCACCGCCCCCACCGCCACCTCACAAACCCGTCACGCCTCCACCACCACCACCACGCAAGCCAATTGCGCCTCCCCCACCTCAAAAGCCAACTGCTCCCCCGCCACCACGCAAGCCACTTGTGCCTCCTCCTCCTCCTCAAAGGCCAACTGCTCCCCCGCCTCCACGAAAGCCAATTGCGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCGCCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCCCCACGCAAGCCAATTGGCCCTCCTCCGCCTCCTCAGAAGCCAACGGCTCCTCCCCCACCACGTAAGCCAATTGCGCCTCCCCCACCCAAAAAGCCAATTGCTCCTCCACCACCACGCAAGCCCATTGCGCCTCCACCACCGAAAAAACCAAATGCTCCTCCACCGCCACCGAAAAAACCAGTTGCTCCTCCACCACCACACAAGCCAATCGCTCCTCCGCCACCGTACAAGCCAATATCTCCGCCATCTCCAATGCTTCCGCCGCCACCGCCTCCTCCTCACCACCACCCGACAGTAATCATCATAGTGTTTGTGTCATTAGGTGGGCTTTGCCTGCTTGGATTCATGGCAGCTGCACTCTTCTGTTTTGTTAAAAAGAGAAAAGAGAAAAGCGTGGAAGAAACAGAGATCATCCACATTGACGAACACAGGAAGATCAAAGAAGCCATAGTTGAGGGACCTCATGGATCATGCCAAACTGTGGTTCTATCAGTTGAAGATGACATCCATGTTAATGAAGAAATCATAAGGTGTGAGAAGATTGGGGGGAAAAGGACGTTGCATAGTACAAATGAAGCAGGAGATCCAAGCAGCATAGAAGATCAGCCACAGCCACCAATACCATCTCTCACTCATCAAAAGCATTCATGA
mRNA sequence
CTTCTTCATGTGCTTTCTTTGGTAAAAGAGTTTTGAGATATTAATCCCAATGGCTTCTATCAATAATAGTGAATACTATTACCCTTGGTTTCCTCTTCCACCTTATCATCCTTTTCAGCCACCACCACACAACCCAATTGCTCCACCATCTCCTCCCCTCTTTCCACCACACAAACCAATTTCTCCCCCGTCGCCTAAGCCAAGTACGCCACCACCACCGCCCCCACCGCCACCTCACAAACCCGTCACGCCTCCACCACCACCACCACGCAAGCCAATTGCGCCTCCCCCACCTCAAAAGCCAACTGCTCCCCCGCCACCACGCAAGCCACTTGTGCCTCCTCCTCCTCCTCAAAGGCCAACTGCTCCCCCGCCTCCACGAAAGCCAATTGCGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCGCCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCCCCACGCAAGCCAATTGGCCCTCCTCCGCCTCCTCAGAAGCCAACGGCTCCTCCCCCACCACGTAAGCCAATTGCGCCTCCCCCACCCAAAAAGCCAATTGCTCCTCCACCACCACGCAAGCCCATTGCGCCTCCACCACCGAAAAAACCAAATGCTCCTCCACCGCCACCGAAAAAACCAGTTGCTCCTCCACCACCACACAAGCCAATCGCTCCTCCGCCACCGTACAAGCCAATATCTCCGCCATCTCCAATGCTTCCGCCGCCACCGCCTCCTCCTCACCACCACCCGACAGTAATCATCATAGTGTTTGTGTCATTAGGTGGGCTTTGCCTGCTTGGATTCATGGCAGCTGCACTCTTCTGTTTTGTTAAAAAGAGAAAAGAGAAAAGCGTGGAAGAAACAGAGATCATCCACATTGACGAACACAGGAAGATCAAAGAAGCCATAGTTGAGGGACCTCATGGATCATGCCAAACTGTGGTTCTATCAGTTGAAGATGACATCCATGTTAATGAAGAAATCATAAGGTGTGAGAAGATTGGGGGGAAAAGGACGTTGCATAGTACAAATGAAGCAGGAGATCCAAGCAGCATAGAAGATCAGCCACAGCCACCAATACCATCTCTCACTCATCAAAAGCATTCATGA
Coding sequence (CDS)
ATGGCTTCTATCAATAATAGTGAATACTATTACCCTTGGTTTCCTCTTCCACCTTATCATCCTTTTCAGCCACCACCACACAACCCAATTGCTCCACCATCTCCTCCCCTCTTTCCACCACACAAACCAATTTCTCCCCCGTCGCCTAAGCCAAGTACGCCACCACCACCGCCCCCACCGCCACCTCACAAACCCGTCACGCCTCCACCACCACCACCACGCAAGCCAATTGCGCCTCCCCCACCTCAAAAGCCAACTGCTCCCCCGCCACCACGCAAGCCACTTGTGCCTCCTCCTCCTCCTCAAAGGCCAACTGCTCCCCCGCCTCCACGAAAGCCAATTGCGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCGCCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCTCCACGCAAGCCAATTGTGCCTCCTCCTCCGCCACCTCAAAAGCCAACTGCTCCCCCACCCCCACGCAAGCCAATTGGCCCTCCTCCGCCTCCTCAGAAGCCAACGGCTCCTCCCCCACCACGTAAGCCAATTGCGCCTCCCCCACCCAAAAAGCCAATTGCTCCTCCACCACCACGCAAGCCCATTGCGCCTCCACCACCGAAAAAACCAAATGCTCCTCCACCGCCACCGAAAAAACCAGTTGCTCCTCCACCACCACACAAGCCAATCGCTCCTCCGCCACCGTACAAGCCAATATCTCCGCCATCTCCAATGCTTCCGCCGCCACCGCCTCCTCCTCACCACCACCCGACAGTAATCATCATAGTGTTTGTGTCATTAGGTGGGCTTTGCCTGCTTGGATTCATGGCAGCTGCACTCTTCTGTTTTGTTAAAAAGAGAAAAGAGAAAAGCGTGGAAGAAACAGAGATCATCCACATTGACGAACACAGGAAGATCAAAGAAGCCATAGTTGAGGGACCTCATGGATCATGCCAAACTGTGGTTCTATCAGTTGAAGATGACATCCATGTTAATGAAGAAATCATAAGGTGTGAGAAGATTGGGGGGAAAAGGACGTTGCATAGTACAAATGAAGCAGGAGATCCAAGCAGCATAGAAGATCAGCCACAGCCACCAATACCATCTCTCACTCATCAAAAGCATTCATGA
Protein sequence
MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPPPPHKPVTPPPPPPRKPIAPPPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPPPQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS*
Homology
BLAST of CSPI06G18120 vs. ExPASy Swiss-Prot
Match:
C1PGW1 (Protein TRACHEARY ELEMENT DIFFERENTIATION-RELATED 7A OS=Zinnia violacea OX=34245 GN=TED7 PE=2 SV=1)
HSP 1 Score: 63.5 bits (153), Expect = 5.9e-09
Identity = 149/350 (42.57%), Postives = 187/350 (53.43%), Query Frame = 0
Query: 28 NPIAPPSPPLFPPHKPISPPSPKPSTPPPPPP----PPPHKPVTPPPPPPRKPIAPPPPQ 87
+P++ P FPP P + P P P+TP PPP PPPH + PPP
Sbjct: 3 SPLSQSVFPHFPPPSPAATPPPAPTTPSTPPPHFISPPPH--------------SVPPPS 62
Query: 88 KPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQK 147
P + PPP P VPPP P P +PPP PPP P P +PPP VPPP PP
Sbjct: 63 PPHSVPPPLHP-VPPPSPPHPVSPPPH----TVPPPSPPHPVSPPP---HTVPPPSPPH- 122
Query: 148 PTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPPPQKPTAPPPPRKPIAPPPPKKPIA 207
P+ PPP T PPP P PPP P PPP P A PPP
Sbjct: 123 ---------PVFPPP-----HTVPPP--SPHFVPPP---PNMVPPPSPPHANPPP----- 182
Query: 208 PPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPPPH 267
PPPP PPPP PPPPP + PPP H ++PPPP+ ++PPPPP P
Sbjct: 183 PPPPHS--VPPPPH--TVPPPPPPPHIIPPPAH-ALSPPPPH--------IIPPPPPSPS 242
Query: 268 HHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGP 327
+H T I+++FVS GG+ L F AAL+CF+KK+K+K V++ E IH DEHRK+ E I +GP
Sbjct: 243 NHSTTIVVIFVSCGGVFFLAFAMAALWCFLKKKKKKMVQKAENIHFDEHRKVTERIEQGP 291
Query: 328 HGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTN------EAGDPSS 368
HG+ +T +LSVEDDIH+ E+I + E ++ LH + G PSS
Sbjct: 303 HGT-ETAILSVEDDIHIEEDIKKSELENFRKGLHLNYGNTYNIDTGKPSS 291
BLAST of CSPI06G18120 vs. ExPASy TrEMBL
Match:
A0A0A0KFA4 (Carboxypeptidase OS=Cucumis sativus OX=3659 GN=Csa_6G338160 PE=3 SV=1)
HSP 1 Score: 548.1 bits (1411), Expect = 2.9e-152
Identity = 357/364 (98.08%), Postives = 358/364 (98.35%), Query Frame = 0
Query: 24 PPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPPPPHKPVTPPPPPPRKPIAPPPPQ 83
PPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPPPPHKPVTPPPPPPRKPIAPPPPQ
Sbjct: 443 PPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPPPPHKPVTPPPPPPRKPIAPPPPQ 502
Query: 84 KPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQK 143
KPTAPPPPRKPLVPPPPPQRPTAPPPPRKPI PPPPPPQKPTAPPPPRKPIVPPPPPPQK
Sbjct: 503 KPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQK 562
Query: 144 PTAPPPPRKPIVPPPPPPQKPTAPPPPRKPI-GPPPPPQKPTAPPPPRKPIA-PPPPKKP 203
PTAPPPPRKPIVPPPPPPQKPTAPPPPRKPI PPPPPQKPTAPPPPRKPI PPPP+KP
Sbjct: 563 PTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPPPQKP 622
Query: 204 IAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPP 263
APPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPP
Sbjct: 623 TAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPP 682
Query: 264 PHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVE 323
PHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVE
Sbjct: 683 PHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVE 742
Query: 324 GPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTH 383
GPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTH
Sbjct: 743 GPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTH 802
Query: 384 QKHS 386
QKHS
Sbjct: 803 QKHS 806
BLAST of CSPI06G18120 vs. ExPASy TrEMBL
Match:
A0A5A7USI7 (Basic salivary proline-rich protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1607G00550 PE=4 SV=1)
HSP 1 Score: 470.3 bits (1209), Expect = 7.7e-129
Identity = 325/387 (83.98%), Postives = 336/387 (86.82%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
MASINNSEYYYPWFP HPFQPPPH P APPSPPLFPPH PISPP KPSTPPPP
Sbjct: 1 MASINNSEYYYPWFP----HPFQPPPHYPTAPPSPPLFPPHSPISPPPRKPSTPPPP--- 60
Query: 61 PPHKPVTPPPPPPRKPIA-PPPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPP 120
PPPPPPRKPI PPPPQKPTAPPPPRKP+VPPPPPQ+PTAPPPPRKPI PPP
Sbjct: 61 -------PPPPPPRKPIVPPPPPQKPTAPPPPRKPIVPPPPPQKPTAPPPPRKPIV-PPP 120
Query: 121 PPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPP 180
PPQKPTAPPPPRKPI PPPPPQ+PTAPPPPRKPI PPPP+KP APPPPRKPI PPP
Sbjct: 121 PPQKPTAPPPPRKPI-GPPPPPQRPTAPPPPRKPIA--PPPPKKPLAPPPPRKPIA-PPP 180
Query: 181 PQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPI 240
P+KP AP PPPPKKP+APPPP+KP APPPP K PPPP+KP+ PPPPHKPI
Sbjct: 181 PKKPNAP--------PPPPKKPVAPPPPKKPNAPPPPPKKPVAPPPPRKPITPPPPHKPI 240
Query: 241 APPPPYKPISPPSPML-PPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKE 300
APPPPYKPISPPSPML PPPPPPPHHHPTVII+VFVSLGGLCLLGFMAAALFCFVKKRKE
Sbjct: 241 APPPPYKPISPPSPMLPPPPPPPPHHHPTVIIVVFVSLGGLCLLGFMAAALFCFVKKRKE 300
Query: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHS 360
KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIG KRTLHS
Sbjct: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGEKRTLHS 360
Query: 361 TNEAGDPSSIEDQPQPPIPSLTHQKHS 386
TNEAGDP +IEDQPQPPIPS THQKHS
Sbjct: 361 TNEAGDPGTIEDQPQPPIPSFTHQKHS 360
BLAST of CSPI06G18120 vs. ExPASy TrEMBL
Match:
A0A6J1K1F3 (basic salivary proline-rich protein 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489761 PE=4 SV=1)
HSP 1 Score: 373.6 bits (958), Expect = 9.8e-100
Identity = 291/387 (75.19%), Postives = 317/387 (81.91%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
M S+N S+ +P+FPLPPYHPF P P+ PP P+ PP+KP+ PP P P PP
Sbjct: 1 MPSLNTSD--HPYFPLPPYHPFWP----PLPPPQNPIAPPYKPVRPPPPPRQ---PTAPP 60
Query: 61 PPHKPVTPPPPPPRKPIA--PPPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPP 120
PP +P+ PPPPPR+PIA PPPPQKP APPPPRKP+ PP P P PPPPRKPI PPP
Sbjct: 61 PPRQPIV-PPPPPRQPIAPSPPPPQKPNAPPPPRKPIAPPTAP--PPPPPPPRKPIVPPP 120
Query: 121 PPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPP 180
PP +KPTAPPPPRKPIV P PPP KPTAPPPPRKPI P PPPQKPTAPPPPRKPI PPP
Sbjct: 121 PPSRKPTAPPPPRKPIV-PTPPPHKPTAPPPPRKPIA-PTPPPQKPTAPPPPRKPIVPPP 180
Query: 181 PPQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKP 240
PPQKPTAPPPPRKPI PPPP KP APPPPRKPIAPPPPKKP A PPPP+KP+ PPPPHKP
Sbjct: 181 PPQKPTAPPPPRKPIVPPPPHKPTAPPPPRKPIAPPPPKKPVA-PPPPRKPITPPPPHKP 240
Query: 241 IAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKE 300
IAPP PYKPISPPSPMLPPPPPP +HHPTVIIIV VSLGGL LL FMAAA FCFVKKRKE
Sbjct: 241 IAPPTPYKPISPPSPMLPPPPPPSNHHPTVIIIVCVSLGGLFLLAFMAAAFFCFVKKRKE 300
Query: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHS 360
K++EETEIIHIDEHRKIKEA V GPHGSC+T+VLSVEDDIH+ EEI+R EK+GGK LH
Sbjct: 301 KTIEETEIIHIDEHRKIKEAKVAGPHGSCETMVLSVEDDIHITEEIVRSEKVGGK-GLHG 360
Query: 361 TNEAGDPSSIEDQPQPPIPSLTHQKHS 386
T+EAGDPSSIE +PQP S +HQ HS
Sbjct: 361 THEAGDPSSIE-EPQP--ASSSHQNHS 368
BLAST of CSPI06G18120 vs. ExPASy TrEMBL
Match:
A0A6J1FHL1 (leucine-rich repeat extensin-like protein 3 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445428 PE=4 SV=1)
HSP 1 Score: 369.8 bits (948), Expect = 1.4e-98
Identity = 288/391 (73.66%), Postives = 312/391 (79.80%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
MAS+N SE YP+FPLPPYHPF PPL PP+KP+ P PPPPPPP
Sbjct: 1 MASLNTSE--YPYFPLPPYHPFW-----------PPLPPPYKPVRP-------PPPPPPP 60
Query: 61 PPHKPVTPPPPPPRKPIA-PPPPQKPTAPPPPRKPLVPPPPP---QRPTAPPPPRKPIAP 120
PP +P PPPPRKPIA PPPPQKPTAPPPPRKP+VPPPPP ++PTAPPPPRKPIAP
Sbjct: 61 PPRQPTA--PPPPRKPIAPPPPPQKPTAPPPPRKPIVPPPPPPPSRKPTAPPPPRKPIAP 120
Query: 121 PPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGP 180
PTAPPPPRKPIVPPP APPPP KPIV PPPP KPTAPPPPRKPI
Sbjct: 121 -------PTAPPPPRKPIVPPP-------APPPPGKPIV--PPPPHKPTAPPPPRKPIA- 180
Query: 181 PPPPQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVA--PPP 240
PPPP+KP APPPPRKPIAPPPPKKP+APPPPRKPIAPPPPKKP A PPPP+KP+ PPP
Sbjct: 181 PPPPKKPVAPPPPRKPIAPPPPKKPVAPPPPRKPIAPPPPKKPVA-PPPPRKPITPPPPP 240
Query: 241 PHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVK 300
PHKPIAPP PYKPISPPSPMLPPPPPP +HHPTVIIIV VS+GGL LL FMAAA FCFVK
Sbjct: 241 PHKPIAPPTPYKPISPPSPMLPPPPPPSNHHPTVIIIVCVSMGGLFLLAFMAAAFFCFVK 300
Query: 301 KRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKR 360
KRKEK++EETEIIHIDEHRKIKEAIV GPHGSC+T VLSVEDDIH+ EEI+R EK+ G++
Sbjct: 301 KRKEKTIEETEIIHIDEHRKIKEAIVAGPHGSCETTVLSVEDDIHITEEIVRSEKV-GEK 347
Query: 361 TLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS 386
LH +EAGDPSSIE +PQP S THQ HS
Sbjct: 361 GLHGKHEAGDPSSIE-EPQP--ASSTHQNHS 347
BLAST of CSPI06G18120 vs. ExPASy TrEMBL
Match:
A0A6J1FLZ2 (formin-like protein 20 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445428 PE=4 SV=1)
HSP 1 Score: 368.2 bits (944), Expect = 4.1e-98
Identity = 290/396 (73.23%), Postives = 314/396 (79.29%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPP--PPP 60
MAS+N SE YP+FPLPPYHPF PPL PP+KP+ PP P P PP P
Sbjct: 1 MASLNTSE--YPYFPLPPYHPFW-----------PPLPPPYKPVRPPPPPPPPPPRQPTA 60
Query: 61 PPPPHKPVTPPPPPPRKPIAPPPP----QKPTAPPPPRKPLVPPPPP---QRPTAPPPPR 120
PPPP KP+ PPPPPRKPI PPPP QKPTAPPPPRKP+VPPPPP ++PTAPPPPR
Sbjct: 61 PPPPRKPIA-PPPPPRKPIVPPPPPPPSQKPTAPPPPRKPIVPPPPPPPSRKPTAPPPPR 120
Query: 121 KPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPR 180
KPIAP PTAPPPPRKPIVPPP APPPP KPIV PPPP KPTAPPPPR
Sbjct: 121 KPIAP-------PTAPPPPRKPIVPPP-------APPPPGKPIV--PPPPHKPTAPPPPR 180
Query: 181 KPIGPPPPPQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVA 240
KPI PPPP+KP APPPPRKPIAPPPPKKP+APPPPRKPIAPPPPKKP A PPPP+KP+
Sbjct: 181 KPIA-PPPPKKPVAPPPPRKPIAPPPPKKPVAPPPPRKPIAPPPPKKPVA-PPPPRKPIT 240
Query: 241 --PPPPHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAAL 300
PPPPHKPIAPP PYKPISPPSPMLPPPPPP +HHPTVIIIV VS+GGL LL FMAAA
Sbjct: 241 PPPPPPHKPIAPPTPYKPISPPSPMLPPPPPPSNHHPTVIIIVCVSMGGLFLLAFMAAAF 300
Query: 301 FCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEK 360
FCFVKKRKEK++EETEIIHIDEHRKIKEAIV GPHGSC+T VLSVEDDIH+ EEI+R EK
Sbjct: 301 FCFVKKRKEKTIEETEIIHIDEHRKIKEAIVAGPHGSCETTVLSVEDDIHITEEIVRSEK 360
Query: 361 IGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS 386
+ G++ LH +EAGDPSSIE +PQP S THQ HS
Sbjct: 361 V-GEKGLHGKHEAGDPSSIE-EPQP--ASSTHQNHS 360
BLAST of CSPI06G18120 vs. NCBI nr
Match:
KAA0058883.1 (basic salivary proline-rich protein 2 [Cucumis melo var. makuwa] >TYK23740.1 basic salivary proline-rich protein 2 [Cucumis melo var. makuwa])
HSP 1 Score: 470.3 bits (1209), Expect = 1.6e-128
Identity = 325/387 (83.98%), Postives = 336/387 (86.82%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
MASINNSEYYYPWFP HPFQPPPH P APPSPPLFPPH PISPP KPSTPPPP
Sbjct: 1 MASINNSEYYYPWFP----HPFQPPPHYPTAPPSPPLFPPHSPISPPPRKPSTPPPP--- 60
Query: 61 PPHKPVTPPPPPPRKPIA-PPPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPP 120
PPPPPPRKPI PPPPQKPTAPPPPRKP+VPPPPPQ+PTAPPPPRKPI PPP
Sbjct: 61 -------PPPPPPRKPIVPPPPPQKPTAPPPPRKPIVPPPPPQKPTAPPPPRKPIV-PPP 120
Query: 121 PPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPP 180
PPQKPTAPPPPRKPI PPPPPQ+PTAPPPPRKPI PPPP+KP APPPPRKPI PPP
Sbjct: 121 PPQKPTAPPPPRKPI-GPPPPPQRPTAPPPPRKPIA--PPPPKKPLAPPPPRKPIA-PPP 180
Query: 181 PQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPI 240
P+KP AP PPPPKKP+APPPP+KP APPPP K PPPP+KP+ PPPPHKPI
Sbjct: 181 PKKPNAP--------PPPPKKPVAPPPPKKPNAPPPPPKKPVAPPPPRKPITPPPPHKPI 240
Query: 241 APPPPYKPISPPSPML-PPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKE 300
APPPPYKPISPPSPML PPPPPPPHHHPTVII+VFVSLGGLCLLGFMAAALFCFVKKRKE
Sbjct: 241 APPPPYKPISPPSPMLPPPPPPPPHHHPTVIIVVFVSLGGLCLLGFMAAALFCFVKKRKE 300
Query: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHS 360
KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIG KRTLHS
Sbjct: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGEKRTLHS 360
Query: 361 TNEAGDPSSIEDQPQPPIPSLTHQKHS 386
TNEAGDP +IEDQPQPPIPS THQKHS
Sbjct: 361 TNEAGDPGTIEDQPQPPIPSFTHQKHS 360
BLAST of CSPI06G18120 vs. NCBI nr
Match:
XP_031742883.1 (basic salivary proline-rich protein 2 [Cucumis sativus])
HSP 1 Score: 454.1 bits (1167), Expect = 1.2e-123
Identity = 295/307 (96.09%), Postives = 296/307 (96.42%), Query Frame = 0
Query: 81 PPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPP 140
P K PPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPP
Sbjct: 37 PTSKANCSPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPP 96
Query: 141 PQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPI-GPPPPPQKPTAPPPPRKPIA-PPPP 200
PQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPI PPPPPQKPTAPPPPRKPI PPPP
Sbjct: 97 PQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPPP 156
Query: 201 KKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPP 260
+KP APPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPP
Sbjct: 157 QKPTAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPP 216
Query: 261 PPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEA 320
PPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEA
Sbjct: 217 PPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEA 276
Query: 321 IVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPS 380
IVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPS
Sbjct: 277 IVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPS 336
Query: 381 LTHQKHS 386
LTHQKHS
Sbjct: 337 LTHQKHS 343
BLAST of CSPI06G18120 vs. NCBI nr
Match:
XP_038884025.1 (basic salivary proline-rich protein 2-like [Benincasa hispida])
HSP 1 Score: 443.7 bits (1140), Expect = 1.6e-120
Identity = 317/388 (81.70%), Postives = 335/388 (86.34%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
MASINNS+YY+PWFP PPYHPFQPPPH PIAPPSPPLFPPHKPI+PP P+ P PPPP
Sbjct: 1 MASINNSDYYFPWFPTPPYHPFQPPPHKPIAPPSPPLFPPHKPITPPPPR--KPISPPPP 60
Query: 61 PPHKPVTPPPPPPRKPIAP-PPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPPP 120
P KP T PPPPRKPI P PPP KPTAPPPPRKP+V PPPQ+PTAPPPPRKPI P P
Sbjct: 61 SPQKPAT--PPPPRKPIVPSPPPLKPTAPPPPRKPIV--PPPQKPTAPPPPRKPIV-PSP 120
Query: 121 PPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPP 180
PPQKP APPPPRKPIV PPPPPQKP APPPPRKPI PP P APPPPRKP+ PPPP
Sbjct: 121 PPQKPIAPPPPRKPIV-PPPPPQKPAAPPPPRKPIGPP------PIAPPPPRKPVRPPPP 180
Query: 181 PQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKPI 240
PQKPT PPPPRKPIAPPPP+KPIAPPPPRKPIAPPPPKKP A PPPP+KP PPPPHKPI
Sbjct: 181 PQKPTTPPPPRKPIAPPPPRKPIAPPPPRKPIAPPPPKKPIA-PPPPRKPFTPPPPHKPI 240
Query: 241 APPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKEK 300
APPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGL LL FMAAALFCFVKKR+EK
Sbjct: 241 APPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLFLLAFMAAALFCFVKKREEK 300
Query: 301 SVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHST 360
+VEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHV EEI+R EKI G++ LH T
Sbjct: 301 TVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVKEEIMRTEKI-GEKALHRT 360
Query: 361 NEAGDPSSIEDQPQPPIPS--LTHQKHS 386
+EAGDPS+IE+ QPP + HQKHS
Sbjct: 361 HEAGDPSTIEEPVQPPSSTHHNLHQKHS 372
BLAST of CSPI06G18120 vs. NCBI nr
Match:
KAE8647185.1 (hypothetical protein Csa_019083 [Cucumis sativus])
HSP 1 Score: 439.5 bits (1129), Expect = 3.0e-119
Identity = 278/284 (97.89%), Postives = 279/284 (98.24%), Query Frame = 0
Query: 104 PTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQK 163
PTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQK
Sbjct: 14 PTAPPPPRKPIAPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQK 73
Query: 164 PTAPPPPRKPI-GPPPPPQKPTAPPPPRKPIA-PPPPKKPIAPPPPRKPIAPPPPKKPNA 223
PTAPPPPRKPI PPPPPQKPTAPPPPRKPI PPPP+KP APPPPRKPIAPPPPKKPNA
Sbjct: 74 PTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPPPPQKPTAPPPPRKPIAPPPPKKPNA 133
Query: 224 PPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCL 283
PPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCL
Sbjct: 134 PPPPPKKPVAPPPPHKPIAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCL 193
Query: 284 LGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVN 343
LGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVN
Sbjct: 194 LGFMAAALFCFVKKRKEKSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVN 253
Query: 344 EEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS 386
EEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS
Sbjct: 254 EEIIRCEKIGGKRTLHSTNEAGDPSSIEDQPQPPIPSLTHQKHS 297
BLAST of CSPI06G18120 vs. NCBI nr
Match:
XP_022993899.1 (basic salivary proline-rich protein 2-like isoform X1 [Cucurbita maxima])
HSP 1 Score: 373.6 bits (958), Expect = 2.0e-99
Identity = 291/387 (75.19%), Postives = 317/387 (81.91%), Query Frame = 0
Query: 1 MASINNSEYYYPWFPLPPYHPFQPPPHNPIAPPSPPLFPPHKPISPPSPKPSTPPPPPPP 60
M S+N S+ +P+FPLPPYHPF P P+ PP P+ PP+KP+ PP P P PP
Sbjct: 1 MPSLNTSD--HPYFPLPPYHPFWP----PLPPPQNPIAPPYKPVRPPPPPRQ---PTAPP 60
Query: 61 PPHKPVTPPPPPPRKPIA--PPPPQKPTAPPPPRKPLVPPPPPQRPTAPPPPRKPIAPPP 120
PP +P+ PPPPPR+PIA PPPPQKP APPPPRKP+ PP P P PPPPRKPI PPP
Sbjct: 61 PPRQPIV-PPPPPRQPIAPSPPPPQKPNAPPPPRKPIAPPTAP--PPPPPPPRKPIVPPP 120
Query: 121 PPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIVPPPPPPQKPTAPPPPRKPIGPPP 180
PP +KPTAPPPPRKPIV P PPP KPTAPPPPRKPI P PPPQKPTAPPPPRKPI PPP
Sbjct: 121 PPSRKPTAPPPPRKPIV-PTPPPHKPTAPPPPRKPIA-PTPPPQKPTAPPPPRKPIVPPP 180
Query: 181 PPQKPTAPPPPRKPIAPPPPKKPIAPPPPRKPIAPPPPKKPNAPPPPPKKPVAPPPPHKP 240
PPQKPTAPPPPRKPI PPPP KP APPPPRKPIAPPPPKKP A PPPP+KP+ PPPPHKP
Sbjct: 181 PPQKPTAPPPPRKPIVPPPPHKPTAPPPPRKPIAPPPPKKPVA-PPPPRKPITPPPPHKP 240
Query: 241 IAPPPPYKPISPPSPMLPPPPPPPHHHPTVIIIVFVSLGGLCLLGFMAAALFCFVKKRKE 300
IAPP PYKPISPPSPMLPPPPPP +HHPTVIIIV VSLGGL LL FMAAA FCFVKKRKE
Sbjct: 241 IAPPTPYKPISPPSPMLPPPPPPSNHHPTVIIIVCVSLGGLFLLAFMAAAFFCFVKKRKE 300
Query: 301 KSVEETEIIHIDEHRKIKEAIVEGPHGSCQTVVLSVEDDIHVNEEIIRCEKIGGKRTLHS 360
K++EETEIIHIDEHRKIKEA V GPHGSC+T+VLSVEDDIH+ EEI+R EK+GGK LH
Sbjct: 301 KTIEETEIIHIDEHRKIKEAKVAGPHGSCETMVLSVEDDIHITEEIVRSEKVGGK-GLHG 360
Query: 361 TNEAGDPSSIEDQPQPPIPSLTHQKHS 386
T+EAGDPSSIE +PQP S +HQ HS
Sbjct: 361 THEAGDPSSIE-EPQP--ASSSHQNHS 368
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
C1PGW1 | 5.9e-09 | 42.57 | Protein TRACHEARY ELEMENT DIFFERENTIATION-RELATED 7A OS=Zinnia violacea OX=34245... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KFA4 | 2.9e-152 | 98.08 | Carboxypeptidase OS=Cucumis sativus OX=3659 GN=Csa_6G338160 PE=3 SV=1 | [more] |
A0A5A7USI7 | 7.7e-129 | 83.98 | Basic salivary proline-rich protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=... | [more] |
A0A6J1K1F3 | 9.8e-100 | 75.19 | basic salivary proline-rich protein 2-like isoform X1 OS=Cucurbita maxima OX=366... | [more] |
A0A6J1FHL1 | 1.4e-98 | 73.66 | leucine-rich repeat extensin-like protein 3 isoform X2 OS=Cucurbita moschata OX=... | [more] |
A0A6J1FLZ2 | 4.1e-98 | 73.23 | formin-like protein 20 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445428 ... | [more] |
Match Name | E-value | Identity | Description | |
KAA0058883.1 | 1.6e-128 | 83.98 | basic salivary proline-rich protein 2 [Cucumis melo var. makuwa] >TYK23740.1 bas... | [more] |
XP_031742883.1 | 1.2e-123 | 96.09 | basic salivary proline-rich protein 2 [Cucumis sativus] | [more] |
XP_038884025.1 | 1.6e-120 | 81.70 | basic salivary proline-rich protein 2-like [Benincasa hispida] | [more] |
KAE8647185.1 | 3.0e-119 | 97.89 | hypothetical protein Csa_019083 [Cucumis sativus] | [more] |
XP_022993899.1 | 2.0e-99 | 75.19 | basic salivary proline-rich protein 2-like isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |