Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGGCCACCAAGATCGAAAACAGAACGGAACGAAGAAGACGAATTATGGGCAGAGAAATTGATCGAATGGCTCTCATCACCGGCCGTTTACGTAATCTCCCTCCTTCCCCACCCCCTTCTCCATCTTCCCCTTCCCCTTTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGTATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACACCAATCCTGATTCCCCTCCTCTTCCCAACGCCCAAGGTATGTTCATTCATACTTCTTCTTTTTTCTTTATGCATTAGAGATCCATTCTTTCTGTCTACATTGTATTGCTCAGGTGGATTCAAGCCGTGTTGTATATTTGATGATTATCGAGGCATCAGATTTCATTAGATGAGCTTCATAAAATGAATGCTAACTATGATTGTTAAGATTATATTTACTTTTTAAATTATGATAATTTTCCATCATACATTGTATGAGATATGCTCACAATTGGATGACATTTTTTAGAGAGAACTATATAATAACATAAGTTTCTGAGTCCTTGTATAAATTGCTATATCGAATCGGTTTACTTTGCTTGACATAAGAAGAAGAGGCTTCGTATGATCTCAATGTAGTTGGAAAGAGGTTATTTTATTGCAAGTGTGTGCTAATATTCATAAGGTTACGGTTTGCTATTGATTGTATGTATGTATATTCGAGCATCTATGAAGTTGGGAAGAACAATAATAAGAGATATCTTTTGATAATTGTGGAATTTTGAAAACCAATGTATTAGAATCATTAGTTTCTTCTCTTCGATCAAATCAATAACTATGTAATGTAATGGCTCTATCTTTGGCTATTGGTTCTTGTTTGAATCTTTTCTAACAATGTGTTGGATTATGATTTATGGTTCGTTTTCTTTATCGATGAAACCTGTGAGATGGTTTATGTTGGTTAATTTTTGTGATGAAAAAATCTTTTTTGATCATAAGTGATAACATACGAATGGGGATTATGCATTGATGGCTAAAAATATGTGTTACAGTTATCTTGCATTGATTATGAAACGAGTTGAATGTTGGTATGTAAATGGGAATGTGAGAACAATTTATCACTTTGTTTGGACAATAGATTATATTGAAACCATACAATCTGTTTGACTTTTAGTTTTATAGGAAAGAGTTCAATTTAGTTTCATATATTTGCAGGTATTCCCAAGCCTAAAGATGCAAAAGCTACTCCTTTACTGAAGCGTTTGTCCATGAGTGAAGCTCGAGAAGAAAAAATTGCTGCCACAGGATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGTACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGTTCAAAAAATGACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCCAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTGTCTGTTCCCTAATCATTGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTTCCATAATTTGGAAGATGGTGAGGTCAGAAAGCGTGGTGGCCTCGAAACCCCTCTACATTCTACTGCTTACCGATGCAACCATCGTTGTGGCGAGAATGTTGGCTGCAAGACAGAAAGACAGTGGAGAGGCAGAGGAAGAAAGCGAGAAAATGAAGGAAGATGGACATAATTGGGACTCAGCTGAGAAAGTGTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATATTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTCTTTGCTGTAG
mRNA sequence
ATGGAAATGGCCACCAAGATCGAAAACAGAACGGAACGAAGAAGACGAATTATGGGCAGAGAAATTGATCGAATGGCTCTCATCACCGGCCGTTTACGTAATCTCCCTCCTTCCCCACCCCCTTCTCCATCTTCCCCTTCCCCTTTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGTATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACACCAATCCTGATTCCCCTCCTCTTCCCAACGCCCAAGGTATTCCCAAGCCTAAAGATGCAAAAGCTACTCCTTTACTGAAGCGTTTGTCCATGAGTGAAGCTCGAGAAGAAAAAATTGCTGCCACAGGATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGTACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGTTCAAAAAATGACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCCAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTGTCTGTTCCCTAATCATTGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTTCCATAATTTGGAAGATGGTGAGGTCAGAAAGCGTGGTGGCCTCGAAACCCCTCTACATTCTACTGCTTACCGATGCAACCATCGTTGTGGCGAGAATGTTGGCTGCAAGACAGAAAGACAGTGGAGAGGCAGAGGAAGAAAGCGAGAAAATGAAGGAAGATGGACATAATTGGGACTCAGCTGAGAAAGTGTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATATTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTCTTTGCTGTAG
Coding sequence (CDS)
ATGGAAATGGCCACCAAGATCGAAAACAGAACGGAACGAAGAAGACGAATTATGGGCAGAGAAATTGATCGAATGGCTCTCATCACCGGCCGTTTACGTAATCTCCCTCCTTCCCCACCCCCTTCTCCATCTTCCCCTTCCCCTTTTCTTTACCATCAAACTCACCAACGCGGCCATTCTCACACCGGTATCTCCCCTTCCTTTTTCTCCAAGGACCTCCACACCAATCCTGATTCCCCTCCTCTTCCCAACGCCCAAGGTATTCCCAAGCCTAAAGATGCAAAAGCTACTCCTTTACTGAAGCGTTTGTCCATGAGTGAAGCTCGAGAAGAAAAAATTGCTGCCACAGGATTCCAAATCAACCACAAAAAACTCGACCCCATAGGAGAAGTACACACTGAAACAGTATCAACTCCATCTGCCTCATCAATGGTTCAAAAAATGACCTCCATTGATAACGAGATACTCTTAAAAACACACCCTTCAAAGCCCAAACTCTTCACTTCAAAACGACTAAATGCCTCCATTTTAGCTTCTCAAACCACACGAGTTGTCTGTTCCCTAATCATTGCATCTTTGGCCGTTCTATCGCACGTCAATCACCCGCTTTCCATAATTTGGAAGATGGTGAGGTCAGAAAGCGTGGTGGCCTCGAAACCCCTCTACATTCTACTGCTTACCGATGCAACCATCGTTGTGGCGAGAATGTTGGCTGCAAGACAGAAAGACAGTGGAGAGGCAGAGGAAGAAAGCGAGAAAATGAAGGAAGATGGACATAATTGGGACTCAGCTGAGAAAGTGTTGGAGAGAGGTTTGGTTTTTTATCAAGCTTTTCGTGCGATATTCATTGATTTTAGTGTTTATGCAGTGGTGGTTATTTGTGGCATCTCTTTGCTGTAG
Protein sequence
MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAARQKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL*
Homology
BLAST of Chy4G078640 vs. ExPASy TrEMBL
Match:
A0A0A0L815 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G481250 PE=4 SV=1)
HSP 1 Score: 538.5 bits (1386), Expect = 1.8e-149
Identity = 284/299 (94.98%), Postives = 290/299 (96.99%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKIENRT+RRRRIM REIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS
Sbjct: 1 MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGISPSFFSKD+H NPDSPPLPNAQG+PKPKDAKATPLLKRLSMSEAREEKIAA GFQI
Sbjct: 61 HTGISPSFFSKDIHANPDSPPLPNAQGVPKPKDAKATPLLKRLSMSEAREEKIAAIGFQI 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
NHKKLDPIGE+HTETVSTPSASSMVQK+TS DNEILLK HPSKPKLFTSKRLNASILASQ
Sbjct: 121 NHKKLDPIGEIHTETVSTPSASSMVQKVTSTDNEILLKAHPSKPKLFTSKRLNASILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
TTRV CSLIIASLAVLSHVNHPLS+IWKMVRSE VVASKPLYILLLTDATIVVARMLAAR
Sbjct: 181 TTRVFCSLIIASLAVLSHVNHPLSMIWKMVRSERVVASKPLYILLLTDATIVVARMLAAR 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 300
QKDS EAEEESEKMKEDGHNWDSA KVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL
Sbjct: 241 QKDSREAEEESEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
BLAST of Chy4G078640 vs. ExPASy TrEMBL
Match:
A0A1S3BH05 (uncharacterized protein LOC103489936 OS=Cucumis melo OX=3656 GN=LOC103489936 PE=4 SV=1)
HSP 1 Score: 505.4 bits (1300), Expect = 1.7e-139
Identity = 271/301 (90.03%), Postives = 281/301 (93.36%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TK +NRTERRRRI+ RE+DRMALITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGF 120
HTGISPSFFSKDL H NPDS P PNAQGIPKPKDAKATPLLKRLSMSEAREEKIAA GF
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLPFPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAAIGF 120
Query: 121 QINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILA 180
Q NHKKLDPIGEVHTETVSTPSASSMVQK+TSID++ILLKTHPSKPKLFTSKR+NASILA
Sbjct: 121 QFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASILA 180
Query: 181 SQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLA 240
SQTTRV CSLIIASL+VLSHVNHPLSIIW MVRSESVVASKPLYILLLTDATIV+ARMLA
Sbjct: 181 SQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARMLA 240
Query: 241 ARQKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISL 300
RQKD G AEEE EKMKEDG NWDSA KVLERGLVFYQAFRAIFIDFSVYAVVVICGI L
Sbjct: 241 ERQKDGGVAEEEIEKMKEDGRNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICL 300
BLAST of Chy4G078640 vs. ExPASy TrEMBL
Match:
A0A5A7U8R0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G001130 PE=4 SV=1)
HSP 1 Score: 477.2 bits (1227), Expect = 4.9e-131
Identity = 271/363 (74.66%), Postives = 281/363 (77.41%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TK +NRTERRRRI+ RE+DRMALITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDL--HTNPDSPPLPNAQ-------------------------------- 120
HTGISPSFFSKDL H NPDS P PNAQ
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLPFPNAQGIFIHSFIHSFFFFFFFFILITFLSFSDIYSI 120
Query: 121 ------------------------------GIPKPKDAKATPLLKRLSMSEAREEKIAAT 180
GIPKPKDAKATPLLKRLSMSEAREEKIAA
Sbjct: 121 LSSFSLSRKSVRWFVLVNFFDEKIFFLIISGIPKPKDAKATPLLKRLSMSEAREEKIAAI 180
Query: 181 GFQINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASI 240
GFQ NHKKLDPIGEVHTETVSTPSASSMVQK+TSID++ILLKTHPSKPKLFTSKR+NASI
Sbjct: 181 GFQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASI 240
Query: 241 LASQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARM 300
LASQTTRV CSLIIASL+VLSHVNHPLSIIW MVRSESVVASKPLYILLLTDATIV+ARM
Sbjct: 241 LASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARM 300
BLAST of Chy4G078640 vs. ExPASy TrEMBL
Match:
A0A6J1J0I6 (uncharacterized protein LOC111480128 OS=Cucurbita maxima OX=3661 GN=LOC111480128 PE=4 SV=1)
HSP 1 Score: 393.7 bits (1010), Expect = 7.1e-106
Identity = 222/299 (74.25%), Postives = 244/299 (81.61%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK ++R ERR++I +E DRMALITGRLR LPPSPPPSPSSPSPFL +Q HQRGHS
Sbjct: 1 MEMATKTDSRAERRQKIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGISPSF SK+L NPDS PL IPK KD A PL K + ++E +EEKI ATGFQI
Sbjct: 61 HTGISPSFLSKELQKNPDSLPLRPVHAIPKLKDGTAVPLPKHMPINEVQEEKITATGFQI 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
N KK+DPIGEV E +S PSA MVQK T I NE L K PSKP++FTSKRLN SILASQ
Sbjct: 121 NDKKIDPIGEVCKEMIS-PSALPMVQKAT-IVNEPLSKPQPSKPRIFTSKRLNVSILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
T RV CSLIIASLA+LSHV+HPL I +V SESV+ASKPLYILLLT+ TIVVARMLA R
Sbjct: 181 TKRVFCSLIIASLAILSHVDHPLFTIRNIVSSESVMASKPLYILLLTNVTIVVARMLADR 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 300
QK GEAEEE EKMKEDG NW+SA KVLERGLVFYQAFRAIFIDFSVYAVVVICG+SLL
Sbjct: 241 QKHGGEAEEECEKMKEDGQNWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 297
BLAST of Chy4G078640 vs. ExPASy TrEMBL
Match:
A0A6J1KF67 (uncharacterized protein LOC111493320 OS=Cucurbita maxima OX=3661 GN=LOC111493320 PE=4 SV=1)
HSP 1 Score: 391.7 bits (1005), Expect = 2.7e-105
Identity = 221/299 (73.91%), Postives = 245/299 (81.94%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK + R ERRRRI RE DRMALITGRLRNLPPSPPPSPSSP PF +H THQRGHS
Sbjct: 1 MEMATKTDVRAERRRRISSREGDRMALITGRLRNLPPSPPPSPSSP-PFFHHYTHQRGHS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGI+PSFF+KD H NPDS PLP + KPKD KA PLLK +S++E AA +Q
Sbjct: 61 HTGINPSFFAKDTHKNPDSGPLPQNHDVSKPKDEKAPPLLKHISINEVHNN--AAIEYQF 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
N KKLDPIGE TE + +PS+ +MVQK IDNE L KT PSKP+L TSKRLNASILASQ
Sbjct: 121 NPKKLDPIGEGSTELILSPSSVTMVQK-ACIDNEPLPKTKPSKPRLITSKRLNASILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
TTRV CSLIIASLA+LS V+ PL+II VRSE+V+ASKPLYILLLT+ATIVVARMLA +
Sbjct: 181 TTRVFCSLIIASLAILSQVDIPLTIIRNTVRSETVMASKPLYILLLTNATIVVARMLAEK 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 300
QKD GEAEEE EKMKED NWDSA KVLERGLVFYQAFRA+FIDFSVYAVVVICG+S+L
Sbjct: 241 QKDRGEAEEECEKMKEDAQNWDSAVKVLERGLVFYQAFRAVFIDFSVYAVVVICGLSVL 295
BLAST of Chy4G078640 vs. NCBI nr
Match:
XP_004150355.1 (uncharacterized protein LOC101203675 [Cucumis sativus] >KGN58065.1 hypothetical protein Csa_023413 [Cucumis sativus])
HSP 1 Score: 539 bits (1388), Expect = 3.59e-192
Identity = 284/299 (94.98%), Postives = 290/299 (96.99%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATKIENRT+RRRRIM REIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS
Sbjct: 1 MEMATKIENRTDRRRRIMSREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGISPSFFSKD+H NPDSPPLPNAQG+PKPKDAKATPLLKRLSMSEAREEKIAA GFQI
Sbjct: 61 HTGISPSFFSKDIHANPDSPPLPNAQGVPKPKDAKATPLLKRLSMSEAREEKIAAIGFQI 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
NHKKLDPIGE+HTETVSTPSASSMVQK+TS DNEILLK HPSKPKLFTSKRLNASILASQ
Sbjct: 121 NHKKLDPIGEIHTETVSTPSASSMVQKVTSTDNEILLKAHPSKPKLFTSKRLNASILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
TTRV CSLIIASLAVLSHVNHPLS+IWKMVRSE VVASKPLYILLLTDATIVVARMLAAR
Sbjct: 181 TTRVFCSLIIASLAVLSHVNHPLSMIWKMVRSERVVASKPLYILLLTDATIVVARMLAAR 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
QKDS EAEEESEKMKEDGHNWDSA KVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL
Sbjct: 241 QKDSREAEEESEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
BLAST of Chy4G078640 vs. NCBI nr
Match:
XP_008447503.1 (PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo])
HSP 1 Score: 506 bits (1302), Expect = 4.89e-179
Identity = 271/301 (90.03%), Postives = 281/301 (93.36%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TK +NRTERRRRI+ RE+DRMALITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHT--NPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGF 120
HTGISPSFFSKDLH NPDS P PNAQGIPKPKDAKATPLLKRLSMSEAREEKIAA GF
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLPFPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAAIGF 120
Query: 121 QINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILA 180
Q NHKKLDPIGEVHTETVSTPSASSMVQK+TSID++ILLKTHPSKPKLFTSKR+NASILA
Sbjct: 121 QFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASILA 180
Query: 181 SQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLA 240
SQTTRV CSLIIASL+VLSHVNHPLSIIW MVRSESVVASKPLYILLLTDATIV+ARMLA
Sbjct: 181 SQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARMLA 240
Query: 241 ARQKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISL 299
RQKD G AEEE EKMKEDG NWDSA KVLERGLVFYQAFRAIFIDFSVYAVVVICGI L
Sbjct: 241 ERQKDGGVAEEEIEKMKEDGRNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGICL 300
BLAST of Chy4G078640 vs. NCBI nr
Match:
KAA0050777.1 (uncharacterized protein E6C27_scaffold404G00270 [Cucumis melo var. makuwa] >TYK08569.1 uncharacterized protein E5676_scaffold323G001130 [Cucumis melo var. makuwa])
HSP 1 Score: 478 bits (1229), Expect = 5.94e-167
Identity = 271/363 (74.66%), Postives = 281/363 (77.41%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEM TK +NRTERRRRI+ RE+DRMALITGRL NLPPSPPPSPSSPSPFL+HQTHQRGHS
Sbjct: 1 MEMPTKTDNRTERRRRIISREMDRMALITGRLPNLPPSPPPSPSSPSPFLFHQTHQRGHS 60
Query: 61 HTGISPSFFSKDLHT--NPDSPPLPNAQGI------------------------------ 120
HTGISPSFFSKDLH NPDS P PNAQGI
Sbjct: 61 HTGISPSFFSKDLHNHNNPDSLPFPNAQGIFIHSFIHSFFFFFFFFILITFLSFSDIYSI 120
Query: 121 --------------------------------PKPKDAKATPLLKRLSMSEAREEKIAAT 180
PKPKDAKATPLLKRLSMSEAREEKIAA
Sbjct: 121 LSSFSLSRKSVRWFVLVNFFDEKIFFLIISGIPKPKDAKATPLLKRLSMSEAREEKIAAI 180
Query: 181 GFQINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASI 240
GFQ NHKKLDPIGEVHTETVSTPSASSMVQK+TSID++ILLKTHPSKPKLFTSKR+NASI
Sbjct: 181 GFQFNHKKLDPIGEVHTETVSTPSASSMVQKITSIDDKILLKTHPSKPKLFTSKRINASI 240
Query: 241 LASQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARM 299
LASQTTRV CSLIIASL+VLSHVNHPLSIIW MVRSESVVASKPLYILLLTDATIV+ARM
Sbjct: 241 LASQTTRVFCSLIIASLSVLSHVNHPLSIIWNMVRSESVVASKPLYILLLTDATIVLARM 300
BLAST of Chy4G078640 vs. NCBI nr
Match:
XP_038890203.1 (uncharacterized protein LOC120079844 [Benincasa hispida])
HSP 1 Score: 459 bits (1180), Expect = 1.67e-160
Identity = 247/299 (82.61%), Postives = 269/299 (89.97%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK ++R ERRRRI+ RE+DRMALITGRLRNLPPSPPPSPSSPSPFL+HQTHQRG+S
Sbjct: 1 MEMATKTDSRKERRRRIVSREVDRMALITGRLRNLPPSPPPSPSSPSPFLFHQTHQRGYS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGISPSFFSK+LH NPDS PL + IPKP+D KATPLLK +SM E +EEKI+A G+Q+
Sbjct: 61 HTGISPSFFSKELHKNPDSIPLSHIHAIPKPEDGKATPLLKHMSMKEVQEEKISAIGYQM 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
+HKKLDPIGEVHTE VSTPSA SMVQK+ SIDNE KT PSKPKLFTSKRLNA ILASQ
Sbjct: 121 SHKKLDPIGEVHTEIVSTPSALSMVQKV-SIDNETRSKTQPSKPKLFTSKRLNACILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
TTRV CSLI+ASLA+LS V+HPL II +VRSESVVASKPLYILLLT+ATIVVARMLA +
Sbjct: 181 TTRVFCSLILASLAILSQVDHPLFIIRNIVRSESVVASKPLYILLLTNATIVVARMLAEK 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
QKDSGEAEEE EKMKEDGHNWDSA KVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL
Sbjct: 241 QKDSGEAEEELEKMKEDGHNWDSAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 298
BLAST of Chy4G078640 vs. NCBI nr
Match:
XP_022980863.1 (uncharacterized protein LOC111480128 [Cucurbita maxima])
HSP 1 Score: 394 bits (1012), Expect = 5.87e-135
Identity = 221/299 (73.91%), Postives = 244/299 (81.61%), Query Frame = 0
Query: 1 MEMATKIENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHS 60
MEMATK ++R ERR++I +E DRMALITGRLR LPPSPPPSPSSPSPFL +Q HQRGHS
Sbjct: 1 MEMATKTDSRAERRQKIRSKETDRMALITGRLRTLPPSPPPSPSSPSPFLQYQIHQRGHS 60
Query: 61 HTGISPSFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQI 120
HTGISPSF SK+L NPDS PL IPK KD A PL K + ++E +EEKI ATGFQI
Sbjct: 61 HTGISPSFLSKELQKNPDSLPLRPVHAIPKLKDGTAVPLPKHMPINEVQEEKITATGFQI 120
Query: 121 NHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQ 180
N KK+DPIGEV E +S PSA MVQK T + NE L K PSKP++FTSKRLN SILASQ
Sbjct: 121 NDKKIDPIGEVCKEMIS-PSALPMVQKATIV-NEPLSKPQPSKPRIFTSKRLNVSILASQ 180
Query: 181 TTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAAR 240
T RV CSLIIASLA+LSHV+HPL I +V SESV+ASKPLYILLLT+ TIVVARMLA R
Sbjct: 181 TKRVFCSLIIASLAILSHVDHPLFTIRNIVSSESVMASKPLYILLLTNVTIVVARMLADR 240
Query: 241 QKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVICGISLL 299
QK GEAEEE EKMKEDG NW+SA KVLERGLVFYQAFRAIFIDFSVYAVVVICG+SLL
Sbjct: 241 QKHGGEAEEECEKMKEDGQNWESAVKVLERGLVFYQAFRAIFIDFSVYAVVVICGLSLL 297
BLAST of Chy4G078640 vs. TAIR 10
Match:
AT1G52343.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32680.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 99.0 bits (245), Expect = 6.9e-21
Identity = 104/298 (34.90%), Postives = 147/298 (49.33%), Query Frame = 0
Query: 7 IENRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISP 66
+ +R ERRRRIM R DR+ALITG+L NL PS P S SS S +H R +S + +
Sbjct: 2 VMDREERRRRIMERGSDRLALITGQLHNLDPSSPSSSSSSS-----ASHNRTYSESFMPQ 61
Query: 67 SFFSKDLHTNPDSPPLPNAQGIPKPKDAKATPLLKRLSMSEAREEKIAATGFQINHKKLD 126
+ D H +SP L K + + E K++ + HK L
Sbjct: 62 T--KSDHHQILESPSLKYQ--------------FKEEVKARSEEPKLST----VLHKPLK 121
Query: 127 PIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHPSKPKLFTSKRLNASILASQTTRVVC 186
E + +T S S Q+ F+SK+LNASI++S+ TR +
Sbjct: 122 I--EPTKQEEATRSQKSQNQRPIC---------------FFSSKKLNASIISSERTRSLS 181
Query: 187 SLIIASLAV-LSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVARMLAARQKDSG 246
SL IA+ V L +N + S +++A +PL++L+LTD IV++ L G
Sbjct: 182 SLTIAAFVVLLPRLN--------ITSSNTILALRPLWLLILTDCAIVMSH-LTTEASGGG 241
Query: 247 EAEEESEKMK----EDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVY-AVVVICGISL 299
+ E E K +G NW AE++LERG+V YQA R +FID S+Y VVVI G SL
Sbjct: 242 LSHEMEEDGKGRDGNNGENWSDAERLLERGVVVYQALRGMFIDCSLYMVVVVIFGASL 248
BLAST of Chy4G078640 vs. TAIR 10
Match:
AT4G32680.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G52343.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 72.0 bits (175), Expect = 9.1e-13
Identity = 83/302 (27.48%), Postives = 135/302 (44.70%), Query Frame = 0
Query: 9 NRTERRRRIMGREIDRMALITGRLRNLPPSPPPSPSSPSPFLYHQTHQRGHSHTGISPSF 68
+R RRR+I+ R DR+A ITG++ + PSPPPS S + S
Sbjct: 5 SREARRRKILDRGSDRLAFITGQINGV-PSPPPSDS--------------------TSSL 64
Query: 69 FSKDLHTNPDSP-PLPNAQGIPKPKDAKATPLLKRLS-----------MSEAREEKIAAT 128
DL T+ P +P I K ++ T +S + ++REE +
Sbjct: 65 SQSDLQTDQSLPDTIPPRDQILKAQEIAFTSHQDNISDAAMLENVDHIIHQSREEPLQPQ 124
Query: 129 GFQINHKKLDPIGEVHTETVSTPSASSMVQKMTSIDNEILLKTHP--SKPKLFTSKRLNA 188
+ T T+ P +S VQ + +D P S T K + A
Sbjct: 125 RHAETLAEASASDPRDTTTIQPPPTTSSVQNPSVVDLGASQAFIPVVSFVNAITPKHIGA 184
Query: 189 SILASQTTRVVCSLIIASLAVLSHVNHPLSIIWKMVRSESVVASKPLYILLLTDATIVVA 248
+I AS+ R+ +L IA + +LSH+ ++V+ +P+++L+LTDATIV+
Sbjct: 185 AIDASEYARMFTALAIALVVILSHLG--------FSSLGNIVSFRPVFLLVLTDATIVLG 244
Query: 249 RMLAARQKDSGEAEEESEKMKEDGHNWDSAEKVLERGLVFYQAFRAIFIDFSVYAVVVIC 297
R+L + + DS A S + D LE ++ + A+ +DFS+YAV++IC
Sbjct: 245 RVLLSHRGDSSSA---SGTVMSGQGIVDQVGNALETVMMVKKIMDALLMDFSLYAVILIC 274
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L815 | 1.8e-149 | 94.98 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G481250 PE=4 SV=1 | [more] |
A0A1S3BH05 | 1.7e-139 | 90.03 | uncharacterized protein LOC103489936 OS=Cucumis melo OX=3656 GN=LOC103489936 PE=... | [more] |
A0A5A7U8R0 | 4.9e-131 | 74.66 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A6J1J0I6 | 7.1e-106 | 74.25 | uncharacterized protein LOC111480128 OS=Cucurbita maxima OX=3661 GN=LOC111480128... | [more] |
A0A6J1KF67 | 2.7e-105 | 73.91 | uncharacterized protein LOC111493320 OS=Cucurbita maxima OX=3661 GN=LOC111493320... | [more] |
Match Name | E-value | Identity | Description | |
XP_004150355.1 | 3.59e-192 | 94.98 | uncharacterized protein LOC101203675 [Cucumis sativus] >KGN58065.1 hypothetical ... | [more] |
XP_008447503.1 | 4.89e-179 | 90.03 | PREDICTED: uncharacterized protein LOC103489936 [Cucumis melo] | [more] |
KAA0050777.1 | 5.94e-167 | 74.66 | uncharacterized protein E6C27_scaffold404G00270 [Cucumis melo var. makuwa] >TYK0... | [more] |
XP_038890203.1 | 1.67e-160 | 82.61 | uncharacterized protein LOC120079844 [Benincasa hispida] | [more] |
XP_022980863.1 | 5.87e-135 | 73.91 | uncharacterized protein LOC111480128 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G52343.1 | 6.9e-21 | 34.90 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G32680.1 | 9.1e-13 | 27.48 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |