Cla97C08G153990 (gene) Watermelon (97103) v2.5

Overview
NameCla97C08G153990
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDUF4057 domain-containing protein
LocationCla97Chr08: 22152742 .. 22155997 (-)
RNA-Seq ExpressionCla97C08G153990
SyntenyCla97C08G153990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGAAAAGAAAAAAGAAAAAAGAAACTTCCAGATCCCGAGGGCATTTCGGTCATTTGAGTTGACAACATCTCTCTGCCTCCACGGCACATTCGCACTGTATCTCTCGCTTCCTTCTTCTTCTTCCTTCCTTCGTCTTCTCCAGAGTAAAGAAGAAGCCATTAACAAAATACCAAATCAAAGCTCCAGAGTGCTTTCCAAGCAAACCCTAACCCTAACTCTTAACTCCCATTTCTTCATTTCTTGTGTCTTCCTCTCTTCGCCATCTCCCAAACAATCTCCCAATTCCTTTCTTCTCCAATGGACAGAACCACTCCTGTTCGGAAGCCTCACACTTCCACTGCAGATCTCCTCACTTGGCCTGAACTTCCTCCTGCCGATTCCCCTGCTCTTCCTTCTTCTGCTTCTCGCTCTGCTCCCAGGTCTCATCAGGTACTTTCTACATTTCTATCTTTTTCTTTCTTTTCTTCTTCTTCTCTTCTTCTTCTTCTGCTTATTCCCTTTCTTTTCCCTCTCTTTAGCCCTCCGATGGGATTAGTAAGGTCGTCTTTGGAGGCCAGGTTACGGACGAGGAGGTGGAGAGCTTGAACAAAAGGTGAGATCTAACTCTGCTTCTCTTACCTCGAATCTTTGATTATTTCATTCCATTCTACTTCTTGTTCTTCTGGGAAATTTCTGCTTTGTTTCTTTTGGTGCTTACTGAATTCCATATAAAGAGAATTTTGTTATCATTCTCTTTGAGTTTTGTAGCCTTTGTTTTCTGTTCAATTTTATGCCACCGATCTGCTTGTTTTGTCTGTTAATGTCGCCAAGTAATTCAAATTTCACATCTCTCTCTTGTTTAAATGGCTCGTGATTGGATCTCAATTTTTCAATCCTTTCTGTTTTCCATACATTCAAGGGAGAAAAAAATTATCAAGTTCGCATTTCGGCTCTGCGGTAATGCATCAGATCTCCGGATTTGTGGAGTCTCTAGTTTGAGTTTTAAATGAGTCGCTTCAACTGTTTGATTTGCAAATTGCAGGAAACCCTGTTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGAATTTTTGTTGGTAACGAAGGAGATGAAGAACTAGAATCTGGAAGCGCCAACCCTTCACAGAATAAAACAGGAATACGTATGTACCAGGTATCCATCGAACAAGATCATATTGGAGTTTTAACGGTCAACTGTGAGCTAGTTTGTGGTTTGGATTCTAAGGCAATTCCTTTCTAACTTCTATTCTTGACGAATTGGATCAAACCGCAGCAAACGTTGGCTGGAATTAGTCATATATCATTTGGTGAAGAAGGTAGTGTTTCTCCTAAAAAGCCTACTACTGTTCCTGAGGTTGCGAAGCAGCGTGAGTTGAGTGGGAACTTAGAAAGCGATGCCGATGCAAAGTTGAAGAAGCAGCTCTCTGATGCCAAATGCAAGGAGCTTAGTGGACATGACATATTCGCCCCTCCTCCTGAGATTTTGCCTCGACCAACAACTGCTCGTACTTTGGATTTAAAAGGAAGTATAGAGATTGGAGAGCCTGTTAATGTAAGTAGTACTCCCCTCAAATCTTTAGTTAAGTAGTTTTCAGGATGAAATCCTTGTCATTCTCATTCAAACACATCCATTGCACTCCCAAAAACGCCATTCACTCATTTTGCTTAGGATTTATATAATTTAGGTCCGGTTCGTTTGATAACCGTTTGGTTTTTGAAAATTATGTGTATGAGCATTACTTCCAGCGGTTGATTTCTTTGTTTCTTTATCTACCTTTTACATGTGTTATTGAAATCTAAGCTGAGTTTTGAAAATGTTGAAGTAGTTTCTAAAAACTTGTTTTTGGAATTTGGAGTTTGGTTAAGAAGTCAAATGTTTTGAAAGGTGACGACTATCGTAAAGAACGGTGAAAAAATAAGCACGACTTTTCAAAAACTGAAAATACAAAACTTAATAGTTGTCAATTGGGTCTTAGTGCAAGGTTCTTTACCATAGCTTGCTTCTTGTTGGTGCTTCATATAGCATGGATAGAAGGGTGGTGTATCATGTTAAGATTGACACAATTAACAATCCATCTGCTTAAGCTTTTGAGTTTAGTAGTGATTTAACATGGTATTAGAGCAAGAGATCTTGTGTTTGAACTCTTGTAATGAACATTACTCAGTCTGCATCATATTTTCTCGATGGGGTCAGAAACTCACAATGATGAAATCCTAGGATTTTAGTAGGTTATATACGTGCAACTAATCTATCACAAGTGTGGTGCATCCACTATTTTCTCTAATTTAATCGTTCATTTGCATCATATTTTTTTGATGGGGCCAAAAGCTCAGAATGATGAAATCCTAGGATTCCAGTTGGTTTAATATGCGTGCACAATCATCACTAATTTCTCTATGTGGGGTTACTTTTTCTGAATAATTAAGCATATGTCCCTATTTGATAGGTTGATGATATGGCTTCGGTTAGCTGGTAGTTGTTTGTTAGCTTTTATGCTTAGTTTTTTAGTTTACTTTTCTTTAGCTTTTATGCTTAGTTTTTTAGTTTACTTTTCTTCTAAGTGTTTTCTTCAATGCATATAATGTCTCATCTAATTAGTCTAGAGATCATCATCTTGAGCCTTTTTTTTAAACTCTTCCTTGTACCAATTAATCTAATTATCTCCTTTCTATTCTTGCACATGGGAAACCAAAACAGAGAAATGTGATCCCCGGAGAAGAACCTTCGGTAAAGACAGCAAAGAAGATTTACGACAAGAAGTTTTCGGAGTTATCAGGAAATGACATCTTCAAAGGCGACATTCCTCCATCTTCTGCAGAGAAACCGTTGAGCGTCGCAAAGTTGCGAGAGATGAGTGGCAACGACATATTTGCAGACGGGAAGGTAGAGACCCGAGACTACTTAGGCGGAGTACGCAAACCTCCGGGGGGAGAGAGCAGCATTGCGTTGGTTTAAATCATCTAGGTTTCTAACAAAACTCAATACTTTCTCTTGGCGGGGGAGATGATGGATAGATATGGTGTTATGGAAATTGTCATATGGTTTGGGTAAAATTATGTTATTAGGTTTTAACTTTGGTTAGTCAACAACAATGGAGTCTCTGCTAATTGGGGTTGTGTTCCATAATGTTTTTGTACTTTTGGTTGTGTTGTGTGTCTGTGTCTAGTAGTTCGGTTTTATTTTGGTTGTCTTCTGTCTATTTTACTTTCATCACTTGGGGTTTTCATTTATTTATCATTATATTGATTAATT

mRNA sequence

AAAGAAAAGAAAAAAGAAAAAAGAAACTTCCAGATCCCGAGGGCATTTCGGTCATTTGAGTTGACAACATCTCTCTGCCTCCACGGCACATTCGCACTGTATCTCTCGCTTCCTTCTTCTTCTTCCTTCCTTCGTCTTCTCCAGAGTAAAGAAGAAGCCATTAACAAAATACCAAATCAAAGCTCCAGAGTGCTTTCCAAGCAAACCCTAACCCTAACTCTTAACTCCCATTTCTTCATTTCTTGTGTCTTCCTCTCTTCGCCATCTCCCAAACAATCTCCCAATTCCTTTCTTCTCCAATGGACAGAACCACTCCTGTTCGGAAGCCTCACACTTCCACTGCAGATCTCCTCACTTGGCCTGAACTTCCTCCTGCCGATTCCCCTGCTCTTCCTTCTTCTGCTTCTCGCTCTGCTCCCAGGTCTCATCAGCCCTCCGATGGGATTAGTAAGGTCGTCTTTGGAGGCCAGGTTACGGACGAGGAGGTGGAGAGCTTGAACAAAAGGAAACCCTGTTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGAATTTTTGTTGGTAACGAAGGAGATGAAGAACTAGAATCTGGAAGCGCCAACCCTTCACAGAATAAAACAGGAATACGTATGTACCAGGTATCCATCGAACAAGATCATATTGGAGTTTTAACGGTCAACTGTGAGCTACAAACGTTGGCTGGAATTAGTCATATATCATTTGGTGAAGAAGGTAGTGTTTCTCCTAAAAAGCCTACTACTGTTCCTGAGGTTGCGAAGCAGCGTGAGTTGAGTGGGAACTTAGAAAGCGATGCCGATGCAAAGTTGAAGAAGCAGCTCTCTGATGCCAAATGCAAGGAGCTTAGTGGACATGACATATTCGCCCCTCCTCCTGAGATTTTGCCTCGACCAACAACTGCTCGTACTTTGGATTTAAAAGGAAGTATAGAGATTGGAGAGCCTGTTAATAGAAATGTGATCCCCGGAGAAGAACCTTCGGTAAAGACAGCAAAGAAGATTTACGACAAGAAGTTTTCGGAGTTATCAGGAAATGACATCTTCAAAGGCGACATTCCTCCATCTTCTGCAGAGAAACCGTTGAGCGTCGCAAAGTTGCGAGAGATGAGTGGCAACGACATATTTGCAGACGGGAAGGTAGAGACCCGAGACTACTTAGGCGGAGTACGCAAACCTCCGGGGGGAGAGAGCAGCATTGCGTTGGTTTAAATCATCTAGGTTTCTAACAAAACTCAATACTTTCTCTTGGCGGGGGAGATGATGGATAGATATGGTGTTATGGAAATTGTCATATGGTTTGGGTAAAATTATGTTATTAGGTTTTAACTTTGGTTAGTCAACAACAATGGAGTCTCTGCTAATTGGGGTTGTGTTCCATAATGTTTTTGTACTTTTGGTTGTGTTGTGTGTCTGTGTCTAGTAGTTCGGTTTTATTTTGGTTGTCTTCTGTCTATTTTACTTTCATCACTTGGGGTTTTCATTTATTTATCATTATATTGATTAATT

Coding sequence (CDS)

ATGGACAGAACCACTCCTGTTCGGAAGCCTCACACTTCCACTGCAGATCTCCTCACTTGGCCTGAACTTCCTCCTGCCGATTCCCCTGCTCTTCCTTCTTCTGCTTCTCGCTCTGCTCCCAGGTCTCATCAGCCCTCCGATGGGATTAGTAAGGTCGTCTTTGGAGGCCAGGTTACGGACGAGGAGGTGGAGAGCTTGAACAAAAGGAAACCCTGTTCTGGATATAAAATGAAGGAGATGACTGGCAGTGGAATTTTTGTTGGTAACGAAGGAGATGAAGAACTAGAATCTGGAAGCGCCAACCCTTCACAGAATAAAACAGGAATACGTATGTACCAGGTATCCATCGAACAAGATCATATTGGAGTTTTAACGGTCAACTGTGAGCTACAAACGTTGGCTGGAATTAGTCATATATCATTTGGTGAAGAAGGTAGTGTTTCTCCTAAAAAGCCTACTACTGTTCCTGAGGTTGCGAAGCAGCGTGAGTTGAGTGGGAACTTAGAAAGCGATGCCGATGCAAAGTTGAAGAAGCAGCTCTCTGATGCCAAATGCAAGGAGCTTAGTGGACATGACATATTCGCCCCTCCTCCTGAGATTTTGCCTCGACCAACAACTGCTCGTACTTTGGATTTAAAAGGAAGTATAGAGATTGGAGAGCCTGTTAATAGAAATGTGATCCCCGGAGAAGAACCTTCGGTAAAGACAGCAAAGAAGATTTACGACAAGAAGTTTTCGGAGTTATCAGGAAATGACATCTTCAAAGGCGACATTCCTCCATCTTCTGCAGAGAAACCGTTGAGCGTCGCAAAGTTGCGAGAGATGAGTGGCAACGACATATTTGCAGACGGGAAGGTAGAGACCCGAGACTACTTAGGCGGAGTACGCAAACCTCCGGGGGGAGAGAGCAGCATTGCGTTGGTTTAA

Protein sequence

MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTDEEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDHIGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQLSDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPGGESSIALV
Homology
BLAST of Cla97C08G153990 vs. NCBI nr
Match: XP_038884071.1 (uncharacterized protein LOC120075008 [Benincasa hispida])

HSP 1 Score: 557.4 bits (1435), Expect = 7.9e-155
Identity = 285/308 (92.53%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFV NEGDEELESGSANPSQNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVANEGDEELESGSANPSQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTL GISHISFGEEGSVSPKKPT+VPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLTGISHISFGEEGSVSPKKPTSVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP NRNVIPGEEPS+KTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDNRNVIPGEEPSIKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. NCBI nr
Match: XP_004142997.1 (uncharacterized protein LOC101215119 [Cucumis sativus] >KGN62250.1 hypothetical protein Csa_018791 [Cucumis sativus])

HSP 1 Score: 557.0 bits (1434), Expect = 1.0e-154
Identity = 285/308 (92.53%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP +R +IPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDSRGIIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSS EKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSTEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. NCBI nr
Match: XP_008445341.1 (PREDICTED: uncharacterized protein LOC103488402 [Cucumis melo] >KAA0064833.1 uncharacterized protein E6C27_scaffold82G001450 [Cucumis melo var. makuwa])

HSP 1 Score: 554.7 bits (1428), Expect = 5.1e-154
Identity = 284/308 (92.21%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANP QNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPLQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP +R++IPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDSRSIIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSS EKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSMEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. NCBI nr
Match: XP_022132174.1 (uncharacterized protein LOC111005096 [Momordica charantia])

HSP 1 Score: 538.1 bits (1385), Expect = 4.9e-149
Identity = 276/309 (89.32%), Postives = 285/309 (92.23%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPE+P ADSPALPSSA+RSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPEVPAADSPALPSSATRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVE+LNKRKPCSGYKMKEMTGSGIFV NEGD+ELESGSANP+QNKTGIRMYQ       
Sbjct: 61  EEVETLNKRKPCSGYKMKEMTGSGIFVANEGDDELESGSANPAQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q +AGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESD+DAKLKKQL
Sbjct: 121 ----------QAMAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDSDAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPV-NRNVIPGEEPSVKTAKK 240
           SDAKCKELSGHDIFA PPEILPRPTTARTLDLKGSIEIGEP  NRN IPGEEPSVKTAKK
Sbjct: 181 SDAKCKELSGHDIFAAPPEILPRPTTARTLDLKGSIEIGEPAHNRNAIPGEEPSVKTAKK 240

Query: 241 IYDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPP 300
           IYDKKFSELSGNDIFKGD+PPSSAEKPLSVAKLREMSG+DIFADGKVETRDYLGGVRKPP
Sbjct: 241 IYDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSDIFADGKVETRDYLGGVRKPP 292

Query: 301 GGESSIALV 309
           GGESSIALV
Sbjct: 301 GGESSIALV 292

BLAST of Cla97C08G153990 vs. NCBI nr
Match: XP_022951840.1 (uncharacterized protein LOC111454570 [Cucurbita moschata] >XP_023002695.1 uncharacterized protein LOC111496478 [Cucurbita maxima] >XP_023537292.1 uncharacterized protein LOC111798406 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 534.3 bits (1375), Expect = 7.1e-148
Identity = 276/308 (89.61%), Postives = 281/308 (91.23%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDR TPVRKPHTSTADLLTWPE+PPADS AL SSASRSAPRSHQPS GISKVVFGGQVTD
Sbjct: 1   MDRATPVRKPHTSTADLLTWPEVPPADSSALSSSASRSAPRSHQPSPGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFV NE D+ELESGSANPSQNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVANEEDDELESGSANPSQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q LAGISHISFGEEGSVSPKKPTT+PEVAKQRELSGNLESDADA LKKQL
Sbjct: 121 ----------QALAGISHISFGEEGSVSPKKPTTLPEVAKQRELSGNLESDADAMLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP N NVIPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDNINVIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSSAEKPLSVAKLREMSG+DIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. ExPASy Swiss-Prot
Match: Q9SIE0 (DNA oxidative demethylase ALKBH2 OS=Arabidopsis thaliana OX=3702 GN=ALKBH2 PE=2 SV=2)

HSP 1 Score: 51.2 bits (121), Expect = 2.4e-05
Identity = 31/59 (52.54%), Postives = 40/59 (67.80%), Query Frame = 0

Query: 36 SRSAPRSHQP-SDGISKVVFGGQVTDEEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDE 94
          S +A RS+QP SDGIS     GQ+T+EE ESL  +K CSG+K+KE+T S  F  N  D+
Sbjct: 7  STAANRSNQPSSDGIS----DGQITNEEAESLINKKNCSGHKLKEVTDSDTFSDNGKDD 61

BLAST of Cla97C08G153990 vs. ExPASy TrEMBL
Match: A0A0A0LK08 (DUF4057 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G345910 PE=4 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 5.0e-155
Identity = 285/308 (92.53%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP +R +IPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDSRGIIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSS EKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSTEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. ExPASy TrEMBL
Match: A0A5A7V9J6 (DUF4057 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G001450 PE=4 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 2.5e-154
Identity = 284/308 (92.21%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANP QNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPLQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP +R++IPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDSRSIIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSS EKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSMEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. ExPASy TrEMBL
Match: A0A1S3BD79 (uncharacterized protein LOC103488402 OS=Cucumis melo OX=3656 GN=LOC103488402 PE=4 SV=1)

HSP 1 Score: 554.7 bits (1428), Expect = 2.5e-154
Identity = 284/308 (92.21%), Postives = 288/308 (93.51%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANP QNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPLQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL
Sbjct: 121 ----------QTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP +R++IPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDSRSIIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSS EKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSMEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. ExPASy TrEMBL
Match: A0A6J1BSB8 (uncharacterized protein LOC111005096 OS=Momordica charantia OX=3673 GN=LOC111005096 PE=4 SV=1)

HSP 1 Score: 538.1 bits (1385), Expect = 2.4e-149
Identity = 276/309 (89.32%), Postives = 285/309 (92.23%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDRTTPVRKPHTSTADLLTWPE+P ADSPALPSSA+RSAPRSHQPSDGISKVVFGGQVTD
Sbjct: 1   MDRTTPVRKPHTSTADLLTWPEVPAADSPALPSSATRSAPRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVE+LNKRKPCSGYKMKEMTGSGIFV NEGD+ELESGSANP+QNKTGIRMYQ       
Sbjct: 61  EEVETLNKRKPCSGYKMKEMTGSGIFVANEGDDELESGSANPAQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q +AGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESD+DAKLKKQL
Sbjct: 121 ----------QAMAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDSDAKLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPV-NRNVIPGEEPSVKTAKK 240
           SDAKCKELSGHDIFA PPEILPRPTTARTLDLKGSIEIGEP  NRN IPGEEPSVKTAKK
Sbjct: 181 SDAKCKELSGHDIFAAPPEILPRPTTARTLDLKGSIEIGEPAHNRNAIPGEEPSVKTAKK 240

Query: 241 IYDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPP 300
           IYDKKFSELSGNDIFKGD+PPSSAEKPLSVAKLREMSG+DIFADGKVETRDYLGGVRKPP
Sbjct: 241 IYDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSDIFADGKVETRDYLGGVRKPP 292

Query: 301 GGESSIALV 309
           GGESSIALV
Sbjct: 301 GGESSIALV 292

BLAST of Cla97C08G153990 vs. ExPASy TrEMBL
Match: A0A6J1KPP3 (uncharacterized protein LOC111496478 OS=Cucurbita maxima OX=3661 GN=LOC111496478 PE=4 SV=1)

HSP 1 Score: 534.3 bits (1375), Expect = 3.4e-148
Identity = 276/308 (89.61%), Postives = 281/308 (91.23%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           MDR TPVRKPHTSTADLLTWPE+PPADS AL SSASRSAPRSHQPS GISKVVFGGQVTD
Sbjct: 1   MDRATPVRKPHTSTADLLTWPEVPPADSSALSSSASRSAPRSHQPSPGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCSGYKMKEMTGSGIFV NE D+ELESGSANPSQNKTGIRMYQ       
Sbjct: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVANEEDDELESGSANPSQNKTGIRMYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q LAGISHISFGEEGSVSPKKPTT+PEVAKQRELSGNLESDADA LKKQL
Sbjct: 121 ----------QALAGISHISFGEEGSVSPKKPTTLPEVAKQRELSGNLESDADAMLKKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEP N NVIPGEEPSVKTAKKI
Sbjct: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPDNINVIPGEEPSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKFSELSGNDIFKGD+PPSSAEKPLSVAKLREMSG+DIFADGKVETRDYLGGVRKPPG
Sbjct: 241 YDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSDIFADGKVETRDYLGGVRKPPG 291

Query: 301 GESSIALV 309
           GESSIALV
Sbjct: 301 GESSIALV 291

BLAST of Cla97C08G153990 vs. TAIR 10
Match: AT1G78150.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1); Has 152 Blast hits to 146 proteins in 18 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 149; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 361.3 bits (926), Expect = 7.7e-100
Identity = 200/308 (64.94%), Postives = 229/308 (74.35%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLN+RKPCS +KMKE+TGSGIF  NE D+  E            + +YQ       
Sbjct: 61  EEVESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP-----------LPVYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q + GIS ISFGEE ++SPKKP TVPEVAKQRELSG +E+++  KL+KQL
Sbjct: 121 ----------QAVNGISQISFGEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAK KE+SG +IFAPPPEI PR  T R L LK +  +G     +    E+ SVKTAKKI
Sbjct: 181 SDAKYKEISGQNIFAPPPEIKPRSGTNRALALKDNFNLGA---ESQTAEEDSSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKF+ELSGNDIFKGD   S+ EK LS AKL+E+ GN+IFADGKVE RDYLGGVRKPPG
Sbjct: 241 YDKKFAELSGNDIFKGDAASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPG 274

Query: 301 GESSIALV 309
           GE+SIALV
Sbjct: 301 GETSIALV 274

BLAST of Cla97C08G153990 vs. TAIR 10
Match: AT1G78150.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 361.3 bits (926), Expect = 7.7e-100
Identity = 200/308 (64.94%), Postives = 229/308 (74.35%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLN+RKPCS +KMKE+TGSGIF  NE D+  E            + +YQ       
Sbjct: 61  EEVESLNRRKPCSEHKMKEITGSGIFSRNEKDDASEP-----------LPVYQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                     Q + GIS ISFGEE ++SPKKP TVPEVAKQRELSG +E+++  KL+KQL
Sbjct: 121 ----------QAVNGISQISFGEEENLSPKKPATVPEVAKQRELSGTMENESANKLQKQL 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAK KE+SG +IFAPPPEI PR  T R L LK +  +G     +    E+ SVKTAKKI
Sbjct: 181 SDAKYKEISGQNIFAPPPEIKPRSGTNRALALKDNFNLGA---ESQTAEEDSSVKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRKPPG 300
           YDKKF+ELSGNDIFKGD   S+ EK LS AKL+E+ GN+IFADGKVE RDYLGGVRKPPG
Sbjct: 241 YDKKFAELSGNDIFKGDAASSNVEKHLSQAKLKEIGGNNIFADGKVEARDYLGGVRKPPG 274

Query: 301 GESSIALV 309
           GE+SIALV
Sbjct: 301 GETSIALV 274

BLAST of Cla97C08G153990 vs. TAIR 10
Match: AT1G78150.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G35780.1). )

HSP 1 Score: 345.9 bits (886), Expect = 3.3e-95
Identity = 200/337 (59.35%), Postives = 229/337 (67.95%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R+TPVRKPHTSTADLLTW E+PP DS   PSSASRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MERSTPVRKPHTSTADLLTWSEVPPPDS---PSSASRSAVRSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNK-----------------------------RKPCSGYKMKEMTGSGIFVGNEG 120
           EEVESLN+                             RKPCS +KMKE+TGSGIF  NE 
Sbjct: 61  EEVESLNRRILDDAFDSFMRLVIYTNVKTCENVYDVIRKPCSEHKMKEITGSGIFSRNEK 120

Query: 121 DEELESGSANPSQNKTGIRMYQVSIEQDHIGVLTVNCELQTLAGISHISFGEEGSVSPKK 180
           D+  E            + +YQ                 Q + GIS ISFGEE ++SPKK
Sbjct: 121 DDASEP-----------LPVYQ-----------------QAVNGISQISFGEEENLSPKK 180

Query: 181 PTTVPEVAKQRELSGNLESDADAKLKKQLSDAKCKELSGHDIFAPPPEILPRPTTARTLD 240
           P TVPEVAKQRELSG +E+++  KL+KQLSDAK KE+SG +IFAPPPEI PR  T R L 
Sbjct: 181 PATVPEVAKQRELSGTMENESANKLQKQLSDAKYKEISGQNIFAPPPEIKPRSGTNRALA 240

Query: 241 LKGSIEIGEPVNRNVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGDIPPSSAEKPLSVAK 300
           LK +  +G     +    E+ SVKTAKKIYDKKF+ELSGNDIFKGD   S+ EK LS AK
Sbjct: 241 LKDNFNLGA---ESQTAEEDSSVKTAKKIYDKKFAELSGNDIFKGDAASSNVEKHLSQAK 300

Query: 301 LREMSGNDIFADGKVETRDYLGGVRKPPGGESSIALV 309
           L+E+ GN+IFADGKVE RDYLGGVRKPPGGE+SIALV
Sbjct: 301 LKEIGGNNIFADGKVEARDYLGGVRKPPGGETSIALV 303

BLAST of Cla97C08G153990 vs. TAIR 10
Match: AT1G35780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78150.2); Has 145 Blast hits to 144 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 145; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 342.8 bits (878), Expect = 2.8e-94
Identity = 195/311 (62.70%), Postives = 224/311 (72.03%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M++ TPVRKPH STADLLTWPE  P +SPA  S  SRSA RSHQPSDGISKVVFGGQVTD
Sbjct: 1   MEKNTPVRKPHMSTADLLTWPENQPFESPAAVS--SRSAARSHQPSDGISKVVFGGQVTD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIFVGNEGDEELESGSANPSQNKTGIRMYQVSIEQDH 120
           EEVESLNKRKPCS YKMKE+TGSGIF   E +++ E  SAN + N       Q       
Sbjct: 61  EEVESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQ------- 120

Query: 121 IGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAKLKKQL 180
                        A +SHISFGEE  V+PKKP TVPEVAKQRELSG LE  +DAKL KQ 
Sbjct: 121 ----------PPAAIMSHISFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQF 180

Query: 181 SDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNRNVIPGEEPSVKTAKKI 240
           SDAKCKELSGH+IFAPPPEI  RP T R L  K + ++GE   +      +  +KTAKKI
Sbjct: 181 SDAKCKELSGHNIFAPPPEIKLRP-TVRALAYKDNFDLGESDTK-----PDGELKTAKKI 240

Query: 241 YDKKFSELSGNDIFKGDI-PPSS--AEKPLSVAKLREMSGNDIFADGKVETRDYLGGVRK 300
            D+KF++LSGN++FK D+  PSS  AE+ LS AKL+E+SGNDIFAD K ++RDY GGVRK
Sbjct: 241 ADRKFTDLSGNNVFKSDVSSPSSATAERLLSTAKLKEISGNDIFADAKAQSRDYFGGVRK 286

Query: 301 PPGGESSIALV 309
           PPGGESSIALV
Sbjct: 301 PPGGESSIALV 286

BLAST of Cla97C08G153990 vs. TAIR 10
Match: AT4G39860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G22270.1); Has 152 Blast hits to 146 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 146; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 335.1 bits (858), Expect = 5.9e-92
Identity = 188/329 (57.14%), Postives = 226/329 (68.69%), Query Frame = 0

Query: 1   MDRTTPVRKPHTSTADLLTWPELPPADSPALPSSASRSAPRSHQPSDGISKVVFGGQVTD 60
           M+R TPVR PHTSTADLL+W E PP      P  ++ SA RSHQPSDGISK++ GGQ+TD
Sbjct: 1   MERNTPVRNPHTSTADLLSWSETPPP-----PHHSTPSAARSHQPSDGISKILGGGQITD 60

Query: 61  EEVESLNKRKPCSGYKMKEMTGSGIF-----VGNEGDEELESGSANPSQNKTGIRMYQVS 120
           EE +SLNK K CSGYK+KEMTGSGIF     VG+E D      + +P   KTG+R YQ  
Sbjct: 61  EEAQSLNKLKNCSGYKLKEMTGSGIFTDKGKVGSESD-----ATTDP---KTGLRYYQ-- 120

Query: 121 IEQDHIGVLTVNCELQTLAGISHISFGEEGSVSPKKPTTVPEVAKQRELSGNLESDADAK 180
                          QTL G+S ISF  +G+VSPKKPTT+ EVAKQRELSGNL ++AD K
Sbjct: 121 ---------------QTLNGMSQISFSADGNVSPKKPTTLTEVAKQRELSGNLLTEADLK 180

Query: 181 LKKQLSDAKCKELSGHDIFAPPPEILPRPTTARTLDLKGSIEIGEPVNR----------- 240
             KQ+S AK +E+SGHDIFAPP EI PR   A   + +G+ ++GEP  R           
Sbjct: 181 SNKQISSAKIEEISGHDIFAPPSEIQPRSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNP 240

Query: 241 -----NVIPGEEPSVKTAKKIYDKKFSELSGNDIFKGDIPPSSAEKPLSVAKLREMSGND 300
                N++  EEP VKT+KKI+++KF EL+GN IFKGD  P SA+K LS AKLREMSGN+
Sbjct: 241 AGGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGDESPGSADKQLSSAKLREMSGNN 299

Query: 301 IFADGKVETRDYLGGVRKPPGGESSIALV 309
           IFADGK E+RDY GGVRKPPGGESSI+LV
Sbjct: 301 IFADGKSESRDYFGGVRKPPGGESSISLV 299

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884071.17.9e-15592.53uncharacterized protein LOC120075008 [Benincasa hispida][more]
XP_004142997.11.0e-15492.53uncharacterized protein LOC101215119 [Cucumis sativus] >KGN62250.1 hypothetical ... [more]
XP_008445341.15.1e-15492.21PREDICTED: uncharacterized protein LOC103488402 [Cucumis melo] >KAA0064833.1 unc... [more]
XP_022132174.14.9e-14989.32uncharacterized protein LOC111005096 [Momordica charantia][more]
XP_022951840.17.1e-14889.61uncharacterized protein LOC111454570 [Cucurbita moschata] >XP_023002695.1 unchar... [more]
Match NameE-valueIdentityDescription
Q9SIE02.4e-0552.54DNA oxidative demethylase ALKBH2 OS=Arabidopsis thaliana OX=3702 GN=ALKBH2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A0A0LK085.0e-15592.53DUF4057 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G345910 PE=... [more]
A0A5A7V9J62.5e-15492.21DUF4057 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A1S3BD792.5e-15492.21uncharacterized protein LOC103488402 OS=Cucumis melo OX=3656 GN=LOC103488402 PE=... [more]
A0A6J1BSB82.4e-14989.32uncharacterized protein LOC111005096 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1KPP33.4e-14889.61uncharacterized protein LOC111496478 OS=Cucurbita maxima OX=3661 GN=LOC111496478... [more]
Match NameE-valueIdentityDescription
AT1G78150.17.7e-10064.94unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G78150.27.7e-10064.94unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G78150.33.3e-9559.35unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G35780.12.8e-9462.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G39860.15.9e-9257.14unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025131Domain of unknown function DUF4057PFAMPF13266DUF4057coord: 3..306
e-value: 1.5E-138
score: 461.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..56
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 31..45
NoneNo IPR availablePANTHERPTHR31132N-LYSINE METHYLTRANSFERASEcoord: 1..308
NoneNo IPR availablePANTHERPTHR31132:SF13N-LYSINE METHYLTRANSFERASEcoord: 1..308

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C08G153990.2Cla97C08G153990.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0008168 methyltransferase activity