Cla97C01G008000 (gene) Watermelon (97103) v2

NameCla97C01G008000
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionCytochrome c oxidase subunit 2
LocationCla97Chr01 : 8221036 .. 8226601 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTCTGTTTCTTCCTCATGCCTCTCTTCCCTCAATCCCATCATATCTTCTTCAAAACATTCCCTCTTCATCTCCAGAACCTCCAACATCCCACTTCCCTCCAAATCTTTGAAGTTCTCATCATCTCCAAACCCACCAAATCCTGAAACTCCTCCACCCAATTCACCTGAAATTGTCTCTGATGCCGCACCGTCTCCTGTCGATCCTGTCAAGCTCGCATTCCAGCGAGCCAAGGCCTACAAGAAATTTTCACAATCCGGCTCCAATTTGAATGTTGAGCTGAAACCAGGTGAGGGTTCTGAAGGCAACTCTGTCGGAACTGGTAAATCAGGTTTATTGAGTTTCGATGGGGCTGACGAGCAGAGGAAAATGCAGGGTGGGGTTGGAACTGCGATGGAAAATGCTAACGAAGTGAAGGGGGAAACGAGGGTTGTAACTGATGGCACGAAGGGTGCGGAAATTAACACCAATGAAGGTAAATTTTGTTAAAGTTGATTTGATTGAATGTCGAAATTACTCTGATATAGTTCAAAATTGTAATGCGGTATGGTTGCTACTTATGTATGGAGTGAGACAAGCTGTTTTTCTGAACTTGAAATTGGACGCTTTGTTAGAGAAATCTGATTTGTATAGTTTGTATATTATATATAGAGATGCAGGGAGGAATATGTTTCAAAAGTTTGGATTTATGTTGCCGGTTCGGGGATATCTGGTAAAATTGGAATGATACAAAGAAGATTAGCACGATGACAAGTACAAATCAAGAAATCGTCCAAATGTTTAAGTGCATATTTTGGAGTGTTTTTTGAAATTGTTAAAATAATTCTTGTCACGTTTAAAATTATTTTGAAACATGTCTTTAATCACTCAAAATTAATTTAATATTTAATTGTATAAATTTTTATATTGTAAAAAATGGTACTGAATGATTGAAAACTTGTTTTAAAGTGATTTTGAAAATTTAGAAGTGATTGTAACTATTTCAGAATTGCTTCCAAACATTCCCTAAAGAAAAGTTTGGATTTATGTTGTTATTTGAACATATATAGTTTGAGAGTTGTGTTTCCAATAGCTTTACTTTATATACCTGATGGGACTAGTAAATCTATACGGGGTTAATAAGTGCACGTAATTAGCAGACAAACACAACCCTGGTGCCTGAACTGAACCTTAGCCAACTTGTGATTCGTGAGCCTATCATCCCATTGATAGGTTGTGATTTTATTTTCATTTCTATGTAATTCAAGAGATAGACTATTTAGTTCTGAGTTGTTAATATTAGCTTGATTTATGTTCTCTATATGTTCTTATGATGGTCTACTATTCTTTTTCCTGTGCATTATTGTAAGGAATGAGTACTATAAGTAACTTGAGTTGATATAAATTGTTAAAGTGATCTTCAGGATTGAAGGGCAGAGAAGGGGAAAATTTGGGAAACAAACAAAAGGGTGATAAGAAGGGAGAGCTTTCTATCTCAAGCATTGATTTTATGGGGCTTGGCTTTGCGGATAAGAAAACGACCAGGGGACTGCCAGCTGGATTGGTCCCAATTGCAGACCCCTTTTCAGTAGCAGACTTGCCTGAAGTGGAAATAATTGTAGGTGATTCGAGCAAGTTTGACGATGCAACATCATCCGAAACTAAATCAACTCAGGAAGATGATTCTGATTTTTACAAACCCAAGGTTTCTACATGGGGGGTCTTTCCTAGACCAGGCAACATTTCAAAGACGGTAAAAGATCTACTTGTGCCTTTTCTAGTGTTTTGGGCAAGAAAGATTCCATTAGCCCCCTTTCTGTTAATTTTTTGTGGAAAATAATACGTGCTTTCATCCTTATTTTCAATAAATGATGTGTTAATCCAAGTAGGGAAGCCTAAATTCATGAACTAATTTTTCCATGCAAAACTTTTTGTCTTCTACATTTCTGAAACGTAAACTTGTATTGGGATACAAGGGTGATGACTATATAAAATCTGTGACAGATTGTCTATTGCATATCAGATGACTTCATATATCACTTGAAATTCCTGAACACATGTTAACCTTTTGACATCATAGATTGTCTTCACTAGTACTCTCACATGATGGAGAATGTGCTCTCTCCATCTCTAGATTCTTTTGGAATGTCATGAATATGATGGCTTTCACATCTTCAATATTGTGTGTGAAGGTGTGTTACACCTCAAGATGGAGTGTGCCCATAAGTTGGTTTATCAAGAAAAACAACAGCGTACCATTGTTCATAATTTGAAAATATTTACTAAAAGTACTGTATCATATATGCTGGAGGACTTGATAATGAGTGAATAATGTGGGTGCAGCATATGCATATACCATCACAATACGGAAAGTACTGTACATTCAACAATTCAACAAATACTAATAAAATTGTGACGACTTCTAAAGTGTTCATTGGCATATTTACTTCATCAATGATGACCAAAGCGAAAACTCCTTGAAAGGAAATTCTTCAATAATGGTGCAATTGGATTCTGATAGATTTCATTAATACATTTGTAGACGTATGGTATTCAATATCTTATCTAAGGCCTCATTTATGAAGGCACATATTAATTTCTTAATAACAAGAAAACACAAAATAGAACTATATTGCAATATCAATAGGCTTTTAAGAGGGCTTTCCCCCCTTGGACAATGTCCAAAATAATTTCCTCTCCTCTAAGATGCCTAAGTCTCTATTTATATTACCAAGTCACGCCTTGGTAAAAGCTACTACGCCCTTACTAATATTTCCTATAGATCCCCTATTAGTACTTTCGCAACACATTTTTTAAAATTCTTTTCTTTTCTATTCAGTTTGGAGGTGGAAGAACAATACGCCCTGGAGATGTGCTTGAAACAGATGAAGAAAAAGCTGCGAAGGAAGCACGCACAAAAGAACTGATTGCTGCATACAAGAAAAAATTTGGCTTGAGCATAGATGCGAAACTGAAGTCCGAATGTGAAGGGGTGAGCTTAATTAGAAACAAATATAATTAACTTTTATTTGTTATGTCTTTTTTCCATGACAAGAATAAACAAATATTATTAACTATTCTCCAAGCTTAAATATGCAATTTACAATGTAATTTGTTCGTTTTTAACATATGGAAGTTCTCTCCCTACTTCTAATAGTTAAACATATTCTTATCAGTTATCAGCATCTCATATTTACAGGCACTAGAGGAAGGCGATTCATTGATGAGTGTTGGAAAACTCAAGGAAGCCTTACCATATTATGAGACAATTATGGAAAAACTAAACTTCCAGGTAGAGTTTTTGTAATTTAAGTTTATGAACAGCTGAAAAGAAAAGATTTGGTAGAATGCAATATTGAATTCACTCAAATATCTTTACCCACCTTGCATAATGTGCGGTAATGAATTTTCTTCCTTTCCTTAATTTGAGAATAGGTATCGGGGACTTTTATTTATTTAAAACAGAGAGAAAAGAGTGCCTAGGCACCATACATAAGTTACCTAGCCAATTTGAGATGGCTAGTTATCTCCTACAAACATCTCTCTCAAGTCCCTCTTGACAAAATATATTCCATCCGGACCAACTAACTAACTAGTAACTACTGAGGAGTAAACATTTACGATAGGACCATCTAGCTGTCTTCCTAGATTCTCCTATCTAAATGAAGAACACCTTCAACAGGGGTCTCAATCTGGGGTGTTATTGCTTTATTGCAGATTATTAAGGTTTCTAAACTAAACCCTGAATAGATTAGATGTTTCTGATGTTAAAAATAAATATATGTATACAAATTAGATATAGTTATTCACATGCTTTGTTTCTTTCTTTTCTAACTATTTTTGTTTACGTGACAATAATGTTTGTGCAGAGTGAGCTTCATGGACTTGCTGCTTTGCAGTGGTCTATCTGTCAAGATTCCCTCAGTAGGTATGTTCATGGTCTTCGAATAAGCTTAAAAACTTCATAATGCTCGAAAAAAATAATCAATTCTTTCATGTATGGCTAAGCTTTAAGTAAAGATCAGCTCCGCATGAATATCTTTTTATGTTCTCTCTCTGATTTTGATCACGTCAATGTCCTTTGAAATGAATAATTGTTTATGTAATGGGGTTCATCAATTCTATATTTTGAATTGTTTTTCAAAAGTTTTTTTGAAAGGATATCTGAAAATGGAAGTTTGGAAATAAGGAAGTTTGGAAATAAGGGATTTAGTGATAATAGAGTTGTATTTCGTGTTTTCAAATATATGTTTGGTAGCAGGTTCGTCGGTGAATCTAAAAACTAGAAGCATAATCTAAATTTGCATATGCCAACATGCTTTAAATGTCTATTGGATTAATGAACTCACCAAATCAATAAAGAATGTGCCCTTCCAAAAAGTTCCAAACTTAGAATCTAGAGATATTTATTTGAGTGAAGATACGATTTGCTATCAATGAAACTGGTAATTTAAAGACTTATTGGTTGAATCTAAATTTTTGTGAGTTTATTATTTAATCCTTTGCTAAATTGAAAAAATAAAATCAAAGCTAACATGAATAAGAAGTTACTCAACGTAGTTCAATAATATATATATATATAATAGGAGTTAGATAAATGTGGTGATACTAAAGAAAGGAGGGGAACGATTATAAAACTGAAAAATTGAGAAAGAAACCGATTGTTATTATTATTATTTTCTGGTTTATGATATGACAACTACCATGCTTAACAGGCCAGATGAGGCTCGAGAAATGTATGAGAAGCTTCAGTCCCATCCAAATCCTCGAGTGAGCAAGAAAGCAAGGCAGTTCATGTTTAGTTTCCAGGTAATTAGCGACGGCGGTTTGTTGATAAACTTGTTTGGAATGACTTCAAATATAATACTATTTGTTTTTCAATGAATCAATGCTTGTACTGATCGATAAGGAAACTTTAAACTGGAAGGACATGGATGCTAATGCTTTAATGTTGTAATGACCACTGTGTAATCAAAACTCCTGTATTGTGTAATTGAGTGCTAAAATACAACTTAAGGCCTAGTATCTATATGGGCACAAGCTGGCCAACTAACCTAGAACTCTCAATAGATTACCCAACTTGGTTAAAGGATAACTAAAAAATAACCGAGAGTAAACTAAAGAAACAAGGAAGCAAACTTAAAGAGATCTAAAGAAAATACAAGATCTTTCAAGAAACTTAATAGGAGCCAAAAACATGAGAATGCTACACATGCTTATTCTAGCATCATAATTTGGGCGGTTTTCTTACTCTGACAACTTAATTAGTGATTTGAAGTTTGAAGTGTATCTTGTAACTTCATTCCTATTATTTCAAGGGATTCTTTCTGATGTTGTATGGATTTGGACCTTTTCCAATTCACAGGCAATGGAAATGATGAAGGTAACGACAAGATCGTCTTTCCTTTCCAATGACAGTAGCTACCGGAACTATTTTGAAGCTTTTCTTGAAAACAAGCTTAACTACTCTGCTGACGACTCCGGGATTGGGGAAGGTGTACTTAATCAATCTCTTCCTTATGTCATTTTTCTTCTTTCTCCTATTCTTCTAGTGCTACTTGCTGCTGTACAAAAGAGAATATAA

mRNA sequence

ATGGCCTCTGTTTCTTCCTCATGCCTCTCTTCCCTCAATCCCATCATATCTTCTTCAAAACATTCCCTCTTCATCTCCAGAACCTCCAACATCCCACTTCCCTCCAAATCTTTGAAGTTCTCATCATCTCCAAACCCACCAAATCCTGAAACTCCTCCACCCAATTCACCTGAAATTGTCTCTGATGCCGCACCGTCTCCTGTCGATCCTGTCAAGCTCGCATTCCAGCGAGCCAAGGCCTACAAGAAATTTTCACAATCCGGCTCCAATTTGAATGTTGAGCTGAAACCAGGTGAGGGTTCTGAAGGCAACTCTGTCGGAACTGGTAAATCAGGTTTATTGAGTTTCGATGGGGCTGACGAGCAGAGGAAAATGCAGGGTGGGGTTGGAACTGCGATGGAAAATGCTAACGAAGTGAAGGGGGAAACGAGGGTTGTAACTGATGGCACGAAGGGTGCGGAAATTAACACCAATGAAGGATTGAAGGGCAGAGAAGGGGAAAATTTGGGAAACAAACAAAAGGGTGATAAGAAGGGAGAGCTTTCTATCTCAAGCATTGATTTTATGGGGCTTGGCTTTGCGGATAAGAAAACGACCAGGGGACTGCCAGCTGGATTGGTCCCAATTGCAGACCCCTTTTCAGTAGCAGACTTGCCTGAAGTGGAAATAATTGTAGGTGATTCGAGCAAGTTTGACGATGCAACATCATCCGAAACTAAATCAACTCAGGAAGATGATTCTGATTTTTACAAACCCAAGGTTTCTACATGGGGGGTCTTTCCTAGACCAGGCAACATTTCAAAGACGTTTGGAGGTGGAAGAACAATACGCCCTGGAGATGTGCTTGAAACAGATGAAGAAAAAGCTGCGAAGGAAGCACGCACAAAAGAACTGATTGCTGCATACAAGAAAAAATTTGGCTTGAGCATAGATGCGAAACTGAAGTCCGAATGTGAAGGGGCACTAGAGGAAGGCGATTCATTGATGAGTGTTGGAAAACTCAAGGAAGCCTTACCATATTATGAGACAATTATGGAAAAACTAAACTTCCAGAGTGAGCTTCATGGACTTGCTGCTTTGCAGTGGTCTATCTGTCAAGATTCCCTCAGTAGGCCAGATGAGGCTCGAGAAATGTATGAGAAGCTTCAGTCCCATCCAAATCCTCGAGTGAGCAAGAAAGCAAGGCAGTTCATGTTTAGTTTCCAGGCAATGGAAATGATGAAGGTAACGACAAGATCGTCTTTCCTTTCCAATGACAGTAGCTACCGGAACTATTTTGAAGCTTTTCTTGAAAACAAGCTTAACTACTCTGCTGACGACTCCGGGATTGGGGAAGGTGTACTTAATCAATCTCTTCCTTATGTCATTTTTCTTCTTTCTCCTATTCTTCTAGTGCTACTTGCTGCTGTACAAAAGAGAATATAA

Coding sequence (CDS)

ATGGCCTCTGTTTCTTCCTCATGCCTCTCTTCCCTCAATCCCATCATATCTTCTTCAAAACATTCCCTCTTCATCTCCAGAACCTCCAACATCCCACTTCCCTCCAAATCTTTGAAGTTCTCATCATCTCCAAACCCACCAAATCCTGAAACTCCTCCACCCAATTCACCTGAAATTGTCTCTGATGCCGCACCGTCTCCTGTCGATCCTGTCAAGCTCGCATTCCAGCGAGCCAAGGCCTACAAGAAATTTTCACAATCCGGCTCCAATTTGAATGTTGAGCTGAAACCAGGTGAGGGTTCTGAAGGCAACTCTGTCGGAACTGGTAAATCAGGTTTATTGAGTTTCGATGGGGCTGACGAGCAGAGGAAAATGCAGGGTGGGGTTGGAACTGCGATGGAAAATGCTAACGAAGTGAAGGGGGAAACGAGGGTTGTAACTGATGGCACGAAGGGTGCGGAAATTAACACCAATGAAGGATTGAAGGGCAGAGAAGGGGAAAATTTGGGAAACAAACAAAAGGGTGATAAGAAGGGAGAGCTTTCTATCTCAAGCATTGATTTTATGGGGCTTGGCTTTGCGGATAAGAAAACGACCAGGGGACTGCCAGCTGGATTGGTCCCAATTGCAGACCCCTTTTCAGTAGCAGACTTGCCTGAAGTGGAAATAATTGTAGGTGATTCGAGCAAGTTTGACGATGCAACATCATCCGAAACTAAATCAACTCAGGAAGATGATTCTGATTTTTACAAACCCAAGGTTTCTACATGGGGGGTCTTTCCTAGACCAGGCAACATTTCAAAGACGTTTGGAGGTGGAAGAACAATACGCCCTGGAGATGTGCTTGAAACAGATGAAGAAAAAGCTGCGAAGGAAGCACGCACAAAAGAACTGATTGCTGCATACAAGAAAAAATTTGGCTTGAGCATAGATGCGAAACTGAAGTCCGAATGTGAAGGGGCACTAGAGGAAGGCGATTCATTGATGAGTGTTGGAAAACTCAAGGAAGCCTTACCATATTATGAGACAATTATGGAAAAACTAAACTTCCAGAGTGAGCTTCATGGACTTGCTGCTTTGCAGTGGTCTATCTGTCAAGATTCCCTCAGTAGGCCAGATGAGGCTCGAGAAATGTATGAGAAGCTTCAGTCCCATCCAAATCCTCGAGTGAGCAAGAAAGCAAGGCAGTTCATGTTTAGTTTCCAGGCAATGGAAATGATGAAGGTAACGACAAGATCGTCTTTCCTTTCCAATGACAGTAGCTACCGGAACTATTTTGAAGCTTTTCTTGAAAACAAGCTTAACTACTCTGCTGACGACTCCGGGATTGGGGAAGGTGTACTTAATCAATCTCTTCCTTATGTCATTTTTCTTCTTTCTCCTATTCTTCTAGTGCTACTTGCTGCTGTACAAAAGAGAATATAA

Protein sequence

MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPPNPETPPPNSPEIVSDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGADEQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETKSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIAAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAALQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDSSYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI
BLAST of Cla97C01G008000 vs. NCBI nr
Match: XP_008465728.1 (PREDICTED: uncharacterized protein LOC103503340 [Cucumis melo])

HSP 1 Score: 770.4 bits (1988), Expect = 3.6e-219
Identity = 418/474 (88.19%), Postives = 438/474 (92.41%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEIV 60
           MASVS SCLSSLNPIISSSKHSL ISR S+ P PSKSLKFS SPNP     XXXXXX  +
Sbjct: 1   MASVSFSCLSSLNPIISSSKHSLLISRISDKPFPSKSLKFSLSPNPPNPETXXXXXXXXL 60

Query: 61  SDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGAD 120
           SDAAP P+DPVKLAF+RAKAYKK S+SGSNLNVELKPG GSEGNSV TGK   LSFDGAD
Sbjct: 61  SDAAPPPLDPVKLAFERAKAYKKLSKSGSNLNVELKPGVGSEGNSVQTGK---LSFDGAD 120

Query: 121 EQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGE 180
           EQRKMQGG+   +E A EVKGE +VVTDGTKG EINTNEGLK RE ENLGNKQKGDKKGE
Sbjct: 121 EQRKMQGGLRITVEGATEVKGEAKVVTDGTKGGEINTNEGLKDRERENLGNKQKGDKKGE 180

Query: 181 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 240
           LSISSIDF+GLGFADK+ TRGLPAGLVPI+DPFSV DLPEVEIIVGDSSKFDDAT+S+ K
Sbjct: 181 LSISSIDFIGLGFADKRKTRGLPAGLVPISDPFSVEDLPEVEIIVGDSSKFDDATASKIK 240

Query: 241 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300
            TQEDDSD YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA
Sbjct: 241 PTQEDDSDLYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300

Query: 301 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 360
           AYK+KFGL+IDAKLKSECE ALEEGDSLM+VGKLKEALPYYETIMEK+NFQSELHGLAAL
Sbjct: 301 AYKRKFGLTIDAKLKSECEVALEEGDSLMNVGKLKEALPYYETIMEKVNFQSELHGLAAL 360

Query: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 420
           QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQF+FSFQAMEMMKVTTRSSFLSNDS
Sbjct: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFVFSFQAMEMMKVTTRSSFLSNDS 420

Query: 421 SYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           SY+NYFEAFLENKLNYSAD+SGIGEGVLNQSLPYVIFLLSPILLVL AAVQKRI
Sbjct: 421 SYQNYFEAFLENKLNYSADESGIGEGVLNQSLPYVIFLLSPILLVLFAAVQKRI 471

BLAST of Cla97C01G008000 vs. NCBI nr
Match: XP_004143815.1 (PREDICTED: uncharacterized protein LOC101215292 [Cucumis sativus] >KGN51107.1 hypothetical protein Csa_5G453150 [Cucumis sativus])

HSP 1 Score: 766.1 bits (1977), Expect = 6.7e-218
Identity = 422/475 (88.84%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISR-TSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEI 60
           MASVSSSCLSSLNPIISS+KHSLFISR +SN P PSKSLKFSSSPNPXXXXXXXXXXX  
Sbjct: 1   MASVSSSCLSSLNPIISSTKHSLFISRISSNKPFPSKSLKFSSSPNPXXXXXXXXXXXXX 60

Query: 61  VSDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGA 120
                  P+DPVKLAF+RAKAYKK S+SGSNLNVELKPG GSEGNSV TGKSG+LSFDGA
Sbjct: 61  XXXXXXPPLDPVKLAFERAKAYKKLSKSGSNLNVELKPGVGSEGNSVQTGKSGVLSFDGA 120

Query: 121 DEQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKG 180
           DEQRKMQGGV  A+E+ANEVKGE +VVTDGTKG  INTNEGL  R+G NLGNKQKGDKKG
Sbjct: 121 DEQRKMQGGVRVAVESANEVKGEAKVVTDGTKGGVINTNEGLNDRDGGNLGNKQKGDKKG 180

Query: 181 ELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSET 240
           ELSISSIDF+GLGFADKK +RGLPAGLVPI+DPFSV DLPEVEIIVGDSSKFDDAT SE 
Sbjct: 181 ELSISSIDFIGLGFADKKKSRGLPAGLVPISDPFSVEDLPEVEIIVGDSSKFDDATVSEI 240

Query: 241 KSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELI 300
           K TQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKA KEARTKELI
Sbjct: 241 KPTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAVKEARTKELI 300

Query: 301 AAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAA 360
           AAYKKKFGL+IDAKLKSECE ALEEGDSLM+ GKLKEALPYYETIMEK+NFQSELHGLAA
Sbjct: 301 AAYKKKFGLTIDAKLKSECEMALEEGDSLMNDGKLKEALPYYETIMEKVNFQSELHGLAA 360

Query: 361 LQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSND 420
           LQWSICQDSLSRPD AREMYEKL+SHPNPRVSKKARQFMFSFQAMEMMKVTT SSFLSND
Sbjct: 361 LQWSICQDSLSRPDVAREMYEKLKSHPNPRVSKKARQFMFSFQAMEMMKVTTSSSFLSND 420

Query: 421 SSYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           SSYRNYFEAFL+NKLNYSAD+SGIGEGVLNQSLPYVIFLLSPILLVL AAVQKRI
Sbjct: 421 SSYRNYFEAFLDNKLNYSADESGIGEGVLNQSLPYVIFLLSPILLVLFAAVQKRI 475

BLAST of Cla97C01G008000 vs. NCBI nr
Match: XP_022157899.1 (uncharacterized protein LOC111024506 [Momordica charantia])

HSP 1 Score: 707.2 bits (1824), Expect = 3.7e-200
Identity = 389/474 (82.07%), Postives = 419/474 (88.40%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEIV 60
           MASVS SCLSSL   IS SKHSLFI R+S  P PSKSLKFSSSPNPXXXXXXXXXXX IV
Sbjct: 1   MASVSPSCLSSLK-TISPSKHSLFIFRSSKQPFPSKSLKFSSSPNPXXXXXXXXXXXXIV 60

Query: 61  SDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGAD 120
           SDAAP P+DPVKLAF+RAKAY+K +Q  SNL  E +PGEGSEGNSVGT +SGL +FDGAD
Sbjct: 61  SDAAPPPIDPVKLAFERAKAYRKLAQPNSNLKFEQEPGEGSEGNSVGTAQSGLSNFDGAD 120

Query: 121 EQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGE 180
           EQRKMQGGV  AMENA+E KGETR   DG    E+ TN GLKGREG+ LGNK K DKKGE
Sbjct: 121 EQRKMQGGVEIAMENADEYKGETRDAIDGKNSGEVYTNPGLKGREGQKLGNKHKVDKKGE 180

Query: 181 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 240
           LSIS+IDFMGLGFADKK TRG+PAGLVP+ADPFS  DLPEVEIIVGD S FDD  + ETK
Sbjct: 181 LSISNIDFMGLGFADKKKTRGMPAGLVPVADPFSTEDLPEVEIIVGDMSNFDDEPAMETK 240

Query: 241 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300
             QEDDSD YKPKVSTWGVFPRPGNISKTFGGGRTIRPG+VLETDEEK AKEART+ELIA
Sbjct: 241 LIQEDDSDLYKPKVSTWGVFPRPGNISKTFGGGRTIRPGEVLETDEEKTAKEARTRELIA 300

Query: 301 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 360
           AYKKKFGLSIDAKLKSECE AL+EGD LMSVGKL +ALPYYETIM+KLNFQSELHG+AA+
Sbjct: 301 AYKKKFGLSIDAKLKSECEEALKEGDLLMSVGKLSDALPYYETIMDKLNFQSELHGIAAM 360

Query: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 420
           QWSICQDSL+R +EAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTT SSFLSN+S
Sbjct: 361 QWSICQDSLNRSNEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTSSSFLSNNS 420

Query: 421 SYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
            Y++YFEAFLENKL+ S  D+GIGEGVLNQSLPY+IFLLSPILLVLLAAVQKRI
Sbjct: 421 GYQSYFEAFLENKLDNSIKDTGIGEGVLNQSLPYIIFLLSPILLVLLAAVQKRI 473

BLAST of Cla97C01G008000 vs. NCBI nr
Match: XP_022990666.1 (uncharacterized protein LOC111487485 [Cucurbita maxima])

HSP 1 Score: 689.9 bits (1779), Expect = 6.1e-195
Identity = 375/474 (79.11%), Postives = 409/474 (86.29%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEIV 60
           MASVSSSCL+SLNP ISSSKHSLFISR S    PS+SLKFSSSPNP           E V
Sbjct: 1   MASVSSSCLASLNP-ISSSKHSLFISRISTKKFPSRSLKFSSSPNPPSPDTPRSNSPETV 60

Query: 61  SDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGAD 120
           SDAA  PVDPVK AF++A AYKK  QS SNL       EGSEGNSVG GK GL SFDG D
Sbjct: 61  SDAAQPPVDPVKAAFEQAMAYKKLKQSDSNLT----KVEGSEGNSVGAGKLGLSSFDGDD 120

Query: 121 EQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGE 180
           EQR+MQGGV   MENANEVK ETR V DGT   EINT+ GLKG+E ENLGNKQKGDKKG 
Sbjct: 121 EQRRMQGGVMIVMENANEVKEETRGVIDGTNSEEINTSAGLKGKESENLGNKQKGDKKGG 180

Query: 181 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 240
           LSISSIDF+GLGFADKK TRGLPAGLVP+ADPFS  DLPEVEIIVGD+SKF+ AT+SE+K
Sbjct: 181 LSISSIDFVGLGFADKKKTRGLPAGLVPMADPFSGEDLPEVEIIVGDTSKFNAATASESK 240

Query: 241 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300
            TQEDDSD YKPKVSTWGVFPRPGNISKTFGGGRTIRPG++LETDEEKAAKEAR++ELIA
Sbjct: 241 PTQEDDSDIYKPKVSTWGVFPRPGNISKTFGGGRTIRPGELLETDEEKAAKEARSRELIA 300

Query: 301 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 360
           AYKKKFGL+ID KLKSECE AL+EGDSLM++G+L+EALPYYE+IM+KL FQSELHG+AAL
Sbjct: 301 AYKKKFGLTIDPKLKSECEVALKEGDSLMNIGRLEEALPYYESIMDKLIFQSELHGVAAL 360

Query: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 420
           QWSICQDSL R DEAREMYEKLQSHP PRVSKKARQF+FSFQAMEMMKVTT S F+SNDS
Sbjct: 361 QWSICQDSLRRSDEAREMYEKLQSHPTPRVSKKARQFVFSFQAMEMMKVTTSSYFVSNDS 420

Query: 421 SYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           +Y+NYFEAFL+NK  YS ++SGIGEGVLNQSLPYVIFLLSPILLVLLAA+QKRI
Sbjct: 421 NYQNYFEAFLDNKPTYSTEESGIGEGVLNQSLPYVIFLLSPILLVLLAALQKRI 469

BLAST of Cla97C01G008000 vs. NCBI nr
Match: XP_022923269.1 (uncharacterized protein LOC111431010 isoform X1 [Cucurbita moschata])

HSP 1 Score: 680.2 bits (1754), Expect = 4.9e-192
Identity = 371/474 (78.27%), Postives = 404/474 (85.23%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEIV 60
           MASVSSSCL+SLNP ISSSK SLFISR S    PS+SLKFS SPNP           E V
Sbjct: 1   MASVSSSCLASLNP-ISSSKRSLFISRISTKKFPSRSLKFSLSPNPPNPETPRSNSPETV 60

Query: 61  SDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGAD 120
           SDA  SPVDPVK AF+RA AYK+  QS SNL       EG EGNSV  GK GL SFDG  
Sbjct: 61  SDAVQSPVDPVKAAFERAMAYKELKQSDSNLT----KVEGCEGNSVEAGKLGLSSFDGDG 120

Query: 121 EQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGE 180
           EQR+MQGGV   MENANEVK ETR V DGT   EINT+ GLK +E ENLG KQKGDKKG 
Sbjct: 121 EQRRMQGGVMIVMENANEVKEETRGVIDGTNSEEINTSAGLKSKESENLGKKQKGDKKGG 180

Query: 181 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 240
           LSISSIDF+GLGFADKK TRGLPAGLVP+ADPFS  DLPEVEIIVGD+SKFD AT+SE+K
Sbjct: 181 LSISSIDFVGLGFADKKKTRGLPAGLVPMADPFSGDDLPEVEIIVGDTSKFDAATASESK 240

Query: 241 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300
            TQEDDSD YKPKVSTWGVFPRPGNISKTFGGGRTIRPG++LETDEEKAA+EAR++ELIA
Sbjct: 241 PTQEDDSDIYKPKVSTWGVFPRPGNISKTFGGGRTIRPGELLETDEEKAAREARSRELIA 300

Query: 301 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 360
           AYKKKFGL+ID KLKSECE AL+EGDSLM+VG+LKEALPYYE++M+KLNFQSELHG+AAL
Sbjct: 301 AYKKKFGLTIDPKLKSECEVALKEGDSLMNVGRLKEALPYYESVMDKLNFQSELHGVAAL 360

Query: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 420
           QWSICQDSL R DEAREMYEKLQSHP PRVSKKARQF+FSFQAMEMMKVTT SSF+SNDS
Sbjct: 361 QWSICQDSLRRSDEAREMYEKLQSHPTPRVSKKARQFVFSFQAMEMMKVTTSSSFVSNDS 420

Query: 421 SYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           +Y+NYFEAF+ENK  YS ++SGIGEGVLNQSLPYVIFLLSPILLVLLAA+QKRI
Sbjct: 421 NYQNYFEAFVENKPTYSTEESGIGEGVLNQSLPYVIFLLSPILLVLLAALQKRI 469

BLAST of Cla97C01G008000 vs. TrEMBL
Match: tr|A0A1S3CPJ2|A0A1S3CPJ2_CUCME (uncharacterized protein LOC103503340 OS=Cucumis melo OX=3656 GN=LOC103503340 PE=4 SV=1)

HSP 1 Score: 770.4 bits (1988), Expect = 2.4e-219
Identity = 418/474 (88.19%), Postives = 438/474 (92.41%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEIV 60
           MASVS SCLSSLNPIISSSKHSL ISR S+ P PSKSLKFS SPNP     XXXXXX  +
Sbjct: 1   MASVSFSCLSSLNPIISSSKHSLLISRISDKPFPSKSLKFSLSPNPPNPETXXXXXXXXL 60

Query: 61  SDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGAD 120
           SDAAP P+DPVKLAF+RAKAYKK S+SGSNLNVELKPG GSEGNSV TGK   LSFDGAD
Sbjct: 61  SDAAPPPLDPVKLAFERAKAYKKLSKSGSNLNVELKPGVGSEGNSVQTGK---LSFDGAD 120

Query: 121 EQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKGE 180
           EQRKMQGG+   +E A EVKGE +VVTDGTKG EINTNEGLK RE ENLGNKQKGDKKGE
Sbjct: 121 EQRKMQGGLRITVEGATEVKGEAKVVTDGTKGGEINTNEGLKDRERENLGNKQKGDKKGE 180

Query: 181 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 240
           LSISSIDF+GLGFADK+ TRGLPAGLVPI+DPFSV DLPEVEIIVGDSSKFDDAT+S+ K
Sbjct: 181 LSISSIDFIGLGFADKRKTRGLPAGLVPISDPFSVEDLPEVEIIVGDSSKFDDATASKIK 240

Query: 241 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300
            TQEDDSD YKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA
Sbjct: 241 PTQEDDSDLYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 300

Query: 301 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 360
           AYK+KFGL+IDAKLKSECE ALEEGDSLM+VGKLKEALPYYETIMEK+NFQSELHGLAAL
Sbjct: 301 AYKRKFGLTIDAKLKSECEVALEEGDSLMNVGKLKEALPYYETIMEKVNFQSELHGLAAL 360

Query: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 420
           QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQF+FSFQAMEMMKVTTRSSFLSNDS
Sbjct: 361 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFVFSFQAMEMMKVTTRSSFLSNDS 420

Query: 421 SYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           SY+NYFEAFLENKLNYSAD+SGIGEGVLNQSLPYVIFLLSPILLVL AAVQKRI
Sbjct: 421 SYQNYFEAFLENKLNYSADESGIGEGVLNQSLPYVIFLLSPILLVLFAAVQKRI 471

BLAST of Cla97C01G008000 vs. TrEMBL
Match: tr|A0A0A0KRT2|A0A0A0KRT2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G453150 PE=4 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 4.5e-218
Identity = 422/475 (88.84%), Postives = 441/475 (92.84%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISR-TSNIPLPSKSLKFSSSPNPXXXXXXXXXXXEI 60
           MASVSSSCLSSLNPIISS+KHSLFISR +SN P PSKSLKFSSSPNPXXXXXXXXXXX  
Sbjct: 1   MASVSSSCLSSLNPIISSTKHSLFISRISSNKPFPSKSLKFSSSPNPXXXXXXXXXXXXX 60

Query: 61  VSDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGA 120
                  P+DPVKLAF+RAKAYKK S+SGSNLNVELKPG GSEGNSV TGKSG+LSFDGA
Sbjct: 61  XXXXXXPPLDPVKLAFERAKAYKKLSKSGSNLNVELKPGVGSEGNSVQTGKSGVLSFDGA 120

Query: 121 DEQRKMQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGDKKG 180
           DEQRKMQGGV  A+E+ANEVKGE +VVTDGTKG  INTNEGL  R+G NLGNKQKGDKKG
Sbjct: 121 DEQRKMQGGVRVAVESANEVKGEAKVVTDGTKGGVINTNEGLNDRDGGNLGNKQKGDKKG 180

Query: 181 ELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSET 240
           ELSISSIDF+GLGFADKK +RGLPAGLVPI+DPFSV DLPEVEIIVGDSSKFDDAT SE 
Sbjct: 181 ELSISSIDFIGLGFADKKKSRGLPAGLVPISDPFSVEDLPEVEIIVGDSSKFDDATVSEI 240

Query: 241 KSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELI 300
           K TQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKA KEARTKELI
Sbjct: 241 KPTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAVKEARTKELI 300

Query: 301 AAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAA 360
           AAYKKKFGL+IDAKLKSECE ALEEGDSLM+ GKLKEALPYYETIMEK+NFQSELHGLAA
Sbjct: 301 AAYKKKFGLTIDAKLKSECEMALEEGDSLMNDGKLKEALPYYETIMEKVNFQSELHGLAA 360

Query: 361 LQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSND 420
           LQWSICQDSLSRPD AREMYEKL+SHPNPRVSKKARQFMFSFQAMEMMKVTT SSFLSND
Sbjct: 361 LQWSICQDSLSRPDVAREMYEKLKSHPNPRVSKKARQFMFSFQAMEMMKVTTSSSFLSND 420

Query: 421 SSYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
           SSYRNYFEAFL+NKLNYSAD+SGIGEGVLNQSLPYVIFLLSPILLVL AAVQKRI
Sbjct: 421 SSYRNYFEAFLDNKLNYSADESGIGEGVLNQSLPYVIFLLSPILLVLFAAVQKRI 475

BLAST of Cla97C01G008000 vs. TrEMBL
Match: tr|A0A2I4DKZ7|A0A2I4DKZ7_9ROSI (uncharacterized protein LOC108981190 isoform X1 OS=Juglans regia OX=51240 GN=LOC108981190 PE=4 SV=1)

HSP 1 Score: 462.6 bits (1189), Expect = 1.1e-126
Identity = 289/490 (58.98%), Postives = 344/490 (70.20%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPI--ISSSKHSLFISRTSNIPLPS-KSLK--FS-SSPNPXXXXXXXX 60
           M  +  S LS+LN +  IS SK +LF   T+ I   S K LK  FS +S N XXXXXX  
Sbjct: 1   MGCLQPSWLSALNTMNTISPSKPTLFPPSTNFIISHSFKPLKQPFSLNSSNSXXXXXXPP 60

Query: 61  XXXEIVSDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLL 120
              E   DA    VDPVKLAF++AKAYKK  +S     VE  P E S       G  G+ 
Sbjct: 61  NSPESTPDAKLGAVDPVKLAFEKAKAYKKEVKSKLVSEVEQNPVEDS-------GVGGV- 120

Query: 121 SFDGADEQRKMQGGVGTAMENANEVKGETRVVTDGTKGAE-------INTNEGLKGREGE 180
                   R++   V  AME A E      VV+ GTKG E       I  + GL G +G 
Sbjct: 121 -------TREVPVSVKVAMEKAKEYNKSKGVVSSGTKGGEGDAIPSSIFESRGLNGGKGG 180

Query: 181 NLGN---KQKGDKKGELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEII 240
           +LGN   ++  +KKGELS+SSIDFMGL FADKK++RGLP GLVP++D F   DL +VE I
Sbjct: 181 SLGNGIVEKTVNKKGELSVSSIDFMGLNFADKKSSRGLPPGLVPVSDSFPEEDLTDVEFI 240

Query: 241 VGDSSKFDDATSSETKSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLET 300
           VGD+SKF+D T+S+ +  QE DS+ YKPKVS+WGVFPRPGNIS+TFGGGR IRPG+VLET
Sbjct: 241 VGDTSKFEDTTASQLEQPQEHDSNLYKPKVSSWGVFPRPGNISRTFGGGRVIRPGEVLET 300

Query: 301 DEEKAAKEARTKELIAAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETI 360
            EEKAAKEART+EL+AAYK K GL ID KLKSEC+ AL++GDSLM++GKLKEALPYY+ +
Sbjct: 301 AEEKAAKEARTRELLAAYKSKTGLKIDPKLKSECQEALKDGDSLMNLGKLKEALPYYKKV 360

Query: 361 MEKLNFQSELHGLAALQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAM 420
           MEKL FQSELHGLAALQWSICQDSLSRP+EAR MYEKLQSHPN +VSKKARQFMFSFQAM
Sbjct: 361 MEKLTFQSELHGLAALQWSICQDSLSRPNEARSMYEKLQSHPNVQVSKKARQFMFSFQAM 420

Query: 421 EMMKVTTRSSFLSNDSSYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILL 475
           EMMK TT + F   ++ Y+N+FEAF+E K NY        EG LNQ LPY+IFL+SPI  
Sbjct: 421 EMMKFTTSTPFYLKNAGYQNFFEAFIEKKSNYPLKKVEFEEGGLNQVLPYIIFLVSPIFA 475

BLAST of Cla97C01G008000 vs. TrEMBL
Match: tr|A0A251QA42|A0A251QA42_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G029800 PE=4 SV=1)

HSP 1 Score: 453.8 bits (1166), Expect = 4.9e-124
Identity = 282/478 (59.00%), Postives = 344/478 (71.97%), Query Frame = 0

Query: 1   MASVSSSCLSSLNPIISSSKHSLFISRTSNIPLPSKSLKFSSS---PNPXXXXXXXXXXX 60
           MAS+  S LSSLN I S++K +LF S   N P P K  K S S   PN XXXXXXXXXXX
Sbjct: 1   MASLQPSWLSSLNSISSTTKPTLFPSTNLNKPHPLKPFKLSFSLNPPNSXXXXXXXXXXX 60

Query: 61  EIVSDAAPSPVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFD 120
               +A P P DPVKLA + AKAYKK  Q    L +E  P +  +G   G G+SG    D
Sbjct: 61  XXXXEAQPGPTDPVKLALENAKAYKKSVQMNKKLKIEKNPVKDGDG-IAGNGESGP---D 120

Query: 121 GADEQRK-MQGGVGTAMENANEVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGD 180
           GA   +K +   V  AME A E K    +V       E +   GL+   G NLGN +  D
Sbjct: 121 GAGGGKKEVPAAVKIAMEKAKEYKKSKGIVGGDINAGESDKISGLEESNGGNLGN-EIVD 180

Query: 181 KKGELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATS 240
           KKG+LS+SSIDF+GLGFADKK  RGLPAGLVPIAD F   + P+VEIIVGD+  F DA +
Sbjct: 181 KKGKLSVSSIDFVGLGFADKKEGRGLPAGLVPIADYFPEGNSPDVEIIVGDARNF-DAVA 240

Query: 241 SETKSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTK 300
            + + TQ D+SD YKPKVS+WGVFPRP +ISKTFGGGR I+PG+VLET EEKAAKEART+
Sbjct: 241 RKPEQTQGDNSDLYKPKVSSWGVFPRPNDISKTFGGGRVIQPGEVLETAEEKAAKEARTR 300

Query: 301 ELIAAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHG 360
           +L+AAYK K G++ID KL+SECE AL++GD+LM VG+LKEAL YYE +M+KL F+SELHG
Sbjct: 301 QLVAAYKSKMGMNIDPKLRSECEKALKDGDTLMDVGELKEALIYYEQVMDKLPFKSELHG 360

Query: 361 LAALQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFL 420
           LAALQWSICQDSLSR  EA+ MYEKLQSHP  +VSKKARQF+FSFQAMEMMK+T  S + 
Sbjct: 361 LAALQWSICQDSLSRSQEAQVMYEKLQSHPTAKVSKKARQFVFSFQAMEMMKLTGSSPW- 420

Query: 421 SNDSSYRNYFEAFLENKLNYSADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
             ++ ++NYFEAF+ENK +Y   ++    G L+Q+LPY+IFL+SPI +VLL A+QKRI
Sbjct: 421 -KNTGFQNYFEAFIENKSDYVLKEAESEVGTLSQTLPYIIFLVSPIFVVLLIALQKRI 470

BLAST of Cla97C01G008000 vs. TrEMBL
Match: tr|A0A1S2YNL5|A0A1S2YNL5_CICAR (uncharacterized protein LOC101489337 OS=Cicer arietinum OX=3827 GN=LOC101489337 PE=4 SV=1)

HSP 1 Score: 452.2 bits (1162), Expect = 1.4e-123
Identity = 254/415 (61.20%), Postives = 308/415 (74.22%), Query Frame = 0

Query: 67  PVDPVKLAFQRAKAYKKFSQSGSNLNVELKPGEGSEGNSVGTGKSGLLSFDGADEQRKMQ 126
           P DP+KLAF +AKAYK+  +S S+L +E    E  + NSV          DG   ++ + 
Sbjct: 60  PADPIKLAFSKAKAYKESIKSKSSLGIEQSGAE--KDNSVEVNN----VVDGG--RKDVP 119

Query: 127 GGVGTAMENAN---EVKGETRVVTDGTKGAEINTNEGLKGREGENLGNKQKGD---KKGE 186
             V  AME AN   ++KG          GA   T++GL+G     LG    G+   KKGE
Sbjct: 120 VSVKIAMEKANKYKQIKG----------GAVSETDQGLQGGSDSTLGEDVNGNSVGKKGE 179

Query: 187 LSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIVGDSSKFDDATSSETK 246
           LS+S +DF+GL FADKK TRGLP GLVPI+D FS  DLPEVE IVGD+++FDDATS++ +
Sbjct: 180 LSVSRMDFVGLEFADKKKTRGLPPGLVPISDSFSDDDLPEVEFIVGDANRFDDATSTQPE 239

Query: 247 STQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETDEEKAAKEARTKELIA 306
            T ED+S+ YKPKVSTWGVFPRPGNISKTFGGGR I PG++LET+EEKA KEARTK+++A
Sbjct: 240 QTNEDESELYKPKVSTWGVFPRPGNISKTFGGGRVINPGEILETEEEKAKKEARTKQMLA 299

Query: 307 AYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIMEKLNFQSELHGLAAL 366
           AYKKKFGL+ID KLKSECE  L++GD LM+ GKLKEALPYYET+M+KL  +SELHGLAAL
Sbjct: 300 AYKKKFGLNIDPKLKSECEEVLKDGDLLMNAGKLKEALPYYETVMDKLPLKSELHGLAAL 359

Query: 367 QWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAMEMMKVTTRSSFLSNDS 426
           QWSICQDSLSR +EAR MYEKLQSHP+P+V KKARQFM+SFQAMEMMKV T SS  S ++
Sbjct: 360 QWSICQDSLSRHNEARSMYEKLQSHPSPKVGKKARQFMYSFQAMEMMKVRTGSSRYSKNT 419

Query: 427 SYRNYFEAFLENKLNYS-ADDSGIGEGVLNQSLPYVIFLLSPILLVLLAAVQKRI 475
            Y+NYF+AF+E K NY   DD    E  +NQ L Y+IFL+SPI +VLL AVQKRI
Sbjct: 420 FYQNYFDAFIEKKSNYPLKDDVAAQESAMNQVLLYIIFLISPIFVVLLLAVQKRI 456

BLAST of Cla97C01G008000 vs. TAIR10
Match: AT2G38780.1 (unknown protein)

HSP 1 Score: 369.8 bits (948), Expect = 2.6e-102
Identity = 197/308 (63.96%), Postives = 238/308 (77.27%), Query Frame = 0

Query: 169 LGNKQKGD---KKGELSISSIDFMGLGFADKKTTRGLPAGLVPIADPFSVADLPEVEIIV 228
           L NK   D   KK EL +SSIDFMGLGFADKK+TRGLPAGLVP+ D     DLPEVE IV
Sbjct: 156 LANKVVEDNDVKKKELKVSSIDFMGLGFADKKSTRGLPAGLVPVVDYLPEGDLPEVEFIV 215

Query: 229 GDSSKFDDATSSETKSTQEDDSDFYKPKVSTWGVFPRPGNISKTFGGGRTIRPGDVLETD 288
           GD ++F +    E +   + +SD YKPKVSTWGVFPRP NISKTFGGGRT+RPGD +ET 
Sbjct: 216 GDKTRFAEKV-KEVEQEGDGNSDVYKPKVSTWGVFPRPSNISKTFGGGRTLRPGDSVETA 275

Query: 289 EEKAAKEARTKELIAAYKKKFGLSIDAKLKSECEGALEEGDSLMSVGKLKEALPYYETIM 348
           EE+  +E +TK+L+ AYK+  GL+ID KLK ECE A++EG+SLM  GKLKEALPYYE +M
Sbjct: 276 EERIVREEKTKKLLIAYKESLGLNIDPKLKLECEKAIDEGNSLMDSGKLKEALPYYEKVM 335

Query: 349 EKLNFQSELHGLAALQWSICQDSLSRPDEAREMYEKLQSHPNPRVSKKARQFMFSFQAME 408
           EK+ F+SELHGLAALQWSICQDSL + D+AR MYEKL SHPNP VSKKARQFMFSFQAME
Sbjct: 336 EKIVFKSELHGLAALQWSICQDSLRKTDKARRMYEKLISHPNPGVSKKARQFMFSFQAME 395

Query: 409 MMKVTTRSSFLSNDSSYRNYFEAFLENKLNYSADDSGIGEGV-LNQSLPYVIFLLSPILL 468
           M+KV   SSF   ++ Y++YFEAF+E+K NY A +   GE + +N++L YVI L SPIL+
Sbjct: 396 MLKV-KGSSFAEGNTGYQDYFEAFVEDKTNYKAQEEKEGEEMGINETLLYVILLASPILM 455

Query: 469 VLLAAVQK 473
           V + A Q+
Sbjct: 456 VFIVAAQR 461

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008465728.13.6e-21988.19PREDICTED: uncharacterized protein LOC103503340 [Cucumis melo][more]
XP_004143815.16.7e-21888.84PREDICTED: uncharacterized protein LOC101215292 [Cucumis sativus] >KGN51107.1 hy... [more]
XP_022157899.13.7e-20082.07uncharacterized protein LOC111024506 [Momordica charantia][more]
XP_022990666.16.1e-19579.11uncharacterized protein LOC111487485 [Cucurbita maxima][more]
XP_022923269.14.9e-19278.27uncharacterized protein LOC111431010 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S3CPJ2|A0A1S3CPJ2_CUCME2.4e-21988.19uncharacterized protein LOC103503340 OS=Cucumis melo OX=3656 GN=LOC103503340 PE=... [more]
tr|A0A0A0KRT2|A0A0A0KRT2_CUCSA4.5e-21888.84Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G453150 PE=4 SV=1[more]
tr|A0A2I4DKZ7|A0A2I4DKZ7_9ROSI1.1e-12658.98uncharacterized protein LOC108981190 isoform X1 OS=Juglans regia OX=51240 GN=LOC... [more]
tr|A0A251QA42|A0A251QA42_PRUPE4.9e-12459.00Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_2G029800 PE=4 SV=1[more]
tr|A0A1S2YNL5|A0A1S2YNL5_CICAR1.4e-12361.20uncharacterized protein LOC101489337 OS=Cicer arietinum OX=3827 GN=LOC101489337 ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT2G38780.12.6e-10263.96unknown protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0022900 electron transport chain
biological_process GO:1902600 hydrogen ion transmembrane transport
biological_process GO:0055114 oxidation-reduction process
biological_process GO:0044763 single-organism cellular process
biological_process GO:0008150 biological_process
biological_process GO:0044699 single-organism process
biological_process GO:0008152 metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0044425 membrane part
cellular_component GO:0070469 respiratory chain
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005743 mitochondrial inner membrane
cellular_component GO:0016020 membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004129 cytochrome-c oxidase activity
molecular_function GO:0005507 copper ion binding
molecular_function GO:0003824 catalytic activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G008000.1Cla97C01G008000.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 286..402
e-value: 7.8E-5
score: 25.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 158..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 82..178
NoneNo IPR availablePANTHERPTHR35482FAMILY NOT NAMEDcoord: 6..472

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G008000Silver-seed gourdcarwmbB0858
Cla97C01G008000Cucurbita maxima (Rimu)cmawmbB289
Cla97C01G008000Cucurbita moschata (Rifu)cmowmbB271
Cla97C01G008000Wax gourdwgowmbB054