Cla97C06G119440 (gene) Watermelon (97103) v2

NameCla97C06G119440
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat-containing protein, putative
LocationCla97Chr06 : 19361682 .. 19363121 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAA

mRNA sequence

ATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAA

Coding sequence (CDS)

ATGGCTCTCTTTCGAATCTCTTACCTCCGATCATCATCATTTCGTCTCAAGATCCCCGCGTTATCTACCTTGCAGCTAAGTACAGTCTCTTCTGCCGATTTATTCTATGGCCATCTGCAGAAAAACAACGGTAATGTGGAGAAAACCCTTGCTACTGTAAAGACCAAGTTGGATTCTGGATGTGTCAACCAAGTATTACATAAATGTTCCTTCGAACTATCTCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATAGTTCTTTTATGTACAGTAGAGCTTGTGAAATGATTGGAATTCATAGAAGGCCAGGTTTGCTTTTTAATGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATTTTTAAGATTATGTTAAACTTGTGTAAAGAAGCTAAGCTTGCCAAAGAGGCTTTGTCCATTTTAGGGAAAATGCCCGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATATGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGATATACATCCTAACATGATTACTTATATTTCTTTGCTTAAGGGATTCTGTGATGTGGGTCGTTGGGCGGATGCTTATGGGGTATTTGAGGCTATGAAGGAAAATGGATGTGAACCCAATACAGTGGTTTACTCCGTGCTACTTAATGCTGCCAGTCGGCATGGGACTATGGAAAAGCTAATGGAAGTGTTGGAGGAGATGGAAAAACAAGGGGGAACATGTGGTCCAAATACTGTCACATACACTTCCATAATCCAGCGTCTATGTGAACTAGGCCAGCCTCTGGAAGCATTGAAGATATTGGACAGAATGGAAGAGTATGGTTGTGCTCCAAATCGTGTTACAGTTAGCTCTTTAATCAAGGAATTTTGTAAAAATGGTCACGTGGAGGAGGCATATAAGTTGATTGATAGAGCTGTTGCAAGAGGTGGTGCTTCGTATGGTGATTGCTATAGCTCACTTGTGGTATCTTTGGTTAAGATGAAAAAGATTGCAGAAGCAGAGGAGCTATTTAGAAACATGTTAGCCAATGGGGTGAAGCCAGATGGTGTGGCTTGTACTCTCGTGATCAAGGAATTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCAACGAAGTCGATAGGAATGGATATTTAACTTCCATTGACTCTGATGTTTATTCTATTCTTTTAGTTGGACTTTGTGAGCATGACCACTCTGTGGATGCTGCAAAACTTGCAAGGTTGATGCTTAAAAAAGGGATTCGTTTAAAGCCTTACTTTGCTGAAAGTATCATCAAACATCTAAAGAAATTTGGAGATCAAGAGTTAGTTATGAATTTGGGTGGAATAAGAAATGACAAGCAAAACTAA

Protein sequence

MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTMYNLVIRLFTEKGEMDMAMELMKEMDSVDIHPNMITYISLLKGFCDVGRWADAYGVFEAMKENGCEPNTVVYSVLLNAASRHGTMEKLMEVLEEMEKQGGTCGPNTVTYTSIIQRLCELGQPLEALKILDRMEEYGCAPNRVTVSSLIKEFCKNGHVEEAYKLIDRAVARGGASYGDCYSSLVVSLVKMKKIAEAEELFRNMLANGVKPDGVACTLVIKELCLEERVLDGFNLCNEVDRNGYLTSIDSDVYSILLVGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIRNDKQN
BLAST of Cla97C06G119440 vs. NCBI nr
Match: XP_008441677.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo] >XP_008441678.1 PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo])

HSP 1 Score: 332.8 bits (852), Expect = 1.9e-87
Identity = 380/474 (80.17%), Postives = 398/474 (83.97%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRISY RSSS  L I  LST  LST+SS+DLFY HL+KNNGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVN+VL+KCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDIRIF+I+LNLCKEAKL KEALSIL KM EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXX             EMEKQGGT                                 Y
Sbjct: 241 XXXXXRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GHAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX GF+LC EVDRNGYL  ID+DVYS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGL +HDHSVDAA LARLMLKKGIRLKP++AESIIKHLKKF DQEL+M+LGGIR
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIR 474

BLAST of Cla97C06G119440 vs. NCBI nr
Match: XP_004139002.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativus] >XP_011649081.1 PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativus] >KGN61448.1 hypothetical protein Csa_2G123590 [Cucumis sativus])

HSP 1 Score: 330.9 bits (847), Expect = 7.3e-87
Identity = 389/474 (82.07%), Postives = 410/474 (86.50%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRIS  RSSSF L I  LST  L+T+SS+DLFY HL+K+NGN++KTLAT+KTKLDS 
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVN+VL+KCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDIR+FKI+LNLCKEAKLAKEALSIL KM EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      GT                                 Y
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  DGFNLC EVDRNGYL SID+D+YS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGLCEHDHSVDAAKLARLMLKKGIRLKP++AESIIKHLKKF D+ELVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIR 474

BLAST of Cla97C06G119440 vs. NCBI nr
Match: XP_022155853.1 (pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155854.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155855.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155856.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia])

HSP 1 Score: 327.8 bits (839), Expect = 6.2e-86
Identity = 415/474 (87.55%), Postives = 428/474 (90.30%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALF I   RS SF LKI  LS L LSTVSSADLFY HLQKNNGNVEK LATVKT LDS 
Sbjct: 1   MALFGIFSFRSFSFGLKISKLSALHLSTVSSADLFYDHLQKNNGNVEKILATVKTTLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVNQVLHKCSFELS MGLRFFIWAGRQPNYRHSSFMYSRACE+IGI R P LL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSLMGLRFFIWAGRQPNYRHSSFMYSRACELIGIDRSPCLLLNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGC+VDIR+FK+MLNLCKEAKLA EAL ILGKMPEFHLRADT XXXXXXXXXXXXXXX
Sbjct: 121 RREGCVVDIRMFKVMLNLCKEAKLANEALLILGKMPEFHLRADTXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX        XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXX         DG+NLCNEVDRNGYL+SIDSD+YS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXELCLEERVLDGYNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGLCEHDH +DA KLARLMLKKGIRLKP++A+ +IKHL KFGDQELVM LGGIR
Sbjct: 421 VGLCEHDHPMDAEKLARLMLKKGIRLKPHYADHVIKHLNKFGDQELVMQLGGIR 474

BLAST of Cla97C06G119440 vs. NCBI nr
Match: XP_022928928.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata])

HSP 1 Score: 324.3 bits (830), Expect = 6.9e-85
Identity = 422/474 (89.03%), Postives = 438/474 (92.41%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQK NGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVNQVLHKCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P LLFNVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDI +FK++LNLCKE KLAKEALSILGKM EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      GTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDF 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        VDRNGYL+SIDSD+YS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of Cla97C06G119440 vs. NCBI nr
Match: XP_023551479.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 323.6 bits (828), Expect = 1.2e-84
Identity = 420/474 (88.61%), Postives = 439/474 (92.62%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRI Y R SSFR KI  LSTLQLSTVSSADLFY HLQKNNGNVEKTL TVKTKLDS 
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLTTVKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVN+VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACE+IG++R P L+FNVIEDY
Sbjct: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLVFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDI +FK++LNLCKE KLAKEALSILG+M EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGEMAEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      GTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDF 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        VDRNGYL+SIDSD+YS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGLCEHDHSVDAAKLARLML+KGIRLKP++AESIIKH+KKFGDQ LVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIR 474

BLAST of Cla97C06G119440 vs. TrEMBL
Match: tr|A0A1S3B4L9|A0A1S3B4L9_CUCME (pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN=LOC103485755 PE=4 SV=1)

HSP 1 Score: 332.8 bits (852), Expect = 1.3e-87
Identity = 380/474 (80.17%), Postives = 398/474 (83.97%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRISY RSSS  L I  LST  LST+SS+DLFY HL+KNNGNVEKTLATVKTKLDS 
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVN+VL+KCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDIRIF+I+LNLCKEAKL KEALSIL KM EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXX             EMEKQGGT                                 Y
Sbjct: 241 XXXXXRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GHAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX GF+LC EVDRNGYL  ID+DVYS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGL +HDHSVDAA LARLMLKKGIRLKP++AESIIKHLKKF DQEL+M+LGGIR
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIR 474

BLAST of Cla97C06G119440 vs. TrEMBL
Match: tr|A0A0A0LI44|A0A0A0LI44_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1)

HSP 1 Score: 330.9 bits (847), Expect = 4.8e-87
Identity = 389/474 (82.07%), Postives = 410/474 (86.50%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MALFRIS  RSSSF L I  LST  L+T+SS+DLFY HL+K+NGN++KTLAT+KTKLDS 
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CVN+VL+KCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACE+IGI+  P LLFNVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           RREGCLVDIR+FKI+LNLCKEAKLAKEALSIL KM EFHLR    XXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      GT                                 Y
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  DGFNLC EVDRNGYL SID+D+YS+LL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGIR 475
           VGLCEHDHSVDAAKLARLMLKKGIRLKP++AESIIKHLKKF D+ELVM+LGGIR
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIR 474

BLAST of Cla97C06G119440 vs. TrEMBL
Match: tr|A0A2P5FIU2|A0A2P5FIU2_9ROSA (Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_066350 PE=4 SV=1)

HSP 1 Score: 207.2 bits (526), Expect = 8.1e-50
Identity = 310/473 (65.54%), Postives = 349/473 (73.78%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MAL  IS L SSS RL+ P  ST++ +  SSAD+F+ HLQKN GN+EKTL TVK ++DS 
Sbjct: 25  MALCSISRLLSSSIRLQNPKFSTVRSTAASSADIFFNHLQKNGGNIEKTLVTVKAQVDSK 84

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CV+ VL++C    SQMGLRFFIWAG Q +YRH+S+MYS+AC    I + P LL++VIE Y
Sbjct: 85  CVSGVLYRCYPSQSQMGLRFFIWAGLQSDYRHTSYMYSKACNFYKITQNPKLLYDVIEAY 144

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           R E C V ++ FK++LNL KEAKLA EAL +L KMPEF LRADTT  XXXXXXXXXXXXX
Sbjct: 145 RAERCSVTVKTFKVILNLYKEAKLADEALWVLRKMPEFGLRADTTMYXXXXXXXXXXXXX 204

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      G+XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY
Sbjct: 265 XXXXXXXXXXXXXXXXXXXXXXXXGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 324

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                        
Sbjct: 325 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGVSYGECYSSLVVCLKRSRNTEEA 384

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
                                            DG+ L +E+++ GYL SIDSDVYS+LL
Sbjct: 385 EKVFRKMLTSGLKPDGLACSIMIKELCLVGRVLDGYQLFDEIEKIGYLISIDSDVYSLLL 444

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGI 474
           VGLCE  HSV+A  LARLMLKK IRLK  +++SI + LKK G++ELV +L  I
Sbjct: 445 VGLCEQSHSVEAKTLARLMLKKRIRLKAPYSDSIGEILKKSGEEELVNHLTAI 497

BLAST of Cla97C06G119440 vs. TrEMBL
Match: tr|A0A061G4E0|A0A061G4E0_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_015917 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 2.6e-48
Identity = 352/473 (74.42%), Postives = 392/473 (82.88%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           M +  +S   S S   K   + T   ST SSAD F+ HLQK   N+EKTLA V +KLDS 
Sbjct: 1   MPILSLSRFISISISSKPNKIFTFLFSTASSADKFFTHLQKKQSNIEKTLALVNSKLDSN 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CV +VL +C F+ SQMGLRFFIWAG Q NYRHSS+MYS+ACE + I + P L+ +VIE Y
Sbjct: 61  CVCEVLERCCFDKSQMGLRFFIWAGLQSNYRHSSYMYSKACEFLKIKQNPFLVLDVIEAY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           + E CLV++++FK++LNLC+EA++  EAL +L KMPEF+LR DTTXXXXXXXXXXXXXXX
Sbjct: 121 KVEKCLVNVKMFKVVLNLCREARITDEALLVLRKMPEFNLRPDTTXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX  EK+G  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXEKEGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTC 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
           XXXXXXXXXXXXXXXXXXXXXXXXX        DGF L  E++R  YL+SID+D+YSILL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXICQEGRVLDGFYLYEEIERMRYLSSIDADIYSILL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGI 474
           VGLC   HSV+AAKLAR ML+K IRLK  + + II+HLK  GD++LV  LG I
Sbjct: 421 VGLCRQSHSVEAAKLARSMLEKRIRLKAPYVDKIIEHLKNCGDKQLVTELGRI 473

BLAST of Cla97C06G119440 vs. TrEMBL
Match: tr|A0A2P5B2Y8|A0A2P5B2Y8_PARAD (Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_276160 PE=4 SV=1)

HSP 1 Score: 198.4 bits (503), Expect = 3.8e-47
Identity = 304/473 (64.27%), Postives = 345/473 (72.94%), Query Frame = 0

Query: 1   MALFRISYLRSSSFRLKIPALSTLQLSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSG 60
           MAL  IS L S   R + P  ST++ +  SSAD+F+ HLQK  GN+EKTLATVK +LDS 
Sbjct: 1   MALCSISRLLSYPIRFQNPKFSTVRSTAASSADIFFNHLQKKGGNIEKTLATVKAQLDSK 60

Query: 61  CVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDY 120
           CV+ VL++C    SQMGLRFFIWAG Q +YRH+S+MYS+AC +  I + P LL++VIE Y
Sbjct: 61  CVSGVLYRCYPSHSQMGLRFFIWAGLQSDYRHTSYMYSKACNLYKITQNPKLLYDVIEAY 120

Query: 121 RREGCLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADTTXXXXXXXXXXXXXXX 180
           R E C V ++ FK++LNL KEAKLA EAL +L KMPEF LRADTT  XXXXXXXXXXXXX
Sbjct: 121 RAERCSVTVKTFKVILNLYKEAKLADEALWVLRKMPEFGLRADTTMYXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXEMEKQGGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 300
           XXXXXXXXXXXXXXXXXX      G XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAC 300

Query: 301 GXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                        
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGVSYGECYSSLVVCLKRSRNTEEA 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDGFNLCNEVDRNGYLTSIDSDVYSILL 420
                                            DG+ L +E+++ GYL SIDSD+YS+LL
Sbjct: 361 EKVFRKMLTSGLKPDGLACSIMIKELCLVGRVLDGYQLFDEIEKIGYLISIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPYFAESIIKHLKKFGDQELVMNLGGI 474
           +GLCE  HSV+A  LARLMLKK IRLK  +++SI + LKK G++EL+ +L GI
Sbjct: 421 LGLCEQSHSVEAKTLARLMLKKRIRLKAPYSDSIGEILKKSGEEELINHLTGI 473

BLAST of Cla97C06G119440 vs. Swiss-Prot
Match: sp|Q9LVS3|PP422_ARATH (Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX=3702 GN=At5g47360 PE=2 SV=1)

HSP 1 Score: 151.0 bits (380), Expect = 3.4e-35
Identity = 76/160 (47.50%), Postives = 109/160 (68.12%), Query Frame = 0

Query: 6   ISYLRSSSFRLKIPALSTLQ-LSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQ 65
           IS L S S R +   +S L+ L+TVS+A+  YG LQ    N+EK LA+   +LDS C+N+
Sbjct: 6   ISRLVSPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINE 65

Query: 66  VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG 125
           VL +C     Q GLRFFIWAG   ++RHS++MY++AC+++ I  +P L+  VIE YR+E 
Sbjct: 66  VLRRCDPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEE 125

Query: 126 CLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADT 165
           C V+++  +I+L LC +A LA EAL +L K PEF++ ADT
Sbjct: 126 CFVNVKTMRIVLTLCNQANLADEALWVLRKFPEFNVCADT 165

BLAST of Cla97C06G119440 vs. Swiss-Prot
Match: sp|Q9FH87|PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana OX=3702 GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 2.0e-11
Identity = 40/138 (28.99%), Postives = 74/138 (53.62%), Query Frame = 0

Query: 30  SSADLFYGHLQKNNGNVEK---TLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGR 89
           S  +  Y  L+K +  V K    L     +L  G + +VL++C  +   +G RFF+WA +
Sbjct: 81  SDVEKSYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCG-DAGNLGYRFFVWAAK 140

Query: 90  QPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG-CLVDIRIFKIMLNLCKEAKLA 149
           QP Y HS  +Y    +++   R+ G ++ +IE+ R+E   L++  +F +++     A + 
Sbjct: 141 QPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMV 200

Query: 150 KEALSILGKMPEFHLRAD 164
           K+A+ +L +MP+F    D
Sbjct: 201 KKAIEVLDEMPKFGFEPD 217

BLAST of Cla97C06G119440 vs. Swiss-Prot
Match: sp|P0C8A0|PP275_ARATH (Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX=3702 GN=At3g49730 PE=2 SV=1)

HSP 1 Score: 62.0 bits (149), Expect = 2.1e-08
Identity = 37/132 (28.03%), Postives = 72/132 (54.55%), Query Frame = 0

Query: 36  YGHLQKNNGNVEK-TLATVKTKLD--SGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRH 95
           Y  L+ ++  V K  LA  ++ +D   G + +VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 96  SSFMYSRACEMIGIHRRPGLLFNVIEDYRREGC-LVDIRIFKIMLNLCKEAKLAKEALSI 155
           S  +      ++   R+ G ++ +IE+ R+    L++  +F +++     A + K+A+ +
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 156 LGKMPEFHLRAD 164
           L +MP++ L  D
Sbjct: 191 LDEMPKYGLEPD 201

BLAST of Cla97C06G119440 vs. Swiss-Prot
Match: sp|Q9SSR6|PPR78_ARATH (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 56.6 bits (135), Expect = 8.7e-07
Identity = 34/118 (28.81%), Postives = 57/118 (48.31%), Query Frame = 0

Query: 45  NVEKTLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMI 104
           ++E TL     ++ S  V QVL +C   L     RFF+WA R P++ HS   Y    E++
Sbjct: 54  DLEHTLVAYSPRVSSNLVEQVLKRCK-NLGFPAHRFFLWARRIPDFAHSLESYHILVEIL 113

Query: 105 GIHRRPGLLFNVIEDYRREGCL-VDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLR 162
           G  ++  LL++ + + R      +  ++F I+      A L  EA     +M EF ++
Sbjct: 114 GSSKQFALLWDFLIEAREYNYFEISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIK 170

BLAST of Cla97C06G119440 vs. TAIR10
Match: AT5G47360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 151.0 bits (380), Expect = 1.9e-36
Identity = 76/160 (47.50%), Postives = 109/160 (68.12%), Query Frame = 0

Query: 6   ISYLRSSSFRLKIPALSTLQ-LSTVSSADLFYGHLQKNNGNVEKTLATVKTKLDSGCVNQ 65
           IS L S S R +   +S L+ L+TVS+A+  YG LQ    N+EK LA+   +LDS C+N+
Sbjct: 6   ISRLVSPSLRSQPSKISALRFLTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINE 65

Query: 66  VLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG 125
           VL +C     Q GLRFFIWAG   ++RHS++MY++AC+++ I  +P L+  VIE YR+E 
Sbjct: 66  VLRRCDPNQFQSGLRFFIWAGTLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEE 125

Query: 126 CLVDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLRADT 165
           C V+++  +I+L LC +A LA EAL +L K PEF++ ADT
Sbjct: 126 CFVNVKTMRIVLTLCNQANLADEALWVLRKFPEFNVCADT 165

BLAST of Cla97C06G119440 vs. TAIR10
Match: AT5G65820.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 72.0 bits (175), Expect = 1.1e-12
Identity = 40/138 (28.99%), Postives = 74/138 (53.62%), Query Frame = 0

Query: 30  SSADLFYGHLQKNNGNVEK---TLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGR 89
           S  +  Y  L+K +  V K    L     +L  G + +VL++C  +   +G RFF+WA +
Sbjct: 81  SDVEKSYRILRKFHSRVPKLELALNESGVELRPGLIERVLNRCG-DAGNLGYRFFVWAAK 140

Query: 90  QPNYRHSSFMYSRACEMIGIHRRPGLLFNVIEDYRREG-CLVDIRIFKIMLNLCKEAKLA 149
           QP Y HS  +Y    +++   R+ G ++ +IE+ R+E   L++  +F +++     A + 
Sbjct: 141 QPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMRKENPQLIEPELFVVLVQRFASADMV 200

Query: 150 KEALSILGKMPEFHLRAD 164
           K+A+ +L +MP+F    D
Sbjct: 201 KKAIEVLDEMPKFGFEPD 217

BLAST of Cla97C06G119440 vs. TAIR10
Match: AT3G49730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 62.0 bits (149), Expect = 1.2e-09
Identity = 37/132 (28.03%), Postives = 72/132 (54.55%), Query Frame = 0

Query: 36  YGHLQKNNGNVEK-TLATVKTKLD--SGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRH 95
           Y  L+ ++  V K  LA  ++ +D   G + +VL +C  +   +G RFF+WA +QP Y H
Sbjct: 71  YRILRNHHSRVPKLELALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFH 130

Query: 96  SSFMYSRACEMIGIHRRPGLLFNVIEDYRREGC-LVDIRIFKIMLNLCKEAKLAKEALSI 155
           S  +      ++   R+ G ++ +IE+ R+    L++  +F +++     A + K+A+ +
Sbjct: 131 SYEVCKSMVMILSKMRQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEV 190

Query: 156 LGKMPEFHLRAD 164
           L +MP++ L  D
Sbjct: 191 LDEMPKYGLEPD 201

BLAST of Cla97C06G119440 vs. TAIR10
Match: AT1G52640.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 56.6 bits (135), Expect = 4.8e-08
Identity = 34/118 (28.81%), Postives = 57/118 (48.31%), Query Frame = 0

Query: 45  NVEKTLATVKTKLDSGCVNQVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACEMI 104
           ++E TL     ++ S  V QVL +C   L     RFF+WA R P++ HS   Y    E++
Sbjct: 54  DLEHTLVAYSPRVSSNLVEQVLKRCK-NLGFPAHRFFLWARRIPDFAHSLESYHILVEIL 113

Query: 105 GIHRRPGLLFNVIEDYRREGCL-VDIRIFKIMLNLCKEAKLAKEALSILGKMPEFHLR 162
           G  ++  LL++ + + R      +  ++F I+      A L  EA     +M EF ++
Sbjct: 114 GSSKQFALLWDFLIEAREYNYFEISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIK 170

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008441677.11.9e-8780.17PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo] ... [more]
XP_004139002.17.3e-8782.07PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativu... [more]
XP_022155853.16.2e-8687.55pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_... [more]
XP_022928928.16.9e-8589.03pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata][more]
XP_023551479.11.2e-8488.61pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pep... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3B4L9|A0A1S3B4L9_CUCME1.3e-8780.17pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A0A0LI44|A0A0A0LI44_CUCSA4.8e-8782.07Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1[more]
tr|A0A2P5FIU2|A0A2P5FIU2_9ROSA8.1e-5065.54Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_066350 PE=4 ... [more]
tr|A0A061G4E0|A0A061G4E0_THECC2.6e-4874.42Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobro... [more]
tr|A0A2P5B2Y8|A0A2P5B2Y8_PARAD3.8e-4764.27Pentatricopeptide repeat OS=Parasponia andersonii OX=3476 GN=PanWU01x14_276160 P... [more]
Match NameE-valueIdentityDescription
sp|Q9LVS3|PP422_ARATH3.4e-3547.50Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX... [more]
sp|Q9FH87|PP447_ARATH2.0e-1128.99Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
sp|P0C8A0|PP275_ARATH2.1e-0828.03Pentatricopeptide repeat-containing protein At3g49730 OS=Arabidopsis thaliana OX... [more]
sp|Q9SSR6|PPR78_ARATH8.7e-0728.81Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT5G47360.11.9e-3647.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.11.1e-1228.99Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G49730.11.2e-0928.03Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G52640.14.8e-0828.81Pentatricopeptide repeat (PPR) superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR033443PPR_long
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G119440.1Cla97C06G119440.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 272..305
e-value: 1.9E-8
score: 31.9
coord: 166..198
e-value: 5.4E-5
score: 21.1
coord: 200..234
e-value: 8.7E-9
score: 33.0
coord: 235..264
e-value: 0.0026
score: 15.8
coord: 343..375
e-value: 1.5E-6
score: 26.0
coord: 307..336
e-value: 0.0011
score: 17.0
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 269..318
e-value: 1.2E-14
score: 54.1
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 343..372
e-value: 0.0027
score: 17.8
coord: 415..444
e-value: 1.2
score: 9.5
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 7.344
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 198..232
score: 13.055
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 6.084
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 9.898
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 9.657
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..446
score: 8.934
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 12.617
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..267
score: 9.701
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 10.665
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 338..475
e-value: 5.7E-18
score: 67.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 102..264
e-value: 6.6E-38
score: 132.8
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 265..334
e-value: 9.1E-20
score: 72.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 39..381
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 271..369
IPR033443Pentacotripeptide-repeat region of PRORPPFAMPF17177PPR_longcoord: 118..262
e-value: 1.5E-16
score: 60.3
NoneNo IPR availablePANTHERPTHR24015:SF719SUBFAMILY NOT NAMEDcoord: 1..473
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..473

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cla97C06G119440ClCG06G010030Watermelon (Charleston Gray)wcgwmbB273
Cla97C06G119440CmaCh15G014360Cucurbita maxima (Rimu)cmawmbB312
Cla97C06G119440CmoCh15G015130Cucurbita moschata (Rifu)cmowmbB294
Cla97C06G119440Carg20742Silver-seed gourdcarwmbB0775
Cla97C06G119440Bhi12G001204Wax gourdwgowmbB091
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C06G119440Watermelon (97103) v1wmwmbB406
Cla97C06G119440Watermelon (97103) v1wmwmbB410