Cla97C05G092010 (gene) Watermelon (97103) v2

NameCla97C05G092010
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr05 : 10092859 .. 10094898 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACAATGGCATTCGGTTATCCATATCCATCCCAAACCCTAATCACCTTCTCTTTCGAATCCTCCATTCTTATTCTGGTTCCCCTCACATTGACATTGCCCATCCACCATCATCCCCACCATTCAAATTCTCAATCTCTCCCCTTTCTCTCTCTGTGACTCTCCGCGACCTCCTGCAGCCGCTCTCTGCGCCGGACCCACCTCCGATTCTATCTTATGCCCCCGTCTTCCAGTTCCTTACTGGCCTAAACCTGTTGAAATTGGGCCACCAAGTTCATGCCCACATGTTTCTCCGTGGCCTTCAGCCCACCGCGCTAGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCCGGTGATATTGATTCCTCTGTTTCGGTCTTCAATCAGATTAGTGAGCCTTCTTCTCTGTTGTTTAATTCCATGATTCGAGCCTATGCCCGATATGGGTTTGCGGAGAGAACTGTTTCCACTTATTTTTCTATGCATTCATGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGCTTGGATGGGGAAATGTGTTCATGGACTGATTTTGAGAGTTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCAGCTTGGAATGCTTTACTTACTGGTTATATGAAGAGTGGGTGTATTGATGCCGCAGTGGCGATTTTCGAGAGAATGCCATGGAGGAACATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCATGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCAGGAGTAAGACCCAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATCATCGGCACTGGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATCCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGCCGATGCTCGCAACTGTTTCGACAGGCTTAATAGAAGTGAAAAGAATTTGGTTGCTTGGAATACCATGATAACTGCTTATGCTTCCTATGGACATGGGCTTGAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAGCCAGATGACATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCTTGTTGATGTTGGTTTAAAGTACTTCAACTACATGAGCACCACATATTCGATCAATCCCAGAGCTGAGCATTATGCTTGTGTTGTCGATCTCTTAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGACGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCACTATTAGCTGCCTGCCGAAAATACCGCAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTGCTTTCAAACATGTATGCTGAAGCTGGAAGGTGGCAGGAAGTTGACAAATTGAGAGCAATTATGAAATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACACATGTTTCTCGGTGGTGATACGTCTCACCCTCAAGCCAAGGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATGTTCCTGATACGAGCTATGTGTTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCCTTGCACACAGTGAGAAGCTCGCTGTCGCATTTGGGATTCTCAACACTTCTGTTGAAACCGTTCTCCGGGTGACAAAGAACTTGAGAATCTGTGGGGACTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGGCGGGAAGTAGTTGTTAGAGATGTGAATCGGTTCCATCACTTTAAAGGGGGTTCTTGTTCTTGTGGAGATTACTGGTGA

mRNA sequence

ATGCACAATGGCATTCGGTTATCCATATCCATCCCAAACCCTAATCACCTTCTCTTTCGAATCCTCCATTCTTATTCTGGTTCCCCTCACATTGACATTGCCCATCCACCATCATCCCCACCATTCAAATTCTCAATCTCTCCCCTTTCTCTCTCTGTGACTCTCCGCGACCTCCTGCAGCCGCTCTCTGCGCCGGACCCACCTCCGATTCTATCTTATGCCCCCGTCTTCCAGTTCCTTACTGGCCTAAACCTGTTGAAATTGGGCCACCAAGTTCATGCCCACATGTTTCTCCGTGGCCTTCAGCCCACCGCGCTAGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCCGGTGATATTGATTCCTCTGTTTCGGTCTTCAATCAGATTAGTGAGCCTTCTTCTCTGTTGTTTAATTCCATGATTCGAGCCTATGCCCGATATGGGTTTGCGGAGAGAACTGTTTCCACTTATTTTTCTATGCATTCATGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGCTTGGATGGGGAAATGTGTTCATGGACTGATTTTGAGAGTTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCAGCTTGGAATGCTTTACTTACTGGTTATATGAAGAGTGGGTGTATTGATGCCGCAGTGGCGATTTTCGAGAGAATGCCATGGAGGAACATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCATGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCAGGAGTAAGACCCAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATCATCGGCACTGGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATCCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGCCGATGCTCGCAACTGTTTCGACAGGCTTAATAGAAGTGAAAAGAATTTGGTTGCTTGGAATACCATGATAACTGCTTATGCTTCCTATGGACATGGGCTTGAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAGCCAGATGACATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGACGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCACTATTAGCTGCCTGCCGAAAATACCGCAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTGCTTTCAAACATGTATGCTGAAGCTGGAAGGTGGCAGGAAGTTGACAAATTGAGAGCAATTATGAAATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACACATGTTTCTCGGTGGTGATACGTCTCACCCTCAAGCCAAGGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATGTTCCTGATACGAGCTATGTGTTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCCTTGCACACAGTGAGAAGCTCGCTGTCGCATTTGGGATTCTCAACACTTCTGTTGAAACCGTTCTCCGGGTGACAAAGAACTTGAGAATCTGTGGGGACTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGGCGGGAAGTAGTTGTTAGAGATGTGAATCGGTTCCATCACTTTAAAGGGGGTTCTTGTTCTTGTGGAGATTACTGGTGA

Coding sequence (CDS)

ATGCACAATGGCATTCGGTTATCCATATCCATCCCAAACCCTAATCACCTTCTCTTTCGAATCCTCCATTCTTATTCTGGTTCCCCTCACATTGACATTGCCCATCCACCATCATCCCCACCATTCAAATTCTCAATCTCTCCCCTTTCTCTCTCTGTGACTCTCCGCGACCTCCTGCAGCCGCTCTCTGCGCCGGACCCACCTCCGATTCTATCTTATGCCCCCGTCTTCCAGTTCCTTACTGGCCTAAACCTGTTGAAATTGGGCCACCAAGTTCATGCCCACATGTTTCTCCGTGGCCTTCAGCCCACCGCGCTAGTTGGCTCCAAGATGGTTGCGTTTTATGCCAGTTCCGGTGATATTGATTCCTCTGTTTCGGTCTTCAATCAGATTAGTGAGCCTTCTTCTCTGTTGTTTAATTCCATGATTCGAGCCTATGCCCGATATGGGTTTGCGGAGAGAACTGTTTCCACTTATTTTTCTATGCATTCATGGGGCTTTACTGGGGATTACTTTACTTTCCCTTTTGTTCTTAAGTCTTCTGTGGATTTGTTGAGTGCTTGGATGGGGAAATGTGTTCATGGACTGATTTTGAGAGTTGGGTTGCAGTTTGATTTGTATGTGGCTACTTCTTTGATTGATATGTATGGGAAATGTGGTGAAATAAATGATGCGGGTAAGGTGTTTGATAATATGACTATTAGAGATGTTTCAGCTTGGAATGCTTTACTTACTGGTTATATGAAGAGTGGGTGTATTGATGCCGCAGTGGCGATTTTCGAGAGAATGCCATGGAGGAACATTGTCTCTTGGACGACTATGATTTCTGGATACTCACAGAGCGGCTTGGCACAGCATGCATTGAGTTTGTTTGATGAAATGGTGAAAGAAGATTCAGGAGTAAGACCCAACTGGGTGACTATAATGAGCGTCCTCCCAGCTTGTGCACAATCATCGGCACTGGAACGCGGAAGGCAGATTCATGAGTTGGCTTGTCGGATGGGTTTGAATTCAAATCCTTCTGTGCTGATTGCCCTTACTGCAATGTATGCAAAATGTGGAAGCTTAGCCGATGCTCGCAACTGTTTCGACAGGCTTAATAGAAGTGAAAAGAATTTGGTTGCTTGGAATACCATGATAACTGCTTATGCTTCCTATGGACATGGGCTTGAAGCAGTGTCAACCTTTCGGGAGATGATCCAAGCAGGCATTCAGCCAGATGACATTACATTCACAGGATTGTTATCCGGTTGCAGCCATTCAGGTCGAGCAGGGAGATTAGCTGAAGCAAGTAAACTTGTAGACGAAATGCCAATGCCAGCAGGACCGAGCATTTGGGGTTCACTATTAGCTGCCTGCCGAAAATACCGCAATCTGGAAATGGCAGAAACTGCAGCAAGAAAGCTTTTTGTCCTAGAACCAGAAAACACCGGCAACTACGTTCTGCTTTCAAACATGTATGCTGAAGCTGGAAGGTGGCAGGAAGTTGACAAATTGAGAGCAATTATGAAATCCCAGGGGACAAAGAAAAGTCCAGGTTGCAGTTGGATCGAGATAAATGGCAAAGCACACATGTTTCTCGGTGGTGATACGTCTCACCCTCAAGCCAAGGAAATCTACATGTTCTTGGAGGCATTGCCAGAGAAGATGAAGGCAGCTGGCTATGTTCCTGATACGAGCTATGTGTTGCACGACATCAGCGAGGAAGAGAAAGAATTCAACCTCCTTGCACACAGTGAGAAGCTCGCTGTCGCATTTGGGATTCTCAACACTTCTGTTGAAACCGTTCTCCGGGTGACAAAGAACTTGAGAATCTGTGGGGACTGCCACACTGCAATGGTGTTCATATCAGAGATATATGGGCGGGAAGTAGTTGTTAGAGATGTGAATCGGTTCCATCACTTTAAAGGGGGTTCTTGTTCTTGTGGAGATTACTGGTGA

Protein sequence

MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAWNALLTGYMKSGCIDAAVAIFERMPWRNIVSWTTMISGYSQSGLAQHALSLFDEMVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSHSGRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKGGSCSCGDYW
BLAST of Cla97C05G092010 vs. NCBI nr
Match: XP_004140278.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis sativus] >KGN48088.1 hypothetical protein Csa_6G430650 [Cucumis sativus])

HSP 1 Score: 1094.0 bits (2828), Expect = 0.0e+00
Identity = 602/679 (88.66%), Postives = 622/679 (91.61%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           MHNGIRLSISIP P+HLLFRILHSYSGS HID   PPSSPPFK SISPL++S TL++LLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYAPVFQFLTGLN+LKLGHQVHAHM LRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFN I EPSSLLFNSMIRAYARYGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           SV+LLS WMGKCVHGLILR+GLQFDLYVATSLI +YGKCGEINDAGKVFDNMTIRDVS+W
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERGRQIHELACRMGLNSN SVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFD+LNR+EKNL+AWNTMITAYASYGHGL+AVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLV EMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRK+RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+KSQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

BLAST of Cla97C05G092010 vs. NCBI nr
Match: XP_008456075.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis melo])

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 598/679 (88.07%), Postives = 618/679 (91.02%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           MHNGIRLSISIP P  LLFRILHSYSGS HI+   PPSSP FK SISPL++S TL++LLQ
Sbjct: 1   MHNGIRLSISIPTPTLLLFRILHSYSGSAHIETVPPPSSPLFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYAPVFQFLTGLN+LKLGHQVHAHM LRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFN I EPSSLLFNSMIRAYARYGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           S DLLS WMGKCVHGLILR+GL  DLYVATSLID+YGKCGEIN+AGKVFDNMTIRDVS+W
Sbjct: 181 SADLLSVWMGKCVHGLILRIGLHCDLYVATSLIDLYGKCGEINEAGKVFDNMTIRDVSSW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+KEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERG QIHELACRMGLNSN SVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGTQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFD+LNRSEKNL+AWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRSEKNLIAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLVDEMPMPAG SIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVDEMPMPAGASIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRK+RNLEMAE AARKLFVLEPEN+GNYVLLSNMYAEAGRWQEVDKLRAI+KSQG
Sbjct: 481 SLLAACRKHRNLEMAEIAARKLFVLEPENSGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

BLAST of Cla97C05G092010 vs. NCBI nr
Match: XP_022149333.1 (pentatricopeptide repeat-containing protein At3g62890-like [Momordica charantia])

HSP 1 Score: 1068.5 bits (2762), Expect = 8.6e-309
Identity = 589/681 (86.49%), Postives = 623/681 (91.48%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPP-SSPPFKFSISPLSLSVTLRDLL 60
           M NG+RLS+S P PNH LFR+LHSYS SP IDIA PP SSPPFK SISPLSLS TLR+LL
Sbjct: 1   MQNGVRLSLSNPIPNHSLFRLLHSYSDSPQIDIAPPPSSSPPFKCSISPLSLSTTLRNLL 60

Query: 61  QPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSG 120
           +PLSAP PPP++SYAP+FQFLTGLNLLKLG Q+HAHM LRG+ PTALVGSK+VAFYASSG
Sbjct: 61  RPLSAPHPPPVVSYAPLFQFLTGLNLLKLGRQIHAHMLLRGIDPTALVGSKLVAFYASSG 120

Query: 121 DIDSSVSVFNQISE-PSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVL 180
           DIDSSVSVFN+ S+ PSSLLFNSMIRA++R+GFAERTV+TYF MHSWGFTGDYFTFPFVL
Sbjct: 121 DIDSSVSVFNRFSDGPSSLLFNSMIRAFSRFGFAERTVATYFEMHSWGFTGDYFTFPFVL 180

Query: 181 KSSVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVS 240
           KSSVDLL  WMG+CVHG I+R+GLQFDLYVATSLIDMYGKCGEINDAGKVFDNMT+RDVS
Sbjct: 181 KSSVDLLCVWMGRCVHGQIVRLGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTVRDVS 240

Query: 241 AWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKED 300
           +W XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX +KED
Sbjct: 241 SWNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMLKED 300

Query: 301 SGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLAD 360
           SGVRPNWVTIMSVLPACAQSSAL+RGRQIHELACRMGLNSN SVLIALTAMYAKCGSLAD
Sbjct: 301 SGVRPNWVTIMSVLPACAQSSALDRGRQIHELACRMGLNSNASVLIALTAMYAKCGSLAD 360

Query: 361 ARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGC 420
           A+NCF+RLNRSE+NLVAWNTMITAYASYGHGLEAVSTF+EMIQAGIQPDDITFTGLLSGC
Sbjct: 361 AQNCFNRLNRSERNLVAWNTMITAYASYGHGLEAVSTFQEMIQAGIQPDDITFTGLLSGC 420

Query: 421 SHS---------------------------------GRAGRLAEASKLVDEMPMPAGPSI 480
           SHS                                 GRAGRLAEASKLVDEMPMPAGPSI
Sbjct: 421 SHSGLVDVGLKYFNCMSTTYSIKPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSI 480

Query: 481 WGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKS 540
           WGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEV+KLRAI+KS
Sbjct: 481 WGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVNKLRAILKS 540

Query: 541 QGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHD 600
           QGTKKSPGCSWIE+NGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGY+PDTSYVLHD
Sbjct: 541 QGTKKSPGCSWIEVNGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYMPDTSYVLHD 600

Query: 601 ISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVV 647
           ISEEEKE NL+AHSEKLAVAFGILNTS ETVLRVTKNLRICGDCHTAMVFISEIYGRE+V
Sbjct: 601 ISEEEKESNLIAHSEKLAVAFGILNTSAETVLRVTKNLRICGDCHTAMVFISEIYGREIV 660

BLAST of Cla97C05G092010 vs. NCBI nr
Match: XP_022922754.1 (pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata])

HSP 1 Score: 1040.8 bits (2690), Expect = 1.9e-300
Identity = 583/679 (85.86%), Postives = 603/679 (88.81%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           M NGIRLSI IPNPN LLFRILHSY GS HIDIA PPSSPPFK SISP SLS TLR+LLQ
Sbjct: 1   MLNGIRLSIFIPNPNRLLFRILHSYLGSSHIDIAPPPSSPPFKCSISPRSLSATLRNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAPDPPPILSYA VFQFLTG NLLKLG QVHAHM LRGL+PTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSV+VFN+ISE               YGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVAVFNRISEXXXXXXXXXXXXXXXYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           SVDLLS WMGKCVHGL+LR GL+FDLYVATSLIDMYGKCGEINDA KVFD MT+RDVS+W
Sbjct: 181 SVDLLSVWMGKCVHGLVLRAGLEFDLYVATSLIDMYGKCGEINDARKVFDKMTVRDVSSW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+KEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQSSALERGR+IHELACRMGLNSN SVLIALTAMYAKCGSLADAR
Sbjct: 301 VRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCF+RLNRSEK+LVAWNTMITAYASYGHG EAVSTF+EMI+AGI+PDDITFTGLLS CSH
Sbjct: 361 NCFNRLNRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLVDEMPMPAGPSIWG
Sbjct: 421 SGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+ SQG
Sbjct: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIE+NG AHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTS+VLHDIS
Sbjct: 541 TKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 660

BLAST of Cla97C05G092010 vs. NCBI nr
Match: XP_023552018.1 (pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1037.7 bits (2682), Expect = 1.6e-299
Identity = 582/679 (85.71%), Postives = 602/679 (88.66%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           M NGIRLSI IPNPN LLFRILHSY GS HIDIA PPSSPPFK SISP SLS TLR+LLQ
Sbjct: 1   MLNGIRLSIFIPNPNRLLFRILHSYLGSSHIDIAPPPSSPPFKCSISPRSLSATLRNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAPDPPPILSYA VFQFLTG NLLKLG QVHAHM LRGL+PTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPDPPPILSYASVFQFLTGQNLLKLGQQVHAHMLLRGLEPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSV+VFN+ISE               YGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVAVFNRISEXXXXXXXXXXXXXXXYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           SVDLLS WMGKCVHGL+LR GLQFDLYVATSLID+YGKCGEI DA KVFD MT+RDVSAW
Sbjct: 181 SVDLLSVWMGKCVHGLVLRAGLQFDLYVATSLIDLYGKCGEIKDARKVFDKMTVRDVSAW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+KEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQSSALERGR+IHELACRMGLNSN SVLIALTAMYAKCGSLADAR
Sbjct: 301 VRPNWVTIMSVLPACAQSSALERGRRIHELACRMGLNSNASVLIALTAMYAKCGSLADAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCF+RL+RSEK+LVAWNTMITAYASYGHG EAVSTF+EMI+AGI+PDDITFTGLLS CSH
Sbjct: 361 NCFNRLSRSEKSLVAWNTMITAYASYGHGREAVSTFQEMIEAGIRPDDITFTGLLSACSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLVDEMPMPAGPSIWG
Sbjct: 421 SGLVDIGLNYFNYMSTTYSTNPRAEHYACVVDLLGRAGRLAEASKLVDEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+ SQG
Sbjct: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAILISQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIE+NG AHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTS+VLHDIS
Sbjct: 541 TKKSPGCSWIEVNGIAHMFLGGDTSHPQTKEIYMFLEALPEKMKAAGYTPDTSFVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPSETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 660

BLAST of Cla97C05G092010 vs. TrEMBL
Match: tr|A0A0A0KEZ1|A0A0A0KEZ1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G430650 PE=4 SV=1)

HSP 1 Score: 1094.0 bits (2828), Expect = 0.0e+00
Identity = 602/679 (88.66%), Postives = 622/679 (91.61%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           MHNGIRLSISIP P+HLLFRILHSYSGS HID   PPSSPPFK SISPL++S TL++LLQ
Sbjct: 1   MHNGIRLSISIPTPSHLLFRILHSYSGSAHIDTVPPPSSPPFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYAPVFQFLTGLN+LKLGHQVHAHM LRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFN I EPSSLLFNSMIRAYARYGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           SV+LLS WMGKCVHGLILR+GLQFDLYVATSLI +YGKCGEINDAGKVFDNMTIRDVS+W
Sbjct: 181 SVELLSVWMGKCVHGLILRIGLQFDLYVATSLIILYGKCGEINDAGKVFDNMTIRDVSSW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERGRQIHELACRMGLNSN SVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGRQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFD+LNR+EKNL+AWNTMITAYASYGHGL+AVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRNEKNLIAWNTMITAYASYGHGLQAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLV EMPMPAGPSIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVGEMPMPAGPSIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRK+RNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAI+KSQG
Sbjct: 481 SLLAACRKHRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQ KEIYMFLEALPEKMKAAGY PDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQGKEIYMFLEALPEKMKAAGYFPDTSYVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

BLAST of Cla97C05G092010 vs. TrEMBL
Match: tr|A0A1S3C2H6|A0A1S3C2H6_CUCME (pentatricopeptide repeat-containing protein At3g62890-like OS=Cucumis melo OX=3656 GN=LOC103496117 PE=4 SV=1)

HSP 1 Score: 1080.9 bits (2794), Expect = 0.0e+00
Identity = 598/679 (88.07%), Postives = 618/679 (91.02%), Query Frame = 0

Query: 1   MHNGIRLSISIPNPNHLLFRILHSYSGSPHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQ 60
           MHNGIRLSISIP P  LLFRILHSYSGS HI+   PPSSP FK SISPL++S TL++LLQ
Sbjct: 1   MHNGIRLSISIPTPTLLLFRILHSYSGSAHIETVPPPSSPLFKCSISPLTISATLQNLLQ 60

Query: 61  PLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAFYASSGD 120
           PLSAP PPPILSYAPVFQFLTGLN+LKLGHQVHAHM LRGLQPTALVGSKMVAFYASSGD
Sbjct: 61  PLSAPGPPPILSYAPVFQFLTGLNMLKLGHQVHAHMLLRGLQPTALVGSKMVAFYASSGD 120

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDSSVSVFN I EPSSLLFNSMIRAYARYGFAERTV+TYFSMHSWGFTGDYFTFPFVLKS
Sbjct: 121 IDSSVSVFNGIGEPSSLLFNSMIRAYARYGFAERTVATYFSMHSWGFTGDYFTFPFVLKS 180

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
           S DLLS WMGKCVHGLILR+GL  DLYVATSLID+YGKCGEIN+AGKVFDNMTIRDVS+W
Sbjct: 181 SADLLSVWMGKCVHGLILRIGLHCDLYVATSLIDLYGKCGEINEAGKVFDNMTIRDVSSW 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+KEDSG
Sbjct: 241 NXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMKEDSG 300

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           VRPNWVTIMSVLPACAQ S LERG QIHELACRMGLNSN SVLIALTAMYAKCGSL DAR
Sbjct: 301 VRPNWVTIMSVLPACAQLSTLERGTQIHELACRMGLNSNASVLIALTAMYAKCGSLVDAR 360

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
           NCFD+LNRSEKNL+AWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH
Sbjct: 361 NCFDKLNRSEKNLIAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420

Query: 421 S---------------------------------GRAGRLAEASKLVDEMPMPAGPSIWG 480
           S                                 GRAGRLAEASKLVDEMPMPAG SIWG
Sbjct: 421 SGLVDVGLKYFNHMSTTYSINPRVEHYACVADLLGRAGRLAEASKLVDEMPMPAGASIWG 480

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           SLLAACRK+RNLEMAE AARKLFVLEPEN+GNYVLLSNMYAEAGRWQEVDKLRAI+KSQG
Sbjct: 481 SLLAACRKHRNLEMAEIAARKLFVLEPENSGNYVLLSNMYAEAGRWQEVDKLRAIVKSQG 540

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
           TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS
Sbjct: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EEEKEFNL+AHSEKLAVAFGILNT  ETVLRVTKNLRICGDCHTAMVFISEIYGREV+VR
Sbjct: 601 EEEKEFNLIAHSEKLAVAFGILNTPAETVLRVTKNLRICGDCHTAMVFISEIYGREVIVR 660

BLAST of Cla97C05G092010 vs. TrEMBL
Match: tr|A0A2P5FRB8|A0A2P5FRB8_9ROSA (DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_040890 PE=4 SV=1)

HSP 1 Score: 787.3 bits (2032), Expect = 2.6e-224
Identity = 442/629 (70.27%), Postives = 511/629 (81.24%), Query Frame = 0

Query: 51  LSVTLRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSK 110
           L  TL  L +PL   DPP   SYA VFQFLTG +LL+LG QVHAHM LR L+P A VG+K
Sbjct: 43  LHSTLSSLAEPLLRRDPPENSSYAAVFQFLTGRSLLRLGRQVHAHMALRSLEPDAFVGAK 102

Query: 111 MVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGD 170
           M+A YASSGD+DS+V+ FN+I+ PSSL +NS+IRAY  YGF ++T+  YF M S G  GD
Sbjct: 103 MIAMYASSGDLDSAVTTFNRINNPSSLSYNSIIRAYTLYGFPQKTIGIYFRMRSLGLKGD 162

Query: 171 YFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFD 230
            FT+PFVLK   DL + WMG CVHG+ LR+GL+ D+YV TSLIDMY KCGEI+DA K+FD
Sbjct: 163 NFTYPFVLKCCADLSNVWMGICVHGISLRIGLEIDMYVGTSLIDMYVKCGEISDAHKLFD 222

Query: 231 NMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 290
            M +RD+S+WXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 223 RMIVRDISSWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 282

Query: 291 XXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMY 350
           XXXX++EDSGV+PNWVTIMSVLPACA S+ALERG QIH+ A ++GL+SN SV  AL AMY
Sbjct: 283 XXXXLEEDSGVKPNWVTIMSVLPACAHSAALERGSQIHKFASKIGLDSNVSVQTALVAMY 342

Query: 351 AKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDIT 410
           AKCG LA+A  CFDR+   +KNLVAWNTMITAY+S+G GLE++STF EMI+AG+QPD IT
Sbjct: 343 AKCGGLAEASQCFDRIPLEKKNLVAWNTMITAYSSHGRGLESISTFEEMIKAGVQPDTIT 402

Query: 411 FTGLLSGCSHS---------------------------------GRAGRLAEASKLVDEM 470
           FTGLLSGCSHS                                 GRAGRL EA+KL++ M
Sbjct: 403 FTGLLSGCSHSGLADMGLKYFSCMTKAYSVEPEVQHYACVVDLLGRAGRLVEANKLINNM 462

Query: 471 PMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVD 530
           PM AGPS+WG+LLAACRK+RNLE+AE AA+KLF+LEP+N+GNYVLLSNMYAE G W+EVD
Sbjct: 463 PMQAGPSVWGALLAACRKHRNLEIAEVAAKKLFILEPDNSGNYVLLSNMYAEVGMWKEVD 522

Query: 531 KLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVP 590
            LRA++KSQG +K+PGCSWIE+NGKAHMFLGGDTSHPQAKEI++FLEALPEK+K AGYVP
Sbjct: 523 NLRAMLKSQGMRKTPGCSWIEVNGKAHMFLGGDTSHPQAKEIHVFLEALPEKIKEAGYVP 582

Query: 591 DTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFIS 647
           DTS VLHD+SEEEKE NL  HSEKLA+AFG+LNTS   VLRVTKNLRIC DCHTA  FIS
Sbjct: 583 DTSLVLHDVSEEEKEHNLTTHSEKLAIAFGLLNTSPGVVLRVTKNLRICVDCHTATKFIS 642

BLAST of Cla97C05G092010 vs. TrEMBL
Match: tr|A0A1Q3AZH1|A0A1Q3AZH1_CEPFO (PPR domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein/DYW_deaminase domain-containing protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_04665 PE=4 SV=1)

HSP 1 Score: 759.6 bits (1960), Expect = 5.7e-216
Identity = 435/649 (67.03%), Postives = 515/649 (79.35%), Query Frame = 0

Query: 33  IAHPPSSPPFKFSISPLSLSVTLRDLLQPLSAPDPPPILS-YAPVFQFLTGLNLLKLGHQ 92
           I  PP++     S S +  S  L+ LL+P++  +PP +L  Y P+FQFLT  N LKLG Q
Sbjct: 78  IKQPPTTTTCSTSKSSI-YSTPLQILLEPINTQNPPQLLPYYTPIFQFLTTHNFLKLGQQ 137

Query: 93  VHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEP-SSLLFNSMIRAYARYG 152
           VHAHM LRGL+P A + +KMVA YASSGD+ S+ +VF++I  P SSLL+NS+IRAY ++G
Sbjct: 138 VHAHMALRGLEPNAFLAAKMVAMYASSGDLSSADAVFSRIINPSSSLLYNSIIRAYTKHG 197

Query: 153 FAERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVAT 212
            A++ +  +  MHS G  GDYFTFPFVLKS VDL    MGKCVHG ILR+GL+FDLYV T
Sbjct: 198 HAKKCIDMFVKMHSRGLEGDYFTFPFVLKSCVDLSCILMGKCVHGQILRIGLEFDLYVGT 257

Query: 213 SLIDMYGKCGEINDAGKVFDNMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 272
           SLIDMY KCG++ +A K+FD MT++DVS+WXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 258 SLIDMYVKCGQLAEARKLFDKMTVKDVSSWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 317

Query: 273 XXXXXXXXXXXXXXXXXXXXXXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHEL 332
           XXXXXXXXXXXXXXXXXXXXXXXX KE+S V+PNWVTIMSVLPAC  S+AL+RGR+IH+ 
Sbjct: 318 XXXXXXXXXXXXXXXXXXXXXXXXXKEESEVKPNWVTIMSVLPACGHSAALQRGRRIHQF 377

Query: 333 ACRMGLNSNPSVLIALTAMYAKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGL 392
           A ++GL+SN SV  AL +MYAKCGSL DA  CF R++  +KNLVAWNTMI AYAS+GHG+
Sbjct: 378 ARKIGLDSNSSVQTALVSMYAKCGSLIDAELCFSRIHPDDKNLVAWNTMIAAYASHGHGM 437

Query: 393 EAVSTFREMIQAGIQPDDITFTGLLSGCSHS----------------------------- 452
           E VSTF  MI+AG+QPD ITFTGL SGCSHS                             
Sbjct: 438 ETVSTFENMIRAGVQPDSITFTGLFSGCSHSGLSDVGLNYYNSMRTVFYVEPRIEHYASI 497

Query: 453 ----GRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENT 512
               GRAGRL EA +L+D+MPM AGPSIWG+LLAACR  RNL+ AE AA+KLFVLEP+N+
Sbjct: 498 VDLLGRAGRLLEAKELIDQMPMEAGPSIWGALLAACRNQRNLKFAEMAAKKLFVLEPDNS 557

Query: 513 GNYVLLSNMYAEAGRWQEVDKLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAK 572
           GNYVLLSNMYA+AG W+EV+ LR++++ QG KKSPGCSWIE+NGKAH+F GGD SHPQAK
Sbjct: 558 GNYVLLSNMYAQAGMWKEVNNLRSLLRRQGMKKSPGCSWIEVNGKAHLFRGGDRSHPQAK 617

Query: 573 EIYMFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVL 632
           EIYM LEALPEK+KAAGY+PDTS+VLHDISEEEKE NL  HSEKLA+AFG+LNT+   VL
Sbjct: 618 EIYMLLEALPEKIKAAGYIPDTSFVLHDISEEEKECNLQTHSEKLAIAFGLLNTNPGVVL 677

Query: 633 RVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKGGSCSCGDYW 647
           RVTKNLRICGDCH+A  F+S+IYGRE++VRDVNRFHHFK G CSCGDYW
Sbjct: 678 RVTKNLRICGDCHSATKFVSKIYGREIIVRDVNRFHHFKVGFCSCGDYW 725

BLAST of Cla97C05G092010 vs. TrEMBL
Match: tr|A0A2P5SUJ2|A0A2P5SUJ2_GOSBA (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD00255 PE=4 SV=1)

HSP 1 Score: 757.7 bits (1955), Expect = 2.2e-215
Identity = 431/651 (66.21%), Postives = 506/651 (77.73%), Query Frame = 0

Query: 29  PHIDIAHPPSSPPFKFSISPLSLSVTLRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKL 88
           PHID +    + P      P   + TL  LLQP+S  +PPP LSYAP+FQFLTG N LKL
Sbjct: 28  PHIDPSQTKCTTP-----KPFPYTSTLPTLLQPISDQNPPPHLSYAPLFQFLTGQNFLKL 87

Query: 89  GHQVHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYAR 148
           G Q+HAHM L GLQP A +G+KMVA YASSGD++S+V+VF +I +P+SLL+NS+IRAY  
Sbjct: 88  GQQIHAHMTLHGLQPNAFLGAKMVAMYASSGDLESAVTVFRKIKDPTSLLYNSIIRAYTN 147

Query: 149 YGFAERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYV 208
            G+  +T+  Y  MHS    GD FTFPFVLKS  ++L  WMG+CVHG  LR GL+ + YV
Sbjct: 148 NGYPLKTIDIYREMHSLRLKGDNFTFPFVLKSCANVLDVWMGECVHGQSLRFGLELNAYV 207

Query: 209 ATSLIDMYGKCGEINDAGKVFDNMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXX 268
            TSLID Y K GE+ DA KVFD MT+R VS+W XXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 208 GTSLIDFYVKVGELRDANKVFDLMTVRAVSSWNXXXXXXXXXXXXXXXXXXXXXXXXXXX 267

Query: 269 XXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIH 328
           XXXXXXXXXXXXXXXXXXXXXXX   +KEDS V+PNWVTIMSVLPACA S++ ERGR+I+
Sbjct: 268 XXXXXXXXXXXXXXXXXXXXXXXDEILKEDSEVKPNWVTIMSVLPACAHSASFERGRRIN 327

Query: 329 ELACRMGLNSNPSVLIALTAMYAKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGH 388
           E   R+GL SNPSV  AL AMYAKCGSL +AR CFDR+  +EKNL AWNTMITAYAS+G 
Sbjct: 328 EYVNRIGLESNPSVQTALIAMYAKCGSLVNARCCFDRILENEKNLCAWNTMITAYASHGQ 387

Query: 389 GLEAVSTFREMIQAGIQPDDITFTGLLSGCSHSG-------------------------- 448
           GLE+V TF  M++AG+ PD ITFTGLLSGCSHSG                          
Sbjct: 388 GLESVLTFENMVRAGVYPDAITFTGLLSGCSHSGIVEFGLRYFNSMQTKYCVEPRHEHYA 447

Query: 449 -------RAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPE 508
                  RAGRLAEA + + ++PM  GPSIWG+LLAACRK RNLE+AE AA++LFVLEPE
Sbjct: 448 SVVDLLARAGRLAEAKEFIKKIPMQPGPSIWGALLAACRKSRNLEIAEIAAKELFVLEPE 507

Query: 509 NTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQ 568
           N+ NY+LLSNMYAEAG W+EVDKLRA +K +G KK+PGCSWIEI GKAH+FL GD SHPQ
Sbjct: 508 NSCNYILLSNMYAEAGMWKEVDKLRARLKCEGIKKNPGCSWIEIKGKAHLFLSGDLSHPQ 567

Query: 569 AKEIYMFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVET 628
           +KEIY  LEALPEK+KAAGY+P+TS+VLHDISEEEKE NL+ HSEKLA+AFG+LNT+ E 
Sbjct: 568 SKEIYNLLEALPEKIKAAGYIPNTSFVLHDISEEEKEQNLIIHSEKLAIAFGLLNTNPEV 627

Query: 629 VLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKGGSCSCGDYW 647
           V+R+TKNLRICGDCHT + FIS+IY RE+VVRDVNRFHHF+ G+CSCGDYW
Sbjct: 628 VIRITKNLRICGDCHTVIKFISKIYEREIVVRDVNRFHHFRDGACSCGDYW 673

BLAST of Cla97C05G092010 vs. Swiss-Prot
Match: sp|P0C899|PP271_ARATH (Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H77 PE=3 SV=1)

HSP 1 Score: 403.7 bits (1036), Expect = 3.9e-111
Identity = 231/628 (36.78%), Postives = 321/628 (51.11%), Query Frame = 0

Query: 92  VHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGF 151
           VH+ + L  L+  + +G K++  YAS  D+ S+  VF++I E + ++ N MIR+Y   GF
Sbjct: 61  VHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGF 120

Query: 152 AERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATS 211
               V  + +M       D++TFP VLK+     +  +G+ +HG   +VGL   L+V   
Sbjct: 121 YGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNG 180

Query: 212 LIDMYGKCGEINDAGKVFDNMTIRDVSAW------------------------------- 271
           L+ MYGKCG +++A  V D M+ RDV +W                               
Sbjct: 181 LVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREMESVKISHD 240

Query: 272 ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVK 331
                                                                       
Sbjct: 241 AGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRM 300

Query: 332 EDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSL 391
           E  G  P+ V+I SVLPAC  +SAL  G++IH    R  L  N  +  AL  MYAKCG L
Sbjct: 301 EADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCGCL 360

Query: 392 ADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLS 451
             AR+ F+  N   +++V+W  MI+AY   G G +AV+ F ++  +G+ PD I F   L+
Sbjct: 361 EKARDVFE--NMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLA 420

Query: 452 GCSHS---------------------------------GRAGRLAEASKLVDEMPMPAGP 511
            CSH+                                 GRAG++ EA + + +M M    
Sbjct: 421 ACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNE 480

Query: 512 SIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIM 571
            +WG+LL ACR + + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R IM
Sbjct: 481 RVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIM 540

Query: 572 KSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVL 631
           KS+G KK+PG S +E+N   H FL GD SHPQ+ EIY  L+ L +KMK  GYVPD+   L
Sbjct: 541 KSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSESAL 600

Query: 632 HDISEEEKEFNLLAHSEKLAVAFGILNTSVE-----TVLRVTKNLRICGDCHTAMVFISE 647
           HD+ EE+KE +L  HSEKLA+ F ++NT  E       +R+TKNLRICGDCH A   IS+
Sbjct: 601 HDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRICGDCHVAAKLISQ 660

BLAST of Cla97C05G092010 vs. Swiss-Prot
Match: sp|Q9STF3|PP265_ARATH (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 400.2 bits (1027), Expect = 4.3e-110
Identity = 217/629 (34.50%), Postives = 333/629 (52.94%), Query Frame = 0

Query: 55  LRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAF 114
           L+  ++ LS    P   +Y  +       + L    +VH H+   G      + +K++  
Sbjct: 62  LKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121

Query: 115 YASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTF 174
           Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+
Sbjct: 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTY 181

Query: 175 PFVLKSSV----DLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFD 234
            +VLK+ V     +     GK +H  + R G    +Y+ T+L+DMY + G ++ A  VF 
Sbjct: 182 TYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFG 241

Query: 235 NMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 294
            M +R+V +W                                                  
Sbjct: 242 GMPVRNVVSW-------------------------------SAMIACYAKNGKAFEALRT 301

Query: 295 XXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMY 354
               ++E     PN VT++SVL ACA  +ALE+G+ IH    R GL+S   V+ AL  MY
Sbjct: 302 FREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMY 361

Query: 355 AKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDIT 414
            +CG L   +  FDR++  ++++V+WN++I++Y  +G+G +A+  F EM+  G  P  +T
Sbjct: 362 GRCGKLEVGQRVFDRMH--DRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVT 421

Query: 415 FTGLLSGCSHS---------------------------------GRAGRLAEASKLVDEM 474
           F  +L  CSH                                  GRA RL EA+K+V +M
Sbjct: 422 FVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDM 481

Query: 475 PMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVD 534
               GP +WGSLL +CR + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV 
Sbjct: 482 RTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVK 541

Query: 535 KLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVP 594
           +++ +++ +G +K PG  W+E+  K + F+  D  +P  ++I+ FL  L E MK  GY+P
Sbjct: 542 RVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIP 601

Query: 595 DTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFIS 647
            T  VL+++  EEKE  +L HSEKLA+AFG++NTS    +R+TKNLR+C DCH    FIS
Sbjct: 602 QTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFIS 657

BLAST of Cla97C05G092010 vs. Swiss-Prot
Match: sp|Q7Y211|PP285_ARATH (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 399.1 bits (1024), Expect = 9.7e-110
Identity = 225/628 (35.83%), Postives = 336/628 (53.50%), Query Frame = 0

Query: 55  LRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRG-LQPTALVGSKMVA 114
           LR+++     PD   I S  P     + L +L+ G ++HA+    G L   + VGS +V 
Sbjct: 290 LREMVLEGVEPDEFTISSVLPA---CSHLEMLRTGKELHAYALKNGSLDENSFVGSALVD 349

Query: 115 FYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSM-HSWGFTGDYF 174
            Y +   + S   VF+ + +    L+N+MI  Y++    +  +  +  M  S G   +  
Sbjct: 350 MYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANST 409

Query: 175 TFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNM 234
           T   V+ + V   +    + +HG +++ GL  D +V  +L+DMY + G+I+ A ++F  M
Sbjct: 410 TMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKM 469

Query: 235 TIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 294
             RD+  W                                                    
Sbjct: 470 EDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRV--------------- 529

Query: 295 XXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAK 354
                   ++PN +T+M++LP+CA  SAL +G++IH  A +  L ++ +V  AL  MYAK
Sbjct: 530 -------SLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAK 589

Query: 355 CGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFT 414
           CG L  +R  FD++   +KN++ WN +I AY  +G+G EA+   R M+  G++P+++TF 
Sbjct: 590 CGCLQMSRKVFDQI--PQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFI 649

Query: 415 GLLSGCSHS---------------------------------GRAGRLAEASKLVDEMPM 474
            + + CSHS                                 GRAGR+ EA +L++ MP 
Sbjct: 650 SVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPR 709

Query: 475 PAGPS-IWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDK 534
               +  W SLL A R + NLE+ E AA+ L  LEP    +YVLL+N+Y+ AG W +  +
Sbjct: 710 DFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATE 769

Query: 535 LRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPD 594
           +R  MK QG +K PGCSWIE   + H F+ GD+SHPQ++++  +LE L E+M+  GYVPD
Sbjct: 770 VRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPD 829

Query: 595 TSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISE 647
           TS VLH++ E+EKE  L  HSEKLA+AFGILNTS  T++RV KNLR+C DCH A  FIS+
Sbjct: 830 TSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISK 889

BLAST of Cla97C05G092010 vs. Swiss-Prot
Match: sp|Q9LW63|PP251_ARATH (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 4.1e-108
Identity = 211/559 (37.75%), Postives = 295/559 (52.77%), Query Frame = 0

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDS   VF  +     + +N++I  YA+ G  E  +     M +     D FT   VL  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
             + +    GK +HG ++R G+  D+Y+ +SL+DMY K   I D+ +VF  +  RD  +W
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
                                                                     + 
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMV---------------------------------TAK 371

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           V+P  V   SV+PACA  + L  G+Q+H    R G  SN  +  AL  MY+KCG++  AR
Sbjct: 372 VKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAAR 431

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
             FDR+N  ++  V+W  +I  +A +GHG EAVS F EM + G++P+ + F  +L+ CSH
Sbjct: 432 KIFDRMNVLDE--VSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSH 491

Query: 421 ---------------------------------SGRAGRLAEASKLVDEMPMPAGPSIWG 480
                                             GRAG+L EA   + +M +    S+W 
Sbjct: 492 VGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWS 551

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           +LL++C  ++NLE+AE  A K+F ++ EN G YVL+ NMYA  GRW+E+ KLR  M+ +G
Sbjct: 552 TLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKG 611

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
            +K P CSWIE+  K H F+ GD SHP   +I  FL+A+ E+M+  GYV DTS VLHD+ 
Sbjct: 612 LRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVD 671

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EE K   L  HSE+LAVAFGI+NT   T +RVTKN+RIC DCH A+ FIS+I  RE++VR
Sbjct: 672 EEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVR 715

BLAST of Cla97C05G092010 vs. Swiss-Prot
Match: sp|O81767|PP348_ARATH (Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX=3702 GN=EMB2758 PE=3 SV=2)

HSP 1 Score: 368.6 bits (945), Expect = 1.4e-100
Identity = 206/592 (34.80%), Postives = 309/592 (52.20%), Query Frame = 0

Query: 89  GHQVHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYAR 148
           G  +H++    GL+    V +K++  YA  G +     VF+++     + +NS+I+AY  
Sbjct: 266 GVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYEL 325

Query: 149 YGFAERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVG-LQFDLY 208
                R +S +  M       D  T   +      L      + V G  LR G    D+ 
Sbjct: 326 NEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDIT 385

Query: 209 VATSLIDMYGKCGEINDAGKVFDNMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXX 268
           +  +++ MY K G ++ A  VF+ +   DV +W                           
Sbjct: 386 IGNAVVVMYAKLGLVDSARAVFNWLPNTDVISW--------------------------- 445

Query: 269 XXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQI 328
                                      ++E+  +  N  T +SVLPAC+Q+ AL +G ++
Sbjct: 446 -----NTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKL 505

Query: 329 HELACRMGLNSNPSVLIALTAMYAKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYG 388
           H    + GL  +  V+ +L  MY KCG L DA + F ++ R   N V WNT+I  +  +G
Sbjct: 506 HGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR--VNSVPWNTLIACHGFHG 565

Query: 389 HGLEAVSTFREMIQAGIQPDDITFTGLLSGCSHS-------------------------- 448
           HG +AV  F+EM+  G++PD ITF  LLS CSHS                          
Sbjct: 566 HGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHY 625

Query: 449 -------GRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEP 508
                  GRAG+L  A K +  M +    SIWG+LL+ACR + N+++ + A+  LF +EP
Sbjct: 626 GCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEP 685

Query: 509 ENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHP 568
           E+ G +VLLSNMYA AG+W+ VD++R+I   +G +K+PG S +E++ K  +F  G+ +HP
Sbjct: 686 EHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHP 745

Query: 569 QAKEIYMFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVE 628
             +E+Y  L AL  K+K  GYVPD  +VL D+ ++EKE  L++HSE+LA+AF ++ T  +
Sbjct: 746 MYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFALIATPAK 805

Query: 629 TVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKGGSCSCGDYW 647
           T +R+ KNLR+CGDCH+   FIS+I  RE++VRD NRFHHFK G CSCGDYW
Sbjct: 806 TTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

BLAST of Cla97C05G092010 vs. TAIR10
Match: AT3G49142.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 403.7 bits (1036), Expect = 2.2e-112
Identity = 231/628 (36.78%), Postives = 321/628 (51.11%), Query Frame = 0

Query: 92  VHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGF 151
           VH+ + L  L+  + +G K++  YAS  D+ S+  VF++I E + ++ N MIR+Y   GF
Sbjct: 61  VHSRIILEDLRCNSSLGVKLMRAYASLKDVASARKVFDEIPERNVIIINVMIRSYVNNGF 120

Query: 152 AERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATS 211
               V  + +M       D++TFP VLK+     +  +G+ +HG   +VGL   L+V   
Sbjct: 121 YGEGVKVFGTMCGCNVRPDHYTFPCVLKACSCSGTIVIGRKIHGSATKVGLSSTLFVGNG 180

Query: 212 LIDMYGKCGEINDAGKVFDNMTIRDVSAW------------------------------- 271
           L+ MYGKCG +++A  V D M+ RDV +W                               
Sbjct: 181 LVSMYGKCGFLSEARLVLDEMSRRDVVSWNSLVVGYAQNQRFDDALEVCREMESVKISHD 240

Query: 272 ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVK 331
                                                                       
Sbjct: 241 AGTMASLLPAVSNTTTENVMYVKDMFFKMGKKSLVSWNVMIGVYMKNAMPVEAVELYSRM 300

Query: 332 EDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSL 391
           E  G  P+ V+I SVLPAC  +SAL  G++IH    R  L  N  +  AL  MYAKCG L
Sbjct: 301 EADGFEPDAVSITSVLPACGDTSALSLGKKIHGYIERKKLIPNLLLENALIDMYAKCGCL 360

Query: 392 ADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLS 451
             AR+ F+  N   +++V+W  MI+AY   G G +AV+ F ++  +G+ PD I F   L+
Sbjct: 361 EKARDVFE--NMKSRDVVSWTAMISAYGFSGRGCDAVALFSKLQDSGLVPDSIAFVTTLA 420

Query: 452 GCSHS---------------------------------GRAGRLAEASKLVDEMPMPAGP 511
            CSH+                                 GRAG++ EA + + +M M    
Sbjct: 421 ACSHAGLLEEGRSCFKLMTDHYKITPRLEHLACMVDLLGRAGKVKEAYRFIQDMSMEPNE 480

Query: 512 SIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIM 571
            +WG+LL ACR + + ++   AA KLF L PE +G YVLLSN+YA+AGRW+EV  +R IM
Sbjct: 481 RVWGALLGACRVHSDTDIGLLAADKLFQLAPEQSGYYVLLSNIYAKAGRWEEVTNIRNIM 540

Query: 572 KSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVL 631
           KS+G KK+PG S +E+N   H FL GD SHPQ+ EIY  L+ L +KMK  GYVPD+   L
Sbjct: 541 KSKGLKKNPGASNVEVNRIIHTFLVGDRSHPQSDEIYRELDVLVKKMKELGYVPDSESAL 600

Query: 632 HDISEEEKEFNLLAHSEKLAVAFGILNTSVE-----TVLRVTKNLRICGDCHTAMVFISE 647
           HD+ EE+KE +L  HSEKLA+ F ++NT  E       +R+TKNLRICGDCH A   IS+
Sbjct: 601 HDVEEEDKETHLAVHSEKLAIVFALMNTKEEEEDSNNTIRITKNLRICGDCHVAAKLISQ 660

BLAST of Cla97C05G092010 vs. TAIR10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 400.2 bits (1027), Expect = 2.4e-111
Identity = 217/629 (34.50%), Postives = 333/629 (52.94%), Query Frame = 0

Query: 55  LRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRGLQPTALVGSKMVAF 114
           L+  ++ LS    P   +Y  +       + L    +VH H+   G      + +K++  
Sbjct: 62  LKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121

Query: 115 YASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTF 174
           Y+  G +D +  VF++  + +  ++N++ RA    G  E  +  Y+ M+  G   D FT+
Sbjct: 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTY 181

Query: 175 PFVLKSSV----DLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFD 234
            +VLK+ V     +     GK +H  + R G    +Y+ T+L+DMY + G ++ A  VF 
Sbjct: 182 TYVLKACVASECTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFG 241

Query: 235 NMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 294
            M +R+V +W                                                  
Sbjct: 242 GMPVRNVVSW-------------------------------SAMIACYAKNGKAFEALRT 301

Query: 295 XXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMY 354
               ++E     PN VT++SVL ACA  +ALE+G+ IH    R GL+S   V+ AL  MY
Sbjct: 302 FREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRGLDSILPVISALVTMY 361

Query: 355 AKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDIT 414
            +CG L   +  FDR++  ++++V+WN++I++Y  +G+G +A+  F EM+  G  P  +T
Sbjct: 362 GRCGKLEVGQRVFDRMH--DRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVT 421

Query: 415 FTGLLSGCSHS---------------------------------GRAGRLAEASKLVDEM 474
           F  +L  CSH                                  GRA RL EA+K+V +M
Sbjct: 422 FVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDM 481

Query: 475 PMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVD 534
               GP +WGSLL +CR + N+E+AE A+R+LF LEP+N GNYVLL+++YAEA  W EV 
Sbjct: 482 RTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVK 541

Query: 535 KLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVP 594
           +++ +++ +G +K PG  W+E+  K + F+  D  +P  ++I+ FL  L E MK  GY+P
Sbjct: 542 RVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIP 601

Query: 595 DTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFIS 647
            T  VL+++  EEKE  +L HSEKLA+AFG++NTS    +R+TKNLR+C DCH    FIS
Sbjct: 602 QTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFIS 657

BLAST of Cla97C05G092010 vs. TAIR10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 399.1 bits (1024), Expect = 5.4e-111
Identity = 225/628 (35.83%), Postives = 336/628 (53.50%), Query Frame = 0

Query: 55  LRDLLQPLSAPDPPPILSYAPVFQFLTGLNLLKLGHQVHAHMFLRG-LQPTALVGSKMVA 114
           LR+++     PD   I S  P     + L +L+ G ++HA+    G L   + VGS +V 
Sbjct: 290 LREMVLEGVEPDEFTISSVLPA---CSHLEMLRTGKELHAYALKNGSLDENSFVGSALVD 349

Query: 115 FYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSM-HSWGFTGDYF 174
            Y +   + S   VF+ + +    L+N+MI  Y++    +  +  +  M  S G   +  
Sbjct: 350 MYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANST 409

Query: 175 TFPFVLKSSVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNM 234
           T   V+ + V   +    + +HG +++ GL  D +V  +L+DMY + G+I+ A ++F  M
Sbjct: 410 TMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKM 469

Query: 235 TIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 294
             RD+  W                                                    
Sbjct: 470 EDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRV--------------- 529

Query: 295 XXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAK 354
                   ++PN +T+M++LP+CA  SAL +G++IH  A +  L ++ +V  AL  MYAK
Sbjct: 530 -------SLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAK 589

Query: 355 CGSLADARNCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFT 414
           CG L  +R  FD++   +KN++ WN +I AY  +G+G EA+   R M+  G++P+++TF 
Sbjct: 590 CGCLQMSRKVFDQI--PQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFI 649

Query: 415 GLLSGCSHS---------------------------------GRAGRLAEASKLVDEMPM 474
            + + CSHS                                 GRAGR+ EA +L++ MP 
Sbjct: 650 SVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPR 709

Query: 475 PAGPS-IWGSLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDK 534
               +  W SLL A R + NLE+ E AA+ L  LEP    +YVLL+N+Y+ AG W +  +
Sbjct: 710 DFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATE 769

Query: 535 LRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPD 594
           +R  MK QG +K PGCSWIE   + H F+ GD+SHPQ++++  +LE L E+M+  GYVPD
Sbjct: 770 VRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPD 829

Query: 595 TSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISE 647
           TS VLH++ E+EKE  L  HSEKLA+AFGILNTS  T++RV KNLR+C DCH A  FIS+
Sbjct: 830 TSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISK 889

BLAST of Cla97C05G092010 vs. TAIR10
Match: AT3G23330.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 393.7 bits (1010), Expect = 2.3e-109
Identity = 211/559 (37.75%), Postives = 295/559 (52.77%), Query Frame = 0

Query: 121 IDSSVSVFNQISEPSSLLFNSMIRAYARYGFAERTVSTYFSMHSWGFTGDYFTFPFVLKS 180
           IDS   VF  +     + +N++I  YA+ G  E  +     M +     D FT   VL  
Sbjct: 192 IDSVRRVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPI 251

Query: 181 SVDLLSAWMGKCVHGLILRVGLQFDLYVATSLIDMYGKCGEINDAGKVFDNMTIRDVSAW 240
             + +    GK +HG ++R G+  D+Y+ +SL+DMY K   I D+ +VF  +  RD  +W
Sbjct: 252 FSEYVDVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISW 311

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSG 300
                                                                     + 
Sbjct: 312 NSLVAGYVQNGRYNEALRLFRQMV---------------------------------TAK 371

Query: 301 VRPNWVTIMSVLPACAQSSALERGRQIHELACRMGLNSNPSVLIALTAMYAKCGSLADAR 360
           V+P  V   SV+PACA  + L  G+Q+H    R G  SN  +  AL  MY+KCG++  AR
Sbjct: 372 VKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGSNIFIASALVDMYSKCGNIKAAR 431

Query: 361 NCFDRLNRSEKNLVAWNTMITAYASYGHGLEAVSTFREMIQAGIQPDDITFTGLLSGCSH 420
             FDR+N  ++  V+W  +I  +A +GHG EAVS F EM + G++P+ + F  +L+ CSH
Sbjct: 432 KIFDRMNVLDE--VSWTAIIMGHALHGHGHEAVSLFEEMKRQGVKPNQVAFVAVLTACSH 491

Query: 421 ---------------------------------SGRAGRLAEASKLVDEMPMPAGPSIWG 480
                                             GRAG+L EA   + +M +    S+W 
Sbjct: 492 VGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKLEEAYNFISKMCVEPTGSVWS 551

Query: 481 SLLAACRKYRNLEMAETAARKLFVLEPENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQG 540
           +LL++C  ++NLE+AE  A K+F ++ EN G YVL+ NMYA  GRW+E+ KLR  M+ +G
Sbjct: 552 TLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMYASNGRWKEMAKLRLRMRKKG 611

Query: 541 TKKSPGCSWIEINGKAHMFLGGDTSHPQAKEIYMFLEALPEKMKAAGYVPDTSYVLHDIS 600
            +K P CSWIE+  K H F+ GD SHP   +I  FL+A+ E+M+  GYV DTS VLHD+ 
Sbjct: 612 LRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVMEQMEKEGYVADTSGVLHDVD 671

Query: 601 EEEKEFNLLAHSEKLAVAFGILNTSVETVLRVTKNLRICGDCHTAMVFISEIYGREVVVR 647
           EE K   L  HSE+LAVAFGI+NT   T +RVTKN+RIC DCH A+ FIS+I  RE++VR
Sbjct: 672 EEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIRICTDCHVAIKFISKITEREIIVR 715

BLAST of Cla97C05G092010 vs. TAIR10
Match: AT4G33990.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 368.6 bits (945), Expect = 7.8e-102
Identity = 206/592 (34.80%), Postives = 309/592 (52.20%), Query Frame = 0

Query: 89  GHQVHAHMFLRGLQPTALVGSKMVAFYASSGDIDSSVSVFNQISEPSSLLFNSMIRAYAR 148
           G  +H++    GL+    V +K++  YA  G +     VF+++     + +NS+I+AY  
Sbjct: 266 GVTIHSYSIKHGLESELFVSNKLIDLYAEFGRLRDCQKVFDRMYVRDLISWNSIIKAYEL 325

Query: 149 YGFAERTVSTYFSMHSWGFTGDYFTFPFVLKSSVDLLSAWMGKCVHGLILRVG-LQFDLY 208
                R +S +  M       D  T   +      L      + V G  LR G    D+ 
Sbjct: 326 NEQPLRAISLFQEMRLSRIQPDCLTLISLASILSQLGDIRACRSVQGFTLRKGWFLEDIT 385

Query: 209 VATSLIDMYGKCGEINDAGKVFDNMTIRDVSAWXXXXXXXXXXXXXXXXXXXXXXXXXXX 268
           +  +++ MY K G ++ A  VF+ +   DV +W                           
Sbjct: 386 IGNAVVVMYAKLGLVDSARAVFNWLPNTDVISW--------------------------- 445

Query: 269 XXXXXXXXXXXXXXXXXXXXXXXXXXXVKEDSGVRPNWVTIMSVLPACAQSSALERGRQI 328
                                      ++E+  +  N  T +SVLPAC+Q+ AL +G ++
Sbjct: 446 -----NTIISGYAQNGFASEAIEMYNIMEEEGEIAANQGTWVSVLPACSQAGALRQGMKL 505

Query: 329 HELACRMGLNSNPSVLIALTAMYAKCGSLADARNCFDRLNRSEKNLVAWNTMITAYASYG 388
           H    + GL  +  V+ +L  MY KCG L DA + F ++ R   N V WNT+I  +  +G
Sbjct: 506 HGRLLKNGLYLDVFVVTSLADMYGKCGRLEDALSLFYQIPR--VNSVPWNTLIACHGFHG 565

Query: 389 HGLEAVSTFREMIQAGIQPDDITFTGLLSGCSHS-------------------------- 448
           HG +AV  F+EM+  G++PD ITF  LLS CSHS                          
Sbjct: 566 HGEKAVMLFKEMLDEGVKPDHITFVTLLSACSHSGLVDEGQWCFEMMQTDYGITPSLKHY 625

Query: 449 -------GRAGRLAEASKLVDEMPMPAGPSIWGSLLAACRKYRNLEMAETAARKLFVLEP 508
                  GRAG+L  A K +  M +    SIWG+LL+ACR + N+++ + A+  LF +EP
Sbjct: 626 GCMVDMYGRAGQLETALKFIKSMSLQPDASIWGALLSACRVHGNVDLGKIASEHLFEVEP 685

Query: 509 ENTGNYVLLSNMYAEAGRWQEVDKLRAIMKSQGTKKSPGCSWIEINGKAHMFLGGDTSHP 568
           E+ G +VLLSNMYA AG+W+ VD++R+I   +G +K+PG S +E++ K  +F  G+ +HP
Sbjct: 686 EHVGYHVLLSNMYASAGKWEGVDEIRSIAHGKGLRKTPGWSSMEVDNKVEVFYTGNQTHP 745

Query: 569 QAKEIYMFLEALPEKMKAAGYVPDTSYVLHDISEEEKEFNLLAHSEKLAVAFGILNTSVE 628
             +E+Y  L AL  K+K  GYVPD  +VL D+ ++EKE  L++HSE+LA+AF ++ T  +
Sbjct: 746 MYEEMYRELTALQAKLKMIGYVPDHRFVLQDVEDDEKEHILMSHSERLAIAFALIATPAK 805

Query: 629 TVLRVTKNLRICGDCHTAMVFISEIYGREVVVRDVNRFHHFKGGSCSCGDYW 647
           T +R+ KNLR+CGDCH+   FIS+I  RE++VRD NRFHHFK G CSCGDYW
Sbjct: 806 TTIRIFKNLRVCGDCHSVTKFISKITEREIIVRDSNRFHHFKNGVCSCGDYW 823

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004140278.10.0e+0088.66PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis s... [more]
XP_008456075.10.0e+0088.07PREDICTED: pentatricopeptide repeat-containing protein At3g62890-like [Cucumis m... [more]
XP_022149333.18.6e-30986.49pentatricopeptide repeat-containing protein At3g62890-like [Momordica charantia][more]
XP_022922754.11.9e-30085.86pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita moschata][more]
XP_023552018.11.6e-29985.71pentatricopeptide repeat-containing protein At3g62890-like [Cucurbita pepo subsp... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KEZ1|A0A0A0KEZ1_CUCSA0.0e+0088.66Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G430650 PE=4 SV=1[more]
tr|A0A1S3C2H6|A0A1S3C2H6_CUCME0.0e+0088.07pentatricopeptide repeat-containing protein At3g62890-like OS=Cucumis melo OX=36... [more]
tr|A0A2P5FRB8|A0A2P5FRB8_9ROSA2.6e-22470.27DYW domain containing protein OS=Trema orientalis OX=63057 GN=TorRG33x02_040890 ... [more]
tr|A0A1Q3AZH1|A0A1Q3AZH1_CEPFO5.7e-21667.03PPR domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-conta... [more]
tr|A0A2P5SUJ2|A0A2P5SUJ2_GOSBA2.2e-21566.21Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=GOBAR_DD00255 PE=4 SV... [more]
Match NameE-valueIdentityDescription
sp|P0C899|PP271_ARATH3.9e-11136.78Putative pentatricopeptide repeat-containing protein At3g49142 OS=Arabidopsis th... [more]
sp|Q9STF3|PP265_ARATH4.3e-11034.50Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
sp|Q7Y211|PP285_ARATH9.7e-11035.83Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
sp|Q9LW63|PP251_ARATH4.1e-10837.75Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
sp|O81767|PP348_ARATH1.4e-10034.80Pentatricopeptide repeat-containing protein At4g33990 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
AT3G49142.12.2e-11236.78Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G46790.12.4e-11134.50Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G57430.15.4e-11135.83Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G23330.12.3e-10937.75Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT4G33990.17.8e-10234.80Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0008270zinc ion binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR032867DYW_dom
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G092010.1Cla97C05G092010.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 267..316
e-value: 1.8E-10
score: 40.8
coord: 371..419
e-value: 1.7E-12
score: 47.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 210..233
e-value: 0.0078
score: 16.3
coord: 139..167
e-value: 0.025
score: 14.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 239..263
e-value: 2.0E-4
score: 19.3
coord: 139..168
e-value: 0.0025
score: 15.9
coord: 374..407
e-value: 2.9E-7
score: 28.3
coord: 269..304
e-value: 3.9E-7
score: 27.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 476..510
score: 7.761
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 372..406
score: 11.718
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 267..301
score: 10.72
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 135..169
score: 8.144
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 236..266
score: 9.58
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 304..338
score: 5.821
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 407..444
score: 6.851
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 69..103
score: 5.437
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 339..369
score: 6.314
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..235
score: 7.903
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 512..636
e-value: 2.8E-42
score: 143.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 65..188
e-value: 1.0E-10
score: 43.5
coord: 267..370
e-value: 2.7E-20
score: 74.9
coord: 189..266
e-value: 5.7E-14
score: 54.2
coord: 371..527
e-value: 1.4E-23
score: 85.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 346..406
coord: 441..496
coord: 270..307
NoneNo IPR availablePANTHERPTHR24015:SF728SUBFAMILY NOT NAMEDcoord: 255..422
coord: 422..555
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 255..422
coord: 72..253
NoneNo IPR availablePANTHERPTHR24015:SF728SUBFAMILY NOT NAMEDcoord: 72..253
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 422..555

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C05G092010Silver-seed gourdcarwmbB0062
Cla97C05G092010Cucumber (Gy14) v2cgybwmbB206
Cla97C05G092010Cucumber (Gy14) v1cgywmbB003
Cla97C05G092010Cucurbita maxima (Rimu)cmawmbB763
Cla97C05G092010Cucurbita moschata (Rifu)cmowmbB735
Cla97C05G092010Wild cucumber (PI 183967)cpiwmbB221
Cla97C05G092010Cucumber (Chinese Long) v3cucwmbB217
Cla97C05G092010Cucumber (Chinese Long) v2cuwmbB215
Cla97C05G092010Wax gourdwgowmbB179