Cla97C11G211850 (gene) Watermelon (97103) v2

NameCla97C11G211850
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPentatricopeptide repeat
LocationCla97Chr11 : 5197578 .. 5199866 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGGAGCGCTAGGAAATTCATCTCCTACTCTAATCTTGTTGAATGAGTTTAATTACCAACATGATTCATATTACCCTTTTAGTTGGGCGGATAAGCGTTTGCGTCAGTGTATAAATGTTAATCCAGTGTTGAAATCATGTATAAGATGTAGGATTACGTATGTGGGTAATGTAGTTTCAATGTTACCAATGAATACTCCACAGTTGAATTTGGTGGTTCAGTCCACTAGGGGCATGAAATTTAGGACTTGTGTTGGGACCCTTTTGAATTGCGAAGAGGATGAGGCAATCGAATTGGTCATTGATGAAGGAGTTGAAGAATCCTCTGGTGAGTGGAAATTGCCTCCTTGGGGAGACATGGCACATCAGGATGAGCCAACCTTTCAATCTGAAGATGCAAACGAATCCAAAATCTTAGAAAGGGAGGCTTTGGAAAATGATAGCAAGGTTCATTTTCTTGAGGAAACTGATAATGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTTAGAAGTGCATTGGAATTATTCAGGTCCATGCAATTAGCTGGTCTTCTGCCAAGTTTGCATGCTTTAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCTATTTGATGATGGCTTAAGAATCTTCGAGTTTATGAAGTCAAATAAGTTATCAACAGGACACACTTATAGCCTTGTACTCAAAGCAGTTGCAAATGCTCATGGATTTCTTTCTGCTCTTGAGATGTTTAAGGCATGGGAGCATAAATATGACTTAACACAGTTTGATGCAATTGTTTACAACACAATGATATCGATATGTGGAAAAGATAATAATTGGGTTGAAGCTGAGAGAACATGGAGACTAATGGAGGAAAATGGCTGTAGTGCAACACGCATAACTTATTCTCTATTGGTGAGCACTTTCGTCCGATGTAACCAGAATGAACTTGCAATTGACACTTATGTAAAGATGGTTCAAAATTCTTTTAAACCAGGTTACGATACAATGCAAGCTATTATTGGCGCATCTTCAAAGGAAGGGAGGTGGGATATTGCTTTAAGAGTCTGTCAAGACATGTTGAAATGTGGACTTCAACCTAATTCTGTTGCATTCAATGCCTTGATCAATGCTTTAGGAAAAGCTAAGGAGGTCACTTTAGCATTCAGTATATACAATGTGATGAAATCTATGGGTCATTCACCTGATGTTTATACATGGAATGCACTACTTGGTGCTCTTTACAATGCAAACCGCTACAATGATGCCATTCATCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCCCAATTGAATATACATATTTACAATACCATTCTAATGTCTTGTTCAAAGCTTGCGTTATGGGATAGGGCGCTCCAAATTTTATGGGAAATGGAGGCTTCCGGTCTCTCAATTTCGGCATCATCGTATAACATTGTTATTACTACATGTGAGATGGCTAGGAAGCCTGAAATTGCATTACAAGTTTATGAACGCATGATTCACCAAGAGCACACTCCTGATACATTCACTCATTTGTCACTTATCAGATGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTAAATGTAAGTGCTTACCTATATATAACTGTGTTGTCAGATGATATGACACTAAATTTACCTTCACCCATCAACTTAAGCTTTGGGTCAATCGGTGATTTAAGATATGTTAGTCATGTGTCTTTGTATTGTTTTAAAGAGACATATATGATATCTTCCCCAATAGAAATTTAGTTCAAACTTTTCATCCCTCTAGTTCTCCTTCCATCTTTGGTGAATTTATGTCTTCCATGCTAATTCATTCAATGATGATAGTTTATAAACTCTATTTTCCAAGAATGAGTTGAAAAACATCATTGAAATCCATGTAATTGAATTGGAAAAGGAGGCTTCCAATCAGACCACAGTATGGTTTTTGAGTACTTTGAACATTGTTGTTTTTCCTGATGATCTCGTTTAGCATGTCCTTTTTTCACTAAGAATTGTGAGGTTTTGTATCTGATAGTTATTCTTCTGCAGAAGTCTGGACCTGATGTATCTCTATACAATGCTGTCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCGAAAAAGCTTTACATGAAGATGCGCAAAAACGGTATCCAACCGGATGGAAAAACACGAGCTTTAATGCTTCAGAACTTGCCAAAGGATCCTGCTAGATTGAAGAACAGGTGGGCTTCTGGTTTCAAGAAAAGACACAGACACTATCATCATAGGTAA

mRNA sequence

ATGAGAGGAGCGCTAGGAAATTCATCTCCTACTCTAATCTTGTTGAATGAGTTTAATTACCAACATGATTCATATTACCCTTTTAGTTGGGCGGATAAGCGTTTGCGTCAGTGTATAAATGTTAATCCAGTGTTGAAATCATGTATAAGATGTAGGATTACGTATGTGGGTAATGTAGTTTCAATGTTACCAATGAATACTCCACAGTTGAATTTGGTGGTTCAGTCCACTAGGGGCATGAAATTTAGGACTTGTGTTGGGACCCTTTTGAATTGCGAAGAGGATGAGGCAATCGAATTGGTCATTGATGAAGGAGTTGAAGAATCCTCTGGTGAGTGGAAATTGCCTCCTTGGGGAGACATGGCACATCAGGATGAGCCAACCTTTCAATCTGAAGATGCAAACGAATCCAAAATCTTAGAAAGGGAGGCTTTGGAAAATGATAGCAAGGTTCATTTTCTTGAGGAAACTGATAATGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTTAGAAGTGCATTGGAATTATTCAGGTCCATGCAATTAGCTGGTCTTCTGCCAAGTTTGCATGCTTTAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCTATTTGATGATGGCTTAAGAATCTTCGAGTTTATGAAGTCAAATAAGTTATCAACAGGACACACTTATAGCCTTGTACTCAAAGCAGTTGCAAATGCTCATGGATTTCTTTCTGCTCTTGAGATGTTTAAGGCATGGGAGCATAAATATGACTTAACACAGTTTGATGCAATTGTTTACAACACAATGATATCGATATGTGGAAAAGATAATAATTGGGTTGAAGCTGAGAGAACATGGAGACTAATGGAGGAAAATGGCTGTAGTGCAACACGCATAACTTATTCTCTATTGGTGAGCACTTTCGTCCGATGTAACCAGAATGAACTTGCAATTGACACTTATGTAAAGATGGTTCAAAATTCTTTTAAACCAGGTTACGATACAATGCAAGCTATTATTGGCGCATCTTCAAAGGAAGGGAGGTGGGATATTGCTTTAAGAGTCTGTCAAGACATGTTGAAATGTGGACTTCAACCTAATTCTGTTGCATTCAATGCCTTGATCAATGCTTTAGGAAAAGCTAAGGAGGTCACTTTAGCATTCAGTATATACAATGTGATGAAATCTATGGGTCATTCACCTGATGTTTATACATGGAATGCACTACTTGGTGCTCTTTACAATGCAAACCGCTACAATGATGCCATTCATCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCCCAATTGAATATACATATTTACAATACCATTCTAATGTCTTGTTCAAAGCTTGCGTTATGGGATAGGGCGCTCCAAATTTTATGGGAAATGGAGGCTTCCGGTCTCTCAATTTCGGCATCATCGTATAACATTGTTATTACTACATGTGAGATGGCTAGGAAGCCTGAAATTGCATTACAAGTTTATGAACGCATGATTCACCAAGAGCACACTCCTGATACATTCACTCATTTGTCACTTATCAGATGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTAAATAAGTCTGGACCTGATGTATCTCTATACAATGCTGTCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCGAAAAAGCTTTACATGAAGATGCGCAAAAACGGTATCCAACCGGATGGAAAAACACGAGCTTTAATGCTTCAGAACTTGCCAAAGGATCCTGCTAGATTGAAGAACAGGTGGGCTTCTGGTTTCAAGAAAAGACACAGACACTATCATCATAGGTAA

Coding sequence (CDS)

ATGAGAGGAGCGCTAGGAAATTCATCTCCTACTCTAATCTTGTTGAATGAGTTTAATTACCAACATGATTCATATTACCCTTTTAGTTGGGCGGATAAGCGTTTGCGTCAGTGTATAAATGTTAATCCAGTGTTGAAATCATGTATAAGATGTAGGATTACGTATGTGGGTAATGTAGTTTCAATGTTACCAATGAATACTCCACAGTTGAATTTGGTGGTTCAGTCCACTAGGGGCATGAAATTTAGGACTTGTGTTGGGACCCTTTTGAATTGCGAAGAGGATGAGGCAATCGAATTGGTCATTGATGAAGGAGTTGAAGAATCCTCTGGTGAGTGGAAATTGCCTCCTTGGGGAGACATGGCACATCAGGATGAGCCAACCTTTCAATCTGAAGATGCAAACGAATCCAAAATCTTAGAAAGGGAGGCTTTGGAAAATGATAGCAAGGTTCATTTTCTTGAGGAAACTGATAATGTTATGCTATCAAAGCGTATTTTAATTCTCAGTAGAAAAAATAAGGTTAGAAGTGCATTGGAATTATTCAGGTCCATGCAATTAGCTGGTCTTCTGCCAAGTTTGCATGCTTTAAATTCACTTTTAGCTTGTCTTTTGAGGAATGGGCTATTTGATGATGGCTTAAGAATCTTCGAGTTTATGAAGTCAAATAAGTTATCAACAGGACACACTTATAGCCTTGTACTCAAAGCAGTTGCAAATGCTCATGGATTTCTTTCTGCTCTTGAGATGTTTAAGGCATGGGAGCATAAATATGACTTAACACAGTTTGATGCAATTGTTTACAACACAATGATATCGATATGTGGAAAAGATAATAATTGGGTTGAAGCTGAGAGAACATGGAGACTAATGGAGGAAAATGGCTGTAGTGCAACACGCATAACTTATTCTCTATTGGTGAGCACTTTCGTCCGATGTAACCAGAATGAACTTGCAATTGACACTTATGTAAAGATGGTTCAAAATTCTTTTAAACCAGGTTACGATACAATGCAAGCTATTATTGGCGCATCTTCAAAGGAAGGGAGGTGGGATATTGCTTTAAGAGTCTGTCAAGACATGTTGAAATGTGGACTTCAACCTAATTCTGTTGCATTCAATGCCTTGATCAATGCTTTAGGAAAAGCTAAGGAGGTCACTTTAGCATTCAGTATATACAATGTGATGAAATCTATGGGTCATTCACCTGATGTTTATACATGGAATGCACTACTTGGTGCTCTTTACAATGCAAACCGCTACAATGATGCCATTCATCTCTTTGAGTTTGTGAAAAGAGAGGAGAAGGCCCAATTGAATATACATATTTACAATACCATTCTAATGTCTTGTTCAAAGCTTGCGTTATGGGATAGGGCGCTCCAAATTTTATGGGAAATGGAGGCTTCCGGTCTCTCAATTTCGGCATCATCGTATAACATTGTTATTACTACATGTGAGATGGCTAGGAAGCCTGAAATTGCATTACAAGTTTATGAACGCATGATTCACCAAGAGCACACTCCTGATACATTCACTCATTTGTCACTTATCAGATGCTGCATTTGGGGATCTTTATGGGATGAAGTGGAACTACTTCTAAATAAGTCTGGACCTGATGTATCTCTATACAATGCTGTCATCCAAGGAATGTGCTTAAGAGGCAAGACTGATTTAGCGAAAAAGCTTTACATGAAGATGCGCAAAAACGGTATCCAACCGGATGGAAAAACACGAGCTTTAATGCTTCAGAACTTGCCAAAGGATCCTGCTAGATTGAAGAACAGGTGGGCTTCTGGTTTCAAGAAAAGACACAGACACTATCATCATAGGTAA

Protein sequence

MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVVSMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGDMAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALELFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVANAHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATRITYSLLVSTFVRCNQNELAIDTYVKMVQNSFKPGYDTMQAIIGASSKEGRWDIALRVCQDMLKCGLQPNSVAFNALINALGKAKEVTLAFSIYNVMKSMGHSPDVYTWNALLGALYNANRYNDAIHLFEFVKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDVSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASGFKKRHRHYHHR
BLAST of Cla97C11G211850 vs. NCBI nr
Match: XP_008459413.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cucumis melo])

HSP 1 Score: 870.9 bits (2249), Expect = 2.5e-249
Identity = 532/611 (87.07%), Postives = 551/611 (90.18%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG LGNSSPTLILLNEFNYQHDSYYPF   DK LRQCINVNP+LKSC+RC I Y GN V
Sbjct: 1   MRGVLGNSSPTLILLNEFNYQHDSYYPFRREDKHLRQCINVNPMLKSCMRCTIMYDGNAV 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGD 120
           SMLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGD
Sbjct: 61  SMLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGD 120

Query: 121 MAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALE 180
           M HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALE
Sbjct: 121 MTHQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVAN 240
           LFRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVAN
Sbjct: 181 LFRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 AHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATR 300
           AHGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT 
Sbjct: 241 AHGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATH 300

Query: 301 ITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXX 360
           ITYSLLVSTFVRCNQNELAID YVKM   SFKPG DT             XXXXXXXXXX
Sbjct: 301 ITYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYN 480
           XXXXXXXXXXX REEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYN
Sbjct: 421 XXXXXXXXXXXXREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYN 480

Query: 481 IVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540
           IV+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD
Sbjct: 481 IVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540

Query: 541 VSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASG 600
           VS+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASG
Sbjct: 541 VSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASG 600

Query: 601 FKKRHRHYHHR 612
           FKKR R YHHR
Sbjct: 601 FKKRRRRYHHR 611

BLAST of Cla97C11G211850 vs. NCBI nr
Match: XP_016902392.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Cucumis melo])

HSP 1 Score: 864.4 bits (2232), Expect = 2.3e-247
Identity = 450/611 (73.65%), Postives = 469/611 (76.76%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG LGNSSPTLILLNEFNYQHDSYYPF   DK LRQCINVNP+LKSC+RC I Y GN V
Sbjct: 1   MRGVLGNSSPTLILLNEFNYQHDSYYPFRREDKHLRQCINVNPMLKSCMRCTIMYDGNAV 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGD 120
           SMLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGD
Sbjct: 61  SMLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGD 120

Query: 121 MAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALE 180
           M HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALE
Sbjct: 121 MTHQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVAN 240
           LFRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVAN
Sbjct: 181 LFRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 AHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATR 300
           AHGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT 
Sbjct: 241 AHGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATH 300

Query: 301 ITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXX 360
           ITYSLLVSTFVRCNQNELAID YVKM   SFKPG                          
Sbjct: 301 ITYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPG-------------------------- 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
                                                                       
Sbjct: 361 ---------------------KAKEVTLAFSIYNVMKSMGHSPDVYTWNALLGALYKANR 420

Query: 421 XXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYN 480
                      KREEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYN
Sbjct: 421 YNDAIHLFGFVKREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYN 480

Query: 481 IVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540
           IV+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD
Sbjct: 481 IVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540

Query: 541 VSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASG 600
           VS+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASG
Sbjct: 541 VSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASG 564

Query: 601 FKKRHRHYHHR 612
           FKKR R YHHR
Sbjct: 601 FKKRRRRYHHR 564

BLAST of Cla97C11G211850 vs. NCBI nr
Match: XP_022134520.1 (pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Momordica charantia] >XP_022134521.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Momordica charantia] >XP_022134522.1 pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Momordica charantia])

HSP 1 Score: 780.8 bits (2015), Expect = 3.4e-222
Identity = 485/611 (79.38%), Postives = 525/611 (85.92%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG L N SPTL+L NE NYQ DSYYP ++A + L QC NVN  LKS +RC I Y GNV+
Sbjct: 1   MRGVLINPSPTLLLSNELNYQQDSYYPVTYAHRLLHQCRNVNSFLKSRMRCSIIYRGNVI 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVID-EGVEESSGEWKLPPWG 120
           SML M+ P+LNLV++STR M+F T VGTLLNCEEDEAIELV D EG+E  S EWK PPWG
Sbjct: 61  SMLSMSVPRLNLVIRSTRAMEFSTGVGTLLNCEEDEAIELVTDEEGLE--SFEWKSPPWG 120

Query: 121 DMAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSAL 180
           D+   DE +FQSED N+ ++LE EA  N SKVHFLEETD VMLSKRILILSRKNKVRSA+
Sbjct: 121 DIVKHDESSFQSEDTNQPRMLEAEAFINGSKVHFLEETDEVMLSKRILILSRKNKVRSAM 180

Query: 181 ELFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVA 240
           ELFRSMQLAGLLPSLHALNSLLACLLRN LFDDGLRIF  MK+NKLSTGHTYSL+LKAVA
Sbjct: 181 ELFRSMQLAGLLPSLHALNSLLACLLRNELFDDGLRIFALMKTNKLSTGHTYSLILKAVA 240

Query: 241 NAHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSAT 300
           +A GFLSALEMFKAWE +YD+  FD IVYNTMIS+CGK+NNWVEAERTWRLME NGC AT
Sbjct: 241 DARGFLSALEMFKAWEQEYDVKHFDPIVYNTMISVCGKENNWVEAERTWRLMEANGCGAT 300

Query: 301 RITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXX 360
           RITY LLVSTFVRC QNELAIDTYVKM    FKPG DT               XXXXXXX
Sbjct: 301 RITYCLLVSTFVRCKQNELAIDTYVKMVQNHFKPGNDTMQTIIGASLKEGKWDXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSY 480
           XXXXXXXXXXXX   +K QLNIH+YNTILMSCSKL LWDR +QILWEMEASGL IS +SY
Sbjct: 421 XXXXXXXXXXXXXXXDKPQLNIHVYNTILMSCSKLGLWDRTIQILWEMEASGLLISTASY 480

Query: 481 NIVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGP 540
           NIVI+ CEMARKP+IALQVYERMIHQ+HTPDTFTHLSL+RCCIWGSLWDEVE+LLNKS P
Sbjct: 481 NIVISACEMARKPKIALQVYERMIHQKHTPDTFTHLSLVRCCIWGSLWDEVEVLLNKSAP 540

Query: 541 DVSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWAS 600
           +VS+YNAVIQGMCLRGKTDLAK+LY KMR+NGIQ DGKTRALMLQNLPKDPAR  NRWAS
Sbjct: 541 NVSVYNAVIQGMCLRGKTDLAKRLYSKMRENGIQADGKTRALMLQNLPKDPARFNNRWAS 600

Query: 601 GFKKRHRHYHH 611
            F+KR+RHYHH
Sbjct: 601 RFRKRNRHYHH 609

BLAST of Cla97C11G211850 vs. NCBI nr
Match: XP_022134523.1 (pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Momordica charantia])

HSP 1 Score: 780.8 bits (2015), Expect = 3.4e-222
Identity = 485/611 (79.38%), Postives = 525/611 (85.92%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG L N SPTL+L NE NYQ DSYYP ++A + L QC NVN  LKS +RC I Y GNV+
Sbjct: 1   MRGVLINPSPTLLLSNELNYQQDSYYPVTYAHRLLHQCRNVNSFLKSRMRCSIIYRGNVI 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVID-EGVEESSGEWKLPPWG 120
           SML M+ P+LNLV++STR M+F T VGTLLNCEEDEAIELV D EG+E  S EWK PPWG
Sbjct: 61  SMLSMSVPRLNLVIRSTRAMEFSTGVGTLLNCEEDEAIELVTDEEGLE--SFEWKSPPWG 120

Query: 121 DMAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSAL 180
           D+   DE +FQSED N+ ++LE EA  N SKVHFLEETD VMLSKRILILSRKNKVRSA+
Sbjct: 121 DIVKHDESSFQSEDTNQPRMLEAEAFINGSKVHFLEETDEVMLSKRILILSRKNKVRSAM 180

Query: 181 ELFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVA 240
           ELFRSMQLAGLLPSLHALNSLLACLLRN LFDDGLRIF  MK+NKLSTGHTYSL+LKAVA
Sbjct: 181 ELFRSMQLAGLLPSLHALNSLLACLLRNELFDDGLRIFALMKTNKLSTGHTYSLILKAVA 240

Query: 241 NAHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSAT 300
           +A GFLSALEMFKAWE +YD+  FD IVYNTMIS+CGK+NNWVEAERTWRLME NGC AT
Sbjct: 241 DARGFLSALEMFKAWEQEYDVKHFDPIVYNTMISVCGKENNWVEAERTWRLMEANGCGAT 300

Query: 301 RITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXX 360
           RITY LLVSTFVRC QNELAIDTYVKM    FKPG DT               XXXXXXX
Sbjct: 301 RITYCLLVSTFVRCKQNELAIDTYVKMVQNHFKPGNDTMQTIIGASLKEGKWDXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSY 480
           XXXXXXXXXXXX   +K QLNIH+YNTILMSCSKL LWDR +QILWEMEASGL IS +SY
Sbjct: 421 XXXXXXXXXXXXXXXDKPQLNIHVYNTILMSCSKLGLWDRTIQILWEMEASGLLISTASY 480

Query: 481 NIVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGP 540
           NIVI+ CEMARKP+IALQVYERMIHQ+HTPDTFTHLSL+RCCIWGSLWDEVE+LLNKS P
Sbjct: 481 NIVISACEMARKPKIALQVYERMIHQKHTPDTFTHLSLVRCCIWGSLWDEVEVLLNKSAP 540

Query: 541 DVSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWAS 600
           +VS+YNAVIQGMCLRGKTDLAK+LY KMR+NGIQ DGKTRALMLQNLPKDPAR  NRWAS
Sbjct: 541 NVSVYNAVIQGMCLRGKTDLAKRLYSKMRENGIQADGKTRALMLQNLPKDPARFNNRWAS 600

Query: 601 GFKKRHRHYHH 611
            F+KR+RHYHH
Sbjct: 601 RFRKRNRHYHH 609

BLAST of Cla97C11G211850 vs. NCBI nr
Match: XP_008459414.1 (PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X3 [Cucumis melo])

HSP 1 Score: 766.1 bits (1977), Expect = 8.7e-218
Identity = 482/550 (87.64%), Postives = 499/550 (90.73%), Query Frame = 0

Query: 62  MLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGDM 121
           MLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGDM
Sbjct: 1   MLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGDM 60

Query: 122 AHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALEL 181
            HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALEL
Sbjct: 61  THQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALEL 120

Query: 182 FRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVANA 241
           FRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVANA
Sbjct: 121 FRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVANA 180

Query: 242 HGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATRI 301
           HGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT I
Sbjct: 181 HGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATHI 240

Query: 302 TYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXXX 361
           TYSLLVSTFVRCNQNELAID YVKM   SFKPG DT             XXXXXXXXXXX
Sbjct: 241 TYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKXXXXXXXXXXX 300

Query: 362 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 421
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 422 XXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNI 481
           XXXXXXXXXX REEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYNI
Sbjct: 361 XXXXXXXXXXXREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYNI 420

Query: 482 VITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV 541
           V+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV
Sbjct: 421 VLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV 480

Query: 542 SLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASGF 601
           S+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASGF
Sbjct: 481 SVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASGF 540

Query: 602 KKRHRHYHHR 612
           KKR R YHHR
Sbjct: 541 KKRRRRYHHR 550

BLAST of Cla97C11G211850 vs. TrEMBL
Match: tr|A0A1S3C9M1|A0A1S3C9M1_CUCME (pentatricopeptide repeat-containing protein At3g29290 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103498557 PE=4 SV=1)

HSP 1 Score: 870.9 bits (2249), Expect = 1.7e-249
Identity = 532/611 (87.07%), Postives = 551/611 (90.18%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG LGNSSPTLILLNEFNYQHDSYYPF   DK LRQCINVNP+LKSC+RC I Y GN V
Sbjct: 1   MRGVLGNSSPTLILLNEFNYQHDSYYPFRREDKHLRQCINVNPMLKSCMRCTIMYDGNAV 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGD 120
           SMLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGD
Sbjct: 61  SMLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGD 120

Query: 121 MAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALE 180
           M HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALE
Sbjct: 121 MTHQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVAN 240
           LFRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVAN
Sbjct: 181 LFRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 AHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATR 300
           AHGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT 
Sbjct: 241 AHGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATH 300

Query: 301 ITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXX 360
           ITYSLLVSTFVRCNQNELAID YVKM   SFKPG DT             XXXXXXXXXX
Sbjct: 301 ITYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYN 480
           XXXXXXXXXXX REEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYN
Sbjct: 421 XXXXXXXXXXXXREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYN 480

Query: 481 IVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540
           IV+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD
Sbjct: 481 IVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540

Query: 541 VSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASG 600
           VS+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASG
Sbjct: 541 VSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASG 600

Query: 601 FKKRHRHYHHR 612
           FKKR R YHHR
Sbjct: 601 FKKRRRRYHHR 611

BLAST of Cla97C11G211850 vs. TrEMBL
Match: tr|A0A1S4E2E0|A0A1S4E2E0_CUCME (pentatricopeptide repeat-containing protein At3g29290 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103498557 PE=4 SV=1)

HSP 1 Score: 864.4 bits (2232), Expect = 1.6e-247
Identity = 450/611 (73.65%), Postives = 469/611 (76.76%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG LGNSSPTLILLNEFNYQHDSYYPF   DK LRQCINVNP+LKSC+RC I Y GN V
Sbjct: 1   MRGVLGNSSPTLILLNEFNYQHDSYYPFRREDKHLRQCINVNPMLKSCMRCTIMYDGNAV 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGD 120
           SMLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGD
Sbjct: 61  SMLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGD 120

Query: 121 MAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALE 180
           M HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALE
Sbjct: 121 MTHQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVAN 240
           LFRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVAN
Sbjct: 181 LFRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 AHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATR 300
           AHGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT 
Sbjct: 241 AHGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATH 300

Query: 301 ITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXX 360
           ITYSLLVSTFVRCNQNELAID YVKM   SFKPG                          
Sbjct: 301 ITYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPG-------------------------- 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
                                                                       
Sbjct: 361 ---------------------KAKEVTLAFSIYNVMKSMGHSPDVYTWNALLGALYKANR 420

Query: 421 XXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYN 480
                      KREEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYN
Sbjct: 421 YNDAIHLFGFVKREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYN 480

Query: 481 IVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540
           IV+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD
Sbjct: 481 IVLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540

Query: 541 VSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASG 600
           VS+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASG
Sbjct: 541 VSVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASG 564

Query: 601 FKKRHRHYHHR 612
           FKKR R YHHR
Sbjct: 601 FKKRRRRYHHR 564

BLAST of Cla97C11G211850 vs. TrEMBL
Match: tr|A0A1S3CA80|A0A1S3CA80_CUCME (pentatricopeptide repeat-containing protein At3g29290 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103498557 PE=4 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 5.8e-218
Identity = 482/550 (87.64%), Postives = 499/550 (90.73%), Query Frame = 0

Query: 62  MLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGDM 121
           MLPM+TP+LNLVVQS RGM+FRT VGTLLNC EDEAIELVIDE   ESS EWKLPPWGDM
Sbjct: 1   MLPMSTPRLNLVVQSIRGMQFRTGVGTLLNCGEDEAIELVIDEEGVESSREWKLPPWGDM 60

Query: 122 AHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALEL 181
            HQDE  FQSED N  KILE EALEN+SKVHFLEETD V+LSKRILILSRKNKVRSALEL
Sbjct: 61  THQDEAAFQSEDVNLPKILEGEALENESKVHFLEETDKVLLSKRILILSRKNKVRSALEL 120

Query: 182 FRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVANA 241
           FRSMQLAG+LP+LHALNSLLACLLRNGLF DGLRIFEFMK N+LSTGHTYSLVLKAVANA
Sbjct: 121 FRSMQLAGVLPNLHALNSLLACLLRNGLFADGLRIFEFMKLNELSTGHTYSLVLKAVANA 180

Query: 242 HGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATRI 301
           HGFLSALEMFKAWEHKY LTQFDAIVYNTMISICGKDNNWVEAERTWRLME+NGC+AT I
Sbjct: 181 HGFLSALEMFKAWEHKYVLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCTATHI 240

Query: 302 TYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXXX 361
           TYSLLVSTFVRCNQNELAID YVKM   SFKPG DT             XXXXXXXXXXX
Sbjct: 241 TYSLLVSTFVRCNQNELAIDAYVKMVQSSFKPGNDTMQAIIGASSKEGKXXXXXXXXXXX 300

Query: 362 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 421
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 422 XXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNI 481
           XXXXXXXXXX REEKAQLNIHIYNTILM CSKL LW+RALQILWEME SGL IS +SYNI
Sbjct: 361 XXXXXXXXXXXREEKAQLNIHIYNTILMCCSKLGLWERALQILWEMEVSGLLISTTSYNI 420

Query: 482 VITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV 541
           V+T CE ARKPEIALQVYERM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV
Sbjct: 421 VLTACETARKPEIALQVYERMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDV 480

Query: 542 SLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASGF 601
           S+YN VIQGMCLRGKTDLAKKLY KMR+N IQ DGKTRALMLQNLPKDPARLKNRWASGF
Sbjct: 481 SVYNVVIQGMCLRGKTDLAKKLYTKMRENSIQSDGKTRALMLQNLPKDPARLKNRWASGF 540

Query: 602 KKRHRHYHHR 612
           KKR R YHHR
Sbjct: 541 KKRRRRYHHR 550

BLAST of Cla97C11G211850 vs. TrEMBL
Match: tr|A0A0A0KVV0|A0A0A0KVV0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642190 PE=4 SV=1)

HSP 1 Score: 664.8 bits (1714), Expect = 1.8e-187
Identity = 465/611 (76.10%), Postives = 486/611 (79.54%), Query Frame = 0

Query: 1   MRGALGNSSPTLILLNEFNYQHDSYYPFSWADKRLRQCINVNPVLKSCIRCRITYVGNVV 60
           MRG LGNSSPTL+LLNEFNYQHDS+YPF   DKRLRQCINVNP+LKS +RC I Y GN V
Sbjct: 1   MRGVLGNSSPTLVLLNEFNYQHDSHYPFRREDKRLRQCINVNPMLKSWMRCTIMYDGNAV 60

Query: 61  SMLPMNTPQLNLVVQSTRGMKFRTCVGTLLNCEEDEAIELVIDEGVEESSGEWKLPPWGD 120
           S+LP +TP+LNLVVQSTRG          +NC EDEAIELVIDEGVEESS EWKLPPWGD
Sbjct: 61  SVLPRSTPRLNLVVQSTRG----------VNCGEDEAIELVIDEGVEESSREWKLPPWGD 120

Query: 121 MAHQDEPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALE 180
           +AHQDE TFQSED N+ KILE + LEN+SK+HFLEETD VMLSKRILILSRKNKVRSALE
Sbjct: 121 IAHQDEATFQSEDVNQPKILEGKVLENESKLHFLEETDKVMLSKRILILSRKNKVRSALE 180

Query: 181 LFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVAN 240
           L RSMQLAGLLPSLHALNSLLACLLRN LF DGLRIFEFMK N+LSTGHTYSLVLKAVAN
Sbjct: 181 LLRSMQLAGLLPSLHALNSLLACLLRNELFADGLRIFEFMKLNELSTGHTYSLVLKAVAN 240

Query: 241 AHGFLSALEMFKAWEHKYDLTQFDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATR 300
           AHGFLSALEMFKAWEH+  L QFDAIVYNTMISICGKDNNWVEAERTWRLME+NGCSATR
Sbjct: 241 AHGFLSALEMFKAWEHQCVLAQFDAIVYNTMISICGKDNNWVEAERTWRLMEKNGCSATR 300

Query: 301 ITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXX 360
           ITYSLLVSTFVRCNQNEL        XXX        XXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 ITYSLLVSTFVRCNQNEL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 420

Query: 421 XXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYN 480
           XXXXXXXXXXX                                                 
Sbjct: 421 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 480

Query: 481 IVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPD 540
                               RM+HQ+HTPDTFTHLSLIRCCIWGSLWDEVELLLNKS PD
Sbjct: 481 XXXXXXXXXXXXXXXXXXXXRMVHQKHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSAPD 540

Query: 541 VSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDPARLKNRWASG 600
           VS+YN VIQGMCLRGK+DLAKKLY KMR+NGIQPDGKTRALMLQNLPKDPAR KNRWASG
Sbjct: 541 VSVYNVVIQGMCLRGKSDLAKKLYTKMRENGIQPDGKTRALMLQNLPKDPARRKNRWASG 600

Query: 601 FKKRHRHYHHR 612
           FKKR RHYHHR
Sbjct: 601 FKKRQRHYHHR 600

BLAST of Cla97C11G211850 vs. TrEMBL
Match: tr|A0A2P4N501|A0A2P4N501_QUESU (Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_44533 PE=4 SV=1)

HSP 1 Score: 486.9 bits (1252), Expect = 6.7e-134
Identity = 349/556 (62.77%), Postives = 426/556 (76.62%), Query Frame = 0

Query: 59  VVSMLPMNTPQLNL--VVQSTRGMKFRTCVGTLLNCEEDEAIELVI-DEGVEESSGEWKL 118
           ++SM     P ++L  V +S   +     V   ++CE++E  +L+  DEG  E S E   
Sbjct: 6   IISMWQPKMPNISLLRVDKSRIVVNMVASVDVGVDCEKEEENKLIEGDEGFGEYSVEQIW 65

Query: 119 PPWGDMA-HQD---EPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSR 178
           PPWG++A H+D   EP+       ESK   R  + N+S+VHFLEET+  +LS+R+L+LSR
Sbjct: 66  PPWGNVANHKDLDIEPSPTPVSQTESKFPNRMTIANESRVHFLEETNEELLSERLLVLSR 125

Query: 179 KNKVRSALELFRSMQLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTY 238
            NKVRSALELFRSM+L+GL PSLHA NSLL+CL RNG  DDGLR+FEF+K+ K++TGHTY
Sbjct: 126 TNKVRSALELFRSMELSGLCPSLHACNSLLSCLFRNGQPDDGLRVFEFLKTKKITTGHTY 185

Query: 239 SLVLKAVANAHGFLSALEMFKAWEHKYDLTQ-FDAIVYNTMISICGKDNNWVEAERTWRL 298
           SLVLKAVAN+ G  +ALEM      + ++ + FDAI YNTMI++CGK NN +E ER WR 
Sbjct: 186 SLVLKAVANSQGSDTALEMLMEMLGECEVKKDFDAIAYNTMITVCGKVNNGIETERIWRS 245

Query: 299 MEENGCSATRITYSLLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXX 358
           M+ NGC  TR+TY LL+S FV C+QNELA+D Y +M    F+ G DT            X
Sbjct: 246 MKANGCCPTRVTYCLLISNFVHCSQNELALDAYSEMVQNGFEAGNDTKQAIISVCTKEGX 305

Query: 359 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 418
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 306 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 365

Query: 419 XXXXXXXXXXXXXXXXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEAS 478
           XXXXXXXXXXXXXXXXXXX  K+++ +QLN+H+YNT LMSCSKL  WDRALQ+LW+MEA 
Sbjct: 366 XXXXXXXXXXXXXXXXXXXSIKKDQGSQLNLHLYNTALMSCSKLGSWDRALQLLWQMEAF 425

Query: 479 GLSISASSYNIVITTCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEV 538
           GLSI  +SYN+VI+ CE+ARKP++ALQVYE M+HQ+  PDTFTHLSL+R CIWGSLWDEV
Sbjct: 426 GLSIPTTSYNLVISACEIARKPKVALQVYEHMVHQKCNPDTFTHLSLLRACIWGSLWDEV 485

Query: 539 ELLLNKSGPDVSLYNAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKDP 598
           E +LN + P+VSLYNAVIQGM LRGK D AK LY KMR++G+QPDGKTRA+MLQNL K+ 
Sbjct: 486 EEILNWAAPNVSLYNAVIQGMYLRGKIDSAKNLYTKMREHGLQPDGKTRAMMLQNLSKNS 545

Query: 599 ARLKNRWASGFKKRHR 607
           AR +  W+ G++ RHR
Sbjct: 546 ARHRRSWSLGYRGRHR 561

BLAST of Cla97C11G211850 vs. Swiss-Prot
Match: sp|Q84J46|PP262_ARATH (Pentatricopeptide repeat-containing protein At3g29290 OS=Arabidopsis thaliana OX=3702 GN=EMB2076 PE=2 SV=1)

HSP 1 Score: 340.1 bits (871), Expect = 5.0e-92
Identity = 280/464 (60.34%), Postives = 341/464 (73.49%), Query Frame = 0

Query: 126 EPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALELFRSM 185
           + +F  E+      LE +   + +++HFLEE +   LSKR+  LSR +KVRSALELF SM
Sbjct: 74  DSSFNGENVVCGLELEEKTAGDRNRIHFLEERNEETLSKRLRKLSRLDKVRSALELFDSM 133

Query: 186 QLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVANAHGFL 245
           +  GL P+ HA NS L+CLLRNG       +FEFM+  +  TGHTYSL+LKAVA   G  
Sbjct: 134 RFLGLQPNAHACNSFLSCLLRNGDIQKAFTVFEFMRKKENVTGHTYSLMLKAVAEVKGCE 193

Query: 246 SALEMFKAWEHKYDLTQ-FDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATRITYS 305
           SAL MF+  E +      FD ++YNT IS+CG+ NN  E ER WR+M+ +G   T ITYS
Sbjct: 194 SALRMFRELEREPKRRSCFDVVLYNTAISLCGRINNVYETERIWRVMKGDGHIGTEITYS 253

Query: 306 LLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXXXXXX 365
           LLVS FVRC ++ELA+D Y +M         D         XXXXXXXXXXXXXXXXXXX
Sbjct: 254 LLVSIFVRCGRSELALDVYDEMVNNKISLREDAMYAMISACXXXXXXXXXXXXXXXXXXX 313

Query: 366 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 425
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 314 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 373

Query: 426 XXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVIT 485
           XXXXXXX  E    LN ++YNT ++SC KL  W++A+++L+EME SGL++S SSYN+VI+
Sbjct: 374 XXXXXXXXXENLCCLNEYLYNTAMVSCQKLGYWEKAVKLLYEMEGSGLTVSTSSYNLVIS 433

Query: 486 TCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDVSLY 545
            CE +RK ++AL VYE M  ++  P+TFT+LSL+R CIWGSLWDEVE +L K  PDVSLY
Sbjct: 434 ACEKSRKSKVALLVYEHMAQRDCKPNTFTYLSLVRSCIWGSLWDEVEDILKKVEPDVSLY 493

Query: 546 NAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPK 589
           NA I GMCLR +   AK+LY+KMR+ G++PDGKTRA+MLQNL K
Sbjct: 494 NAAIHGMCLRREFKFAKELYVKMREMGLEPDGKTRAMMLQNLKK 537

BLAST of Cla97C11G211850 vs. Swiss-Prot
Match: sp|Q8GZ63|PP397_ARATH (Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX=3702 GN=At5g25630 PE=2 SV=2)

HSP 1 Score: 48.1 bits (113), Expect = 4.0e-04
Identity = 37/143 (25.87%), Postives = 69/143 (48.25%), Query Frame = 0

Query: 444 YNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVITTCEMARKPEIALQVYERMI 503
           Y T+L + +    +     I+ E+E SG  + +  +N VI     +   E A+Q   +M 
Sbjct: 83  YTTLLAAMTVQKQYGSISSIVSEVEQSGTKLDSIFFNAVINAFSESGNMEDAVQALLKMK 142

Query: 504 HQEHTPDTFTHLSLIR-CCIWGS---LWDEVELLLNKS----GPDVSLYNAVIQGMCLRG 563
                P T T+ +LI+   I G      + ++L+L +     GP++  +N ++Q  C + 
Sbjct: 143 ELGLNPTTSTYNTLIKGYGIAGKPERSSELLDLMLEEGNVDVGPNIRTFNVLVQAWCKKK 202

Query: 564 KTDLAKKLYMKMRKNGIQPDGKT 579
           K + A ++  KM + G++PD  T
Sbjct: 203 KVEEAWEVVKKMEECGVRPDTVT 225

BLAST of Cla97C11G211850 vs. Swiss-Prot
Match: sp|Q9SF38|PP222_ARATH (Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=HCF152 PE=2 SV=1)

HSP 1 Score: 47.4 bits (111), Expect = 6.7e-04
Identity = 38/145 (26.21%), Postives = 68/145 (46.90%), Query Frame = 0

Query: 444 YNTILMSCSKLALWDRALQILWEM-EASGLSISASSYNIVITTCEMARKPEIALQVYERM 503
           YN +L    K    DRA  +L EM E +G+     SYNI+I  C +      AL  +  M
Sbjct: 490 YNVLLKGYCKQLQIDRAEDLLREMTEDAGIEPDVVSYNIIIDGCILIDDSAGALAFFNEM 549

Query: 504 IHQEHTPDTFTHLSLIRCC-------IWGSLWDEVELLLNKSGPDVSL--YNAVIQGMCL 563
             +   P   ++ +L++         +   ++DE   ++N     V L  +N +++G C 
Sbjct: 550 RTRGIAPTKISYTTLMKAFAMSGQPKLANRVFDE---MMNDPRVKVDLIAWNMLVEGYCR 609

Query: 564 RGKTDLAKKLYMKMRKNGIQPDGKT 579
            G  + A+++  +M++NG  P+  T
Sbjct: 610 LGLIEDAQRVVSRMKENGFYPNVAT 631

BLAST of Cla97C11G211850 vs. TAIR10
Match: AT3G29290.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 340.1 bits (871), Expect = 2.8e-93
Identity = 280/464 (60.34%), Postives = 341/464 (73.49%), Query Frame = 0

Query: 126 EPTFQSEDANESKILEREALENDSKVHFLEETDNVMLSKRILILSRKNKVRSALELFRSM 185
           + +F  E+      LE +   + +++HFLEE +   LSKR+  LSR +KVRSALELF SM
Sbjct: 74  DSSFNGENVVCGLELEEKTAGDRNRIHFLEERNEETLSKRLRKLSRLDKVRSALELFDSM 133

Query: 186 QLAGLLPSLHALNSLLACLLRNGLFDDGLRIFEFMKSNKLSTGHTYSLVLKAVANAHGFL 245
           +  GL P+ HA NS L+CLLRNG       +FEFM+  +  TGHTYSL+LKAVA   G  
Sbjct: 134 RFLGLQPNAHACNSFLSCLLRNGDIQKAFTVFEFMRKKENVTGHTYSLMLKAVAEVKGCE 193

Query: 246 SALEMFKAWEHKYDLTQ-FDAIVYNTMISICGKDNNWVEAERTWRLMEENGCSATRITYS 305
           SAL MF+  E +      FD ++YNT IS+CG+ NN  E ER WR+M+ +G   T ITYS
Sbjct: 194 SALRMFRELEREPKRRSCFDVVLYNTAISLCGRINNVYETERIWRVMKGDGHIGTEITYS 253

Query: 306 LLVSTFVRCNQNELAIDTYVKMXXXSFKPGYDTXXXXXXXXXXXXXXXXXXXXXXXXXXX 365
           LLVS FVRC ++ELA+D Y +M         D         XXXXXXXXXXXXXXXXXXX
Sbjct: 254 LLVSIFVRCGRSELALDVYDEMVNNKISLREDAMYAMISACXXXXXXXXXXXXXXXXXXX 313

Query: 366 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 425
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 314 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 373

Query: 426 XXXXXXXKREEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVIT 485
           XXXXXXX  E    LN ++YNT ++SC KL  W++A+++L+EME SGL++S SSYN+VI+
Sbjct: 374 XXXXXXXXXENLCCLNEYLYNTAMVSCQKLGYWEKAVKLLYEMEGSGLTVSTSSYNLVIS 433

Query: 486 TCEMARKPEIALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWDEVELLLNKSGPDVSLY 545
            CE +RK ++AL VYE M  ++  P+TFT+LSL+R CIWGSLWDEVE +L K  PDVSLY
Sbjct: 434 ACEKSRKSKVALLVYEHMAQRDCKPNTFTYLSLVRSCIWGSLWDEVEDILKKVEPDVSLY 493

Query: 546 NAVIQGMCLRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPK 589
           NA I GMCLR +   AK+LY+KMR+ G++PDGKTRA+MLQNL K
Sbjct: 494 NAAIHGMCLRREFKFAKELYVKMREMGLEPDGKTRAMMLQNLKK 537

BLAST of Cla97C11G211850 vs. TAIR10
Match: AT5G25630.2 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 48.1 bits (113), Expect = 2.2e-05
Identity = 37/143 (25.87%), Postives = 69/143 (48.25%), Query Frame = 0

Query: 444 YNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVITTCEMARKPEIALQVYERMI 503
           Y T+L + +    +     I+ E+E SG  + +  +N VI     +   E A+Q   +M 
Sbjct: 83  YTTLLAAMTVQKQYGSISSIVSEVEQSGTKLDSIFFNAVINAFSESGNMEDAVQALLKMK 142

Query: 504 HQEHTPDTFTHLSLIR-CCIWGS---LWDEVELLLNKS----GPDVSLYNAVIQGMCLRG 563
                P T T+ +LI+   I G      + ++L+L +     GP++  +N ++Q  C + 
Sbjct: 143 ELGLNPTTSTYNTLIKGYGIAGKPERSSELLDLMLEEGNVDVGPNIRTFNVLVQAWCKKK 202

Query: 564 KTDLAKKLYMKMRKNGIQPDGKT 579
           K + A ++  KM + G++PD  T
Sbjct: 203 KVEEAWEVVKKMEECGVRPDTVT 225

BLAST of Cla97C11G211850 vs. TAIR10
Match: AT3G09650.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 47.4 bits (111), Expect = 3.7e-05
Identity = 38/145 (26.21%), Postives = 68/145 (46.90%), Query Frame = 0

Query: 444 YNTILMSCSKLALWDRALQILWEM-EASGLSISASSYNIVITTCEMARKPEIALQVYERM 503
           YN +L    K    DRA  +L EM E +G+     SYNI+I  C +      AL  +  M
Sbjct: 490 YNVLLKGYCKQLQIDRAEDLLREMTEDAGIEPDVVSYNIIIDGCILIDDSAGALAFFNEM 549

Query: 504 IHQEHTPDTFTHLSLIRCC-------IWGSLWDEVELLLNKSGPDVSL--YNAVIQGMCL 563
             +   P   ++ +L++         +   ++DE   ++N     V L  +N +++G C 
Sbjct: 550 RTRGIAPTKISYTTLMKAFAMSGQPKLANRVFDE---MMNDPRVKVDLIAWNMLVEGYCR 609

Query: 564 RGKTDLAKKLYMKMRKNGIQPDGKT 579
            G  + A+++  +M++NG  P+  T
Sbjct: 610 LGLIEDAQRVVSRMKENGFYPNVAT 631

BLAST of Cla97C11G211850 vs. TAIR10
Match: AT1G23450.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 45.1 bits (105), Expect = 1.9e-04
Identity = 34/157 (21.66%), Postives = 74/157 (47.13%), Query Frame = 0

Query: 434 EEKAQLNIHIYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVITTCEMARKPE 493
           +E +  ++  YN ++   S+     RA+++  EM + GL  SAS++  V++ C       
Sbjct: 70  DEMSVRDVVTYNLLISGNSRYGCSLRAIELYAEMVSCGLRESASTFPSVLSVCSDELFCR 129

Query: 494 IALQVYERMIHQEHTPDTFTHLSLIRCCIWGSLWD-EVELLLNKSGPDVSLYNAVIQGMC 553
             +QV+ R+I      + F   +L+       L D  ++L       ++++ N +++  C
Sbjct: 130 EGIQVHCRVISLGFGCNMFVRSALVGLYACLRLVDVALKLFDEMLDRNLAVCNLLLRCFC 189

Query: 554 LRGKTDLAKKLYMKMRKNGIQPDGKTRALMLQNLPKD 590
             G++    ++Y++M   G+  +G T   M++    D
Sbjct: 190 QTGESKRLFEVYLRMELEGVAKNGLTYCYMIRGCSHD 226

BLAST of Cla97C11G211850 vs. TAIR10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 43.5 bits (101), Expect = 5.4e-04
Identity = 25/78 (32.05%), Postives = 43/78 (55.13%), Query Frame = 0

Query: 443 IYNTILMSCSKLALWDRALQILWEMEASGLSISASSYNIVI-TTCEMARKPEIALQVYER 502
           +++ ++ S S+L+L D+AL I+   +A G      SYN V+  T    R    A  V++ 
Sbjct: 136 VFDLVVKSYSRLSLIDKALSIVHLAQAHGFMPGVLSYNAVLDATIRSKRNISFAENVFKE 195

Query: 503 MIHQEHTPDTFTHLSLIR 520
           M+  + +P+ FT+  LIR
Sbjct: 196 MLESQVSPNVFTYNILIR 213

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008459413.12.5e-24987.07PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Cuc... [more]
XP_016902392.12.3e-24773.65PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Cuc... [more]
XP_022134520.13.4e-22279.38pentatricopeptide repeat-containing protein At3g29290 isoform X1 [Momordica char... [more]
XP_022134523.13.4e-22279.38pentatricopeptide repeat-containing protein At3g29290 isoform X2 [Momordica char... [more]
XP_008459414.18.7e-21887.64PREDICTED: pentatricopeptide repeat-containing protein At3g29290 isoform X3 [Cuc... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3C9M1|A0A1S3C9M1_CUCME1.7e-24987.07pentatricopeptide repeat-containing protein At3g29290 isoform X1 OS=Cucumis melo... [more]
tr|A0A1S4E2E0|A0A1S4E2E0_CUCME1.6e-24773.65pentatricopeptide repeat-containing protein At3g29290 isoform X2 OS=Cucumis melo... [more]
tr|A0A1S3CA80|A0A1S3CA80_CUCME5.8e-21887.64pentatricopeptide repeat-containing protein At3g29290 isoform X3 OS=Cucumis melo... [more]
tr|A0A0A0KVV0|A0A0A0KVV0_CUCSA1.8e-18776.10Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642190 PE=4 SV=1[more]
tr|A0A2P4N501|A0A2P4N501_QUESU6.7e-13462.77Pentatricopeptide repeat-containing protein OS=Quercus suber OX=58331 GN=CFP56_4... [more]
Match NameE-valueIdentityDescription
sp|Q84J46|PP262_ARATH5.0e-9260.34Pentatricopeptide repeat-containing protein At3g29290 OS=Arabidopsis thaliana OX... [more]
sp|Q8GZ63|PP397_ARATH4.0e-0425.87Pentatricopeptide repeat-containing protein At5g25630 OS=Arabidopsis thaliana OX... [more]
sp|Q9SF38|PP222_ARATH6.7e-0426.21Pentatricopeptide repeat-containing protein At3g09650, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
AT3G29290.12.8e-9360.34Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G25630.22.2e-0525.87Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G09650.13.7e-0526.21Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G23450.11.9e-0421.66Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G39710.15.4e-0432.05Tetratricopeptide repeat (TPR)-like superfamily protein[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006487 protein N-linked glycosylation
cellular_component GO:0005575 cellular_component
cellular_component GO:0005886 plasma membrane
molecular_function GO:0005515 protein binding
molecular_function GO:0003674 molecular_function
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0008568 microtubule-severing ATPase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C11G211850.1Cla97C11G211850.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 153..262
e-value: 6.9E-14
score: 53.9
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 524..608
e-value: 6.6E-9
score: 37.3
coord: 420..523
e-value: 2.3E-22
score: 81.2
coord: 328..419
e-value: 7.9E-23
score: 82.7
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 478..511
e-value: 2.9E-4
score: 18.8
coord: 197..225
e-value: 2.6E-4
score: 18.9
coord: 543..575
e-value: 1.1E-7
score: 29.6
coord: 266..298
e-value: 1.4E-8
score: 32.4
coord: 371..405
e-value: 5.3E-8
score: 30.6
coord: 340..369
e-value: 0.0012
score: 16.9
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 463..521
e-value: 7.6E-4
score: 19.4
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 340..366
e-value: 0.17
score: 12.1
coord: 197..224
e-value: 0.0038
score: 17.3
coord: 169..189
e-value: 1.1
score: 9.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 539..586
e-value: 2.1E-11
score: 43.7
coord: 264..310
e-value: 7.0E-9
score: 35.6
coord: 368..416
e-value: 1.4E-12
score: 47.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 11.148
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 334..368
score: 8.966
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 193..227
score: 9.208
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 299..333
score: 9.109
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 540..574
score: 12.375
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 475..509
score: 9.35
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 369..403
score: 10.885
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 158..192
score: 8.199
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 404..434
score: 8.802
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 440..474
score: 9.898
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 118..577
NoneNo IPR availablePANTHERPTHR24015:SF1076SUBFAMILY NOT NAMEDcoord: 118..577