MC08g0997 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC08g0997
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC08: 8099994 .. 8105528 (+)
RNA-Seq ExpressionMC08g0997
SyntenyMC08g0997
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCATAATGCTATATGGTAAGGCTGAGATGATCAAGCATGCTGTTGATACTTTTTATGATATGCACTTATATGGGTGCCGTAGGACTGTGAAATCTTTTAATGCCGTGCTTAAGGTTTTGATGAAGACCCGTGATTTGGGAGCCATTGAGGCATTTTTGAGTGAAGCTCCTGAAAAATTCGATATTGAGTTGGATATTATTTCTGTTAACATTGTTGTTAAGGCTTTTTGTGATATAGGTATTCTAAGTAGAGCTTATCTTCTCATGTTAGAGATGGAGAAAGTGGGAATAAGACCTGATGTGGTTACCTATACGACGTTAATTTCAGCTTTTTACAAGGATAATCGATGCGAAATTAGTAATGGACTGTGGAATCTAATGGTTTTGAGGGGTTGTTTGCCCAATCTTGCTTCTTTCAATGTGAGGATTCAATATTTGGTTGACAGGAGACGAGCGTGGGAGGCTAATAAATTGATGAATGTGATGCGGAATATCGGTATTGTTCCCGATGAGGTTACTTACAATCTGGTAATAAAAGGCTTTTGCCAAGCCGGTTTTTTTGATATGGCCAAAAGGGTTTATTCTGCTCTGCAAGGGAGCGGGTATAAACCTAACGTCAAAATTTACCAAACCATGATTCATTACCTGTGCAGAAGTGGAGATTTCAACCTGGCATATACAATGTGCAAAGATGCCATGAACAGGAATTGGTTTCCAAATATTGATACAATTCATTCATTAATTAAAGGCCTGAAGAAGATGGGACAGCTTGGGAAGGCTCAGTTGATCCTAACATTGGCTAGGAAAAGGGTCCCTCCTTTCTCTCTAACTCAGTTGGAAGCTTTGAATACCACACTGGCCAAGAGTTGAAAAAGAGATTTGTATACTGATGCTCGATAAAATGTAACAGGAAGCTCAAAAGGAGCTCATATTCTTTATTTGGTTCTATTTGTAAATTACAAAGTTAATTGGACACTGATTTTTTATGGGGTTACATCTTCAAAAAGGTTTCTCTTTTTCTCAGTTAATTTATTCTACCACCTCAATAAAGTCTGTTACAAGATGAGAGTTCCGAATGGAGTGGCCGACGTTCACCATATTCTCTGTGAGTTATTGAGCAAGTCTCTTTCTCTCGCTGTCACTGTCATATATACACATATTTAGGCAGCAAACATGCATTTCCACGTCCAAAAAATTCCATCCTCAACTTGATTTGCAATGTTGCCAACTGTTTGTACAAGTAATTCTACATATACAATACACACTTTCTCATTTTCTTATCTATGATGTCTCTTGGAAACCAATATGAATTTAATATTTTCTTCAAGGGGAATTTGTAGAGTTTCTATTATGCATGGACTACAGCCAAAAAAATCTCATAAATATCTGATGCCATGGTTTTGAAGATTCAGCTTATGGCTATTTGCTATCGCTTACTTCTTTTGTTTAGAAGGTGGATTCTTTGTTCTTCTTCTTGTCCTGTTTATGATATCACAGAGATGTTGCAACCGTGTTACAAGTCATGTACGCATCACTTTTTATTTTGTGATGACGACTTTCTGAGTTTTAATCATGTTTGGTCATTCCAGCATCGGTGCTCACCAACGCAATGGCAATAGGAATAAGGGCAAAACCTTTCCTGTTAGATTTTTTTGAATGCTGCATAGGTAATTTCAAATTTTTGATCATCATGAAGTATAGAATATTATTTTTTAAACATGTGAATTTATGAATAACCATTTCTTTCGTAGAACTAAAGCATTAGACCATATTTTGTATTAACCTGTTTGGAATTCTTTCAGTATTGCAACCTTTTTATCTGGTAGTCAACTTATCATAACTTAACTAGTTAAGACATATATTCTTGATGAAAAAGTCGGAAGTTTAAATCCCTACCCCCATTTGTTGTTGAACTAAAAAAAATTCACATTTCTATCTGGTATATAGACGAGAAGCTTAATCTAAGGGCATGGATAAAAGTAGATGGTTTATGAAAATGTGAAGTACATTTATTTCTCTGCTGTTATGTTGATTATGAATCTGTTGTCCTTATGCCTCCAGGAAGTCAATGCCAGATTGATGATTTCTCAACTTCCCGTGTGTCTAAATCTATCATTTGATTTGGTCGAGAGAACATAAAATAGTAACTGTTCTATACAACTCAGCATGTTGGTGCCTTTCCTGTTTTCCCTCTTTGTGAGCTATCGAGCATCATCGATATGTATAGGGACGGTTCCTTCTGCCTGCTCTGTTGGTCTGACAAGATTTCTTAAGGTGAGAGGCTGTATTTGAGTTTTCGAGCAAGAAACTCGACATCACCTCTCATTCTTCTCTGATTCGATTTATGTTACTTGCAGCTGTTCATGGGCTTATGATCATTGTCATAACACTCAGCGAAAGCTTGTAATCAGCATGCATCTCAAAAGGTCGTTGGTCGATCTCTCAAGCCTCAACTTTCAACCATATGTCCAGGGCATTATACAGACAAGGAGAGAGAGACATCATCTAAAGTTAAATGGGGCTGAAGAACATTAAGGAACAGTGGCTGTCTGTTCTTCAACCTGAATGTTAGATCCTTTATTTCTCACTTGGGATCTCCTGTTAATTTCAGGCGCCTGTGTTTCTCTTGCCATGGATTCTGTTCTTGTCTGAGAGTTTTCCTTTGTATTTGAATGTCACTGGGTAGAGGGAAATTGTTCAATTTGGATTACAGCAAGAGTCTGCCATTTTTTGTAGGATGGAAGTTACCACAAAGTTGAATGTAAATATTTACAGGGGGAAAGTTTAAAGACTGAATCTTTTATTACTGTTCATACCTGATAATAAGAATTAATTAGTAAAGATTTCGGTAAATAATTTTTTTTTAATTAAGTTTTGTTCATATTTTGATTTATATGTTAATTGTTAACCATTAACTTTTCCTGTAATTAATTTTTTTTTCAAACAAAAATCAAGGATATTTCATAGGAGGTAATAATTCAGTTTAGATGGTTATTTTACTTTATTTTTGTTGGTTAAAAATTTTTCATGGTCCTCTGTGTTTTAGTTTGTAACTACATCTAAAAATTTAGATGTAACAATTTTATTAATGTAATTTAAAATTTATATAGGTTGATTCTGTGTTTATCAATTATAAAAACTAAATTTTTATGTGTTAAAATTAGTACTTAGCTTTATAAGATTTTTTACTTGTTATAATTCGAATTATAACAATTAAATTGTTACATATTAAAATTATTAAATTGTTACGTATTAAAATTTAGGGAGTTAAATTACTGACCTCGACTAAATTAAAATTCGAATCGACATGGAATGAAATCGATAGGTTTTTTTGGGAATTTGAACCAGATATATTTGCCGTTGAAATCGATGGAATTGAATACGTTGATTATTTTTCAGTTGTGAATTGTATAAAACTACAATGGAAGGCATCTTGAATCTTCCCGCTAAGAGCTTTACACAAGCCTCTAGCGCGAATAGCTCTCAAGTTGGCTTTCTTCTTCTGCACCAACAGGAGCACAAGCATTTTCCTGAGCGGGGAAACTCCATTTTTGTGCAGAGTGATCGCTTTCAGACCCACAAATCTCATCCTTCAGAATCATCGATTCCTCTTACATGTCTAAAGCCCTTCTCTCGCGAATCAAGCCCCTTCGCAACCTCAAACCGAAACCATCTTCCCCTTTCTCCTTTCCTCTCAAATGTGACATCAAGAAGCTTGTCAATGACACCATTAAAATTCTCAAGTCCCAGGAGAAGTGGGAGCAATCCCTTGAAACCCAGTTCAATGAATCCGATATACCCGTCATCGACATTAGTCATTTCGTTTTGGACCGAATTGATGATGTAGAACTGGGTTTGAAGTTCTTCGATTGGGCGTCAAAGAATTCGAGCTCCTGTTCTTTGAATGGGAGTGCCTACTCATCGCTGTTGAAACTTCTATCGAGGTTTAGAGTGTTTCCGGAGATCGAATTCACACTCGAAGATATGAGAACTAAGGAAATTGTCCCGACCCGTGACGCACTGAGTAATGTGCTCTGTGCATATGCGGATTTGGGGTTCGTTGATAAGGCTCTTGTGTTCTATCATGGCGTCGTTAAGTTGCACAACAGTCTTCCAAGTACGTATGCTTGTAATTCCCTGCTCAATTTGCTTGTTAAACACCGTAGGCTTGGAACTGCACACCAACTGTATGATGAAATGGTCAAAAGAGATAACGGCGACGATAAATGTACGGATAACTATACTACTTGTATTATGGTGAGGGGCTTATGTTTGGAAGGTAGAACCGAGGATGGAAGGAAGCTGATTGAATCCAGATGGGGGAAAGGCTGTGTACCGAACATTGTGTTTTACAATACACTCATTGATGGATATTGCAAGAAAGGTGAGGTTGGAAGTGCTTATGAACTTTTTATTGAATTGAAGCTGAAAGGATTTGTACCTACATTAGAAACTTTTGGTTCCATGGTAAATGGCTTTTGCAAGACGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAATGAAAGATAGGGGCTTGAGTGTTAGTGTTCAAGTGTATAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGACATTAGAGCAAAGGATATACTTAAAGAAACGGCTGAGAATTGCTGTGAACCAGATCTTGTGACTTATAATACTCTAATCAACTATTTATGCAGAGGTGGGGAGGTCATGGAAGCTGAGAAGATCTTGGAACAAGCAATAAAGAGAGGAATGGTGCCGAATAAGTTCACTTATACTCCGCTTGTTCATGCCTATTGTAAACAAGGGGAATATTATAGGGCCTCAGATTTACTTATTGAGATGTCAAAAAAAGGACATAAAGTTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTAGTTGCAGGGGAAGTCGATAATGCTATGACTATCCGGGACAGAATGATGGAAAGAGGGGTTTTACCTGATGCCAATATCTACAATGTTTTGATGAATGGACTTTTCAAGAAAGGGAATCTTTCCATGGCCAAGGTGATGCTTTCTGAGATGCTTGACCAAAATATAGCACCTGATGCATTTATTTATGCAACTTTAGTGGATGGGTTCATTAGGCATGACAACCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATAGAAAAGGGTATAGACCCAGGTGTTGTGGGATATAATTCCATGATCAAAGGTTTCTGTAAATTCGGGATGATGGAAGATGCAGTTTTGTGCATTGATAGAATGAGGAGTGCCCGTCATGTTCCTGATGTCTTTACTTTCTCCACCATAATTGATGGGTATGTAAAACAATGTGACTTGTACGCTGCACTGAAGATCTTTGGACTGATGTTGAAGCAGAGTTGCAAACCAAATGTTGTCACTTACACATCTTTGATCAATGGATATTGCCATAAGGGAGAATTGAAGATAGCTGAAAAACTTTTTAGCTTAATGCAATCTCATGGTTTGGAGCCTAGTGTCGTCACATACGGTGTACTTATACGGAGCCTTTGCAAA

mRNA sequence

ATCATAATGCTATATGGTAAGGCTGAGATGATCAAGCATGCTGTTGATACTTTTTATGATATGCACTTATATGGGTGCCGTAGGACTGTGAAATCTTTTAATGCCGTGCTTAAGGTTTTGATGAAGACCCGTGATTTGGGAGCCATTGAGGCATTTTTGAGTGAAGCTCCTGAAAAATTCGATATTGAGTTGGATATTATTTCTGTTAACATTGTTGTTAAGGCTTTTTGTGATATAGGTATTCTAAGTAGAGCTTATCTTCTCATGTTAGAGATGGAGAAAGTGGGAATAAGACCTGATGTGGTTACCTATACGACGTTAATTTCAGCTTTTTACAAGGATAATCGATGCGAAATTAGTAATGGACTGTGGAATCTAATGGTTTTGAGGGGTTGTTTGCCCAATCTTGCTTCTTTCAATGTGAGGATTCAATATTTGGTTGACAGGAGACGAGCGTGGGAGGCTAATAAATTGATGAATGTGATGCGGAATATCGGTATTGTTCCCGATGAGGTTACTTACAATCTGGTAATAAAAGGCTTTTGCCAAGCCGGTTTTTTTGATATGGCCAAAAGGGTTTATTCTGCTCTGCAAGGGAGCGGGTATAAACCTAACGTCAAAATTTACCAAACCATGATTCATTACCTGTGCAGAAGTGGAGATTTCAACCTGGCATATACAATGTGCAAAGATGCCATGAACAGGAATTGGTTTCCAAATATTGATACAATTCATTCATTAATTAAAGGCCTGAAGAAGATGGGACAGCTTGGGAAGGCTCAAGATAACGGCGACGATAAATGTACGGATAACTATACTACTTGTATTATGGTGAGGGGCTTATGTTTGGAAGGTAGAACCGAGGATGGAAGGAAGCTGATTGAATCCAGATGGGGGAAAGGCTGTGTACCGAACATTGTGTTTTACAATACACTCATTGATGGATATTGCAAGAAAGGTGAGGTTGGAAGTGCTTATGAACTTTTTATTGAATTGAAGCTGAAAGGATTTGTACCTACATTAGAAACTTTTGGTTCCATGGTAAATGGCTTTTGCAAGACGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAATGAAAGATAGGGGCTTGAGTGTTAGTGTTCAAGTGTATAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGACATTAGAGCAAAGGATATACTTAAAGAAACGGCTGAGAATTGCTGTGAACCAGATCTTGTGACTTATAATACTCTAATCAACTATTTATGCAGAGGTGGGGAGGTCATGGAAGCTGAGAAGATCTTGGAACAAGCAATAAAGAGAGGAATGGTGCCGAATAAGTTCACTTATACTCCGCTTGTTCATGCCTATTGTAAACAAGGGGAATATTATAGGGCCTCAGATTTACTTATTGAGATGTCAAAAAAAGGACATAAAGTTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTAGTTGCAGGGGAAGTCGATAATGCTATGACTATCCGGGACAGAATGATGGAAAGAGGGGTTTTACCTGATGCCAATATCTACAATGTTTTGATGAATGGACTTTTCAAGAAAGGGAATCTTTCCATGGCCAAGGTGATGCTTTCTGAGATGCTTGACCAAAATATAGCACCTGATGCATTTATTTATGCAACTTTAGTGGATGGGTTCATTAGGCATGACAACCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATAGAAAAGGGTATAGACCCAGGTGTTGTGGGATATAATTCCATGATCAAAGGTTTCTGTAAATTCGGGATGATGGAAGATGCAGTTTTGTGCATTGATAGAATGAGGAGTGCCCGTCATGTTCCTGATGTCTTTACTTTCTCCACCATAATTGATGGGTATGTAAAACAATGTGACTTGTACGCTGCACTGAAGATCTTTGGACTGATGTTGAAGCAGAGTTGCAAACCAAATGTTGTCACTTACACATCTTTGATCAATGGATATTGCCATAAGGGAGAATTGAAGATAGCTGAAAAACTTTTTAGCTTAATGCAATCTCATGGTTTGGAGCCTAGTGTCGTCACATACGGTGTACTTATACGGAGCCTTTGCAAA

Coding sequence (CDS)

ATCATAATGCTATATGGTAAGGCTGAGATGATCAAGCATGCTGTTGATACTTTTTATGATATGCACTTATATGGGTGCCGTAGGACTGTGAAATCTTTTAATGCCGTGCTTAAGGTTTTGATGAAGACCCGTGATTTGGGAGCCATTGAGGCATTTTTGAGTGAAGCTCCTGAAAAATTCGATATTGAGTTGGATATTATTTCTGTTAACATTGTTGTTAAGGCTTTTTGTGATATAGGTATTCTAAGTAGAGCTTATCTTCTCATGTTAGAGATGGAGAAAGTGGGAATAAGACCTGATGTGGTTACCTATACGACGTTAATTTCAGCTTTTTACAAGGATAATCGATGCGAAATTAGTAATGGACTGTGGAATCTAATGGTTTTGAGGGGTTGTTTGCCCAATCTTGCTTCTTTCAATGTGAGGATTCAATATTTGGTTGACAGGAGACGAGCGTGGGAGGCTAATAAATTGATGAATGTGATGCGGAATATCGGTATTGTTCCCGATGAGGTTACTTACAATCTGGTAATAAAAGGCTTTTGCCAAGCCGGTTTTTTTGATATGGCCAAAAGGGTTTATTCTGCTCTGCAAGGGAGCGGGTATAAACCTAACGTCAAAATTTACCAAACCATGATTCATTACCTGTGCAGAAGTGGAGATTTCAACCTGGCATATACAATGTGCAAAGATGCCATGAACAGGAATTGGTTTCCAAATATTGATACAATTCATTCATTAATTAAAGGCCTGAAGAAGATGGGACAGCTTGGGAAGGCTCAAGATAACGGCGACGATAAATGTACGGATAACTATACTACTTGTATTATGGTGAGGGGCTTATGTTTGGAAGGTAGAACCGAGGATGGAAGGAAGCTGATTGAATCCAGATGGGGGAAAGGCTGTGTACCGAACATTGTGTTTTACAATACACTCATTGATGGATATTGCAAGAAAGGTGAGGTTGGAAGTGCTTATGAACTTTTTATTGAATTGAAGCTGAAAGGATTTGTACCTACATTAGAAACTTTTGGTTCCATGGTAAATGGCTTTTGCAAGACGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAATGAAAGATAGGGGCTTGAGTGTTAGTGTTCAAGTGTATAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGACATTAGAGCAAAGGATATACTTAAAGAAACGGCTGAGAATTGCTGTGAACCAGATCTTGTGACTTATAATACTCTAATCAACTATTTATGCAGAGGTGGGGAGGTCATGGAAGCTGAGAAGATCTTGGAACAAGCAATAAAGAGAGGAATGGTGCCGAATAAGTTCACTTATACTCCGCTTGTTCATGCCTATTGTAAACAAGGGGAATATTATAGGGCCTCAGATTTACTTATTGAGATGTCAAAAAAAGGACATAAAGTTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTAGTTGCAGGGGAAGTCGATAATGCTATGACTATCCGGGACAGAATGATGGAAAGAGGGGTTTTACCTGATGCCAATATCTACAATGTTTTGATGAATGGACTTTTCAAGAAAGGGAATCTTTCCATGGCCAAGGTGATGCTTTCTGAGATGCTTGACCAAAATATAGCACCTGATGCATTTATTTATGCAACTTTAGTGGATGGGTTCATTAGGCATGACAACCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATAGAAAAGGGTATAGACCCAGGTGTTGTGGGATATAATTCCATGATCAAAGGTTTCTGTAAATTCGGGATGATGGAAGATGCAGTTTTGTGCATTGATAGAATGAGGAGTGCCCGTCATGTTCCTGATGTCTTTACTTTCTCCACCATAATTGATGGGTATGTAAAACAATGTGACTTGTACGCTGCACTGAAGATCTTTGGACTGATGTTGAAGCAGAGTTGCAAACCAAATGTTGTCACTTACACATCTTTGATCAATGGATATTGCCATAAGGGAGAATTGAAGATAGCTGAAAAACTTTTTAGCTTAATGCAATCTCATGGTTTGGAGCCTAGTGTCGTCACATACGGTGTACTTATACGGAGCCTTTGCAAA

Protein sequence

IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK
Homology
BLAST of MC08g0997 vs. ExPASy Swiss-Prot
Match: Q9SSR4 (Pentatricopeptide repeat-containing protein At1g52620 OS=Arabidopsis thaliana OX=3702 GN=At1g52620 PE=2 SV=1)

HSP 1 Score: 548.5 bits (1412), Expect = 1.1e-154
Identity = 267/523 (51.05%), Postives = 366/523 (69.98%), Query Frame = 0

Query: 187 FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIH 246
           F+  + V   L+    K   +    ++H    SG  + A  +    +   +  P++   +
Sbjct: 115 FNEIEDVLGNLRNENVKLTHEALSHVLHAYAESGSLSKAVEIYDYVVELYDSVPDVIACN 174

Query: 247 SLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK 306
           SL+  L K  +LG A+   D+ C      DNY+TCI+V+G+C EG+ E GRKLIE RWGK
Sbjct: 175 SLLSLLVKSRRLGDARKVYDEMCDRGDSVDNYSTCILVKGMCNEGKVEVGRKLIEGRWGK 234

Query: 307 GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAI 366
           GC+PNIVFYNT+I GYCK G++ +AY +F ELKLKGF+PTLETFG+M+NGFCK G+F A 
Sbjct: 235 GCIPNIVFYNTIIGGYCKLGDIENAYLVFKELKLKGFMPTLETFGTMINGFCKEGDFVAS 294

Query: 367 DLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINY 426
           D LL E+K+RGL VSV   NNIIDA+Y+ G  +   + +     N C+PD+ TYN LIN 
Sbjct: 295 DRLLSEVKERGLRVSVWFLNNIIDAKYRHGYKVDPAESIGWIIANDCKPDVATYNILINR 354

Query: 427 LCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD 486
           LC+ G+   A   L++A K+G++PN  +Y PL+ AYCK  EY  AS LL++M+++G K D
Sbjct: 355 LCKEGKKEVAVGFLDEASKKGLIPNNLSYAPLIQAYCKSKEYDIASKLLLQMAERGCKPD 414

Query: 487 MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLS 546
           +V+YG LIHGLVV+G +D+A+ ++ ++++RGV PDA IYN+LM+GL K G    AK++ S
Sbjct: 415 IVTYGILIHGLVVSGHMDDAVNMKVKLIDRGVSPDAAIYNMLMSGLCKTGRFLPAKLLFS 474

Query: 547 EMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG 606
           EMLD+NI PDA++YATL+DGFIR  + DEA+K+F L++EKG+   VV +N+MIKGFC+ G
Sbjct: 475 EMLDRNILPDAYVYATLIDGFIRSGDFDEARKVFSLSVEKGVKVDVVHHNAMIKGFCRSG 534

Query: 607 MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYT 666
           M+++A+ C++RM     VPD FT+STIIDGYVKQ D+  A+KIF  M K  CKPNVVTYT
Sbjct: 535 MLDEALACMNRMNEEHLVPDKFTYSTIIDGYVKQQDMATAIKIFRYMEKNKCKPNVVTYT 594

Query: 667 SLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK 704
           SLING+C +G+ K+AE+ F  MQ   L P+VVTY  LIRSL K
Sbjct: 595 SLINGFCCQGDFKMAEETFKEMQLRDLVPNVVTYTTLIRSLAK 637

BLAST of MC08g0997 vs. ExPASy Swiss-Prot
Match: Q8GW57 (Pentatricopeptide repeat-containing protein At1g80150, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g80150 PE=2 SV=2)

HSP 1 Score: 357.8 bits (917), Expect = 2.7e-97
Identity = 165/261 (63.22%), Postives = 208/261 (79.69%), Query Frame = 0

Query: 1   IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
           IIMLYGKA M K A+DTF++M LYGC+R+VKSFNA L+VL    DL  I  FL +AP K+
Sbjct: 112 IIMLYGKAGMTKQALDTFFNMDLYGCKRSVKSFNAALQVLSFNPDLHTIWEFLHDAPSKY 171

Query: 61  DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
            I++D +S NI +K+FC++GIL  AY+ M EMEK G+ PDVVTYTTLISA YK  RC I 
Sbjct: 172 GIDIDAVSFNIAIKSFCELGILDGAYMAMREMEKSGLTPDVVTYTTLISALYKHERCVIG 231

Query: 121 NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
           NGLWNLMVL+GC PNL +FNVRIQ+LV+RRRAW+AN L+ +M  + + PD +TYN+VIKG
Sbjct: 232 NGLWNLMVLKGCKPNLTTFNVRIQFLVNRRRAWDANDLLLLMPKLQVEPDSITYNMVIKG 291

Query: 181 FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
           F  A F DMA+RVY+A+ G GYKPN+KIYQTMIHYLC++G+F+LAYTMCKD M + W+PN
Sbjct: 292 FFLARFPDMAERVYTAMHGKGYKPNLKIYQTMIHYLCKAGNFDLAYTMCKDCMRKKWYPN 351

Query: 241 IDTIHSLIKGLKKMGQLGKAQ 262
           +DT+  L+KGL K GQL +A+
Sbjct: 352 LDTVEMLLKGLVKKGQLDQAK 372

BLAST of MC08g0997 vs. ExPASy Swiss-Prot
Match: Q9CA58 (Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis thaliana OX=3702 GN=At1g74580 PE=3 SV=1)

HSP 1 Score: 311.2 bits (796), Expect = 2.9e-83
Identity = 200/659 (30.35%), Postives = 319/659 (48.41%), Query Frame = 0

Query: 5   YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIEL 64
           YG+   ++ AV+ F  M  Y C  TV S+NA++ VL+ +              ++  I  
Sbjct: 86  YGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMRDR-GITP 145

Query: 65  DIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLW 124
           D+ S  I +K+FC       A  L+  M   G   +VV Y T++  FY++N       L+
Sbjct: 146 DVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEGYELF 205

Query: 125 NLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQA 184
             M+  G    L++FN  ++ L  +    E  KL++ +   G++P+  TYNL I+G CQ 
Sbjct: 206 GKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQGLCQR 265

Query: 185 GFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTI 244
           G  D A R+   L   G KP+V  Y  +I+ LC++  F  A       +N    P+  T 
Sbjct: 266 GELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTY 325

Query: 245 HSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR 304
           ++LI G  K G        +G A  NG     D +T   ++ GLC EG T     L    
Sbjct: 326 NTLIAGYCKGGMVQLAERIVGDAVFNG--FVPDQFTYRSLIDGLCHEGETNRALALFNEA 385

Query: 305 WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNF 364
            GKG  PN++ YNTLI G   +G +  A +L  E+  KG +P ++TF  +VNG CK G  
Sbjct: 386 LGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCV 445

Query: 365 EAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIRAKDILKETAENCCEPDLVTY 424
              D L+  M  +G    +  +N +I     Q K+     A +IL    +N  +PD+ TY
Sbjct: 446 SDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKME---NALEILDVMLDNGVDPDVYTY 505

Query: 425 NTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSK 484
           N+L+N LC+  +  +  +  +  +++G  PN FT+  L+ + C+  +   A  LL EM  
Sbjct: 506 NSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMKN 565

Query: 485 KGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLS 544
           K    D V++G LI G    G++D A T+  +M E   V      YN++++   +K N++
Sbjct: 566 KSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNVT 625

Query: 545 MAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI 604
           MA+ +  EM+D+ + PD + Y  +VDGF +  N++   K     +E G  P +     +I
Sbjct: 626 MAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNLGYKFLLEMMENGFIPSLTTLGRVI 685

Query: 605 KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC 653
              C    + +A   I RM     VP+    +TI D  V + ++ A   +   +LK+SC
Sbjct: 686 NCLCVEDRVYEAAGIIHRMVQKGLVPE--AVNTICD--VDKKEVAAPKLVLEDLLKKSC 734

BLAST of MC08g0997 vs. ExPASy Swiss-Prot
Match: Q9FJE6 (Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis thaliana OX=3702 GN=At5g59900 PE=3 SV=1)

HSP 1 Score: 295.0 bits (754), Expect = 2.2e-78
Identity = 165/622 (26.53%), Postives = 305/622 (49.04%), Query Frame = 0

Query: 88  LMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLV 147
           L  +M  VGIRPDV  YT +I +  +      +  +   M   GC  N+  +NV I  L 
Sbjct: 214 LFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLC 273

Query: 148 DRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVK 207
            +++ WEA  +   +    + PD VTY  ++ G C+   F++   +   +    + P+  
Sbjct: 274 KKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEA 333

Query: 208 IYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGD-- 267
              +++  L + G    A  + K  ++    PN+   ++LI  L K  +  +A+   D  
Sbjct: 334 AVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRM 393

Query: 268 ---DKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV 327
                  ++ T  I++   C  G+ +     +      G   ++  YN+LI+G+CK G++
Sbjct: 394 GKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDI 453

Query: 328 GSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNI 387
            +A     E+  K   PT+ T+ S++ G+C  G       L  EM  +G++ S+  +  +
Sbjct: 454 SAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTL 513

Query: 388 IDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGM 447
           +   ++ G    A  +  E AE   +P+ VTYN +I   C  G++ +A + L++  ++G+
Sbjct: 514 LSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGI 573

Query: 448 VPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMT 507
           VP+ ++Y PL+H  C  G+   A   +  + K   +++ + Y  L+HG    G+++ A++
Sbjct: 574 VPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALS 633

Query: 508 IRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFI 567
           +   M++RGV  D   Y VL++G  K  +  +   +L EM D+ + PD  IY +++D   
Sbjct: 634 VCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKS 693

Query: 568 RHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVF 627
           +  +  EA  ++ L I +G  P  V Y ++I G CK G + +A +   +M+    VP+  
Sbjct: 694 KTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQV 753

Query: 628 TFSTIIDGYVK-QCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSL 687
           T+   +D   K + D+  A+++   +LK     N  TY  LI G+C +G ++ A +L + 
Sbjct: 754 TYGCFLDILTKGEVDMQKAVELHNAILK-GLLANTATYNMLIRGFCRQGRIEEASELITR 813

Query: 688 MQSHGLEPSVVTYGVLIRSLCK 704
           M   G+ P  +TY  +I  LC+
Sbjct: 814 MIGDGVSPDCITYTTMINELCR 834

BLAST of MC08g0997 vs. ExPASy Swiss-Prot
Match: Q9LSL9 (Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX=3702 GN=At5g65560 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 1.5e-74
Identity = 171/624 (27.40%), Postives = 299/624 (47.92%), Query Frame = 0

Query: 104 YTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMR 163
           Y TL+++  +    +    ++  M+     PN+ ++N  +          EAN+ ++ + 
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIV 245

Query: 164 NIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFN 223
             G+ PD  TY  +I G+CQ    D A +V++ +   G + N   Y  +IH LC +   +
Sbjct: 246 EAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRID 305

Query: 224 LAYTMCKDAMNRNWFPNIDTIHSLIKGL----KKMGQLGKAQDNGDDKCTDN-YTTCIMV 283
            A  +     +   FP + T   LIK L    +K   L   ++  +     N +T  +++
Sbjct: 306 EAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLI 365

Query: 284 RGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFV 343
             LC + + E  R+L+     KG +PN++ YN LI+GYCK+G +  A ++   ++ +   
Sbjct: 366 DSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLS 425

Query: 344 PTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDI 403
           P   T+  ++ G+CK+   +A+  +L +M +R +   V  YN++ID Q + G    A  +
Sbjct: 426 PNTRTYNELIKGYCKSNVHKAMG-VLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRL 485

Query: 404 LKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCK 463
           L    +    PD  TY ++I+ LC+   V EA  + +   ++G+ PN   YT L+  YCK
Sbjct: 486 LSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCK 545

Query: 464 QGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI 523
            G+   A  +L +M  K    + +++ ALIHGL   G++  A  + ++M++ G+ P  + 
Sbjct: 546 AGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVST 605

Query: 524 YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTI 583
             +L++ L K G+   A     +ML     PDA  Y T +  + R   L +A+ +     
Sbjct: 606 DTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMR 665

Query: 584 EKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIID-----GYVK 643
           E G+ P +  Y+S+IKG+   G    A   + RMR     P   TF ++I       Y K
Sbjct: 666 ENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIKHLLEMKYGK 725

Query: 644 Q-------------CDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS 703
           Q              +    +++   M++ S  PN  +Y  LI G C  G L++AEK+F 
Sbjct: 726 QKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAEKVFD 785

BLAST of MC08g0997 vs. NCBI nr
Match: GAV66013.1 (PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein, partial [Cephalotus follicularis])

HSP 1 Score: 963 bits (2489), Expect = 0.0
Identity = 454/703 (64.58%), Postives = 566/703 (80.51%), Query Frame = 0

Query: 1   IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
           IIMLYGK  M KHA+DTFY+MHLYG +RTVKS NA LKVL ++RDLGAIE FL E P+KF
Sbjct: 1   IIMLYGKVGMTKHAIDTFYNMHLYGSKRTVKSLNAALKVLTQSRDLGAIEEFLQEVPQKF 60

Query: 61  DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
           DI LDI SVNIV+K  C++GIL +AYL+MLEMEKVG+RPDV+TYTTLISA  K NR EI 
Sbjct: 61  DIALDIFSVNIVIKGLCEMGILDKAYLVMLEMEKVGLRPDVITYTTLISASCKSNRWEIG 120

Query: 121 NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
           NGLWNLMV +GC PNL +FNVRI+YLV RRRAW+AN L+ +M+N+G++PDE+TYNLVIKG
Sbjct: 121 NGLWNLMVRKGCFPNLITFNVRIEYLVSRRRAWQANSLLGLMQNLGMMPDEITYNLVIKG 180

Query: 181 FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
           FCQAG+F+MAKRVYSA+   GY+PN+KIY+TMIHYLC+ GDFNLAYTM KD M++NWFPN
Sbjct: 181 FCQAGYFEMAKRVYSAMLFKGYRPNLKIYETMIHYLCKGGDFNLAYTMSKDCMSKNWFPN 240

Query: 241 IDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK 300
           +DTIH+L++                            VRGLC E + E+G+KL+E+RWG 
Sbjct: 241 LDTIHTLLE----------------------------VRGLCKEEKVEEGKKLVENRWGV 300

Query: 301 GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAI 360
           GCVPN VFYN LIDGYCKK E+ SA +L  +LK+KGF+PTLET+G+M+NGFCK G+F+A+
Sbjct: 301 GCVPNKVFYNVLIDGYCKKSEIDSAKQLLKKLKMKGFLPTLETYGAMINGFCKGGDFKAV 360

Query: 361 DLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINY 420
           D LL+EMK+RG++VSV+VYNNIIDA+YK  C ++A + +K   E+ CEPD+ TYNTLI+ 
Sbjct: 361 DRLLVEMKERGINVSVRVYNNIIDARYKHECKVKAVETVKLMIESGCEPDIATYNTLISG 420

Query: 421 LCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD 480
            CR G+V +A ++LEQ  +RG++PNKFTYTPL+H YC  G++ RAS+ LIEM ++GH+ D
Sbjct: 421 ACRDGKVEDACQLLEQVKERGLLPNKFTYTPLIHVYCNHGDHVRASEFLIEMMERGHRPD 480

Query: 481 MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLS 540
           +V+YGAL++GLV AGEVD A+T+R +M+ERGVLPDA IYNVLM+GL KK  LS AK++L+
Sbjct: 481 LVAYGALVNGLVAAGEVDTALTVRHKMVERGVLPDAAIYNVLMSGLCKKRKLSAAKMLLA 540

Query: 541 EMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG 600
           EMLD N+ PDAF+YATLVDGFIR+  LDEAKKLF+LTIEKGIDPGVVGYN+MIKG+CK G
Sbjct: 541 EMLDHNVLPDAFVYATLVDGFIRNGELDEAKKLFELTIEKGIDPGVVGYNAMIKGYCKSG 600

Query: 601 MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYT 660
           MM+DA+ C+++M +  H PD F++STIIDGYVK  DL  AL+IF  M KQ+CKPNVVTYT
Sbjct: 601 MMKDALSCVNKMIAGHHAPDEFSYSTIIDGYVKLHDLDGALRIFAQMKKQNCKPNVVTYT 660

Query: 661 SLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK 703
            LING+C KG+   AEK F  MQS GL P+VVTY +LI   CK
Sbjct: 661 CLINGFCSKGDSSKAEKSFEEMQSCGLVPNVVTYSILIGGFCK 675

BLAST of MC08g0997 vs. NCBI nr
Match: XP_022153568.1 (pentatricopeptide repeat-containing protein At1g52620 [Momordica charantia])

HSP 1 Score: 918 bits (2372), Expect = 0.0
Identity = 476/601 (79.20%), Postives = 497/601 (82.70%), Query Frame = 0

Query: 108 ISAFYKDNRCEISNGL----WNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMR 167
           IS F  D   ++  GL    W       C  N ++++  ++ L   R   E    +  MR
Sbjct: 66  ISHFVLDRIDDVELGLKFFDWASKNSSSCSLNGSAYSSLLKLLSRFRVFPEIEFTLEDMR 125

Query: 168 NIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIH-YLCRSGDF 227
              IVP     + V+  +   GF D A   Y  +        VK++ ++   Y C S   
Sbjct: 126 TKEIVPTRDALSNVLCAYADLGFVDKALVFYHGV--------VKLHNSLPSTYACNS--- 185

Query: 228 NLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLC 287
            L   + K          + T H L   + K       +DNGDDKCTDNYTTCIMVRGLC
Sbjct: 186 -LLNLLVKHR-------RLGTAHQLYDEMVK-------RDNGDDKCTDNYTTCIMVRGLC 245

Query: 288 LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE 347
           LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE
Sbjct: 246 LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE 305

Query: 348 TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET 407
           TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET
Sbjct: 306 TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET 365

Query: 408 AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY 467
           AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY
Sbjct: 366 AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY 425

Query: 468 YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL 527
           YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL
Sbjct: 426 YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL 485

Query: 528 MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI 587
           MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI
Sbjct: 486 MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI 545

Query: 588 DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK 647
           DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK
Sbjct: 546 DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK 605

Query: 648 IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC 703
           IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC
Sbjct: 606 IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC 640

BLAST of MC08g0997 vs. NCBI nr
Match: RXH68979.1 (hypothetical protein DVH24_031312 [Malus domestica])

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 489/1005 (48.66%), Postives = 593/1005 (59.00%), Query Frame = 0

Query: 1    IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
            II LYGKA M KHA+DTF DMHLYGC RTVKSFNA LKVL +TRDLGA+EAFLSE PEKF
Sbjct: 111  IITLYGKAGMTKHAIDTFCDMHLYGCSRTVKSFNAALKVLTQTRDLGALEAFLSEIPEKF 170

Query: 61   DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
            DIELDI SVNIV+KAFC++GIL +AY +M++MEK+GI+PDV+TYTTL+SAFYKDNR EI 
Sbjct: 171  DIELDIYSVNIVIKAFCEMGILVKAYQIMVQMEKLGIKPDVITYTTLMSAFYKDNRWEIG 230

Query: 121  NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
            NGLWNLM+L+GCLPNLA+FNVRIQYLV RRRAWEAN+LM +M+NI I PDEVTYNLVIKG
Sbjct: 231  NGLWNLMILKGCLPNLATFNVRIQYLVYRRRAWEANRLMGLMQNIEITPDEVTYNLVIKG 290

Query: 181  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
            FCQAG+ +MAKRVYSAL G GYKPNVKIYQTMIHYLC+ GDF+LAYTMCKD M +NWFPN
Sbjct: 291  FCQAGYLEMAKRVYSALHGKGYKPNVKIYQTMIHYLCKGGDFDLAYTMCKDCMQKNWFPN 350

Query: 241  IDTIHSLIKGLKKMGQLGKAQ-------DNGD---------------------------- 300
            +DTI +L++GLKK  QLGKA+       +NG                             
Sbjct: 351  VDTIRTLLEGLKKANQLGKAKAIMIMVRENGKVIPIRCHKMLVWKIQSSATLPLEVHNSF 410

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 411  FGQQGGTEPSRQGVDSLHLIRAFLLKRKRAATVIPIQELRHLQTSENSLTESLKKSKTSS 470

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 471  HSHMSKTLLSRIKPLQNRKPTSSSSSSSSSPPPPHVKRLVNDTIQILRTQHHWEQSLETQ 530

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 531  FSETEMIVSDVAHFVLDRVHDVELGLKFFDWAFKRSYCCSPDGSAYSSLLKLLARFRVFS 590

Query: 481  ------DK---------------------------------------------------- 540
                  DK                                                    
Sbjct: 591  EIDLVMDKVKLEEVKPTHDALSFVIRAYADSGMVGKALDLYDVVVKVYGVVPSVFACNSL 650

Query: 541  -----------------------------CTDNYTTCIMVRGLCLEGRTEDGRKLIESRW 600
                                         C DNY+TCIMV+GLC EGR E+GRKLI  RW
Sbjct: 651  LNVLVKSRRVDVARRVYDEMAERGGREHLCMDNYSTCIMVKGLCKEGRVEEGRKLIVDRW 710

Query: 601  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFE 660
            GK CVPN+VFYNTLIDGYCKKG+V SA  +F ELK KGF+PTLET+G+M+NG+CK G F+
Sbjct: 711  GKSCVPNVVFYNTLIDGYCKKGDVESANVIFKELKSKGFLPTLETYGAMINGYCKEGKFK 770

Query: 661  AIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLI 703
            AID L MEMK+RGL ++VQV NNI+DA+ K G  ++  + +K+  E+ CEPD+ TYN LI
Sbjct: 771  AIDRLFMEMKERGLHINVQVRNNIVDARCKHGSLVKGVETVKQMIESGCEPDITTYNILI 830

BLAST of MC08g0997 vs. NCBI nr
Match: XP_024956279.1 (pentatricopeptide repeat-containing protein At1g52620 [Citrus sinensis])

HSP 1 Score: 861 bits (2224), Expect = 9.95e-298
Identity = 460/986 (46.65%), Postives = 584/986 (59.23%), Query Frame = 0

Query: 1    IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
            I+MLYGKA MIKHA+DTFYDMHLYGC+RTVKS NA LKVL ++RDL AI+AFL E PEKF
Sbjct: 121  IMMLYGKAGMIKHAMDTFYDMHLYGCKRTVKSLNAALKVLTESRDLKAIQAFLMEVPEKF 180

Query: 61   DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
             I+ DI S NIV+KAFC++GIL +AYL+M+EM+K+G++PDV+TYTTLISAFYKDNR EI 
Sbjct: 181  HIQFDIFSFNIVIKAFCEMGILDKAYLVMVEMQKLGVKPDVITYTTLISAFYKDNRPEIG 240

Query: 121  NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
            NGLWNLMV +GC PNLA+FNVRIQ+LV++RR+W+ANKLM +M+  GI PDEVTYNLVIKG
Sbjct: 241  NGLWNLMVRKGCFPNLATFNVRIQHLVNKRRSWQANKLMGLMQRFGIEPDEVTYNLVIKG 300

Query: 181  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
            FC++G  DMAK+VYSA+ G    PN KIYQTMIHYLC+ GDFNLAY MCKD+M +NW P+
Sbjct: 301  FCRSGHLDMAKKVYSAMLGRRLMPNRKIYQTMIHYLCQEGDFNLAYIMCKDSMKKNWVPS 360

Query: 241  IDTIHSLIKGLKKMGQLGKA---------------------------------------- 300
            +DTI +L++GLKK  Q  KA                                        
Sbjct: 361  VDTISALLEGLKKNNQPCKANTIMALVQRRVPHFSSNQLSAFKSILSKSDRCFNSTSALP 420

Query: 301  ------------------------------------QD---------------------- 360
                                                QD                      
Sbjct: 421  GKSSTEKAAPRKEHCTYVPSLAGLLLQSSVSIDYISQDPQSFVSSSLFLMSKTLLSRIKP 480

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 481  LQNQKPSSSSSSPFPLAPHIKKLVNETIEILKTHPQYDQSLEIRFSDEETYVSEIAHHVF 540

Query: 421  --------------------------NG-------------------------------- 480
                                      NG                                
Sbjct: 541  DRIRELELGLKFFDWLSRQQPKNSFSNGYACSSFLKLLARFRVFSEIELVLKNLKIDGIK 600

Query: 481  ------------------------------------------------------------ 540
                                                                        
Sbjct: 601  PTHEALSVIIRAYAESGLVDKAIDLYNNLFVPYNSVPDVFTCNSLLNLLVKCKRIEMARK 660

Query: 541  --DDKCT-----DNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYC 600
              D+ C      DNY+TCIMVRGLC EG+ E+G+ LIE R+GKGC+PNIVFYNTLIDGYC
Sbjct: 661  LYDEMCKTDDGLDNYSTCIMVRGLCKEGKVEEGKNLIEDRFGKGCIPNIVFYNTLIDGYC 720

Query: 601  KKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQ 660
            KKG+V +A +LF ELK+KGF+PTLET+G++++GFCK G+F+ ID L+MEMK R L+V+V+
Sbjct: 721  KKGDVENARKLFKELKMKGFLPTLETYGAIISGFCKKGSFKGIDGLMMEMKQRNLNVNVR 780

Query: 661  VYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQA 703
            VYN+IID +YK G  + A + ++   EN CEPD+VTYN LI+  CR G+V EA ++LEQ 
Sbjct: 781  VYNSIIDGKYKHGFKVEALETVRLMIENRCEPDIVTYNILISGACRDGKVNEACELLEQV 840

BLAST of MC08g0997 vs. NCBI nr
Match: XP_038894903.1 (pentatricopeptide repeat-containing protein At1g52620 [Benincasa hispida])

HSP 1 Score: 776 bits (2005), Expect = 5.07e-271
Identity = 377/477 (79.04%), Postives = 416/477 (87.21%), Query Frame = 0

Query: 236 NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGR 295
           N  P++   +SL+  L K  +L  A         +DNGDD C DNYTTCIMVRGLCLEGR
Sbjct: 164 NSLPSMYACNSLLNLLVKHRRLETAHQLYDEMVSRDNGDDICMDNYTTCIMVRGLCLEGR 223

Query: 296 TEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGS 355
            EDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV SAY+LF ELK+KGF+PTLETFGS
Sbjct: 224 IEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVESAYKLFKELKMKGFIPTLETFGS 283

Query: 356 MVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENC 415
           +VNGFCK G FEAIDLLL+EMK+RGLSV+VQ+YN IIDA+YKLG D +AKD LKE  ENC
Sbjct: 284 LVNGFCKVGIFEAIDLLLVEMKERGLSVNVQIYNTIIDARYKLGYDTKAKDTLKEMTENC 343

Query: 416 CEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRAS 475
           C PDLVTYNTLINYLC  GEV EAEK+LEQ I+RG+ P+KF YTPLVH Y KQGEY RAS
Sbjct: 344 CTPDLVTYNTLINYLCSRGEVKEAEKLLEQTIRRGLAPDKFAYTPLVHGYYKQGEYIRAS 403

Query: 476 DLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL 535
           DL+IEMS +GH+VD VSYGA+IHGLVVAGEVD A+TIRDRMMERGVLPDANIYNVLMNGL
Sbjct: 404 DLVIEMSTRGHEVDRVSYGAIIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGL 463

Query: 536 FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGV 595
           FKKG LSMAK+ML+EMLDQ+IAPDAFIYATLVDGFIRH NLDEA K+FQLTIEKGIDPGV
Sbjct: 464 FKKGKLSMAKMMLTEMLDQHIAPDAFIYATLVDGFIRHGNLDEAMKIFQLTIEKGIDPGV 523

Query: 596 VGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGL 655
           VGYN MIKGF KFGMM DA+LCIDRMRSA H PDVFTFSTIIDGYVKQ D+YA LK+FGL
Sbjct: 524 VGYNVMIKGFSKFGMMNDAILCIDRMRSAHHAPDVFTFSTIIDGYVKQHDMYAVLKVFGL 583

Query: 656 MLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK 703
           M+KQ+CKPNV+TYTSLINGYC KGE+K+AEK FS+MQSHGLEPSVVTY +LIRS CK
Sbjct: 584 MVKQNCKPNVITYTSLINGYCRKGEIKMAEKHFSMMQSHGLEPSVVTYSILIRSFCK 640

BLAST of MC08g0997 vs. ExPASy TrEMBL
Match: A0A1Q3BDZ3 (PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein (Fragment) OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_09524 PE=4 SV=1)

HSP 1 Score: 963 bits (2489), Expect = 0.0
Identity = 454/703 (64.58%), Postives = 566/703 (80.51%), Query Frame = 0

Query: 1   IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
           IIMLYGK  M KHA+DTFY+MHLYG +RTVKS NA LKVL ++RDLGAIE FL E P+KF
Sbjct: 1   IIMLYGKVGMTKHAIDTFYNMHLYGSKRTVKSLNAALKVLTQSRDLGAIEEFLQEVPQKF 60

Query: 61  DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
           DI LDI SVNIV+K  C++GIL +AYL+MLEMEKVG+RPDV+TYTTLISA  K NR EI 
Sbjct: 61  DIALDIFSVNIVIKGLCEMGILDKAYLVMLEMEKVGLRPDVITYTTLISASCKSNRWEIG 120

Query: 121 NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
           NGLWNLMV +GC PNL +FNVRI+YLV RRRAW+AN L+ +M+N+G++PDE+TYNLVIKG
Sbjct: 121 NGLWNLMVRKGCFPNLITFNVRIEYLVSRRRAWQANSLLGLMQNLGMMPDEITYNLVIKG 180

Query: 181 FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
           FCQAG+F+MAKRVYSA+   GY+PN+KIY+TMIHYLC+ GDFNLAYTM KD M++NWFPN
Sbjct: 181 FCQAGYFEMAKRVYSAMLFKGYRPNLKIYETMIHYLCKGGDFNLAYTMSKDCMSKNWFPN 240

Query: 241 IDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK 300
           +DTIH+L++                            VRGLC E + E+G+KL+E+RWG 
Sbjct: 241 LDTIHTLLE----------------------------VRGLCKEEKVEEGKKLVENRWGV 300

Query: 301 GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAI 360
           GCVPN VFYN LIDGYCKK E+ SA +L  +LK+KGF+PTLET+G+M+NGFCK G+F+A+
Sbjct: 301 GCVPNKVFYNVLIDGYCKKSEIDSAKQLLKKLKMKGFLPTLETYGAMINGFCKGGDFKAV 360

Query: 361 DLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINY 420
           D LL+EMK+RG++VSV+VYNNIIDA+YK  C ++A + +K   E+ CEPD+ TYNTLI+ 
Sbjct: 361 DRLLVEMKERGINVSVRVYNNIIDARYKHECKVKAVETVKLMIESGCEPDIATYNTLISG 420

Query: 421 LCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD 480
            CR G+V +A ++LEQ  +RG++PNKFTYTPL+H YC  G++ RAS+ LIEM ++GH+ D
Sbjct: 421 ACRDGKVEDACQLLEQVKERGLLPNKFTYTPLIHVYCNHGDHVRASEFLIEMMERGHRPD 480

Query: 481 MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLS 540
           +V+YGAL++GLV AGEVD A+T+R +M+ERGVLPDA IYNVLM+GL KK  LS AK++L+
Sbjct: 481 LVAYGALVNGLVAAGEVDTALTVRHKMVERGVLPDAAIYNVLMSGLCKKRKLSAAKMLLA 540

Query: 541 EMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG 600
           EMLD N+ PDAF+YATLVDGFIR+  LDEAKKLF+LTIEKGIDPGVVGYN+MIKG+CK G
Sbjct: 541 EMLDHNVLPDAFVYATLVDGFIRNGELDEAKKLFELTIEKGIDPGVVGYNAMIKGYCKSG 600

Query: 601 MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYT 660
           MM+DA+ C+++M +  H PD F++STIIDGYVK  DL  AL+IF  M KQ+CKPNVVTYT
Sbjct: 601 MMKDALSCVNKMIAGHHAPDEFSYSTIIDGYVKLHDLDGALRIFAQMKKQNCKPNVVTYT 660

Query: 661 SLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK 703
            LING+C KG+   AEK F  MQS GL P+VVTY +LI   CK
Sbjct: 661 CLINGFCSKGDSSKAEKSFEEMQSCGLVPNVVTYSILIGGFCK 675

BLAST of MC08g0997 vs. ExPASy TrEMBL
Match: A0A6J1DHT9 (pentatricopeptide repeat-containing protein At1g52620 OS=Momordica charantia OX=3673 GN=LOC111021040 PE=4 SV=1)

HSP 1 Score: 918 bits (2372), Expect = 0.0
Identity = 476/601 (79.20%), Postives = 497/601 (82.70%), Query Frame = 0

Query: 108 ISAFYKDNRCEISNGL----WNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMR 167
           IS F  D   ++  GL    W       C  N ++++  ++ L   R   E    +  MR
Sbjct: 66  ISHFVLDRIDDVELGLKFFDWASKNSSSCSLNGSAYSSLLKLLSRFRVFPEIEFTLEDMR 125

Query: 168 NIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIH-YLCRSGDF 227
              IVP     + V+  +   GF D A   Y  +        VK++ ++   Y C S   
Sbjct: 126 TKEIVPTRDALSNVLCAYADLGFVDKALVFYHGV--------VKLHNSLPSTYACNS--- 185

Query: 228 NLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLC 287
            L   + K          + T H L   + K       +DNGDDKCTDNYTTCIMVRGLC
Sbjct: 186 -LLNLLVKHR-------RLGTAHQLYDEMVK-------RDNGDDKCTDNYTTCIMVRGLC 245

Query: 288 LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE 347
           LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE
Sbjct: 246 LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE 305

Query: 348 TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET 407
           TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET
Sbjct: 306 TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET 365

Query: 408 AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY 467
           AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY
Sbjct: 366 AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY 425

Query: 468 YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL 527
           YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL
Sbjct: 426 YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL 485

Query: 528 MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI 587
           MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI
Sbjct: 486 MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI 545

Query: 588 DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK 647
           DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK
Sbjct: 546 DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK 605

Query: 648 IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC 703
           IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC
Sbjct: 606 IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC 640

BLAST of MC08g0997 vs. ExPASy TrEMBL
Match: A0A498HD54 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_031312 PE=4 SV=1)

HSP 1 Score: 912 bits (2356), Expect = 0.0
Identity = 489/1005 (48.66%), Postives = 593/1005 (59.00%), Query Frame = 0

Query: 1    IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
            II LYGKA M KHA+DTF DMHLYGC RTVKSFNA LKVL +TRDLGA+EAFLSE PEKF
Sbjct: 111  IITLYGKAGMTKHAIDTFCDMHLYGCSRTVKSFNAALKVLTQTRDLGALEAFLSEIPEKF 170

Query: 61   DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
            DIELDI SVNIV+KAFC++GIL +AY +M++MEK+GI+PDV+TYTTL+SAFYKDNR EI 
Sbjct: 171  DIELDIYSVNIVIKAFCEMGILVKAYQIMVQMEKLGIKPDVITYTTLMSAFYKDNRWEIG 230

Query: 121  NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
            NGLWNLM+L+GCLPNLA+FNVRIQYLV RRRAWEAN+LM +M+NI I PDEVTYNLVIKG
Sbjct: 231  NGLWNLMILKGCLPNLATFNVRIQYLVYRRRAWEANRLMGLMQNIEITPDEVTYNLVIKG 290

Query: 181  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
            FCQAG+ +MAKRVYSAL G GYKPNVKIYQTMIHYLC+ GDF+LAYTMCKD M +NWFPN
Sbjct: 291  FCQAGYLEMAKRVYSALHGKGYKPNVKIYQTMIHYLCKGGDFDLAYTMCKDCMQKNWFPN 350

Query: 241  IDTIHSLIKGLKKMGQLGKAQ-------DNGD---------------------------- 300
            +DTI +L++GLKK  QLGKA+       +NG                             
Sbjct: 351  VDTIRTLLEGLKKANQLGKAKAIMIMVRENGKVIPIRCHKMLVWKIQSSATLPLEVHNSF 410

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 411  FGQQGGTEPSRQGVDSLHLIRAFLLKRKRAATVIPIQELRHLQTSENSLTESLKKSKTSS 470

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 471  HSHMSKTLLSRIKPLQNRKPTSSSSSSSSSPPPPHVKRLVNDTIQILRTQHHWEQSLETQ 530

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 531  FSETEMIVSDVAHFVLDRVHDVELGLKFFDWAFKRSYCCSPDGSAYSSLLKLLARFRVFS 590

Query: 481  ------DK---------------------------------------------------- 540
                  DK                                                    
Sbjct: 591  EIDLVMDKVKLEEVKPTHDALSFVIRAYADSGMVGKALDLYDVVVKVYGVVPSVFACNSL 650

Query: 541  -----------------------------CTDNYTTCIMVRGLCLEGRTEDGRKLIESRW 600
                                         C DNY+TCIMV+GLC EGR E+GRKLI  RW
Sbjct: 651  LNVLVKSRRVDVARRVYDEMAERGGREHLCMDNYSTCIMVKGLCKEGRVEEGRKLIVDRW 710

Query: 601  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFE 660
            GK CVPN+VFYNTLIDGYCKKG+V SA  +F ELK KGF+PTLET+G+M+NG+CK G F+
Sbjct: 711  GKSCVPNVVFYNTLIDGYCKKGDVESANVIFKELKSKGFLPTLETYGAMINGYCKEGKFK 770

Query: 661  AIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLI 703
            AID L MEMK+RGL ++VQV NNI+DA+ K G  ++  + +K+  E+ CEPD+ TYN LI
Sbjct: 771  AIDRLFMEMKERGLHINVQVRNNIVDARCKHGSLVKGVETVKQMIESGCEPDITTYNILI 830

BLAST of MC08g0997 vs. ExPASy TrEMBL
Match: A0A371FHY8 (Signal peptidase I (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_41872 PE=3 SV=1)

HSP 1 Score: 811 bits (2094), Expect = 5.28e-270
Identity = 418/897 (46.60%), Postives = 550/897 (61.32%), Query Frame = 0

Query: 1    IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
            II LYG A M  HA+  F   H    RRTVKSFNA L+VL  +R+  +   FL+  P ++
Sbjct: 934  IISLYGIAGMTSHALHAF---HHTRSRRTVKSFNATLRVLALSRNFPSFLDFLTLVPLRY 993

Query: 61   DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
             I+LDI SVNI VKAFCD+G L +AYL MLE E  GI+PD VTYTTL+SAFYK+ R EI 
Sbjct: 994  HIQLDIFSVNIAVKAFCDLGKLHQAYLFMLECEIKGIQPDAVTYTTLLSAFYKNKRWEIG 1053

Query: 121  NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRN-IGIVPDEVTYNLVIK 180
            NGLWN M+L+GC P+LA+FNVRIQ+LV  RRAW+AN LM++M+N +GIVPD+VT+NLVIK
Sbjct: 1054 NGLWNRMLLKGCTPSLATFNVRIQFLVTMRRAWDANNLMDLMQNQLGIVPDQVTFNLVIK 1113

Query: 181  GFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFP 240
            GF  AG+ DMAKRVYS+L G G+KPN KIYQTMIHYLC+SGDF++AYTMCKD+M +NWFP
Sbjct: 1114 GFFVAGYVDMAKRVYSSLHGKGFKPNAKIYQTMIHYLCKSGDFDVAYTMCKDSMTKNWFP 1173

Query: 241  NIDTIHSLIKGLKKMGQLGKA--------------------------------------- 300
            +++TI  L++GLK  GQ+ K                                        
Sbjct: 1174 DVNTICMLLEGLKGSGQIAKGIAIMTLAILSGIKPRHRPKGSHLLPPHINHLVSEVIRIL 1233

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 1234 KTRQWQHSLESRFAESQVLVSDVAHLVIERVHDAELGLMFFDWASTRPFSCSLDAVAHSS 1293

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 1294 LLKLLARFRVFSEIEPVLENMKTQGLNPTREALTALILAYGESGSLHRALQLFHTLREMH 1353

Query: 421  ----------------------------------QDNGDDKCTDNYTTCIMVRGLCLEGR 480
                                               + G     DNY+T I+++GLC  G+
Sbjct: 1354 DCFPSVVASNSLLNGLVQSGKVDVALQLYDKMLQTEGGAVAVVDNYSTSIVMKGLCKLGK 1413

Query: 481  TEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGS 540
             E+ R+LI  RWGKGCVP++VFYN +IDGYCKKG++ SA  +  ELKLKGF+PT+ET+G+
Sbjct: 1414 VEEARRLINDRWGKGCVPHVVFYNMIIDGYCKKGDLLSATRVLKELKLKGFLPTVETYGA 1473

Query: 541  MVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENC 600
            ++NGFCK G FEA+D LL EM  RGL+++V+V+NNIIDA+YK G   +A + +K  AE  
Sbjct: 1474 LINGFCKAGEFEAVDQLLTEMAARGLNMNVKVFNNIIDAEYKHGLVAKAAETMKRMAEMG 1533

Query: 601  CEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRAS 660
             EPD+ TYN +IN+ CRGG + EA++++E A +R ++PNKF+YTPL+HAYCKQG+Y +AS
Sbjct: 1534 VEPDITTYNVMINFSCRGGRIKEADELIEMAKERRLLPNKFSYTPLMHAYCKQGDYVKAS 1593

Query: 661  DLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL 703
            ++L ++++ G K D+VSYGA IHG+VVAGE+D A+ +R++MME+GV PDA IYNVLM+GL
Sbjct: 1594 NMLFKIAEIGGKPDLVSYGAFIHGVVVAGEIDVALMVREKMMEKGVFPDAQIYNVLMSGL 1653

BLAST of MC08g0997 vs. ExPASy TrEMBL
Match: A0A5A7TLP1 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G001660 PE=4 SV=1)

HSP 1 Score: 758 bits (1957), Expect = 5.35e-264
Identity = 371/481 (77.13%), Postives = 414/481 (86.07%), Query Frame = 0

Query: 232 AMNRNWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLC 291
           A   N  P++   +SL+  L K  +   A         +DNGD    D YTTCIMVRGLC
Sbjct: 160 AKLHNSLPSLYACNSLLNLLVKHRRFETAHQLYDEMVDRDNGDGIHVDYYTTCIMVRGLC 219

Query: 292 LEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLE 351
           LEGR EDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV SAYELF ELK KGF+PTL+
Sbjct: 220 LEGRIEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVESAYELFKELKTKGFIPTLQ 279

Query: 352 TFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKET 411
           TFGS+VNGFCK G FEAIDLLL+EMKDRG SV+VQ+YNNIIDAQYKLGCDI+AKD LKE 
Sbjct: 280 TFGSLVNGFCKMGMFEAIDLLLLEMKDRGFSVNVQIYNNIIDAQYKLGCDIKAKDTLKEM 339

Query: 412 AENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEY 471
           +EN C PDLVTYNTLINYLC  GEV EAEK+LEQ I+RG+ PN+FTYTPLVH YCK+GEY
Sbjct: 340 SENSCVPDLVTYNTLINYLCSRGEVKEAEKLLEQTIRRGLAPNEFTYTPLVHGYCKRGEY 399

Query: 472 YRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVL 531
            RA+DLLIEMS +G ++DM+SYGALIHGLVVAGEVD A+TIRDRMM +G+LPDANIYNVL
Sbjct: 400 TRATDLLIEMSTRGLEIDMISYGALIHGLVVAGEVDIALTIRDRMMNQGILPDANIYNVL 459

Query: 532 MNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGI 591
           MNGLFKKG LSMAKV+LSEMLDQNIAPDAF+YATLVDGFIR  NLDEAKKLFQL IEKG+
Sbjct: 460 MNGLFKKGKLSMAKVVLSEMLDQNIAPDAFVYATLVDGFIRLGNLDEAKKLFQLIIEKGL 519

Query: 592 DPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALK 651
           DPGVVGYN MIKGF KFGMM++A+LCIDRMRSA HVPDVFTFSTIIDGYVKQ ++ A LK
Sbjct: 520 DPGVVGYNVMIKGFSKFGMMDNAILCIDRMRSAHHVPDVFTFSTIIDGYVKQHNMNAVLK 579

Query: 652 IFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLC 703
           IFGLM+KQ+CKPNVVTYTSLINGYC KGE ++AEKLFS+M+SHGLEPSVVTY +LI + C
Sbjct: 580 IFGLMVKQNCKPNVVTYTSLINGYCRKGETEMAEKLFSMMRSHGLEPSVVTYTILIGNFC 639

BLAST of MC08g0997 vs. TAIR 10
Match: AT1G52620.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 548.5 bits (1412), Expect = 7.8e-156
Identity = 267/523 (51.05%), Postives = 366/523 (69.98%), Query Frame = 0

Query: 187 FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIH 246
           F+  + V   L+    K   +    ++H    SG  + A  +    +   +  P++   +
Sbjct: 115 FNEIEDVLGNLRNENVKLTHEALSHVLHAYAESGSLSKAVEIYDYVVELYDSVPDVIACN 174

Query: 247 SLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK 306
           SL+  L K  +LG A+   D+ C      DNY+TCI+V+G+C EG+ E GRKLIE RWGK
Sbjct: 175 SLLSLLVKSRRLGDARKVYDEMCDRGDSVDNYSTCILVKGMCNEGKVEVGRKLIEGRWGK 234

Query: 307 GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAI 366
           GC+PNIVFYNT+I GYCK G++ +AY +F ELKLKGF+PTLETFG+M+NGFCK G+F A 
Sbjct: 235 GCIPNIVFYNTIIGGYCKLGDIENAYLVFKELKLKGFMPTLETFGTMINGFCKEGDFVAS 294

Query: 367 DLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINY 426
           D LL E+K+RGL VSV   NNIIDA+Y+ G  +   + +     N C+PD+ TYN LIN 
Sbjct: 295 DRLLSEVKERGLRVSVWFLNNIIDAKYRHGYKVDPAESIGWIIANDCKPDVATYNILINR 354

Query: 427 LCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD 486
           LC+ G+   A   L++A K+G++PN  +Y PL+ AYCK  EY  AS LL++M+++G K D
Sbjct: 355 LCKEGKKEVAVGFLDEASKKGLIPNNLSYAPLIQAYCKSKEYDIASKLLLQMAERGCKPD 414

Query: 487 MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLS 546
           +V+YG LIHGLVV+G +D+A+ ++ ++++RGV PDA IYN+LM+GL K G    AK++ S
Sbjct: 415 IVTYGILIHGLVVSGHMDDAVNMKVKLIDRGVSPDAAIYNMLMSGLCKTGRFLPAKLLFS 474

Query: 547 EMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG 606
           EMLD+NI PDA++YATL+DGFIR  + DEA+K+F L++EKG+   VV +N+MIKGFC+ G
Sbjct: 475 EMLDRNILPDAYVYATLIDGFIRSGDFDEARKVFSLSVEKGVKVDVVHHNAMIKGFCRSG 534

Query: 607 MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYT 666
           M+++A+ C++RM     VPD FT+STIIDGYVKQ D+  A+KIF  M K  CKPNVVTYT
Sbjct: 535 MLDEALACMNRMNEEHLVPDKFTYSTIIDGYVKQQDMATAIKIFRYMEKNKCKPNVVTYT 594

Query: 667 SLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCK 704
           SLING+C +G+ K+AE+ F  MQ   L P+VVTY  LIRSL K
Sbjct: 595 SLINGFCCQGDFKMAEETFKEMQLRDLVPNVVTYTTLIRSLAK 637

BLAST of MC08g0997 vs. TAIR 10
Match: AT1G80150.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 357.8 bits (917), Expect = 1.9e-98
Identity = 165/261 (63.22%), Postives = 208/261 (79.69%), Query Frame = 0

Query: 1   IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKF 60
           IIMLYGKA M K A+DTF++M LYGC+R+VKSFNA L+VL    DL  I  FL +AP K+
Sbjct: 112 IIMLYGKAGMTKQALDTFFNMDLYGCKRSVKSFNAALQVLSFNPDLHTIWEFLHDAPSKY 171

Query: 61  DIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEIS 120
            I++D +S NI +K+FC++GIL  AY+ M EMEK G+ PDVVTYTTLISA YK  RC I 
Sbjct: 172 GIDIDAVSFNIAIKSFCELGILDGAYMAMREMEKSGLTPDVVTYTTLISALYKHERCVIG 231

Query: 121 NGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG 180
           NGLWNLMVL+GC PNL +FNVRIQ+LV+RRRAW+AN L+ +M  + + PD +TYN+VIKG
Sbjct: 232 NGLWNLMVLKGCKPNLTTFNVRIQFLVNRRRAWDANDLLLLMPKLQVEPDSITYNMVIKG 291

Query: 181 FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPN 240
           F  A F DMA+RVY+A+ G GYKPN+KIYQTMIHYLC++G+F+LAYTMCKD M + W+PN
Sbjct: 292 FFLARFPDMAERVYTAMHGKGYKPNLKIYQTMIHYLCKAGNFDLAYTMCKDCMRKKWYPN 351

Query: 241 IDTIHSLIKGLKKMGQLGKAQ 262
           +DT+  L+KGL K GQL +A+
Sbjct: 352 LDTVEMLLKGLVKKGQLDQAK 372

BLAST of MC08g0997 vs. TAIR 10
Match: AT1G74580.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 311.2 bits (796), Expect = 2.1e-84
Identity = 200/659 (30.35%), Postives = 319/659 (48.41%), Query Frame = 0

Query: 5   YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIEL 64
           YG+   ++ AV+ F  M  Y C  TV S+NA++ VL+ +              ++  I  
Sbjct: 86  YGRKGKVQEAVNVFERMDFYDCEPTVFSYNAIMSVLVDSGYFDQAHKVYMRMRDR-GITP 145

Query: 65  DIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLW 124
           D+ S  I +K+FC       A  L+  M   G   +VV Y T++  FY++N       L+
Sbjct: 146 DVYSFTIRMKSFCKTSRPHAALRLLNNMSSQGCEMNVVAYCTVVGGFYEENFKAEGYELF 205

Query: 125 NLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQA 184
             M+  G    L++FN  ++ L  +    E  KL++ +   G++P+  TYNL I+G CQ 
Sbjct: 206 GKMLASGVSLCLSTFNKLLRVLCKKGDVKECEKLLDKVIKRGVLPNLFTYNLFIQGLCQR 265

Query: 185 GFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTI 244
           G  D A R+   L   G KP+V  Y  +I+ LC++  F  A       +N    P+  T 
Sbjct: 266 GELDGAVRMVGCLIEQGPKPDVITYNNLIYGLCKNSKFQEAEVYLGKMVNEGLEPDSYTY 325

Query: 245 HSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR 304
           ++LI G  K G        +G A  NG     D +T   ++ GLC EG T     L    
Sbjct: 326 NTLIAGYCKGGMVQLAERIVGDAVFNG--FVPDQFTYRSLIDGLCHEGETNRALALFNEA 385

Query: 305 WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNF 364
            GKG  PN++ YNTLI G   +G +  A +L  E+  KG +P ++TF  +VNG CK G  
Sbjct: 386 LGKGIKPNVILYNTLIKGLSNQGMILEAAQLANEMSEKGLIPEVQTFNILVNGLCKMGCV 445

Query: 365 EAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIRAKDILKETAENCCEPDLVTY 424
              D L+  M  +G    +  +N +I     Q K+     A +IL    +N  +PD+ TY
Sbjct: 446 SDADGLVKVMISKGYFPDIFTFNILIHGYSTQLKME---NALEILDVMLDNGVDPDVYTY 505

Query: 425 NTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSK 484
           N+L+N LC+  +  +  +  +  +++G  PN FT+  L+ + C+  +   A  LL EM  
Sbjct: 506 NSLLNGLCKTSKFEDVMETYKTMVEKGCAPNLFTFNILLESLCRYRKLDEALGLLEEMKN 565

Query: 485 KGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLS 544
           K    D V++G LI G    G++D A T+  +M E   V      YN++++   +K N++
Sbjct: 566 KSVNPDAVTFGTLIDGFCKNGDLDGAYTLFRKMEEAYKVSSSTPTYNIIIHAFTEKLNVT 625

Query: 545 MAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI 604
           MA+ +  EM+D+ + PD + Y  +VDGF +  N++   K     +E G  P +     +I
Sbjct: 626 MAEKLFQEMVDRCLGPDGYTYRLMVDGFCKTGNVNLGYKFLLEMMENGFIPSLTTLGRVI 685

Query: 605 KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC 653
              C    + +A   I RM     VP+    +TI D  V + ++ A   +   +LK+SC
Sbjct: 686 NCLCVEDRVYEAAGIIHRMVQKGLVPE--AVNTICD--VDKKEVAAPKLVLEDLLKKSC 734

BLAST of MC08g0997 vs. TAIR 10
Match: AT5G59900.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 295.0 bits (754), Expect = 1.5e-79
Identity = 165/622 (26.53%), Postives = 305/622 (49.04%), Query Frame = 0

Query: 88  LMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLV 147
           L  +M  VGIRPDV  YT +I +  +      +  +   M   GC  N+  +NV I  L 
Sbjct: 214 LFNDMVSVGIRPDVYIYTGVIRSLCELKDLSRAKEMIAHMEATGCDVNIVPYNVLIDGLC 273

Query: 148 DRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVK 207
            +++ WEA  +   +    + PD VTY  ++ G C+   F++   +   +    + P+  
Sbjct: 274 KKQKVWEAVGIKKDLAGKDLKPDVVTYCTLVYGLCKVQEFEIGLEMMDEMLCLRFSPSEA 333

Query: 208 IYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGD-- 267
              +++  L + G    A  + K  ++    PN+   ++LI  L K  +  +A+   D  
Sbjct: 334 AVSSLVEGLRKRGKIEEALNLVKRVVDFGVSPNLFVYNALIDSLCKGRKFHEAELLFDRM 393

Query: 268 ---DKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV 327
                  ++ T  I++   C  G+ +     +      G   ++  YN+LI+G+CK G++
Sbjct: 394 GKIGLRPNDVTYSILIDMFCRRGKLDTALSFLGEMVDTGLKLSVYPYNSLINGHCKFGDI 453

Query: 328 GSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNI 387
            +A     E+  K   PT+ T+ S++ G+C  G       L  EM  +G++ S+  +  +
Sbjct: 454 SAAEGFMAEMINKKLEPTVVTYTSLMGGYCSKGKINKALRLYHEMTGKGIAPSIYTFTTL 513

Query: 388 IDAQYKLGCDIRAKDILKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGM 447
           +   ++ G    A  +  E AE   +P+ VTYN +I   C  G++ +A + L++  ++G+
Sbjct: 514 LSGLFRAGLIRDAVKLFNEMAEWNVKPNRVTYNVMIEGYCEEGDMSKAFEFLKEMTEKGI 573

Query: 448 VPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMT 507
           VP+ ++Y PL+H  C  G+   A   +  + K   +++ + Y  L+HG    G+++ A++
Sbjct: 574 VPDTYSYRPLIHGLCLTGQASEAKVFVDGLHKGNCELNEICYTGLLHGFCREGKLEEALS 633

Query: 508 IRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFI 567
           +   M++RGV  D   Y VL++G  K  +  +   +L EM D+ + PD  IY +++D   
Sbjct: 634 VCQEMVQRGVDLDLVCYGVLIDGSLKHKDRKLFFGLLKEMHDRGLKPDDVIYTSMIDAKS 693

Query: 568 RHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVF 627
           +  +  EA  ++ L I +G  P  V Y ++I G CK G + +A +   +M+    VP+  
Sbjct: 694 KTGDFKEAFGIWDLMINEGCVPNEVTYTAVINGLCKAGFVNEAEVLCSKMQPVSSVPNQV 753

Query: 628 TFSTIIDGYVK-QCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSL 687
           T+   +D   K + D+  A+++   +LK     N  TY  LI G+C +G ++ A +L + 
Sbjct: 754 TYGCFLDILTKGEVDMQKAVELHNAILK-GLLANTATYNMLIRGFCRQGRIEEASELITR 813

Query: 688 MQSHGLEPSVVTYGVLIRSLCK 704
           M   G+ P  +TY  +I  LC+
Sbjct: 814 MIGDGVSPDCITYTTMINELCR 834

BLAST of MC08g0997 vs. TAIR 10
Match: AT5G65560.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 282.3 bits (721), Expect = 1.0e-75
Identity = 171/624 (27.40%), Postives = 299/624 (47.92%), Query Frame = 0

Query: 104 YTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMR 163
           Y TL+++  +    +    ++  M+     PN+ ++N  +          EAN+ ++ + 
Sbjct: 186 YNTLLNSLARFGLVDEMKQVYMEMLEDKVCPNIYTYNKMVNGYCKLGNVEEANQYVSKIV 245

Query: 164 NIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFN 223
             G+ PD  TY  +I G+CQ    D A +V++ +   G + N   Y  +IH LC +   +
Sbjct: 246 EAGLDPDFFTYTSLIMGYCQRKDLDSAFKVFNEMPLKGCRRNEVAYTHLIHGLCVARRID 305

Query: 224 LAYTMCKDAMNRNWFPNIDTIHSLIKGL----KKMGQLGKAQDNGDDKCTDN-YTTCIMV 283
            A  +     +   FP + T   LIK L    +K   L   ++  +     N +T  +++
Sbjct: 306 EAMDLFVKMKDDECFPTVRTYTVLIKSLCGSERKSEALNLVKEMEETGIKPNIHTYTVLI 365

Query: 284 RGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFV 343
             LC + + E  R+L+     KG +PN++ YN LI+GYCK+G +  A ++   ++ +   
Sbjct: 366 DSLCSQCKFEKARELLGQMLEKGLMPNVITYNALINGYCKRGMIEDAVDVVELMESRKLS 425

Query: 344 PTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDI 403
           P   T+  ++ G+CK+   +A+  +L +M +R +   V  YN++ID Q + G    A  +
Sbjct: 426 PNTRTYNELIKGYCKSNVHKAMG-VLNKMLERKVLPDVVTYNSLIDGQCRSGNFDSAYRL 485

Query: 404 LKETAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCK 463
           L    +    PD  TY ++I+ LC+   V EA  + +   ++G+ PN   YT L+  YCK
Sbjct: 486 LSLMNDRGLVPDQWTYTSMIDSLCKSKRVEEACDLFDSLEQKGVNPNVVMYTALIDGYCK 545

Query: 464 QGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI 523
            G+   A  +L +M  K    + +++ ALIHGL   G++  A  + ++M++ G+ P  + 
Sbjct: 546 AGKVDEAHLMLEKMLSKNCLPNSLTFNALIHGLCADGKLKEATLLEEKMVKIGLQPTVST 605

Query: 524 YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTI 583
             +L++ L K G+   A     +ML     PDA  Y T +  + R   L +A+ +     
Sbjct: 606 DTILIHRLLKDGDFDHAYSRFQQMLSSGTKPDAHTYTTFIQTYCREGRLLDAEDMMAKMR 665

Query: 584 EKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIID-----GYVK 643
           E G+ P +  Y+S+IKG+   G    A   + RMR     P   TF ++I       Y K
Sbjct: 666 ENGVSPDLFTYSSLIKGYGDLGQTNFAFDVLKRMRDTGCEPSQHTFLSLIKHLLEMKYGK 725

Query: 644 Q-------------CDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS 703
           Q              +    +++   M++ S  PN  +Y  LI G C  G L++AEK+F 
Sbjct: 726 QKGSEPELCAMSNMMEFDTVVELLEKMVEHSVTPNAKSYEKLILGICEVGNLRVAEKVFD 785

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SSR41.1e-15451.05Pentatricopeptide repeat-containing protein At1g52620 OS=Arabidopsis thaliana OX... [more]
Q8GW572.7e-9763.22Pentatricopeptide repeat-containing protein At1g80150, mitochondrial OS=Arabidop... [more]
Q9CA582.9e-8330.35Putative pentatricopeptide repeat-containing protein At1g74580 OS=Arabidopsis th... [more]
Q9FJE62.2e-7826.53Putative pentatricopeptide repeat-containing protein At5g59900 OS=Arabidopsis th... [more]
Q9LSL91.5e-7427.40Pentatricopeptide repeat-containing protein At5g65560 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
GAV66013.10.064.58PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-conta... [more]
XP_022153568.10.079.20pentatricopeptide repeat-containing protein At1g52620 [Momordica charantia][more]
RXH68979.10.048.66hypothetical protein DVH24_031312 [Malus domestica][more]
XP_024956279.19.95e-29846.65pentatricopeptide repeat-containing protein At1g52620 [Citrus sinensis][more]
XP_038894903.15.07e-27179.04pentatricopeptide repeat-containing protein At1g52620 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A1Q3BDZ30.064.58PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-conta... [more]
A0A6J1DHT90.079.20pentatricopeptide repeat-containing protein At1g52620 OS=Momordica charantia OX=... [more]
A0A498HD540.048.66Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_031312 PE=4 SV=1[more]
A0A371FHY85.28e-27046.60Signal peptidase I (Fragment) OS=Mucuna pruriens OX=157652 GN=CR513_41872 PE=3 S... [more]
A0A5A7TLP15.35e-26477.13Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT1G52620.17.8e-15651.05Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G80150.11.9e-9863.22Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74580.12.1e-8430.35Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G59900.11.5e-7926.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G65560.11.0e-7527.40Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 648..703
e-value: 1.1E-18
score: 69.4
coord: 507..576
e-value: 2.7E-15
score: 58.4
coord: 577..647
e-value: 1.8E-15
score: 59.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 387..506
e-value: 2.8E-30
score: 107.8
coord: 2..123
e-value: 8.2E-20
score: 73.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 124..266
e-value: 1.3E-28
score: 102.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 267..386
e-value: 4.4E-24
score: 86.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 65..113
e-value: 4.4E-9
score: 36.4
coord: 514..562
e-value: 2.9E-9
score: 37.0
coord: 654..703
e-value: 1.8E-18
score: 66.5
coord: 409..458
e-value: 1.2E-13
score: 51.0
coord: 134..183
e-value: 1.6E-9
score: 37.8
coord: 584..633
e-value: 3.1E-13
score: 49.7
coord: 304..353
e-value: 1.1E-14
score: 54.4
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 657..691
e-value: 4.7E-9
score: 33.9
coord: 553..585
e-value: 9.7E-5
score: 20.3
coord: 102..135
e-value: 3.1E-4
score: 18.7
coord: 622..656
e-value: 3.7E-8
score: 31.0
coord: 308..340
e-value: 1.3E-7
score: 29.3
coord: 482..516
e-value: 6.5E-7
score: 27.1
coord: 343..375
e-value: 1.0E-5
score: 23.4
coord: 589..621
e-value: 3.1E-8
score: 31.3
coord: 273..306
e-value: 3.1E-4
score: 18.7
coord: 172..206
e-value: 3.5E-7
score: 28.0
coord: 447..480
e-value: 7.5E-6
score: 23.8
coord: 518..551
e-value: 3.9E-7
score: 27.8
coord: 208..240
e-value: 4.0E-5
score: 21.5
coord: 412..445
e-value: 5.7E-9
score: 33.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 1..26
e-value: 1.2
score: 9.5
coord: 482..512
e-value: 1.6E-4
score: 21.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 10.215989
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 480..514
score: 11.41077
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 100..134
score: 9.876189
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 515..549
score: 10.873667
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 12.682281
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 205..239
score: 8.999285
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 550..584
score: 11.114816
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 410..444
score: 12.75901
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 585..619
score: 10.862706
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 170..204
score: 12.495939
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 65..99
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 8.845827
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 620..654
score: 10.785976
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 655..689
score: 13.493418
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 445..479
score: 10.950397
NoneNo IPR availablePANTHERPTHR47938RESPIRATORY COMPLEX I CHAPERONE (CIA84), PUTATIVE (AFU_ORTHOLOGUE AFUA_2G06020)-RELATEDcoord: 5..260
coord: 227..703
NoneNo IPR availablePANTHERPTHR47938:SF15PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 227..703
NoneNo IPR availablePANTHERPTHR47938:SF15PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 5..260
NoneNo IPR availableSUPERFAMILY81901HCP-likecoord: 455..613

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC08g0997.1MC08g0997.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032981 mitochondrial respiratory chain complex I assembly
biological_process GO:0000963 mitochondrial RNA processing
biological_process GO:0008380 RNA splicing
cellular_component GO:0005739 mitochondrion
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:1990825 sequence-specific mRNA binding