MC06g_new0040 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC06g_new0040
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC06: 1564228 .. 1566311 (+)
RNA-Seq ExpressionMC06g_new0040
SyntenyMC06g_new0040
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAGTTGTCTTCCGCCAATTTCTTCTCCGCCGTTCTCTGAATCCGCAGCATCATATTTCTCTTCCTCGCTTCGCCTCCACCGCTTCACTCCTTCACTCTCCCAAATCTCTGATTTCCTCCAAATTTCGTCAACATAACTCAACAACTCGCTTCTCAAATGCTCCAATCGAGAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAACCACAATTGCTCCAAATCCATGCCCACATTATCCGCACGTCCTCCATCAAAGATCCCATCATTGCCCTCCGCTTCTTGACTCGTGCCGCCACCGCGCCTTTTCGCGAATTGGACTATTCTCGACGATTTTTCTCCCAGCTAACAAACCCATCTGTTTCTCATTACAATGCAATGTTAAGAGCGTATTCTTTGAGCCGCTCACCTCAGGACGGATTGTACGTGTATAGAGATATGGAGAGGCAGGGAATTCGTGCCGATCCATTGTCGTCTTCCTTTGCCATTAAGTCCTGTATAAGGATATTTTCATTACTAAGTGGGGTTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAGTCGGATAGTCTTTTGCTCACCACTATGATGGACCTGTACTCTCATTGTGGCAAACTTGAGGAAGCATGCAAATTGTTCGACGAAATTCCTCAAAGAGACGTTGTTGCTTGGAACGTTCTGATTTCGTGTCTGACTCGCAATAAACGAACTAGAGATGCTTTGGGTTTATTTGAGATCATGCAGAGTCCAACATATCTCTGCCAACCTGATAAAGTAACTTGTTTGCTTCTCCTCCAAGCGTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAGTTATATTCAAGAGCGCGGTTATGATACTGAGAGCAATTTGTGTAATTCCCTGATATCTATGTATTCGCGTTGTGGGCGTGTGGATAAGGCATATGAAGTGTTTGACAAAATGCCAGAGAAAAATGTAGTTTCATGGAGTGCAATAATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGATGGGTATAGAGCCTGATGATCGTACTTTTACTGGAGTTCTTTCTGCTTGTAGCCACTGTGGTCTGGTCGATGAAGGCATGGCATTCTTTGATCGTATGAGAGAGTTCAAGATAGCTCCTAATGTCCATCACTATGGATGCATGGTTGATCTCTTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCAGCTCGCAATGTCGATGGAGATGAACCCAGATGCGACGTTATGGAGGACCCTTCTTGGAGCTTGTAAAATTCACGGCCATGTAAACCTTGGGGAGCACATAATTGGACATTTGATTGAAGCCAAATCTCAAGAAGCAGGAGATTACGTTCTTTTGCTGAACATTTATTCCTCCGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAGTTTATGAAGGAGAATGGTATTTATACTACACCTAGCTGCACCACAATAGAACTGAATGGGGTGGTGCATGAGTTTGCTGTGGATGATGTTTCGCATCCGATGAAGGACGAGATTTATGAGCAGCTGGATGAGATCAACAAGCAGCTAAAGATTGCTGGCTATGAATCTGAAATATCATCTGAATTGCACAACTTGAAGGCAGAGGAAAAGGGGTATGCACTTTCTTGCCATAGCGAGAAACTGGCCATAGCTTTTGGGGTTCTTGCAACTCCGCCGGGAAGAACCATCAGAGTGGCTAATAACATTCGTACTTGTGTAGATTGTCATAACTTCGCAAAGTATGTCTCCAGTGTTTATAACAGAAAAGTGGTTGTTAGAGACCGAAGTCGGTTCCACCATTTCCGAGAGGGTCGGTGTTCCTGCAACGATTACTGGTAGCAGCAGATATGCTCTGGACACCATTGATTCTCAGGATATACACCAACCCACTAGGAAGAATTCCCCAAAGTGTGCAGCTTAAACTCATGGCCGTGGGACGTTTCTGAAGATGGGAGAGTTGTTCTGCTTTCACAGGCGAGAATATTATCACAACAGCAGCCGGAGAAAAGGTGGATGTCTAA

mRNA sequence

ATGAAAGTTGTCTTCCGCCAATTTCTTCTCCGCCGTTCTCTGAATCCGCAGCATCATATTTCTCTTCCTCGCTTCGCCTCCACCGCTTCACTCCTTCACTCTCCCAAATCTCTGATTTCCTCCAAATTTCGTCAACATAACTCAACAACTCGCTTCTCAAATGCTCCAATCGAGAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAACCACAATTGCTCCAAATCCATGCCCACATTATCCGCACGTCCTCCATCAAAGATCCCATCATTGCCCTCCGCTTCTTGACTCGTGCCGCCACCGCGCCTTTTCGCGAATTGGACTATTCTCGACGATTTTTCTCCCAGCTAACAAACCCATCTGTTTCTCATTACAATGCAATGTTAAGAGCGTATTCTTTGAGCCGCTCACCTCAGGACGGATTGTACGTGTATAGAGATATGGAGAGGCAGGGAATTCGTGCCGATCCATTGTCGTCTTCCTTTGCCATTAAGTCCTGTATAAGGATATTTTCATTACTAAGTGGGGTTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAGTCGGATAGTCTTTTGCTCACCACTATGATGGACCTGTACTCTCATTGTGGCAAACTTGAGGAAGCATGCAAATTGTTCGACGAAATTCCTCAAAGAGACGTTGTTGCTTGGAACGTTCTGATTTCGTGTCTGACTCGCAATAAACGAACTAGAGATGCTTTGGGTTTATTTGAGATCATGCAGAGTCCAACATATCTCTGCCAACCTGATAAAGTAACTTGTTTGCTTCTCCTCCAAGCGTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAGTTATATTCAAGAGCGCGGTTATGATACTGAGAGCAATTTGTGTAATTCCCTGATATCTATGTATTCGCGTTGTGGGCGTGTGGATAAGGCATATGAAGTGTTTGACAAAATGCCAGAGAAAAATGTAGTTTCATGGAGTGCAATAATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGATGGGTATAGAGCCTGATGATCGTACTTTTACTGGAGTTCTTTCTGCTTGTAGCCACTGTGGTCTGGTCGATGAAGGCATGGCATTCTTTGATCGTATGAGAGAGTTCAAGATAGCTCCTAATGTCCATCACTATGGATGCATGGTTGATCTCTTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCAGCTCGCAATGTCGATGGAGATGAACCCAGATGCGACGTTATGGAGGACCCTTCTTGGAGCTTGTAAAATTCACGGCCATGTAAACCTTGGGGAGCACATAATTGGACATTTGATTGAAGCCAAATCTCAAGAAGCAGGAGATTACGTTCTTTTGCTGAACATTTATTCCTCCGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAGTTTATGAAGGAGAATGGCGAGAATATTATCACAACAGCAGCCGGAGAAAAGGTGGATGTCTAA

Coding sequence (CDS)

ATGAAAGTTGTCTTCCGCCAATTTCTTCTCCGCCGTTCTCTGAATCCGCAGCATCATATTTCTCTTCCTCGCTTCGCCTCCACCGCTTCACTCCTTCACTCTCCCAAATCTCTGATTTCCTCCAAATTTCGTCAACATAACTCAACAACTCGCTTCTCAAATGCTCCAATCGAGAGAGAGCCTCTGATTTCTCTCATAAAATCATGCACCCACAAACCACAATTGCTCCAAATCCATGCCCACATTATCCGCACGTCCTCCATCAAAGATCCCATCATTGCCCTCCGCTTCTTGACTCGTGCCGCCACCGCGCCTTTTCGCGAATTGGACTATTCTCGACGATTTTTCTCCCAGCTAACAAACCCATCTGTTTCTCATTACAATGCAATGTTAAGAGCGTATTCTTTGAGCCGCTCACCTCAGGACGGATTGTACGTGTATAGAGATATGGAGAGGCAGGGAATTCGTGCCGATCCATTGTCGTCTTCCTTTGCCATTAAGTCCTGTATAAGGATATTTTCATTACTAAGTGGGGTTCAGATTCACGCGAGGATTTTTAGAAATGGGCATCAGTCGGATAGTCTTTTGCTCACCACTATGATGGACCTGTACTCTCATTGTGGCAAACTTGAGGAAGCATGCAAATTGTTCGACGAAATTCCTCAAAGAGACGTTGTTGCTTGGAACGTTCTGATTTCGTGTCTGACTCGCAATAAACGAACTAGAGATGCTTTGGGTTTATTTGAGATCATGCAGAGTCCAACATATCTCTGCCAACCTGATAAAGTAACTTGTTTGCTTCTCCTCCAAGCGTGTGCAGACTTGAATGCATTGGAATTCGGTGAAAGGATTCATAGTTATATTCAAGAGCGCGGTTATGATACTGAGAGCAATTTGTGTAATTCCCTGATATCTATGTATTCGCGTTGTGGGCGTGTGGATAAGGCATATGAAGTGTTTGACAAAATGCCAGAGAAAAATGTAGTTTCATGGAGTGCAATAATTTCCGGGTTATCAATGAATGGGCACGGGAGAGAAGCTATTGAAGCGTTTTGGGAGATGCAAAAGATGGGTATAGAGCCTGATGATCGTACTTTTACTGGAGTTCTTTCTGCTTGTAGCCACTGTGGTCTGGTCGATGAAGGCATGGCATTCTTTGATCGTATGAGAGAGTTCAAGATAGCTCCTAATGTCCATCACTATGGATGCATGGTTGATCTCTTGGGTCGTGCTGGAATGCTCGACCAAGCCTATCAGCTCGCAATGTCGATGGAGATGAACCCAGATGCGACGTTATGGAGGACCCTTCTTGGAGCTTGTAAAATTCACGGCCATGTAAACCTTGGGGAGCACATAATTGGACATTTGATTGAAGCCAAATCTCAAGAAGCAGGAGATTACGTTCTTTTGCTGAACATTTATTCCTCCGCTGGCAACTGGGACAAGGTAACTGAATTGAGGAAGTTTATGAAGGAGAATGGCGAGAATATTATCACAACAGCAGCCGGAGAAAAGGTGGATGTCTAA

Protein sequence

MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENGENIITTAAGEKVDV
Homology
BLAST of MC06g_new0040 vs. ExPASy Swiss-Prot
Match: Q9SN85 (Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H76 PE=2 SV=1)

HSP 1 Score: 532.7 bits (1371), Expect = 4.5e-150
Identity = 264/436 (60.55%), Postives = 334/436 (76.61%), Query Frame = 0

Query: 62  LISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPF-RELDYSRRFFSQLT 121
           L+SLI S T K  L QIHA ++RTS I++  +   FL+R A +   R+++YS R FSQ  
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 122 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMER-QGIRADPLSSSFAIKSCIRIFSLLSGV 181
           NP++SH N M+RA+SLS++P +G  ++R + R   + A+PLSSSFA+K CI+   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 182 QIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNK 241
           QIH +IF +G  SDSLL+TT+MDLYS C    +ACK+FDEIP+RD V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 242 RTRDALGLFEIMQSPTYLC-QPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESN 301
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H +I E G     N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 302 LCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMG 361
           L N+L+SMYSRCG +DKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 362 IEPDDRTFTGVLSACSHCGLVDEGMAFFDRMR--EFKIAPNVHHYGCMVDLLGRAGMLDQ 421
           I P+++T TG+LSACSH GLV EGM FFDRMR  EFKI PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 422 AYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSS 481
           AY L  SMEM PD+T+WRTLLGAC++HG V LGE +I HLIE K++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 482 AGNWDKVTELRKFMKE 493
            G W+KVTELR  MKE
Sbjct: 434 VGKWEKVTELRSLMKE 449

BLAST of MC06g_new0040 vs. ExPASy Swiss-Prot
Match: Q9STF3 (Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CRR2 PE=2 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 3.7e-88
Identity = 178/503 (35.39%), Postives = 287/503 (57.06%), Query Frame = 0

Query: 8   FLLRRSLNPQH---HISLPRFASTASLLHSPKSLISS-----KFRQHNSTTRFSNAPIER 67
           FL R  L P      ++ P  +S A    S   LI S     K +Q        ++P  +
Sbjct: 19  FLPRSPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQESSP-SQ 78

Query: 68  EPLISLIKSCTHKPQL---LQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFF 127
           +    LI  C H+  L   L++H HI+   S +DP +A + +     +    +DY+R+ F
Sbjct: 79  QTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLI--GMYSDLGSVDYARKVF 138

Query: 128 SQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCI----RI 187
            +    ++  +NA+ RA +L+   ++ L +Y  M R G+ +D  + ++ +K+C+     +
Sbjct: 139 DKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTV 198

Query: 188 FSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLI 247
             L+ G +IHA + R G+ S   ++TT++D+Y+  G ++ A  +F  +P R+VV+W+ +I
Sbjct: 199 NHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMI 258

Query: 248 SCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERG 307
           +C  +N +  +AL  F  M   T    P+ VT + +LQACA L ALE G+ IH YI  RG
Sbjct: 259 ACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRG 318

Query: 308 YDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFW 367
            D+   + ++L++MY RCG+++    VFD+M +++VVSW+++IS   ++G+G++AI+ F 
Sbjct: 319 LDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFE 378

Query: 368 EMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRM-REFKIAPNVHHYGCMVDLLGRA 427
           EM   G  P   TF  VL ACSH GLV+EG   F+ M R+  I P + HY CMVDLLGRA
Sbjct: 379 EMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRA 438

Query: 428 GMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLL 487
             LD+A ++   M   P   +W +LLG+C+IHG+V L E     L   + + AG+YVLL 
Sbjct: 439 NRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLA 498

Query: 488 NIYSSAGNWDKVTELRKFMKENG 495
           +IY+ A  WD+V  ++K ++  G
Sbjct: 499 DIYAEAQMWDEVKRVKKLLEHRG 518

BLAST of MC06g_new0040 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 312.8 bits (800), Expect = 7.3e-84
Identity = 152/403 (37.72%), Postives = 251/403 (62.28%), Query Frame = 0

Query: 109 LDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKS 168
           ++ +++ F ++    V  +NAM+  Y+ + + ++ L +++DM +  +R D  +    + +
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 169 CIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAW 228
           C +  S+  G Q+H  I  +G  S+  ++  ++DLYS CG+LE AC LF+ +P +DV++W
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 229 NVLISCLTRNKRTRDALGLF-EIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSY 288
           N LI   T     ++AL LF E+++S      P+ VT L +L ACA L A++ G  IH Y
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILPACAHLGAIDIGRWIHVY 395

Query: 289 IQER--GYDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGR 348
           I +R  G    S+L  SLI MY++CG ++ A++VF+ +  K++ SW+A+I G +M+G   
Sbjct: 396 IDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRAD 455

Query: 349 EAIEAFWEMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRM-REFKIAPNVHHYGCM 408
            + + F  M+K+GI+PDD TF G+LSACSH G++D G   F  M +++K+ P + HYGCM
Sbjct: 456 ASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCM 515

Query: 409 VDLLGRAGMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEA 468
           +DLLG +G+  +A ++   MEM PD  +W +LL ACK+HG+V LGE    +LI+ + +  
Sbjct: 516 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 575

Query: 469 GDYVLLLNIYSSAGNWDKVTELRKFMKENGENIITTAAGEKVD 508
           G YVLL NIY+SAG W++V + R  + + G   +   +  ++D
Sbjct: 576 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 615

BLAST of MC06g_new0040 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 312.0 bits (798), Expect = 1.2e-83
Identity = 165/427 (38.64%), Postives = 256/427 (59.95%), Query Frame = 0

Query: 74  QLLQIHAHIIRTS-SIKDPIIALRFLTRAATAPF-RELDYSRRFFSQLTNP-SVSHYNAM 133
           +L QIHA  IR   SI D  +    +    + P    + Y+ + FS++  P +V  +N +
Sbjct: 32  KLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTL 91

Query: 134 LRAYSLSRSPQDGLYVYRDMERQG-IRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNG 193
           +R Y+   +      +YR+M   G +  D  +  F IK+   +  +  G  IH+ + R+G
Sbjct: 92  IRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSG 151

Query: 194 HQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFE 253
             S   +  +++ LY++CG +  A K+FD++P++D+VAWN +I+    N +  +AL L+ 
Sbjct: 152 FGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYT 211

Query: 254 IMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSR 313
            M S     +PD  T + LL ACA + AL  G+R+H Y+ + G     +  N L+ +Y+R
Sbjct: 212 EMNSKG--IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYAR 271

Query: 314 CGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKM-GIEPDDRTFTG 373
           CGRV++A  +FD+M +KN VSW+++I GL++NG G+EAIE F  M+   G+ P + TF G
Sbjct: 272 CGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVG 331

Query: 374 VLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMN 433
           +L ACSHCG+V EG  +F RMR E+KI P + H+GCMVDLL RAG + +AY+   SM M 
Sbjct: 332 ILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQ 391

Query: 434 PDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELR 493
           P+  +WRTLLGAC +HG  +L E     +++ +   +GDYVLL N+Y+S   W  V ++R
Sbjct: 392 PNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIR 451

Query: 494 KFMKENG 495
           K M  +G
Sbjct: 452 KQMLRDG 456

BLAST of MC06g_new0040 vs. ExPASy Swiss-Prot
Match: Q9SIL5 (Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E78 PE=2 SV=1)

HSP 1 Score: 311.6 bits (797), Expect = 1.6e-83
Identity = 151/432 (34.95%), Postives = 259/432 (59.95%), Query Frame = 0

Query: 108 ELDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIR-ADPLSSSFAI 167
           ++DY+ R F+Q++NP+V  YN+++RAY+ +    D + +Y+ + R+     D  +  F  
Sbjct: 57  DMDYATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMF 116

Query: 168 KSCIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVV 227
           KSC  + S   G Q+H  + + G +   +    ++D+Y     L +A K+FDE+ +RDV+
Sbjct: 117 KSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVI 176

Query: 228 AWNVLISCLTRNKRTRDALGLFEIMQSPTYL----------------------------- 287
           +WN L+S   R  + + A GLF +M   T +                             
Sbjct: 177 SWNSLLSGYARLGQMKKAKGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAG 236

Query: 288 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 347
            +PD+++ + +L +CA L +LE G+ IH Y + RG+  ++ +CN+LI MYS+CG + +A 
Sbjct: 237 IEPDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAI 296

Query: 348 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 407
           ++F +M  K+V+SWS +ISG + +G+   AIE F EMQ+  ++P+  TF G+LSACSH G
Sbjct: 297 QLFGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVG 356

Query: 408 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 467
           +  EG+ +FD MR +++I P + HYGC++D+L RAG L++A ++  +M M PD+ +W +L
Sbjct: 357 MWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSL 416

Query: 468 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENGEN 509
           L +C+  G++++    + HL+E + ++ G+YVLL NIY+  G W+ V+ LRK ++   EN
Sbjct: 417 LSSCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIR--NEN 476

BLAST of MC06g_new0040 vs. NCBI nr
Match: XP_022135228.1 (pentatricopeptide repeat-containing protein At3g47530 [Momordica charantia])

HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 497/508 (97.83%), Postives = 501/508 (98.62%), Query Frame = 0

Query: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60
           MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE
Sbjct: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60

Query: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120
           PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT
Sbjct: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120

Query: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180
           NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ
Sbjct: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180

Query: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240
           IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300
           TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300

Query: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360
           NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360

Query: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL 420
           PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL
Sbjct: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL 420

Query: 421 AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW 480
           AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW
Sbjct: 421 AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW 480

Query: 481 DKVTELRKFMKENGENIITTAAGEKVDV 508
           DKVTELRKFMKENG  I TT +   +++
Sbjct: 481 DKVTELRKFMKENG--IYTTPSCTTIEL 506

BLAST of MC06g_new0040 vs. NCBI nr
Match: XP_023515406.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 846 bits (2186), Expect = 8.03e-305
Identity = 415/478 (86.82%), Postives = 441/478 (92.26%), Query Frame = 0

Query: 18  HHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQ 77
           H + LPRFASTASLLHSP SL+SSKFR+ NST RF     +REPLISLIKSCTHK QLLQ
Sbjct: 17  HSLRLPRFASTASLLHSPISLLSSKFREQNSTLRF-----DREPLISLIKSCTHKSQLLQ 76

Query: 78  IHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLS 137
           IHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLTNP VSHYN +LRAYSLS
Sbjct: 77  IHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLS 136

Query: 138 RSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLL 197
           RSP +GLY+YRDMERQG+ ADPLSSSFA+KSCIR+ SL SG+QIHARIFRNGHQSDSLLL
Sbjct: 137 RSPLEGLYMYRDMERQGVHADPLSSSFAVKSCIRMLSLFSGIQIHARIFRNGHQSDSLLL 196

Query: 198 TTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 257
           T+MMDLYSHCGKLE+ACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL
Sbjct: 197 TSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 256

Query: 258 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 317
           C+PDKVTCLLLLQACADLNALEFGERIHSYIQ+  Y+TESNLCNSLISMYSRCGRVDKAY
Sbjct: 257 CKPDKVTCLLLLQACADLNALEFGERIHSYIQQNDYNTESNLCNSLISMYSRCGRVDKAY 316

Query: 318 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 377
           EVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+EPDD TFT VLSACSHCG
Sbjct: 317 EVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCG 376

Query: 378 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 437
           LVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQL MSME+NPDAT+WRTL
Sbjct: 377 LVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTL 436

Query: 438 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENG 494
           LGAC+IHGH NLGE II HLIE KSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKE G
Sbjct: 437 LGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. NCBI nr
Match: XP_022987181.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima])

HSP 1 Score: 846 bits (2186), Expect = 8.03e-305
Identity = 417/495 (84.24%), Postives = 449/495 (90.71%), Query Frame = 0

Query: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60
           M V+FR+       +P H + LPRFASTASLLHSP SL+SSKFRQ NST  F     +RE
Sbjct: 1   MTVIFRRCRCSAYRHP-HSLRLPRFASTASLLHSPISLLSSKFRQQNSTLHF-----DRE 60

Query: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120
           PLISLIKSCTHK QLLQIHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLT
Sbjct: 61  PLISLIKSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLT 120

Query: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180
           NP VSHYN +LRAYSLSRSP +GLY+YRDMERQG+ ADPLSSSFA+KSCIR+ SL SG+Q
Sbjct: 121 NPFVSHYNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFALKSCIRMLSLFSGIQ 180

Query: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240
           IHARIFRNGHQSDSLLLT+MMDLYSHCGKL++ACKLFDEIPQRDVVAWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTSMMDLYSHCGKLKDACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300
           TRDALGLFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIHS+IQ+ GY+TESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLC 300

Query: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360
           NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+E
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVE 360

Query: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQ 420
           PDD TFT VLSACSHCGLVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQ
Sbjct: 361 PDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQ 420

Query: 421 LAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGN 480
           L MSME+NPDAT+WRTLLGAC+IHGH NLGE +I HL+E KSQEAGDYVLLLNIYSSAGN
Sbjct: 421 LVMSMEVNPDATMWRTLLGACRIHGHANLGERVIEHLVELKSQEAGDYVLLLNIYSSAGN 480

Query: 481 WDKVTELRKFMKENG 494
           WDKVTELRKFMKE G
Sbjct: 481 WDKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. NCBI nr
Match: KAG6589508.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia] >KAG7023195.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 846 bits (2185), Expect = 1.14e-304
Identity = 415/478 (86.82%), Postives = 442/478 (92.47%), Query Frame = 0

Query: 18  HHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQ 77
           H + LPRFASTASLLHSP SL+SSKFR+ NST RF     +REPLISLIKSCTHK QLLQ
Sbjct: 17  HSLRLPRFASTASLLHSPISLLSSKFREQNSTLRF-----DREPLISLIKSCTHKSQLLQ 76

Query: 78  IHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLS 137
           IHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLTNP VSHYN +LRAYSLS
Sbjct: 77  IHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLS 136

Query: 138 RSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLL 197
           RSP +GLY+YRDMER+G+ ADPLSSSFA+KSCIR+ SL SGVQIHARIFRNGHQSDSLLL
Sbjct: 137 RSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIFRNGHQSDSLLL 196

Query: 198 TTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 257
           T+MMDLYSHCGKLE+ACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL
Sbjct: 197 TSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 256

Query: 258 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 317
           C+PDKVTCLLLLQACADLNALEFGERIHS+IQ+ GY+TESNLCNSLISMYSRCGRVDKAY
Sbjct: 257 CKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISMYSRCGRVDKAY 316

Query: 318 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 377
           EVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+EPDD TFT VLSACSHCG
Sbjct: 317 EVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCG 376

Query: 378 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 437
           LVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQL MSME+NPDAT+WRTL
Sbjct: 377 LVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTL 436

Query: 438 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENG 494
           LGAC+IHGH NLGE II HLIE KSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKE G
Sbjct: 437 LGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. NCBI nr
Match: XP_022921651.1 (pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata])

HSP 1 Score: 840 bits (2171), Expect = 1.54e-302
Identity = 413/478 (86.40%), Postives = 440/478 (92.05%), Query Frame = 0

Query: 18  HHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQ 77
           H + LP FASTASLLHSP SL+SSKFR+ NST RF     +REPLISLIKSCTHK QLLQ
Sbjct: 17  HSLRLPHFASTASLLHSPISLLSSKFREQNSTLRF-----DREPLISLIKSCTHKSQLLQ 76

Query: 78  IHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLS 137
           IHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLTNP VSHYN +LRAYSLS
Sbjct: 77  IHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLS 136

Query: 138 RSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLL 197
           RSP +GLY+YRDMER+G+ ADPLSSSFA+KSCIR+ SL SGVQIHARIFRNGHQSDSLLL
Sbjct: 137 RSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIFRNGHQSDSLLL 196

Query: 198 TTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 257
           T+MMDLYSHCGKLE+ACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL
Sbjct: 197 TSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 256

Query: 258 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 317
           C+PDKVTCLLLLQACADLNALEFGERIHS+IQ+ GY+TESNLCNSLISMYSRCGRVDKAY
Sbjct: 257 CKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISMYSRCGRVDKAY 316

Query: 318 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 377
           EVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+EPDD TFT VLSACSHCG
Sbjct: 317 EVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCG 376

Query: 378 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 437
           LVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQL MSME+NPDAT+WRTL
Sbjct: 377 LVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTL 436

Query: 438 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENG 494
           LGAC+IHGH NLGE II HLIE KSQEAGDYVLLLNIYSSAGNW KVTELRKFMKE G
Sbjct: 437 LGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWVKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. ExPASy TrEMBL
Match: A0A6J1C487 (pentatricopeptide repeat-containing protein At3g47530 OS=Momordica charantia OX=3673 GN=LOC111007242 PE=3 SV=1)

HSP 1 Score: 998 bits (2580), Expect = 0.0
Identity = 497/508 (97.83%), Postives = 501/508 (98.62%), Query Frame = 0

Query: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60
           MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE
Sbjct: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60

Query: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120
           PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT
Sbjct: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120

Query: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180
           NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ
Sbjct: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180

Query: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240
           IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300
           TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300

Query: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360
           NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360

Query: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL 420
           PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL
Sbjct: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMREFKIAPNVHHYGCMVDLLGRAGMLDQAYQL 420

Query: 421 AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW 480
           AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW
Sbjct: 421 AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW 480

Query: 481 DKVTELRKFMKENGENIITTAAGEKVDV 508
           DKVTELRKFMKENG  I TT +   +++
Sbjct: 481 DKVTELRKFMKENG--IYTTPSCTTIEL 506

BLAST of MC06g_new0040 vs. ExPASy TrEMBL
Match: A0A6J1JDE8 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita maxima OX=3661 GN=LOC111484807 PE=3 SV=1)

HSP 1 Score: 846 bits (2186), Expect = 3.89e-305
Identity = 417/495 (84.24%), Postives = 449/495 (90.71%), Query Frame = 0

Query: 1   MKVVFRQFLLRRSLNPQHHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIERE 60
           M V+FR+       +P H + LPRFASTASLLHSP SL+SSKFRQ NST  F     +RE
Sbjct: 1   MTVIFRRCRCSAYRHP-HSLRLPRFASTASLLHSPISLLSSKFRQQNSTLHF-----DRE 60

Query: 61  PLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLT 120
           PLISLIKSCTHK QLLQIHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLT
Sbjct: 61  PLISLIKSCTHKSQLLQIHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLT 120

Query: 121 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQ 180
           NP VSHYN +LRAYSLSRSP +GLY+YRDMERQG+ ADPLSSSFA+KSCIR+ SL SG+Q
Sbjct: 121 NPFVSHYNTLLRAYSLSRSPLEGLYMYRDMERQGVHADPLSSSFALKSCIRMLSLFSGIQ 180

Query: 181 IHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKR 240
           IHARIFRNGHQSDSLLLT+MMDLYSHCGKL++ACKLFDEIPQRDVVAWNVLISCLTRNKR
Sbjct: 181 IHARIFRNGHQSDSLLLTSMMDLYSHCGKLKDACKLFDEIPQRDVVAWNVLISCLTRNKR 240

Query: 241 TRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLC 300
           TRDALGLFEIMQSPTYLC+PDKVTCLLLLQACADLNALEFGERIHS+IQ+ GY+TESNLC
Sbjct: 241 TRDALGLFEIMQSPTYLCKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLC 300

Query: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIE 360
           NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+E
Sbjct: 301 NSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVE 360

Query: 361 PDDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQ 420
           PDD TFT VLSACSHCGLVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQ
Sbjct: 361 PDDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQ 420

Query: 421 LAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGN 480
           L MSME+NPDAT+WRTLLGAC+IHGH NLGE +I HL+E KSQEAGDYVLLLNIYSSAGN
Sbjct: 421 LVMSMEVNPDATMWRTLLGACRIHGHANLGERVIEHLVELKSQEAGDYVLLLNIYSSAGN 480

Query: 481 WDKVTELRKFMKENG 494
           WDKVTELRKFMKE G
Sbjct: 481 WDKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. ExPASy TrEMBL
Match: A0A6J1E1Z3 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita moschata OX=3662 GN=LOC111429841 PE=3 SV=1)

HSP 1 Score: 840 bits (2171), Expect = 7.44e-303
Identity = 413/478 (86.40%), Postives = 440/478 (92.05%), Query Frame = 0

Query: 18  HHISLPRFASTASLLHSPKSLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQ 77
           H + LP FASTASLLHSP SL+SSKFR+ NST RF     +REPLISLIKSCTHK QLLQ
Sbjct: 17  HSLRLPHFASTASLLHSPISLLSSKFREQNSTLRF-----DREPLISLIKSCTHKSQLLQ 76

Query: 78  IHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLS 137
           IHAH+IRTS I+DPI++LRFLTR  +APFREL YSRRFFSQLTNP VSHYN +LRAYSLS
Sbjct: 77  IHAHMIRTSFIQDPIVSLRFLTRIVSAPFRELGYSRRFFSQLTNPFVSHYNTLLRAYSLS 136

Query: 138 RSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLL 197
           RSP +GLY+YRDMER+G+ ADPLSSSFA+KSCIR+ SL SGVQIHARIFRNGHQSDSLLL
Sbjct: 137 RSPLEGLYMYRDMERRGVHADPLSSSFAVKSCIRMLSLFSGVQIHARIFRNGHQSDSLLL 196

Query: 198 TTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 257
           T+MMDLYSHCGKLE+ACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL
Sbjct: 197 TSMMDLYSHCGKLEDACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYL 256

Query: 258 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 317
           C+PDKVTCLLLLQACADLNALEFGERIHS+IQ+ GY+TESNLCNSLISMYSRCGRVDKAY
Sbjct: 257 CKPDKVTCLLLLQACADLNALEFGERIHSHIQQHGYNTESNLCNSLISMYSRCGRVDKAY 316

Query: 318 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 377
           EVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFW MQK G+EPDD TFT VLSACSHCG
Sbjct: 317 EVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWAMQKKGVEPDDHTFTAVLSACSHCG 376

Query: 378 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 437
           LVDEGMAFFDRMR EF I P VHHYGCMVDLLGRAGMLDQAYQL MSME+NPDAT+WRTL
Sbjct: 377 LVDEGMAFFDRMRQEFMIVPTVHHYGCMVDLLGRAGMLDQAYQLVMSMEVNPDATMWRTL 436

Query: 438 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENG 494
           LGAC+IHGH NLGE II HLIE KSQEAGDYVLLLNIYSSAGNW KVTELRKFMKE G
Sbjct: 437 LGACRIHGHANLGERIIEHLIELKSQEAGDYVLLLNIYSSAGNWVKVTELRKFMKERG 489

BLAST of MC06g_new0040 vs. ExPASy TrEMBL
Match: A0A0A0LUH9 (DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G009750 PE=3 SV=1)

HSP 1 Score: 790 bits (2041), Expect = 1.71e-283
Identity = 386/459 (84.10%), Postives = 418/459 (91.07%), Query Frame = 0

Query: 37  SLISSKFRQHNSTTRFSNAPIEREPLISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALR 96
           S++S K+  H+     S +  EREPLISLIKSCTHK QLLQIHAHII TSSI+DPI++LR
Sbjct: 9   SILSLKYHHHS----ISFSHFEREPLISLIKSCTHKSQLLQIHAHIITTSSIQDPIVSLR 68

Query: 97  FLTRAATAPFRELDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIR 156
           FLTR A+APFR+L YSRR F  LTNP VSHYNAMLRAYSLSRSP +GLY+YRDMERQG+R
Sbjct: 69  FLTRTASAPFRDLGYSRRLFDLLTNPFVSHYNAMLRAYSLSRSPLEGLYMYRDMERQGVR 128

Query: 157 ADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKL 216
           ADPLSSSFA+KSCI++ SLL G+QIHARIF NGHQ+DSLLLT+MMDLYSHCGK EEACKL
Sbjct: 129 ADPLSSSFAVKSCIKLLSLLFGIQIHARIFINGHQADSLLLTSMMDLYSHCGKPEEACKL 188

Query: 217 FDEIPQRDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLN 276
           FDE+PQ+DVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLN
Sbjct: 189 FDEVPQKDVVAWNVLISCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLN 248

Query: 277 ALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIIS 336
           ALEFGERIH YIQ+ GY+TESNLCNSLISMYSRCGR+DKAYEVFDKM EKNVVSWSA+IS
Sbjct: 249 ALEFGERIHGYIQQHGYNTESNLCNSLISMYSRCGRMDKAYEVFDKMTEKNVVSWSAMIS 308

Query: 337 GLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIA 396
           GLSMNGHGREAIEAFWEMQK G+EP D TFT VLSACSHCGLVDEGMAFFDRMR EF IA
Sbjct: 309 GLSMNGHGREAIEAFWEMQKNGVEPGDHTFTAVLSACSHCGLVDEGMAFFDRMRQEFMIA 368

Query: 397 PNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGH 456
           PNVHHYGC+VDLLGRAGMLDQAY+L MSME+ PDAT+WRTLLGAC+IHGH NLGE I+ H
Sbjct: 369 PNVHHYGCIVDLLGRAGMLDQAYELIMSMEVRPDATMWRTLLGACRIHGHGNLGERIVEH 428

Query: 457 LIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENG 494
           LIE KSQEAGDYVLLLNIYSSAGNWDKVTELRK MKE G
Sbjct: 429 LIELKSQEAGDYVLLLNIYSSAGNWDKVTELRKLMKEKG 463

BLAST of MC06g_new0040 vs. ExPASy TrEMBL
Match: A0A1S3BV40 (pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN=LOC103493993 PE=3 SV=1)

HSP 1 Score: 778 bits (2008), Expect = 9.64e-279
Identity = 377/434 (86.87%), Postives = 403/434 (92.86%), Query Frame = 0

Query: 62  LISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFFSQLTN 121
           LISLIKSCTHK QLLQIHAHIIRTSSI+DPI++LRFLTR A+APFR+L YSRRF   LTN
Sbjct: 13  LISLIKSCTHKSQLLQIHAHIIRTSSIQDPIVSLRFLTRTASAPFRDLGYSRRFLDLLTN 72

Query: 122 PSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCIRIFSLLSGVQI 181
           P VSHYNAMLRAYS+SRSP +GLYVYRDMERQG+RADPLSSSFA+KSCI++ SLL G+QI
Sbjct: 73  PLVSHYNAMLRAYSVSRSPLEGLYVYRDMERQGVRADPLSSSFAVKSCIKLLSLLFGIQI 132

Query: 182 HARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRT 241
           HARIF  GHQ+DSLLLT+MMDLYSHCGK EEACKLFDE+PQ+DVVAWNVLISCLTRNKRT
Sbjct: 133 HARIFIYGHQADSLLLTSMMDLYSHCGKPEEACKLFDEVPQKDVVAWNVLISCLTRNKRT 192

Query: 242 RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCN 301
           RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIH YIQ+  Y+TESNLCN
Sbjct: 193 RDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHGYIQQHCYNTESNLCN 252

Query: 302 SLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEP 361
           SLISMYSRCGRVDKAYEVFDKMPEKNVVSWSA+ISGLSMNGHGREAIEAFWEMQK G+EP
Sbjct: 253 SLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAMISGLSMNGHGREAIEAFWEMQKNGVEP 312

Query: 362 DDRTFTGVLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQL 421
           DD TFT VLSACSHCGLVDEGMAFFDRMR E  IAPNVHHYGC+VDLLGRAGMLDQAY+L
Sbjct: 313 DDHTFTAVLSACSHCGLVDEGMAFFDRMRQELMIAPNVHHYGCIVDLLGRAGMLDQAYEL 372

Query: 422 AMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNW 481
            MSME+ PDAT+WRTLLGAC+IHGH NLGE I+ HLIE KSQEAGDYVLLLNIYSSAG W
Sbjct: 373 IMSMEVRPDATMWRTLLGACRIHGHANLGERIVEHLIELKSQEAGDYVLLLNIYSSAGKW 432

Query: 482 DKVTELRKFMKENG 494
           DKVTELRK MKE G
Sbjct: 433 DKVTELRKLMKEKG 446

BLAST of MC06g_new0040 vs. TAIR 10
Match: AT3G47530.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 532.7 bits (1371), Expect = 3.2e-151
Identity = 264/436 (60.55%), Postives = 334/436 (76.61%), Query Frame = 0

Query: 62  LISLIKSCTHKPQLLQIHAHIIRTSSIKDPIIALRFLTRAATAPF-RELDYSRRFFSQLT 121
           L+SLI S T K  L QIHA ++RTS I++  +   FL+R A +   R+++YS R FSQ  
Sbjct: 14  LLSLIVSSTGKLHLRQIHALLLRTSLIRNSDVFHHFLSRLALSLIPRDINYSCRVFSQRL 73

Query: 122 NPSVSHYNAMLRAYSLSRSPQDGLYVYRDMER-QGIRADPLSSSFAIKSCIRIFSLLSGV 181
           NP++SH N M+RA+SLS++P +G  ++R + R   + A+PLSSSFA+K CI+   LL G+
Sbjct: 74  NPTLSHCNTMIRAFSLSQTPCEGFRLFRSLRRNSSLPANPLSSSFALKCCIKSGDLLGGL 133

Query: 182 QIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNK 241
           QIH +IF +G  SDSLL+TT+MDLYS C    +ACK+FDEIP+RD V+WNVL SC  RNK
Sbjct: 134 QIHGKIFSDGFLSDSLLMTTLMDLYSTCENSTDACKVFDEIPKRDTVSWNVLFSCYLRNK 193

Query: 242 RTRDALGLFEIMQSPTYLC-QPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESN 301
           RTRD L LF+ M++    C +PD VTCLL LQACA+L AL+FG+++H +I E G     N
Sbjct: 194 RTRDVLVLFDKMKNDVDGCVKPDGVTCLLALQACANLGALDFGKQVHDFIDENGLSGALN 253

Query: 302 LCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMG 361
           L N+L+SMYSRCG +DKAY+VF  M E+NVVSW+A+ISGL+MNG G+EAIEAF EM K G
Sbjct: 254 LSNTLVSMYSRCGSMDKAYQVFYGMRERNVVSWTALISGLAMNGFGKEAIEAFNEMLKFG 313

Query: 362 IEPDDRTFTGVLSACSHCGLVDEGMAFFDRMR--EFKIAPNVHHYGCMVDLLGRAGMLDQ 421
           I P+++T TG+LSACSH GLV EGM FFDRMR  EFKI PN+HHYGC+VDLLGRA +LD+
Sbjct: 314 ISPEEQTLTGLLSACSHSGLVAEGMMFFDRMRSGEFKIKPNLHHYGCVVDLLGRARLLDK 373

Query: 422 AYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSS 481
           AY L  SMEM PD+T+WRTLLGAC++HG V LGE +I HLIE K++EAGDYVLLLN YS+
Sbjct: 374 AYSLIKSMEMKPDSTIWRTLLGACRVHGDVELGERVISHLIELKAEEAGDYVLLLNTYST 433

Query: 482 AGNWDKVTELRKFMKE 493
            G W+KVTELR  MKE
Sbjct: 434 VGKWEKVTELRSLMKE 449

BLAST of MC06g_new0040 vs. TAIR 10
Match: AT3G46790.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 327.0 bits (837), Expect = 2.7e-89
Identity = 178/503 (35.39%), Postives = 287/503 (57.06%), Query Frame = 0

Query: 8   FLLRRSLNPQH---HISLPRFASTASLLHSPKSLISS-----KFRQHNSTTRFSNAPIER 67
           FL R  L P      ++ P  +S A    S   LI S     K +Q        ++P  +
Sbjct: 19  FLPRSPLKPPSCSVALNNPSISSGAGAKISNNQLIQSLCKEGKLKQAIRVLSQESSP-SQ 78

Query: 68  EPLISLIKSCTHKPQL---LQIHAHIIRTSSIKDPIIALRFLTRAATAPFRELDYSRRFF 127
           +    LI  C H+  L   L++H HI+   S +DP +A + +     +    +DY+R+ F
Sbjct: 79  QTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLI--GMYSDLGSVDYARKVF 138

Query: 128 SQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKSCI----RI 187
            +    ++  +NA+ RA +L+   ++ L +Y  M R G+ +D  + ++ +K+C+     +
Sbjct: 139 DKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASECTV 198

Query: 188 FSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLI 247
             L+ G +IHA + R G+ S   ++TT++D+Y+  G ++ A  +F  +P R+VV+W+ +I
Sbjct: 199 NHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMI 258

Query: 248 SCLTRNKRTRDALGLFEIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERG 307
           +C  +N +  +AL  F  M   T    P+ VT + +LQACA L ALE G+ IH YI  RG
Sbjct: 259 ACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSVLQACASLAALEQGKLIHGYILRRG 318

Query: 308 YDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFW 367
            D+   + ++L++MY RCG+++    VFD+M +++VVSW+++IS   ++G+G++AI+ F 
Sbjct: 319 LDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFE 378

Query: 368 EMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRM-REFKIAPNVHHYGCMVDLLGRA 427
           EM   G  P   TF  VL ACSH GLV+EG   F+ M R+  I P + HY CMVDLLGRA
Sbjct: 379 EMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRA 438

Query: 428 GMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLL 487
             LD+A ++   M   P   +W +LLG+C+IHG+V L E     L   + + AG+YVLL 
Sbjct: 439 NRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLA 498

Query: 488 NIYSSAGNWDKVTELRKFMKENG 495
           +IY+ A  WD+V  ++K ++  G
Sbjct: 499 DIYAEAQMWDEVKRVKKLLEHRG 518

BLAST of MC06g_new0040 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 312.8 bits (800), Expect = 5.2e-85
Identity = 152/403 (37.72%), Postives = 251/403 (62.28%), Query Frame = 0

Query: 109 LDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIRADPLSSSFAIKS 168
           ++ +++ F ++    V  +NAM+  Y+ + + ++ L +++DM +  +R D  +    + +
Sbjct: 216 IENAQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSA 275

Query: 169 CIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAW 228
           C +  S+  G Q+H  I  +G  S+  ++  ++DLYS CG+LE AC LF+ +P +DV++W
Sbjct: 276 CAQSGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISW 335

Query: 229 NVLISCLTRNKRTRDALGLF-EIMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSY 288
           N LI   T     ++AL LF E+++S      P+ VT L +L ACA L A++ G  IH Y
Sbjct: 336 NTLIGGYTHMNLYKEALLLFQEMLRSGE---TPNDVTMLSILPACAHLGAIDIGRWIHVY 395

Query: 289 IQER--GYDTESNLCNSLISMYSRCGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGR 348
           I +R  G    S+L  SLI MY++CG ++ A++VF+ +  K++ SW+A+I G +M+G   
Sbjct: 396 IDKRLKGVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRAD 455

Query: 349 EAIEAFWEMQKMGIEPDDRTFTGVLSACSHCGLVDEGMAFFDRM-REFKIAPNVHHYGCM 408
            + + F  M+K+GI+PDD TF G+LSACSH G++D G   F  M +++K+ P + HYGCM
Sbjct: 456 ASFDLFSRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCM 515

Query: 409 VDLLGRAGMLDQAYQLAMSMEMNPDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEA 468
           +DLLG +G+  +A ++   MEM PD  +W +LL ACK+HG+V LGE    +LI+ + +  
Sbjct: 516 IDLLGHSGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENP 575

Query: 469 GDYVLLLNIYSSAGNWDKVTELRKFMKENGENIITTAAGEKVD 508
           G YVLL NIY+SAG W++V + R  + + G   +   +  ++D
Sbjct: 576 GSYVLLSNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEID 615

BLAST of MC06g_new0040 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 312.0 bits (798), Expect = 8.8e-85
Identity = 165/427 (38.64%), Postives = 256/427 (59.95%), Query Frame = 0

Query: 74  QLLQIHAHIIRTS-SIKDPIIALRFLTRAATAPF-RELDYSRRFFSQLTNP-SVSHYNAM 133
           +L QIHA  IR   SI D  +    +    + P    + Y+ + FS++  P +V  +N +
Sbjct: 32  KLRQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTL 91

Query: 134 LRAYSLSRSPQDGLYVYRDMERQG-IRADPLSSSFAIKSCIRIFSLLSGVQIHARIFRNG 193
           +R Y+   +      +YR+M   G +  D  +  F IK+   +  +  G  IH+ + R+G
Sbjct: 92  IRGYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSG 151

Query: 194 HQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVVAWNVLISCLTRNKRTRDALGLFE 253
             S   +  +++ LY++CG +  A K+FD++P++D+VAWN +I+    N +  +AL L+ 
Sbjct: 152 FGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYT 211

Query: 254 IMQSPTYLCQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSR 313
            M S     +PD  T + LL ACA + AL  G+R+H Y+ + G     +  N L+ +Y+R
Sbjct: 212 EMNSKG--IKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYAR 271

Query: 314 CGRVDKAYEVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKM-GIEPDDRTFTG 373
           CGRV++A  +FD+M +KN VSW+++I GL++NG G+EAIE F  M+   G+ P + TF G
Sbjct: 272 CGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVG 331

Query: 374 VLSACSHCGLVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMN 433
           +L ACSHCG+V EG  +F RMR E+KI P + H+GCMVDLL RAG + +AY+   SM M 
Sbjct: 332 ILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQ 391

Query: 434 PDATLWRTLLGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELR 493
           P+  +WRTLLGAC +HG  +L E     +++ +   +GDYVLL N+Y+S   W  V ++R
Sbjct: 392 PNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIR 451

Query: 494 KFMKENG 495
           K M  +G
Sbjct: 452 KQMLRDG 456

BLAST of MC06g_new0040 vs. TAIR 10
Match: AT2G20540.1 (mitochondrial editing factor 21 )

HSP 1 Score: 311.6 bits (797), Expect = 1.2e-84
Identity = 151/432 (34.95%), Postives = 259/432 (59.95%), Query Frame = 0

Query: 108 ELDYSRRFFSQLTNPSVSHYNAMLRAYSLSRSPQDGLYVYRDMERQGIR-ADPLSSSFAI 167
           ++DY+ R F+Q++NP+V  YN+++RAY+ +    D + +Y+ + R+     D  +  F  
Sbjct: 57  DMDYATRLFNQVSNPNVFLYNSIIRAYTHNSLYCDVIRIYKQLLRKSFELPDRFTFPFMF 116

Query: 168 KSCIRIFSLLSGVQIHARIFRNGHQSDSLLLTTMMDLYSHCGKLEEACKLFDEIPQRDVV 227
           KSC  + S   G Q+H  + + G +   +    ++D+Y     L +A K+FDE+ +RDV+
Sbjct: 117 KSCASLGSCYLGKQVHGHLCKFGPRFHVVTENALIDMYMKFDDLVDAHKVFDEMYERDVI 176

Query: 228 AWNVLISCLTRNKRTRDALGLFEIMQSPTYL----------------------------- 287
           +WN L+S   R  + + A GLF +M   T +                             
Sbjct: 177 SWNSLLSGYARLGQMKKAKGLFHLMLDKTIVSWTAMISGYTGIGCYVEAMDFFREMQLAG 236

Query: 288 CQPDKVTCLLLLQACADLNALEFGERIHSYIQERGYDTESNLCNSLISMYSRCGRVDKAY 347
            +PD+++ + +L +CA L +LE G+ IH Y + RG+  ++ +CN+LI MYS+CG + +A 
Sbjct: 237 IEPDEISLISVLPSCAQLGSLELGKWIHLYAERRGFLKQTGVCNALIEMYSKCGVISQAI 296

Query: 348 EVFDKMPEKNVVSWSAIISGLSMNGHGREAIEAFWEMQKMGIEPDDRTFTGVLSACSHCG 407
           ++F +M  K+V+SWS +ISG + +G+   AIE F EMQ+  ++P+  TF G+LSACSH G
Sbjct: 297 QLFGQMEGKDVISWSTMISGYAYHGNAHGAIETFNEMQRAKVKPNGITFLGLLSACSHVG 356

Query: 408 LVDEGMAFFDRMR-EFKIAPNVHHYGCMVDLLGRAGMLDQAYQLAMSMEMNPDATLWRTL 467
           +  EG+ +FD MR +++I P + HYGC++D+L RAG L++A ++  +M M PD+ +W +L
Sbjct: 357 MWQEGLRYFDMMRQDYQIEPKIEHYGCLIDVLARAGKLERAVEITKTMPMKPDSKIWGSL 416

Query: 468 LGACKIHGHVNLGEHIIGHLIEAKSQEAGDYVLLLNIYSSAGNWDKVTELRKFMKENGEN 509
           L +C+  G++++    + HL+E + ++ G+YVLL NIY+  G W+ V+ LRK ++   EN
Sbjct: 417 LSSCRTPGNLDVALVAMDHLVELEPEDMGNYVLLANIYADLGKWEDVSRLRKMIR--NEN 476

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SN854.5e-15060.55Pentatricopeptide repeat-containing protein At3g47530 OS=Arabidopsis thaliana OX... [more]
Q9STF33.7e-8835.39Pentatricopeptide repeat-containing protein At3g46790, chloroplastic OS=Arabidop... [more]
Q9LN017.3e-8437.72Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
A8MQA31.2e-8338.64Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9SIL51.6e-8334.95Pentatricopeptide repeat-containing protein At2g20540 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022135228.10.097.83pentatricopeptide repeat-containing protein At3g47530 [Momordica charantia][more]
XP_023515406.18.03e-30586.82pentatricopeptide repeat-containing protein At3g47530 [Cucurbita pepo subsp. pep... [more]
XP_022987181.18.03e-30584.24pentatricopeptide repeat-containing protein At3g47530 [Cucurbita maxima][more]
KAG6589508.11.14e-30486.82Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_022921651.11.54e-30286.40pentatricopeptide repeat-containing protein At3g47530 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1C4870.097.83pentatricopeptide repeat-containing protein At3g47530 OS=Momordica charantia OX=... [more]
A0A6J1JDE83.89e-30584.24pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita maxima OX=366... [more]
A0A6J1E1Z37.44e-30386.40pentatricopeptide repeat-containing protein At3g47530 OS=Cucurbita moschata OX=3... [more]
A0A0A0LUH91.71e-28384.10DYW_deaminase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G0097... [more]
A0A1S3BV409.64e-27986.87pentatricopeptide repeat-containing protein At3g47530 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT3G47530.13.2e-15160.55Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G46790.12.7e-8935.39Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.15.2e-8537.72Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.18.8e-8538.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G20540.11.2e-8434.95mitochondrial editing factor 21 [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 278..375
e-value: 1.3E-26
score: 95.1
coord: 174..277
e-value: 5.5E-20
score: 73.5
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 62..173
e-value: 3.3E-6
score: 28.7
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 378..506
e-value: 6.1E-19
score: 70.6
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 223..273
e-value: 5.9E-8
score: 32.8
coord: 326..374
e-value: 1.4E-12
score: 47.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 126..155
e-value: 0.0031
score: 17.6
coord: 400..425
e-value: 0.78
score: 10.1
coord: 467..494
e-value: 0.021
score: 15.0
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 198..226
e-value: 3.0E-4
score: 18.8
coord: 226..253
e-value: 7.5E-5
score: 20.6
coord: 299..328
e-value: 2.3E-7
score: 28.6
coord: 365..398
e-value: 2.5E-6
score: 25.3
coord: 127..158
e-value: 3.4E-6
score: 24.9
coord: 329..362
e-value: 4.2E-6
score: 24.6
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 301..323
e-value: 3.0E-6
score: 26.9
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 362..396
score: 10.796938
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 193..227
score: 9.843305
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 296..326
score: 10.369448
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 123..157
score: 10.018685
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 327..361
score: 11.695765
NoneNo IPR availablePANTHERPTHR47928:SF63OS08G0434000 PROTEINcoord: 54..500
NoneNo IPR availablePANTHERPTHR47928REPEAT-CONTAINING PROTEIN, PUTATIVE-RELATEDcoord: 54..500

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC06g_new0040.1MC06g_new0040.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:1900865 chloroplast RNA modification
cellular_component GO:0009507 chloroplast
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding