MC04g1418 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC04g1418
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
LocationMC04: 22126688 .. 22130127 (-)
RNA-Seq ExpressionMC04g1418
SyntenyMC04g1418
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAATGGATAAAAATGATGAATTATCCATCTTCGAATCGAGCTGATTATTAACGATCCATTTCCCTGATCCATTATTAACGATCCATTTTGCAAAGGTTCAAACTGTGGACTCTTTGAAGAGTGCTTATTCTTTTTAAAACCAGAGCAGAAAAAGATTCAAACAACAGTTCTTCATCTACACAGTAATTTGTTGTATCGATAGCTCTGGGGCAGCTGCCTGAGATGTTCTGTTCTTCGGAGCAATTTAATGGGATTTCTCCATGTTCGCAATTTCGCAGCAAATTCAGGAAGAAACAGAGAGAGGCGGCTCTTTTTACTTCTCAGAGTTCGTCTTCGCACCTGCCAATTTTCTGCAGCATCATCATTCATTGTTGAATCTGGGAAAGGAACCTCAGAGGATTTCTACGCTACCCATTTCTCGTATGTGCACGAGCTACTGAAATTATGTGCCGAAAGAAGATTGCCCATACAAGGGAAGGCTTGCCATGCCCAAATTCTGCTTATGGGGTTGCAGAAAGACACTTCAACGTCAAACATTCTTATCAACATGTACTCGAAATGTGGGTTAGTTGACTTTGCCCGCAAGGTGTTTGATGAAATGCCCAACCGAAATTTGGTATCGTGGAGCACCATGATTGGGTCGCTTACACAGAACGGAGAGGAGAATCAGGCTCTTGGTCTTTTGCTTCAGATGAAAAGAGAAGGAATCCCTTTTAGTGAGTTCACCATTTCGAGTGTTCTTTGTGCCTGTGCGGCAAAATGTGCTCTCTATGAATGCCAGCTGCTCCATGCCTTTGCTATTAAGGCCGCGATGAATCTGAATGTTTTTGTTGCAACTGCATTGCTTGATGTTTATGCAAAATGTGGTTTGATGAACGATGCGGCTCGTGTTTTCGAGTCCATGCCTGAGAGGAGTGCTGTCACGTGGAGTTCGATGGCGGCGGGGTACGTGCAGAATGAGCTATATGAGGAAGCTTTGGCATTGTTTCGCAAAGCTCAGGGAATCGGGTTGAGACAGGACCAGTTTTTTATGTCATCTTTGGTTTGTGCTTGTGCTAGATTGGCAGCCATGATTGAAGGGAACCAGGTGAATGCTTTGCTATCTAAATCTGGTTTTTGTTCAAATATTTTTGTTGCTTCTTCTCTTATTGATATGTATGCAAAATGTGGTGGCATTGAGGAAGCTTATAAAGTGTTTCGAGATGTAGAAGAGAGAAATGTTGTTTTGTGGAATGCTATGATATCTGGCCTATCAAGACACGCTCGTTCGCTTGAGGTGATGATTTTATTCGAGAAAATGCAGCAGATGGGCTTGAGTCCAAATGATGTAACTTTTGTTTCTGTCTTGTCTGCTTGTGGTCATATGGGTTTGGTTGAAAAAGGACAGAAATATTTTGACCTGATGATAAAAGAGCATCATTTGTCACCAAATGTCCTTCACTATTCTTGCATGGTGGACACTCTTAGTCGGGCGGGGCAGACCTTCAAGGCTTATGATTTGATGAGTAATATGGCCTTTAATGCCACTGCTTCTATGTGGGGTTCCCTTTTGGCTTCTTGTAGGACCCATGGGAATCTTGAACTTGCTGAGGTTGCTGCAAGAAATTTGTTTGAGATTGAACCTCACAATGCGGGAAACTATTTGTTGCTGTCGAACATGTATGCAGCACACGGGAAGTGGGACGAAGTGGCAAAGGCAAGGAAGCTCCTTAAAGAAAGTGATGTGAAGAAAGAGAGGGGCAAGAGTTGGATTGAGATCAAGGACAAGGTTCACTCGTTTATGGTTGGAGAGAGGAATCATCCTAAGATTTCTGAAATTTACTCAAAATTGAATGAGTTGATTGAAGAGTTACAGAAACTTGGTTACAAGGCGGAGACCGAGCACGACCTTCATCAAGTGGGAGAGAGTAGAAAACAAGTACTTCTGAGGCACCACAGCGAGAAGCTTGCTCTTACTATGGGGTTACTGTTTTTATCTCCCAATGCACCTATTAGGATTATGAAAAACCTTAGAATATGTGGAGACTGCCACTCTTTTATGAAGCTTGCATCGAGATTTGTTCGGAGGGATGTCATAGTCAGGGACACCAACCGATTTCACCATTTTAAGAATGGGAGTTGTTCTTGCGGGGATTTTTGGTGAATTCAGATTCTTTTGAAGTGCACCTATTCCTCTTTTAGTTTGTCAAACGTGCAAGTGCAGTAAAAAGCTTTGGAAGTTTAGAGTTTAAACTCTCTCTCCCTAATGTCAACCTCAATGCATTTGATCATCCAAACTCATCCTTGAAGATTGACCCCTGCCACCAGTTTCTTGCTCATAGCACGACGGAATTCTCAGATTGTGGTCCGACATGCTGCGTCGATGGCTCGATATGACCAGCTCTACTAACTTTTATGTACCATCCAGTTTGCAACCCCTACTACAACCACTCGCAAGACCTTCGTAACTAAATTTATTTTTATCATTCAAGCCTTATGTTTAAGCTACTGCAGGATTGATAAGATGTGGAAATGGTGATAAATCCTACAAGATTCTTTCCAGGTAAGTTGATCCCATTTTATAGTTGATTAGTTTCTTTGATATCACAAGCTAGAGTAAGTCGAGTTGCGCCCCAAAGGTTGTGCGGGAGCGTGATTTAATATTTTGGGGGATGCTCTTATCAAAATATTGCTGATTCATGAGGTACGCAATTGGTTTAGCGCCAATTGCTAGATTCTATTACTATCCCAGAAATATAGGAGCCATTCCATAGATTTTGGACTGAATCTTTGAGGTTGTACCGAGCAAAAAATCTTGGTATAAACCAAGAAAACTTGCAACTTAAAAGATACTGTTGTTCTGTTATGAAGAGTGCTTTTTTCATTTTCAGAGCATCTGACTACTTGATCTGCACTGGGTTTTAAGCTCACCAGATCCCACTTTCAGTCAAGACTGGTTGGGGCTCTAGAAAGGAAGAATTTACATCGATCACGATGCGAGCACAAGGACATATTGTCATTTGTAAATGCACTAAGAAGAAGAAATGAAGCCAAGCCAAACTTTTCCGACTATTCCCCATCAAGTACTTGTTGTTCAATATCTTGGCTAGTTCCATCAAGTACTTGTTGTTCAATATCTTGGCTATCAAAAAGTTGAACTAGCAGCTCCTACCCAACCCAAAGTGGATTGATGGAGAAGGGAAGGGCAGGTAGAGCCTCACCTGACTGAGAGAAAGCGAATACGAATAGCACACCTTGTCTAGTATATGTTAGGCTCCGCTTGAAGAGGTGAGTCCAACGTAGAGAAAGACATTACGTGCTAGCTTAGTTAGCTAGCATTTTTCAACTGGTTGTTTTGTTAGTTGTAGTGGACATAAAATGTTCATCTATGAAAATATTACTTCTCTTACACAAAAGTCCTTTTAAGTTTCACTTATTTAACTTGACGCATATACGGG

mRNA sequence

AGAAAATGGATAAAAATGATGAATTATCCATCTTCGAATCGAGCTGATTATTAACGATCCATTTCCCTGATCCATTATTAACGATCCATTTTGCAAAGGTTCAAACTGTGGACTCTTTGAAGAGTGCTTATTCTTTTTAAAACCAGAGCAGAAAAAGATTCAAACAACAGTTCTTCATCTACACAGTAATTTGTTGTATCGATAGCTCTGGGGCAGCTGCCTGAGATGTTCTGTTCTTCGGAGCAATTTAATGGGATTTCTCCATGTTCGCAATTTCGCAGCAAATTCAGGAAGAAACAGAGAGAGGCGGCTCTTTTTACTTCTCAGAGTTCGTCTTCGCACCTGCCAATTTTCTGCAGCATCATCATTCATTGTTGAATCTGGGAAAGGAACCTCAGAGGATTTCTACGCTACCCATTTCTCGTATGTGCACGAGCTACTGAAATTATGTGCCGAAAGAAGATTGCCCATACAAGGGAAGGCTTGCCATGCCCAAATTCTGCTTATGGGGTTGCAGAAAGACACTTCAACGTCAAACATTCTTATCAACATGTACTCGAAATGTGGGTTAGTTGACTTTGCCCGCAAGGTGTTTGATGAAATGCCCAACCGAAATTTGGTATCGTGGAGCACCATGATTGGGTCGCTTACACAGAACGGAGAGGAGAATCAGGCTCTTGGTCTTTTGCTTCAGATGAAAAGAGAAGGAATCCCTTTTAGTGAGTTCACCATTTCGAGTGTTCTTTGTGCCTGTGCGGCAAAATGTGCTCTCTATGAATGCCAGCTGCTCCATGCCTTTGCTATTAAGGCCGCGATGAATCTGAATGTTTTTGTTGCAACTGCATTGCTTGATGTTTATGCAAAATGTGGTTTGATGAACGATGCGGCTCGTGTTTTCGAGTCCATGCCTGAGAGGAGTGCTGTCACGTGGAGTTCGATGGCGGCGGGGTACGTGCAGAATGAGCTATATGAGGAAGCTTTGGCATTGTTTCGCAAAGCTCAGGGAATCGGGTTGAGACAGGACCAGTTTTTTATGTCATCTTTGGTTTGTGCTTGTGCTAGATTGGCAGCCATGATTGAAGGGAACCAGGTGAATGCTTTGCTATCTAAATCTGGTTTTTGTTCAAATATTTTTGTTGCTTCTTCTCTTATTGATATGTATGCAAAATGTGGTGGCATTGAGGAAGCTTATAAAGTGTTTCGAGATGTAGAAGAGAGAAATGTTGTTTTGTGGAATGCTATGATATCTGGCCTATCAAGACACGCTCGTTCGCTTGAGGTGATGATTTTATTCGAGAAAATGCAGCAGATGGGCTTGAGTCCAAATGATGTAACTTTTGTTTCTGTCTTGTCTGCTTGTGGTCATATGGGTTTGGTTGAAAAAGGACAGAAATATTTTGACCTGATGATAAAAGAGCATCATTTGTCACCAAATGTCCTTCACTATTCTTGCATGGTGGACACTCTTAGTCGGGCGGGGCAGACCTTCAAGGCTTATGATTTGATGAGTAATATGGCCTTTAATGCCACTGCTTCTATGTGGGGTTCCCTTTTGGCTTCTTGTAGGACCCATGGGAATCTTGAACTTGCTGAGGTTGCTGCAAGAAATTTGTTTGAGATTGAACCTCACAATGCGGGAAACTATTTGTTGCTGTCGAACATGTATGCAGCACACGGGAAGTGGGACGAAGTGGCAAAGGCAAGGAAGCTCCTTAAAGAAAGTGATGTGAAGAAAGAGAGGGGCAAGAGTTGGATTGAGATCAAGGACAAGGTTCACTCGTTTATGGTTGGAGAGAGGAATCATCCTAAGATTTCTGAAATTTACTCAAAATTGAATGAGTTGATTGAAGAGTTACAGAAACTTGGTTACAAGGCGGAGACCGAGCACGACCTTCATCAAGTGGGAGAGAGTAGAAAACAAGTACTTCTGAGGCACCACAGCGAGAAGCTTGCTCTTACTATGGGGTTACTGTTTTTATCTCCCAATGCACCTATTAGGATTATGAAAAACCTTAGAATATGTGGAGACTGCCACTCTTTTATGAAGCTTGCATCGAGATTTGTTCGGAGGGATGTCATAGTCAGGGACACCAACCGATTTCACCATTTTAAGAATGGGAGTTGTTCTTGCGGGGATTTTTGGTGAATTCAGATTCTTTTGAAGTGCACCTATTCCTCTTTTAGTTTGTCAAACGTGCAAGTGCAGTAAAAAGCTTTGGAAGTTTAGAGTTTAAACTCTCTCTCCCTAATGTCAACCTCAATGCATTTGATCATCCAAACTCATCCTTGAAGATTGACCCCTGCCACCAGTTTCTTGCTCATAGCACGACGGAATTCTCAGATTGTGGTCCGACATGCTGCGTCGATGGCTCGATATGACCAGCTCTACTAACTTTTATGTACCATCCAGTTTGCAACCCCTACTACAACCACTCGCAAGACCTTCGTAACTAAATTTATTTTTATCATTCAAGCCTTATGTTTAAGCTACTGCAGGATTGATAAGATGTGGAAATGGTGATAAATCCTACAAGATTCTTTCCAGAGCATCTGACTACTTGATCTGCACTGGGTTTTAAGCTCACCAGATCCCACTTTCAGTCAAGACTGGTTGGGGCTCTAGAAAGGAAGAATTTACATCGATCACGATGCGAGCACAAGGACATATTGTCATTTGTAAATGCACTAAGAAGAAGAAATGAAGCCAAGCCAAACTTTTCCGACTATTCCCCATCAAGTACTTGTTGTTCAATATCTTGGCTAGTTCCATCAAGTACTTGTTGTTCAATATCTTGGCTATCAAAAAGTTGAACTAGCAGCTCCTACCCAACCCAAAGTGGATTGATGGAGAAGGGAAGGGCAGGTAGAGCCTCACCTGACTGAGAGAAAGCGAATACGAATAGCACACCTTGTCTAGTATATGTTAGGCTCCGCTTGAAGAGGTGAGTCCAACGTAGAGAAAGACATTACGTGCTAGCTTAGTTAGCTAGCATTTTTCAACTGGTTGTTTTGTTAGTTGTAGTGGACATAAAATGTTCATCTATGAAAATATTACTTCTCTTACACAAAAGTCCTTTTAAGTTTCACTTATTTAACTTGACGCATATACGGG

Coding sequence (CDS)

ATGGGATTTCTCCATGTTCGCAATTTCGCAGCAAATTCAGGAAGAAACAGAGAGAGGCGGCTCTTTTTACTTCTCAGAGTTCGTCTTCGCACCTGCCAATTTTCTGCAGCATCATCATTCATTGTTGAATCTGGGAAAGGAACCTCAGAGGATTTCTACGCTACCCATTTCTCGTATGTGCACGAGCTACTGAAATTATGTGCCGAAAGAAGATTGCCCATACAAGGGAAGGCTTGCCATGCCCAAATTCTGCTTATGGGGTTGCAGAAAGACACTTCAACGTCAAACATTCTTATCAACATGTACTCGAAATGTGGGTTAGTTGACTTTGCCCGCAAGGTGTTTGATGAAATGCCCAACCGAAATTTGGTATCGTGGAGCACCATGATTGGGTCGCTTACACAGAACGGAGAGGAGAATCAGGCTCTTGGTCTTTTGCTTCAGATGAAAAGAGAAGGAATCCCTTTTAGTGAGTTCACCATTTCGAGTGTTCTTTGTGCCTGTGCGGCAAAATGTGCTCTCTATGAATGCCAGCTGCTCCATGCCTTTGCTATTAAGGCCGCGATGAATCTGAATGTTTTTGTTGCAACTGCATTGCTTGATGTTTATGCAAAATGTGGTTTGATGAACGATGCGGCTCGTGTTTTCGAGTCCATGCCTGAGAGGAGTGCTGTCACGTGGAGTTCGATGGCGGCGGGGTACGTGCAGAATGAGCTATATGAGGAAGCTTTGGCATTGTTTCGCAAAGCTCAGGGAATCGGGTTGAGACAGGACCAGTTTTTTATGTCATCTTTGGTTTGTGCTTGTGCTAGATTGGCAGCCATGATTGAAGGGAACCAGGTGAATGCTTTGCTATCTAAATCTGGTTTTTGTTCAAATATTTTTGTTGCTTCTTCTCTTATTGATATGTATGCAAAATGTGGTGGCATTGAGGAAGCTTATAAAGTGTTTCGAGATGTAGAAGAGAGAAATGTTGTTTTGTGGAATGCTATGATATCTGGCCTATCAAGACACGCTCGTTCGCTTGAGGTGATGATTTTATTCGAGAAAATGCAGCAGATGGGCTTGAGTCCAAATGATGTAACTTTTGTTTCTGTCTTGTCTGCTTGTGGTCATATGGGTTTGGTTGAAAAAGGACAGAAATATTTTGACCTGATGATAAAAGAGCATCATTTGTCACCAAATGTCCTTCACTATTCTTGCATGGTGGACACTCTTAGTCGGGCGGGGCAGACCTTCAAGGCTTATGATTTGATGAGTAATATGGCCTTTAATGCCACTGCTTCTATGTGGGGTTCCCTTTTGGCTTCTTGTAGGACCCATGGGAATCTTGAACTTGCTGAGGTTGCTGCAAGAAATTTGTTTGAGATTGAACCTCACAATGCGGGAAACTATTTGTTGCTGTCGAACATGTATGCAGCACACGGGAAGTGGGACGAAGTGGCAAAGGCAAGGAAGCTCCTTAAAGAAAGTGATGTGAAGAAAGAGAGGGGCAAGAGTTGGATTGAGATCAAGGACAAGGTTCACTCGTTTATGGTTGGAGAGAGGAATCATCCTAAGATTTCTGAAATTTACTCAAAATTGAATGAGTTGATTGAAGAGTTACAGAAACTTGGTTACAAGGCGGAGACCGAGCACGACCTTCATCAAGTGGGAGAGAGTAGAAAACAAGTACTTCTGAGGCACCACAGCGAGAAGCTTGCTCTTACTATGGGGTTACTGTTTTTATCTCCCAATGCACCTATTAGGATTATGAAAAACCTTAGAATATGTGGAGACTGCCACTCTTTTATGAAGCTTGCATCGAGATTTGTTCGGAGGGATGTCATAGTCAGGGACACCAACCGATTTCACCATTTTAAGAATGGGAGTTGTTCTTGCGGGGATTTTTGGTGA

Protein sequence

MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW
Homology
BLAST of MC04g1418 vs. ExPASy Swiss-Prot
Match: Q9LZ19 (Pentatricopeptide repeat-containing protein At5g04780, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H16 PE=2 SV=2)

HSP 1 Score: 718.8 bits (1854), Expect = 5.5e-206
Identity = 345/582 (59.28%), Postives = 450/582 (77.32%), Query Frame = 0

Query: 53  YATHFS---YVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVD 112
           Y+  FS    VHE+L+LCA     ++ KACH +I+ + L+ D +  N+LIN YSKCG V+
Sbjct: 54  YSNEFSNRNLVHEILQLCARNGAVMEAKACHGKIIRIDLEGDVTLLNVLINAYSKCGFVE 113

Query: 113 FARKVFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACA 172
            AR+VFD M  R+LVSW+TMIG  T+N  E++AL + L+M+ EG  FSEFTISSVL AC 
Sbjct: 114 LARQVFDGMLERSLVSWNTMIGLYTRNRMESEALDIFLEMRNEGFKFSEFTISSVLSACG 173

Query: 173 AKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSS 232
             C   EC+ LH  ++K  ++LN++V TALLD+YAKCG++ DA +VFESM ++S+VTWSS
Sbjct: 174 VNCDALECKKLHCLSVKTCIDLNLYVGTALLDLYAKCGMIKDAVQVFESMQDKSSVTWSS 233

Query: 233 MAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSG 292
           M AGYVQN+ YEEAL L+R+AQ + L Q+QF +SS++CAC+ LAA+IEG Q++A++ KSG
Sbjct: 234 MVAGYVQNKNYEEALLLYRRAQRMSLEQNQFTLSSVICACSNLAALIEGKQMHAVICKSG 293

Query: 293 FCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFE 352
           F SN+FVASS +DMYAKCG + E+Y +F +V+E+N+ LWN +ISG ++HAR  EVMILFE
Sbjct: 294 FGSNVFVASSAVDMYAKCGSLRESYIIFSEVQEKNLELWNTIISGFAKHARPKEVMILFE 353

Query: 353 KMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRA 412
           KMQQ G+ PN+VTF S+LS CGH GLVE+G+++F LM   + LSPNV+HYSCMVD L RA
Sbjct: 354 KMQQDGMHPNEVTFSSLLSVCGHTGLVEEGRRFFKLMRTTYGLSPNVVHYSCMVDILGRA 413

Query: 413 GQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLS 472
           G   +AY+L+ ++ F+ TAS+WGSLLASCR + NLELAEVAA  LFE+EP NAGN++LLS
Sbjct: 414 GLLSEAYELIKSIPFDPTASIWGSLLASCRVYKNLELAEVAAEKLFELEPENAGNHVLLS 473

Query: 473 NMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLN 532
           N+YAA+ +W+E+AK+RKLL++ DVKK RGKSWI+IKDKVH+F VGE  HP+I EI S L+
Sbjct: 474 NIYAANKQWEEIAKSRKLLRDCDVKKVRGKSWIDIKDKVHTFSVGESGHPRIREICSTLD 533

Query: 533 ELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLR 592
            L+ + +K GYK   EH+LH V   +K+ LL  HSEKLAL  GL+ L  ++P+RIMKNLR
Sbjct: 534 NLVIKFRKFGYKPSVEHELHDVEIGKKEELLMQHSEKLALVFGLMCLPESSPVRIMKNLR 593

Query: 593 ICGDCHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           IC DCH FMK AS   RR +IVRD NRFHHF +G CSCGDFW
Sbjct: 594 ICVDCHEFMKAASMATRRFIIVRDVNRFHHFSDGHCSCGDFW 635

BLAST of MC04g1418 vs. ExPASy Swiss-Prot
Match: Q9LIQ7 (Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H87 PE=3 SV=1)

HSP 1 Score: 470.7 bits (1210), Expect = 2.6e-131
Identity = 237/573 (41.36%), Postives = 368/573 (64.22%), Query Frame = 0

Query: 59  YVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEM 118
           + + LLK C   +L IQG+  HA IL    + D    N L+NMY+KCG ++ ARKVF++M
Sbjct: 62  FYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKM 121

Query: 119 PNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQ 178
           P R+ V+W+T+I   +Q+     AL    QM R G   +EFT+SSV+ A AA+       
Sbjct: 122 PQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGH 181

Query: 179 LLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNE 238
            LH F +K   + NV V +ALLD+Y + GLM+DA  VF+++  R+ V+W+++ AG+ +  
Sbjct: 182 QLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRS 241

Query: 239 LYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVAS 298
             E+AL LF+     G R   F  +SL  AC+    + +G  V+A + KSG     F  +
Sbjct: 242 GTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGN 301

Query: 299 SLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSP 358
           +L+DMYAK G I +A K+F  + +R+VV WN++++  ++H    E +  FE+M+++G+ P
Sbjct: 302 TLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRP 361

Query: 359 NDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDL 418
           N+++F+SVL+AC H GL+++G  Y++LM K+  + P   HY  +VD L RAG   +A   
Sbjct: 362 NEISFLSVLTACSHSGLLDEGWHYYELM-KKDGIVPEAWHYVTVVDLLGRAGDLNRALRF 421

Query: 419 MSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKW 478
           +  M    TA++W +LL +CR H N EL   AA ++FE++P + G +++L N+YA+ G+W
Sbjct: 422 IEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRW 481

Query: 479 DEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKL 538
           ++ A+ RK +KES VKKE   SW+EI++ +H F+  +  HP+  EI  K  E++ ++++L
Sbjct: 482 NDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKEL 541

Query: 539 GYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFM 598
           GY  +T H +  V +  ++V L++HSEK+AL   LL   P + I I KN+R+CGDCH+ +
Sbjct: 542 GYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAI 601

Query: 599 KLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           KLAS+ V R++IVRDTNRFHHFK+G+CSC D+W
Sbjct: 602 KLASKVVGREIIVRDTNRFHHFKDGNCSCKDYW 633

BLAST of MC04g1418 vs. ExPASy Swiss-Prot
Match: Q9LTF4 (Putative pentatricopeptide repeat-containing protein At5g52630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H52 PE=3 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 4.1e-129
Identity = 238/576 (41.32%), Postives = 361/576 (62.67%), Query Frame = 0

Query: 56  HFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVF 115
           +++ + +LL   A  R  I+G   H  ++  GL      +N LIN YSK  L   +R+ F
Sbjct: 14  NYNQICDLLLSSARTRSTIKGLQLHGYVVKSGLSLIPLVANNLINFYSKSQLPFDSRRAF 73

Query: 116 DEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALY 175
           ++ P ++  +WS++I    QN     +L  L +M    +   +  + S   +CA      
Sbjct: 74  EDSPQKSSTTWSSIISCFAQNELPWMSLEFLKKMMAGNLRPDDHVLPSATKSCAILSRCD 133

Query: 176 ECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYV 235
             + +H  ++K   + +VFV ++L+D+YAKCG +  A ++F+ MP+R+ VTWS M  GY 
Sbjct: 134 IGRSVHCLSMKTGYDADVFVGSSLVDMYAKCGEIVYARKMFDEMPQRNVVTWSGMMYGYA 193

Query: 236 QNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIF 295
           Q    EEAL LF++A    L  + +  SS++  CA    +  G Q++ L  KS F S+ F
Sbjct: 194 QMGENEEALWLFKEALFENLAVNDYSFSSVISVCANSTLLELGRQIHGLSIKSSFDSSSF 253

Query: 296 VASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMG 355
           V SSL+ +Y+KCG  E AY+VF +V  +N+ +WNAM+   ++H+ + +V+ LF++M+  G
Sbjct: 254 VGSSLVSLYSKCGVPEGAYQVFNEVPVKNLGIWNAMLKAYAQHSHTQKVIELFKRMKLSG 313

Query: 356 LSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKA 415
           + PN +TF++VL+AC H GLV++G+ YFD M KE  + P   HY+ +VD L RAG+  +A
Sbjct: 314 MKPNFITFLNVLNACSHAGLVDEGRYYFDQM-KESRIEPTDKHYASLVDMLGRAGRLQEA 373

Query: 416 YDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAH 475
            ++++NM  + T S+WG+LL SC  H N ELA  AA  +FE+ P ++G ++ LSN YAA 
Sbjct: 374 LEVITNMPIDPTESVWGALLTSCTVHKNTELAAFAADKVFELGPVSSGMHISLSNAYAAD 433

Query: 476 GKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEEL 535
           G++++ AKARKLL++   KKE G SW+E ++KVH+F  GER H K  EIY KL EL EE+
Sbjct: 434 GRFEDAAKARKLLRDRGEKKETGLSWVEERNKVHTFAAGERRHEKSKEIYEKLAELGEEM 493

Query: 536 QKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCH 595
           +K GY A+T + L +V    K   +R+HSE+LA+  GL+    + PIR+MKNLR+CGDCH
Sbjct: 494 EKAGYIADTSYVLREVDGDEKNQTIRYHSERLAIAFGLITFPADRPIRVMKNLRVCGDCH 553

Query: 596 SFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           + +K  S   RR +IVRD NRFH F++G CSC D+W
Sbjct: 554 NAIKFMSVCTRRVIIVRDNNRFHRFEDGKCSCNDYW 588

BLAST of MC04g1418 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 6.2e-125
Identity = 232/570 (40.70%), Postives = 349/570 (61.23%), Query Frame = 0

Query: 62  ELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPNR 121
           EL+K C   R   +G      +   G +      N+LINMY K  L++ A ++FD+MP R
Sbjct: 66  ELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFDQMPQR 125

Query: 122 NLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLLH 181
           N++SW+TMI + ++     +AL LL+ M R+ +  + +T SSVL +C     + + ++LH
Sbjct: 126 NVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCN---GMSDVRMLH 185

Query: 182 AFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELYE 241
              IK  +  +VFV +AL+DV+AK G   DA  VF+ M    A+ W+S+  G+ QN   +
Sbjct: 186 CGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQNSRSD 245

Query: 242 EALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSLI 301
            AL LF++ +  G   +Q  ++S++ AC  LA +  G Q +  + K  +  ++ + ++L+
Sbjct: 246 VALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YDQDLILNNALV 305

Query: 302 DMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPNDV 361
           DMY KCG +E+A +VF  ++ER+V+ W+ MISGL+++  S E + LFE+M+  G  PN +
Sbjct: 306 DMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYI 365

Query: 362 TFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMSN 421
           T V VL AC H GL+E G  YF  M K + + P   HY CM+D L +AG+   A  L++ 
Sbjct: 366 TIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNE 425

Query: 422 MAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDEV 481
           M     A  W +LL +CR   N+ LAE AA+ +  ++P +AG Y LLSN+YA   KWD V
Sbjct: 426 MECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSV 485

Query: 482 AKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGYK 541
            + R  +++  +KKE G SWIE+  ++H+F++G+ +HP+I E+  KLN+LI  L  +GY 
Sbjct: 486 EEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLTGIGYV 545

Query: 542 AETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKLA 601
            ET   L  +   + +  LRHHSEKLAL  GL+ L     IRI KNLRICGDCH F KLA
Sbjct: 546 PETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHVFCKLA 605

Query: 602 SRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           S+   R +++RD  R+HHF++G CSCGD+W
Sbjct: 606 SKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of MC04g1418 vs. ExPASy Swiss-Prot
Match: Q9ZUW3 (Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H60 PE=2 SV=1)

HSP 1 Score: 444.9 bits (1143), Expect = 1.5e-123
Identity = 233/590 (39.49%), Postives = 363/590 (61.53%), Query Frame = 0

Query: 52  FYATHFSYVH-------ELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSK 111
           FY+   +YV         ++KLCA  +     +  H  ++  G   D +    L+  YSK
Sbjct: 283 FYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSK 342

Query: 112 CGLVDFARKVFDEMP-NRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISS 171
           C  +  A ++F E+    N+VSW+ MI    QN  + +A+ L  +MKR+G+  +EFT S 
Sbjct: 343 CTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSV 402

Query: 172 VLCACAAKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERS 231
           +L A      +     +HA  +K     +  V TALLD Y K G + +AA+VF  + ++ 
Sbjct: 403 ILTA----LPVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKD 462

Query: 232 AVTWSSMAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARL-AAMIEGNQVN 291
            V WS+M AGY Q    E A+ +F +    G++ ++F  SS++  CA   A+M +G Q +
Sbjct: 463 IVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFH 522

Query: 292 ALLSKSGFCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSL 351
               KS   S++ V+S+L+ MYAK G IE A +VF+   E+++V WN+MISG ++H +++
Sbjct: 523 GFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAM 582

Query: 352 EVMILFEKMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCM 411
           + + +F++M++  +  + VTF+ V +AC H GLVE+G+KYFD+M+++  ++P   H SCM
Sbjct: 583 KALDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCM 642

Query: 412 VDTLSRAGQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNA 471
           VD  SRAGQ  KA  ++ NM   A +++W ++LA+CR H   EL  +AA  +  ++P ++
Sbjct: 643 VDLYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDS 702

Query: 472 GNYLLLSNMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKIS 531
             Y+LLSNMYA  G W E AK RKL+ E +VKKE G SWIE+K+K +SF+ G+R+HP   
Sbjct: 703 AAYVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKD 762

Query: 532 EIYSKLNELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPI 591
           +IY KL +L   L+ LGY+ +T + L  + +  K+ +L  HSE+LA+  GL+     +P+
Sbjct: 763 QIYMKLEDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIAFGLIATPKGSPL 822

Query: 592 RIMKNLRICGDCHSFMKLASRFVRRDVIVRDTNRFHHF-KNGSCSCGDFW 632
            I+KNLR+CGDCH  +KL ++   R+++VRD+NRFHHF  +G CSCGDFW
Sbjct: 823 LIIKNLRVCGDCHLVIKLIAKIEEREIVVRDSNRFHHFSSDGVCSCGDFW 868

BLAST of MC04g1418 vs. NCBI nr
Match: XP_022134327.1 (pentatricopeptide repeat-containing protein At5g04780 [Momordica charantia])

HSP 1 Score: 1251 bits (3236), Expect = 0.0
Identity = 630/631 (99.84%), Postives = 631/631 (100.00%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV
Sbjct: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
           HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN
Sbjct: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           RNLVSWSTMIGSLTQNGEENQALGLL+QMKREGIPFSEFTISSVLCACAAKCALYECQLL
Sbjct: 121 RNLVSWSTMIGSLTQNGEENQALGLLVQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY
Sbjct: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
           NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE
Sbjct: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL
Sbjct: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW
Sbjct: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631

BLAST of MC04g1418 vs. NCBI nr
Match: KAG7016352.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1064 bits (2751), Expect = 0.0
Identity = 536/631 (84.94%), Postives = 578/631 (91.60%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHV +FA NSG+     +F  L +R+   +F A+SS I+E  KGT+EDF +TH SYV
Sbjct: 1   MGFLHVCHFATNSGK-----IFQFLSLRVCISRFFASSSCIIECEKGTTEDFCSTHVSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
            ELLKLCA+RRL +QGKACHA+ILLMGLQ +TSTSNILINMYSKCG VD AR+VFDEMP+
Sbjct: 61  LELLKLCAKRRLFLQGKACHARILLMGLQAETSTSNILINMYSKCGSVDVARQVFDEMPS 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           R+LVSWSTMIGSLTQNGEEN+ALGLLLQM+REG PFSEFTISSVLCACAAKCAL ECQLL
Sbjct: 121 RSLVSWSTMIGSLTQNGEENEALGLLLQMQREGTPFSEFTISSVLCACAAKCALSECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFA+KAAMNLNVFVATALLDVYAKCGLMNDAA VFESM ERS VTWSSMAAGYVQN +Y
Sbjct: 181 HAFAVKAAMNLNVFVATALLDVYAKCGLMNDAANVFESMSERSVVTWSSMAAGYVQNAMY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKA   GL+ DQF MSS++CACA LAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVE RNVVLWNAMISGLSRHARSLEVMILFEKMQQ+GL+PND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEYRNVVLWNAMISGLSRHARSLEVMILFEKMQQIGLNPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKE+HL+PNV HYSCMVD LSRAG+T  AYDL+ 
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEYHLTPNVFHYSCMVDALSRAGRTSDAYDLIC 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
            M F A+ASMWGSLLASCRTHGNLELAEVAA+NLF+IEPHNAGNYLLLSNMYAA+GKWDE
Sbjct: 421 KMPFRASASMWGSLLASCRTHGNLELAEVAAKNLFDIEPHNAGNYLLLSNMYAANGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKNEVHSFMVGERNHPKIAEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           + ET+HDLHQV ESRKQ LLRHHSEKLA TMGLLFL PNAP+RIMKNLRICGDCHSFMKL
Sbjct: 541 QVETQHDLHQVEESRKQELLRHHSEKLAFTMGLLFLPPNAPLRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           AS+ VRRDV+VRDTNRFHHF NG CSCGDFW
Sbjct: 601 ASKLVRRDVVVRDTNRFHHFTNGHCSCGDFW 626

BLAST of MC04g1418 vs. NCBI nr
Match: XP_023550933.1 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1060 bits (2740), Expect = 0.0
Identity = 534/631 (84.63%), Postives = 576/631 (91.28%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHV +FA NSG+     +F  L +R+   +F A+SS I+E  KGT+EDF  TH SYV
Sbjct: 1   MGFLHVCHFATNSGK-----IFQFLSLRVCINRFFASSSCIIECEKGTTEDFCRTHVSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
            ELLKLCA+RRL +QGKACHA+ILLMGLQ +T TSNILINMYSKCG VD AR+VFDEMP+
Sbjct: 61  LELLKLCAKRRLFLQGKACHARILLMGLQAETLTSNILINMYSKCGSVDVARQVFDEMPS 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           R+LVSWSTMIGSLTQNGEEN+ALGLLLQM+REG PFSEFTISSVLCACAAKCAL ECQLL
Sbjct: 121 RSLVSWSTMIGSLTQNGEENEALGLLLQMQREGTPFSEFTISSVLCACAAKCALSECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFA+KAAMNLNVFVATALLDVYAKCGLMNDAA VFESM ERS VTWSSMAAGYVQN +Y
Sbjct: 181 HAFAVKAAMNLNVFVATALLDVYAKCGLMNDAANVFESMSERSVVTWSSMAAGYVQNAMY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKA   GL+ DQF MSS++CACA LAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVE+RNVVLWNAMISGLSRHARSLEVMILFEKMQQ+GL+PND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEDRNVVLWNAMISGLSRHARSLEVMILFEKMQQIGLNPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKE+HL+PNV HYSCMVD LSRAG+T  AYDL+ 
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEYHLAPNVYHYSCMVDALSRAGRTSDAYDLIC 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
            M F A+ASMWGSLLASCRTHGNLELAEVAA+NLF+IEPHNAGNYLLLSNMYAA+GKWDE
Sbjct: 421 KMPFRASASMWGSLLASCRTHGNLELAEVAAKNLFDIEPHNAGNYLLLSNMYAANGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKNEVHSFMVGERNHPKIAEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           + ET+HDLHQV ESRKQ LLRHHSEKLA TMGLLFL PNAP+RIMKNLRICGDCHSFMKL
Sbjct: 541 QVETQHDLHQVEESRKQELLRHHSEKLAFTMGLLFLPPNAPLRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
            S+ VRRDV+VRDTNRFHHF NG CSCGDFW
Sbjct: 601 VSKLVRRDVVVRDTNRFHHFTNGHCSCGDFW 626

BLAST of MC04g1418 vs. NCBI nr
Match: XP_004140992.1 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial [Cucumis sativus] >KAE8646637.1 hypothetical protein Csa_005652 [Cucumis sativus])

HSP 1 Score: 1059 bits (2738), Expect = 0.0
Identity = 535/638 (83.86%), Postives = 580/638 (90.91%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRER----RLFLLLRVRLRTCQFSAA---SSFIVESGKGTSEDFY 60
           MGFLHV +FA+NSGR RE+    R+F  L +R+ T QF A+   SS IVE  K T++DF 
Sbjct: 1   MGFLHVCHFASNSGRYREKGKGKRIFQFLSLRVCTTQFFASLSSSSCIVECEKPTTKDFN 60

Query: 61  ATHFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARK 120
           ATH S+VHE+LKLCA+R+L +QGKACHAQILLMGL+ D  TSNILINMYSKCG VDFAR+
Sbjct: 61  ATHVSFVHEILKLCAKRKLLLQGKACHAQILLMGLKTDLLTSNILINMYSKCGSVDFARQ 120

Query: 121 VFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCA 180
           VFDEMP+R+LVSW+TMIGSLTQNGEEN+AL LLLQM+REG PFSEFTISSVLCACAAKCA
Sbjct: 121 VFDEMPSRSLVSWNTMIGSLTQNGEENEALDLLLQMQREGTPFSEFTISSVLCACAAKCA 180

Query: 181 LYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAG 240
           L ECQLLHAFAIKAAM+LNVFVATALLDVYAKCGLM DA  VFESMP+RS VTWSSMAAG
Sbjct: 181 LSECQLLHAFAIKAAMDLNVFVATALLDVYAKCGLMKDAVCVFESMPDRSVVTWSSMAAG 240

Query: 241 YVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSN 300
           YVQNE+YE+ALALFRKA   GL+ DQF MSS++CACA LAAMIEG Q+NALLSKSGFCSN
Sbjct: 241 YVQNEMYEQALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGKQMNALLSKSGFCSN 300

Query: 301 IFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQ 360
           IFVASSLIDMYAKCGGIEE+YKVFRDVE+RNVVLWNAMISGLSRHARSLEVMILFEKMQQ
Sbjct: 301 IFVASSLIDMYAKCGGIEESYKVFRDVEKRNVVLWNAMISGLSRHARSLEVMILFEKMQQ 360

Query: 361 MGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTF 420
           MGLSPNDVTFVSVLSACGHMGLV KGQKYFDLM KEHHL+PNV HYSCMVDTLSRAGQ F
Sbjct: 361 MGLSPNDVTFVSVLSACGHMGLVRKGQKYFDLMTKEHHLAPNVFHYSCMVDTLSRAGQIF 420

Query: 421 KAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYA 480
           +AYDL+S + FNA+ASMWGSLLASCRTHGNLELAEVAA+ LF+IEPHN+GNYLLLSNMYA
Sbjct: 421 EAYDLISKLPFNASASMWGSLLASCRTHGNLELAEVAAKKLFDIEPHNSGNYLLLSNMYA 480

Query: 481 AHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIE 540
           A+GKWDEVAK RKLLKESDVKKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLNE+++
Sbjct: 481 ANGKWDEVAKMRKLLKESDVKKERGKSWIEIKDKVHLFMVGERNHPKIVEIYSKLNEVMD 540

Query: 541 ELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGD 600
           ELQKLGYK ET+HDLHQVGES KQ LLRHHSEKLA TMGLLFL PNAPIRIMKNLRICGD
Sbjct: 541 ELQKLGYKVETQHDLHQVGESIKQELLRHHSEKLAFTMGLLFLPPNAPIRIMKNLRICGD 600

Query: 601 CHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           CHSFMKLAS+F  RDVIVRDTNRFHHFKNG CSCGDFW
Sbjct: 601 CHSFMKLASKFFCRDVIVRDTNRFHHFKNGCCSCGDFW 638

BLAST of MC04g1418 vs. NCBI nr
Match: XP_008456610.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g04780 [Cucumis melo])

HSP 1 Score: 1058 bits (2736), Expect = 0.0
Identity = 537/642 (83.64%), Postives = 580/642 (90.34%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERR--------LFLLLRVRLRTCQF---SAASSFIVESGKGTS 60
           MGFLHV +FA+NSGR RE+         +F  L +RL T QF   S+ SS IVE  K TS
Sbjct: 1   MGFLHVCHFASNSGRYREKGKGKGKGKGIFQFLSLRLCTTQFFASSSPSSCIVECEKPTS 60

Query: 61  EDFYATHFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVD 120
           +DF ATH SYVHE+LKLCA+R+L +QGKACHAQILLMGL+ D  TSNILIN YSKCG VD
Sbjct: 61  KDFNATHVSYVHEILKLCAKRKLFLQGKACHAQILLMGLKTDLLTSNILINTYSKCGSVD 120

Query: 121 FARKVFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACA 180
            AR+VFDEMP+R+LVSW+TMIGSLTQNG+EN+ALGLLLQM+REG PFSEFTISSVLCACA
Sbjct: 121 SARQVFDEMPSRSLVSWNTMIGSLTQNGQENEALGLLLQMQREGTPFSEFTISSVLCACA 180

Query: 181 AKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSS 240
           AKCAL ECQLLHAF IKAAM+LNVFVATALLDVYAKCGLM DA  VFESMP+RS VTWSS
Sbjct: 181 AKCALSECQLLHAFVIKAAMDLNVFVATALLDVYAKCGLMKDAVSVFESMPDRSVVTWSS 240

Query: 241 MAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSG 300
           MAAGYVQNE+YEEALALFRKA   GL+ DQF MSS++CACA LAAMIEG QVNALLSKSG
Sbjct: 241 MAAGYVQNEMYEEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGKQVNALLSKSG 300

Query: 301 FCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFE 360
           FCSNIFVASSLIDMYAKCGGIEE+YKVF+DVE RNVVLWNAMISGLSRHARSLEVMILFE
Sbjct: 301 FCSNIFVASSLIDMYAKCGGIEESYKVFQDVERRNVVLWNAMISGLSRHARSLEVMILFE 360

Query: 361 KMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRA 420
           KMQQMGLSPNDVTFVSVLSACGHMGLV+KGQKYFDLMIKEHHL+PNV+HYSCMVDTLSRA
Sbjct: 361 KMQQMGLSPNDVTFVSVLSACGHMGLVKKGQKYFDLMIKEHHLAPNVIHYSCMVDTLSRA 420

Query: 421 GQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLS 480
           GQTF+AYDL+S M FNA+ASMWGSLLASCRTHGNLELAE AA+ LF+IEPHN+GNYLLLS
Sbjct: 421 GQTFEAYDLISKMPFNASASMWGSLLASCRTHGNLELAEFAAKKLFDIEPHNSGNYLLLS 480

Query: 481 NMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLN 540
           NMYAA+GKWDEVAK RKLLKESDVKKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLN
Sbjct: 481 NMYAANGKWDEVAKMRKLLKESDVKKERGKSWIEIKDKVHLFMVGERNHPKIVEIYSKLN 540

Query: 541 ELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLR 600
           E+++ELQKLGYKAET+HDLHQVGES KQ LLRHHSEKLA  MGLLFL P+APIRIMKNLR
Sbjct: 541 EVMDELQKLGYKAETQHDLHQVGESIKQELLRHHSEKLAFIMGLLFLPPSAPIRIMKNLR 600

Query: 601 ICGDCHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           ICGDCHSFMKLAS+FV RDVIVRDTNRFHHFKNG CSCGDFW
Sbjct: 601 ICGDCHSFMKLASKFVCRDVIVRDTNRFHHFKNGCCSCGDFW 642

BLAST of MC04g1418 vs. ExPASy TrEMBL
Match: A0A6J1BZB9 (pentatricopeptide repeat-containing protein At5g04780 OS=Momordica charantia OX=3673 GN=LOC111006613 PE=3 SV=1)

HSP 1 Score: 1251 bits (3236), Expect = 0.0
Identity = 630/631 (99.84%), Postives = 631/631 (100.00%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV
Sbjct: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
           HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN
Sbjct: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           RNLVSWSTMIGSLTQNGEENQALGLL+QMKREGIPFSEFTISSVLCACAAKCALYECQLL
Sbjct: 121 RNLVSWSTMIGSLTQNGEENQALGLLVQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY
Sbjct: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
           NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE
Sbjct: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL
Sbjct: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW
Sbjct: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631

BLAST of MC04g1418 vs. ExPASy TrEMBL
Match: A0A1S3C4B6 (pentatricopeptide repeat-containing protein At5g04780 OS=Cucumis melo OX=3656 GN=LOC103496519 PE=3 SV=1)

HSP 1 Score: 1058 bits (2736), Expect = 0.0
Identity = 537/642 (83.64%), Postives = 580/642 (90.34%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERR--------LFLLLRVRLRTCQF---SAASSFIVESGKGTS 60
           MGFLHV +FA+NSGR RE+         +F  L +RL T QF   S+ SS IVE  K TS
Sbjct: 1   MGFLHVCHFASNSGRYREKGKGKGKGKGIFQFLSLRLCTTQFFASSSPSSCIVECEKPTS 60

Query: 61  EDFYATHFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVD 120
           +DF ATH SYVHE+LKLCA+R+L +QGKACHAQILLMGL+ D  TSNILIN YSKCG VD
Sbjct: 61  KDFNATHVSYVHEILKLCAKRKLFLQGKACHAQILLMGLKTDLLTSNILINTYSKCGSVD 120

Query: 121 FARKVFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACA 180
            AR+VFDEMP+R+LVSW+TMIGSLTQNG+EN+ALGLLLQM+REG PFSEFTISSVLCACA
Sbjct: 121 SARQVFDEMPSRSLVSWNTMIGSLTQNGQENEALGLLLQMQREGTPFSEFTISSVLCACA 180

Query: 181 AKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSS 240
           AKCAL ECQLLHAF IKAAM+LNVFVATALLDVYAKCGLM DA  VFESMP+RS VTWSS
Sbjct: 181 AKCALSECQLLHAFVIKAAMDLNVFVATALLDVYAKCGLMKDAVSVFESMPDRSVVTWSS 240

Query: 241 MAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSG 300
           MAAGYVQNE+YEEALALFRKA   GL+ DQF MSS++CACA LAAMIEG QVNALLSKSG
Sbjct: 241 MAAGYVQNEMYEEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGKQVNALLSKSG 300

Query: 301 FCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFE 360
           FCSNIFVASSLIDMYAKCGGIEE+YKVF+DVE RNVVLWNAMISGLSRHARSLEVMILFE
Sbjct: 301 FCSNIFVASSLIDMYAKCGGIEESYKVFQDVERRNVVLWNAMISGLSRHARSLEVMILFE 360

Query: 361 KMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRA 420
           KMQQMGLSPNDVTFVSVLSACGHMGLV+KGQKYFDLMIKEHHL+PNV+HYSCMVDTLSRA
Sbjct: 361 KMQQMGLSPNDVTFVSVLSACGHMGLVKKGQKYFDLMIKEHHLAPNVIHYSCMVDTLSRA 420

Query: 421 GQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLS 480
           GQTF+AYDL+S M FNA+ASMWGSLLASCRTHGNLELAE AA+ LF+IEPHN+GNYLLLS
Sbjct: 421 GQTFEAYDLISKMPFNASASMWGSLLASCRTHGNLELAEFAAKKLFDIEPHNSGNYLLLS 480

Query: 481 NMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLN 540
           NMYAA+GKWDEVAK RKLLKESDVKKERGKSWIEIKDKVH FMVGERNHPKI EIYSKLN
Sbjct: 481 NMYAANGKWDEVAKMRKLLKESDVKKERGKSWIEIKDKVHLFMVGERNHPKIVEIYSKLN 540

Query: 541 ELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLR 600
           E+++ELQKLGYKAET+HDLHQVGES KQ LLRHHSEKLA  MGLLFL P+APIRIMKNLR
Sbjct: 541 EVMDELQKLGYKAETQHDLHQVGESIKQELLRHHSEKLAFIMGLLFLPPSAPIRIMKNLR 600

Query: 601 ICGDCHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           ICGDCHSFMKLAS+FV RDVIVRDTNRFHHFKNG CSCGDFW
Sbjct: 601 ICGDCHSFMKLASKFVCRDVIVRDTNRFHHFKNGCCSCGDFW 642

BLAST of MC04g1418 vs. ExPASy TrEMBL
Match: A0A6J1JWF9 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488964 PE=3 SV=1)

HSP 1 Score: 1056 bits (2732), Expect = 0.0
Identity = 533/631 (84.47%), Postives = 575/631 (91.13%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHV +FA NSG+     +F  L +R+   +F A+SS I+E  KGT+EDF +TH SYV
Sbjct: 1   MGFLHVCHFATNSGK-----IFQFLSLRVCVSRFFASSSCIIECEKGTTEDFCSTHVSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
            ELLKLCA+RRL +QGKACHA+ILLMGLQ +T TSNILINMYSKCG VD AR+VFDEMP+
Sbjct: 61  LELLKLCAKRRLFLQGKACHARILLMGLQAETLTSNILINMYSKCGSVDVARQVFDEMPS 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           R+LVSW+TMIGSLTQNGEEN+AL LLLQM+R G PFSEFTISSVLCACAAKCAL ECQLL
Sbjct: 121 RSLVSWNTMIGSLTQNGEENEALSLLLQMQRAGTPFSEFTISSVLCACAAKCALSECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAA VFESM ERS VTWSSMAAGYVQN +Y
Sbjct: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAAHVFESMSERSVVTWSSMAAGYVQNAMY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKA   GL+ DQF MSS++CACA LAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVE RNVVLWNAMISGLSRHARSLEVMILFEKMQQ+GL+PND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEYRNVVLWNAMISGLSRHARSLEVMILFEKMQQIGLNPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKE+HL+PNV HYSCMVD LSRAG+T  AYDL+ 
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEYHLAPNVFHYSCMVDALSRAGRTSDAYDLIC 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
            M F A+ASMWGSLLASCRTHGNLELAEVAA+NLF+IEP NAGNYLLLSNMYAA+GKWDE
Sbjct: 421 KMPFRASASMWGSLLASCRTHGNLELAEVAAKNLFDIEPQNAGNYLLLSNMYAANGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKNEVHSFMVGERNHPKIAEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           + ET+HDLHQVGESRKQ LLRHHSEKLA TMGLLFL PNAP+RIMKNLRICGDCHSFMKL
Sbjct: 541 QVETQHDLHQVGESRKQELLRHHSEKLAFTMGLLFLPPNAPLRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           AS+ VRRDV+VRDTNRFHHF NG CSCGDFW
Sbjct: 601 ASKLVRRDVVVRDTNRFHHFTNGHCSCGDFW 626

BLAST of MC04g1418 vs. ExPASy TrEMBL
Match: A0A6J1FDG7 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444755 PE=3 SV=1)

HSP 1 Score: 1054 bits (2725), Expect = 0.0
Identity = 531/631 (84.15%), Postives = 575/631 (91.13%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHV +FA NSG+     +F  L +R+   +F A+SS I+E  KGT+EDF +TH SYV
Sbjct: 1   MGFLHVCHFATNSGK-----IFQFLSLRVCISRFFASSSCIIECEKGTTEDFCSTHVSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
            ELLKLCA+RRL +QGKACHA+ILLMGLQ +TSTSNILINMY+KCG VD AR+VFDEMP+
Sbjct: 61  LELLKLCAKRRLFLQGKACHARILLMGLQAETSTSNILINMYAKCGSVDVARQVFDEMPS 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           R+LVSWSTMIGSLTQNGEEN+ALGL LQM+REG PFSEFTISSVLCACAAKCAL ECQLL
Sbjct: 121 RSLVSWSTMIGSLTQNGEENEALGLFLQMQREGTPFSEFTISSVLCACAAKCALSECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFA+KAAMNLNVFVATALLDVYAKCGLMNDAA VFESM ERS VTWSSMAAGYVQN +Y
Sbjct: 181 HAFAVKAAMNLNVFVATALLDVYAKCGLMNDAANVFESMSERSVVTWSSMAAGYVQNAMY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKA   GL+ DQF MSS++CACA LAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVE RNVVLWNAMISGLSRHARSLEVMILFEKMQQ+GL+PND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEYRNVVLWNAMISGLSRHARSLEVMILFEKMQQIGLNPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKE+HL+PNV HYSCMVD LSRAG+T  AY L+ 
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEYHLAPNVFHYSCMVDALSRAGRTSDAYVLIC 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
            M F A+AS+WGSLLASCRTHGNLELAEVAA+NLF+IEPHNAGNYLLLSNMYAA+GKWDE
Sbjct: 421 KMPFRASASIWGSLLASCRTHGNLELAEVAAKNLFDIEPHNAGNYLLLSNMYAANGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIK++VH FMVGERNHPKI+EIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKNEVHLFMVGERNHPKIAEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           + ET+HDLHQV ESRKQ LLRHHSEKLA TMGLLFL PNAP+RIMKNLRICGDCHSFMKL
Sbjct: 541 QVETQHDLHQVEESRKQELLRHHSEKLAFTMGLLFLPPNAPLRIMKNLRICGDCHSFMKL 600

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           AS+ VRRDV+VRDTNRFHHF NG CSCGDFW
Sbjct: 601 ASKLVRRDVVVRDTNRFHHFTNGHCSCGDFW 626

BLAST of MC04g1418 vs. ExPASy TrEMBL
Match: A0A6J1JUA1 (pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488964 PE=3 SV=1)

HSP 1 Score: 986 bits (2548), Expect = 0.0
Identity = 506/631 (80.19%), Postives = 546/631 (86.53%), Query Frame = 0

Query: 1   MGFLHVRNFAANSGRNRERRLFLLLRVRLRTCQFSAASSFIVESGKGTSEDFYATHFSYV 60
           MGFLHV +FA NSG+     +F  L +R+   +F A+SS I+E  KGT+EDF +TH SYV
Sbjct: 1   MGFLHVCHFATNSGK-----IFQFLSLRVCVSRFFASSSCIIECEKGTTEDFCSTHVSYV 60

Query: 61  HELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPN 120
            ELLKLCA+RRL +Q                                VD AR+VFDEMP+
Sbjct: 61  LELLKLCAKRRLFLQ--------------------------------VDVARQVFDEMPS 120

Query: 121 RNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLL 180
           R+LVSW+TMIGSLTQNGEEN+AL LLLQM+R G PFSEFTISSVLCACAAKCAL ECQLL
Sbjct: 121 RSLVSWNTMIGSLTQNGEENEALSLLLQMQRAGTPFSEFTISSVLCACAAKCALSECQLL 180

Query: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELY 240
           HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAA VFESM ERS VTWSSMAAGYVQN +Y
Sbjct: 181 HAFAIKAAMNLNVFVATALLDVYAKCGLMNDAAHVFESMSERSVVTWSSMAAGYVQNAMY 240

Query: 241 EEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSL 300
           EEALALFRKA   GL+ DQF MSS++CACA LAAMIEGNQVNALLSKSGFCSNIFVASSL
Sbjct: 241 EEALALFRKAWETGLKHDQFLMSSVICACAGLAAMIEGNQVNALLSKSGFCSNIFVASSL 300

Query: 301 IDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPND 360
           IDMYAKCGGIEEAYKVFRDVE RNVVLWNAMISGLSRHARSLEVMILFEKMQQ+GL+PND
Sbjct: 301 IDMYAKCGGIEEAYKVFRDVEYRNVVLWNAMISGLSRHARSLEVMILFEKMQQIGLNPND 360

Query: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMS 420
           VTFVSVLSACGHMGLVEKGQKYFDLMIKE+HL+PNV HYSCMVD LSRAG+T  AYDL+ 
Sbjct: 361 VTFVSVLSACGHMGLVEKGQKYFDLMIKEYHLAPNVFHYSCMVDALSRAGRTSDAYDLIC 420

Query: 421 NMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDE 480
            M F A+ASMWGSLLASCRTHGNLELAEVAA+NLF+IEP NAGNYLLLSNMYAA+GKWDE
Sbjct: 421 KMPFRASASMWGSLLASCRTHGNLELAEVAAKNLFDIEPQNAGNYLLLSNMYAANGKWDE 480

Query: 481 VAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGY 540
           VAKARKLLKESDVKKERGKSWIEIK++VHSFMVGERNHPKI+EIYSKLNELIEELQKLGY
Sbjct: 481 VAKARKLLKESDVKKERGKSWIEIKNEVHSFMVGERNHPKIAEIYSKLNELIEELQKLGY 540

Query: 541 KAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKL 600
           + ET+HDLHQVGESRKQ LLRHHSEKLA TMGLLFL PNAP+RIMKNLRICGDCHSFMKL
Sbjct: 541 QVETQHDLHQVGESRKQELLRHHSEKLAFTMGLLFLPPNAPLRIMKNLRICGDCHSFMKL 594

Query: 601 ASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 631
           AS+ VRRDV+VRDTNRFHHF NG CSCGDFW
Sbjct: 601 ASKLVRRDVVVRDTNRFHHFTNGHCSCGDFW 594

BLAST of MC04g1418 vs. TAIR 10
Match: AT5G04780.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 718.8 bits (1854), Expect = 3.9e-207
Identity = 345/582 (59.28%), Postives = 450/582 (77.32%), Query Frame = 0

Query: 53  YATHFS---YVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVD 112
           Y+  FS    VHE+L+LCA     ++ KACH +I+ + L+ D +  N+LIN YSKCG V+
Sbjct: 54  YSNEFSNRNLVHEILQLCARNGAVMEAKACHGKIIRIDLEGDVTLLNVLINAYSKCGFVE 113

Query: 113 FARKVFDEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACA 172
            AR+VFD M  R+LVSW+TMIG  T+N  E++AL + L+M+ EG  FSEFTISSVL AC 
Sbjct: 114 LARQVFDGMLERSLVSWNTMIGLYTRNRMESEALDIFLEMRNEGFKFSEFTISSVLSACG 173

Query: 173 AKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSS 232
             C   EC+ LH  ++K  ++LN++V TALLD+YAKCG++ DA +VFESM ++S+VTWSS
Sbjct: 174 VNCDALECKKLHCLSVKTCIDLNLYVGTALLDLYAKCGMIKDAVQVFESMQDKSSVTWSS 233

Query: 233 MAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSG 292
           M AGYVQN+ YEEAL L+R+AQ + L Q+QF +SS++CAC+ LAA+IEG Q++A++ KSG
Sbjct: 234 MVAGYVQNKNYEEALLLYRRAQRMSLEQNQFTLSSVICACSNLAALIEGKQMHAVICKSG 293

Query: 293 FCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFE 352
           F SN+FVASS +DMYAKCG + E+Y +F +V+E+N+ LWN +ISG ++HAR  EVMILFE
Sbjct: 294 FGSNVFVASSAVDMYAKCGSLRESYIIFSEVQEKNLELWNTIISGFAKHARPKEVMILFE 353

Query: 353 KMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRA 412
           KMQQ G+ PN+VTF S+LS CGH GLVE+G+++F LM   + LSPNV+HYSCMVD L RA
Sbjct: 354 KMQQDGMHPNEVTFSSLLSVCGHTGLVEEGRRFFKLMRTTYGLSPNVVHYSCMVDILGRA 413

Query: 413 GQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLS 472
           G   +AY+L+ ++ F+ TAS+WGSLLASCR + NLELAEVAA  LFE+EP NAGN++LLS
Sbjct: 414 GLLSEAYELIKSIPFDPTASIWGSLLASCRVYKNLELAEVAAEKLFELEPENAGNHVLLS 473

Query: 473 NMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLN 532
           N+YAA+ +W+E+AK+RKLL++ DVKK RGKSWI+IKDKVH+F VGE  HP+I EI S L+
Sbjct: 474 NIYAANKQWEEIAKSRKLLRDCDVKKVRGKSWIDIKDKVHTFSVGESGHPRIREICSTLD 533

Query: 533 ELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLR 592
            L+ + +K GYK   EH+LH V   +K+ LL  HSEKLAL  GL+ L  ++P+RIMKNLR
Sbjct: 534 NLVIKFRKFGYKPSVEHELHDVEIGKKEELLMQHSEKLALVFGLMCLPESSPVRIMKNLR 593

Query: 593 ICGDCHSFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           IC DCH FMK AS   RR +IVRD NRFHHF +G CSCGDFW
Sbjct: 594 ICVDCHEFMKAASMATRRFIIVRDVNRFHHFSDGHCSCGDFW 635

BLAST of MC04g1418 vs. TAIR 10
Match: AT5G52630.1 (mitochondrial RNAediting factor 1 )

HSP 1 Score: 463.4 bits (1191), Expect = 2.9e-130
Identity = 238/576 (41.32%), Postives = 361/576 (62.67%), Query Frame = 0

Query: 56  HFSYVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVF 115
           +++ + +LL   A  R  I+G   H  ++  GL      +N LIN YSK  L   +R+ F
Sbjct: 14  NYNQICDLLLSSARTRSTIKGLQLHGYVVKSGLSLIPLVANNLINFYSKSQLPFDSRRAF 73

Query: 116 DEMPNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALY 175
           ++ P ++  +WS++I    QN     +L  L +M    +   +  + S   +CA      
Sbjct: 74  EDSPQKSSTTWSSIISCFAQNELPWMSLEFLKKMMAGNLRPDDHVLPSATKSCAILSRCD 133

Query: 176 ECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYV 235
             + +H  ++K   + +VFV ++L+D+YAKCG +  A ++F+ MP+R+ VTWS M  GY 
Sbjct: 134 IGRSVHCLSMKTGYDADVFVGSSLVDMYAKCGEIVYARKMFDEMPQRNVVTWSGMMYGYA 193

Query: 236 QNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIF 295
           Q    EEAL LF++A    L  + +  SS++  CA    +  G Q++ L  KS F S+ F
Sbjct: 194 QMGENEEALWLFKEALFENLAVNDYSFSSVISVCANSTLLELGRQIHGLSIKSSFDSSSF 253

Query: 296 VASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMG 355
           V SSL+ +Y+KCG  E AY+VF +V  +N+ +WNAM+   ++H+ + +V+ LF++M+  G
Sbjct: 254 VGSSLVSLYSKCGVPEGAYQVFNEVPVKNLGIWNAMLKAYAQHSHTQKVIELFKRMKLSG 313

Query: 356 LSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKA 415
           + PN +TF++VL+AC H GLV++G+ YFD M KE  + P   HY+ +VD L RAG+  +A
Sbjct: 314 MKPNFITFLNVLNACSHAGLVDEGRYYFDQM-KESRIEPTDKHYASLVDMLGRAGRLQEA 373

Query: 416 YDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAH 475
            ++++NM  + T S+WG+LL SC  H N ELA  AA  +FE+ P ++G ++ LSN YAA 
Sbjct: 374 LEVITNMPIDPTESVWGALLTSCTVHKNTELAAFAADKVFELGPVSSGMHISLSNAYAAD 433

Query: 476 GKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEEL 535
           G++++ AKARKLL++   KKE G SW+E ++KVH+F  GER H K  EIY KL EL EE+
Sbjct: 434 GRFEDAAKARKLLRDRGEKKETGLSWVEERNKVHTFAAGERRHEKSKEIYEKLAELGEEM 493

Query: 536 QKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCH 595
           +K GY A+T + L +V    K   +R+HSE+LA+  GL+    + PIR+MKNLR+CGDCH
Sbjct: 494 EKAGYIADTSYVLREVDGDEKNQTIRYHSERLAIAFGLITFPADRPIRVMKNLRVCGDCH 553

Query: 596 SFMKLASRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           + +K  S   RR +IVRD NRFH F++G CSC D+W
Sbjct: 554 NAIKFMSVCTRRVIIVRDNNRFHRFEDGKCSCNDYW 588

BLAST of MC04g1418 vs. TAIR 10
Match: AT3G24000.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 454.1 bits (1167), Expect = 1.8e-127
Identity = 232/566 (40.99%), Postives = 361/566 (63.78%), Query Frame = 0

Query: 59  YVHELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEM 118
           + + LLK C   +L IQG+  HA IL    + D    N L+NMY+KCG ++ ARKVF++M
Sbjct: 62  FYNTLLKKCTVFKLLIQGRIVHAHILQSIFRHDIVMGNTLLNMYAKCGSLEEARKVFEKM 121

Query: 119 PNRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQ 178
           P R+ V+W+T+I   +Q+     AL    QM R G   +EFT+SSV+ A AA+       
Sbjct: 122 PQRDFVTWTTLISGYSQHDRPCDALLFFNQMLRFGYSPNEFTLSSVIKAAAAERRGCCGH 181

Query: 179 LLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNE 238
            LH F +K   + NV V +ALLD+Y + GLM+DA  VF+++  R+ V+W+++ AG+ +  
Sbjct: 182 QLHGFCVKCGFDSNVHVGSALLDLYTRYGLMDDAQLVFDALESRNDVSWNALIAGHARRS 241

Query: 239 LYEEALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVAS 298
             E+AL LF+     G R   F  +SL  AC+    + +G  V+A + KSG     F  +
Sbjct: 242 GTEKALELFQGMLRDGFRPSHFSYASLFGACSSTGFLEQGKWVHAYMIKSGEKLVAFAGN 301

Query: 299 SLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSP 358
           +L+DMYAK G I +A K+F  + +R+VV WN++++  ++H    E +  FE+M+++G+ P
Sbjct: 302 TLLDMYAKSGSIHDARKIFDRLAKRDVVSWNSLLTAYAQHGFGKEAVWWFEEMRRVGIRP 361

Query: 359 NDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDL 418
           N+++F+SVL+AC H GL+++G  Y++LM K+  + P   HY  +VD L RAG   +A   
Sbjct: 362 NEISFLSVLTACSHSGLLDEGWHYYELM-KKDGIVPEAWHYVTVVDLLGRAGDLNRALRF 421

Query: 419 MSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKW 478
           +  M    TA++W +LL +CR H N EL   AA ++FE++P + G +++L N+YA+ G+W
Sbjct: 422 IEEMPIEPTAAIWKALLNACRMHKNTELGAYAAEHVFELDPDDPGPHVILYNIYASGGRW 481

Query: 479 DEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKL 538
           ++ A+ RK +KES VKKE   SW+EI++ +H F+  +  HP+  EI  K  E++ ++++L
Sbjct: 482 NDAARVRKKMKESGVKKEPACSWVEIENAIHMFVANDERHPQREEIARKWEEVLAKIKEL 541

Query: 539 GYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFM 598
           GY  +T H +  V +  ++V L++HSEK+AL   LL   P + I I KN+R+CGDCH+ +
Sbjct: 542 GYVPDTSHVIVHVDQQEREVNLQYHSEKIALAFALLNTPPGSTIHIKKNIRVCGDCHTAI 601

Query: 599 KLASRFVRRDVIVRDTNRFHHFKNGS 625
           KLAS+ V R++IVRDTNRFHHFK+ S
Sbjct: 602 KLASKVVGREIIVRDTNRFHHFKDAS 626

BLAST of MC04g1418 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 449.5 bits (1155), Expect = 4.4e-126
Identity = 232/570 (40.70%), Postives = 349/570 (61.23%), Query Frame = 0

Query: 62  ELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSKCGLVDFARKVFDEMPNR 121
           EL+K C   R   +G      +   G +      N+LINMY K  L++ A ++FD+MP R
Sbjct: 66  ELIKCCISNRAVHEGNLICRHLYFNGHRPMMFLVNVLINMYVKFNLLNDAHQLFDQMPQR 125

Query: 122 NLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISSVLCACAAKCALYECQLLH 181
           N++SW+TMI + ++     +AL LL+ M R+ +  + +T SSVL +C     + + ++LH
Sbjct: 126 NVISWTTMISAYSKCKIHQKALELLVLMLRDNVRPNVYTYSSVLRSCN---GMSDVRMLH 185

Query: 182 AFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERSAVTWSSMAAGYVQNELYE 241
              IK  +  +VFV +AL+DV+AK G   DA  VF+ M    A+ W+S+  G+ QN   +
Sbjct: 186 CGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEMVTGDAIVWNSIIGGFAQNSRSD 245

Query: 242 EALALFRKAQGIGLRQDQFFMSSLVCACARLAAMIEGNQVNALLSKSGFCSNIFVASSLI 301
            AL LF++ +  G   +Q  ++S++ AC  LA +  G Q +  + K  +  ++ + ++L+
Sbjct: 246 VALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGMQAHVHIVK--YDQDLILNNALV 305

Query: 302 DMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSLEVMILFEKMQQMGLSPNDV 361
           DMY KCG +E+A +VF  ++ER+V+ W+ MISGL+++  S E + LFE+M+  G  PN +
Sbjct: 306 DMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNGYSQEALKLFERMKSSGTKPNYI 365

Query: 362 TFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCMVDTLSRAGQTFKAYDLMSN 421
           T V VL AC H GL+E G  YF  M K + + P   HY CM+D L +AG+   A  L++ 
Sbjct: 366 TIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHYGCMIDLLGKAGKLDDAVKLLNE 425

Query: 422 MAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNAGNYLLLSNMYAAHGKWDEV 481
           M     A  W +LL +CR   N+ LAE AA+ +  ++P +AG Y LLSN+YA   KWD V
Sbjct: 426 MECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDPEDAGTYTLLSNIYANSQKWDSV 485

Query: 482 AKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKISEIYSKLNELIEELQKLGYK 541
            + R  +++  +KKE G SWIE+  ++H+F++G+ +HP+I E+  KLN+LI  L  +GY 
Sbjct: 486 EEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHPQIVEVSKKLNQLIHRLTGIGYV 545

Query: 542 AETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPIRIMKNLRICGDCHSFMKLA 601
            ET   L  +   + +  LRHHSEKLAL  GL+ L     IRI KNLRICGDCH F KLA
Sbjct: 546 PETNFVLQDLEGEQMEDSLRHHSEKLALAFGLMTLPIEKVIRIRKNLRICGDCHVFCKLA 605

Query: 602 SRFVRRDVIVRDTNRFHHFKNGSCSCGDFW 632
           S+   R +++RD  R+HHF++G CSCGD+W
Sbjct: 606 SKLEIRSIVIRDPIRYHHFQDGKCSCGDYW 630

BLAST of MC04g1418 vs. TAIR 10
Match: AT2G27610.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 444.9 bits (1143), Expect = 1.1e-124
Identity = 233/590 (39.49%), Postives = 363/590 (61.53%), Query Frame = 0

Query: 52  FYATHFSYVH-------ELLKLCAERRLPIQGKACHAQILLMGLQKDTSTSNILINMYSK 111
           FY+   +YV         ++KLCA  +     +  H  ++  G   D +    L+  YSK
Sbjct: 283 FYSMRLNYVRLSESSFASVIKLCANLKELRFTEQLHCSVVKYGFLFDQNIRTALMVAYSK 342

Query: 112 CGLVDFARKVFDEMP-NRNLVSWSTMIGSLTQNGEENQALGLLLQMKREGIPFSEFTISS 171
           C  +  A ++F E+    N+VSW+ MI    QN  + +A+ L  +MKR+G+  +EFT S 
Sbjct: 343 CTAMLDALRLFKEIGCVGNVVSWTAMISGFLQNDGKEEAVDLFSEMKRKGVRPNEFTYSV 402

Query: 172 VLCACAAKCALYECQLLHAFAIKAAMNLNVFVATALLDVYAKCGLMNDAARVFESMPERS 231
           +L A      +     +HA  +K     +  V TALLD Y K G + +AA+VF  + ++ 
Sbjct: 403 ILTA----LPVISPSEVHAQVVKTNYERSSTVGTALLDAYVKLGKVEEAAKVFSGIDDKD 462

Query: 232 AVTWSSMAAGYVQNELYEEALALFRKAQGIGLRQDQFFMSSLVCACARL-AAMIEGNQVN 291
            V WS+M AGY Q    E A+ +F +    G++ ++F  SS++  CA   A+M +G Q +
Sbjct: 463 IVAWSAMLAGYAQTGETEAAIKMFGELTKGGIKPNEFTFSSILNVCAATNASMGQGKQFH 522

Query: 292 ALLSKSGFCSNIFVASSLIDMYAKCGGIEEAYKVFRDVEERNVVLWNAMISGLSRHARSL 351
               KS   S++ V+S+L+ MYAK G IE A +VF+   E+++V WN+MISG ++H +++
Sbjct: 523 GFAIKSRLDSSLCVSSALLTMYAKKGNIESAEEVFKRQREKDLVSWNSMISGYAQHGQAM 582

Query: 352 EVMILFEKMQQMGLSPNDVTFVSVLSACGHMGLVEKGQKYFDLMIKEHHLSPNVLHYSCM 411
           + + +F++M++  +  + VTF+ V +AC H GLVE+G+KYFD+M+++  ++P   H SCM
Sbjct: 583 KALDVFKEMKKRKVKMDGVTFIGVFAACTHAGLVEEGEKYFDIMVRDCKIAPTKEHNSCM 642

Query: 412 VDTLSRAGQTFKAYDLMSNMAFNATASMWGSLLASCRTHGNLELAEVAARNLFEIEPHNA 471
           VD  SRAGQ  KA  ++ NM   A +++W ++LA+CR H   EL  +AA  +  ++P ++
Sbjct: 643 VDLYSRAGQLEKAMKVIENMPNPAGSTIWRTILAACRVHKKTELGRLAAEKIIAMKPEDS 702

Query: 472 GNYLLLSNMYAAHGKWDEVAKARKLLKESDVKKERGKSWIEIKDKVHSFMVGERNHPKIS 531
             Y+LLSNMYA  G W E AK RKL+ E +VKKE G SWIE+K+K +SF+ G+R+HP   
Sbjct: 703 AAYVLLSNMYAESGDWQERAKVRKLMNERNVKKEPGYSWIEVKNKTYSFLAGDRSHPLKD 762

Query: 532 EIYSKLNELIEELQKLGYKAETEHDLHQVGESRKQVLLRHHSEKLALTMGLLFLSPNAPI 591
           +IY KL +L   L+ LGY+ +T + L  + +  K+ +L  HSE+LA+  GL+     +P+
Sbjct: 763 QIYMKLEDLSTRLKDLGYEPDTSYVLQDIDDEHKEAVLAQHSERLAIAFGLIATPKGSPL 822

Query: 592 RIMKNLRICGDCHSFMKLASRFVRRDVIVRDTNRFHHF-KNGSCSCGDFW 632
            I+KNLR+CGDCH  +KL ++   R+++VRD+NRFHHF  +G CSCGDFW
Sbjct: 823 LIIKNLRVCGDCHLVIKLIAKIEEREIVVRDSNRFHHFSSDGVCSCGDFW 868

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZ195.5e-20659.28Pentatricopeptide repeat-containing protein At5g04780, mitochondrial OS=Arabidop... [more]
Q9LIQ72.6e-13141.36Pentatricopeptide repeat-containing protein At3g24000, mitochondrial OS=Arabidop... [more]
Q9LTF44.1e-12941.32Putative pentatricopeptide repeat-containing protein At5g52630 OS=Arabidopsis th... [more]
Q9SI536.2e-12540.70Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9ZUW31.5e-12339.49Pentatricopeptide repeat-containing protein At2g27610 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
XP_022134327.10.099.84pentatricopeptide repeat-containing protein At5g04780 [Momordica charantia][more]
KAG7016352.10.084.94Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023550933.10.084.63pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 ... [more]
XP_004140992.10.083.86pentatricopeptide repeat-containing protein At5g04780, mitochondrial [Cucumis sa... [more]
XP_008456610.10.083.64PREDICTED: pentatricopeptide repeat-containing protein At5g04780 [Cucumis melo][more]
Match NameE-valueIdentityDescription
A0A6J1BZB90.099.84pentatricopeptide repeat-containing protein At5g04780 OS=Momordica charantia OX=... [more]
A0A1S3C4B60.083.64pentatricopeptide repeat-containing protein At5g04780 OS=Cucumis melo OX=3656 GN... [more]
A0A6J1JWF90.084.47pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 ... [more]
A0A6J1FDG70.084.15pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X1 ... [more]
A0A6J1JUA10.080.19pentatricopeptide repeat-containing protein At5g04780, mitochondrial isoform X2 ... [more]
Match NameE-valueIdentityDescription
AT5G04780.13.9e-20759.28Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G52630.12.9e-13041.32mitochondrial RNAediting factor 1 [more]
AT3G24000.11.8e-12740.99Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03880.14.4e-12640.70Pentatricopeptide repeat (PPR) superfamily protein [more]
AT2G27610.11.1e-12439.49Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 361..395
e-value: 0.0022
score: 16.0
coord: 326..359
e-value: 1.5E-4
score: 19.7
coord: 124..155
e-value: 1.7E-4
score: 19.5
coord: 96..123
e-value: 1.2E-4
score: 20.0
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 225..254
e-value: 8.7E-5
score: 22.5
coord: 124..154
e-value: 4.1E-5
score: 23.5
coord: 398..422
e-value: 1.4
score: 9.3
coord: 197..222
e-value: 0.0022
score: 18.1
coord: 94..122
e-value: 5.7E-6
score: 26.2
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 323..370
e-value: 6.9E-12
score: 45.4
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 192..222
score: 8.736214
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 91..125
score: 10.873667
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 324..358
score: 11.498462
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 223..257
score: 9.196589
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 293..323
score: 8.780059
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 49..177
e-value: 5.4E-22
score: 80.0
coord: 281..378
e-value: 2.0E-24
score: 88.0
coord: 178..280
e-value: 1.1E-18
score: 69.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 379..571
e-value: 1.6E-11
score: 46.3
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 202..480
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 498..621
e-value: 1.5E-39
score: 134.7
NoneNo IPR availablePANTHERPTHR24015:SF1063OS12G0156900 PROTEINcoord: 61..622
NoneNo IPR availablePANTHERPTHR24015OS07G0578800 PROTEIN-RELATEDcoord: 61..622

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC04g1418.1MC04g1418.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding