Moc03g30020 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g30020
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr3: 21393579 .. 21396683 (-)
RNA-Seq ExpressionMoc03g30020
SyntenyMoc03g30020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTTGCGCACCAACACCACTTCCTCAACTCATTTCATTCCATTCGCAAGCCCAATCTGAACCAATACTCTCCCAATTCTCACTTGTTTTTTCAATTCAATACCCAGAAACTCGTCTGCTCATGTGCAACATCTCCAAATCCCACCACTCAGTCTCCATCCCCGATTTTCCTCCCTTTTCTCGAGGAAAACGAGGAACAAGAAGAAGAAGAAGAAGTTAAAGGAGCTCATCGCCATGAAGGAAACAAGATGGAAGATTGGGATGACCCACTGTTCAGATTCTTCAAATCACGCACTTCAACGACGCAAGACCCCCAGCGCGAAAGCAAAGTCTCCCTTCAGAAGAATCGCCGTTCTTCTTGGCATCTTGCCTCTGGTTCCGAATTTGCGGATGAAGCTGAAATTACAATCGACGAAGTCGCCGGACAATTGAGTTCTGTGAGTCAGAATTTTAGGGCCTTACCTGACGGCGTCGTCGGAGACATTATGCGAACTGCGAGGAACTTGCCGGAAAATACGACCCTGGGAGAGGCTTTGGCAGATTTCGATGGGAGAATTGGGGAGAAACAATGTATGGAGGTGTTACGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGCTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGCCTTCGCTTGTTACACCTCGTGCTTATTCTCTTATTTTTCCGTCGCTGGGGAGAGCTGGAATGGGAGAAAAAATTATGGTATTGTTCAAGAACCTTCCGCTCAAGAAGGAATTCCAGGATGTTCATGTGTATAATTCAGCAATGTCTGGGCTCATGTTCTGTAACAGGTACGAGATAGGCCTAAATTTATGACATTGTTTATGTTCTGTTTGTTCTATAGTTGATAATTTTTCTTGCTGCTTAATCTAGTCATCTGTCATCATTTATCTTTATGGAGGAACTTGGATTGAACTCGGAGCTGTTGAAATGATAAATTTCTGATAAGTTGCTTGCCATGGGGATTAGGGGTTTTGTGTAAGTTGAGGCATGCCATTTTGAAACATCACAGCATTTTGTTTTCCTTTTGATGCTTATAGTCTACCCCTACTAATCTATGGATTTCGCCCGCCCCAAAACCAACTTCAAGAGATGCCAGGGATACTGAGGGGTTTGTTTAGTAACGTTCTCGTATCTTGTTTTCATTTTCAATTTTATTAAGAAATTTAAATATTTGACAACTATTTATGTGTGTTGTTTCTCATTTCTTCTAGAAACATTCTAAAAATTGTATCCTTCTTGAATCTATAGAGTTGATGCCATGCAATTATATTTTTCTTTATTTAAAAAAGGAAATAGAGCAAGAAATGAAAACGTTATCAAACTTATAAACTCGAAGCAGGATATTGAGCATATTTCAGAATATTCGAAGATCTTTATCATCATAGCACCCTTTAGAATAGCATATATAAGTAGATCTTTCTTTTCTATTTTTATCAAAATTAAGTAGAACTACACCTTGTTAGTATTAATCCCTCCTTAATCTATGTTCTGTTCACATATGATTTATGGGGAGAGGAAAGACAGGGTTCTGTGCCATACTCTTCCAATCCAATGAAAACAATACTTGAATAAACCTTCTGGTATATGGAAGAACTAGCCTTCAAATTGGCTGTCAAATACAAATTATTTGAAGTCATATTTTTTTGGGGGGAAATAGCATGGTTTATATCTTTCGAGGTAGTCGTAGTAGTTGGGGAGGCTCGACTTTGATTATTCTCTAATCCTCATTTCCATGTTATAACTTTTAGGTATAATGATGCTTGCAAGGTCTATGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATCACAGTCATGAGAAAAATCGGCCGCAGTGCAAAGGATTCCTGGGATTACTTTGAGAAAATGAACAAGAAAGGAGTAAAATGGAGTCCAGAAGTTTTGGGTGCCCTGATTAAATCGTTCTGCGACGAGGGGCTCAAGAGTCAAGCGCTTATCATCCAATTGGAGATGGAGAAGAAGGGGGTTGCTTCGAATGCGATCGTGTATAACACGATCTTGGATGCTTTTAGTAAATCGAACCAAATCGAGGAAGCCGAAGGTCTCTTTGCTGAAATGAAAGCTAAAGGGGTGAAACCGACTAGTGCAACATATAACGTTTTGATGGATGCATACAGTAGGAGGATGCAACCTGAGGTTGTTGAGAAGCTTATGCTTGAGATGAAGGATGTGGGTTTTGAGCCTAATGTGAAGTCATACACTTGCTTGATTAGTGCTTATGGAAGGCAAAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGGATGAAGAAAAATGGCATTAGGCCAACTTCACATTCATATACAGCTCTTATCCATGCCTATTCTGTAAGCGGTTGGCACGAGAAAGCCTACTTGACATTCGAGAACATGCAACGAGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTGCTCGATGCATTTAGGCGTGCGGGTGACACTGTGGCATTGATGAAGATTTGGAAACTTATGATTAGAGAAAAAATAGAAGGGACAAGAGTAACATTCAACATACTGCTAGATGGGTTTGCAAAACAGGGTCATTATGTTGAAGCTAGAGATGTGATTTCTGAGTTTGGAAAGATTGGATTGCAACCAACCGTCATGACATACAATATGTTGATGAATGCATATGCAAGGGGAGGGCAACATGCAAAGATGCCACAGCTGCTGCAGGAGATGGCTGCTCGAGAACTAAAACCCGACTCGGTTACTTATTCAACCATGATCTACGCCTATGTTCGTGTTCGCGACTTCAAACGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAGGTGCCAGATGTGAAGTCATACCAGAAACTTAGAGCAATTTTGGATGTAAAACTTGCTACAAAGAACAGGAAAGACAAGAGTGCCATTCTGGGTATAATAAACAGCAAAATGGGGATGGTGAAAGCCAAGAAAAAGGGCAAGAAAGATGAGTTCTGGAAGAACAAGAGAAAGCATGTGAAAACTCAGAGAGTTTCTCCCATGGAGGCAAAGAGTTGA

mRNA sequence

ATGGCTCTTGCGCACCAACACCACTTCCTCAACTCATTTCATTCCATTCGCAAGCCCAATCTGAACCAATACTCTCCCAATTCTCACTTGTTTTTTCAATTCAATACCCAGAAACTCGTCTGCTCATGTGCAACATCTCCAAATCCCACCACTCAGTCTCCATCCCCGATTTTCCTCCCTTTTCTCGAGGAAAACGAGGAACAAGAAGAAGAAGAAGAAGTTAAAGGAGCTCATCGCCATGAAGGAAACAAGATGGAAGATTGGGATGACCCACTGTTCAGATTCTTCAAATCACGCACTTCAACGACGCAAGACCCCCAGCGCGAAAGCAAAGTCTCCCTTCAGAAGAATCGCCGTTCTTCTTGGCATCTTGCCTCTGGTTCCGAATTTGCGGATGAAGCTGAAATTACAATCGACGAAGTCGCCGGACAATTGAGTTCTGTGAGTCAGAATTTTAGGGCCTTACCTGACGGCGTCGTCGGAGACATTATGCGAACTGCGAGGAACTTGCCGGAAAATACGACCCTGGGAGAGGCTTTGGCAGATTTCGATGGGAGAATTGGGGAGAAACAATGTATGGAGGTGTTACGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGCTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGCCTTCGCTTGTTACACCTCGTGCTTATTCTCTTATTTTTCCGTCGCTGGGGAGAGCTGGAATGGGAGAAAAAATTATGGTATTGTTCAAGAACCTTCCGCTCAAGAAGGAATTCCAGGATGTTCATGTGTATAATTCAGCAATGTCTGGGCTCATGTTCTGTAACAGGTATAATGATGCTTGCAAGGTCTATGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATCACAGTCATGAGAAAAATCGGCCGCAGTGCAAAGGATTCCTGGGATTACTTTGAGAAAATGAACAAGAAAGGAGTAAAATGGAGTCCAGAAGTTTTGGGTGCCCTGATTAAATCGTTCTGCGACGAGGGGCTCAAGAGTCAAGCGCTTATCATCCAATTGGAGATGGAGAAGAAGGGGGTTGCTTCGAATGCGATCGTGTATAACACGATCTTGGATGCTTTTAGTAAATCGAACCAAATCGAGGAAGCCGAAGGTCTCTTTGCTGAAATGAAAGCTAAAGGGGTGAAACCGACTAGTGCAACATATAACGTTTTGATGGATGCATACAGTAGGAGGATGCAACCTGAGGTTGTTGAGAAGCTTATGCTTGAGATGAAGGATGTGGGTTTTGAGCCTAATGTGAAGTCATACACTTGCTTGATTAGTGCTTATGGAAGGCAAAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGGATGAAGAAAAATGGCATTAGGCCAACTTCACATTCATATACAGCTCTTATCCATGCCTATTCTGTAAGCGGTTGGCACGAGAAAGCCTACTTGACATTCGAGAACATGCAACGAGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTGCTCGATGCATTTAGGCGTGCGGGTGACACTGTGGCATTGATGAAGATTTGGAAACTTATGATTAGAGAAAAAATAGAAGGGACAAGAGTAACATTCAACATACTGCTAGATGGGTTTGCAAAACAGGGTCATTATGTTGAAGCTAGAGATGTGATTTCTGAGTTTGGAAAGATTGGATTGCAACCAACCGTCATGACATACAATATGTTGATGAATGCATATGCAAGGGGAGGGCAACATGCAAAGATGCCACAGCTGCTGCAGGAGATGGCTGCTCGAGAACTAAAACCCGACTCGGTTACTTATTCAACCATGATCTACGCCTATGTTCGTGTTCGCGACTTCAAACGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAGGTGCCAGATGTGAAGTCATACCAGAAACTTAGAGCAATTTTGGATGTAAAACTTGCTACAAAGAACAGGAAAGACAAGAGTGCCATTCTGGGTATAATAAACAGCAAAATGGGGATGGTGAAAGCCAAGAAAAAGGGCAAGAAAGATGAGTTCTGGAAGAACAAGAGAAAGCATGTGAAAACTCAGAGAGTTTCTCCCATGGAGGCAAAGAGTTGA

Coding sequence (CDS)

ATGGCTCTTGCGCACCAACACCACTTCCTCAACTCATTTCATTCCATTCGCAAGCCCAATCTGAACCAATACTCTCCCAATTCTCACTTGTTTTTTCAATTCAATACCCAGAAACTCGTCTGCTCATGTGCAACATCTCCAAATCCCACCACTCAGTCTCCATCCCCGATTTTCCTCCCTTTTCTCGAGGAAAACGAGGAACAAGAAGAAGAAGAAGAAGTTAAAGGAGCTCATCGCCATGAAGGAAACAAGATGGAAGATTGGGATGACCCACTGTTCAGATTCTTCAAATCACGCACTTCAACGACGCAAGACCCCCAGCGCGAAAGCAAAGTCTCCCTTCAGAAGAATCGCCGTTCTTCTTGGCATCTTGCCTCTGGTTCCGAATTTGCGGATGAAGCTGAAATTACAATCGACGAAGTCGCCGGACAATTGAGTTCTGTGAGTCAGAATTTTAGGGCCTTACCTGACGGCGTCGTCGGAGACATTATGCGAACTGCGAGGAACTTGCCGGAAAATACGACCCTGGGAGAGGCTTTGGCAGATTTCGATGGGAGAATTGGGGAGAAACAATGTATGGAGGTGTTACGTTTGTTGGGTGAGGAGAATCTTGTGGTATGTTGCTTGTATTTCTTTGAATGGATGGGTTTGCAGGAGCCTTCGCTTGTTACACCTCGTGCTTATTCTCTTATTTTTCCGTCGCTGGGGAGAGCTGGAATGGGAGAAAAAATTATGGTATTGTTCAAGAACCTTCCGCTCAAGAAGGAATTCCAGGATGTTCATGTGTATAATTCAGCAATGTCTGGGCTCATGTTCTGTAACAGGTATAATGATGCTTGCAAGGTCTATGAGGCCATGGAAACAAATAATGTTAATCCAGATCATGTGACATGTTCTATAATGATCACAGTCATGAGAAAAATCGGCCGCAGTGCAAAGGATTCCTGGGATTACTTTGAGAAAATGAACAAGAAAGGAGTAAAATGGAGTCCAGAAGTTTTGGGTGCCCTGATTAAATCGTTCTGCGACGAGGGGCTCAAGAGTCAAGCGCTTATCATCCAATTGGAGATGGAGAAGAAGGGGGTTGCTTCGAATGCGATCGTGTATAACACGATCTTGGATGCTTTTAGTAAATCGAACCAAATCGAGGAAGCCGAAGGTCTCTTTGCTGAAATGAAAGCTAAAGGGGTGAAACCGACTAGTGCAACATATAACGTTTTGATGGATGCATACAGTAGGAGGATGCAACCTGAGGTTGTTGAGAAGCTTATGCTTGAGATGAAGGATGTGGGTTTTGAGCCTAATGTGAAGTCATACACTTGCTTGATTAGTGCTTATGGAAGGCAAAAGAAAATGAGTGACATGGCTGCAGATGCATTTTTGAGGATGAAGAAAAATGGCATTAGGCCAACTTCACATTCATATACAGCTCTTATCCATGCCTATTCTGTAAGCGGTTGGCACGAGAAAGCCTACTTGACATTCGAGAACATGCAACGAGAAGGTTTAAAGCCATCCATTGAAACTTACACGACTCTGCTCGATGCATTTAGGCGTGCGGGTGACACTGTGGCATTGATGAAGATTTGGAAACTTATGATTAGAGAAAAAATAGAAGGGACAAGAGTAACATTCAACATACTGCTAGATGGGTTTGCAAAACAGGGTCATTATGTTGAAGCTAGAGATGTGATTTCTGAGTTTGGAAAGATTGGATTGCAACCAACCGTCATGACATACAATATGTTGATGAATGCATATGCAAGGGGAGGGCAACATGCAAAGATGCCACAGCTGCTGCAGGAGATGGCTGCTCGAGAACTAAAACCCGACTCGGTTACTTATTCAACCATGATCTACGCCTATGTTCGTGTTCGCGACTTCAAACGAGCTTTCTTCTATCACAAGAAGATGGTAAAAAGTGGACAGGTGCCAGATGTGAAGTCATACCAGAAACTTAGAGCAATTTTGGATGTAAAACTTGCTACAAAGAACAGGAAAGACAAGAGTGCCATTCTGGGTATAATAAACAGCAAAATGGGGATGGTGAAAGCCAAGAAAAAGGGCAAGAAAGATGAGTTCTGGAAGAACAAGAGAAAGCATGTGAAAACTCAGAGAGTTTCTCCCATGGAGGCAAAGAGTTGA

Protein sequence

MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLPFLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS
Homology
BLAST of Moc03g30020 vs. NCBI nr
Match: XP_022137654.1 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Momordica charantia] >XP_022137656.1 pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Momordica charantia])

HSP 1 Score: 1416.7 bits (3666), Expect = 0.0e+00
Identity = 714/714 (100.00%), Postives = 714/714 (100.00%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP 60
           MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP
Sbjct: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP 60

Query: 61  FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS 120
           FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS
Sbjct: 61  FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS 120

Query: 121 SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL 180
           SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL
Sbjct: 121 SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL 180

Query: 181 ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM 240
           ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM
Sbjct: 181 ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM 240

Query: 241 GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI 300
           GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI
Sbjct: 241 GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI 300

Query: 301 MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK 360
           MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK
Sbjct: 301 MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK 360

Query: 361 GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV 420
           GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV
Sbjct: 361 GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV 420

Query: 421 EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH 480
           EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH
Sbjct: 421 EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH 480

Query: 481 AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG 540
           AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG
Sbjct: 481 AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG 540

Query: 541 TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL 600
           TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL
Sbjct: 541 TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL 600

Query: 601 QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK 660
           QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK
Sbjct: 601 QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK 660

Query: 661 LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS 715
           LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS
Sbjct: 661 LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS 714

BLAST of Moc03g30020 vs. NCBI nr
Match: XP_038877040.1 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Benincasa hispida])

HSP 1 Score: 1220.3 bits (3156), Expect = 0.0e+00
Identity = 619/717 (86.33%), Postives = 660/717 (92.05%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCS-CATSPNPTTQSPSPIFL 60
           MAL HQHH  NSFHSIRKPNL + S NS LFFQFNTQKL C  CA SPNP +QSPSPIFL
Sbjct: 1   MALVHQHHLTNSFHSIRKPNLKRQSSNSCLFFQFNTQKLACCLCAASPNPISQSPSPIFL 60

Query: 61  PFLEENEEQEEEEEVKGAHR----HEGNKM-EDWDDPLFRFFKSRTSTTQDPQRESKVSL 120
            FLEE EE+EEEEE +        H GNK  EDW DPLFRF KSRTS TQDP RESK+SL
Sbjct: 61  HFLEEEEEEEEEEEEEEEEEINGGHGGNKTEEDWKDPLFRFLKSRTSATQDPPRESKLSL 120

Query: 121 QKNRRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENT 180
           Q+NRRSSWHLAS  EF DEAEI+++E  G+L SVS++ R LPDG+VG+I+RTARNLP+N 
Sbjct: 121 QRNRRSSWHLASDVEFFDEAEISLEEDKGKLGSVSRDSRVLPDGLVGEIVRTARNLPQNM 180

Query: 181 TLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPS 240
           TLGEAL DF+GRI EK+C+EVLRLLGEENLVVCCLYFFEWMGLQE SLVTPRAYSL+FP 
Sbjct: 181 TLGEALEDFEGRISEKECLEVLRLLGEENLVVCCLYFFEWMGLQETSLVTPRAYSLLFPL 240

Query: 241 LGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPD 300
           LGRAGMGEKIMVLFKNLP KKEFQDVHVYNSAMSGLM C RY+DA KVYEAMETN+VNPD
Sbjct: 241 LGRAGMGEKIMVLFKNLPCKKEFQDVHVYNSAMSGLMVCKRYDDAYKVYEAMETNSVNPD 300

Query: 301 HVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQ 360
           HVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIK FCDEGLKSQA+IIQ
Sbjct: 301 HVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKLFCDEGLKSQAIIIQ 360

Query: 361 LEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRR 420
           LEMEKKGVASNAI+YNTI+DAFSKSNQIEEAEG+FAEMK+KGVKPTSA++N+LMDAYSRR
Sbjct: 361 LEMEKKGVASNAIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMDAYSRR 420

Query: 421 MQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHS 480
           MQPE+VEKL++EMKD+G EPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHS
Sbjct: 421 MQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHS 480

Query: 481 YTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMI 540
           YTALIHAYSVS WH+KAY TF+NM REGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMI
Sbjct: 481 YTALIHAYSVSDWHQKAYSTFKNMLREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMI 540

Query: 541 REKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHA 600
           REKI GTRVTFNILLDGFAKQGHYVEARDVISEF KIGLQPTVMTYNMLMNAYARGGQH 
Sbjct: 541 REKILGTRVTFNILLDGFAKQGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQHL 600

Query: 601 KMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR 660
           KMPQLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR
Sbjct: 601 KMPQLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR 660

Query: 661 AILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           +ILDVKL+TKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWK KRKHV+T+RVSP E
Sbjct: 661 SILDVKLSTKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKTKRKHVRTRRVSPSE 717

BLAST of Moc03g30020 vs. NCBI nr
Match: KAG6584120.1 (Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1204.5 bits (3115), Expect = 0.0e+00
Identity = 611/714 (85.57%), Postives = 652/714 (91.32%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLV-CSCATSPNPTTQSPSPIFL 60
           MAL HQHH   SF SIRK NL Q + NS LFFQF+T+KL  C CATSPNP +QSPSPIFL
Sbjct: 1   MALVHQHHLPQSFQSIRKANLKQSTSNSCLFFQFSTRKLAGCLCATSPNPISQSPSPIFL 60

Query: 61  PFLEENEEQEEEEEVKGAHRH--EGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKN 120
           PFLEE EE+EEEEE +  H+    GN  EDW+DPL RFFKSRTSTTQDP  ESK+SLQKN
Sbjct: 61  PFLEE-EEEEEEEEDEEEHKEVLGGNTTEDWNDPLVRFFKSRTSTTQDPLPESKLSLQKN 120

Query: 121 RRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLG 180
           RRSSWHLAS  E + EAEI  DE   Q   VS+N RALPDGVVGDI+RTARNLP+NTTLG
Sbjct: 121 RRSSWHLASDVECSVEAEIAPDEDKKQSGLVSRNSRALPDGVVGDIVRTARNLPQNTTLG 180

Query: 181 EALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGR 240
           EAL DF+G+I EK+C+EVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYS++FP LGR
Sbjct: 181 EALGDFEGKIDEKECLEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSILFPLLGR 240

Query: 241 AGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVT 300
           AGMG+KIMVLFKN+PLKKE QDVHVYNSAMSGLM C RYNDAC+VY AMETN VNPDHVT
Sbjct: 241 AGMGDKIMVLFKNIPLKKELQDVHVYNSAMSGLMVCKRYNDACEVYAAMETNKVNPDHVT 300

Query: 301 CSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEM 360
           CSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWSPEVLGALIK+FCDEGLKSQALIIQLEM
Sbjct: 301 CSIMITVMRKIGRSAKDSWDYFEKMNEKGVKWSPEVLGALIKAFCDEGLKSQALIIQLEM 360

Query: 361 EKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQP 420
           EKKGV SNAIVYNTI+DAFSKSNQIEEAEGLFAEMKAKGVKPTSAT+N+LMDAYSRRMQP
Sbjct: 361 EKKGVTSNAIVYNTIMDAFSKSNQIEEAEGLFAEMKAKGVKPTSATFNILMDAYSRRMQP 420

Query: 421 EVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTA 480
           E+VEKL++EMKD GFEPNVKSYTCLISAYGR+K MSDMAADAFLRMKKNGI+P SHSYTA
Sbjct: 421 EIVEKLLIEMKDTGFEPNVKSYTCLISAYGRKKTMSDMAADAFLRMKKNGIKPNSHSYTA 480

Query: 481 LIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREK 540
           LIHAYSVSGWHEKAY TFENM +EGLKPSIETYTTLLDAFRRAGDT ALMKIWKLMIREK
Sbjct: 481 LIHAYSVSGWHEKAYSTFENMLQEGLKPSIETYTTLLDAFRRAGDTEALMKIWKLMIREK 540

Query: 541 IEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMP 600
           + GTRVTFNILLDGFAKQGHY+EARDVISEF KIGLQPTVMTYNMLMNAYARGGQH KMP
Sbjct: 541 VAGTRVTFNILLDGFAKQGHYIEARDVISEFSKIGLQPTVMTYNMLMNAYARGGQHLKMP 600

Query: 601 QLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAIL 660
           QLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR+IL
Sbjct: 601 QLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRSIL 660

Query: 661 DVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           D KL TKNRKDKSAILGI+NSK+GMVKAKKKGKKDEFWKNKRK+VKT R+SP E
Sbjct: 661 DAKLDTKNRKDKSAILGIMNSKLGMVKAKKKGKKDEFWKNKRKYVKTLRISPNE 713

BLAST of Moc03g30020 vs. NCBI nr
Match: XP_022924164.1 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucurbita moschata])

HSP 1 Score: 1203.7 bits (3113), Expect = 0.0e+00
Identity = 608/714 (85.15%), Postives = 650/714 (91.04%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLV-CSCATSPNPTTQSPSPIFL 60
           MAL HQHH   SF SIRK NL Q + NS LFFQF+T+KL  C CATSPNP +QSPSPIFL
Sbjct: 1   MALVHQHHLPQSFQSIRKANLKQSTSNSCLFFQFSTRKLAGCLCATSPNPISQSPSPIFL 60

Query: 61  PFLEENEEQEEEEEVKGAHRH--EGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKN 120
           PFLEE EE+EEEEE +  H+    GN  EDW+DPL RFFKSRTSTTQDP  ESK+SLQKN
Sbjct: 61  PFLEEEEEEEEEEEDEEEHKEVLGGNTTEDWNDPLVRFFKSRTSTTQDPLPESKLSLQKN 120

Query: 121 RRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLG 180
           RRSSWHLAS  E + EAEI   E   Q   VS+N RALPDGVVGDI+RTARNLP+NTTLG
Sbjct: 121 RRSSWHLASDVECSVEAEIAPGEDKKQSGLVSRNSRALPDGVVGDIVRTARNLPQNTTLG 180

Query: 181 EALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGR 240
           EAL DF+G+I EK+C+EVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYS++FP LGR
Sbjct: 181 EALGDFEGKIDEKECLEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSILFPLLGR 240

Query: 241 AGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVT 300
           AGMG+KIMVLFKN+PLKKE QDVHVYNSAMSGLM C RYNDAC+VY AMETN VNPDHVT
Sbjct: 241 AGMGDKIMVLFKNIPLKKELQDVHVYNSAMSGLMVCKRYNDACEVYAAMETNKVNPDHVT 300

Query: 301 CSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEM 360
           CSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWSPEVLGALIK+FCDEGLKSQALIIQLEM
Sbjct: 301 CSIMITVMRKIGRSAKDSWDYFEKMNEKGVKWSPEVLGALIKAFCDEGLKSQALIIQLEM 360

Query: 361 EKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQP 420
           EKKGV SNAIVYNTI+DAFSKSNQIEEAEGLFAEMKAKGVKPTSAT+N+LMDAYSRRMQP
Sbjct: 361 EKKGVTSNAIVYNTIMDAFSKSNQIEEAEGLFAEMKAKGVKPTSATFNILMDAYSRRMQP 420

Query: 421 EVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTA 480
           E+VEKL++EMKD GFEPNVKSYTCLISAYGR+K MSDMAADAFLRMKKNGI+P SHSYTA
Sbjct: 421 EIVEKLLIEMKDTGFEPNVKSYTCLISAYGRKKTMSDMAADAFLRMKKNGIKPNSHSYTA 480

Query: 481 LIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREK 540
           LIHAYSVSGWHEKAY TFENM +EGLKPSIETYTTLLDAFRRAGDT ALMKIWKLMIREK
Sbjct: 481 LIHAYSVSGWHEKAYSTFENMLQEGLKPSIETYTTLLDAFRRAGDTEALMKIWKLMIREK 540

Query: 541 IEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMP 600
           + GTRVTFNILLDGFAKQGHY+EARDVISEF K GLQPT+MTYNMLMNAYARGGQH KMP
Sbjct: 541 VAGTRVTFNILLDGFAKQGHYIEARDVISEFSKTGLQPTIMTYNMLMNAYARGGQHLKMP 600

Query: 601 QLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAIL 660
           QLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR+IL
Sbjct: 601 QLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRSIL 660

Query: 661 DVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           D KL TKNRKDKSAILGI+NSK+GMVKAKKKGKKDEFWKNKRK+VKT R+SP E
Sbjct: 661 DAKLDTKNRKDKSAILGIMNSKLGMVKAKKKGKKDEFWKNKRKYVKTLRISPNE 714

BLAST of Moc03g30020 vs. NCBI nr
Match: XP_023001232.1 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucurbita maxima])

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 609/714 (85.29%), Postives = 652/714 (91.32%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCS-CATSPNPTTQSPSPIFL 60
           MAL HQHH   SF SIRK NLNQ + NS LFFQF+T+KL C  CATSPNPT+QSPSPIFL
Sbjct: 8   MALVHQHHLPQSFQSIRKANLNQSTSNSCLFFQFSTRKLACCLCATSPNPTSQSPSPIFL 67

Query: 61  PFLEENEEQEEEEEVKGAHRH--EGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKN 120
            FLEE EE+EEEEE +   +    GN  EDW+DPL RFFKSR STTQDP  ESK+SLQKN
Sbjct: 68  RFLEEEEEEEEEEEEEEEPKDVLGGNTTEDWNDPLVRFFKSRNSTTQDPLPESKLSLQKN 127

Query: 121 RRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLG 180
           RRSSWHLAS  E + EAEI  D+   Q  SVS+N R LPDGVVGDI+RTARNLP+NTTLG
Sbjct: 128 RRSSWHLASEVECSVEAEIAPDDDKKQSGSVSRNSRVLPDGVVGDIVRTARNLPQNTTLG 187

Query: 181 EALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGR 240
           EAL DF+G+I EK+C+EVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYS++FP LGR
Sbjct: 188 EALGDFEGKIDEKECLEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSILFPLLGR 247

Query: 241 AGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVT 300
           AGMG+KIMVLFKN+PLKKE QDVHVYNSAMSGLM C RYNDAC+VYEAMETN VNPDHVT
Sbjct: 248 AGMGDKIMVLFKNIPLKKELQDVHVYNSAMSGLMVCKRYNDACEVYEAMETNKVNPDHVT 307

Query: 301 CSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEM 360
           CSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWSPEVLGALIK+FCDEGLKSQALIIQLEM
Sbjct: 308 CSIMITVMRKIGRSAKDSWDYFEKMNEKGVKWSPEVLGALIKAFCDEGLKSQALIIQLEM 367

Query: 361 EKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQP 420
           EKKGVASNAIVYNTI+DAFSKSNQIEEAEGLFAEMKAKGVKPTSAT+N+LMDAYSRRMQP
Sbjct: 368 EKKGVASNAIVYNTIMDAFSKSNQIEEAEGLFAEMKAKGVKPTSATFNILMDAYSRRMQP 427

Query: 421 EVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTA 480
           E+VEKL++EMK++GFEPNVKSYTCLISAYGR+K MSDMAADAFLRMKKNGI+P SHSYTA
Sbjct: 428 EIVEKLLIEMKEMGFEPNVKSYTCLISAYGRKKTMSDMAADAFLRMKKNGIKPNSHSYTA 487

Query: 481 LIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREK 540
           LIHAYSVSGWHEKAY TFENM REGLKPSIETYTTLLDAFRRAGDT ALMKIWK M+REK
Sbjct: 488 LIHAYSVSGWHEKAYSTFENMLREGLKPSIETYTTLLDAFRRAGDTEALMKIWKFMVREK 547

Query: 541 IEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMP 600
           + GTRVTFNILLDGFAKQGHY+EARDVISEF KIGLQPTVMTYNMLMNAYARGGQH KMP
Sbjct: 548 VAGTRVTFNILLDGFAKQGHYIEARDVISEFSKIGLQPTVMTYNMLMNAYARGGQHLKMP 607

Query: 601 QLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAIL 660
           QLLQEMAA ELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR+IL
Sbjct: 608 QLLQEMAAWELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRSIL 667

Query: 661 DVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           D KL TKNRKDKSAILGI+NSK+GMVKAKKKGKKDEFWKNKRK+VKT RVSP E
Sbjct: 668 DEKLDTKNRKDKSAILGIMNSKLGMVKAKKKGKKDEFWKNKRKYVKTMRVSPNE 721

BLAST of Moc03g30020 vs. ExPASy Swiss-Prot
Match: Q9FGR7 (Pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB1006 PE=2 SV=1)

HSP 1 Score: 840.1 bits (2169), Expect = 1.8e-242
Identity = 427/669 (63.83%), Postives = 523/669 (78.18%), Query Frame = 0

Query: 44  ATSPNPTTQSPSPIFLPF-------LEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFF 103
           ATSP+ ++ SPS     F       +++ E      E       +  + +D+ DP+ +FF
Sbjct: 45  ATSPSSSSSSPSIFLSCFDDALPDKIQQPENSTINSEESECEEEDDEEGDDFTDPILKFF 104

Query: 104 KSRTST---TQDPQRESKVSLQKNRRSSWHLASGSEFAD-EAEITIDEVAGQLSSVSQNF 163
           KSRT T   T DP RESK SLQKNRR+SWHLA   +FAD E EI          +  Q  
Sbjct: 105 KSRTLTSESTADPARESKFSLQKNRRTSWHLA--PDFADPETEIESKPEESVFVTNQQTL 164

Query: 164 RA---LPDGVVGDIMRTARNLPENTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCL 223
                   GV  +I+  A+NL EN TLGE L+ F+ R+ + +C+E L ++GE   V  CL
Sbjct: 165 GVHIPFESGVAREILELAKNLKENQTLGEMLSGFERRVSDTECVEALVMMGESGFVKSCL 224

Query: 224 YFFEWMGLQEPSLVTPRAYSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSG 283
           YF+EWM LQEPSL +PRA S++F  LGR  M + I++L  NLP K+EF+DV +YN+A+SG
Sbjct: 225 YFYEWMSLQEPSLASPRACSVLFTLLGRERMADYILLLLSNLPDKEEFRDVRLYNAAISG 284

Query: 284 LMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKW 343
           L    RY+DA +VYEAM+  NV PD+VTC+I+IT +RK GRSAK+ W+ FEKM++KGVKW
Sbjct: 285 LSASQRYDDAWEVYEAMDKINVYPDNVTCAILITTLRKAGRSAKEVWEIFEKMSEKGVKW 344

Query: 344 SPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLF 403
           S +V G L+KSFCDEGLK +AL+IQ EMEKKG+ SN IVYNT++DA++KSN IEE EGLF
Sbjct: 345 SQDVFGGLVKSFCDEGLKEEALVIQTEMEKKGIRSNTIVYNTLMDAYNKSNHIEEVEGLF 404

Query: 404 AEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQ 463
            EM+ KG+KP++ATYN+LMDAY+RRMQP++VE L+ EM+D+G EPNVKSYTCLISAYGR 
Sbjct: 405 TEMRDKGLKPSAATYNILMDAYARRMQPDIVETLLREMEDLGLEPNVKSYTCLISAYGRT 464

Query: 464 KKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIET 523
           KKMSDMAADAFLRMKK G++P+SHSYTALIHAYSVSGWHEKAY +FE M +EG+KPS+ET
Sbjct: 465 KKMSDMAADAFLRMKKVGLKPSSHSYTALIHAYSVSGWHEKAYASFEEMCKEGIKPSVET 524

Query: 524 YTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFG 583
           YT++LDAFRR+GDT  LM+IWKLM+REKI+GTR+T+N LLDGFAKQG Y+EARDV+SEF 
Sbjct: 525 YTSVLDAFRRSGDTGKLMEIWKLMLREKIKGTRITYNTLLDGFAKQGLYIEARDVVSEFS 584

Query: 584 KIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFK 643
           K+GLQP+VMTYNMLMNAYARGGQ AK+PQLL+EMAA  LKPDS+TYSTMIYA+VRVRDFK
Sbjct: 585 KMGLQPSVMTYNMLMNAYARGGQDAKLPQLLKEMAALNLKPDSITYSTMIYAFVRVRDFK 644

Query: 644 RAFFYHKKMVKSGQVPDVKSYQKLRAILDVKLATKNRKDKSAILGIINSKMGMVKAKKKG 699
           RAFFYHK MVKSGQVPD +SY+KLRAIL+ K  TKNRKDK+AILGIINSK G VKAK KG
Sbjct: 645 RAFFYHKMMVKSGQVPDPRSYEKLRAILEDKAKTKNRKDKTAILGIINSKFGRVKAKTKG 704

BLAST of Moc03g30020 vs. ExPASy Swiss-Prot
Match: Q9LYZ9 (Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX=3702 GN=At5g02860 PE=2 SV=1)

HSP 1 Score: 186.4 bits (472), Expect = 1.1e-45
Identity = 126/473 (26.64%), Postives = 216/473 (45.67%), Query Frame = 0

Query: 157 DGVVGDIMRTARNLPENTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMG 216
           D V+ ++    ++ PE+T+               + +  L+ LG        L  F+W  
Sbjct: 117 DSVLSELFEPFKDKPESTS--------------SELLAFLKGLGFHKKFDLALRAFDWFM 176

Query: 217 LQE--PSLVTPRAYSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCN 276
            Q+   S++     ++I   LG+ G       +F  L       DV+ Y S +S      
Sbjct: 177 KQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSG 236

Query: 277 RYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVL 336
           RY +A  V++ ME +   P  +T ++++ V  K+G          EKM   G+       
Sbjct: 237 RYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSDGIAPDAYTY 296

Query: 337 GALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKA 396
             LI       L  +A  +  EM+  G + + + YN +LD + KS++ +EA  +  EM  
Sbjct: 297 NTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVL 356

Query: 397 KGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSD 456
            G  P+  TYN L+ AY+R    +   +L  +M + G +P+V +YT L+S + R  K+ +
Sbjct: 357 NGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKV-E 416

Query: 457 MAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLL 516
            A   F  M+  G +P   ++ A I  Y   G   +    F+ +   GL P I T+ TLL
Sbjct: 417 SAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLL 476

Query: 517 DAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQ 576
             F + G    +  ++K M R      R TFN L+  +++ G + +A  V       G+ 
Sbjct: 477 AVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVT 536

Query: 577 PTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRD 628
           P + TYN ++ A ARGG   +  ++L EM     KP+ +TY ++++AY   ++
Sbjct: 537 PDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKE 574

BLAST of Moc03g30020 vs. ExPASy Swiss-Prot
Match: Q9FIX3 (Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX=3702 GN=EMB2745 PE=2 SV=1)

HSP 1 Score: 178.3 bits (451), Expect = 3.0e-43
Identity = 115/410 (28.05%), Postives = 197/410 (48.05%), Query Frame = 0

Query: 247 LFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMR 306
           +FK +   +   +V  YN  + G  F    + A  +++ MET    P+ VT + +I    
Sbjct: 192 VFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYC 251

Query: 307 KIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNA 366
           K+ R   D +     M  KG++ +      +I   C EG   +   +  EM ++G + + 
Sbjct: 252 KL-RKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDE 311

Query: 367 IVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLE 426
           + YNT++  + K     +A  + AEM   G+ P+  TY  L+ +  +        + + +
Sbjct: 312 VTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQ 371

Query: 427 MKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 486
           M+  G  PN ++YT L+  + ++  M++ A      M  NG  P+  +Y ALI+ + V+G
Sbjct: 372 MRVRGLCPNERTYTTLVDGFSQKGYMNE-AYRVLREMNDNGFSPSVVTYNALINGHCVTG 431

Query: 487 WHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFN 546
             E A    E+M+ +GL P + +Y+T+L  F R+ D    +++ + M+ + I+   +T++
Sbjct: 432 KMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYS 491

Query: 547 ILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAAR 606
            L+ GF +Q    EA D+  E  ++GL P   TY  L+NAY   G   K  QL  EM  +
Sbjct: 492 SLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEK 551

Query: 607 ELKPDSVTYSTMIYA---YVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 654
            + PD VTYS +I       R R+ KR      K+     VP   +Y  L
Sbjct: 552 GVLPDVVTYSVLINGLNKQSRTREAKRLLL---KLFYEESVPSDVTYHTL 596

BLAST of Moc03g30020 vs. ExPASy Swiss-Prot
Match: O64624 (Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=At2g18940 PE=2 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 2.6e-42
Identity = 118/473 (24.95%), Postives = 226/473 (47.78%), Query Frame = 0

Query: 189 EKQCMEV-LRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGMGEKIMVL 248
           + Q +E+ +R+LG E+         + + LQE  L+  RAY+ I  +  R G  EK + L
Sbjct: 174 DHQVIEIFVRILGRESQYSVAAKLLDKIPLQE-YLLDVRAYTTILHAYSRTGKYEKAIDL 233

Query: 249 FKNLPLKKEFQDVHVYNSAMSGLMFCNR-YNDACKVYEAMETNNVNPDHVTCSIMITVMR 308
           F+ +        +  YN  +       R +     V + M +  +  D  TCS +++   
Sbjct: 234 FERMKEMGPSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACA 293

Query: 309 KIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNA 368
           + G   +++ ++F ++   G +       AL++ F   G+ ++AL +  EME+    +++
Sbjct: 294 REG-LLREAKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADS 353

Query: 369 IVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLE 428
           + YN ++ A+ ++   +EA G+   M  KGV P + TY  ++DAY +  + +   KL   
Sbjct: 354 VTYNELVAAYVRAGFSKEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYS 413

Query: 429 MKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 488
           MK+ G  PN  +Y  ++S  G++ + ++M       MK NG  P   ++  ++      G
Sbjct: 414 MKEAGCVPNTCTYNAVLSLLGKKSRSNEM-IKMLCDMKSNGCSPNRATWNTMLALCGNKG 473

Query: 489 WHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFN 548
             +     F  M+  G +P  +T+ TL+ A+ R G  V   K++  M R        T+N
Sbjct: 474 MDKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAGFNACVTTYN 533

Query: 549 ILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAAR 608
            LL+  A++G +    +VIS+    G +PT  +Y++++  YA+GG +  + ++   +   
Sbjct: 534 ALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIENRIKEG 593

Query: 609 ELKPDSVTYSTMIYAYVRVRDF---KRAFFYHKKMVKSGQVPDVKSYQKLRAI 657
           ++ P  +   T++ A  + R     +RAF   K   K G  PD+  +  + +I
Sbjct: 594 QIFPSWMLLRTLLLANFKCRALAGSERAFTLFK---KHGYKPDMVIFNSMLSI 640

BLAST of Moc03g30020 vs. ExPASy Swiss-Prot
Match: Q76C99 (Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.4e-40
Identity = 110/392 (28.06%), Postives = 186/392 (47.45%), Query Frame = 0

Query: 259 DVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDY 318
           DV  Y + ++G       + A   Y  M    + PD VT + +I  + K  ++   + + 
Sbjct: 195 DVVSYTTVINGFFKEGDSDKAYSTYHEMLDRGILPDVVTYNSIIAALCK-AQAMDKAMEV 254

Query: 319 FEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSK 378
              M K GV        +++  +C  G   +A+    +M   GV  + + Y+ ++D   K
Sbjct: 255 LNTMVKNGVMPDCMTYNSILHGYCSSGQPKEAIGFLKKMRSDGVEPDVVTYSLLMDYLCK 314

Query: 379 SNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKS 438
           + +  EA  +F  M  +G+KP   TY  L+  Y+ +     +  L+  M   G  P+   
Sbjct: 315 NGRCMEARKIFDSMTKRGLKPEITTYGTLLQGYATKGALVEMHGLLDLMVRNGIHPDHYV 374

Query: 439 YTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENM 498
           ++ LI AY +Q K+ D A   F +M++ G+ P + +Y A+I     SG  E A L FE M
Sbjct: 375 FSILICAYAKQGKV-DQAMLVFSKMRQQGLNPNAVTYGAVIGILCKSGRVEDAMLYFEQM 434

Query: 499 QREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHY 558
             EGL P    Y +L+             ++   M+   I    + FN ++D   K+G  
Sbjct: 435 IDEGLSPGNIVYNSLIHGLCTCNKWERAEELILEMLDRGICLNTIFFNSIIDSHCKEGRV 494

Query: 559 VEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTM 618
           +E+  +     +IG++P V+TYN L+N Y   G+  +  +LL  M +  LKP++VTYST+
Sbjct: 495 IESEKLFELMVRIGVKPNVITYNTLINGYCLAGKMDEAMKLLSGMVSVGLKPNTVTYSTL 554

Query: 619 IYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSY 651
           I  Y ++   + A    K+M  SG  PD+ +Y
Sbjct: 555 INGYCKISRMEDALVLFKEMESSGVSPDIITY 584

BLAST of Moc03g30020 vs. ExPASy TrEMBL
Match: A0A6J1C7V4 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111009047 PE=4 SV=1)

HSP 1 Score: 1416.7 bits (3666), Expect = 0.0e+00
Identity = 714/714 (100.00%), Postives = 714/714 (100.00%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP 60
           MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP
Sbjct: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCSCATSPNPTTQSPSPIFLP 60

Query: 61  FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS 120
           FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS
Sbjct: 61  FLEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKNRRS 120

Query: 121 SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL 180
           SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL
Sbjct: 121 SWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLGEAL 180

Query: 181 ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM 240
           ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM
Sbjct: 181 ADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGM 240

Query: 241 GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI 300
           GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI
Sbjct: 241 GEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSI 300

Query: 301 MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK 360
           MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK
Sbjct: 301 MITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKK 360

Query: 361 GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV 420
           GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV
Sbjct: 361 GVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVV 420

Query: 421 EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH 480
           EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH
Sbjct: 421 EKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIH 480

Query: 481 AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG 540
           AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG
Sbjct: 481 AYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEG 540

Query: 541 TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL 600
           TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL
Sbjct: 541 TRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLL 600

Query: 601 QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK 660
           QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK
Sbjct: 601 QEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAILDVK 660

Query: 661 LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS 715
           LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS
Sbjct: 661 LATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPMEAKS 714

BLAST of Moc03g30020 vs. ExPASy TrEMBL
Match: A0A6J1E8D2 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111431694 PE=4 SV=1)

HSP 1 Score: 1203.7 bits (3113), Expect = 0.0e+00
Identity = 608/714 (85.15%), Postives = 650/714 (91.04%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLV-CSCATSPNPTTQSPSPIFL 60
           MAL HQHH   SF SIRK NL Q + NS LFFQF+T+KL  C CATSPNP +QSPSPIFL
Sbjct: 1   MALVHQHHLPQSFQSIRKANLKQSTSNSCLFFQFSTRKLAGCLCATSPNPISQSPSPIFL 60

Query: 61  PFLEENEEQEEEEEVKGAHRH--EGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKN 120
           PFLEE EE+EEEEE +  H+    GN  EDW+DPL RFFKSRTSTTQDP  ESK+SLQKN
Sbjct: 61  PFLEEEEEEEEEEEDEEEHKEVLGGNTTEDWNDPLVRFFKSRTSTTQDPLPESKLSLQKN 120

Query: 121 RRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLG 180
           RRSSWHLAS  E + EAEI   E   Q   VS+N RALPDGVVGDI+RTARNLP+NTTLG
Sbjct: 121 RRSSWHLASDVECSVEAEIAPGEDKKQSGLVSRNSRALPDGVVGDIVRTARNLPQNTTLG 180

Query: 181 EALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGR 240
           EAL DF+G+I EK+C+EVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYS++FP LGR
Sbjct: 181 EALGDFEGKIDEKECLEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSILFPLLGR 240

Query: 241 AGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVT 300
           AGMG+KIMVLFKN+PLKKE QDVHVYNSAMSGLM C RYNDAC+VY AMETN VNPDHVT
Sbjct: 241 AGMGDKIMVLFKNIPLKKELQDVHVYNSAMSGLMVCKRYNDACEVYAAMETNKVNPDHVT 300

Query: 301 CSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEM 360
           CSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWSPEVLGALIK+FCDEGLKSQALIIQLEM
Sbjct: 301 CSIMITVMRKIGRSAKDSWDYFEKMNEKGVKWSPEVLGALIKAFCDEGLKSQALIIQLEM 360

Query: 361 EKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQP 420
           EKKGV SNAIVYNTI+DAFSKSNQIEEAEGLFAEMKAKGVKPTSAT+N+LMDAYSRRMQP
Sbjct: 361 EKKGVTSNAIVYNTIMDAFSKSNQIEEAEGLFAEMKAKGVKPTSATFNILMDAYSRRMQP 420

Query: 421 EVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTA 480
           E+VEKL++EMKD GFEPNVKSYTCLISAYGR+K MSDMAADAFLRMKKNGI+P SHSYTA
Sbjct: 421 EIVEKLLIEMKDTGFEPNVKSYTCLISAYGRKKTMSDMAADAFLRMKKNGIKPNSHSYTA 480

Query: 481 LIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREK 540
           LIHAYSVSGWHEKAY TFENM +EGLKPSIETYTTLLDAFRRAGDT ALMKIWKLMIREK
Sbjct: 481 LIHAYSVSGWHEKAYSTFENMLQEGLKPSIETYTTLLDAFRRAGDTEALMKIWKLMIREK 540

Query: 541 IEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMP 600
           + GTRVTFNILLDGFAKQGHY+EARDVISEF K GLQPT+MTYNMLMNAYARGGQH KMP
Sbjct: 541 VAGTRVTFNILLDGFAKQGHYIEARDVISEFSKTGLQPTIMTYNMLMNAYARGGQHLKMP 600

Query: 601 QLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAIL 660
           QLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR+IL
Sbjct: 601 QLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRSIL 660

Query: 661 DVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           D KL TKNRKDKSAILGI+NSK+GMVKAKKKGKKDEFWKNKRK+VKT R+SP E
Sbjct: 661 DAKLDTKNRKDKSAILGIMNSKLGMVKAKKKGKKDEFWKNKRKYVKTLRISPNE 714

BLAST of Moc03g30020 vs. ExPASy TrEMBL
Match: A0A6J1KFZ0 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111495422 PE=4 SV=1)

HSP 1 Score: 1203.3 bits (3112), Expect = 0.0e+00
Identity = 609/714 (85.29%), Postives = 652/714 (91.32%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCS-CATSPNPTTQSPSPIFL 60
           MAL HQHH   SF SIRK NLNQ + NS LFFQF+T+KL C  CATSPNPT+QSPSPIFL
Sbjct: 8   MALVHQHHLPQSFQSIRKANLNQSTSNSCLFFQFSTRKLACCLCATSPNPTSQSPSPIFL 67

Query: 61  PFLEENEEQEEEEEVKGAHRH--EGNKMEDWDDPLFRFFKSRTSTTQDPQRESKVSLQKN 120
            FLEE EE+EEEEE +   +    GN  EDW+DPL RFFKSR STTQDP  ESK+SLQKN
Sbjct: 68  RFLEEEEEEEEEEEEEEEPKDVLGGNTTEDWNDPLVRFFKSRNSTTQDPLPESKLSLQKN 127

Query: 121 RRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPENTTLG 180
           RRSSWHLAS  E + EAEI  D+   Q  SVS+N R LPDGVVGDI+RTARNLP+NTTLG
Sbjct: 128 RRSSWHLASEVECSVEAEIAPDDDKKQSGSVSRNSRVLPDGVVGDIVRTARNLPQNTTLG 187

Query: 181 EALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGR 240
           EAL DF+G+I EK+C+EVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYS++FP LGR
Sbjct: 188 EALGDFEGKIDEKECLEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSILFPLLGR 247

Query: 241 AGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVT 300
           AGMG+KIMVLFKN+PLKKE QDVHVYNSAMSGLM C RYNDAC+VYEAMETN VNPDHVT
Sbjct: 248 AGMGDKIMVLFKNIPLKKELQDVHVYNSAMSGLMVCKRYNDACEVYEAMETNKVNPDHVT 307

Query: 301 CSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEM 360
           CSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWSPEVLGALIK+FCDEGLKSQALIIQLEM
Sbjct: 308 CSIMITVMRKIGRSAKDSWDYFEKMNEKGVKWSPEVLGALIKAFCDEGLKSQALIIQLEM 367

Query: 361 EKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQP 420
           EKKGVASNAIVYNTI+DAFSKSNQIEEAEGLFAEMKAKGVKPTSAT+N+LMDAYSRRMQP
Sbjct: 368 EKKGVASNAIVYNTIMDAFSKSNQIEEAEGLFAEMKAKGVKPTSATFNILMDAYSRRMQP 427

Query: 421 EVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTA 480
           E+VEKL++EMK++GFEPNVKSYTCLISAYGR+K MSDMAADAFLRMKKNGI+P SHSYTA
Sbjct: 428 EIVEKLLIEMKEMGFEPNVKSYTCLISAYGRKKTMSDMAADAFLRMKKNGIKPNSHSYTA 487

Query: 481 LIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREK 540
           LIHAYSVSGWHEKAY TFENM REGLKPSIETYTTLLDAFRRAGDT ALMKIWK M+REK
Sbjct: 488 LIHAYSVSGWHEKAYSTFENMLREGLKPSIETYTTLLDAFRRAGDTEALMKIWKFMVREK 547

Query: 541 IEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMP 600
           + GTRVTFNILLDGFAKQGHY+EARDVISEF KIGLQPTVMTYNMLMNAYARGGQH KMP
Sbjct: 548 VAGTRVTFNILLDGFAKQGHYIEARDVISEFSKIGLQPTVMTYNMLMNAYARGGQHLKMP 607

Query: 601 QLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRAIL 660
           QLLQEMAA ELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLR+IL
Sbjct: 608 QLLQEMAAWELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKLRSIL 667

Query: 661 DVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKTQRVSPME 712
           D KL TKNRKDKSAILGI+NSK+GMVKAKKKGKKDEFWKNKRK+VKT RVSP E
Sbjct: 668 DEKLDTKNRKDKSAILGIMNSKLGMVKAKKKGKKDEFWKNKRKYVKTMRVSPNE 721

BLAST of Moc03g30020 vs. ExPASy TrEMBL
Match: A0A5A7UP11 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold80G001340 PE=4 SV=1)

HSP 1 Score: 1188.7 bits (3074), Expect = 0.0e+00
Identity = 606/712 (85.11%), Postives = 646/712 (90.73%), Query Frame = 0

Query: 1    MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCS-CATSPNPTTQSPSPIFL 60
            MAL HQHH    F SI K N  Q + NS  FFQ NTQKL C  CA SPNPTTQSPSPIFL
Sbjct: 701  MALVHQHHLTFPFLSIGKANPKQNTSNSCSFFQSNTQKLACCLCAASPNPTTQSPSPIFL 760

Query: 61   PFLEENEEQE------EEEEVKGAHRHEGNKM-EDWDDPLFRFFKSRTSTTQDPQRESKV 120
             FL++ EE+E      EEEEV     H GNK  EDW+DPLFRFFKSRTSTTQDP RESK+
Sbjct: 761  HFLQKEEEEEEEVEEVEEEEVPSKEVHGGNKTEEDWNDPLFRFFKSRTSTTQDPSRESKL 820

Query: 121  SLQKNRRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPE 180
            SLQKNRRSSWHLAS  EF DEAE+T++E   QL S S+N R LPDG+VG+I+  ARNL +
Sbjct: 821  SLQKNRRSSWHLASDFEFFDEAEVTLEEDKEQLGSASRNSRVLPDGLVGEIVGIARNLSQ 880

Query: 181  NTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIF 240
            N TLGEAL +F+GRI EK+C+EVLRLLGEENLVVCCLYFFEWMGLQE SLVT RAYSL+F
Sbjct: 881  NMTLGEALGEFEGRISEKECLEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLF 940

Query: 241  PSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVN 300
            P LGRAGMGEKIMVLFKNLPL+KEFQDVHVYNSAMSGLM C RY+DACKVYEAMETNNVN
Sbjct: 941  PLLGRAGMGEKIMVLFKNLPLRKEFQDVHVYNSAMSGLMVCKRYDDACKVYEAMETNNVN 1000

Query: 301  PDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALI 360
            PDHVTCSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWS EVLGALIKSFCDEGLKSQALI
Sbjct: 1001 PDHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALI 1060

Query: 361  IQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYS 420
            IQLEMEKKGVASN I+YNTI+DAFSKSNQIEEAEG+FAEMK+KGVKPTSA++N+LM+AYS
Sbjct: 1061 IQLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYS 1120

Query: 421  RRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTS 480
            RRMQPE+VEKL++EMKD+G EPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTS
Sbjct: 1121 RRMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTS 1180

Query: 481  HSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKL 540
            HSYTALIHAYSVSGWHEKAYL FENM REGLKPSIETYTTLLDAFRRAGDTV+LMKIWKL
Sbjct: 1181 HSYTALIHAYSVSGWHEKAYLIFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKL 1240

Query: 541  MIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQ 600
            MIREKI GTRVTFNILLDGFAKQGHYVEARDVISEF KIGLQPTVMTYNMLMNAYARGGQ
Sbjct: 1241 MIREKIVGTRVTFNILLDGFAKQGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQ 1300

Query: 601  HAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQK 660
            H KMPQLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQK
Sbjct: 1301 HLKMPQLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQK 1360

Query: 661  LRAILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKT 705
            L++ILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWK KR+HV+T
Sbjct: 1361 LKSILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKTKRRHVRT 1412

BLAST of Moc03g30020 vs. ExPASy TrEMBL
Match: A0A1S3AY38 (pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103484023 PE=3 SV=1)

HSP 1 Score: 1188.3 bits (3073), Expect = 0.0e+00
Identity = 605/711 (85.09%), Postives = 647/711 (91.00%), Query Frame = 0

Query: 1   MALAHQHHFLNSFHSIRKPNLNQYSPNSHLFFQFNTQKLVCS-CATSPNPTTQSPSPIFL 60
           MAL HQHH    F SI K NL Q + NS  FFQ NTQKL C  CA SPNPTTQSPSPIFL
Sbjct: 1   MALVHQHHLTFPFLSIGKANLKQNTSNSCSFFQSNTQKLACCLCAASPNPTTQSPSPIFL 60

Query: 61  PFLEENEEQEEEEEVK-----GAHRHEGNKM-EDWDDPLFRFFKSRTSTTQDPQRESKVS 120
            FL+E EE+EEEEEV+         H GNK  EDW+DPLFRFFKSRTSTTQDP RESK+S
Sbjct: 61  HFLQEEEEEEEEEEVEEEEVPSKEVHGGNKTEEDWNDPLFRFFKSRTSTTQDPSRESKLS 120

Query: 121 LQKNRRSSWHLASGSEFADEAEITIDEVAGQLSSVSQNFRALPDGVVGDIMRTARNLPEN 180
           LQKNRRSSWHLAS  EF +EAE+T++E   QL S S+N R LPDG+VG+I+  ARNL +N
Sbjct: 121 LQKNRRSSWHLASDVEFFNEAEVTLEEDKEQLGSASRNSRVLPDGLVGEIVGIARNLSQN 180

Query: 181 TTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFP 240
            TLGEAL +F+GRI EK+C+EVLRLLGEENLVVCCLYFFEWMGLQE SLVT RAYSL+FP
Sbjct: 181 MTLGEALGEFEGRISEKECLEVLRLLGEENLVVCCLYFFEWMGLQETSLVTSRAYSLLFP 240

Query: 241 SLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNP 300
            LGRAGMGEKIMVLFKNLPL+KEFQDVHVYNSAMSGLM C RY+DACKVYEAMETNNVNP
Sbjct: 241 LLGRAGMGEKIMVLFKNLPLRKEFQDVHVYNSAMSGLMVCKRYDDACKVYEAMETNNVNP 300

Query: 301 DHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALII 360
           DHVTCSIMITVMRKIGRSAKDSWDYFEKMN+KGVKWS EVLGALIKSFCDEGLKSQALII
Sbjct: 301 DHVTCSIMITVMRKIGRSAKDSWDYFEKMNQKGVKWSSEVLGALIKSFCDEGLKSQALII 360

Query: 361 QLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSR 420
           QLEMEKKGVASN I+YNTI+DAFSKSNQIEEAEG+FAEMK+KGVKPTSA++N+LM+AYSR
Sbjct: 361 QLEMEKKGVASNVIMYNTIMDAFSKSNQIEEAEGVFAEMKSKGVKPTSASFNILMNAYSR 420

Query: 421 RMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH 480
           RMQPE+VEKL++EMKD+G EPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH
Sbjct: 421 RMQPEIVEKLLVEMKDMGLEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSH 480

Query: 481 SYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLM 540
           SYTALIHAYSVSGWHEKAY  FENM REGLKPSIETYTTLLDAFRRAGDTV+LMKIWKLM
Sbjct: 481 SYTALIHAYSVSGWHEKAYSIFENMLREGLKPSIETYTTLLDAFRRAGDTVSLMKIWKLM 540

Query: 541 IREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQH 600
           IREKI GTRVTFNILLDGFAKQGHYVEARDVISEF KIGLQPTVMTYNMLMNAYARGGQH
Sbjct: 541 IREKIVGTRVTFNILLDGFAKQGHYVEARDVISEFDKIGLQPTVMTYNMLMNAYARGGQH 600

Query: 601 AKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 660
            KMPQLLQEMAARELKPDSVTYSTMIYA+VRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL
Sbjct: 601 LKMPQLLQEMAARELKPDSVTYSTMIYAFVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 660

Query: 661 RAILDVKLATKNRKDKSAILGIINSKMGMVKAKKKGKKDEFWKNKRKHVKT 705
           ++ILDVKLATKNRKDKSAILGIINSKMGMVKAK+KGKKDEFWK KR+HV+T
Sbjct: 661 KSILDVKLATKNRKDKSAILGIINSKMGMVKAKQKGKKDEFWKTKRRHVRT 711

BLAST of Moc03g30020 vs. TAIR 10
Match: AT5G50280.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 840.1 bits (2169), Expect = 1.3e-243
Identity = 427/669 (63.83%), Postives = 523/669 (78.18%), Query Frame = 0

Query: 44  ATSPNPTTQSPSPIFLPF-------LEENEEQEEEEEVKGAHRHEGNKMEDWDDPLFRFF 103
           ATSP+ ++ SPS     F       +++ E      E       +  + +D+ DP+ +FF
Sbjct: 45  ATSPSSSSSSPSIFLSCFDDALPDKIQQPENSTINSEESECEEEDDEEGDDFTDPILKFF 104

Query: 104 KSRTST---TQDPQRESKVSLQKNRRSSWHLASGSEFAD-EAEITIDEVAGQLSSVSQNF 163
           KSRT T   T DP RESK SLQKNRR+SWHLA   +FAD E EI          +  Q  
Sbjct: 105 KSRTLTSESTADPARESKFSLQKNRRTSWHLA--PDFADPETEIESKPEESVFVTNQQTL 164

Query: 164 RA---LPDGVVGDIMRTARNLPENTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCL 223
                   GV  +I+  A+NL EN TLGE L+ F+ R+ + +C+E L ++GE   V  CL
Sbjct: 165 GVHIPFESGVAREILELAKNLKENQTLGEMLSGFERRVSDTECVEALVMMGESGFVKSCL 224

Query: 224 YFFEWMGLQEPSLVTPRAYSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSG 283
           YF+EWM LQEPSL +PRA S++F  LGR  M + I++L  NLP K+EF+DV +YN+A+SG
Sbjct: 225 YFYEWMSLQEPSLASPRACSVLFTLLGRERMADYILLLLSNLPDKEEFRDVRLYNAAISG 284

Query: 284 LMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKW 343
           L    RY+DA +VYEAM+  NV PD+VTC+I+IT +RK GRSAK+ W+ FEKM++KGVKW
Sbjct: 285 LSASQRYDDAWEVYEAMDKINVYPDNVTCAILITTLRKAGRSAKEVWEIFEKMSEKGVKW 344

Query: 344 SPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLF 403
           S +V G L+KSFCDEGLK +AL+IQ EMEKKG+ SN IVYNT++DA++KSN IEE EGLF
Sbjct: 345 SQDVFGGLVKSFCDEGLKEEALVIQTEMEKKGIRSNTIVYNTLMDAYNKSNHIEEVEGLF 404

Query: 404 AEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQ 463
            EM+ KG+KP++ATYN+LMDAY+RRMQP++VE L+ EM+D+G EPNVKSYTCLISAYGR 
Sbjct: 405 TEMRDKGLKPSAATYNILMDAYARRMQPDIVETLLREMEDLGLEPNVKSYTCLISAYGRT 464

Query: 464 KKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIET 523
           KKMSDMAADAFLRMKK G++P+SHSYTALIHAYSVSGWHEKAY +FE M +EG+KPS+ET
Sbjct: 465 KKMSDMAADAFLRMKKVGLKPSSHSYTALIHAYSVSGWHEKAYASFEEMCKEGIKPSVET 524

Query: 524 YTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFG 583
           YT++LDAFRR+GDT  LM+IWKLM+REKI+GTR+T+N LLDGFAKQG Y+EARDV+SEF 
Sbjct: 525 YTSVLDAFRRSGDTGKLMEIWKLMLREKIKGTRITYNTLLDGFAKQGLYIEARDVVSEFS 584

Query: 584 KIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRDFK 643
           K+GLQP+VMTYNMLMNAYARGGQ AK+PQLL+EMAA  LKPDS+TYSTMIYA+VRVRDFK
Sbjct: 585 KMGLQPSVMTYNMLMNAYARGGQDAKLPQLLKEMAALNLKPDSITYSTMIYAFVRVRDFK 644

Query: 644 RAFFYHKKMVKSGQVPDVKSYQKLRAILDVKLATKNRKDKSAILGIINSKMGMVKAKKKG 699
           RAFFYHK MVKSGQVPD +SY+KLRAIL+ K  TKNRKDK+AILGIINSK G VKAK KG
Sbjct: 645 RAFFYHKMMVKSGQVPDPRSYEKLRAILEDKAKTKNRKDKTAILGIINSKFGRVKAKTKG 704

BLAST of Moc03g30020 vs. TAIR 10
Match: AT5G02860.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 186.4 bits (472), Expect = 7.9e-47
Identity = 126/473 (26.64%), Postives = 216/473 (45.67%), Query Frame = 0

Query: 157 DGVVGDIMRTARNLPENTTLGEALADFDGRIGEKQCMEVLRLLGEENLVVCCLYFFEWMG 216
           D V+ ++    ++ PE+T+               + +  L+ LG        L  F+W  
Sbjct: 117 DSVLSELFEPFKDKPESTS--------------SELLAFLKGLGFHKKFDLALRAFDWFM 176

Query: 217 LQE--PSLVTPRAYSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCN 276
            Q+   S++     ++I   LG+ G       +F  L       DV+ Y S +S      
Sbjct: 177 KQKDYQSMLDNSVVAIIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFANSG 236

Query: 277 RYNDACKVYEAMETNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVL 336
           RY +A  V++ ME +   P  +T ++++ V  K+G          EKM   G+       
Sbjct: 237 RYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTPWNKITSLVEKMKSDGIAPDAYTY 296

Query: 337 GALIKSFCDEGLKSQALIIQLEMEKKGVASNAIVYNTILDAFSKSNQIEEAEGLFAEMKA 396
             LI       L  +A  +  EM+  G + + + YN +LD + KS++ +EA  +  EM  
Sbjct: 297 NTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVL 356

Query: 397 KGVKPTSATYNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSD 456
            G  P+  TYN L+ AY+R    +   +L  +M + G +P+V +YT L+S + R  K+ +
Sbjct: 357 NGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKV-E 416

Query: 457 MAADAFLRMKKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLL 516
            A   F  M+  G +P   ++ A I  Y   G   +    F+ +   GL P I T+ TLL
Sbjct: 417 SAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLL 476

Query: 517 DAFRRAGDTVALMKIWKLMIREKIEGTRVTFNILLDGFAKQGHYVEARDVISEFGKIGLQ 576
             F + G    +  ++K M R      R TFN L+  +++ G + +A  V       G+ 
Sbjct: 477 AVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVT 536

Query: 577 PTVMTYNMLMNAYARGGQHAKMPQLLQEMAARELKPDSVTYSTMIYAYVRVRD 628
           P + TYN ++ A ARGG   +  ++L EM     KP+ +TY ++++AY   ++
Sbjct: 537 PDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYANGKE 574

BLAST of Moc03g30020 vs. TAIR 10
Match: AT5G39710.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 178.3 bits (451), Expect = 2.1e-44
Identity = 115/410 (28.05%), Postives = 197/410 (48.05%), Query Frame = 0

Query: 247 LFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAMETNNVNPDHVTCSIMITVMR 306
           +FK +   +   +V  YN  + G  F    + A  +++ MET    P+ VT + +I    
Sbjct: 192 VFKEMLESQVSPNVFTYNILIRGFCFAGNIDVALTLFDKMETKGCLPNVVTYNTLIDGYC 251

Query: 307 KIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNA 366
           K+ R   D +     M  KG++ +      +I   C EG   +   +  EM ++G + + 
Sbjct: 252 KL-RKIDDGFKLLRSMALKGLEPNLISYNVVINGLCREGRMKEVSFVLTEMNRRGYSLDE 311

Query: 367 IVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLE 426
           + YNT++  + K     +A  + AEM   G+ P+  TY  L+ +  +        + + +
Sbjct: 312 VTYNTLIKGYCKEGNFHQALVMHAEMLRHGLTPSVITYTSLIHSMCKAGNMNRAMEFLDQ 371

Query: 427 MKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 486
           M+  G  PN ++YT L+  + ++  M++ A      M  NG  P+  +Y ALI+ + V+G
Sbjct: 372 MRVRGLCPNERTYTTLVDGFSQKGYMNE-AYRVLREMNDNGFSPSVVTYNALINGHCVTG 431

Query: 487 WHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFN 546
             E A    E+M+ +GL P + +Y+T+L  F R+ D    +++ + M+ + I+   +T++
Sbjct: 432 KMEDAIAVLEDMKEKGLSPDVVSYSTVLSGFCRSYDVDEALRVKREMVEKGIKPDTITYS 491

Query: 547 ILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAAR 606
            L+ GF +Q    EA D+  E  ++GL P   TY  L+NAY   G   K  QL  EM  +
Sbjct: 492 SLIQGFCEQRRTKEACDLYEEMLRVGLPPDEFTYTALINAYCMEGDLEKALQLHNEMVEK 551

Query: 607 ELKPDSVTYSTMIYA---YVRVRDFKRAFFYHKKMVKSGQVPDVKSYQKL 654
            + PD VTYS +I       R R+ KR      K+     VP   +Y  L
Sbjct: 552 GVLPDVVTYSVLINGLNKQSRTREAKRLLL---KLFYEESVPSDVTYHTL 596

BLAST of Moc03g30020 vs. TAIR 10
Match: AT2G18940.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 175.3 bits (443), Expect = 1.8e-43
Identity = 118/473 (24.95%), Postives = 226/473 (47.78%), Query Frame = 0

Query: 189 EKQCMEV-LRLLGEENLVVCCLYFFEWMGLQEPSLVTPRAYSLIFPSLGRAGMGEKIMVL 248
           + Q +E+ +R+LG E+         + + LQE  L+  RAY+ I  +  R G  EK + L
Sbjct: 174 DHQVIEIFVRILGRESQYSVAAKLLDKIPLQE-YLLDVRAYTTILHAYSRTGKYEKAIDL 233

Query: 249 FKNLPLKKEFQDVHVYNSAMSGLMFCNR-YNDACKVYEAMETNNVNPDHVTCSIMITVMR 308
           F+ +        +  YN  +       R +     V + M +  +  D  TCS +++   
Sbjct: 234 FERMKEMGPSPTLVTYNVILDVFGKMGRSWRKILGVLDEMRSKGLKFDEFTCSTVLSACA 293

Query: 309 KIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFCDEGLKSQALIIQLEMEKKGVASNA 368
           + G   +++ ++F ++   G +       AL++ F   G+ ++AL +  EME+    +++
Sbjct: 294 REG-LLREAKEFFAELKSCGYEPGTVTYNALLQVFGKAGVYTEALSVLKEMEENSCPADS 353

Query: 369 IVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSATYNVLMDAYSRRMQPEVVEKLMLE 428
           + YN ++ A+ ++   +EA G+   M  KGV P + TY  ++DAY +  + +   KL   
Sbjct: 354 VTYNELVAAYVRAGFSKEAAGVIEMMTKKGVMPNAITYTTVIDAYGKAGKEDEALKLFYS 413

Query: 429 MKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRMKKNGIRPTSHSYTALIHAYSVSG 488
           MK+ G  PN  +Y  ++S  G++ + ++M       MK NG  P   ++  ++      G
Sbjct: 414 MKEAGCVPNTCTYNAVLSLLGKKSRSNEM-IKMLCDMKSNGCSPNRATWNTMLALCGNKG 473

Query: 489 WHEKAYLTFENMQREGLKPSIETYTTLLDAFRRAGDTVALMKIWKLMIREKIEGTRVTFN 548
             +     F  M+  G +P  +T+ TL+ A+ R G  V   K++  M R        T+N
Sbjct: 474 MDKFVNRVFREMKSCGFEPDRDTFNTLISAYGRCGSEVDASKMYGEMTRAGFNACVTTYN 533

Query: 549 ILLDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAAR 608
            LL+  A++G +    +VIS+    G +PT  +Y++++  YA+GG +  + ++   +   
Sbjct: 534 ALLNALARKGDWRSGENVISDMKSKGFKPTETSYSLMLQCYAKGGNYLGIERIENRIKEG 593

Query: 609 ELKPDSVTYSTMIYAYVRVRDF---KRAFFYHKKMVKSGQVPDVKSYQKLRAI 657
           ++ P  +   T++ A  + R     +RAF   K   K G  PD+  +  + +I
Sbjct: 594 QIFPSWMLLRTLLLANFKCRALAGSERAFTLFK---KHGYKPDMVIFNSMLSI 640

BLAST of Moc03g30020 vs. TAIR 10
Match: AT2G41720.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 169.1 bits (427), Expect = 1.3e-41
Identity = 122/458 (26.64%), Postives = 210/458 (45.85%), Query Frame = 0

Query: 228 YSLIFPSLGRAGMGEKIMVLFKNLPLKKEFQDVHVYNSAMSGLMFCNRYNDACKVYEAME 287
           Y  +  + GRAG     M L  ++           YN+ ++       + +A +V + M 
Sbjct: 49  YDALINAHGRAGQWRWAMNLMDDMLRAAIAPSRSTYNNLINACGSSGNWREALEVCKKMT 108

Query: 288 TNNVNPDHVTCSIMITVMRKIGRSAKDSWDYFEKMNKKGVKWSPEVLGALIKSFC--DEG 347
            N V PD VT +I+++   K GR    +  YFE M  KG K  P+     I  +C    G
Sbjct: 109 DNGVGPDLVTHNIVLSAY-KSGRQYSKALSYFELM--KGAKVRPDTTTFNIIIYCLSKLG 168

Query: 348 LKSQALIIQLEMEKKGV--ASNAIVYNTILDAFSKSNQIEEAEGLFAEMKAKGVKPTSAT 407
             SQAL +   M +K      + + + +I+  +S   +IE    +F  M A+G+KP   +
Sbjct: 169 QSSQALDLFNSMREKRAECRPDVVTFTSIMHLYSVKGEIENCRAVFEAMVAEGLKPNIVS 228

Query: 408 YNVLMDAYSRRMQPEVVEKLMLEMKDVGFEPNVKSYTCLISAYGRQKKMSDMAADAFLRM 467
           YN LM AY+          ++ ++K  G  P+V SYTCL+++YGR ++    A + FL M
Sbjct: 229 YNALMGAYAVHGMSGTALSVLGDIKQNGIIPDVVSYTCLLNSYGRSRQ-PGKAKEVFLMM 288

Query: 468 KKNGIRPTSHSYTALIHAYSVSGWHEKAYLTFENMQREGLKPSIETYTTLLDAFRR---- 527
           +K   +P   +Y ALI AY  +G+  +A   F  M+++G+KP++ +  TLL A  R    
Sbjct: 289 RKERRKPNVVTYNALIDAYGSNGFLAEAVEIFRQMEQDGIKPNVVSVCTLLAACSRSKKK 348

Query: 528 -------------------------------AGDTVALMKIWKLMIREKIEGTRVTFNIL 587
                                          A +    + +++ M ++K++   VTF IL
Sbjct: 349 VNVDTVLSAAQSRGINLNTAAYNSAIGSYINAAELEKAIALYQSMRKKKVKADSVTFTIL 408

Query: 588 LDGFAKQGHYVEARDVISEFGKIGLQPTVMTYNMLMNAYARGGQHAKMPQLLQEMAAREL 647
           + G  +   Y EA   + E   + +  T   Y+ ++ AY++ GQ  +   +  +M     
Sbjct: 409 ISGSCRMSKYPEAISYLKEMEDLSIPLTKEVYSSVLCAYSKQGQVTEAESIFNQMKMAGC 468

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022137654.10.0e+00100.00pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Momordica ... [more]
XP_038877040.10.0e+0086.33pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Benincasa ... [more]
KAG6584120.10.0e+0085.57Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita a... [more]
XP_022924164.10.0e+0085.15pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucurbita ... [more]
XP_023001232.10.0e+0085.29pentatricopeptide repeat-containing protein At5g50280, chloroplastic [Cucurbita ... [more]
Match NameE-valueIdentityDescription
Q9FGR71.8e-24263.83Pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Arabidop... [more]
Q9LYZ91.1e-4526.64Pentatricopeptide repeat-containing protein At5g02860 OS=Arabidopsis thaliana OX... [more]
Q9FIX33.0e-4328.05Pentatricopeptide repeat-containing protein At5g39710 OS=Arabidopsis thaliana OX... [more]
O646242.6e-4224.95Pentatricopeptide repeat-containing protein At2g18940, chloroplastic OS=Arabidop... [more]
Q76C991.4e-4028.06Protein Rf1, mitochondrial OS=Oryza sativa subsp. indica OX=39946 GN=Rf1 PE=2 SV... [more]
Match NameE-valueIdentityDescription
A0A6J1C7V40.0e+00100.00pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Momordic... [more]
A0A6J1E8D20.0e+0085.15pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucurbit... [more]
A0A6J1KFZ00.0e+0085.29pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucurbit... [more]
A0A5A7UP110.0e+0085.11Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AY380.0e+0085.09pentatricopeptide repeat-containing protein At5g50280, chloroplastic OS=Cucumis ... [more]
Match NameE-valueIdentityDescription
AT5G50280.11.3e-24363.83Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G02860.17.9e-4726.64Pentatricopeptide repeat (PPR) superfamily protein [more]
AT5G39710.12.1e-4428.05Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G18940.11.8e-4324.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G41720.21.3e-4126.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 312..431
e-value: 1.8E-27
score: 98.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 159..311
e-value: 3.9E-20
score: 74.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 447..556
e-value: 1.9E-26
score: 94.5
coord: 557..669
e-value: 3.4E-23
score: 83.9
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 251..641
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 543..567
e-value: 2.4E-4
score: 19.1
coord: 473..506
e-value: 1.8E-6
score: 25.7
coord: 613..647
e-value: 7.8E-7
score: 26.9
coord: 438..471
e-value: 1.8E-6
score: 25.8
coord: 579..612
e-value: 2.4E-7
score: 28.5
coord: 403..436
e-value: 1.5E-6
score: 26.0
coord: 509..539
e-value: 4.5E-5
score: 21.4
coord: 367..400
e-value: 5.6E-9
score: 33.6
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 423..482
e-value: 2.6E-11
score: 43.4
coord: 495..550
e-value: 1.5E-12
score: 47.3
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 575..623
e-value: 2.4E-11
score: 43.7
coord: 365..412
e-value: 5.6E-15
score: 55.3
coord: 259..304
e-value: 9.0E-11
score: 41.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 611..645
score: 11.32308
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 400..434
score: 10.89559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 365..399
score: 13.230347
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 576..610
score: 10.89559
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 506..540
score: 8.61564
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 471..505
score: 11.268274
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 259..293
score: 9.930995
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 541..575
score: 10.457138
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 435..470
score: 11.268274
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 687..714
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..86
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..86
NoneNo IPR availablePANTHERPTHR47447OS03G0856100 PROTEINcoord: 1..705
NoneNo IPR availablePANTHERPTHR47447:SF5SUBFAMILY NOT NAMEDcoord: 1..705

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g30020.1Moc03g30020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding