Moc08g41880 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc08g41880
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr8: 32162952 .. 32165111 (+)
RNA-Seq ExpressionMoc08g41880
SyntenyMoc08g41880
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATACTGTTATTAAAAGTTCGGAAACCATATTTGACAGTTGACCTACGGCGAGGGTACTCTGCTCCCAAACGGAGCAAGTTCGACGGCGGTGGCTCATTCTCTAACAAAAACCAGTCGCCGTCTCCTAATCTAATTTCATGGATGAAACTCAAATGGGTATTCCAAAATCTGAGTTCCCGTCTTCCCTCTTGGGCTTCCTCTCTAACCTCCCCTTTCAGAAACCGATTCCTTCAAACCCAATTTGCAGAATCCTCCTCAAGATTCGTCCTCAATCATATAGACACAAGCCTCTTCTTATCCATTTGTGGAAGAGAAAATCTCCTCCATTTGGGCTCCTCCCTCCATGCCTCCATCATCAAGAGCTTCCAGCTCGCCAACCATGAAAATGGGGTCGTCATTGCGAATTCTCTCATCTCCATGTACCAGAGGTGCGGTAAATTGGCCGATGCAGTCAAGGTGTTTGATGAAATGTCTAGGAGAGATACTGTATCATGGAACGCATTGATCGCTGGGTTTATGAGAAATGGGGAGTCTTGTGTTGGTTTTAGCTACTTTAAAGCCATGTGTTTAGTTGGTGATTGTGAATTTGACAAAGCAACTTTGACGACGATTTTATCTGCTTGTGATGGCTTTGAGCTCTGTTATGTAATCAAAATGATTCACGGTTTGGCGTTTCTGAGTGGTTTTGTGCAAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTTGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTCATTACTTGGACAGCAGTGATCTCTGGCCTGGCTCATAATGGGCACCATGAGCACAGCCTGAAGCTTTTTAAGGAGATGATGAGTTGCGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGTTTACTTACTGCTTGTTCTGGTTTGGAGGCACTAGAGGAAGGGCGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGAGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGATATGCTTGGAAGATTTTTGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACTCAGAATGGGTGTGAGGATGAAGCCATTAGGATCTTTCTGAAAATGTTGAAGGTGGGGATCTTGATCGATGAGAATGTTGTTTCGGCCGCTCTTGGGGTGTTTGGTGCTGATACATCTTTGAGGCTGGGTCAGCAAGTTCACTCATTTGTTGTCAAAAGGAACTTTAGCTGCAATCCTTTTGTTGGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGACGAGTCGGTCAAGGTCTTCGATAGGATGCGAGAAAGGAACTCGGTCACGTGGAACTCCATGATTGCAGCGTTTGCCCGGCATGGAGATGGCTCGAAAGCTCTATGTCTTTACGAGAATATGAAACTGGAAGGTGCAAAGCCAACCGACGTCACATTTCTATCGTTACTCCATGCGTGTAGCCATGTTGGCTTAGTCAGAAAAGGAATGGAGTTCCTTGAATCTATGACAAAAGATCACGGGATGAATCCAAGGACCGAACACTATGCTTGTGTTGTTGACATGTTGGGGAGGGCAGGACTGGTCTCAGAAGCTAGAAAATTTATTGAGGAACTACCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCATTGCTCGGCGCCTGCAGCCTCTATGGCGATTCTGAAACAGGGAAATACGCAGCCGAGCATCTGTTTTCGGAGTCTCCACATAGCCCGGTCCCATATGTTCTGTTAGCCAATATATATTCTTCTGAAGGGAATTGGAAGGAGAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGCGTGGCCAAAGAAACTGGCATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTTGGGACAAAATGCATCCACAAGCTGAGACCATTTATGGAGTTTTGATGGAGCTTTTTGTATTCATGGTAGATGAAGGATATGTGCCTGATAAGAAGTTCATCCTCTACTACTTGGATTCTGATGGCAAGAGGGATCCGGCCCGTGACAGTCGTCCTAACCATCAAAGTTCCATAAAATCCGAGGTTGTTTGA

mRNA sequence

ATGATACTGTTATTAAAAGTTCGGAAACCATATTTGACAGTTGACCTACGGCGAGGGTACTCTGCTCCCAAACGGAGCAAGTTCGACGGCGGTGGCTCATTCTCTAACAAAAACCAGTCGCCGTCTCCTAATCTAATTTCATGGATGAAACTCAAATGGGTATTCCAAAATCTGAGTTCCCGTCTTCCCTCTTGGGCTTCCTCTCTAACCTCCCCTTTCAGAAACCGATTCCTTCAAACCCAATTTGCAGAATCCTCCTCAAGATTCGTCCTCAATCATATAGACACAAGCCTCTTCTTATCCATTTGTGGAAGAGAAAATCTCCTCCATTTGGGCTCCTCCCTCCATGCCTCCATCATCAAGAGCTTCCAGCTCGCCAACCATGAAAATGGGGTCGTCATTGCGAATTCTCTCATCTCCATGTACCAGAGGTGCGGTAAATTGGCCGATGCAGTCAAGGTGTTTGATGAAATGTCTAGGAGAGATACTGTATCATGGAACGCATTGATCGCTGGGTTTATGAGAAATGGGGAGTCTTGTGTTGGTTTTAGCTACTTTAAAGCCATGTGTTTAGTTGGTGATTGTGAATTTGACAAAGCAACTTTGACGACGATTTTATCTGCTTGTGATGGCTTTGAGCTCTGTTATGTAATCAAAATGATTCACGGTTTGGCGTTTCTGAGTGGTTTTGTGCAAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTTGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTCATTACTTGGACAGCAGTGATCTCTGGCCTGGCTCATAATGGGCACCATGAGCACAGCCTGAAGCTTTTTAAGGAGATGATGAGTTGCGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGTTTACTTACTGCTTGTTCTGGTTTGGAGGCACTAGAGGAAGGGCGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGAGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGATATGCTTGGAAGATTTTTGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACTCAGAATGGGTGTGAGGATGAAGCCATTAGGATCTTTCTGAAAATGTTGAAGGTGGGGATCTTGATCGATGAGAATGTTGTTTCGGCCGCTCTTGGGGTGTTTGGTGCTGATACATCTTTGAGGCTGGGTCAGCAAGTTCACTCATTTGTTGTCAAAAGGAACTTTAGCTGCAATCCTTTTGTTGGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGACGAGTCGGTCAAGGTCTTCGATAGGATGCGAGAAAGGAACTCGGTCACGTGGAACTCCATGATTGCAGCGTTTGCCCGGCATGGAGATGGCTCGAAAGCTCTATGTCTTTACGAGAATATGAAACTGGAAGGTGCAAAGCCAACCGACGTCACATTTCTATCGTTACTCCATGCGTGTAGCCATGTTGGCTTAGTCAGAAAAGGAATGGAGTTCCTTGAATCTATGACAAAAGATCACGGGATGAATCCAAGGACCGAACACTATGCTTGTGTTGTTGACATGTTGGGGAGGGCAGGACTGGTCTCAGAAGCTAGAAAATTTATTGAGGAACTACCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCATTGCTCGGCGCCTGCAGCCTCTATGGCGATTCTGAAACAGGGAAATACGCAGCCGAGCATCTGTTTTCGGAGTCTCCACATAGCCCGGTCCCATATGTTCTGTTAGCCAATATATATTCTTCTGAAGGGAATTGGAAGGAGAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGCGTGGCCAAAGAAACTGGCATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTTGGGACAAAATGCATCCACAAGCTGAGACCATTTATGGAGTTTTGATGGAGCTTTTTGTATTCATGGTAGATGAAGGATATGTGCCTGATAAGAAGTTCATCCTCTACTACTTGGATTCTGATGGCAAGAGGGATCCGGCCCGTGACAGTCGTCCTAACCATCAAAGTTCCATAAAATCCGAGGTTGTTTGA

Coding sequence (CDS)

ATGATACTGTTATTAAAAGTTCGGAAACCATATTTGACAGTTGACCTACGGCGAGGGTACTCTGCTCCCAAACGGAGCAAGTTCGACGGCGGTGGCTCATTCTCTAACAAAAACCAGTCGCCGTCTCCTAATCTAATTTCATGGATGAAACTCAAATGGGTATTCCAAAATCTGAGTTCCCGTCTTCCCTCTTGGGCTTCCTCTCTAACCTCCCCTTTCAGAAACCGATTCCTTCAAACCCAATTTGCAGAATCCTCCTCAAGATTCGTCCTCAATCATATAGACACAAGCCTCTTCTTATCCATTTGTGGAAGAGAAAATCTCCTCCATTTGGGCTCCTCCCTCCATGCCTCCATCATCAAGAGCTTCCAGCTCGCCAACCATGAAAATGGGGTCGTCATTGCGAATTCTCTCATCTCCATGTACCAGAGGTGCGGTAAATTGGCCGATGCAGTCAAGGTGTTTGATGAAATGTCTAGGAGAGATACTGTATCATGGAACGCATTGATCGCTGGGTTTATGAGAAATGGGGAGTCTTGTGTTGGTTTTAGCTACTTTAAAGCCATGTGTTTAGTTGGTGATTGTGAATTTGACAAAGCAACTTTGACGACGATTTTATCTGCTTGTGATGGCTTTGAGCTCTGTTATGTAATCAAAATGATTCACGGTTTGGCGTTTCTGAGTGGTTTTGTGCAAGAAATTACTGTGGGAAATGCTCTGATTAGTTCGTATTTTAAATGTGGATGTGTTGGTTTTGGGAGGCAAGTTTTTGATGAGATGGAGGAGAGAAATGTCATTACTTGGACAGCAGTGATCTCTGGCCTGGCTCATAATGGGCACCATGAGCACAGCCTGAAGCTTTTTAAGGAGATGATGAGTTGCGGCTCTGTGGAGCCAAATTCTTTAACTTATTTGAGTTTACTTACTGCTTGTTCTGGTTTGGAGGCACTAGAGGAAGGGCGTCAAATTCATGGTCTTATTTTGAAGTTGGGAATTCAGTCAGATTTGTGCATTGAGAGTGCTCTGATGGATATGTACTCGAAATCTGGAAGCATTGGATATGCTTGGAAGATTTTTGAGTCGGCTGAGGAACTTGATATGGTTTCATTGACTGTTATACTTGCAGGGTTCACTCAGAATGGGTGTGAGGATGAAGCCATTAGGATCTTTCTGAAAATGTTGAAGGTGGGGATCTTGATCGATGAGAATGTTGTTTCGGCCGCTCTTGGGGTGTTTGGTGCTGATACATCTTTGAGGCTGGGTCAGCAAGTTCACTCATTTGTTGTCAAAAGGAACTTTAGCTGCAATCCTTTTGTTGGCAATGGGCTTATAAACATGTACTCCAAGTGTGGAGCATTGGACGAGTCGGTCAAGGTCTTCGATAGGATGCGAGAAAGGAACTCGGTCACGTGGAACTCCATGATTGCAGCGTTTGCCCGGCATGGAGATGGCTCGAAAGCTCTATGTCTTTACGAGAATATGAAACTGGAAGGTGCAAAGCCAACCGACGTCACATTTCTATCGTTACTCCATGCGTGTAGCCATGTTGGCTTAGTCAGAAAAGGAATGGAGTTCCTTGAATCTATGACAAAAGATCACGGGATGAATCCAAGGACCGAACACTATGCTTGTGTTGTTGACATGTTGGGGAGGGCAGGACTGGTCTCAGAAGCTAGAAAATTTATTGAGGAACTACCTGAAAGGCCAGGTTTACTCGTGTGGCAGGCATTGCTCGGCGCCTGCAGCCTCTATGGCGATTCTGAAACAGGGAAATACGCAGCCGAGCATCTGTTTTCGGAGTCTCCACATAGCCCGGTCCCATATGTTCTGTTAGCCAATATATATTCTTCTGAAGGGAATTGGAAGGAGAGAGCAAGGACAATTAGGAAGATGAAGGAGGTGGGCGTGGCCAAAGAAACTGGCATCAGTTGGATTGAGATTGACAAGAAAGTCCATAGTTTTACTGTTTGGGACAAAATGCATCCACAAGCTGAGACCATTTATGGAGTTTTGATGGAGCTTTTTGTATTCATGGTAGATGAAGGATATGTGCCTGATAAGAAGTTCATCCTCTACTACTTGGATTCTGATGGCAAGAGGGATCCGGCCCGTGACAGTCGTCCTAACCATCAAAGTTCCATAAAATCCGAGGTTGTTTGA

Protein sequence

MILLLKVRKPYLTVDLRRGYSAPKRSKFDGGGSFSNKNQSPSPNLISWMKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENLLHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLSGFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLFKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSKSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRPNHQSSIKSEVV
Homology
BLAST of Moc08g41880 vs. NCBI nr
Match: XP_022150755.1 (pentatricopeptide repeat-containing protein At3g05340 [Momordica charantia])

HSP 1 Score: 1357.4 bits (3512), Expect = 0.0e+00
Identity = 671/671 (100.00%), Postives = 671/671 (100.00%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL
Sbjct: 1   MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA
Sbjct: 61  LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS
Sbjct: 121 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF
Sbjct: 181 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK
Sbjct: 241 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
           SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA
Sbjct: 301 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
           TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID
Sbjct: 541 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP 708
           KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP
Sbjct: 601 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP 660

Query: 709 NHQSSIKSEVV 720
           NHQSSIKSEVV
Sbjct: 661 NHQSSIKSEVV 671

BLAST of Moc08g41880 vs. NCBI nr
Match: XP_022946254.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. NCBI nr
Match: XP_022946256.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita moschata])

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. NCBI nr
Match: XP_022946255.1 (pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. NCBI nr
Match: XP_022999024.1 (pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima])

HSP 1 Score: 1156.7 bits (2991), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 605/654 (92.51%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWASS  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQRLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE   GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +I+M+HGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGSEMCCIIEMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFGRQ+F EMEERNVITWTAVISGLA NGHHEHSL+LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGRQLFYEMEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EMMSCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI ID NVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDANVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDGSKAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDK FIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVLMVDEGYVPDKNFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. ExPASy Swiss-Prot
Match: Q9MA85 (Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E83 PE=2 SV=2)

HSP 1 Score: 752.3 bits (1941), Expect = 5.1e-216
Identity = 392/656 (59.76%), Postives = 468/656 (71.34%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           M  +WV Q L+S LPS  S++ SP +    Q+   + S+ F+LNH+D SL LSICGRE  
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQVST-FLLNHVDMSLLLSICGREGW 60

Query: 109 L-HLGSSLHASIIKSFQLAN------HENGVVIANSLISMYQRCGKLADAVKVFDEMSRR 168
             HLG  LHASIIK+ +         H N +V+ NSL+S+Y +CGKL DA+K+FDEM  R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 169 DTVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMI 228
           D +S N +  GF+RN E+  GF   K M  +G   FD ATLT +LS CD  E C V KMI
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 229 HGLAFLSGFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHH 288
           H LA LSG+ +EI+VGN LI+SYFKCGC   GR VFD M  RNVIT TAVISGL  N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 289 EHSLKLFKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESA 348
           E  L+LF  +M  G V PNS+TYLS L ACSG + + EG+QIH L+ K GI+S+LCIESA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 349 LMDMYSKSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILID 408
           LMDMYSK GSI  AW IFES  E+D VS+TVIL G  QNG E+EAI+ F++ML+ G+ ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 409 ENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFD 468
            NVVSA LGV   D SL LG+Q+HS V+KR FS N FV NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 469 RMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRK 528
           RM +RN V+WNSMIAAFARHG G  AL LYE M     KPTDVTFLSLLHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 529 GMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGAC 588
           G E L  M + HG+ PRTEHY C++DMLGRAGL+ EA+ FI+ LP +P   +WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 589 SLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETG 648
           S +GD+E G+YAAE LF  +P S   ++L+ANIYSS G WKERA+TI++MK +GV KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 649 ISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSD 698
           IS IEI+ K HSF V DK+HPQAE IY VL  LF  MVDEGY PDK+FIL Y   D
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDD 652

BLAST of Moc08g41880 vs. ExPASy Swiss-Prot
Match: Q5G1T1 (Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=EMB2261 PE=2 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 1.2e-111
Identity = 229/621 (36.88%), Postives = 347/621 (55.88%), Query Frame = 0

Query: 103 CGRENLLHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRC-GKLADAVKVFDEMSRR 162
           C   + + +G      ++K+    + E+ V +  SLI M+ +      +A KVFD+MS  
Sbjct: 176 CSNSDFVGVGRVTLGFLMKT---GHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSEL 235

Query: 163 DTVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMI 222
           + V+W  +I   M+ G       +F  M L G  E DK TL+++ SAC   E   + K +
Sbjct: 236 NVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-FESDKFTLSSVFSACAELENLSLGKQL 295

Query: 223 HGLAFLSGFVQEITVGNALISSYFKC---GCVGFGRQVFDEMEERNVITWTAVISGLAHN 282
           H  A  SG V ++    +L+  Y KC   G V   R+VFD ME+ +V++WTA+I+G   N
Sbjct: 296 HSWAIRSGLVDDVEC--SLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKN 355

Query: 283 GH-HEHSLKLFKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLC 342
            +    ++ LF EM++ G VEPN  T+ S   AC  L     G+Q+ G   K G+ S+  
Sbjct: 356 CNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSS 415

Query: 343 IESALMDMYSKSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVG 402
           + ++++ M+ KS  +  A + FES  E ++VS    L G  +N   ++A ++  ++ +  
Sbjct: 416 VANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERE 475

Query: 403 ILIDENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESV 462
           + +     ++ L       S+R G+Q+HS VVK   SCN  V N LI+MYSKCG++D + 
Sbjct: 476 LGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTAS 535

Query: 463 KVFDRMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVG 522
           +VF+ M  RN ++W SMI  FA+HG   + L  +  M  EG KP +VT++++L ACSHVG
Sbjct: 536 RVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHVG 595

Query: 523 LVRKGMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQAL 582
           LV +G     SM +DH + P+ EHYAC+VD+L RAGL+++A +FI  +P +  +LVW+  
Sbjct: 596 LVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTF 655

Query: 583 LGACSLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVA 642
           LGAC ++ ++E GK AA  +    P+ P  Y+ L+NIY+  G W+E     RKMKE  + 
Sbjct: 656 LGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKERNLV 715

Query: 643 KETGISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSD 702
           KE G SWIE+  K+H F V D  HP A  IY  L  L   +   GYVPD   +L+ L+ +
Sbjct: 716 KEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVPDTDLVLHKLEEE 775

Query: 703 GKRDPARDSRPNHQSSIKSEV 719
              D A   R  +Q S K  V
Sbjct: 776 --NDEAEKERLLYQHSEKIAV 788

BLAST of Moc08g41880 vs. ExPASy Swiss-Prot
Match: Q9FIB2 (Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H35 PE=3 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 1.3e-110
Identity = 219/575 (38.09%), Postives = 332/575 (57.74%), Query Frame = 0

Query: 132 VVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNALIAGFMRNGESCVGFSYFKAMCL 191
           V I N L++MY +CG +ADA +VF  M+ +D+VSWN++I G  +NG        +K+M  
Sbjct: 349 VGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSM-R 408

Query: 192 VGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLSGFVQEITVGNALISSYFKCGCVG 251
             D      TL + LS+C   +   + + IHG +   G    ++V NAL++ Y + G + 
Sbjct: 409 RHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLN 468

Query: 252 FGRQVFDEMEERNVITWTAVISGLAHNGHH-EHSLKLFKEMMSCGSVEPNSLTYLSLLTA 311
             R++F  M E + ++W ++I  LA +      ++  F      G  + N +T+ S+L+A
Sbjct: 469 ECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSVLSA 528

Query: 312 CSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSKSGSIGYAWKIF-ESAEELDMVS 371
            S L   E G+QIHGL LK  I  +   E+AL+  Y K G +    KIF   AE  D V+
Sbjct: 529 VSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVT 588

Query: 372 LTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAALGVFGADTSLRLGQQVHSFVV 431
              +++G+  N    +A+ +   ML+ G  +D  + +  L  F +  +L  G +VH+  V
Sbjct: 589 WNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSV 648

Query: 432 KRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAAFARHGDGSKALC 491
           +     +  VG+ L++MYSKCG LD +++ F+ M  RNS +WNSMI+ +ARHG G +AL 
Sbjct: 649 RACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALK 708

Query: 492 LYENMKLEGAKPTD-VTFLSLLHACSHVGLVRKGMEFLESMTKDHGMNPRTEHYACVVDM 551
           L+E MKL+G  P D VTF+ +L ACSH GL+ +G +  ESM+  +G+ PR EH++C+ D+
Sbjct: 709 LFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADV 768

Query: 552 LGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYG--DSETGKYAAEHLFSESPHSPV 611
           LGRAG + +   FIE++P +P +L+W+ +LGAC       +E GK AAE LF   P + V
Sbjct: 769 LGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAV 828

Query: 612 PYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEIDKKVHSFTVWDKMHPQAET 671
            YVLL N+Y++ G W++  +  +KMK+  V KE G SW+ +   VH F   DK HP A+ 
Sbjct: 829 NYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADV 888

Query: 672 IYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRD 702
           IY  L EL   M D GYVP   F LY L+ + K +
Sbjct: 889 IYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEE 921

BLAST of Moc08g41880 vs. ExPASy Swiss-Prot
Match: Q7Y211 (Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H81 PE=2 SV=2)

HSP 1 Score: 399.4 bits (1025), Expect = 8.4e-110
Identity = 216/606 (35.64%), Postives = 337/606 (55.61%), Query Frame = 0

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L +G  +HA     + L   E    I N+L++MY + GKLA +  +      RD V+WN 
Sbjct: 218 LMMGKQVHA-----YGLRKGELNSFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNT 277

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           +++   +N +      Y + M L G  E D+ T++++L AC   E+    K +H  A  +
Sbjct: 278 VLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEMLRTGKELHAYALKN 337

Query: 229 GFVQEIT-VGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKL 288
           G + E + VG+AL+  Y  C  V  GR+VFD M +R +  W A+I+G + N H + +L L
Sbjct: 338 GSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLL 397

Query: 289 FKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYS 348
           F  M     +  NS T   ++ AC    A      IHG ++K G+  D  +++ LMDMYS
Sbjct: 398 FIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYS 457

Query: 349 KSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKM-----------LKV 408
           + G I  A +IF   E+ D+V+   ++ G+  +   ++A+ +  KM            +V
Sbjct: 458 RLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRV 517

Query: 409 GILIDENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDES 468
            +  +   +   L    A ++L  G+++H++ +K N + +  VG+ L++MY+KCG L  S
Sbjct: 518 SLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMS 577

Query: 469 VKVFDRMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHV 528
            KVFD++ ++N +TWN +I A+  HG+G +A+ L   M ++G KP +VTF+S+  ACSH 
Sbjct: 578 RKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHS 637

Query: 529 GLVRKGMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELP---ERPGLLV 588
           G+V +G+     M  D+G+ P ++HYACVVD+LGRAG + EA + +  +P    + G   
Sbjct: 638 GMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAG--A 697

Query: 589 WQALLGACSLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKE 648
           W +LLGA  ++ + E G+ AA++L    P+    YVLLANIYSS G W +     R MKE
Sbjct: 698 WSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKE 757

Query: 649 VGVAKETGISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYY 700
            GV KE G SWIE   +VH F   D  HPQ+E + G L  L+  M  EGYVPD   +L+ 
Sbjct: 758 QGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHN 815

BLAST of Moc08g41880 vs. ExPASy Swiss-Prot
Match: Q9LFL5 (Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H92 PE=2 SV=1)

HSP 1 Score: 397.9 bits (1021), Expect = 2.4e-109
Identity = 218/645 (33.80%), Postives = 350/645 (54.26%), Query Frame = 0

Query: 103 CGRENLLHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRD 162
           CG  + +  G S HA  + +  ++N    V + N+L++MY RC  L+DA KVFDEMS  D
Sbjct: 137 CGEISSVRCGESAHALSLVTGFISN----VFVGNALVAMYSRCRSLSDARKVFDEMSVWD 196

Query: 163 TVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIH 222
            VSWN++I  + + G+  V    F  M     C  D  TL  +L  C       + K +H
Sbjct: 197 VVSWNSIIESYAKLGKPKVALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLH 256

Query: 223 GLAFLSGFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHE 282
             A  S  +Q + VGN L+  Y KCG +     VF  M  ++V++W A+++G +  G  E
Sbjct: 257 CFAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFE 316

Query: 283 HSLKLFKEM----------------------------------MSCGSVEPNSLTYLSLL 342
            +++LF++M                                  M    ++PN +T +S+L
Sbjct: 317 DAVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVL 376

Query: 343 TACSGLEALEEGRQIHGLILKLGIQ-------SDLCIESALMDMYSKSGSIGYAWKIFE- 402
           + C+ + AL  G++IH   +K  I         +  + + L+DMY+K   +  A  +F+ 
Sbjct: 377 SGCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDS 436

Query: 403 -SAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDEN--VVSAALGVFGADTS 462
            S +E D+V+ TV++ G++Q+G  ++A+ +  +M +       N   +S AL    +  +
Sbjct: 437 LSPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAA 496

Query: 463 LRLGQQVHSFVVKRNFSCNP-FVGNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIA 522
           LR+G+Q+H++ ++   +  P FV N LI+MY+KCG++ ++  VFD M  +N VTW S++ 
Sbjct: 497 LRIGKQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMT 556

Query: 523 AFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLESMTKDHGMN 582
            +  HG G +AL +++ M+  G K   VT L +L+ACSH G++ +GME+   M    G++
Sbjct: 557 GYGMHGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVS 616

Query: 583 PRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSETGKYAAEH 642
           P  EHYAC+VD+LGRAG ++ A + IEE+P  P  +VW A L  C ++G  E G+YAAE 
Sbjct: 617 PGPEHYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEK 676

Query: 643 LFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEIDKKVHSFTV 702
           +   + +    Y LL+N+Y++ G WK+  R    M+  GV K  G SW+E  K   +F V
Sbjct: 677 ITELASNHDGSYTLLSNLYANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFV 736

BLAST of Moc08g41880 vs. ExPASy TrEMBL
Match: A0A6J1D9D8 (pentatricopeptide repeat-containing protein At3g05340 OS=Momordica charantia OX=3673 GN=LOC111018815 PE=3 SV=1)

HSP 1 Score: 1357.4 bits (3512), Expect = 0.0e+00
Identity = 671/671 (100.00%), Postives = 671/671 (100.00%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL
Sbjct: 1   MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA
Sbjct: 61  LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS
Sbjct: 121 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF
Sbjct: 181 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK
Sbjct: 241 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
           SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA
Sbjct: 301 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
           TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID
Sbjct: 541 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP 708
           KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP
Sbjct: 601 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDPARDSRP 660

Query: 709 NHQSSIKSEVV 720
           NHQSSIKSEVV
Sbjct: 661 NHQSSIKSEVV 671

BLAST of Moc08g41880 vs. ExPASy TrEMBL
Match: A0A6J1G350 (pentatricopeptide repeat-containing protein At3g05340 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. ExPASy TrEMBL
Match: A0A6J1G3C0 (pentatricopeptide repeat-containing protein At3g05340 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. ExPASy TrEMBL
Match: A0A6J1G3A2 (pentatricopeptide repeat-containing protein At3g05340 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111450393 PE=3 SV=1)

HSP 1 Score: 1159.8 bits (2999), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 607/654 (92.81%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWA+S  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQKLSSKLPSWATSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE C GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +IKM+HGL FLS
Sbjct: 121 LIGGFMRNGEFCAGFSYFKAMCLVGDCKFDKATLTTILSACDGLEMCCIIKMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFG+QVF EMEERNVITWTAVISGLA NG+HEHSL+LF
Sbjct: 181 GYKQEITVGNALISSYFKCGCVGFGKQVFYEMEERNVITWTAVISGLAQNGYHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EM+SCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMLSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI IDENVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDENVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDG KAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGLKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPNSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDKKFIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVHMVDEGYVPDKKFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. ExPASy TrEMBL
Match: A0A6J1KFW9 (pentatricopeptide repeat-containing protein At3g05340 OS=Cucurbita maxima OX=3661 GN=LOC111493539 PE=3 SV=1)

HSP 1 Score: 1156.7 bits (2991), Expect = 0.0e+00
Identity = 567/654 (86.70%), Postives = 605/654 (92.51%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           MKLKWVFQ LSS+LPSWASS  SPFRN+F Q  FAE+SS FVLNH+D S  LS+CGR+  
Sbjct: 1   MKLKWVFQRLSSKLPSWASSRISPFRNQFHQNPFAETSSTFVLNHVDPSFLLSVCGRDGH 60

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L+LGSSLHASIIKSF+L+NHENGVVI NSLISMY+RCGKL DAVKVFDEM  RDTVSWNA
Sbjct: 61  LYLGSSLHASIIKSFELSNHENGVVIMNSLISMYERCGKLPDAVKVFDEMPTRDTVSWNA 120

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           LI GFMRNGE   GFSYFKAMCLVGDC+FDKATLTTILSACDG E+C +I+M+HGL FLS
Sbjct: 121 LIGGFMRNGEFYAGFSYFKAMCLVGDCKFDKATLTTILSACDGSEMCCIIEMMHGLTFLS 180

Query: 229 GFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKLF 288
           G+ QEITVGNALISSYFKCGCVGFGRQ+F EMEERNVITWTAVISGLA NGHHEHSL+LF
Sbjct: 181 GYEQEITVGNALISSYFKCGCVGFGRQLFYEMEERNVITWTAVISGLAQNGHHEHSLELF 240

Query: 289 KEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSK 348
           +EMMSCGSVEPNSLTYLSLLTACSGLEALEEG QIHGLILKLGIQSDLCI SALMDMYSK
Sbjct: 241 REMMSCGSVEPNSLTYLSLLTACSGLEALEEGCQIHGLILKLGIQSDLCIGSALMDMYSK 300

Query: 349 SGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAA 408
            GSIG AWKIFESAEELDMVSLTVILAGFTQNGCE+EAI+IFLKMLK+GI ID NVVSA 
Sbjct: 301 CGSIGDAWKIFESAEELDMVSLTVILAGFTQNGCEEEAIQIFLKMLKMGIEIDANVVSAV 360

Query: 409 LGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNS 468
           LGVFGADTSLRLGQQVHSF+VK+NFSCNPFV NGLINMYSKCGALDESVKVFDRM+ RNS
Sbjct: 361 LGVFGADTSLRLGQQVHSFIVKKNFSCNPFVSNGLINMYSKCGALDESVKVFDRMQNRNS 420

Query: 469 VTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLES 528
           VTWNSMIAAFARHGDGSKAL LYENMKLEGAKPTD+TFLSLLHACSHVGLV KGMEFLES
Sbjct: 421 VTWNSMIAAFARHGDGSKALHLYENMKLEGAKPTDITFLSLLHACSHVGLVNKGMEFLES 480

Query: 529 MTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSE 588
           MTKDH MNPR+EHYACVVDMLGRAGL+SEAR FIE+LPE+PGLLVWQALLGACSLYGDSE
Sbjct: 481 MTKDHRMNPRSEHYACVVDMLGRAGLLSEARTFIEKLPEQPGLLVWQALLGACSLYGDSE 540

Query: 589 TGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEID 648
            GKYAAEHLFSE+P+S VPYVLLANIYSSEGNWKERARTIRKMKE G+AKETGISWIEID
Sbjct: 541 MGKYAAEHLFSETPYSSVPYVLLANIYSSEGNWKERARTIRKMKETGMAKETGISWIEID 600

Query: 649 KKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRDP 703
           KKVHSFTV DK HPQA+ IYGVLM+LFV MVDEGYVPDK FIL+YLD D K++P
Sbjct: 601 KKVHSFTVGDKRHPQADIIYGVLMDLFVLMVDEGYVPDKNFILFYLDPDDKKEP 654

BLAST of Moc08g41880 vs. TAIR 10
Match: AT3G05340.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 752.3 bits (1941), Expect = 3.6e-217
Identity = 392/656 (59.76%), Postives = 468/656 (71.34%), Query Frame = 0

Query: 49  MKLKWVFQNLSSRLPSWASSLTSPFRNRFLQTQFAESSSRFVLNHIDTSLFLSICGRENL 108
           M  +WV Q L+S LPS  S++ SP +    Q+   + S+ F+LNH+D SL LSICGRE  
Sbjct: 1   MNSRWVIQKLTSHLPSCLSTVLSPSKILIRQSPNYQVST-FLLNHVDMSLLLSICGREGW 60

Query: 109 L-HLGSSLHASIIKSFQLAN------HENGVVIANSLISMYQRCGKLADAVKVFDEMSRR 168
             HLG  LHASIIK+ +         H N +V+ NSL+S+Y +CGKL DA+K+FDEM  R
Sbjct: 61  FPHLGPCLHASIIKNPEFFEPVDADIHRNALVVWNSLLSLYAKCGKLVDAIKLFDEMPMR 120

Query: 169 DTVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMI 228
           D +S N +  GF+RN E+  GF   K M  +G   FD ATLT +LS CD  E C V KMI
Sbjct: 121 DVISQNIVFYGFLRNRETESGFVLLKRM--LGSGGFDHATLTIVLSVCDTPEFCLVTKMI 180

Query: 229 HGLAFLSGFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHH 288
           H LA LSG+ +EI+VGN LI+SYFKCGC   GR VFD M  RNVIT TAVISGL  N  H
Sbjct: 181 HALAILSGYDKEISVGNKLITSYFKCGCSVSGRGVFDGMSHRNVITLTAVISGLIENELH 240

Query: 289 EHSLKLFKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESA 348
           E  L+LF  +M  G V PNS+TYLS L ACSG + + EG+QIH L+ K GI+S+LCIESA
Sbjct: 241 EDGLRLF-SLMRRGLVHPNSVTYLSALAACSGSQRIVEGQQIHALLWKYGIESELCIESA 300

Query: 349 LMDMYSKSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILID 408
           LMDMYSK GSI  AW IFES  E+D VS+TVIL G  QNG E+EAI+ F++ML+ G+ ID
Sbjct: 301 LMDMYSKCGSIEDAWTIFESTTEVDEVSMTVILVGLAQNGSEEEAIQFFIRMLQAGVEID 360

Query: 409 ENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESVKVFD 468
            NVVSA LGV   D SL LG+Q+HS V+KR FS N FV NGLINMYSKCG L +S  VF 
Sbjct: 361 ANVVSAVLGVSFIDNSLGLGKQLHSLVIKRKFSGNTFVNNGLINMYSKCGDLTDSQTVFR 420

Query: 469 RMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRK 528
           RM +RN V+WNSMIAAFARHG G  AL LYE M     KPTDVTFLSLLHACSHVGL+ K
Sbjct: 421 RMPKRNYVSWNSMIAAFARHGHGLAALKLYEEMTTLEVKPTDVTFLSLLHACSHVGLIDK 480

Query: 529 GMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGAC 588
           G E L  M + HG+ PRTEHY C++DMLGRAGL+ EA+ FI+ LP +P   +WQALLGAC
Sbjct: 481 GRELLNEMKEVHGIEPRTEHYTCIIDMLGRAGLLKEAKSFIDSLPLKPDCKIWQALLGAC 540

Query: 589 SLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETG 648
           S +GD+E G+YAAE LF  +P S   ++L+ANIYSS G WKERA+TI++MK +GV KETG
Sbjct: 541 SFHGDTEVGEYAAEQLFQTAPDSSSAHILIANIYSSRGKWKERAKTIKRMKAMGVTKETG 600

Query: 649 ISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSD 698
           IS IEI+ K HSF V DK+HPQAE IY VL  LF  MVDEGY PDK+FIL Y   D
Sbjct: 601 ISSIEIEHKTHSFVVEDKLHPQAEAIYDVLSGLFPVMVDEGYRPDKRFILCYTGDD 652

BLAST of Moc08g41880 vs. TAIR 10
Match: AT3G49170.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 405.6 bits (1041), Expect = 8.3e-113
Identity = 229/621 (36.88%), Postives = 347/621 (55.88%), Query Frame = 0

Query: 103 CGRENLLHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRC-GKLADAVKVFDEMSRR 162
           C   + + +G      ++K+    + E+ V +  SLI M+ +      +A KVFD+MS  
Sbjct: 176 CSNSDFVGVGRVTLGFLMKT---GHFESDVCVGCSLIDMFVKGENSFENAYKVFDKMSEL 235

Query: 163 DTVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMI 222
           + V+W  +I   M+ G       +F  M L G  E DK TL+++ SAC   E   + K +
Sbjct: 236 NVVTWTLMITRCMQMGFPREAIRFFLDMVLSG-FESDKFTLSSVFSACAELENLSLGKQL 295

Query: 223 HGLAFLSGFVQEITVGNALISSYFKC---GCVGFGRQVFDEMEERNVITWTAVISGLAHN 282
           H  A  SG V ++    +L+  Y KC   G V   R+VFD ME+ +V++WTA+I+G   N
Sbjct: 296 HSWAIRSGLVDDVEC--SLVDMYAKCSADGSVDDCRKVFDRMEDHSVMSWTALITGYMKN 355

Query: 283 GH-HEHSLKLFKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLC 342
            +    ++ LF EM++ G VEPN  T+ S   AC  L     G+Q+ G   K G+ S+  
Sbjct: 356 CNLATEAINLFSEMITQGHVEPNHFTFSSAFKACGNLSDPRVGKQVLGQAFKRGLASNSS 415

Query: 343 IESALMDMYSKSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVG 402
           + ++++ M+ KS  +  A + FES  E ++VS    L G  +N   ++A ++  ++ +  
Sbjct: 416 VANSVISMFVKSDRMEDAQRAFESLSEKNLVSYNTFLDGTCRNLNFEQAFKLLSEITERE 475

Query: 403 ILIDENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDESV 462
           + +     ++ L       S+R G+Q+HS VVK   SCN  V N LI+MYSKCG++D + 
Sbjct: 476 LGVSAFTFASLLSGVANVGSIRKGEQIHSQVVKLGLSCNQPVCNALISMYSKCGSIDTAS 535

Query: 463 KVFDRMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVG 522
           +VF+ M  RN ++W SMI  FA+HG   + L  +  M  EG KP +VT++++L ACSHVG
Sbjct: 536 RVFNFMENRNVISWTSMITGFAKHGFAIRVLETFNQMIEEGVKPNEVTYVAILSACSHVG 595

Query: 523 LVRKGMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQAL 582
           LV +G     SM +DH + P+ EHYAC+VD+L RAGL+++A +FI  +P +  +LVW+  
Sbjct: 596 LVSEGWRHFNSMYEDHKIKPKMEHYACMVDLLCRAGLLTDAFEFINTMPFQADVLVWRTF 655

Query: 583 LGACSLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVA 642
           LGAC ++ ++E GK AA  +    P+ P  Y+ L+NIY+  G W+E     RKMKE  + 
Sbjct: 656 LGACRVHSNTELGKLAARKILELDPNEPAAYIQLSNIYACAGKWEESTEMRRKMKERNLV 715

Query: 643 KETGISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYYLDSD 702
           KE G SWIE+  K+H F V D  HP A  IY  L  L   +   GYVPD   +L+ L+ +
Sbjct: 716 KEGGCSWIEVGDKIHKFYVGDTAHPNAHQIYDELDRLITEIKRCGYVPDTDLVLHKLEEE 775

Query: 703 GKRDPARDSRPNHQSSIKSEV 719
              D A   R  +Q S K  V
Sbjct: 776 --NDEAEKERLLYQHSEKIAV 788

BLAST of Moc08g41880 vs. TAIR 10
Match: AT5G09950.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 402.1 bits (1032), Expect = 9.2e-112
Identity = 219/575 (38.09%), Postives = 332/575 (57.74%), Query Frame = 0

Query: 132 VVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNALIAGFMRNGESCVGFSYFKAMCL 191
           V I N L++MY +CG +ADA +VF  M+ +D+VSWN++I G  +NG        +K+M  
Sbjct: 349 VGIGNGLVNMYAKCGSIADARRVFYFMTDKDSVSWNSMITGLDQNGCFIEAVERYKSM-R 408

Query: 192 VGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLSGFVQEITVGNALISSYFKCGCVG 251
             D      TL + LS+C   +   + + IHG +   G    ++V NAL++ Y + G + 
Sbjct: 409 RHDILPGSFTLISSLSSCASLKWAKLGQQIHGESLKLGIDLNVSVSNALMTLYAETGYLN 468

Query: 252 FGRQVFDEMEERNVITWTAVISGLAHNGHH-EHSLKLFKEMMSCGSVEPNSLTYLSLLTA 311
             R++F  M E + ++W ++I  LA +      ++  F      G  + N +T+ S+L+A
Sbjct: 469 ECRKIFSSMPEHDQVSWNSIIGALARSERSLPEAVVCFLNAQRAGQ-KLNRITFSSVLSA 528

Query: 312 CSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYSKSGSIGYAWKIF-ESAEELDMVS 371
            S L   E G+QIHGL LK  I  +   E+AL+  Y K G +    KIF   AE  D V+
Sbjct: 529 VSSLSFGELGKQIHGLALKNNIADEATTENALIACYGKCGEMDGCEKIFSRMAERRDNVT 588

Query: 372 LTVILAGFTQNGCEDEAIRIFLKMLKVGILIDENVVSAALGVFGADTSLRLGQQVHSFVV 431
              +++G+  N    +A+ +   ML+ G  +D  + +  L  F +  +L  G +VH+  V
Sbjct: 589 WNSMISGYIHNELLAKALDLVWFMLQTGQRLDSFMYATVLSAFASVATLERGMEVHACSV 648

Query: 432 KRNFSCNPFVGNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIAAFARHGDGSKALC 491
           +     +  VG+ L++MYSKCG LD +++ F+ M  RNS +WNSMI+ +ARHG G +AL 
Sbjct: 649 RACLESDVVVGSALVDMYSKCGRLDYALRFFNTMPVRNSYSWNSMISGYARHGQGEEALK 708

Query: 492 LYENMKLEGAKPTD-VTFLSLLHACSHVGLVRKGMEFLESMTKDHGMNPRTEHYACVVDM 551
           L+E MKL+G  P D VTF+ +L ACSH GL+ +G +  ESM+  +G+ PR EH++C+ D+
Sbjct: 709 LFETMKLDGQTPPDHVTFVGVLSACSHAGLLEEGFKHFESMSDSYGLAPRIEHFSCMADV 768

Query: 552 LGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYG--DSETGKYAAEHLFSESPHSPV 611
           LGRAG + +   FIE++P +P +L+W+ +LGAC       +E GK AAE LF   P + V
Sbjct: 769 LGRAGELDKLEDFIEKMPMKPNVLIWRTVLGACCRANGRKAELGKKAAEMLFQLEPENAV 828

Query: 612 PYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEIDKKVHSFTVWDKMHPQAET 671
            YVLL N+Y++ G W++  +  +KMK+  V KE G SW+ +   VH F   DK HP A+ 
Sbjct: 829 NYVLLGNMYAAGGRWEDLVKARKKMKDADVKKEAGYSWVTMKDGVHMFVAGDKSHPDADV 888

Query: 672 IYGVLMELFVFMVDEGYVPDKKFILYYLDSDGKRD 702
           IY  L EL   M D GYVP   F LY L+ + K +
Sbjct: 889 IYKKLKELNRKMRDAGYVPQTGFALYDLEQENKEE 921

BLAST of Moc08g41880 vs. TAIR 10
Match: AT3G57430.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 399.4 bits (1025), Expect = 6.0e-111
Identity = 216/606 (35.64%), Postives = 337/606 (55.61%), Query Frame = 0

Query: 109 LHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRDTVSWNA 168
           L +G  +HA     + L   E    I N+L++MY + GKLA +  +      RD V+WN 
Sbjct: 218 LMMGKQVHA-----YGLRKGELNSFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNT 277

Query: 169 LIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIHGLAFLS 228
           +++   +N +      Y + M L G  E D+ T++++L AC   E+    K +H  A  +
Sbjct: 278 VLSSLCQNEQLLEALEYLREMVLEG-VEPDEFTISSVLPACSHLEMLRTGKELHAYALKN 337

Query: 229 GFVQEIT-VGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHEHSLKL 288
           G + E + VG+AL+  Y  C  V  GR+VFD M +R +  W A+I+G + N H + +L L
Sbjct: 338 GSLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLL 397

Query: 289 FKEMMSCGSVEPNSLTYLSLLTACSGLEALEEGRQIHGLILKLGIQSDLCIESALMDMYS 348
           F  M     +  NS T   ++ AC    A      IHG ++K G+  D  +++ LMDMYS
Sbjct: 398 FIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYS 457

Query: 349 KSGSIGYAWKIFESAEELDMVSLTVILAGFTQNGCEDEAIRIFLKM-----------LKV 408
           + G I  A +IF   E+ D+V+   ++ G+  +   ++A+ +  KM            +V
Sbjct: 458 RLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVSKGASRV 517

Query: 409 GILIDENVVSAALGVFGADTSLRLGQQVHSFVVKRNFSCNPFVGNGLINMYSKCGALDES 468
            +  +   +   L    A ++L  G+++H++ +K N + +  VG+ L++MY+KCG L  S
Sbjct: 518 SLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMS 577

Query: 469 VKVFDRMRERNSVTWNSMIAAFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHV 528
            KVFD++ ++N +TWN +I A+  HG+G +A+ L   M ++G KP +VTF+S+  ACSH 
Sbjct: 578 RKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQGVKPNEVTFISVFAACSHS 637

Query: 529 GLVRKGMEFLESMTKDHGMNPRTEHYACVVDMLGRAGLVSEARKFIEELP---ERPGLLV 588
           G+V +G+     M  D+G+ P ++HYACVVD+LGRAG + EA + +  +P    + G   
Sbjct: 638 GMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAG--A 697

Query: 589 WQALLGACSLYGDSETGKYAAEHLFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKE 648
           W +LLGA  ++ + E G+ AA++L    P+    YVLLANIYSS G W +     R MKE
Sbjct: 698 WSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKE 757

Query: 649 VGVAKETGISWIEIDKKVHSFTVWDKMHPQAETIYGVLMELFVFMVDEGYVPDKKFILYY 700
            GV KE G SWIE   +VH F   D  HPQ+E + G L  L+  M  EGYVPD   +L+ 
Sbjct: 758 QGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHN 815

BLAST of Moc08g41880 vs. TAIR 10
Match: AT5G16860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 397.9 bits (1021), Expect = 1.7e-110
Identity = 218/645 (33.80%), Postives = 350/645 (54.26%), Query Frame = 0

Query: 103 CGRENLLHLGSSLHASIIKSFQLANHENGVVIANSLISMYQRCGKLADAVKVFDEMSRRD 162
           CG  + +  G S HA  + +  ++N    V + N+L++MY RC  L+DA KVFDEMS  D
Sbjct: 137 CGEISSVRCGESAHALSLVTGFISN----VFVGNALVAMYSRCRSLSDARKVFDEMSVWD 196

Query: 163 TVSWNALIAGFMRNGESCVGFSYFKAMCLVGDCEFDKATLTTILSACDGFELCYVIKMIH 222
            VSWN++I  + + G+  V    F  M     C  D  TL  +L  C       + K +H
Sbjct: 197 VVSWNSIIESYAKLGKPKVALEMFSRMTNEFGCRPDNITLVNVLPPCASLGTHSLGKQLH 256

Query: 223 GLAFLSGFVQEITVGNALISSYFKCGCVGFGRQVFDEMEERNVITWTAVISGLAHNGHHE 282
             A  S  +Q + VGN L+  Y KCG +     VF  M  ++V++W A+++G +  G  E
Sbjct: 257 CFAVTSEMIQNMFVGNCLVDMYAKCGMMDEANTVFSNMSVKDVVSWNAMVAGYSQIGRFE 316

Query: 283 HSLKLFKEM----------------------------------MSCGSVEPNSLTYLSLL 342
            +++LF++M                                  M    ++PN +T +S+L
Sbjct: 317 DAVRLFEKMQEEKIKMDVVTWSAAISGYAQRGLGYEALGVCRQMLSSGIKPNEVTLISVL 376

Query: 343 TACSGLEALEEGRQIHGLILKLGIQ-------SDLCIESALMDMYSKSGSIGYAWKIFE- 402
           + C+ + AL  G++IH   +K  I         +  + + L+DMY+K   +  A  +F+ 
Sbjct: 377 SGCASVGALMHGKEIHCYAIKYPIDLRKNGHGDENMVINQLIDMYAKCKKVDTARAMFDS 436

Query: 403 -SAEELDMVSLTVILAGFTQNGCEDEAIRIFLKMLKVGILIDEN--VVSAALGVFGADTS 462
            S +E D+V+ TV++ G++Q+G  ++A+ +  +M +       N   +S AL    +  +
Sbjct: 437 LSPKERDVVTWTVMIGGYSQHGDANKALELLSEMFEEDCQTRPNAFTISCALVACASLAA 496

Query: 463 LRLGQQVHSFVVKRNFSCNP-FVGNGLINMYSKCGALDESVKVFDRMRERNSVTWNSMIA 522
           LR+G+Q+H++ ++   +  P FV N LI+MY+KCG++ ++  VFD M  +N VTW S++ 
Sbjct: 497 LRIGKQIHAYALRNQQNAVPLFVSNCLIDMYAKCGSISDARLVFDNMMAKNEVTWTSLMT 556

Query: 523 AFARHGDGSKALCLYENMKLEGAKPTDVTFLSLLHACSHVGLVRKGMEFLESMTKDHGMN 582
            +  HG G +AL +++ M+  G K   VT L +L+ACSH G++ +GME+   M    G++
Sbjct: 557 GYGMHGYGEEALGIFDEMRRIGFKLDGVTLLVVLYACSHSGMIDQGMEYFNRMKTVFGVS 616

Query: 583 PRTEHYACVVDMLGRAGLVSEARKFIEELPERPGLLVWQALLGACSLYGDSETGKYAAEH 642
           P  EHYAC+VD+LGRAG ++ A + IEE+P  P  +VW A L  C ++G  E G+YAAE 
Sbjct: 617 PGPEHYACLVDLLGRAGRLNAALRLIEEMPMEPPPVVWVAFLSCCRIHGKVELGEYAAEK 676

Query: 643 LFSESPHSPVPYVLLANIYSSEGNWKERARTIRKMKEVGVAKETGISWIEIDKKVHSFTV 702
           +   + +    Y LL+N+Y++ G WK+  R    M+  GV K  G SW+E  K   +F V
Sbjct: 677 ITELASNHDGSYTLLSNLYANAGRWKDVTRIRSLMRHKGVKKRPGCSWVEGIKGTTTFFV 736

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150755.10.0e+00100.00pentatricopeptide repeat-containing protein At3g05340 [Momordica charantia][more]
XP_022946254.10.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X1 [Cucurbita mosc... [more]
XP_022946256.10.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X3 [Cucurbita mosc... [more]
XP_022946255.10.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X2 [Cucurbita mosc... [more]
XP_022999024.10.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9MA855.1e-21659.76Pentatricopeptide repeat-containing protein At3g05340 OS=Arabidopsis thaliana OX... [more]
Q5G1T11.2e-11136.88Pentatricopeptide repeat-containing protein At3g49170, chloroplastic OS=Arabidop... [more]
Q9FIB21.3e-11038.09Putative pentatricopeptide repeat-containing protein At5g09950 OS=Arabidopsis th... [more]
Q7Y2118.4e-11035.64Pentatricopeptide repeat-containing protein At3g57430, chloroplastic OS=Arabidop... [more]
Q9LFL52.4e-10933.80Pentatricopeptide repeat-containing protein At5g16860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D9D80.0e+00100.00pentatricopeptide repeat-containing protein At3g05340 OS=Momordica charantia OX=... [more]
A0A6J1G3500.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X3 OS=Cucurbita mo... [more]
A0A6J1G3C00.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X2 OS=Cucurbita mo... [more]
A0A6J1G3A20.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 isoform X1 OS=Cucurbita mo... [more]
A0A6J1KFW90.0e+0086.70pentatricopeptide repeat-containing protein At3g05340 OS=Cucurbita maxima OX=366... [more]
Match NameE-valueIdentityDescription
AT3G05340.13.6e-21759.76Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G49170.18.3e-11336.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G09950.19.2e-11238.09Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT3G57430.16.0e-11135.64Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G16860.11.7e-11033.80Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 317..415
e-value: 4.4E-13
score: 51.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 416..516
e-value: 6.0E-25
score: 89.6
coord: 74..211
e-value: 1.2E-19
score: 72.4
coord: 517..649
e-value: 9.8E-14
score: 53.1
coord: 213..316
e-value: 6.7E-21
score: 76.4
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 263..311
e-value: 3.2E-11
score: 43.3
coord: 467..514
e-value: 1.4E-12
score: 47.6
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 368..398
e-value: 0.0036
score: 17.5
coord: 164..179
e-value: 0.43
score: 10.9
coord: 542..568
e-value: 1.0
score: 9.8
coord: 135..161
e-value: 1.2E-4
score: 22.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 469..502
e-value: 2.0E-7
score: 28.7
coord: 136..162
e-value: 1.9E-4
score: 19.4
coord: 266..300
e-value: 1.3E-7
score: 29.4
coord: 369..401
e-value: 0.0018
score: 16.3
coord: 238..264
e-value: 3.2E-4
score: 18.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 467..501
score: 12.276713
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 131..165
score: 9.646002
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 436..466
score: 9.382931
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 366..400
score: 9.076014
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 264..298
score: 10.665402
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 641..700
e-value: 1.1E-7
score: 31.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 700..719
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 78..690
NoneNo IPR availablePANTHERPTHR47925:SF89BNAA05G31840D PROTEINcoord: 78..690

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc08g41880.1Moc08g41880.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding