Moc02g10830 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g10830
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Locationchr2: 7727588 .. 7729063 (+)
RNA-Seq ExpressionMoc02g10830
SyntenyMoc02g10830
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCCATCCCTCTGCTGAAACGAACCCTGTCGTCAATCCGAAACCCAGGATTTAAGCTCCCATTTTCCCCTTCATTCTCCTCTTCCTCGCCGTCGGCAAAACCCTCGATCTCGACCGTGGTTTCAGTTCTCACTCACCACCGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGCCCCGACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTCCTCCACATCAAGAACAATCCCCATCTCTCCCTTCGATTCTTTCTCTGGACTCAGAACAAGTCCCTCTGCACTCACAATCTCGTTTCCTACTCAACCGTCATCCACATCCTTGCCCGCGGTCGCCTCAGAACTCACGCCAAGGCCGTTATTCAGACCGCCATTAGGGCTGCGGAGCTCGAAGACGACGATGGCTGTTCCAATTGTAAGCAATTTTCTAGGCCTTTGAGGCTGTTTCAGACTCTCGTCAAGACGTATAAACGGTGTGGTTCTGCTCCCTTCGTGTTTGATTTATTGATTAAAGCTCTCCTGGATTCTAGAAAGCTCGAGCCGGCCATTCAAATTATTAGAATGTTAAGGTCTCGTGGGATTAGCCCGCAGGTTAGTACATTGAATTCGTTGATTTTGTGGGTGTCGAAGTGCGAGGGGGCTAACGCGGGTTATGCTATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAGTCAAGGAAGAACGTGTGAAAATGAAGGCTCAGGCTAGTCCCAATGTACATTCTTTTAATACATTAATGATGTGTTTTTATCAAGATGGATTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCCGAACAGTTACAGTTATAGTATTCTAATGACGGTTTTCTGTGATCAAAGAAGAATGGTTGAAGCAGAGGAGTTGTGGAAAGAAATGAGATTGAAGAAGTTGGAGCTTGATGCTGTAGCTTATAACACTATAATTGGAGGGTTTTGTAAAGCAGGAAGTATTCAGAGAGCTGAAGAGCTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAAACTGGAGATATTGACTCTGCATTACTGGTGTATAAGGATATGCGGAGGAAAAATTTTAGTCTGAACGCATCGACGTTGGAAGCGATTGTTAGAGGATTGGTTGCCGAGACTAGACTTTTAGAAGCTTTAGATGTTTTTGGGTTCACCACAGAGGACTCCAACTTTTGCCCAACAATGGAAACTTACGAACTTCTGATAAATGGTTTGTGTCGGGAAGGGGAAATTGAAGCTGCATTTAAGCTTCAGGCGCAGATGGTAGGGAAAGGCTTTAAGCCAGATTCGAAGGTTTACCGTTCTTTTATCGATGCTTATACGAACGAAGGAAATGAAGAAATGGTCGAGAAGTTGAGGAAGGAATTACTTGAAATCCAGCTGAGTTGA

mRNA sequence

ATGTCCATCCCTCTGCTGAAACGAACCCTGTCGTCAATCCGAAACCCAGGATTTAAGCTCCCATTTTCCCCTTCATTCTCCTCTTCCTCGCCGTCGGCAAAACCCTCGATCTCGACCGTGGTTTCAGTTCTCACTCACCACCGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGCCCCGACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTCCTCCACATCAAGAACAATCCCCATCTCTCCCTTCGATTCTTTCTCTGGACTCAGAACAAGTCCCTCTGCACTCACAATCTCGTTTCCTACTCAACCGTCATCCACATCCTTGCCCGCGGTCGCCTCAGAACTCACGCCAAGGCCGTTATTCAGACCGCCATTAGGGCTGCGGAGCTCGAAGACGACGATGGCTGTTCCAATTGTAAGCAATTTTCTAGGCCTTTGAGGCTGTTTCAGACTCTCGTCAAGACGTATAAACGGTGTGGTTCTGCTCCCTTCGTGTTTGATTTATTGATTAAAGCTCTCCTGGATTCTAGAAAGCTCGAGCCGGCCATTCAAATTATTAGAATGTTAAGGTCTCGTGGGATTAGCCCGCAGGTTAGTACATTGAATTCGTTGATTTTGTGGGTGTCGAAGTGCGAGGGGGCTAACGCGGGTTATGCTATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAGTCAAGGAAGAACGTGTGAAAATGAAGGCTCAGGCTAGTCCCAATGTACATTCTTTTAATACATTAATGATGTGTTTTTATCAAGATGGATTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCCGAACAGTTACAGTTATAGTATTCTAATGACGGTTTTCTGTGATCAAAGAAGAATGGTTGAAGCAGAGGAGTTGTGGAAAGAAATGAGATTGAAGAAGTTGGAGCTTGATGCTGTAGCTTATAACACTATAATTGGAGGGTTTTGTAAAGCAGGAAGTATTCAGAGAGCTGAAGAGCTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAAACTGGAGATATTGACTCTGCATTACTGGTGTATAAGGATATGCGGAGGAAAAATTTTAGTCTGAACGCATCGACGTTGGAAGCGATTGTTAGAGGATTGGTTGCCGAGACTAGACTTTTAGAAGCTTTAGATGTTTTTGGGTTCACCACAGAGGACTCCAACTTTTGCCCAACAATGGAAACTTACGAACTTCTGATAAATGGTTTGTGTCGGGAAGGGGAAATTGAAGCTGCATTTAAGCTTCAGGCGCAGATGGTAGGGAAAGGCTTTAAGCCAGATTCGAAGGTTTACCGTTCTTTTATCGATGCTTATACGAACGAAGGAAATGAAGAAATGGTCGAGAAGTTGAGGAAGGAATTACTTGAAATCCAGCTGAGTTGA

Coding sequence (CDS)

ATGTCCATCCCTCTGCTGAAACGAACCCTGTCGTCAATCCGAAACCCAGGATTTAAGCTCCCATTTTCCCCTTCATTCTCCTCTTCCTCGCCGTCGGCAAAACCCTCGATCTCGACCGTGGTTTCAGTTCTCACTCACCACCGCTCAAAATCCCGCTGGCGATTCCTCAACTCCCTCTGCCCCGACGGCTTCGATCCCGGCGAGTTTTCCGATATCGTCCTCCACATCAAGAACAATCCCCATCTCTCCCTTCGATTCTTTCTCTGGACTCAGAACAAGTCCCTCTGCACTCACAATCTCGTTTCCTACTCAACCGTCATCCACATCCTTGCCCGCGGTCGCCTCAGAACTCACGCCAAGGCCGTTATTCAGACCGCCATTAGGGCTGCGGAGCTCGAAGACGACGATGGCTGTTCCAATTGTAAGCAATTTTCTAGGCCTTTGAGGCTGTTTCAGACTCTCGTCAAGACGTATAAACGGTGTGGTTCTGCTCCCTTCGTGTTTGATTTATTGATTAAAGCTCTCCTGGATTCTAGAAAGCTCGAGCCGGCCATTCAAATTATTAGAATGTTAAGGTCTCGTGGGATTAGCCCGCAGGTTAGTACATTGAATTCGTTGATTTTGTGGGTGTCGAAGTGCGAGGGGGCTAACGCGGGTTATGCTATTTTTAGAGAGGTTTTTGGTTTAGATTGTGAAGTCAAGGAAGAACGTGTGAAAATGAAGGCTCAGGCTAGTCCCAATGTACATTCTTTTAATACATTAATGATGTGTTTTTATCAAGATGGATTGGTGGGGCGGGTGAAGGAGATATGGGATCAATTAACCGAGTCAAATTCGATTCCGAACAGTTACAGTTATAGTATTCTAATGACGGTTTTCTGTGATCAAAGAAGAATGGTTGAAGCAGAGGAGTTGTGGAAAGAAATGAGATTGAAGAAGTTGGAGCTTGATGCTGTAGCTTATAACACTATAATTGGAGGGTTTTGTAAAGCAGGAAGTATTCAGAGAGCTGAAGAGCTTTTCAGAGAAATGGAACTGAGTGGAATAGAGAGTACTTTCTCCACCTTTGAGCATCTCATCAATGGCTATTGTGAAACTGGAGATATTGACTCTGCATTACTGGTGTATAAGGATATGCGGAGGAAAAATTTTAGTCTGAACGCATCGACGTTGGAAGCGATTGTTAGAGGATTGGTTGCCGAGACTAGACTTTTAGAAGCTTTAGATGTTTTTGGGTTCACCACAGAGGACTCCAACTTTTGCCCAACAATGGAAACTTACGAACTTCTGATAAATGGTTTGTGTCGGGAAGGGGAAATTGAAGCTGCATTTAAGCTTCAGGCGCAGATGGTAGGGAAAGGCTTTAAGCCAGATTCGAAGGTTTACCGTTCTTTTATCGATGCTTATACGAACGAAGGAAATGAAGAAATGGTCGAGAAGTTGAGGAAGGAATTACTTGAAATCCAGCTGAGTTGA

Protein sequence

MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLEIQLS
Homology
BLAST of Moc02g10830 vs. NCBI nr
Match: XP_022148504.1 (pentatricopeptide repeat-containing protein At2g15980 [Momordica charantia])

HSP 1 Score: 979.5 bits (2531), Expect = 1.0e-281
Identity = 491/491 (100.00%), Postives = 491/491 (100.00%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC 60
           MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC
Sbjct: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC 60

Query: 61  PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK 120
           PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK
Sbjct: 61  PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK 120

Query: 121 AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK 180
           AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK
Sbjct: 121 AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK 180

Query: 181 LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM 240
           LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM
Sbjct: 181 LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM 240

Query: 241 KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV 300
           KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
Sbjct: 241 KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV 300

Query: 301 EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI 360
           EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI
Sbjct: 301 EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI 360

Query: 361 NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF 420
           NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF
Sbjct: 361 NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF 420

Query: 421 CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK 480
           CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK
Sbjct: 421 CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK 480

Query: 481 LRKELLEIQLS 492
           LRKELLEIQLS
Sbjct: 481 LRKELLEIQLS 491

BLAST of Moc02g10830 vs. NCBI nr
Match: KAG6571131.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 827.0 bits (2135), Expect = 8.5e-236
Identity = 405/499 (81.16%), Postives = 457/499 (91.58%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWR 60
           MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWR
Sbjct: 1   MSIPLLKRSLWLIPNSTFNLPFSPSFFSSSPAAVPSPSTKPSISTVVSVLTHHRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KSLC H+LVSYSTVIHILARGR
Sbjct: 61  FLNSLCPDGFDPGEFSDIVLQIKNNSHLVLRFFLWTRSKSLCNHDLVSYSTVIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQTAIRA  LED DGCS C++F  SRPL+LF+TLVKTYK+CGSAPFVFDLLI
Sbjct: 121 LRTHAKDVIQTAIRATALEDGDGCSKCERFSSSRPLKLFETLVKTYKQCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+PAIQI+RMLRSRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE
Sbjct: 181 KALLDSKKLDPAIQIVRMLRSRGISPQIGTLNSLILWMSKCEGANAGYALFREVFGLNCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEEQNVKVKARVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G EST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLEIDAVAYNTIIGGFCKAGNIRRAEEFFREMELCGTEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCE+GD+DSALLVYKDMRRK+FS+N   LEA+ RGL AETRLLEALDVFG
Sbjct: 361 FSTFEHLINGYCESGDVDSALLVYKDMRRKSFSINPLMLEAMTRGLCAETRLLEALDVFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F TED+N CPTMETYELLINGLC++G++EAAFKLQ+QMVGKGFKP+SK+Y+SFIDAY+ E
Sbjct: 421 FATEDANICPTMETYELLINGLCQKGKLEAAFKLQSQMVGKGFKPNSKIYQSFIDAYSKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMV+KLR+E+LEIQLS
Sbjct: 481 GNEEMVKKLREEILEIQLS 499

BLAST of Moc02g10830 vs. NCBI nr
Match: KAG7010942.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 826.6 bits (2134), Expect = 1.1e-235
Identity = 405/499 (81.16%), Postives = 456/499 (91.38%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWR 60
           MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWR
Sbjct: 1   MSIPLLKRSLWLIPNSTFNLPFSPSFFSSSPAAVPSPSTKPSISTVVSVLTHHRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KSLC H+LVSYSTVIHILARGR
Sbjct: 61  FLNSLCPDGFDPGEFSDIVLQIKNNSHLVLRFFLWTRSKSLCNHDLVSYSTVIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQTAIRA  LED DGCS C++F  SRPL+LF+TLVKTYK+CGSAPFVFDLLI
Sbjct: 121 LRTHAKDVIQTAIRATALEDGDGCSKCERFSSSRPLKLFETLVKTYKQCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+PAIQI+RMLRSRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE
Sbjct: 181 KALLDSKKLDPAIQIVRMLRSRGISPQIGTLNSLILWMSKCEGANAGYALFREVFGLNCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEEQNVKVKARVSPNVHTFNTLMVCFYQDGLVGRVKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G EST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLEIDAVAYNTIIGGFCKAGNIRRAEEFFREMELCGTEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCE+GD+DSALLVYKDMRRK+FS+N   LEA+ RGL AETRLLEALDVFG
Sbjct: 361 FSTFEHLINGYCESGDVDSALLVYKDMRRKSFSINPLMLEAMTRGLCAETRLLEALDVFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F TED+N CPTMETYELLINGLC++G++EAAFKLQ QMVGKGFKP+SK+Y+SFIDAY+ E
Sbjct: 421 FATEDANICPTMETYELLINGLCQKGKLEAAFKLQGQMVGKGFKPNSKIYQSFIDAYSKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMV+KLR+E+LEIQLS
Sbjct: 481 GNEEMVKKLREEILEIQLS 499

BLAST of Moc02g10830 vs. NCBI nr
Match: XP_038901621.1 (pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida])

HSP 1 Score: 822.8 bits (2124), Expect = 1.6e-234
Identity = 411/499 (82.36%), Postives = 448/499 (89.78%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSS------SPSAKPSISTVVSVLTHHRSKSRWR 60
           MS+PLL+RTL  IRN  F LPFS SF SS      SPS KPSISTVVSVLTHHRSKSRWR
Sbjct: 27  MSVPLLRRTLWPIRNSTFNLPFSHSFFSSSPPGEPSPSTKPSISTVVSVLTHHRSKSRWR 86

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDI+L IKNNPHL+LRFF WTQNKSLC HNLVSYSTVIHILARGR
Sbjct: 87  FLNSLCPDGFDPGEFSDILLQIKNNPHLALRFFHWTQNKSLCNHNLVSYSTVIHILARGR 146

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQTAIRAAELED D CS C++F  SRPL+LF+TLVKTYKRCGSAPFVFDLLI
Sbjct: 147 LRTHAKDVIQTAIRAAELEDGDDCSKCERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLI 206

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KLE AIQI+RMLRSRGISPQV TLNSLIL VSK +GANAGYAIF+EVFGLDCE
Sbjct: 207 KALLDSKKLESAIQIVRMLRSRGISPQVGTLNSLILLVSKFQGANAGYAIFKEVFGLDCE 266

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++EE VK+KA  SPNVH+FNTLM CFYQDGLVGRVK+IWDQL +SNSIPNSYSYSILM V
Sbjct: 267 IEEENVKLKAGVSPNVHTFNTLMECFYQDGLVGRVKDIWDQLADSNSIPNSYSYSILMAV 326

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
           FC+++RM EAEELW EM++KKLELDAVAYNTIIGGFCKAG++ RAEE +REMELSGIEST
Sbjct: 327 FCEEKRMGEAEELWVEMKIKKLELDAVAYNTIIGGFCKAGNVHRAEEFYREMELSGIEST 386

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCETGD+DSALLVYKDMRRK F+ NA  LE ++RGL AETRLLEALDVF 
Sbjct: 387 FSTFEHLINGYCETGDVDSALLVYKDMRRKRFTPNALILEELIRGLCAETRLLEALDVFC 446

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F  EDSNFCPT+ETYELLINGLC+EG+IE AFKLQAQMVGKGFKP+ K+Y+SFIDAY  E
Sbjct: 447 FAIEDSNFCPTLETYELLINGLCQEGKIEVAFKLQAQMVGKGFKPNLKIYQSFIDAYIKE 506

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMVEKL KE+LEIQLS
Sbjct: 507 GNEEMVEKLGKEILEIQLS 525

BLAST of Moc02g10830 vs. NCBI nr
Match: XP_022944388.1 (pentatricopeptide repeat-containing protein At2g15980 [Cucurbita moschata])

HSP 1 Score: 822.8 bits (2124), Expect = 1.6e-234
Identity = 405/499 (81.16%), Postives = 454/499 (90.98%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWR 60
           MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWR
Sbjct: 1   MSIPLLKRSLWLIPNSTFNLPFSPSFFSSSPAAVPSPSTKPSISTVVSVLTHHRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KSLC H+LVSYSTVIHILARGR
Sbjct: 61  FLNSLCPDGFDPGEFSDIVLQIKNNSHLVLRFFLWTRSKSLCNHDLVSYSTVIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRT AK VIQTAIRA  LED D CS C++F  SRPL+LF+TLVKTYK+CGSAPFVFDLLI
Sbjct: 121 LRTLAKDVIQTAIRATALEDGDDCSKCERFSSSRPLKLFETLVKTYKQCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+PAIQI+RMLRSRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE
Sbjct: 181 KALLDSKKLDPAIQIVRMLRSRGISPQIGTLNSLILWMSKCEGANAGYALFREVFGLNCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGR KEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEEQNVKVKARVSPNVHTFNTLMVCFYQDGLVGRAKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G EST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLEIDAVAYNTIIGGFCKAGNIRRAEEFFREMELCGTEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LEA+ RGL AETRLLEALD+FG
Sbjct: 361 FSTFEHLINGYCETGDVDSALLVYKDMRRKSFSLNHLMLEAMTRGLCAETRLLEALDIFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F TED+N CPTMETYELLINGLC+EG++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ E
Sbjct: 421 FATEDANICPTMETYELLINGLCQEGKLEAAFKLQAQMVGKGFKPNSKIYQSFIDAYSKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMV+KLR+E+LEIQLS
Sbjct: 481 GNEEMVKKLREEILEIQLS 499

BLAST of Moc02g10830 vs. ExPASy Swiss-Prot
Match: Q9XIM8 (Pentatricopeptide repeat-containing protein At2g15980 OS=Arabidopsis thaliana OX=3702 GN=At2g15980 PE=2 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 2.3e-127
Identity = 233/496 (46.98%), Postives = 332/496 (66.94%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRF 60
           MS  +L+R L   R P      S S      S  SP + P IS  VS+LTHHRSKSRW  
Sbjct: 1   MSTSILRRILDPTRKPKPDAILSISLLTTVSSPPSPPSDPLISDAVSILTHHRSKSRWST 60

Query: 61  LNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRL 120
           L SL P GF P +FS+I L ++NNPHLSLRFFL+T+  SLC+H+  S ST+IHIL+R RL
Sbjct: 61  LRSLQPSGFTPSQFSEITLCLRNNPHLSLRFFLFTRRYSLCSHDTHSCSTLIHILSRSRL 120

Query: 121 RTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKAL 180
           ++HA  +I+ A+R A  ++D+         R L++F++L+K+Y RCGSAPFVFDLLIK+ 
Sbjct: 121 KSHASEIIRLALRLAATDEDE--------DRVLKVFRSLIKSYNRCGSAPFVFDLLIKSC 180

Query: 181 LDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKE 240
           LDS++++ A+ ++R LRSRGI+ Q+ST N+LI  VS+  GA+ GY ++REVFGLD    +
Sbjct: 181 LDSKEIDGAVMVMRKLRSRGINAQISTCNALITEVSRRRGASNGYKMYREVFGLDDVSVD 240

Query: 241 ERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC 300
           E  KM  +  PN  +FN++M+ FY++G    V+ IW ++ E     PN YSY++LM  +C
Sbjct: 241 EAKKMIGKIKPNATTFNSMMVSFYREGETEMVERIWREMEEEVGCSPNVYSYNVLMEAYC 300

Query: 301 DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFS 360
            +  M EAE++W+EM+++ +  D VAYNT+IGG C    + +A+ELFR+M L GIE T  
Sbjct: 301 ARGLMSEAEKVWEEMKVRGVVYDIVAYNTMIGGLCSNFEVVKAKELFRDMGLKGIECTCL 360

Query: 361 TFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAE---TRLLEALDVF 420
           T+EHL+NGYC+ GD+DS L+VY++M+RK F  +  T+EA+V GL  +    R++EA D+ 
Sbjct: 361 TYEHLVNGYCKAGDVDSGLVVYREMKRKGFEADGLTIEALVEGLCDDRDGQRVVEAADIV 420

Query: 421 GFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTN 480
                ++ F P+   YELL+  LC +G+++ A  +QA+MVGKGFKP  + YR+FID Y  
Sbjct: 421 KDAVREAMFYPSRNCYELLVKRLCEDGKMDRALNIQAEMVGKGFKPSQETYRAFIDGYGI 480

Query: 481 EGNEEMVEKLRKELLE 488
            G+EE    L  E+ E
Sbjct: 481 VGDEETSALLAIEMAE 488

BLAST of Moc02g10830 vs. ExPASy Swiss-Prot
Match: Q9SZ10 (Pentatricopeptide repeat-containing protein At4g26680, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At4g26680 PE=3 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 1.8e-39
Identity = 109/457 (23.85%), Postives = 204/457 (44.64%), Query Frame = 0

Query: 30  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLW 89
           +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W
Sbjct: 46  NPEPKGQDLDFVNVAHSHLIQSDWDKLNKL-SDHLDSFRVKNVLLKIQKDYLLSLEFFNW 105

Query: 90  TQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLR 149
            + ++  +H+L +++ V+H L + R    A+++++  +    ++             P +
Sbjct: 106 AKTRNPGSHSLETHAIVLHTLTKNRKFKSAESILRDVLVNGGVD------------LPAK 165

Query: 150 LFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILW 209
           +F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   +
Sbjct: 166 VFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVESCNA---Y 225

Query: 210 VSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKE 269
           +S   G             +D  ++  R   + + SPN ++ N +M  + + G + +  E
Sbjct: 226 MSSLLGQGR----------VDIALRFYREMRRCKISPNPYTLNMVMSGYCRSGKLDKGIE 285

Query: 270 IWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC 329
           +   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Sbjct: 286 LLQDMERLGFRATDVSYNTLIAGHCEKGLLSSALKLKNMMGKSGLQPNVVTFNTLIHGFC 345

Query: 330 KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNAS 389
           +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  
Sbjct: 346 RAMKLQEASKVFGEMKAVNVAPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDIL 405

Query: 390 TLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQ 449
           T  A++ GL  + +  +A   F    +  N  P   T+  LI G C     +  F+L   
Sbjct: 406 TYNALIFGLCKQAKTRKAAQ-FVKELDKENLVPNSSTFSALIMGQCVRKNADRGFELYKS 465

Query: 450 MVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL 487
           M+  G  P+ + +   + A+    + +   ++ +E++
Sbjct: 466 MIRSGCHPNEQTFNMLVSAFCRNEDFDGASQVLREMV 475

BLAST of Moc02g10830 vs. ExPASy Swiss-Prot
Match: Q9SXD1 (Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g62670 PE=3 SV=2)

HSP 1 Score: 158.7 bits (400), Expect = 1.7e-37
Identity = 107/395 (27.09%), Postives = 196/395 (49.62%), Query Frame = 0

Query: 99  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQT 158
           N V+++T+IH L      + A A+I   +      +L       N  CK+    L  F  
Sbjct: 185 NTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDL-AFNL 244

Query: 159 LVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK 218
           L K  + +      +++ +I  L   + ++ A+ + + + ++GI P V T +SLI     
Sbjct: 245 LNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLI----- 304

Query: 219 CEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWD 278
                  Y  + +   L  ++ E ++      +P+V +F+ L+  F ++G +   ++++D
Sbjct: 305 --SCLCNYGRWSDASRLLSDMIERKI------NPDVFTFSALIDAFVKEGKLVEAEKLYD 364

Query: 279 QLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAG 338
           ++ + +  P+  +YS L+  FC   R+ EA+++++ M  K    D V YNT+I GFCK  
Sbjct: 365 EMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYK 424

Query: 339 SIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE 398
            ++   E+FREM   G+     T+  LI G  + GD D A  ++K+M       N  T  
Sbjct: 425 RVEEGMEVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYN 484

Query: 399 AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVG 458
            ++ GL    +L +A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  
Sbjct: 485 TLLDGLCKNGKLEKAMVVFEY-LQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSL 544

Query: 459 KGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE 488
           KG KPD   Y + I  +  +G++E  + L KE+ E
Sbjct: 545 KGVKPDVVAYNTMISGFCRKGSKEEADALFKEMKE 564

BLAST of Moc02g10830 vs. ExPASy Swiss-Prot
Match: Q0WVK7 (Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g05670 PE=2 SV=1)

HSP 1 Score: 158.3 bits (399), Expect = 2.2e-37
Identity = 126/474 (26.58%), Postives = 206/474 (43.46%), Query Frame = 0

Query: 72  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAE 131
           +++ IK +  L L FF W +++     NL S   VIH+    +    A+++I +     +
Sbjct: 93  VLMKIKCDYRLVLDFFDWARSRR--DSNLESLCIVIHLAVASKDLKVAQSLISSFWERPK 152

Query: 132 LEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRML 191
           L   D           ++ F  LV TYK  GS P VFD+  + L+D   L  A ++   +
Sbjct: 153 LNVTDSF---------VQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKM 212

Query: 192 RSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE--- 251
            + G+   V + N  +  +SK C        +FRE               V    C+   
Sbjct: 213 LNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGR 272

Query: 252 VKEER-----VKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYS 311
           +KE       +++K   +P+V S++T++  + + G + +V ++ + +      PNSY Y 
Sbjct: 273 IKEAHHLLLLMELKGY-TPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYG 332

Query: 312 ILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE--------- 371
            ++ + C   ++ EAEE + EM  + +  D V Y T+I GFCK G I+ A          
Sbjct: 333 SIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSR 392

Query: 372 --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSA 431
                                     +LF EM   G+E    TF  LINGYC+ G +  A
Sbjct: 393 DITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDA 452

Query: 432 LLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLIN 487
             V+  M +   S N  T   ++ GL  E  L  A ++           P + TY  ++N
Sbjct: 453 FRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELL-HEMWKIGLQPNIFTYNSIVN 512

BLAST of Moc02g10830 vs. ExPASy Swiss-Prot
Match: Q9SH26 (Pentatricopeptide repeat-containing protein At1g63400 OS=Arabidopsis thaliana OX=3702 GN=At1g63400 PE=2 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 2.9e-37
Identity = 92/321 (28.66%), Postives = 168/321 (52.34%), Query Frame = 0

Query: 167 VFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREV 226
           ++  +I +L   R  + A+ +   + ++G+ P V T +SLI            Y  + + 
Sbjct: 262 IYSTVIDSLCKYRHEDDALNLFTEMENKGVRPNVITYSSLI-------SCLCNYERWSDA 321

Query: 227 FGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSY 286
             L  ++ E ++      +PNV +FN L+  F ++G +   ++++D++ + +  P+ ++Y
Sbjct: 322 SRLLSDMIERKI------NPNVVTFNALIDAFVKEGKLVEAEKLYDEMIKRSIDPDIFTY 381

Query: 287 SILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMEL 346
           S L+  FC   R+ EA+ +++ M  K    + V YNT+I GFCKA  I    ELFREM  
Sbjct: 382 SSLINGFCMHDRLDEAKHMFELMISKDCFPNVVTYNTLINGFCKAKRIDEGVELFREMSQ 441

Query: 347 SGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLE 406
            G+     T+  LI+G+ +  D D+A +V+K M       N  T   ++ GL    +L +
Sbjct: 442 RGLVGNTVTYTTLIHGFFQARDCDNAQMVFKQMVSDGVHPNIMTYNTLLDGLCKNGKLEK 501

Query: 407 ALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFI 466
           A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  KG KPD  +Y + I
Sbjct: 502 AMVVFEY-LQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCSLSLKGVKPDVIIYNTMI 561

Query: 467 DAYTNEGNEEMVEKLRKELLE 488
             +  +G +E  + L +++ E
Sbjct: 562 SGFCRKGLKEEADALFRKMRE 568

BLAST of Moc02g10830 vs. ExPASy TrEMBL
Match: A0A6J1D472 (pentatricopeptide repeat-containing protein At2g15980 OS=Momordica charantia OX=3673 GN=LOC111017136 PE=4 SV=1)

HSP 1 Score: 979.5 bits (2531), Expect = 4.9e-282
Identity = 491/491 (100.00%), Postives = 491/491 (100.00%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC 60
           MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC
Sbjct: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSAKPSISTVVSVLTHHRSKSRWRFLNSLC 60

Query: 61  PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK 120
           PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK
Sbjct: 61  PDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAK 120

Query: 121 AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK 180
           AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK
Sbjct: 121 AVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRK 180

Query: 181 LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM 240
           LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM
Sbjct: 181 LEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKEERVKM 240

Query: 241 KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV 300
           KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV
Sbjct: 241 KAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTVFCDQRRMV 300

Query: 301 EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI 360
           EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI
Sbjct: 301 EAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFSTFEHLI 360

Query: 361 NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF 420
           NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF
Sbjct: 361 NGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNF 420

Query: 421 CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK 480
           CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK
Sbjct: 421 CPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNEGNEEMVEK 480

Query: 481 LRKELLEIQLS 492
           LRKELLEIQLS
Sbjct: 481 LRKELLEIQLS 491

BLAST of Moc02g10830 vs. ExPASy TrEMBL
Match: A0A6J1FY05 (pentatricopeptide repeat-containing protein At2g15980 OS=Cucurbita moschata OX=3662 GN=LOC111448850 PE=4 SV=1)

HSP 1 Score: 822.8 bits (2124), Expect = 7.7e-235
Identity = 405/499 (81.16%), Postives = 454/499 (90.98%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWR 60
           MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWR
Sbjct: 1   MSIPLLKRSLWLIPNSTFNLPFSPSFFSSSPAAVPSPSTKPSISTVVSVLTHHRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KSLC H+LVSYSTVIHILARGR
Sbjct: 61  FLNSLCPDGFDPGEFSDIVLQIKNNSHLVLRFFLWTRSKSLCNHDLVSYSTVIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRT AK VIQTAIRA  LED D CS C++F  SRPL+LF+TLVKTYK+CGSAPFVFDLLI
Sbjct: 121 LRTLAKDVIQTAIRATALEDGDDCSKCERFSSSRPLKLFETLVKTYKQCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+PAIQI+RMLRSRGISPQ+ TLNSLILW+SKCEGANAGYA+FREVFGL+CE
Sbjct: 181 KALLDSKKLDPAIQIVRMLRSRGISPQIGTLNSLILWMSKCEGANAGYALFREVFGLNCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++E+ VK+KA+ SPNVH+FNTLM+CFYQDGLVGR KEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEEQNVKVKARVSPNVHTFNTLMVCFYQDGLVGRAKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+I+RAEE FREMEL G EST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLEIDAVAYNTIIGGFCKAGNIRRAEEFFREMELCGTEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LEA+ RGL AETRLLEALD+FG
Sbjct: 361 FSTFEHLINGYCETGDVDSALLVYKDMRRKSFSLNHLMLEAMTRGLCAETRLLEALDIFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F TED+N CPTMETYELLINGLC+EG++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ E
Sbjct: 421 FATEDANICPTMETYELLINGLCQEGKLEAAFKLQAQMVGKGFKPNSKIYQSFIDAYSKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMV+KLR+E+LEIQLS
Sbjct: 481 GNEEMVKKLREEILEIQLS 499

BLAST of Moc02g10830 vs. ExPASy TrEMBL
Match: A0A6J1JGQ1 (pentatricopeptide repeat-containing protein At2g15980 OS=Cucurbita maxima OX=3661 GN=LOC111484227 PE=4 SV=1)

HSP 1 Score: 822.4 bits (2123), Expect = 1.0e-234
Identity = 406/499 (81.36%), Postives = 454/499 (90.98%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSSSPSA------KPSISTVVSVLTHHRSKSRWR 60
           MSIPLLKR+L  I N  F LPFSPSF SSSP+A      KPSISTVVSVLTHHRSKSRWR
Sbjct: 1   MSIPLLKRSLWLIPNSTFNLPFSPSFFSSSPAAVPLPSTKPSISTVVSVLTHHRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCPDGFDPGEFSDIVL IKNN HL LRFFLWT++KSLC H+LVSYSTVIHILARGR
Sbjct: 61  FLNSLCPDGFDPGEFSDIVLQIKNNSHLVLRFFLWTRSKSLCNHDLVSYSTVIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQ AIRA  LEDDD CS C++F  SRPL+LF+TLVKTYK+CGSAPFVFDLLI
Sbjct: 121 LRTHAKDVIQNAIRATALEDDDDCSQCERFSSSRPLKLFETLVKTYKQCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+PAIQI+RMLRSRGISPQ+ TLNSLIL +SKCEGANAGYA+FREVFGL+CE
Sbjct: 181 KALLDSKKLDPAIQIVRMLRSRGISPQIGTLNSLILCMSKCEGANAGYALFREVFGLNCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           ++EE VK+KA+ASPNVH+FNTLM+CFYQDGLVGRVKEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEEENVKVKARASPNVHTFNTLMVCFYQDGLVGRVKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLE+DAVAYNTIIGGFCKAG+++RAEE FREMEL G EST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLEIDAVAYNTIIGGFCKAGNVRRAEEFFREMELCGTEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FSTFEHLINGYCETGD+DSALLVYKDMRRK+FSLN   LEAI RGL  ETRLLEALDVFG
Sbjct: 361 FSTFEHLINGYCETGDVDSALLVYKDMRRKSFSLNPLMLEAITRGLCVETRLLEALDVFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F TE +NFCPTMETYELLINGLC++G++EAAFKLQAQMVGKGFKP+SK+Y+SFIDAY+ E
Sbjct: 421 FATEHTNFCPTMETYELLINGLCQKGKLEAAFKLQAQMVGKGFKPNSKIYQSFIDAYSKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GNEEMV+KL +E+LEIQLS
Sbjct: 481 GNEEMVKKLGEEILEIQLS 499

BLAST of Moc02g10830 vs. ExPASy TrEMBL
Match: A0A5D3CQ25 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold299G001590 PE=4 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 8.0e-224
Identity = 394/499 (78.96%), Postives = 436/499 (87.37%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSS------SPSAKPSISTVVSVLTHHRSKSRWR 60
           MS PLLKRTL  I N    L FS SF SS      SPS KPSISTVVSVLTH RSKSRWR
Sbjct: 1   MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCP+GFDPGEFSDIVL IKNNPHL+LRFFLWTQNKSLC HNL+SYST+IHILARGR
Sbjct: 61  FLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQTAIRAAELED D  S  ++F  SRPL+LF+TLVKTYKRCGSAPFVFDLLI
Sbjct: 121 LRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+ +I+I+RMLRSRGISPQVSTLNSLIL VSKC+GAN  YAIF EVFGLDCE
Sbjct: 181 KALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           +++E VK+K + SPNVH+FNTLM CFYQDG VGRVKEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLELD VAYNTIIGGFCKAG+ QRAEE +REMELSGIEST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNAQRAEEFYREMELSGIEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FST EHLINGYC+TGD+DSALLVYKDMRRK FSLNASTLE ++  L AE RLLEALDVFG
Sbjct: 361 FSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F  EDS+FCPTMET+E+LIN LC+EG+IE AFKLQAQMVGKGFKP+ K+Y+SFIDAY  E
Sbjct: 421 FAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GN EMVEKL KE+ EIQLS
Sbjct: 481 GNAEMVEKLGKEMHEIQLS 499

BLAST of Moc02g10830 vs. ExPASy TrEMBL
Match: A0A5A7TQU6 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G004860 PE=4 SV=1)

HSP 1 Score: 786.2 bits (2029), Expect = 8.0e-224
Identity = 394/499 (78.96%), Postives = 436/499 (87.37%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSFSSS------SPSAKPSISTVVSVLTHHRSKSRWR 60
           MS PLLKRTL  I N    L FS SF SS      SPS KPSISTVVSVLTH RSKSRWR
Sbjct: 1   MSGPLLKRTLRPIGNSTVNLQFSSSFFSSSPPAEPSPSTKPSISTVVSVLTHQRSKSRWR 60

Query: 61  FLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGR 120
           FLNSLCP+GFDPGEFSDIVL IKNNPHL+LRFFLWTQNKSLC HNL+SYST+IHILARGR
Sbjct: 61  FLNSLCPNGFDPGEFSDIVLQIKNNPHLALRFFLWTQNKSLCNHNLISYSTLIHILARGR 120

Query: 121 LRTHAKAVIQTAIRAAELEDDDGCSNCKQF--SRPLRLFQTLVKTYKRCGSAPFVFDLLI 180
           LRTHAK VIQTAIRAAELED D  S  ++F  SRPL+LF+TLVKTYKRCGSAPFVFDLLI
Sbjct: 121 LRTHAKDVIQTAIRAAELEDSDNYSESERFSSSRPLKLFETLVKTYKRCGSAPFVFDLLI 180

Query: 181 KALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCE 240
           KALLDS+KL+ +I+I+RMLRSRGISPQVSTLNSLIL VSKC+GAN  YAIF EVFGLDCE
Sbjct: 181 KALLDSKKLDSSIEIVRMLRSRGISPQVSTLNSLILLVSKCQGANVAYAIFTEVFGLDCE 240

Query: 241 VKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYSILMTV 300
           +++E VK+K + SPNVH+FNTLM CFYQDG VGRVKEIWDQL +SNSIPNSYSYSILM V
Sbjct: 241 IEKEHVKLKGRVSPNVHTFNTLMDCFYQDGFVGRVKEIWDQLADSNSIPNSYSYSILMAV 300

Query: 301 FCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIEST 360
            C+++RM EAEELW+EM++KKLELD VAYNTIIGGFCKAG+ QRAEE +REMELSGIEST
Sbjct: 301 LCEEKRMGEAEELWEEMKMKKLELDVVAYNTIIGGFCKAGNTQRAEEFYREMELSGIEST 360

Query: 361 FSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFG 420
           FST EHLINGYC+TGD+DSALLVYKDMRRK FSLNASTLE ++  L AE RLLEALDVFG
Sbjct: 361 FSTLEHLINGYCDTGDVDSALLVYKDMRRKRFSLNASTLEGLIEVLCAERRLLEALDVFG 420

Query: 421 FTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTNE 480
           F  EDS+FCPTMET+E+LIN LC+EG+IE AFKLQAQMVGKGFKP+ K+Y+SFIDAY  E
Sbjct: 421 FAVEDSSFCPTMETFEVLINWLCQEGKIEGAFKLQAQMVGKGFKPNLKIYQSFIDAYMKE 480

Query: 481 GNEEMVEKLRKELLEIQLS 492
           GN EMVEKL KE+ EIQLS
Sbjct: 481 GNAEMVEKLGKEMHEIQLS 499

BLAST of Moc02g10830 vs. TAIR 10
Match: AT2G15980.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 457.2 bits (1175), Expect = 1.6e-128
Identity = 233/496 (46.98%), Postives = 332/496 (66.94%), Query Frame = 0

Query: 1   MSIPLLKRTLSSIRNPGFKLPFSPSF-----SSSSPSAKPSISTVVSVLTHHRSKSRWRF 60
           MS  +L+R L   R P      S S      S  SP + P IS  VS+LTHHRSKSRW  
Sbjct: 1   MSTSILRRILDPTRKPKPDAILSISLLTTVSSPPSPPSDPLISDAVSILTHHRSKSRWST 60

Query: 61  LNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRL 120
           L SL P GF P +FS+I L ++NNPHLSLRFFL+T+  SLC+H+  S ST+IHIL+R RL
Sbjct: 61  LRSLQPSGFTPSQFSEITLCLRNNPHLSLRFFLFTRRYSLCSHDTHSCSTLIHILSRSRL 120

Query: 121 RTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKAL 180
           ++HA  +I+ A+R A  ++D+         R L++F++L+K+Y RCGSAPFVFDLLIK+ 
Sbjct: 121 KSHASEIIRLALRLAATDEDE--------DRVLKVFRSLIKSYNRCGSAPFVFDLLIKSC 180

Query: 181 LDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSKCEGANAGYAIFREVFGLDCEVKE 240
           LDS++++ A+ ++R LRSRGI+ Q+ST N+LI  VS+  GA+ GY ++REVFGLD    +
Sbjct: 181 LDSKEIDGAVMVMRKLRSRGINAQISTCNALITEVSRRRGASNGYKMYREVFGLDDVSVD 240

Query: 241 ERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTES-NSIPNSYSYSILMTVFC 300
           E  KM  +  PN  +FN++M+ FY++G    V+ IW ++ E     PN YSY++LM  +C
Sbjct: 241 EAKKMIGKIKPNATTFNSMMVSFYREGETEMVERIWREMEEEVGCSPNVYSYNVLMEAYC 300

Query: 301 DQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAEELFREMELSGIESTFS 360
            +  M EAE++W+EM+++ +  D VAYNT+IGG C    + +A+ELFR+M L GIE T  
Sbjct: 301 ARGLMSEAEKVWEEMKVRGVVYDIVAYNTMIGGLCSNFEVVKAKELFRDMGLKGIECTCL 360

Query: 361 TFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLEAIVRGLVAE---TRLLEALDVF 420
           T+EHL+NGYC+ GD+DS L+VY++M+RK F  +  T+EA+V GL  +    R++EA D+ 
Sbjct: 361 TYEHLVNGYCKAGDVDSGLVVYREMKRKGFEADGLTIEALVEGLCDDRDGQRVVEAADIV 420

Query: 421 GFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVGKGFKPDSKVYRSFIDAYTN 480
                ++ F P+   YELL+  LC +G+++ A  +QA+MVGKGFKP  + YR+FID Y  
Sbjct: 421 KDAVREAMFYPSRNCYELLVKRLCEDGKMDRALNIQAEMVGKGFKPSQETYRAFIDGYGI 480

Query: 481 EGNEEMVEKLRKELLE 488
            G+EE    L  E+ E
Sbjct: 481 VGDEETSALLAIEMAE 488

BLAST of Moc02g10830 vs. TAIR 10
Match: AT4G26680.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 165.2 bits (417), Expect = 1.3e-40
Identity = 109/457 (23.85%), Postives = 204/457 (44.64%), Query Frame = 0

Query: 30  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLW 89
           +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W
Sbjct: 46  NPEPKGQDLDFVNVAHSHLIQSDWDKLNKL-SDHLDSFRVKNVLLKIQKDYLLSLEFFNW 105

Query: 90  TQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLR 149
            + ++  +H+L +++ V+H L + R    A+++++  +    ++             P +
Sbjct: 106 AKTRNPGSHSLETHAIVLHTLTKNRKFKSAESILRDVLVNGGVD------------LPAK 165

Query: 150 LFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILW 209
           +F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   +
Sbjct: 166 VFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVESCNA---Y 225

Query: 210 VSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKE 269
           +S   G             +D  ++  R   + + SPN ++ N +M  + + G + +  E
Sbjct: 226 MSSLLGQGR----------VDIALRFYREMRRCKISPNPYTLNMVMSGYCRSGKLDKGIE 285

Query: 270 IWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC 329
           +   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Sbjct: 286 LLQDMERLGFRATDVSYNTLIAGHCEKGLLSSALKLKNMMGKSGLQPNVVTFNTLIHGFC 345

Query: 330 KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNAS 389
           +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  
Sbjct: 346 RAMKLQEASKVFGEMKAVNVAPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDIL 405

Query: 390 TLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQ 449
           T  A++ GL  + +  +A   F    +  N  P   T+  LI G C     +  F+L   
Sbjct: 406 TYNALIFGLCKQAKTRKAAQ-FVKELDKENLVPNSSTFSALIMGQCVRKNADRGFELYKS 465

Query: 450 MVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL 487
           M+  G  P+ + +   + A+    + +   ++ +E++
Sbjct: 466 MIRSGCHPNEQTFNMLVSAFCRNEDFDGASQVLREMV 475

BLAST of Moc02g10830 vs. TAIR 10
Match: AT4G26680.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 165.2 bits (417), Expect = 1.3e-40
Identity = 109/457 (23.85%), Postives = 204/457 (44.64%), Query Frame = 0

Query: 30  SPSAKPSISTVVSVLTHHRSKSRWRFLNSLCPDGFDPGEFSDIVLHIKNNPHLSLRFFLW 89
           +P  K      V+V   H  +S W  LN L  D  D     +++L I+ +  LSL FF W
Sbjct: 46  NPEPKGQDLDFVNVAHSHLIQSDWDKLNKL-SDHLDSFRVKNVLLKIQKDYLLSLEFFNW 105

Query: 90  TQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAELEDDDGCSNCKQFSRPLR 149
            + ++  +H+L +++ V+H L + R    A+++++  +    ++             P +
Sbjct: 106 AKTRNPGSHSLETHAIVLHTLTKNRKFKSAESILRDVLVNGGVD------------LPAK 165

Query: 150 LFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILW 209
           +F  L+ +Y+ C S P VFD L K     +K   A      ++  G  P V + N+   +
Sbjct: 166 VFDALLYSYRECDSTPRVFDSLFKTFAHLKKFRNATDTFMQMKDYGFLPTVESCNA---Y 225

Query: 210 VSKCEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKE 269
           +S   G             +D  ++  R   + + SPN ++ N +M  + + G + +  E
Sbjct: 226 MSSLLGQGR----------VDIALRFYREMRRCKISPNPYTLNMVMSGYCRSGKLDKGIE 285

Query: 270 IWDQLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFC 329
           +   +          SY+ L+   C++  +  A +L   M    L+ + V +NT+I GFC
Sbjct: 286 LLQDMERLGFRATDVSYNTLIAGHCEKGLLSSALKLKNMMGKSGLQPNVVTFNTLIHGFC 345

Query: 330 KAGSIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNAS 389
           +A  +Q A ++F EM+   +     T+  LINGY + GD + A   Y+DM       +  
Sbjct: 346 RAMKLQEASKVFGEMKAVNVAPNTVTYNTLINGYSQQGDHEMAFRFYEDMVCNGIQRDIL 405

Query: 390 TLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQ 449
           T  A++ GL  + +  +A   F    +  N  P   T+  LI G C     +  F+L   
Sbjct: 406 TYNALIFGLCKQAKTRKAAQ-FVKELDKENLVPNSSTFSALIMGQCVRKNADRGFELYKS 465

Query: 450 MVGKGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELL 487
           M+  G  P+ + +   + A+    + +   ++ +E++
Sbjct: 466 MIRSGCHPNEQTFNMLVSAFCRNEDFDGASQVLREMV 475

BLAST of Moc02g10830 vs. TAIR 10
Match: AT1G62670.1 (rna processing factor 2 )

HSP 1 Score: 158.7 bits (400), Expect = 1.2e-38
Identity = 107/395 (27.09%), Postives = 196/395 (49.62%), Query Frame = 0

Query: 99  NLVSYSTVIHILARGRLRTHAKAVIQTAIRAA---ELEDDDGCSN--CKQFSRPLRLFQT 158
           N V+++T+IH L      + A A+I   +      +L       N  CK+    L  F  
Sbjct: 185 NTVTFNTLIHGLFLHNKASEAMALIDRMVAKGCQPDLVTYGVVVNGLCKRGDTDL-AFNL 244

Query: 159 LVKTYK-RCGSAPFVFDLLIKALLDSRKLEPAIQIIRMLRSRGISPQVSTLNSLILWVSK 218
           L K  + +      +++ +I  L   + ++ A+ + + + ++GI P V T +SLI     
Sbjct: 245 LNKMEQGKLEPGVLIYNTIIDGLCKYKHMDDALNLFKEMETKGIRPNVVTYSSLI----- 304

Query: 219 CEGANAGYAIFREVFGLDCEVKEERVKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWD 278
                  Y  + +   L  ++ E ++      +P+V +F+ L+  F ++G +   ++++D
Sbjct: 305 --SCLCNYGRWSDASRLLSDMIERKI------NPDVFTFSALIDAFVKEGKLVEAEKLYD 364

Query: 279 QLTESNSIPNSYSYSILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAG 338
           ++ + +  P+  +YS L+  FC   R+ EA+++++ M  K    D V YNT+I GFCK  
Sbjct: 365 EMVKRSIDPSIVTYSSLINGFCMHDRLDEAKQMFEFMVSKHCFPDVVTYNTLIKGFCKYK 424

Query: 339 SIQRAEELFREMELSGIESTFSTFEHLINGYCETGDIDSALLVYKDMRRKNFSLNASTLE 398
            ++   E+FREM   G+     T+  LI G  + GD D A  ++K+M       N  T  
Sbjct: 425 RVEEGMEVFREMSQRGLVGNTVTYNILIQGLFQAGDCDMAQEIFKEMVSDGVPPNIMTYN 484

Query: 399 AIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLINGLCREGEIEAAFKLQAQMVG 458
            ++ GL    +L +A+ VF +  + S   PT+ TY ++I G+C+ G++E  + L   +  
Sbjct: 485 TLLDGLCKNGKLEKAMVVFEY-LQRSKMEPTIYTYNIMIEGMCKAGKVEDGWDLFCNLSL 544

Query: 459 KGFKPDSKVYRSFIDAYTNEGNEEMVEKLRKELLE 488
           KG KPD   Y + I  +  +G++E  + L KE+ E
Sbjct: 545 KGVKPDVVAYNTMISGFCRKGSKEEADALFKEMKE 564

BLAST of Moc02g10830 vs. TAIR 10
Match: AT1G05670.1 (Pentatricopeptide repeat (PPR-like) superfamily protein )

HSP 1 Score: 158.3 bits (399), Expect = 1.6e-38
Identity = 126/474 (26.58%), Postives = 206/474 (43.46%), Query Frame = 0

Query: 72  IVLHIKNNPHLSLRFFLWTQNKSLCTHNLVSYSTVIHILARGRLRTHAKAVIQTAIRAAE 131
           +++ IK +  L L FF W +++     NL S   VIH+    +    A+++I +     +
Sbjct: 93  VLMKIKCDYRLVLDFFDWARSRR--DSNLESLCIVIHLAVASKDLKVAQSLISSFWERPK 152

Query: 132 LEDDDGCSNCKQFSRPLRLFQTLVKTYKRCGSAPFVFDLLIKALLDSRKLEPAIQIIRML 191
           L   D           ++ F  LV TYK  GS P VFD+  + L+D   L  A ++   +
Sbjct: 153 LNVTDSF---------VQFFDLLVYTYKDWGSDPRVFDVFFQVLVDFGLLREARRVFEKM 212

Query: 192 RSRGISPQVSTLNSLILWVSK-CEGANAGYAIFRE---------------VFGLDCE--- 251
            + G+   V + N  +  +SK C        +FRE               V    C+   
Sbjct: 213 LNYGLVLSVDSCNVYLTRLSKDCYKTATAIIVFREFPEVGVCWNVASYNIVIHFVCQLGR 272

Query: 252 VKEER-----VKMKAQASPNVHSFNTLMMCFYQDGLVGRVKEIWDQLTESNSIPNSYSYS 311
           +KE       +++K   +P+V S++T++  + + G + +V ++ + +      PNSY Y 
Sbjct: 273 IKEAHHLLLLMELKGY-TPDVISYSTVVNGYCRFGELDKVWKLIEVMKRKGLKPNSYIYG 332

Query: 312 ILMTVFCDQRRMVEAEELWKEMRLKKLELDAVAYNTIIGGFCKAGSIQRAE--------- 371
            ++ + C   ++ EAEE + EM  + +  D V Y T+I GFCK G I+ A          
Sbjct: 333 SIIGLLCRICKLAEAEEAFSEMIRQGILPDTVVYTTLIDGFCKRGDIRAASKFFYEMHSR 392

Query: 372 --------------------------ELFREMELSGIESTFSTFEHLINGYCETGDIDSA 431
                                     +LF EM   G+E    TF  LINGYC+ G +  A
Sbjct: 393 DITPDVLTYTAIISGFCQIGDMVEAGKLFHEMFCKGLEPDSVTFTELINGYCKAGHMKDA 452

Query: 432 LLVYKDMRRKNFSLNASTLEAIVRGLVAETRLLEALDVFGFTTEDSNFCPTMETYELLIN 487
             V+  M +   S N  T   ++ GL  E  L  A ++           P + TY  ++N
Sbjct: 453 FRVHNHMIQAGCSPNVVTYTTLIDGLCKEGDLDSANELL-HEMWKIGLQPNIFTYNSIVN 512

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022148504.11.0e-281100.00pentatricopeptide repeat-containing protein At2g15980 [Momordica charantia][more]
KAG6571131.18.5e-23681.16Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
KAG7010942.11.1e-23581.16Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_038901621.11.6e-23482.36pentatricopeptide repeat-containing protein At2g15980 [Benincasa hispida][more]
XP_022944388.11.6e-23481.16pentatricopeptide repeat-containing protein At2g15980 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q9XIM82.3e-12746.98Pentatricopeptide repeat-containing protein At2g15980 OS=Arabidopsis thaliana OX... [more]
Q9SZ101.8e-3923.85Pentatricopeptide repeat-containing protein At4g26680, mitochondrial OS=Arabidop... [more]
Q9SXD11.7e-3727.09Pentatricopeptide repeat-containing protein At1g62670, mitochondrial OS=Arabidop... [more]
Q0WVK72.2e-3726.58Pentatricopeptide repeat-containing protein At1g05670, mitochondrial OS=Arabidop... [more]
Q9SH262.9e-3728.66Pentatricopeptide repeat-containing protein At1g63400 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A6J1D4724.9e-282100.00pentatricopeptide repeat-containing protein At2g15980 OS=Momordica charantia OX=... [more]
A0A6J1FY057.7e-23581.16pentatricopeptide repeat-containing protein At2g15980 OS=Cucurbita moschata OX=3... [more]
A0A6J1JGQ11.0e-23481.36pentatricopeptide repeat-containing protein At2g15980 OS=Cucurbita maxima OX=366... [more]
A0A5D3CQ258.0e-22478.96Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A5A7TQU68.0e-22478.96Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
Match NameE-valueIdentityDescription
AT2G15980.11.6e-12846.98Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G26680.11.3e-4023.85Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G26680.21.3e-4023.85Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G62670.11.2e-3827.09rna processing factor 2 [more]
AT1G05670.11.6e-3826.58Pentatricopeptide repeat (PPR-like) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 235..396
e-value: 3.8E-38
score: 133.6
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 400..491
e-value: 7.0E-16
score: 60.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 72..227
e-value: 2.8E-13
score: 51.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 355..398
e-value: 4.2E-8
score: 33.3
coord: 246..294
e-value: 7.2E-9
score: 35.7
coord: 424..469
e-value: 1.0E-9
score: 38.5
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 314..345
e-value: 1.2E-11
score: 44.1
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 319..350
e-value: 5.7E-9
score: 33.6
coord: 355..387
e-value: 1.7E-7
score: 29.0
coord: 426..459
e-value: 9.2E-8
score: 29.8
coord: 284..317
e-value: 4.4E-7
score: 27.7
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 247..281
score: 8.736214
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 317..351
score: 13.011121
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 423..457
score: 11.761533
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 282..316
score: 10.500983
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 164..198
score: 9.054091
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 352..386
score: 10.731171
NoneNo IPR availablePANTHERPTHR47942TETRATRICOPEPTIDE REPEAT (TPR)-LIKE SUPERFAMILY PROTEIN-RELATEDcoord: 7..489
NoneNo IPR availablePANTHERPTHR47942:SF14PPR CONTAINING PLANT-LIKE PROTEINcoord: 7..489

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g10830.1Moc02g10830.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding