MS011056 (gene) Bitter gourd (TR) v1

Overview
NameMS011056
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Locationscaffold35: 3937994 .. 3939664 (+)
RNA-Seq ExpressionMS011056
SyntenyMS011056
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGCTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAAAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCTTATGAGTATATCCGAAACATGTCGATCCCGCCAAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATTTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGAGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTATGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACCAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCGTATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACGGTTCGTGCTCTTGTAGAGATTATTGG

mRNA sequence

AAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGCTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAAAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCTTATGAGTATATCCGAAACATGTCGATCCCGCCAAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATTTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGAGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTATGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACCAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCGTATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACGGTTCGTGCTCTTGTAGAGATTATTGG

Coding sequence (CDS)

AAGCAAATCCACGGCTTCTCCATCAGACATGGCGTCCCACCCCACAATCCAGACATGGGCAAGCACCTCATCTTCGCCCTCGTCTCCCTCTCGGCCCCCATGCCCTACGCGACCCGAATTTTCCGTCTGATTCGAGCCCCCAATATCTTCACGTGGAACACCATGATTAGAGGGTTTGCCGAGAGCGAGAATCCGAGGCCGGCCGTGGAGTTGTACTGCCAAATGCACGCGTCTTCGGTTCTGCCTGATACGCATACTTTCCCTTTTCTTTTGAAGGCTGCTGCTAAGTTAATGGATGCTAGAGTAGGCGAGGAGATTCACTCGATTGTTGTTAAAAATGGGTTCGGTTCGTTGCTTTTTGTTCAGAATTCGTTGGTCCATATGTACTCTGTTTTTGGGTTTGCCGAGAGTGCGTACCAGGTGTTTGAGTTTATGCTTGAGAGAGATCTCGTGGCCTGGAACTCTGTTATTAATGGCTTTGCTCTTAATGGAATGGCTAACGAAGCTCTGACCCTTTTTAGGGAAATGGGTTTGGATGGCGTGGAGCCTGATGGGTTCACCATGGTTAGTCTGTTATCTGCTTGTGTTGAGCTTGGGGCCATGGCCTTGGGGGAGAGGGTTCATGTGTATATGTTGAAGGTTGGTTTAGTACACAATCCACATGCTAGCAATGCCCTCCTTGATCTCTACTCCAAATGTGGGAACATTAGAGATGCACTGAAGGTGTTTGATGAAATGGAAGAGAGGAGTGTGGTTTCTTGGACTTCTCTGATTGTTGGGTTGGCTGTTAATGGATTAGGAAATGAAGCTCTTGAGCTGTTTGGGGAGTTGGAAAGGAAGGGGTTGAAGCCTAGTGAGATCACATTTGTTGGAGTTTTGTATGCTTGTAGCCATTGTGGGATGGTTGAGGAAGGCTTCGATTACTTTAGAAGGATGAAAGATGAATATGGCATCTTGCCAAGGATAGAGCACCATGGCTGTATTGTTGATTTGCTGTGCAGGGCCGGCAAGGTTGGAGATGCTTATGAGTATATCCGAAACATGTCGATCCCGCCAAATGCAGTCATTTGGCGGACCTTACTGGGAGCTTGCACAATCCATGGGCATTTAGAATTGGGTGAGGTTGCAAGAGCTGAAGTCCTACGCTTGGAACCGAAGCATAGCGGAGACTATGTCCTTCTCTCGAACCTTTATGCATCGGAGCGACGTTGGCTGGATGTGCAAAACGTAAGGAGGACGATGCTTATGAAAGGAGTGAGGAAAACTCCCGGGTATAGCCTCGTTGAGTTGAAAAACCGTGTTTATGAGTTTATCATGGGTGATAGATCTCATCCCCAAAGTGAGGAGACATACGCAATGCTGGGGAAGATCACAGAGTTGTTGAAAATCGAAGGCTATGTTCCTCGCACGGTTAATGTTCTTGCTGATATAGAAGAGGAAGAAAAGGAGACGGCTCTGTCTCATCACACGGAGAAAGTTGCAATTGCTTTTATGTTGGTTAACACCCCACCAAGAACTCCAATTAGAATCATGAAGAATTTGAGAGTCTGTGCAGATTGTCATCTGGCGATCAAACTCGTATCCAAGGTTTTCGAACGTGAGATCATCGTAAGGGATCGTAGTAGGTTTCATCATTTTAAAGACGGTTCGTGCTCTTGTAGAGATTATTGG

Protein sequence

KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSCSCRDYW
Homology
BLAST of MS011056 vs. NCBI nr
Match: XP_022146486.1 (pentatricopeptide repeat-containing protein At4g21065 [Momordica charantia])

HSP 1 Score: 1129.4 bits (2920), Expect = 0.0e+00
Identity = 551/557 (98.92%), Postives = 554/557 (99.46%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA
Sbjct: 47  KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 106

Query: 61  ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLLF 120
           ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMD RVGEEIHSIVV+NGFGSLLF
Sbjct: 107 ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLF 166

Query: 121 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG 180
           VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG
Sbjct: 167 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG 226

Query: 181 VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL 240
           VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL
Sbjct: 227 VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL 286

Query: 241 KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG 300
           KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG
Sbjct: 287 KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG 346

Query: 301 MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRTL 360
           MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRN+SIPPNAVIWRTL
Sbjct: 347 MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTL 406

Query: 361 LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR 420
           LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR
Sbjct: 407 LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR 466

Query: 421 KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE 480
           KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE
Sbjct: 467 KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE 526

Query: 481 EKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRDR 540
           EKETALSHHTEKVAIAFMLVNTP RTPIRIMKNLRVCADCHLAIKL+SKVFEREIIVRDR
Sbjct: 527 EKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDR 586

Query: 541 SRFHHFKDGSCSCRDYW 558
           SRFHHFKD SCSCRDYW
Sbjct: 587 SRFHHFKDSSCSCRDYW 603

BLAST of MS011056 vs. NCBI nr
Match: XP_023002974.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucurbita maxima])

HSP 1 Score: 1053.5 bits (2723), Expect = 6.3e-304
Identity = 511/558 (91.58%), Postives = 535/558 (95.88%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFA
Sbjct: 45  KQIHAFSIRHGVPPPNPDFNKHLIFSLVSISAPMSYATRIFQQIQAPNIFTWNTMVRGFA 104

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENPRPAVELY QMH ASS+ PDTHTFPFL KA AKLMDAR+GE IHSIVV+NGF SLL
Sbjct: 105 ESENPRPAVELYSQMHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLL 164

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  
Sbjct: 165 FVQNSLVHMYSVFGFAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSV 224

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GV+PDGFTMVSLLSACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI DA
Sbjct: 225 GVKPDGFTMVSLLSACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNIIDA 284

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LKVFDEM ERSVVSWTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 285 LKVFDEMHERSVVSWTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHC 344

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGFDYFRRMK+EYGILPRIEHHGCIVDLLCRAGKV DAYEYIRNMS+PPNAVIWRT
Sbjct: 345 GMVDEGFDYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNMSVPPNAVIWRT 404

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGE+ARAE+L+LEPKH GD+VLLSNLYASERRWLDVQNVRRTMLMKGV
Sbjct: 405 LLGACTIHGHLELGEIARAEILQLEPKHCGDFVLLSNLYASERRWLDVQNVRRTMLMKGV 464

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEE
Sbjct: 465 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEE 524

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKL+SKVFEREI+VRD
Sbjct: 525 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRD 584

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDG CSC+DYW
Sbjct: 585 RSRFHHFKDGLCSCKDYW 602

BLAST of MS011056 vs. NCBI nr
Match: XP_022926338.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucurbita moschata])

HSP 1 Score: 1050.8 bits (2716), Expect = 4.1e-303
Identity = 510/558 (91.40%), Postives = 533/558 (95.52%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFA
Sbjct: 45  KQIHAFSIRHGVPPPNPDFNKHLIFSLVSISAPMTYATRIFQQIQAPNIFTWNTMVRGFA 104

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENPRPAVELY QMH ASS+ PDTHTFPFL KA AKLMDAR+GE IHSIVV+NGF SLL
Sbjct: 105 ESENPRPAVELYSQMHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLL 164

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  
Sbjct: 165 FVQNSLVHMYSVFGFAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSV 224

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  A
Sbjct: 225 GVEPDGFTMVSLLSACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNITHA 284

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LKVFDEM ERSVVSWTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 285 LKVFDEMHERSVVSWTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHC 344

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGFDYFRRMK+EYGILPRIEHHGCIVDLLCRAGKV DAY+YIRNMS+PPNAVIWRT
Sbjct: 345 GMVDEGFDYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYQYIRNMSVPPNAVIWRT 404

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGE+ARAE+L+LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV
Sbjct: 405 LLGACTIHGHLELGEIARAEILQLEPKHCGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 464

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEE
Sbjct: 465 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEE 524

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPP TPIRIMKNLRVCADCHLAIKL+SKVFEREI+VRD
Sbjct: 525 EEKETALSHHTEKVAIAFMLVNTPPGTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRD 584

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDG CSC+DYW
Sbjct: 585 RSRFHHFKDGLCSCKDYW 602

BLAST of MS011056 vs. NCBI nr
Match: XP_023517386.1 (pentatricopeptide repeat-containing protein At4g21065 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1048.5 bits (2710), Expect = 2.0e-302
Identity = 511/558 (91.58%), Postives = 531/558 (95.16%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFA
Sbjct: 44  KQIHAFSIRHGVPPPNPDFNKHLIFSLVSISAPMFYATRIFQQIQAPNIFTWNTMVRGFA 103

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENPRPAVELY QMH ASS+ PDTHTFPFL KA AKLMDAR+GE IHSIVV+NGF SL 
Sbjct: 104 ESENPRPAVELYSQMHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLR 163

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  
Sbjct: 164 FVQNSLVHMYSVFGFAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSV 223

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  A
Sbjct: 224 GVEPDGFTMVSLLSACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNIAHA 283

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LKVFDEM ERSVVSWTSLIVGLAVNGLGNEAL LFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 284 LKVFDEMHERSVVSWTSLIVGLAVNGLGNEALRLFGELERKGLKPSEITFVGVLYACSHC 343

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGFDYFRRMK+EYGILPRIEHHGCIVDLLCRAGKV DAYEYIRNMSIPPNAVIWRT
Sbjct: 344 GMVDEGFDYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNMSIPPNAVIWRT 403

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGE+ARAE+L LEPKH GD+VLLSNLYASERRWLDVQNVRRTMLMKGV
Sbjct: 404 LLGACTIHGHLELGEIARAEILLLEPKHCGDFVLLSNLYASERRWLDVQNVRRTMLMKGV 463

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEE
Sbjct: 464 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEE 523

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKL+SKVFEREI+VRD
Sbjct: 524 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRD 583

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDG CSC+DYW
Sbjct: 584 RSRFHHFKDGLCSCKDYW 601

BLAST of MS011056 vs. NCBI nr
Match: XP_038882791.1 (pentatricopeptide repeat-containing protein At4g21065 [Benincasa hispida])

HSP 1 Score: 1046.2 bits (2704), Expect = 1.0e-301
Identity = 504/558 (90.32%), Postives = 533/558 (95.52%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIFALVSLSAPM YAT IF  I+APNIFTWNTMIRGFA
Sbjct: 52  KQIHAFSIRHGVPPPNPDFNKHLIFALVSLSAPMSYATLIFNQIQAPNIFTWNTMIRGFA 111

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENP PA+ELY QM  ASS+LPDTHTFPFL KA AKLMD R+GE IHS+VV+NGF SLL
Sbjct: 112 ESENPSPAIELYSQMRAASSILPDTHTFPFLFKAVAKLMDVRLGEGIHSVVVRNGFDSLL 171

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAYQVFEFM +RDLVAWNSVINGFALNGMANEALTLFREMG +
Sbjct: 172 FVQNSLVHMYSVFGFAESAYQVFEFMSDRDLVAWNSVINGFALNGMANEALTLFREMGFE 231

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGL+ N HASNALLDLYSKCGNIRDA
Sbjct: 232 GVEPDGFTMVSLLSACVELGALALGERVHVYMLKVGLIQNLHASNALLDLYSKCGNIRDA 291

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LK+FDEMEERSVVSWTSLIVGLAVNGLGN ALELFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 292 LKMFDEMEERSVVSWTSLIVGLAVNGLGNRALELFGELERKGLKPSEITFVGVLYACSHC 351

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGF+YFRRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY+YIRNMS+PPNAVIWRT
Sbjct: 352 GMVDEGFNYFRRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYDYIRNMSVPPNAVIWRT 411

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGEVARAE+L LEPKH+GD+VLLSNLYASERRWLDVQNVRR MLMKGV
Sbjct: 412 LLGACTIHGHLELGEVARAEILLLEPKHTGDFVLLSNLYASERRWLDVQNVRRMMLMKGV 471

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVY+FIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEE
Sbjct: 472 KKTPGYSLVELKNRVYKFIMGDRSHPQSEETYAMLTKITELLKIEGYVPRTVNVLADIEE 531

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPP+TPIRIMKNLR+CADCHLAIK++SKVFEREI++RD
Sbjct: 532 EEKETALSHHTEKVAIAFMLVNTPPKTPIRIMKNLRICADCHLAIKIISKVFEREIVIRD 591

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDGSCSC+DYW
Sbjct: 592 RSRFHHFKDGSCSCKDYW 609

BLAST of MS011056 vs. ExPASy Swiss-Prot
Match: A8MQA3 (Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H28 PE=2 SV=2)

HSP 1 Score: 760.4 bits (1962), Expect = 1.4e-218
Identity = 359/562 (63.88%), Postives = 453/562 (80.60%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMP--YATRIFRLIRAP-NIFTWNTMIR 60
           +QIH FSIRHGV   + ++GKHLIF LVSL +P P  YA ++F  I  P N+F WNT+IR
Sbjct: 34  RQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLIR 93

Query: 61  GFAESENPRPAVELYCQMHASSVL-PDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFG 120
           G+AE  N   A  LY +M  S ++ PDTHT+PFL+KA   + D R+GE IHS+V+++GFG
Sbjct: 94  GYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSGFG 153

Query: 121 SLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREM 180
           SL++VQNSL+H+Y+  G   SAY+VF+ M E+DLVAWNSVINGFA NG   EAL L+ EM
Sbjct: 154 SLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEM 213

Query: 181 GLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNI 240
              G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGL  N H+SN LLDLY++CG +
Sbjct: 214 NSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRV 273

Query: 241 RDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELE-RKGLKPSEITFVGVLYA 300
            +A  +FDEM +++ VSWTSLIVGLAVNG G EA+ELF  +E  +GL P EITFVG+LYA
Sbjct: 274 EEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYA 333

Query: 301 CSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAV 360
           CSHCGMV+EGF+YFRRM++EY I PRIEH GC+VDLL RAG+V  AYEYI++M + PN V
Sbjct: 334 CSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVV 393

Query: 361 IWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTML 420
           IWRTLLGACT+HG  +L E AR ++L+LEP HSGDYVLLSN+YASE+RW DVQ +R+ ML
Sbjct: 394 IWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQML 453

Query: 421 MKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLA 480
             GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L ++T  L+ EGYVP+  NV  
Sbjct: 454 RDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYV 513

Query: 481 DIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREI 540
           D+EEEEKE A+ +H+EK+AIAFML++TP R+PI ++KNLRVCADCHLAIKLVSKV+ REI
Sbjct: 514 DVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREI 573

Query: 541 IVRDRSRFHHFKDGSCSCRDYW 558
           +VRDRSRFHHFK+GSCSC+DYW
Sbjct: 574 VVRDRSRFHHFKNGSCSCQDYW 595

BLAST of MS011056 vs. ExPASy Swiss-Prot
Match: Q9CA54 (Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H71 PE=2 SV=1)

HSP 1 Score: 483.8 bits (1244), Expect = 2.6e-135
Identity = 250/621 (40.26%), Postives = 364/621 (58.62%), Query Frame = 0

Query: 2   QIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAE 61
           QIHG  I++GV   +   GK ++   +S+S  +PYA R+      P+ F +NT++RG++E
Sbjct: 23  QIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPDAFMFNTLVRGYSE 82

Query: 62  SENPRPAVELYCQ-MHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLLF 121
           S+ P  +V ++ + M    V PD+ +F F++KA       R G ++H   +K+G  S LF
Sbjct: 83  SDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLF 142

Query: 122 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVIN---------------------- 181
           V  +L+ MY   G  E A +VF+ M + +LVAWN+VI                       
Sbjct: 143 VGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRN 202

Query: 182 ----------------------------------------GFALNGMANEALTLFREMGL 241
                                                   G A NG  NE+   FRE+  
Sbjct: 203 HTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQR 262

Query: 242 DGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRD 301
            G+ P+  ++  +LSAC + G+   G+ +H ++ K G       +NAL+D+YS+CGN+  
Sbjct: 263 AGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPM 322

Query: 302 ALKVFDEMEE-RSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACS 361
           A  VF+ M+E R +VSWTS+I GLA++G G EA+ LF E+   G+ P  I+F+ +L+ACS
Sbjct: 323 ARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACS 382

Query: 362 HCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIW 421
           H G++EEG DYF  MK  Y I P IEH+GC+VDL  R+GK+  AY++I  M IPP A++W
Sbjct: 383 HAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYDFICQMPIPPTAIVW 442

Query: 422 RTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMK 481
           RTLLGAC+ HG++EL E  +  +  L+P +SGD VLLSN YA+  +W DV ++R++M+++
Sbjct: 443 RTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATAGKWKDVASIRKSMIVQ 502

Query: 482 GVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIE-GYVPRTVNVLAD 541
            ++KT  +SLVE+   +Y+F  G++      E +  L +I   LK E GY P   + L D
Sbjct: 503 RIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRLKDEAGYTPEVASALYD 562

Query: 542 IEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREII 558
           +EEEEKE  +S H+EK+A+AF L        IRI+KNLR+C DCH  +KL SKV+  EI+
Sbjct: 563 VEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRICRDCHAVMKLTSKVYGVEIL 622

BLAST of MS011056 vs. ExPASy Swiss-Prot
Match: Q9FI80 (Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H38 PE=2 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 3.2e-133
Identity = 241/606 (39.77%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 2   QIHGFSIRHGVPPHNPDMGKHLIFALVS--LSAPMPYATRIFRLIRAPNIFTWNTMIRGF 61
           QIH   I+ G         + L F   S      + YA +IF  +   N F+WNT+IRGF
Sbjct: 41  QIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGF 100

Query: 62  AESENPRPAVEL---YCQMHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFG 121
           +ES+  +  + +   Y  M    V P+  TFP +LKA AK    + G++IH + +K GFG
Sbjct: 101 SESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFG 160

Query: 122 SLLFVQNSLVHMYSVFGF------------------------------------------ 181
              FV ++LV MY + GF                                          
Sbjct: 161 GDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMR 220

Query: 182 ---AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSL 241
               ++A  +F+ M +R +V+WN++I+G++LNG   +A+ +FREM    + P+  T+VS+
Sbjct: 221 LGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSV 280

Query: 242 LSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSV 301
           L A   LG++ LGE +H+Y    G+  +    +AL+D+YSKCG I  A+ VF+ +   +V
Sbjct: 281 LPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENV 340

Query: 302 VSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRR 361
           ++W+++I G A++G   +A++ F ++ + G++PS++ ++ +L ACSH G+VEEG  YF +
Sbjct: 341 ITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQ 400

Query: 362 MKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLE 421
           M    G+ PRIEH+GC+VDLL R+G + +A E+I NM I P+ VIW+ LLGAC + G++E
Sbjct: 401 MVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVE 460

Query: 422 LGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELK 481
           +G+     ++ + P  SG YV LSN+YAS+  W +V  +R  M  K +RK PG SL+++ 
Sbjct: 461 MGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDID 520

Query: 482 NRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTE 541
             ++EF++ D SHP+++E  +ML +I++ L++ GY P T  VL ++EEE+KE  L +H+E
Sbjct: 521 GVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSE 580

Query: 542 KVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSC 558
           K+A AF L++T P  PIRI+KNLR+C DCH +IKL+SKV++R+I VRDR RFHHF+DGSC
Sbjct: 581 KIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSC 640

BLAST of MS011056 vs. ExPASy Swiss-Prot
Match: Q9LN01 (Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H12 PE=2 SV=1)

HSP 1 Score: 476.9 bits (1226), Expect = 3.2e-133
Identity = 227/523 (43.40%), Postives = 350/523 (66.92%), Query Frame = 0

Query: 37  ATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAK 96
           A ++F  I   ++ +WN MI G+AE+ N + A+EL+  M  ++V PD  T   ++ A A+
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 97  LMDARVGEEIHSIVVKNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 156
                +G ++H  +  +GFGS L + N+L+ +YS  G  E+A  +FE +  +D+++WN++
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 157 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--V 216
           I G+    +  EAL LF+EM   G  P+  TM+S+L AC  LGA+ +G  +HVY+ K   
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLK 398

Query: 217 GLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELF 276
           G+ +      +L+D+Y+KCG+I  A +VF+ +  +S+ SW ++I G A++G  + + +LF
Sbjct: 399 GVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLF 458

Query: 277 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 336
             + + G++P +ITFVG+L ACSH GM++ G   FR M  +Y + P++EH+GC++DLL  
Sbjct: 459 SRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGH 518

Query: 337 AGKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 396
           +G   +A E I  M + P+ VIW +LL AC +HG++ELGE     ++++EP++ G YVLL
Sbjct: 519 SGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLL 578

Query: 397 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 456
           SN+YAS  RW +V   R  +  KG++K PG S +E+ + V+EFI+GD+ HP++ E Y ML
Sbjct: 579 SNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 638

Query: 457 GKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNL 516
            ++  LL+  G+VP T  VL ++EEE KE AL HH+EK+AIAF L++T P T + I+KNL
Sbjct: 639 EEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNL 698

Query: 517 RVCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSCSCRDYW 558
           RVC +CH A KL+SK+++REII RDR+RFHHF+DG CSC DYW
Sbjct: 699 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of MS011056 vs. ExPASy Swiss-Prot
Match: Q9LW63 (Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H32 PE=3 SV=1)

HSP 1 Score: 471.9 bits (1213), Expect = 1.0e-131
Identity = 218/519 (42.00%), Postives = 337/519 (64.93%), Query Frame = 0

Query: 39  RIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLM 98
           R+F ++   ++ ++NT+I G+A+S     A+ +  +M  + + PD+ T   +L   ++ +
Sbjct: 197 RVFEVMPRKDVVSYNTIIAGYAQSGMYEDALRMVREMGTTDLKPDSFTLSSVLPIFSEYV 256

Query: 99  DARVGEEIHSIVVKNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVIN 158
           D   G+EIH  V++ G  S +++ +SLV MY+     E + +VF  +  RD ++WNS++ 
Sbjct: 257 DVIKGKEIHGYVIRKGIDSDVYIGSSLVDMYAKSARIEDSERVFSRLYCRDGISWNSLVA 316

Query: 159 GFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVH 218
           G+  NG  NEAL LFR+M    V+P      S++ AC  L  + LG+++H Y+L+ G   
Sbjct: 317 GYVQNGRYNEALRLFRQMVTAKVKPGAVAFSSVIPACAHLATLHLGKQLHGYVLRGGFGS 376

Query: 219 NPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELE 278
           N   ++AL+D+YSKCGNI+ A K+FD M     VSWT++I+G A++G G+EA+ LF E++
Sbjct: 377 NIFIASALVDMYSKCGNIKAARKIFDRMNVLDEVSWTAIIMGHALHGHGHEAVSLFEEMK 436

Query: 279 RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKV 338
           R+G+KP+++ FV VL ACSH G+V+E + YF  M   YG+   +EH+  + DLL RAGK+
Sbjct: 437 RQGVKPNQVAFVAVLTACSHVGLVDEAWGYFNSMTKVYGLNQELEHYAAVADLLGRAGKL 496

Query: 339 GDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLY 398
            +AY +I  M + P   +W TLL +C++H +LEL E    ++  ++ ++ G YVL+ N+Y
Sbjct: 497 EEAYNFISKMCVEPTGSVWSTLLSSCSVHKNLELAEKVAEKIFTVDSENMGAYVLMCNMY 556

Query: 399 ASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKIT 458
           AS  RW ++  +R  M  KG+RK P  S +E+KN+ + F+ GDRSHP  ++    L  + 
Sbjct: 557 ASNGRWKEMAKLRLRMRKKGLRKKPACSWIEMKNKTHGFVSGDRSHPSMDKINEFLKAVM 616

Query: 459 ELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCA 518
           E ++ EGYV  T  VL D++EE K   L  H+E++A+AF ++NT P T IR+ KN+R+C 
Sbjct: 617 EQMEKEGYVADTSGVLHDVDEEHKRELLFGHSERLAVAFGIINTEPGTTIRVTKNIRICT 676

Query: 519 DCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSCSCRDYW 558
           DCH+AIK +SK+ EREIIVRD SRFHHF  G+CSC DYW
Sbjct: 677 DCHVAIKFISKITEREIIVRDNSRFHHFNRGNCSCGDYW 715

BLAST of MS011056 vs. ExPASy TrEMBL
Match: A0A6J1CZH8 (pentatricopeptide repeat-containing protein At4g21065 OS=Momordica charantia OX=3673 GN=LOC111015693 PE=3 SV=1)

HSP 1 Score: 1129.4 bits (2920), Expect = 0.0e+00
Identity = 551/557 (98.92%), Postives = 554/557 (99.46%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA
Sbjct: 47  KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 106

Query: 61  ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLLF 120
           ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMD RVGEEIHSIVV+NGFGSLLF
Sbjct: 107 ESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAKLMDVRVGEEIHSIVVRNGFGSLLF 166

Query: 121 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG 180
           VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG
Sbjct: 167 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDG 226

Query: 181 VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL 240
           VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL
Sbjct: 227 VEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDAL 286

Query: 241 KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG 300
           KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG
Sbjct: 287 KVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCG 346

Query: 301 MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRTL 360
           MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRN+SIPPNAVIWRTL
Sbjct: 347 MVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNISIPPNAVIWRTL 406

Query: 361 LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR 420
           LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR
Sbjct: 407 LGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVR 466

Query: 421 KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE 480
           KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE
Sbjct: 467 KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEE 526

Query: 481 EKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRDR 540
           EKETALSHHTEKVAIAFMLVNTP RTPIRIMKNLRVCADCHLAIKL+SKVFEREIIVRDR
Sbjct: 527 EKETALSHHTEKVAIAFMLVNTPQRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRDR 586

Query: 541 SRFHHFKDGSCSCRDYW 558
           SRFHHFKD SCSCRDYW
Sbjct: 587 SRFHHFKDSSCSCRDYW 603

BLAST of MS011056 vs. ExPASy TrEMBL
Match: A0A6J1KRZ9 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita maxima OX=3661 GN=LOC111496722 PE=3 SV=1)

HSP 1 Score: 1053.5 bits (2723), Expect = 3.1e-304
Identity = 511/558 (91.58%), Postives = 535/558 (95.88%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFA
Sbjct: 45  KQIHAFSIRHGVPPPNPDFNKHLIFSLVSISAPMSYATRIFQQIQAPNIFTWNTMVRGFA 104

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENPRPAVELY QMH ASS+ PDTHTFPFL KA AKLMDAR+GE IHSIVV+NGF SLL
Sbjct: 105 ESENPRPAVELYSQMHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLL 164

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  
Sbjct: 165 FVQNSLVHMYSVFGFAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSV 224

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GV+PDGFTMVSLLSACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI DA
Sbjct: 225 GVKPDGFTMVSLLSACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNIIDA 284

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LKVFDEM ERSVVSWTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 285 LKVFDEMHERSVVSWTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHC 344

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGFDYFRRMK+EYGILPRIEHHGCIVDLLCRAGKV DAYEYIRNMS+PPNAVIWRT
Sbjct: 345 GMVDEGFDYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYEYIRNMSVPPNAVIWRT 404

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGE+ARAE+L+LEPKH GD+VLLSNLYASERRWLDVQNVRRTMLMKGV
Sbjct: 405 LLGACTIHGHLELGEIARAEILQLEPKHCGDFVLLSNLYASERRWLDVQNVRRTMLMKGV 464

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEE
Sbjct: 465 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEE 524

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKL+SKVFEREI+VRD
Sbjct: 525 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRD 584

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDG CSC+DYW
Sbjct: 585 RSRFHHFKDGLCSCKDYW 602

BLAST of MS011056 vs. ExPASy TrEMBL
Match: A0A6J1EHS2 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita moschata OX=3662 GN=LOC111433518 PE=3 SV=1)

HSP 1 Score: 1050.8 bits (2716), Expect = 2.0e-303
Identity = 510/558 (91.40%), Postives = 533/558 (95.52%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQIH FSIRHGVPP NPD  KHLIF+LVS+SAPM YATRIF+ I+APNIFTWNTM+RGFA
Sbjct: 45  KQIHAFSIRHGVPPPNPDFNKHLIFSLVSISAPMTYATRIFQQIQAPNIFTWNTMVRGFA 104

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENPRPAVELY QMH ASS+ PDTHTFPFL KA AKLMDAR+GE IHSIVV+NGF SLL
Sbjct: 105 ESENPRPAVELYSQMHAASSIQPDTHTFPFLFKAVAKLMDARLGEGIHSIVVRNGFDSLL 164

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAY+VFEFM ERDLVAWNSVINGFALNGMANEALTLF+EMG  
Sbjct: 165 FVQNSLVHMYSVFGFAESAYKVFEFMSERDLVAWNSVINGFALNGMANEALTLFKEMGSV 224

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVELGA+ALGERVHVYMLKVGLV NPHASNALLDLYSKCGNI  A
Sbjct: 225 GVEPDGFTMVSLLSACVELGALALGERVHVYMLKVGLVQNPHASNALLDLYSKCGNITHA 284

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
           LKVFDEM ERSVVSWTSLIVGLAVNGLGNEAL+LFGELERKGLKPSEITFVGVLYACSHC
Sbjct: 285 LKVFDEMHERSVVSWTSLIVGLAVNGLGNEALKLFGELERKGLKPSEITFVGVLYACSHC 344

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GMV+EGFDYFRRMK+EYGILPRIEHHGCIVDLLCRAGKV DAY+YIRNMS+PPNAVIWRT
Sbjct: 345 GMVDEGFDYFRRMKEEYGILPRIEHHGCIVDLLCRAGKVRDAYQYIRNMSVPPNAVIWRT 404

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGE+ARAE+L+LEPKH GDYVLLSNLYASERRWLDVQNVRRTMLMKGV
Sbjct: 405 LLGACTIHGHLELGEIARAEILQLEPKHCGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 464

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITE LKIEGYVPRTVNVLADIEE
Sbjct: 465 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITESLKIEGYVPRTVNVLADIEE 524

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPP TPIRIMKNLRVCADCHLAIKL+SKVFEREI+VRD
Sbjct: 525 EEKETALSHHTEKVAIAFMLVNTPPGTPIRIMKNLRVCADCHLAIKLISKVFEREIVVRD 584

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDG CSC+DYW
Sbjct: 585 RSRFHHFKDGLCSCKDYW 602

BLAST of MS011056 vs. ExPASy TrEMBL
Match: A0A5A7UEB3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009420 PE=3 SV=1)

HSP 1 Score: 1029.6 bits (2661), Expect = 4.7e-297
Identity = 496/558 (88.89%), Postives = 526/558 (94.27%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQ+H FSIRHGVPP NPD  KHLIFALVSLSAPM +A RIF  I+APNIFTWNTMIRGFA
Sbjct: 51  KQVHAFSIRHGVPPQNPDFNKHLIFALVSLSAPMSFAARIFNQIQAPNIFTWNTMIRGFA 110

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENP PAVEL+ QMH ASS+LPDTHTFPFL KA AKLMD R+GE IHS+VV+NGF SLL
Sbjct: 111 ESENPSPAVELFSQMHAASSILPDTHTFPFLFKAVAKLMDVRLGEAIHSVVVRNGFDSLL 170

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAYQVFE M +RDLVAWNSVINGFALNGM NEALTLFREMG +
Sbjct: 171 FVQNSLVHMYSVFGFAESAYQVFEIMSDRDLVAWNSVINGFALNGMPNEALTLFREMGSE 230

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVEL A+ALGER H+YM+KVGLV N HASNALLDLYSKCGN +DA
Sbjct: 231 GVEPDGFTMVSLLSACVELRALALGERAHMYMVKVGLVRNQHASNALLDLYSKCGNFKDA 290

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
            KVFDEMEERSVVSWTSLIVG AVNGLGNEAL+LFGELER+GLKPSEITFVGVLYACSHC
Sbjct: 291 QKVFDEMEERSVVSWTSLIVGSAVNGLGNEALKLFGELERQGLKPSEITFVGVLYACSHC 350

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GM++EGFDYFRRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY YIRNM +PPNAVIWRT
Sbjct: 351 GMLDEGFDYFRRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYNYIRNMPVPPNAVIWRT 410

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGEVARAE+ RLEP+HSGD+VLLSNLYASE RWLDVQN+R+TML+KGV
Sbjct: 411 LLGACTIHGHLELGEVARAEIQRLEPRHSGDFVLLSNLYASEGRWLDVQNLRKTMLVKGV 470

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEE
Sbjct: 471 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYVPRTVNVLADIEE 530

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKL+SKVFEREIIVRD
Sbjct: 531 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRD 590

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDGSCSC+DYW
Sbjct: 591 RSRFHHFKDGSCSCKDYW 608

BLAST of MS011056 vs. ExPASy TrEMBL
Match: A0A1S3AZ16 (pentatricopeptide repeat-containing protein At4g21065 OS=Cucumis melo OX=3656 GN=LOC103484330 PE=3 SV=1)

HSP 1 Score: 1029.6 bits (2661), Expect = 4.7e-297
Identity = 496/558 (88.89%), Postives = 526/558 (94.27%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFA 60
           KQ+H FSIRHGVPP NPD  KHLIFALVSLSAPM +A RIF  I+APNIFTWNTMIRGFA
Sbjct: 51  KQVHAFSIRHGVPPQNPDFNKHLIFALVSLSAPMSFAARIFNQIQAPNIFTWNTMIRGFA 110

Query: 61  ESENPRPAVELYCQMH-ASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLL 120
           ESENP PAVEL+ QMH ASS+LPDTHTFPFL KA AKLMD R+GE IHS+VV+NGF SLL
Sbjct: 111 ESENPSPAVELFSQMHAASSILPDTHTFPFLFKAVAKLMDVRLGEAIHSVVVRNGFDSLL 170

Query: 121 FVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLD 180
           FVQNSLVHMYSVFGFAESAYQVFE M +RDLVAWNSVINGFALNGM NEALTLFREMG +
Sbjct: 171 FVQNSLVHMYSVFGFAESAYQVFEIMSDRDLVAWNSVINGFALNGMPNEALTLFREMGSE 230

Query: 181 GVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDA 240
           GVEPDGFTMVSLLSACVEL A+ALGER H+YM+KVGLV N HASNALLDLYSKCGN +DA
Sbjct: 231 GVEPDGFTMVSLLSACVELRALALGERAHMYMVKVGLVRNQHASNALLDLYSKCGNFKDA 290

Query: 241 LKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHC 300
            KVFDEMEERSVVSWTSLIVG AVNGLGNEAL+LFGELER+GLKPSEITFVGVLYACSHC
Sbjct: 291 QKVFDEMEERSVVSWTSLIVGSAVNGLGNEALKLFGELERQGLKPSEITFVGVLYACSHC 350

Query: 301 GMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRT 360
           GM++EGFDYFRRMK+EYGILPRIEHHGC+VDLLCRAGKVGDAY YIRNM +PPNAVIWRT
Sbjct: 351 GMLDEGFDYFRRMKEEYGILPRIEHHGCMVDLLCRAGKVGDAYNYIRNMPVPPNAVIWRT 410

Query: 361 LLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGV 420
           LLGACTIHGHLELGEVARAE+ RLEP+HSGD+VLLSNLYASE RWLDVQN+R+TML+KGV
Sbjct: 411 LLGACTIHGHLELGEVARAEIQRLEPRHSGDFVLLSNLYASEGRWLDVQNLRKTMLVKGV 470

Query: 421 RKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEE 480
           +KTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML KITELLKIEGYVPRTVNVLADIEE
Sbjct: 471 KKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLAKITELLKIEGYVPRTVNVLADIEE 530

Query: 481 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRD 540
           EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKL+SKVFEREIIVRD
Sbjct: 531 EEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLISKVFEREIIVRD 590

Query: 541 RSRFHHFKDGSCSCRDYW 558
           RSRFHHFKDGSCSC+DYW
Sbjct: 591 RSRFHHFKDGSCSCKDYW 608

BLAST of MS011056 vs. TAIR 10
Match: AT4G21065.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 760.4 bits (1962), Expect = 1.0e-219
Identity = 359/562 (63.88%), Postives = 453/562 (80.60%), Query Frame = 0

Query: 1   KQIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMP--YATRIFRLIRAP-NIFTWNTMIR 60
           +QIH FSIRHGV   + ++GKHLIF LVSL +P P  YA ++F  I  P N+F WNT+IR
Sbjct: 34  RQIHAFSIRHGVSISDAELGKHLIFYLVSLPSPPPMSYAHKVFSKIEKPINVFIWNTLIR 93

Query: 61  GFAESENPRPAVELYCQMHASSVL-PDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFG 120
           G+AE  N   A  LY +M  S ++ PDTHT+PFL+KA   + D R+GE IHS+V+++GFG
Sbjct: 94  GYAEIGNSISAFSLYREMRVSGLVEPDTHTYPFLIKAVTTMADVRLGETIHSVVIRSGFG 153

Query: 121 SLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREM 180
           SL++VQNSL+H+Y+  G   SAY+VF+ M E+DLVAWNSVINGFA NG   EAL L+ EM
Sbjct: 154 SLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSVINGFAENGKPEEALALYTEM 213

Query: 181 GLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNI 240
              G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGL  N H+SN LLDLY++CG +
Sbjct: 214 NSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGLTRNLHSSNVLLDLYARCGRV 273

Query: 241 RDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGELE-RKGLKPSEITFVGVLYA 300
            +A  +FDEM +++ VSWTSLIVGLAVNG G EA+ELF  +E  +GL P EITFVG+LYA
Sbjct: 274 EEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKYMESTEGLLPCEITFVGILYA 333

Query: 301 CSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAV 360
           CSHCGMV+EGF+YFRRM++EY I PRIEH GC+VDLL RAG+V  AYEYI++M + PN V
Sbjct: 334 CSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARAGQVKKAYEYIKSMPMQPNVV 393

Query: 361 IWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTML 420
           IWRTLLGACT+HG  +L E AR ++L+LEP HSGDYVLLSN+YASE+RW DVQ +R+ ML
Sbjct: 394 IWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLSNMYASEQRWSDVQKIRKQML 453

Query: 421 MKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLA 480
             GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L ++T  L+ EGYVP+  NV  
Sbjct: 454 RDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLKEMTGRLRSEGYVPQISNVYV 513

Query: 481 DIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREI 540
           D+EEEEKE A+ +H+EK+AIAFML++TP R+PI ++KNLRVCADCHLAIKLVSKV+ REI
Sbjct: 514 DVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLRVCADCHLAIKLVSKVYNREI 573

Query: 541 IVRDRSRFHHFKDGSCSCRDYW 558
           +VRDRSRFHHFK+GSCSC+DYW
Sbjct: 574 VVRDRSRFHHFKNGSCSCQDYW 595

BLAST of MS011056 vs. TAIR 10
Match: AT4G21065.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 661.8 bits (1706), Expect = 5.0e-190
Identity = 306/462 (66.23%), Postives = 385/462 (83.33%), Query Frame = 0

Query: 97  LMDARVGEEIHSIVVKNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 156
           + D R+GE IHS+V+++GFGSL++VQNSL+H+Y+  G   SAY+VF+ M E+DLVAWNSV
Sbjct: 1   MADVRLGETIHSVVIRSGFGSLIYVQNSLLHLYANCGDVASAYKVFDKMPEKDLVAWNSV 60

Query: 157 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGL 216
           INGFA NG   EAL L+ EM   G++PDGFT+VSLLSAC ++GA+ LG+RVHVYM+KVGL
Sbjct: 61  INGFAENGKPEEALALYTEMNSKGIKPDGFTIVSLLSACAKIGALTLGKRVHVYMIKVGL 120

Query: 217 VHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELFGE 276
             N H+SN LLDLY++CG + +A  +FDEM +++ VSWTSLIVGLAVNG G EA+ELF  
Sbjct: 121 TRNLHSSNVLLDLYARCGRVEEAKTLFDEMVDKNSVSWTSLIVGLAVNGFGKEAIELFKY 180

Query: 277 LE-RKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRA 336
           +E  +GL P EITFVG+LYACSHCGMV+EGF+YFRRM++EY I PRIEH GC+VDLL RA
Sbjct: 181 MESTEGLLPCEITFVGILYACSHCGMVKEGFEYFRRMREEYKIEPRIEHFGCMVDLLARA 240

Query: 337 GKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLS 396
           G+V  AYEYI++M + PN VIWRTLLGACT+HG  +L E AR ++L+LEP HSGDYVLLS
Sbjct: 241 GQVKKAYEYIKSMPMQPNVVIWRTLLGACTVHGDSDLAEFARIQILQLEPNHSGDYVLLS 300

Query: 397 NLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLG 456
           N+YASE+RW DVQ +R+ ML  GV+K PG+SLVE+ NRV+EF+MGD+SHPQS+  YA L 
Sbjct: 301 NMYASEQRWSDVQKIRKQMLRDGVKKVPGHSLVEVGNRVHEFLMGDKSHPQSDAIYAKLK 360

Query: 457 KITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLR 516
           ++T  L+ EGYVP+  NV  D+EEEEKE A+ +H+EK+AIAFML++TP R+PI ++KNLR
Sbjct: 361 EMTGRLRSEGYVPQISNVYVDVEEEEKENAVVYHSEKIAIAFMLISTPERSPITVVKNLR 420

Query: 517 VCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSCSCRDYW 558
           VCADCHLAIKLVSKV+ REI+VRDRSRFHHFK+GSCSC+DYW
Sbjct: 421 VCADCHLAIKLVSKVYNREIVVRDRSRFHHFKNGSCSCQDYW 462

BLAST of MS011056 vs. TAIR 10
Match: AT1G74630.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 483.8 bits (1244), Expect = 1.9e-136
Identity = 250/621 (40.26%), Postives = 364/621 (58.62%), Query Frame = 0

Query: 2   QIHGFSIRHGVPPHNPDMGKHLIFALVSLSAPMPYATRIFRLIRAPNIFTWNTMIRGFAE 61
           QIHG  I++GV   +   GK ++   +S+S  +PYA R+      P+ F +NT++RG++E
Sbjct: 23  QIHGLFIKYGVDTDSYFTGKLILHCAISISDALPYARRLLLCFPEPDAFMFNTLVRGYSE 82

Query: 62  SENPRPAVELYCQ-MHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFGSLLF 121
           S+ P  +V ++ + M    V PD+ +F F++KA       R G ++H   +K+G  S LF
Sbjct: 83  SDEPHNSVAVFVEMMRKGFVFPDSFSFAFVIKAVENFRSLRTGFQMHCQALKHGLESHLF 142

Query: 122 VQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSVIN---------------------- 181
           V  +L+ MY   G  E A +VF+ M + +LVAWN+VI                       
Sbjct: 143 VGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGNDVAGAREIFDKMLVRN 202

Query: 182 ----------------------------------------GFALNGMANEALTLFREMGL 241
                                                   G A NG  NE+   FRE+  
Sbjct: 203 HTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHNGSFNESFLYFRELQR 262

Query: 242 DGVEPDGFTMVSLLSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRD 301
            G+ P+  ++  +LSAC + G+   G+ +H ++ K G       +NAL+D+YS+CGN+  
Sbjct: 263 AGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVNNALIDMYSRCGNVPM 322

Query: 302 ALKVFDEMEE-RSVVSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACS 361
           A  VF+ M+E R +VSWTS+I GLA++G G EA+ LF E+   G+ P  I+F+ +L+ACS
Sbjct: 323 ARLVFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACS 382

Query: 362 HCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIW 421
           H G++EEG DYF  MK  Y I P IEH+GC+VDL  R+GK+  AY++I  M IPP A++W
Sbjct: 383 HAGLIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKAYDFICQMPIPPTAIVW 442

Query: 422 RTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMK 481
           RTLLGAC+ HG++EL E  +  +  L+P +SGD VLLSN YA+  +W DV ++R++M+++
Sbjct: 443 RTLLGACSSHGNIELAEQVKQRLNELDPNNSGDLVLLSNAYATAGKWKDVASIRKSMIVQ 502

Query: 482 GVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAMLGKITELLKIE-GYVPRTVNVLAD 541
            ++KT  +SLVE+   +Y+F  G++      E +  L +I   LK E GY P   + L D
Sbjct: 503 RIKKTTAWSLVEVGKTMYKFTAGEKKKGIDIEAHEKLKEIILRLKDEAGYTPEVASALYD 562

Query: 542 IEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREII 558
           +EEEEKE  +S H+EK+A+AF L        IRI+KNLR+C DCH  +KL SKV+  EI+
Sbjct: 563 VEEEEKEDQVSKHSEKLALAFALARLSKGANIRIVKNLRICRDCHAVMKLTSKVYGVEIL 622

BLAST of MS011056 vs. TAIR 10
Match: AT1G08070.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 476.9 bits (1226), Expect = 2.3e-134
Identity = 227/523 (43.40%), Postives = 350/523 (66.92%), Query Frame = 0

Query: 37  ATRIFRLIRAPNIFTWNTMIRGFAESENPRPAVELYCQMHASSVLPDTHTFPFLLKAAAK 96
           A ++F  I   ++ +WN MI G+AE+ N + A+EL+  M  ++V PD  T   ++ A A+
Sbjct: 219 AQKLFDEIPVKDVVSWNAMISGYAETGNYKEALELFKDMMKTNVRPDESTMVTVVSACAQ 278

Query: 97  LMDARVGEEIHSIVVKNGFGSLLFVQNSLVHMYSVFGFAESAYQVFEFMLERDLVAWNSV 156
                +G ++H  +  +GFGS L + N+L+ +YS  G  E+A  +FE +  +D+++WN++
Sbjct: 279 SGSIELGRQVHLWIDDHGFGSNLKIVNALIDLYSKCGELETACGLFERLPYKDVISWNTL 338

Query: 157 INGFALNGMANEALTLFREMGLDGVEPDGFTMVSLLSACVELGAMALGERVHVYMLK--V 216
           I G+    +  EAL LF+EM   G  P+  TM+S+L AC  LGA+ +G  +HVY+ K   
Sbjct: 339 IGGYTHMNLYKEALLLFQEMLRSGETPNDVTMLSILPACAHLGAIDIGRWIHVYIDKRLK 398

Query: 217 GLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSVVSWTSLIVGLAVNGLGNEALELF 276
           G+ +      +L+D+Y+KCG+I  A +VF+ +  +S+ SW ++I G A++G  + + +LF
Sbjct: 399 GVTNASSLRTSLIDMYAKCGDIEAAHQVFNSILHKSLSSWNAMIFGFAMHGRADASFDLF 458

Query: 277 GELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRRMKDEYGILPRIEHHGCIVDLLCR 336
             + + G++P +ITFVG+L ACSH GM++ G   FR M  +Y + P++EH+GC++DLL  
Sbjct: 459 SRMRKIGIQPDDITFVGLLSACSHSGMLDLGRHIFRTMTQDYKMTPKLEHYGCMIDLLGH 518

Query: 337 AGKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLELGEVARAEVLRLEPKHSGDYVLL 396
           +G   +A E I  M + P+ VIW +LL AC +HG++ELGE     ++++EP++ G YVLL
Sbjct: 519 SGLFKEAEEMINMMEMEPDGVIWCSLLKACKMHGNVELGESFAENLIKIEPENPGSYVLL 578

Query: 397 SNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELKNRVYEFIMGDRSHPQSEETYAML 456
           SN+YAS  RW +V   R  +  KG++K PG S +E+ + V+EFI+GD+ HP++ E Y ML
Sbjct: 579 SNIYASAGRWNEVAKTRALLNDKGMKKVPGCSSIEIDSVVHEFIIGDKFHPRNREIYGML 638

Query: 457 GKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTEKVAIAFMLVNTPPRTPIRIMKNL 516
            ++  LL+  G+VP T  VL ++EEE KE AL HH+EK+AIAF L++T P T + I+KNL
Sbjct: 639 EEMEVLLEKAGFVPDTSEVLQEMEEEWKEGALRHHSEKLAIAFGLISTKPGTKLTIVKNL 698

Query: 517 RVCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSCSCRDYW 558
           RVC +CH A KL+SK+++REII RDR+RFHHF+DG CSC DYW
Sbjct: 699 RVCRNCHEATKLISKIYKREIIARDRTRFHHFRDGVCSCNDYW 741

BLAST of MS011056 vs. TAIR 10
Match: AT5G48910.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 476.9 bits (1226), Expect = 2.3e-134
Identity = 241/606 (39.77%), Postives = 371/606 (61.22%), Query Frame = 0

Query: 2   QIHGFSIRHGVPPHNPDMGKHLIFALVS--LSAPMPYATRIFRLIRAPNIFTWNTMIRGF 61
           QIH   I+ G         + L F   S      + YA +IF  +   N F+WNT+IRGF
Sbjct: 41  QIHAVFIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTIIRGF 100

Query: 62  AESENPRPAVEL---YCQMHASSVLPDTHTFPFLLKAAAKLMDARVGEEIHSIVVKNGFG 121
           +ES+  +  + +   Y  M    V P+  TFP +LKA AK    + G++IH + +K GFG
Sbjct: 101 SESDEDKALIAITLFYEMMSDEFVEPNRFTFPSVLKACAKTGKIQEGKQIHGLALKYGFG 160

Query: 122 SLLFVQNSLVHMYSVFGF------------------------------------------ 181
              FV ++LV MY + GF                                          
Sbjct: 161 GDEFVMSNLVRMYVMCGFMKDARVLFYKNIIEKDMVVMTDRRKRDGEIVLWNVMIDGYMR 220

Query: 182 ---AESAYQVFEFMLERDLVAWNSVINGFALNGMANEALTLFREMGLDGVEPDGFTMVSL 241
               ++A  +F+ M +R +V+WN++I+G++LNG   +A+ +FREM    + P+  T+VS+
Sbjct: 221 LGDCKAARMLFDKMRQRSVVSWNTMISGYSLNGFFKDAVEVFREMKKGDIRPNYVTLVSV 280

Query: 242 LSACVELGAMALGERVHVYMLKVGLVHNPHASNALLDLYSKCGNIRDALKVFDEMEERSV 301
           L A   LG++ LGE +H+Y    G+  +    +AL+D+YSKCG I  A+ VF+ +   +V
Sbjct: 281 LPAISRLGSLELGEWLHLYAEDSGIRIDDVLGSALIDMYSKCGIIEKAIHVFERLPRENV 340

Query: 302 VSWTSLIVGLAVNGLGNEALELFGELERKGLKPSEITFVGVLYACSHCGMVEEGFDYFRR 361
           ++W+++I G A++G   +A++ F ++ + G++PS++ ++ +L ACSH G+VEEG  YF +
Sbjct: 341 ITWSAMINGFAIHGQAGDAIDCFCKMRQAGVRPSDVAYINLLTACSHGGLVEEGRRYFSQ 400

Query: 362 MKDEYGILPRIEHHGCIVDLLCRAGKVGDAYEYIRNMSIPPNAVIWRTLLGACTIHGHLE 421
           M    G+ PRIEH+GC+VDLL R+G + +A E+I NM I P+ VIW+ LLGAC + G++E
Sbjct: 401 MVSVDGLEPRIEHYGCMVDLLGRSGLLDEAEEFILNMPIKPDDVIWKALLGACRMQGNVE 460

Query: 422 LGEVARAEVLRLEPKHSGDYVLLSNLYASERRWLDVQNVRRTMLMKGVRKTPGYSLVELK 481
           +G+     ++ + P  SG YV LSN+YAS+  W +V  +R  M  K +RK PG SL+++ 
Sbjct: 461 MGKRVANILMDMVPHDSGAYVALSNMYASQGNWSEVSEMRLRMKEKDIRKDPGCSLIDID 520

Query: 482 NRVYEFIMGDRSHPQSEETYAMLGKITELLKIEGYVPRTVNVLADIEEEEKETALSHHTE 541
             ++EF++ D SHP+++E  +ML +I++ L++ GY P T  VL ++EEE+KE  L +H+E
Sbjct: 521 GVLHEFVVEDDSHPKAKEINSMLVEISDKLRLAGYRPITTQVLLNLEEEDKENVLHYHSE 580

Query: 542 KVAIAFMLVNTPPRTPIRIMKNLRVCADCHLAIKLVSKVFEREIIVRDRSRFHHFKDGSC 558
           K+A AF L++T P  PIRI+KNLR+C DCH +IKL+SKV++R+I VRDR RFHHF+DGSC
Sbjct: 581 KIATAFGLISTSPGKPIRIVKNLRICEDCHSSIKLISKVYKRKITVRDRKRFHHFQDGSC 640

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022146486.10.0e+0098.92pentatricopeptide repeat-containing protein At4g21065 [Momordica charantia][more]
XP_023002974.16.3e-30491.58pentatricopeptide repeat-containing protein At4g21065 [Cucurbita maxima][more]
XP_022926338.14.1e-30391.40pentatricopeptide repeat-containing protein At4g21065 [Cucurbita moschata][more]
XP_023517386.12.0e-30291.58pentatricopeptide repeat-containing protein At4g21065 [Cucurbita pepo subsp. pep... [more]
XP_038882791.11.0e-30190.32pentatricopeptide repeat-containing protein At4g21065 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A8MQA31.4e-21863.88Pentatricopeptide repeat-containing protein At4g21065 OS=Arabidopsis thaliana OX... [more]
Q9CA542.6e-13540.26Pentatricopeptide repeat-containing protein At1g74630 OS=Arabidopsis thaliana OX... [more]
Q9FI803.2e-13339.77Pentatricopeptide repeat-containing protein At5g48910 OS=Arabidopsis thaliana OX... [more]
Q9LN013.2e-13343.40Pentatricopeptide repeat-containing protein At1g08070, chloroplastic OS=Arabidop... [more]
Q9LW631.0e-13142.00Putative pentatricopeptide repeat-containing protein At3g23330 OS=Arabidopsis th... [more]
Match NameE-valueIdentityDescription
A0A6J1CZH80.0e+0098.92pentatricopeptide repeat-containing protein At4g21065 OS=Momordica charantia OX=... [more]
A0A6J1KRZ93.1e-30491.58pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita maxima OX=366... [more]
A0A6J1EHS22.0e-30391.40pentatricopeptide repeat-containing protein At4g21065 OS=Cucurbita moschata OX=3... [more]
A0A5A7UEB34.7e-29788.89Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A1S3AZ164.7e-29788.89pentatricopeptide repeat-containing protein At4g21065 OS=Cucumis melo OX=3656 GN... [more]
Match NameE-valueIdentityDescription
AT4G21065.11.0e-21963.88Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT4G21065.25.0e-19066.23Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G74630.11.9e-13640.26Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G08070.12.3e-13443.40Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT5G48910.12.3e-13439.77Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 250..296
e-value: 1.5E-8
score: 34.7
coord: 149..195
e-value: 7.0E-10
score: 39.0
coord: 47..95
e-value: 5.4E-11
score: 42.5
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 50..84
e-value: 3.1E-6
score: 25.0
coord: 252..286
e-value: 2.4E-5
score: 22.2
coord: 151..184
e-value: 2.0E-7
score: 28.7
coord: 288..320
e-value: 2.1E-4
score: 19.3
coord: 224..252
e-value: 3.7E-6
score: 24.8
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 149..183
score: 12.079411
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 250..284
score: 11.279235
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 48..82
score: 11.070971
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 219..249
score: 10.183105
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 111..203
e-value: 2.7E-19
score: 71.2
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 204..290
e-value: 1.3E-20
score: 76.0
coord: 3..110
e-value: 1.4E-10
score: 43.0
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 291..444
e-value: 1.7E-17
score: 65.8
IPR032867DYW domainPFAMPF14432DYW_deaminasecoord: 423..547
e-value: 1.7E-36
score: 124.9
NoneNo IPR availablePANTHERPTHR47926PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..543
NoneNo IPR availablePANTHERPTHR47926:SF70PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEINcoord: 1..543

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS011056.1MS011056.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding