Sed0005021 (gene) Chayote v1

Overview
NameSed0005021
Typegene
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
LocationLG14: 21904488 .. 21907076 (+)
RNA-Seq ExpressionSed0005021
SyntenySed0005021
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTCACCTGAAATTCCCGACTGAAATGCGCCTCTGTAATACTCGCCCTTCTTCTTCTTCTTCCAGCATCAATCATCTGTTCGCCAAGTCGATTTATGCCGTTCTTCATCGCCATAAATCTGAATGCGCCAACGCCAGGTTTCCTTCTTTCCTTCTTTATTCCGTCTTAAATCTTTCTTATACTTCGAATCTTTGCTTTCATTTCATATGCTCCATTAATAACTCTGCTGTTTCGTCCAATTTAGTTGCCATTGCTTCCTTTTCGATTTTAGGGTTTTGGGTTCTGTAGTTCCTTTTCGTTTCTTCTTCATTCTATCTTCTTTTTAGAGTTTCGAATTCTAATTTGCTGTAGAGTAGAATGTTAAGAATTGAAATGGTAGATAATAGATAATATTTCCCCCTTGATTGCCAATAATGGGCGAATTGTCCTCGTTACAGGAGGTGTCATTCCATGGCCACACTTATTAATTTCCTCCACGTTTCTTTGATATACAGATGTCGTAGGATAGGACAGTATGTCCGGTAACATTAGTCGAGATACGCACAAACTCACCTCATCGTGGAAATATAATTTTCGTAATTTTCCAGGTTCAACTTGATTTATATGTCTAATCTTAAAACATTTGAATGTCAAAGCAAATTGTAAAAGGTTCATTTATAAGTAGATGATACCATGAATTAAACTACCTGAAAGTTCTTCAGTACCTTATGCTATTCCATGTTGGTTAGAATTTCCTTTTTTTTCTCCTCTACTTTGACAACTGACAAGTCTAAATTTATTCACTCTTTCATATCTCATATTTGATTGTAGTATCTATCCTTGGACATTTCCAGCTTTCAGGTCAAGCCACATCAGAAAGATTCCTCTTTCTGGGATAGAACATTTAGGAACCTATGTTCAAGGGGGGAATTATCAGAGGCTGTTGCACTTCTGTGCTGTATGGCCTTACAATTTCACTCCAAAACCTACTGTCTTTTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATTGTTGTTGTTGGACATGTACCGAATGAATATCTGAAAACCAAACTACTGATATTATATGCCAAAGCAGGTGACTTAGAAACTGCATACATTCTTCATGAGAGATTGCTGGAAAAAAGTCTGGTTTCATGGAATGCAATGATTGCTGGATATGTACAAAATGGTTTTGGAGAAGTTGGATTGGAGCTTTATTTTAAGATGAGACAAAGTGGGCTGATGCCTGATCAGTATACATTTGCATCAGTTCTTAGATCCTGTGCTACTTTAGCTTCTTTGGAACATGGAAAAAGAGCACATGGAGTTCTGATTAAGTGTCAAATTGGTGACAATGTTGTTGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCTTGTTGGATGGGCACAAAGCATTTAACAAATCTTCATCTAGGAATGTAATTACATGGACTGCTTTAATATCTGGGTATGGCCAGCATGGAAGAGTTTCTGAAGTTTTGGAATCGTTCAACAGTATGATCAACGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTGCTGCTTGTGGTCGTGGGGGTTTTGTCGCTGAAGCATGGCGATACTTTTCATTGATGACAACGAACTACGGAATTAAACCAAGAGGGCAACACTTCGCCGCCATGGCTGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATATCGCTCTCGACGCACCTTGCAAGGAGCACCCAGTTACATGGGGTGCTTTGGTTGGGGCTTGTAAGGTTCATGAAGACATAAATTTGATGAAATGTGCCGCAGCAAATTACTTTGCCCTGGATTCTGAAAACTCTGGGAAGTATGTTGTTTTAGCAAATGCTTTTGCTGCATGTGGGATGTGGGACAATGTTTCGGAAATTAGAGATATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTACAGCAGAATTGAGATACAAAGGGAGTTTCATTTCTTTGTGAAGAGTGATAAATCTCACGAAAAAACTGAGGAGATTTATAGAACCATAGACAGAATAACACCAATATTGAAGGATGTTGGTTATATTCCTGAATTAAATGAAAACTAATTGTGGGGTTAGTTTAGAATAACTTCAGTTTTGAAGGACGAGCTGATTATACATTGGTGCTAAATCTATACATTTGCACGGCAAGTTGAGATTTCTGATTGAAATTTAGAATACACATAACCTCATCATTAACTTTTAAGGTATTTTCGAGGTCTTAAATTCAGGTTAAGATCTTGGAACGGGCCCGAGTGCCTCATGTCTTCGAGGAACGGACTCATGAGTTGAGGTCTAGGACTACCAAAATATGTTCCTTCGGTACAGAGTCCCAAATATATGAGCGAAGCCAATTCCTGATCATCAAATTTGAAGGAATTGCTGCAGTTGCATCAGTTCAGTTCTGGGAACATTATGATTTCTTGAGATGAAGCTAGACGAAAAGCAACTTCAAAATGGGACAGATGGAAATCTTGAATTACCCCCCACATAATGAATGCTGCCAAGGTTTCAAGAGAACTGGGA

mRNA sequence

GTTCACCTGAAATTCCCGACTGAAATGCGCCTCTGTAATACTCGCCCTTCTTCTTCTTCTTCCAGCATCAATCATCTGTTCGCCAAGTCGATTTATGCCGTTCTTCATCGCCATAAATCTGAATGCGCCAACGCCAGCTTTCAGGTCAAGCCACATCAGAAAGATTCCTCTTTCTGGGATAGAACATTTAGGAACCTATGTTCAAGGGGGGAATTATCAGAGGCTGTTGCACTTCTGTGCTGTATGGCCTTACAATTTCACTCCAAAACCTACTGTCTTTTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATTGTTGTTGTTGGACATGTACCGAATGAATATCTGAAAACCAAACTACTGATATTATATGCCAAAGCAGGTGACTTAGAAACTGCATACATTCTTCATGAGAGATTGCTGGAAAAAAGTCTGGTTTCATGGAATGCAATGATTGCTGGATATGTACAAAATGGTTTTGGAGAAGTTGGATTGGAGCTTTATTTTAAGATGAGACAAAGTGGGCTGATGCCTGATCAGTATACATTTGCATCAGTTCTTAGATCCTGTGCTACTTTAGCTTCTTTGGAACATGGAAAAAGAGCACATGGAGTTCTGATTAAGTGTCAAATTGGTGACAATGTTGTTGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCTTGTTGGATGGGCACAAAGCATTTAACAAATCTTCATCTAGGAATGTAATTACATGGACTGCTTTAATATCTGGGTATGGCCAGCATGGAAGAGTTTCTGAAGTTTTGGAATCGTTCAACAGTATGATCAACGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTGCTGCTTGTGGTCGTGGGGGTTTTGTCGCTGAAGCATGGCGATACTTTTCATTGATGACAACGAACTACGGAATTAAACCAAGAGGGCAACACTTCGCCGCCATGGCTGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATATCGCTCTCGACGCACCTTGCAAGGAGCACCCAGTTACATGGGGTGCTTTGGTTGGGGCTTGTAAGGTTCATGAAGACATAAATTTGATGAAATGTGCCGCAGCAAATTACTTTGCCCTGGATTCTGAAAACTCTGGGAAGTATGTTGTTTTAGCAAATGCTTTTGCTGCATGTGGGATGTGGGACAATGTTTCGGAAATTAGAGATATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTACAGCAGAATTGAGATACAAAGGGAGTTTCATTTCTTTGTGAAGAGTGATAAATCTCACGAAAAAACTGAGGAGATTTATAGAACCATAGACAGAATAACACCAATATTGAAGGATGTTGGTTATATTCCTGAATTAAATGAAAACTAATTGTGGGGTTAGTTTAGAATAACTTCAGTTTTGAAGGACGAGCTGATTATACATTGGTGCTAAATCTATACATTTGCACGGCAAGTTGAGATTTCTGATTGAAATTTAGAATACACATAACCTCATCATTAACTTTTAAGGTATTTTCGAGGTCTTAAATTCAGGTTAAGATCTTGGAACGGGCCCGAGTGCCTCATGTCTTCGAGGAACGGACTCATGAGTTGAGGTCTAGGACTACCAAAATATGTTCCTTCGGTACAGAGTCCCAAATATATGAGCGAAGCCAATTCCTGATCATCAAATTTGAAGGAATTGCTGCAGTTGCATCAGTTCAGTTCTGGGAACATTATGATTTCTTGAGATGAAGCTAGACGAAAAGCAACTTCAAAATGGGACAGATGGAAATCTTGAATTACCCCCCACATAATGAATGCTGCCAAGGTTTCAAGAGAACTGGGA

Coding sequence (CDS)

ATGCGCCTCTGTAATACTCGCCCTTCTTCTTCTTCTTCCAGCATCAATCATCTGTTCGCCAAGTCGATTTATGCCGTTCTTCATCGCCATAAATCTGAATGCGCCAACGCCAGCTTTCAGGTCAAGCCACATCAGAAAGATTCCTCTTTCTGGGATAGAACATTTAGGAACCTATGTTCAAGGGGGGAATTATCAGAGGCTGTTGCACTTCTGTGCTGTATGGCCTTACAATTTCACTCCAAAACCTACTGTCTTTTGTTACAAGAATGCATTTTCAGGAAAGAGTATATGAAAGGAAAAAGAATCCATGCCCAAATTGTTGTTGTTGGACATGTACCGAATGAATATCTGAAAACCAAACTACTGATATTATATGCCAAAGCAGGTGACTTAGAAACTGCATACATTCTTCATGAGAGATTGCTGGAAAAAAGTCTGGTTTCATGGAATGCAATGATTGCTGGATATGTACAAAATGGTTTTGGAGAAGTTGGATTGGAGCTTTATTTTAAGATGAGACAAAGTGGGCTGATGCCTGATCAGTATACATTTGCATCAGTTCTTAGATCCTGTGCTACTTTAGCTTCTTTGGAACATGGAAAAAGAGCACATGGAGTTCTGATTAAGTGTCAAATTGGTGACAATGTTGTTGTGTCTAGTGCCCTTGTTGATATGTACTTCAAATGCAGTAGCTTGTTGGATGGGCACAAAGCATTTAACAAATCTTCATCTAGGAATGTAATTACATGGACTGCTTTAATATCTGGGTATGGCCAGCATGGAAGAGTTTCTGAAGTTTTGGAATCGTTCAACAGTATGATCAACGAAGGTTACCGACCAAATTACGTTACTTTCCTTGCGGTTCTTGCTGCTTGTGGTCGTGGGGGTTTTGTCGCTGAAGCATGGCGATACTTTTCATTGATGACAACGAACTACGGAATTAAACCAAGAGGGCAACACTTCGCCGCCATGGCTGATCTTCTCGCGCGGGCAGGGAGGTTGCAAGAGGCATATAATATCGCTCTCGACGCACCTTGCAAGGAGCACCCAGTTACATGGGGTGCTTTGGTTGGGGCTTGTAAGGTTCATGAAGACATAAATTTGATGAAATGTGCCGCAGCAAATTACTTTGCCCTGGATTCTGAAAACTCTGGGAAGTATGTTGTTTTAGCAAATGCTTTTGCTGCATGTGGGATGTGGGACAATGTTTCGGAAATTAGAGATATGATGAAGAAATCAGGAATGAGTAAAGATCCTGGTTACAGCAGAATTGAGATACAAAGGGAGTTTCATTTCTTTGTGAAGAGTGATAAATCTCACGAAAAAACTGAGGAGATTTATAGAACCATAGACAGAATAACACCAATATTGAAGGATGTTGGTTATATTCCTGAATTAAATGAAAACTAA

Protein sequence

MRLCNTRPSSSSSSINHLFAKSIYAVLHRHKSECANASFQVKPHQKDSSFWDRTFRNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN
Homology
BLAST of Sed0005021 vs. NCBI nr
Match: XP_038881197.1 (pentatricopeptide repeat-containing protein At4g16470 [Benincasa hispida])

HSP 1 Score: 779.2 bits (2011), Expect = 1.9e-221
Identity = 389/464 (83.84%), Postives = 414/464 (89.22%), Query Frame = 0

Query: 11  SSSSINHLFAKSIYA----VLH---RHKSECANASFQVKPHQK-DSSFWDRTFRNLCSRG 70
           S S + HLF KSI A    ++H   RHKSE A  SFQVKPHQK D+SFWD+T R LCS G
Sbjct: 73  SFSGVIHLFTKSIVAAEIPIVHRRRRHKSESAYPSFQVKPHQKEDTSFWDKTLRGLCSTG 132

Query: 71  ELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLL 130
            L EAVALLCCMALQFHSKTY LLLQECIFRKEYMKGKRIHAQ+VVVGHVPNEY+ TKLL
Sbjct: 133 RLPEAVALLCCMALQFHSKTYRLLLQECIFRKEYMKGKRIHAQMVVVGHVPNEYINTKLL 192

Query: 131 ILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQY 190
           ILYAK+GDLETAYILHE LLEKSLVSWN++IAGYVQ G GEVGLE YFKMRQSGL+PDQY
Sbjct: 193 ILYAKSGDLETAYILHENLLEKSLVSWNSLIAGYVQKGLGEVGLEFYFKMRQSGLLPDQY 252

Query: 191 TFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKS 250
           TFASVLR+CA+LASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL DGHKAFNKS
Sbjct: 253 TFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKS 312

Query: 251 SSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAW 310
           S+RNVITWTALISGYGQHGRV EVLESF+SMINEGYRPNYVTFLAVLAACG GGFV+EA 
Sbjct: 313 SNRNVITWTALISGYGQHGRVFEVLESFHSMINEGYRPNYVTFLAVLAACGHGGFVSEAL 372

Query: 311 RYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKV 370
           RYFSLMT  YGIKPRGQH+AAM DLLARAGRLQEAYN+ LDAPCKEH V WGALVG CKV
Sbjct: 373 RYFSLMTKMYGIKPRGQHYAAMVDLLARAGRLQEAYNLVLDAPCKEHSVMWGALVGGCKV 432

Query: 371 HEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYS 430
           HEDI+LMK AAANYF LD ENSGK+VV +NAFA  G+WDNV EIR MMKKSGMSKDPGYS
Sbjct: 433 HEDIDLMKHAAANYFELDPENSGKFVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGYS 492

Query: 431 RIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPEL 467
           RIEIQREFHFFV SDKSH++TEEIYRTI+ ITPILKD GY PEL
Sbjct: 493 RIEIQREFHFFVMSDKSHQQTEEIYRTINSITPILKDAGYTPEL 536

BLAST of Sed0005021 vs. NCBI nr
Match: XP_022926503.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata])

HSP 1 Score: 776.5 bits (2004), Expect = 1.3e-220
Identity = 386/474 (81.43%), Postives = 418/474 (88.19%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RP  SSS + HLF KSI A     +  RHKSE AN  FQVKPHQKDSS WDRTF
Sbjct: 1   MRLCG-RP--SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRFQVKPHQKDSSSWDRTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G L+EAVALLCCM  QFHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 61  RSLCITGRLTEAVALLCCMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETA ILHE+LLE SLVSWNA+IAGYVQ GFGEVGLELYFKMR++
Sbjct: 121 YLKTKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRT 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GL+PDQYTFASV R+CA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+LDG
Sbjct: 181 GLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSILDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
           HK FNKS++RNVITWTALISGYG HGRVSEVLESFN MINEGYRPNYVTFLAVL ACG G
Sbjct: 241 HKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHG 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAY+  +DAPCKEH V WGA
Sbjct: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANY ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 361 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSH++ EEIYRTI  IT ILKD G I EL+EN
Sbjct: 421 NKEPGYSRIEIQREFHFFVKSDKSHKQAEEIYRTIHSITAILKDAGSIRELSEN 471

BLAST of Sed0005021 vs. NCBI nr
Match: KAG6594700.1 (Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 775.4 bits (2001), Expect = 2.8e-220
Identity = 386/474 (81.43%), Postives = 417/474 (87.97%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RP  SSS + HLF KSI A     +  RHKSE AN   QVKPHQKDSS WDRTF
Sbjct: 1   MRLCG-RP--SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G LSEAVALLCCM  +FHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 61  RSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETA +LHE+LLE SLVSWNA+IAGYVQ GFGEVGLELYFKMR++
Sbjct: 121 YLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRT 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GL+PDQYTFASV R+CA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+ DG
Sbjct: 181 GLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
           HK FNKS++RNVITWTALISGYG HGRVSEVLESFNSMINEGYRPNYVTFLAVL ACG  
Sbjct: 241 HKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHV 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAYN  +DAPCKEH V WGA
Sbjct: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANY ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 361 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSHE+ EEIYRTI  ITPILKD G I EL+EN
Sbjct: 421 NKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 471

BLAST of Sed0005021 vs. NCBI nr
Match: XP_023518629.1 (pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 774.6 bits (1999), Expect = 4.8e-220
Identity = 384/474 (81.01%), Postives = 416/474 (87.76%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RP  SSS + HLF KSI A     +  RHKSE  N   QVKPHQKDSS WDRTF
Sbjct: 1   MRLCG-RP--SSSGVVHLFTKSIVAGATATIRRRHKSEYVNDRSQVKPHQKDSSSWDRTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G LSEAVALLCCM  +FHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 61  RSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETA ILHE+LLE SLVSWNA+IAGYVQ GFGEVGLE+YFKMR++
Sbjct: 121 YLKTKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLEIYFKMRRT 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GLMPDQYTFASV R+CA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+ DG
Sbjct: 181 GLMPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
            K FNKSS+RNVITWTALISGYG HGRVSEVLESFN+MINEGYRPNYVTFLAVL ACG G
Sbjct: 241 RKVFNKSSTRNVITWTALISGYGHHGRVSEVLESFNNMINEGYRPNYVTFLAVLTACGHG 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAY+  +DAPCKEH V WGA
Sbjct: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANY ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 361 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSHE+ EEIYRTI  ITPI+KD G  PEL+EN
Sbjct: 421 NKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPIIKDAGSFPELSEN 471

BLAST of Sed0005021 vs. NCBI nr
Match: KAG7026668.1 (Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 773.5 bits (1996), Expect = 1.1e-219
Identity = 385/474 (81.22%), Postives = 416/474 (87.76%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RP  SSS + HLF KSI A     +  RHKSE AN   QVKPHQKDSS WDRTF
Sbjct: 506 MRLCG-RP--SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTF 565

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G LSEAVALLCCM  +FHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 566 RSLCITGRLSEAVALLCCMPFRFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 625

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETA +LHE+LLE SLVSWNA+IAGYVQ GFGEVGLELYFKMR++
Sbjct: 626 YLKTKLLILYAKLGDLETANVLHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRT 685

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GL+PDQYTFASV R+CA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+ DG
Sbjct: 686 GLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSISDG 745

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
           HK FNKS++RNVITWTALISGYG HGRVSEVLESFN MINEGYRPNYVTFLAVL ACG  
Sbjct: 746 HKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHV 805

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAYN  +DAPCKEH V WGA
Sbjct: 806 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYNFVVDAPCKEHAVIWGA 865

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANY ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 866 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 925

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSHE+ EEIYRTI  ITPILKD G I EL+EN
Sbjct: 926 NKEPGYSRIEIQREFHFFVKSDKSHEQAEEIYRTIHSITPILKDAGSIHELSEN 976

BLAST of Sed0005021 vs. ExPASy Swiss-Prot
Match: O23491 (Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E12 PE=2 SV=2)

HSP 1 Score: 496.1 bits (1276), Expect = 4.3e-139
Identity = 242/457 (52.95%), Postives = 317/457 (69.37%), Query Frame = 0

Query: 9   SSSSSSINHLFAKSIYAVLHRHKSECANASFQVKPHQKDSSFWDRTFRNLCSRGELSEAV 68
           +S +S+   +F+ +   +L R  +E     FQV+ +Q+ +   D+T + LC  G L EAV
Sbjct: 38  ASQTSASGSMFSGNATTILRRMLAEKRIGRFQVE-NQRKTEKLDKTLKGLCVTGRLKEAV 97

Query: 69  ALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKA 128
            LL    LQ   +TY +LLQEC  RKEY KGKRIHAQ+ VVG   NEYLK KLLILYA +
Sbjct: 98  GLLWSSGLQVEPETYAVLLQECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALS 157

Query: 129 GDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVL 188
           GDL+TA IL   L  + L+ WNAMI+GYVQ G  + GL +Y+ MRQ+ ++PDQYTFASV 
Sbjct: 158 GDLQTAGILFRSLKIRDLIPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVF 217

Query: 189 RSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVI 248
           R+C+ L  LEHGKRAH V+IK  I  N++V SALVDMYFKCSS  DGH+ F++ S+RNVI
Sbjct: 218 RACSALDRLEHGKRAHAVMIKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVI 277

Query: 249 TWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLM 308
           TWT+LISGYG HG+VSEVL+ F  M  EG RPN VTFL VL AC  GG V + W +F  M
Sbjct: 278 TWTSLISGYGYHGKVSEVLKCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSM 337

Query: 309 TTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINL 368
             +YGI+P GQH+AAM D L RAGRLQEAY   + +PCKEHP  WG+L+GAC++H ++ L
Sbjct: 338 KRDYGIEPEGQHYAAMVDTLGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKL 397

Query: 369 MKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQR 428
           ++ AA  +  LD  N G YVV AN +A+CG+ +  S++R  M+ +G+ KDPGYS+IE+Q 
Sbjct: 398 LELAATKFLELDPTNGGNYVVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQG 457

Query: 429 EFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPE 466
           E H F+K D SH  +E+IY+ +  +T    D+ Y P+
Sbjct: 458 EVHRFMKDDTSHRLSEKIYKKVHEMTSFFMDIDYYPD 493

BLAST of Sed0005021 vs. ExPASy Swiss-Prot
Match: Q9SI53 (Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PCMP-H44 PE=2 SV=1)

HSP 1 Score: 287.7 bits (735), Expect = 2.3e-76
Identity = 141/386 (36.53%), Postives = 230/386 (59.59%), Query Frame = 0

Query: 82  TYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERL 141
           TY  +L+ C    +    + +H  I+  G   + ++++ L+ ++AK G+ E A  + + +
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 142 LEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGK 201
           +    + WN++I G+ QN   +V LEL+ +M+++G + +Q T  SVLR+C  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 202 RAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHG 261
           +AH  ++K     ++++++ALVDMY KC SL D  + FN+   R+VITW+ +ISG  Q+G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 262 RVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHF 321
              E L+ F  M + G +PNY+T + VL AC   G + + W YF  M   YGI P  +H+
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 322 AAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDS 381
             M DLL +AG+L +A  +  +  C+   VTW  L+GAC+V  ++ L + AA    ALD 
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 382 ENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHE 441
           E++G Y +L+N +A    WD+V EIR  M+  G+ K+PG S IE+ ++ H F+  D SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 442 KTEEIYRTIDRITPILKDVGYIPELN 468
           +  E+ + ++++   L  +GY+PE N
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPETN 544

BLAST of Sed0005021 vs. ExPASy Swiss-Prot
Match: Q9LTV8 (Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX=3702 GN=PCMP-H43 PE=2 SV=1)

HSP 1 Score: 283.5 bits (724), Expect = 4.4e-75
Identity = 138/370 (37.30%), Postives = 217/370 (58.65%), Query Frame = 0

Query: 94  KEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMI 153
           ++  +G+ IHA +V +G      L   L  +YAK G + TA IL +++   +L+ WNAMI
Sbjct: 236 QDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMI 295

Query: 154 AGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIG 213
           +GY +NG+    ++++ +M    + PD  +  S + +CA + SLE  +  +  + +    
Sbjct: 296 SGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYR 355

Query: 214 DNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSM 273
           D+V +SSAL+DM+ KC S+      F+++  R+V+ W+A+I GYG HGR  E +  + +M
Sbjct: 356 DDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAM 415

Query: 274 INEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGR 333
              G  PN VTFL +L AC   G V E W +F+ M  ++ I P+ QH+A + DLL RAG 
Sbjct: 416 ERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRM-ADHKINPQQQHYACVIDLLGRAGH 475

Query: 334 LQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANA 393
           L +AY +    P +     WGAL+ ACK H  + L + AA   F++D  N+G YV L+N 
Sbjct: 476 LDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNL 535

Query: 394 FAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRI 453
           +AA  +WD V+E+R  MK+ G++KD G S +E++     F   DKSH + EEI R ++ I
Sbjct: 536 YAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWI 595

Query: 454 TPILKDVGYI 464
              LK+ G++
Sbjct: 596 ESRLKEGGFV 604

BLAST of Sed0005021 vs. ExPASy Swiss-Prot
Match: Q9SIT7 (Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX=3702 GN=PCMP-E76 PE=3 SV=1)

HSP 1 Score: 282.7 bits (722), Expect = 7.5e-75
Identity = 140/364 (38.46%), Postives = 219/364 (60.16%), Query Frame = 0

Query: 96  YMKGKRIHAQIVVVGHVP--NEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMI 155
           Y K  RI     +   +P  N   +T ++  YA A   + A ++  ++ E+++VSWNA+I
Sbjct: 299 YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALI 358

Query: 156 AGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLI----K 215
           AGY QNG  E  L L+  +++  + P  Y+FA++L++CA LA L  G +AH  ++    K
Sbjct: 359 AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFK 418

Query: 216 CQIG--DNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVL 275
            Q G  D++ V ++L+DMY KC  + +G+  F K   R+ ++W A+I G+ Q+G  +E L
Sbjct: 419 FQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEAL 478

Query: 276 ESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADL 335
           E F  M+  G +P+++T + VL+ACG  GFV E   YFS MT ++G+ P   H+  M DL
Sbjct: 479 ELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDL 538

Query: 336 LARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGKY 395
           L RAG L+EA ++  + P +   V WG+L+ ACKVH +I L K  A     ++  NSG Y
Sbjct: 539 LGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPY 598

Query: 396 VVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIY 452
           V+L+N +A  G W++V  +R  M+K G++K PG S I+IQ   H F+  DKSH + ++I+
Sbjct: 599 VLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIH 658

BLAST of Sed0005021 vs. ExPASy Swiss-Prot
Match: Q3E6Q1 (Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PCMP-H40 PE=2 SV=1)

HSP 1 Score: 282.3 bits (721), Expect = 9.7e-75
Identity = 140/381 (36.75%), Postives = 220/381 (57.74%), Query Frame = 0

Query: 87  LQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERLLEKSL 146
           L  C    +  +G+ IH   V +G   N  +   L+ +Y K  +++TA  +  +L  ++L
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 147 VSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGV 206
           VSWNAMI G+ QNG     L  + +MR   + PD +T+ SV+ + A L+   H K  HGV
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 207 LIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEV 266
           +++  +  NV V++ALVDMY KC +++     F+  S R+V TW A+I GYG HG     
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 267 LESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMAD 326
           LE F  M     +PN VTFL+V++AC   G V    + F +M  NY I+    H+ AM D
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 583

Query: 327 LLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGK 386
           LL RAGRL EA++  +  P K     +GA++GAC++H+++N  + AA   F L+ ++ G 
Sbjct: 584 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 643

Query: 387 YVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEI 446
           +V+LAN + A  MW+ V ++R  M + G+ K PG S +EI+ E H F     +H  +++I
Sbjct: 644 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 703

Query: 447 YRTIDRITPILKDVGYIPELN 468
           Y  ++++   +K+ GY+P+ N
Sbjct: 704 YAFLEKLICHIKEAGYVPDTN 724

BLAST of Sed0005021 vs. ExPASy TrEMBL
Match: A0A6J1EI86 (pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita moschata OX=3662 GN=LOC111433634 PE=4 SV=1)

HSP 1 Score: 776.5 bits (2004), Expect = 6.1e-221
Identity = 386/474 (81.43%), Postives = 418/474 (88.19%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RP  SSS + HLF KSI A     +  RHKSE AN  FQVKPHQKDSS WDRTF
Sbjct: 1   MRLCG-RP--SSSGVIHLFTKSIVAGATATIRRRHKSEYANDRFQVKPHQKDSSSWDRTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G L+EAVALLCCM  QFHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 61  RSLCITGRLTEAVALLCCMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETA ILHE+LLE SLVSWNA+IAGYVQ GFGEVGLELYFKMR++
Sbjct: 121 YLKTKLLILYAKLGDLETANILHEKLLENSLVSWNALIAGYVQKGFGEVGLELYFKMRRT 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GL+PDQYTFASV R+CA+LASLEHGKRAHGVLIKC+IGDNVVVSSALVDMYFKCSS+LDG
Sbjct: 181 GLIPDQYTFASVFRACASLASLEHGKRAHGVLIKCRIGDNVVVSSALVDMYFKCSSILDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
           HK FNKS++RNVITWTALISGYG HGRVSEVLESFN MINEGYRPNYVTFLAVL ACG G
Sbjct: 241 HKVFNKSTTRNVITWTALISGYGHHGRVSEVLESFNRMINEGYRPNYVTFLAVLTACGHG 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAY+  +DAPCKEH V WGA
Sbjct: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHAVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANY ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 361 LVGGCKVHEDIDLMKHAAANYLALDAGNAGKYVVLANGFAASGLWDNVAEIRGMMKKSGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSH++ EEIYRTI  IT ILKD G I EL+EN
Sbjct: 421 NKEPGYSRIEIQREFHFFVKSDKSHKQAEEIYRTIHSITAILKDAGSIRELSEN 471

BLAST of Sed0005021 vs. ExPASy TrEMBL
Match: A0A6J1KS92 (pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita maxima OX=3661 GN=LOC111496780 PE=4 SV=1)

HSP 1 Score: 771.2 bits (1990), Expect = 2.6e-219
Identity = 383/474 (80.80%), Postives = 416/474 (87.76%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA-----VLHRHKSECANASFQVKPHQKDSSFWDRTF 60
           MRLC     SSSS + HLF KSI A     +  RHKSE AN   QVKPHQKDSS WDRTF
Sbjct: 1   MRLCG---RSSSSGVIHLFTKSIVAGATATIRRRHKSEYANDRSQVKPHQKDSSSWDRTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R+LC  G LSEAVALLC M  QFHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVGH+PNE
Sbjct: 61  RSLCITGRLSEAVALLCSMPFQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGHLPNE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YLKTKLLILYAK GDLETAYILHE+LL+ SLVSWNA+IAG VQ G GEVGLELYFKMR++
Sbjct: 121 YLKTKLLILYAKLGDLETAYILHEKLLDNSLVSWNALIAGCVQKGLGEVGLELYFKMRRT 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           GL+PDQYTFASV+R+CA+LASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSS+ DG
Sbjct: 181 GLIPDQYTFASVIRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSISDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
           HK F+KSS+RNVITWTALISGYG HGRVSEVLESFNSMINEGYRPNYVTFLAVL ACG G
Sbjct: 241 HKVFDKSSTRNVITWTALISGYGHHGRVSEVLESFNSMINEGYRPNYVTFLAVLTACGHG 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EAWRY SLM T Y I+PRGQH+AAMADLLARAGRLQEAY+  +DAPCKEH V WGA
Sbjct: 301 GFVSEAWRYLSLMKTTYEIEPRGQHYAAMADLLARAGRLQEAYDFVVDAPCKEHSVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAA+Y ALD+ N+GKYVVLAN FAA G+WDNV+EIR MMKKSGM
Sbjct: 361 LVGGCKVHEDIDLMKHAAAHYLALDASNAGKYVVLANGFAASGLWDNVAEIRCMMKKSGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +K+PGYSRIEIQREFHFFVKSDKSH++  EIYRTI  ITPILKD G IPEL+EN
Sbjct: 421 NKEPGYSRIEIQREFHFFVKSDKSHKQAVEIYRTIHSITPILKDAGSIPELSEN 471

BLAST of Sed0005021 vs. ExPASy TrEMBL
Match: A0A0A0KJS7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1)

HSP 1 Score: 769.2 bits (1985), Expect = 9.7e-219
Identity = 379/466 (81.33%), Postives = 411/466 (88.20%), Query Frame = 0

Query: 10  SSSSSINHLFAKSIYA-------VLHRHKSECANASFQVKPHQKDSSFWDRTFRNLCSRG 69
           SS S++ HL + SI         +  RHKSE A+ SFQVKPH KD+S WD+T R LC  G
Sbjct: 11  SSFSAVIHLISTSIVPTQLPTVHLPRRHKSEFASPSFQVKPHHKDTSSWDKTLRGLCLTG 70

Query: 70  ELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLL 129
           +L+EAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQ+VVVG+VPNEYL TKLL
Sbjct: 71  KLAEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQMVVVGYVPNEYLNTKLL 130

Query: 130 ILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQY 189
           ILYAK+GDLETAY+LHE LLEKSLVSWN++IAGYVQ G  EVGLE Y KMRQSGLMPDQY
Sbjct: 131 ILYAKSGDLETAYVLHEHLLEKSLVSWNSLIAGYVQKGLAEVGLEFYLKMRQSGLMPDQY 190

Query: 190 TFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKS 249
           TFASVLR+CA+LASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL DGHKAFNKS
Sbjct: 191 TFASVLRACASLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDGHKAFNKS 250

Query: 250 SSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAW 309
           S+RNVITWTALISGYGQHGR+SEVLESF+SMIN+GYRPNYVTFLAVLAAC RGGFV+EAW
Sbjct: 251 SNRNVITWTALISGYGQHGRISEVLESFHSMINKGYRPNYVTFLAVLAACSRGGFVSEAW 310

Query: 310 RYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKV 369
            YFSLMT  YGI+PRGQH+AAMADLLARAGRLQEAY+  LDAPCKEH V WGALVGACKV
Sbjct: 311 NYFSLMTKTYGIEPRGQHYAAMADLLARAGRLQEAYDFVLDAPCKEHSVMWGALVGACKV 370

Query: 370 HEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYS 429
           HED++LMK  AA+YF LD +NSGK VV +NAFA  G+WDNV EIR MMKKSGMSKDPG S
Sbjct: 371 HEDVDLMKHVAASYFELDPKNSGKLVVFSNAFATSGLWDNVEEIRAMMKKSGMSKDPGCS 430

Query: 430 RIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNE 469
           RIEIQREFH FVK DKSH +TEEIYRTIDRITPILKD GYIPEL E
Sbjct: 431 RIEIQREFHIFVKGDKSHRETEEIYRTIDRITPILKDAGYIPELCE 476

BLAST of Sed0005021 vs. ExPASy TrEMBL
Match: A0A6J1BXR4 (pentatricopeptide repeat-containing protein At4g16470 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111005684 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 1.1e-217
Identity = 382/474 (80.59%), Postives = 418/474 (88.19%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA----VLHRHK-SECANASFQVKPHQKDSSFWDRTF 60
           MRLC  RPSS +    HLF KSI A     +HR K SE   ASFQV PHQKD+SFWD+TF
Sbjct: 1   MRLC-CRPSSGAI---HLFTKSILAATTLTIHRRKISEYCTASFQVNPHQKDASFWDKTF 60

Query: 61  RNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNE 120
           R LC  G LSEAVALLCCMALQFHS+TYCLLLQECIFRKEYMKGKRIHAQIVVVGH+P+E
Sbjct: 61  RGLCLTGRLSEAVALLCCMALQFHSRTYCLLLQECIFRKEYMKGKRIHAQIVVVGHLPSE 120

Query: 121 YLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQS 180
           YL TKLLILYAK+GDL TAYILHE+LL KSLVSWNAMIAGYVQ G GEVGLE Y KMRQS
Sbjct: 121 YLTTKLLILYAKSGDLGTAYILHEKLLGKSLVSWNAMIAGYVQKGLGEVGLEFYLKMRQS 180

Query: 181 GLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDG 240
           G++PDQYTFASV ++CATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL DG
Sbjct: 181 GMVPDQYTFASVFKACATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLSDG 240

Query: 241 HKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRG 300
            + FN SS+RNVITWTALI GY QHGRV EVLESFNSMINEGYRPN+VTFLAVLAACGRG
Sbjct: 241 RRVFNISSNRNVITWTALIFGYAQHGRVFEVLESFNSMINEGYRPNHVTFLAVLAACGRG 300

Query: 301 GFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGA 360
           GFV+EA RYFSLM T+Y I+PRGQH+AAMADLLARAGRL+EAYN  L+APCKEH V WGA
Sbjct: 301 GFVSEAQRYFSLMMTDYRIEPRGQHYAAMADLLARAGRLEEAYNFVLNAPCKEHSVIWGA 360

Query: 361 LVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGM 420
           LVG CKVHEDI+LMK AAANYFALD+ENSGKYVVL+NA+A  G+WDNV+E+RDMMKK+GM
Sbjct: 361 LVGGCKVHEDIDLMKYAAANYFALDAENSGKYVVLSNAYATSGLWDNVAEVRDMMKKTGM 420

Query: 421 SKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +KDPGYSRIEIQREF FF KSD+SH++TEEIYRTI+R+TPILKD GYI EL  N
Sbjct: 421 TKDPGYSRIEIQREFRFFFKSDESHKETEEIYRTINRLTPILKDAGYIAELRGN 470

BLAST of Sed0005021 vs. ExPASy TrEMBL
Match: A0A6J1BVB9 (pentatricopeptide repeat-containing protein At4g16470 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005684 PE=4 SV=1)

HSP 1 Score: 761.9 bits (1966), Expect = 1.5e-216
Identity = 380/477 (79.66%), Postives = 417/477 (87.42%), Query Frame = 0

Query: 1   MRLCNTRPSSSSSSINHLFAKSIYA----VLHRHK----SECANASFQVKPHQKDSSFWD 60
           MRLC  RPSS +    HLF KSI A     +HR K        N+SFQV PHQKD+SFWD
Sbjct: 1   MRLC-CRPSSGAI---HLFTKSILAATTLTIHRRKISEYCTARNSSFQVNPHQKDASFWD 60

Query: 61  RTFRNLCSRGELSEAVALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHV 120
           +TFR LC  G LSEAVALLCCMALQFHS+TYCLLLQECIFRKEYMKGKRIHAQIVVVGH+
Sbjct: 61  KTFRGLCLTGRLSEAVALLCCMALQFHSRTYCLLLQECIFRKEYMKGKRIHAQIVVVGHL 120

Query: 121 PNEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKM 180
           P+EYL TKLLILYAK+GDL TAYILHE+LL KSLVSWNAMIAGYVQ G GEVGLE Y KM
Sbjct: 121 PSEYLTTKLLILYAKSGDLGTAYILHEKLLGKSLVSWNAMIAGYVQKGLGEVGLEFYLKM 180

Query: 181 RQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240
           RQSG++PDQYTFASV ++CATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL
Sbjct: 181 RQSGMVPDQYTFASVFKACATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSL 240

Query: 241 LDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAAC 300
            DG + FN SS+RNVITWTALI GY QHGRV EVLESFNSMINEGYRPN+VTFLAVLAAC
Sbjct: 241 SDGRRVFNISSNRNVITWTALIFGYAQHGRVFEVLESFNSMINEGYRPNHVTFLAVLAAC 300

Query: 301 GRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVT 360
           GRGGFV+EA RYFSLM T+Y I+PRGQH+AAMADLLARAGRL+EAYN  L+APCKEH V 
Sbjct: 301 GRGGFVSEAQRYFSLMMTDYRIEPRGQHYAAMADLLARAGRLEEAYNFVLNAPCKEHSVI 360

Query: 361 WGALVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKK 420
           WGALVG CKVHEDI+LMK AAANYFALD+ENSGKYVVL+NA+A  G+WDNV+E+RDMMKK
Sbjct: 361 WGALVGGCKVHEDIDLMKYAAANYFALDAENSGKYVVLSNAYATSGLWDNVAEVRDMMKK 420

Query: 421 SGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPELNEN 470
           +GM+KDPGYSRIEIQREF FF KSD+SH++TEEIYRTI+R+TPILKD GYI EL  N
Sbjct: 421 TGMTKDPGYSRIEIQREFRFFFKSDESHKETEEIYRTINRLTPILKDAGYIAELRGN 473

BLAST of Sed0005021 vs. TAIR 10
Match: AT4G16470.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 496.1 bits (1276), Expect = 3.0e-140
Identity = 242/457 (52.95%), Postives = 317/457 (69.37%), Query Frame = 0

Query: 9   SSSSSSINHLFAKSIYAVLHRHKSECANASFQVKPHQKDSSFWDRTFRNLCSRGELSEAV 68
           +S +S+   +F+ +   +L R  +E     FQV+ +Q+ +   D+T + LC  G L EAV
Sbjct: 38  ASQTSASGSMFSGNATTILRRMLAEKRIGRFQVE-NQRKTEKLDKTLKGLCVTGRLKEAV 97

Query: 69  ALLCCMALQFHSKTYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKA 128
            LL    LQ   +TY +LLQEC  RKEY KGKRIHAQ+ VVG   NEYLK KLLILYA +
Sbjct: 98  GLLWSSGLQVEPETYAVLLQECKQRKEYTKGKRIHAQMFVVGFALNEYLKVKLLILYALS 157

Query: 129 GDLETAYILHERLLEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVL 188
           GDL+TA IL   L  + L+ WNAMI+GYVQ G  + GL +Y+ MRQ+ ++PDQYTFASV 
Sbjct: 158 GDLQTAGILFRSLKIRDLIPWNAMISGYVQKGLEQEGLFIYYDMRQNRIVPDQYTFASVF 217

Query: 189 RSCATLASLEHGKRAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVI 248
           R+C+ L  LEHGKRAH V+IK  I  N++V SALVDMYFKCSS  DGH+ F++ S+RNVI
Sbjct: 218 RACSALDRLEHGKRAHAVMIKRCIKSNIIVDSALVDMYFKCSSFSDGHRVFDQLSTRNVI 277

Query: 249 TWTALISGYGQHGRVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLM 308
           TWT+LISGYG HG+VSEVL+ F  M  EG RPN VTFL VL AC  GG V + W +F  M
Sbjct: 278 TWTSLISGYGYHGKVSEVLKCFEKMKEEGCRPNPVTFLVVLTACNHGGLVDKGWEHFYSM 337

Query: 309 TTNYGIKPRGQHFAAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINL 368
             +YGI+P GQH+AAM D L RAGRLQEAY   + +PCKEHP  WG+L+GAC++H ++ L
Sbjct: 338 KRDYGIEPEGQHYAAMVDTLGRAGRLQEAYEFVMKSPCKEHPPVWGSLLGACRIHGNVKL 397

Query: 369 MKCAAANYFALDSENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQR 428
           ++ AA  +  LD  N G YVV AN +A+CG+ +  S++R  M+ +G+ KDPGYS+IE+Q 
Sbjct: 398 LELAATKFLELDPTNGGNYVVFANGYASCGLREAASKVRRKMENAGVKKDPGYSQIELQG 457

Query: 429 EFHFFVKSDKSHEKTEEIYRTIDRITPILKDVGYIPE 466
           E H F+K D SH  +E+IY+ +  +T    D+ Y P+
Sbjct: 458 EVHRFMKDDTSHRLSEKIYKKVHEMTSFFMDIDYYPD 493

BLAST of Sed0005021 vs. TAIR 10
Match: AT2G03880.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 287.7 bits (735), Expect = 1.6e-77
Identity = 141/386 (36.53%), Postives = 230/386 (59.59%), Query Frame = 0

Query: 82  TYCLLLQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERL 141
           TY  +L+ C    +    + +H  I+  G   + ++++ L+ ++AK G+ E A  + + +
Sbjct: 164 TYSSVLRSCNGMSDV---RMLHCGIIKEGLESDVFVRSALIDVFAKLGEPEDALSVFDEM 223

Query: 142 LEKSLVSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGK 201
           +    + WN++I G+ QN   +V LEL+ +M+++G + +Q T  SVLR+C  LA LE G 
Sbjct: 224 VTGDAIVWNSIIGGFAQNSRSDVALELFKRMKRAGFIAEQATLTSVLRACTGLALLELGM 283

Query: 202 RAHGVLIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHG 261
           +AH  ++K     ++++++ALVDMY KC SL D  + FN+   R+VITW+ +ISG  Q+G
Sbjct: 284 QAHVHIVK--YDQDLILNNALVDMYCKCGSLEDALRVFNQMKERDVITWSTMISGLAQNG 343

Query: 262 RVSEVLESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHF 321
              E L+ F  M + G +PNY+T + VL AC   G + + W YF  M   YGI P  +H+
Sbjct: 344 YSQEALKLFERMKSSGTKPNYITIVGVLFACSHAGLLEDGWYYFRSMKKLYGIDPVREHY 403

Query: 322 AAMADLLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDS 381
             M DLL +AG+L +A  +  +  C+   VTW  L+GAC+V  ++ L + AA    ALD 
Sbjct: 404 GCMIDLLGKAGKLDDAVKLLNEMECEPDAVTWRTLLGACRVQRNMVLAEYAAKKVIALDP 463

Query: 382 ENSGKYVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHE 441
           E++G Y +L+N +A    WD+V EIR  M+  G+ K+PG S IE+ ++ H F+  D SH 
Sbjct: 464 EDAGTYTLLSNIYANSQKWDSVEEIRTRMRDRGIKKEPGCSWIEVNKQIHAFIIGDNSHP 523

Query: 442 KTEEIYRTIDRITPILKDVGYIPELN 468
           +  E+ + ++++   L  +GY+PE N
Sbjct: 524 QIVEVSKKLNQLIHRLTGIGYVPETN 544

BLAST of Sed0005021 vs. TAIR 10
Match: AT3G12770.1 (mitochondrial editing factor 22 )

HSP 1 Score: 283.5 bits (724), Expect = 3.1e-76
Identity = 138/370 (37.30%), Postives = 217/370 (58.65%), Query Frame = 0

Query: 94  KEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMI 153
           ++  +G+ IHA +V +G      L   L  +YAK G + TA IL +++   +L+ WNAMI
Sbjct: 236 QDLKQGRSIHASVVKMGLEIEPDLLISLNTMYAKCGQVATAKILFDKMKSPNLILWNAMI 295

Query: 154 AGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLIKCQIG 213
           +GY +NG+    ++++ +M    + PD  +  S + +CA + SLE  +  +  + +    
Sbjct: 296 SGYAKNGYAREAIDMFHEMINKDVRPDTISITSAISACAQVGSLEQARSMYEYVGRSDYR 355

Query: 214 DNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVLESFNSM 273
           D+V +SSAL+DM+ KC S+      F+++  R+V+ W+A+I GYG HGR  E +  + +M
Sbjct: 356 DDVFISSALIDMFAKCGSVEGARLVFDRTLDRDVVVWSAMIVGYGLHGRAREAISLYRAM 415

Query: 274 INEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADLLARAGR 333
              G  PN VTFL +L AC   G V E W +F+ M  ++ I P+ QH+A + DLL RAG 
Sbjct: 416 ERGGVHPNDVTFLGLLMACNHSGMVREGWWFFNRM-ADHKINPQQQHYACVIDLLGRAGH 475

Query: 334 LQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGKYVVLANA 393
           L +AY +    P +     WGAL+ ACK H  + L + AA   F++D  N+G YV L+N 
Sbjct: 476 LDQAYEVIKCMPVQPGVTVWGALLSACKKHRHVELGEYAAQQLFSIDPSNTGHYVQLSNL 535

Query: 394 FAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIYRTIDRI 453
           +AA  +WD V+E+R  MK+ G++KD G S +E++     F   DKSH + EEI R ++ I
Sbjct: 536 YAAARLWDRVAEVRVRMKEKGLNKDVGCSWVEVRGRLEAFRVGDKSHPRYEEIERQVEWI 595

Query: 454 TPILKDVGYI 464
              LK+ G++
Sbjct: 596 ESRLKEGGFV 604

BLAST of Sed0005021 vs. TAIR 10
Match: AT2G13600.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 282.7 bits (722), Expect = 5.3e-76
Identity = 140/364 (38.46%), Postives = 219/364 (60.16%), Query Frame = 0

Query: 96  YMKGKRIHAQIVVVGHVP--NEYLKTKLLILYAKAGDLETAYILHERLLEKSLVSWNAMI 155
           Y K  RI     +   +P  N   +T ++  YA A   + A ++  ++ E+++VSWNA+I
Sbjct: 299 YAKCSRIKEARFIFDSMPIRNVIAETSMISGYAMAASTKAARLMFTKMAERNVVSWNALI 358

Query: 156 AGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGVLI----K 215
           AGY QNG  E  L L+  +++  + P  Y+FA++L++CA LA L  G +AH  ++    K
Sbjct: 359 AGYTQNGENEEALSLFCLLKRESVCPTHYSFANILKACADLAELHLGMQAHVHVLKHGFK 418

Query: 216 CQIG--DNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEVL 275
            Q G  D++ V ++L+DMY KC  + +G+  F K   R+ ++W A+I G+ Q+G  +E L
Sbjct: 419 FQSGEEDDIFVGNSLIDMYVKCGCVEEGYLVFRKMMERDCVSWNAMIIGFAQNGYGNEAL 478

Query: 276 ESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMADL 335
           E F  M+  G +P+++T + VL+ACG  GFV E   YFS MT ++G+ P   H+  M DL
Sbjct: 479 ELFREMLESGEKPDHITMIGVLSACGHAGFVEEGRHYFSSMTRDFGVAPLRDHYTCMVDL 538

Query: 336 LARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGKY 395
           L RAG L+EA ++  + P +   V WG+L+ ACKVH +I L K  A     ++  NSG Y
Sbjct: 539 LGRAGFLEEAKSMIEEMPMQPDSVIWGSLLAACKVHRNITLGKYVAEKLLEVEPSNSGPY 598

Query: 396 VVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEIY 452
           V+L+N +A  G W++V  +R  M+K G++K PG S I+IQ   H F+  DKSH + ++I+
Sbjct: 599 VLLSNMYAELGKWEDVMNVRKSMRKEGVTKQPGCSWIKIQGHDHVFMVKDKSHPRKKQIH 658

BLAST of Sed0005021 vs. TAIR 10
Match: AT1G11290.1 (Pentatricopeptide repeat (PPR) superfamily protein )

HSP 1 Score: 282.3 bits (721), Expect = 6.9e-76
Identity = 140/381 (36.75%), Postives = 220/381 (57.74%), Query Frame = 0

Query: 87  LQECIFRKEYMKGKRIHAQIVVVGHVPNEYLKTKLLILYAKAGDLETAYILHERLLEKSL 146
           L  C    +  +G+ IH   V +G   N  +   L+ +Y K  +++TA  +  +L  ++L
Sbjct: 344 LHACADLGDLERGRFIHKLSVELGLDRNVSVVNSLISMYCKCKEVDTAASMFGKLQSRTL 403

Query: 147 VSWNAMIAGYVQNGFGEVGLELYFKMRQSGLMPDQYTFASVLRSCATLASLEHGKRAHGV 206
           VSWNAMI G+ QNG     L  + +MR   + PD +T+ SV+ + A L+   H K  HGV
Sbjct: 404 VSWNAMILGFAQNGRPIDALNYFSQMRSRTVKPDTFTYVSVITAIAELSITHHAKWIHGV 463

Query: 207 LIKCQIGDNVVVSSALVDMYFKCSSLLDGHKAFNKSSSRNVITWTALISGYGQHGRVSEV 266
           +++  +  NV V++ALVDMY KC +++     F+  S R+V TW A+I GYG HG     
Sbjct: 464 VMRSCLDKNVFVTTALVDMYAKCGAIMIARLIFDMMSERHVTTWNAMIDGYGTHGFGKAA 523

Query: 267 LESFNSMINEGYRPNYVTFLAVLAACGRGGFVAEAWRYFSLMTTNYGIKPRGQHFAAMAD 326
           LE F  M     +PN VTFL+V++AC   G V    + F +M  NY I+    H+ AM D
Sbjct: 524 LELFEEMQKGTIKPNGVTFLSVISACSHSGLVEAGLKCFYMMKENYSIELSMDHYGAMVD 583

Query: 327 LLARAGRLQEAYNIALDAPCKEHPVTWGALVGACKVHEDINLMKCAAANYFALDSENSGK 386
           LL RAGRL EA++  +  P K     +GA++GAC++H+++N  + AA   F L+ ++ G 
Sbjct: 584 LLGRAGRLNEAWDFIMQMPVKPAVNVYGAMLGACQIHKNVNFAEKAAERLFELNPDDGGY 643

Query: 387 YVVLANAFAACGMWDNVSEIRDMMKKSGMSKDPGYSRIEIQREFHFFVKSDKSHEKTEEI 446
           +V+LAN + A  MW+ V ++R  M + G+ K PG S +EI+ E H F     +H  +++I
Sbjct: 644 HVLLANIYRAASMWEKVGQVRVSMLRQGLRKTPGCSMVEIKNEVHSFFSGSTAHPDSKKI 703

Query: 447 YRTIDRITPILKDVGYIPELN 468
           Y  ++++   +K+ GY+P+ N
Sbjct: 704 YAFLEKLICHIKEAGYVPDTN 724

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881197.11.9e-22183.84pentatricopeptide repeat-containing protein At4g16470 [Benincasa hispida][more]
XP_022926503.11.3e-22081.43pentatricopeptide repeat-containing protein At4g16470 [Cucurbita moschata][more]
KAG6594700.12.8e-22081.43Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma sub... [more]
XP_023518629.14.8e-22081.01pentatricopeptide repeat-containing protein At4g16470 [Cucurbita pepo subsp. pep... [more]
KAG7026668.11.1e-21981.22Scarecrow-like protein 13, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
O234914.3e-13952.95Pentatricopeptide repeat-containing protein At4g16470 OS=Arabidopsis thaliana OX... [more]
Q9SI532.3e-7636.53Pentatricopeptide repeat-containing protein At2g03880, mitochondrial OS=Arabidop... [more]
Q9LTV84.4e-7537.30Pentatricopeptide repeat-containing protein At3g12770 OS=Arabidopsis thaliana OX... [more]
Q9SIT77.5e-7538.46Pentatricopeptide repeat-containing protein At2g13600 OS=Arabidopsis thaliana OX... [more]
Q3E6Q19.7e-7536.75Pentatricopeptide repeat-containing protein At1g11290, chloroplastic OS=Arabidop... [more]
Match NameE-valueIdentityDescription
A0A6J1EI866.1e-22181.43pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita moschata OX=3... [more]
A0A6J1KS922.6e-21980.80pentatricopeptide repeat-containing protein At4g16470 OS=Cucurbita maxima OX=366... [more]
A0A0A0KJS79.7e-21981.33Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G511640 PE=4 SV=1[more]
A0A6J1BXR41.1e-21780.59pentatricopeptide repeat-containing protein At4g16470 isoform X2 OS=Momordica ch... [more]
A0A6J1BVB91.5e-21679.66pentatricopeptide repeat-containing protein At4g16470 isoform X1 OS=Momordica ch... [more]
Match NameE-valueIdentityDescription
AT4G16470.13.0e-14052.95Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT2G03880.11.6e-7736.53Pentatricopeptide repeat (PPR) superfamily protein [more]
AT3G12770.13.1e-7637.30mitochondrial editing factor 22 [more]
AT2G13600.15.3e-7638.46Pentatricopeptide repeat (PPR) superfamily protein [more]
AT1G11290.16.9e-7636.75Pentatricopeptide repeat (PPR) superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 147..180
e-value: 1.7E-6
score: 25.8
coord: 283..316
e-value: 0.0012
score: 16.8
coord: 248..281
e-value: 2.4E-7
score: 28.5
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 387..414
e-value: 0.23
score: 11.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 145..192
e-value: 1.4E-9
score: 38.0
coord: 245..292
e-value: 1.1E-9
score: 38.3
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 145..179
score: 11.213468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 246..280
score: 12.561707
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 200..462
e-value: 2.9E-24
score: 88.1
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 15..199
e-value: 4.1E-19
score: 70.6
NoneNo IPR availablePANTHERPTHR47925OS01G0913400 PROTEIN-RELATEDcoord: 37..174
coord: 62..466
NoneNo IPR availablePANTHERPTHR47925:SF61PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN DOT4, CHLOROPLASTICcoord: 37..174
coord: 62..466

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0005021.1Sed0005021.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009157 deoxyribonucleoside monophosphate biosynthetic process
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0016310 phosphorylation
molecular_function GO:0005524 ATP binding
molecular_function GO:0005515 protein binding
molecular_function GO:0004797 thymidine kinase activity