CsaV3_2G011310 (gene) Cucumber (Chinese Long) v3

NameCsaV3_2G011310
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
Descriptionpentatricopeptide repeat-containing protein At5g47360
Locationchr2 : 8559328 .. 8560755 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCTCTTTCGAATCTCTTGCCCCCGATCATCTTCATTTCTTCTCAACATCTCTACCTTATCTACCTTTCACCTAAATACACTCTCTTCTTCCGATTTATTCTATGACCATTTGGAGAAAAGCAATGGTAATCTGGATAAAACCCTTGCTACTCTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTATATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATTCTTCTTTTATGTACAGTAGAGCTTGTGAACTGATTGGAATTAATGTAAGCCCATGTTTGCTTTTTAACGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATGTTTAAGATTATTTTAAACTTGTGTAAAGAAGCTAAGCTTGCAAAAGAGGCTTTGTCTATTTTAAGGAAAATGTCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGACATACATCCTAATATGATCACTTATATTTCCATGCTCAAAGGATTCTGTGATGTGGGTCGTTGGGAGGATGCTTATGGGTTATTTAAGGATATGAAGGAAAATGGATGTGCCCCAAATACAGTGGTTTACTCTGTGCTAGTGAATGGCGCCATTCGACTCAGAATTATGGATAGGCTAATGGAAATGTTGAAGGAGATGGAAAAACAAGGGGGAACTTGCAGTCCAAATACTGTCACATACACTTCTATAATCCAGAGTCTATGTGAAGAAGGCCACCCTCTGGAGGCATTGAAGGTATTAGACAGAATGGAAGAGTATGGTTATGCTCCAAATCGTGTTGCAGTTAGCTTTTTAGTTAAGGAATTTTGTAAAGATGGCCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCGAGAGGTGGTGTTTCATATGGTGATTGTTATAGCTCACTTGTGGTAACTTTAGTTAAGATGAAAAAGATTGCAGAGGCAGAGAAGCTATTTAGAAACATGTTAGCCAACGGGGTGAAGCCAGATGGTGTGGCTTGCAGTCTCATGATCAGGGAACTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCTACGAAGTCGATAGGAATGGATATTTATGTTCCATTGATGCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAACATGACCACTCTGTGGATGCTGCAAAACTAGCAAGGTTGATGCTTAAAAAGGGAATTCGTTTAAAACCTCACTATGCTGAAAGTATCATCAAACATCTAAAGAAATTTGAAGACCGAGAGTTAGTTATGCATTTGGGCGGAATAAGGAAATAA

mRNA sequence

ATGGCTCTCTTTCGAATCTCTTGCCCCCGATCATCTTCATTTCTTCTCAACATCTCTACCTTATCTACCTTTCACCTAAATACACTCTCTTCTTCCGATTTATTCTATGACCATTTGGAGAAAAGCAATGGTAATCTGGATAAAACCCTTGCTACTCTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTATATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATTCTTCTTTTATGTACAGTAGAGCTTGTGAACTGATTGGAATTAATGTAAGCCCATGTTTGCTTTTTAACGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATGTTTAAGATTATTTTAAACTTGTGTAAAGAAGCTAAGCTTGCAAAAGAGGCTTTGTCTATTTTAAGGAAAATGTCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGACATACATCCTAATATGATCACTTATATTTCCATGCTCAAAGGATTCTGTGATGTGGGTCGTTGGGAGGATGCTTATGGGTTATTTAAGGATATGAAGGAAAATGGATGTGCCCCAAATACAGTGGTTTACTCTGTGCTAGTGAATGGCGCCATTCGACTCAGAATTATGGATAGGCTAATGGAAATGTTGAAGGAGATGGAAAAACAAGGGGGAACTTGCAGTCCAAATACTGTCACATACACTTCTATAATCCAGAGTCTATGTGAAGAAGGCCACCCTCTGGAGGCATTGAAGGTATTAGACAGAATGGAAGAGTATGGTTATGCTCCAAATCGTGTTGCAGTTAGCTTTTTAGTTAAGGAATTTTGTAAAGATGGCCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCGAGAGGTGGTGTTTCATATGGTGATTGTTATAGCTCACTTGTGGTAACTTTAGTTAAGATGAAAAAGATTGCAGAGGCAGAGAAGCTATTTAGAAACATGTTAGCCAACGGGGTGAAGCCAGATGGTGTGGCTTGCAGTCTCATGATCAGGGAACTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCTACGAAGTCGATAGGAATGGATATTTATGTTCCATTGATGCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAACATGACCACTCTGTGGATGCTGCAAAACTAGCAAGGTTGATGCTTAAAAAGGGAATTCGTTTAAAACCTCACTATGCTGAAAGTATCATCAAACATCTAAAGAAATTTGAAGACCGAGAGTTAGTTATGCATTTGGGCGGAATAAGGAAATAA

Coding sequence (CDS)

ATGGCTCTCTTTCGAATCTCTTGCCCCCGATCATCTTCATTTCTTCTCAACATCTCTACCTTATCTACCTTTCACCTAAATACACTCTCTTCTTCCGATTTATTCTATGACCATTTGGAGAAAAGCAATGGTAATCTGGATAAAACCCTTGCTACTCTAAAGACCAAGTTGGATTCTAGATGTGTCAACGAAGTATTATATAAATGTTCCTTCGAACTATCGCAAATGGGTCTTAGATTTTTTATATGGGCTGGTCGACAGCCTAATTATAGGCATTCTTCTTTTATGTACAGTAGAGCTTGTGAACTGATTGGAATTAATGTAAGCCCATGTTTGCTTTTTAACGTTATTGAAGATTATAGAAGGGAGGGTTGCCTTGTTGATATTAGGATGTTTAAGATTATTTTAAACTTGTGTAAAGAAGCTAAGCTTGCAAAAGAGGCTTTGTCTATTTTAAGGAAAATGTCTGAATTTCATTTGCGTGCTGACACTACAATGTATAATCTGGTTATAAGGTTATTTACTGAGAAGGGTGAGATGGATAAGGCGATGGAGTTGATGAAAGAGATGGATTCAGTTGACATACATCCTAATATGATCACTTATATTTCCATGCTCAAAGGATTCTGTGATGTGGGTCGTTGGGAGGATGCTTATGGGTTATTTAAGGATATGAAGGAAAATGGATGTGCCCCAAATACAGTGGTTTACTCTGTGCTAGTGAATGGCGCCATTCGACTCAGAATTATGGATAGGCTAATGGAAATGTTGAAGGAGATGGAAAAACAAGGGGGAACTTGCAGTCCAAATACTGTCACATACACTTCTATAATCCAGAGTCTATGTGAAGAAGGCCACCCTCTGGAGGCATTGAAGGTATTAGACAGAATGGAAGAGTATGGTTATGCTCCAAATCGTGTTGCAGTTAGCTTTTTAGTTAAGGAATTTTGTAAAGATGGCCATGTGGAGGAGGCTTATAAGTTGATTGATAGAGTTGTTGCGAGAGGTGGTGTTTCATATGGTGATTGTTATAGCTCACTTGTGGTAACTTTAGTTAAGATGAAAAAGATTGCAGAGGCAGAGAAGCTATTTAGAAACATGTTAGCCAACGGGGTGAAGCCAGATGGTGTGGCTTGCAGTCTCATGATCAGGGAACTGTGCTTAGAGGAGCGAGTGCTAGATGGTTTTAACTTATGCTACGAAGTCGATAGGAATGGATATTTATGTTCCATTGATGCTGATATTTATTCTCTTCTTTTAGTTGGACTTTGTGAACATGACCACTCTGTGGATGCTGCAAAACTAGCAAGGTTGATGCTTAAAAAGGGAATTCGTTTAAAACCTCACTATGCTGAAAGTATCATCAAACATCTAAAGAAATTTGAAGACCGAGAGTTAGTTATGCATTTGGGCGGAATAAGGAAATAA

Protein sequence

MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRADTTMYNLVIRLFTEKGEMDKAMELMKEMDSVDIHPNMITYISMLKGFCDVGRWEDAYGLFKDMKENGCAPNTVVYSVLVNGAIRLRIMDRLMEMLKEMEKQGGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEYGYAPNRVAVSFLVKEFCKDGHVEEAYKLIDRVVARGGVSYGDCYSSLVVTLVKMKKIAEAEKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFNLCYEVDRNGYLCSIDADIYSLLLVGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK
BLAST of CsaV3_2G011310 vs. NCBI nr
Match: XP_004139002.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativus] >XP_011649081.1 PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativus] >KGN61448.1 hypothetical protein Csa_2G123590 [Cucumis sativus])

HSP 1 Score: 496.9 bits (1278), Expect = 7.7e-137
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 475

BLAST of CsaV3_2G011310 vs. NCBI nr
Match: XP_008441677.1 (PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo] >XP_008441678.1 PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo])

HSP 1 Score: 440.7 bits (1132), Expect = 6.5e-120
Identity = 427/475 (89.89%), Postives = 442/475 (93.05%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRIS PRSSS LLNISTLSTFHL+TLSSSDLFYDHLEK+NGN++KTLAT+KTKLDSR
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVNEVLYKCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACELIGINVSPCLLFNVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDIR+F+IILNLCKEAKL KEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXX                   GTC PNTVTYTSIIQSLCE+G  LEALKVLDRMEEY
Sbjct: 241 XXXXXRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GHAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   GF+LCYEVDRNGYLC IDAD+YSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGL +HDHSVDAA LARLMLKKGIRLKPHYAESIIKHLKKFED+EL+MHLGGIRK
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIRK 475

BLAST of CsaV3_2G011310 vs. NCBI nr
Match: XP_022928928.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata])

HSP 1 Score: 320.9 bits (821), Expect = 7.5e-84
Identity = 394/475 (82.95%), Postives = 414/475 (87.16%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRI  PR SSF   ISTLST  L+T+SS+DLFYDHL+K NGN++KTLAT+KTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKKNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVN+VL+KCS ELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+N SPCLLFNVIEDY
Sbjct: 61  CVNQVLHKCSLELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDI MFK+ILNLCKE KLAKEALSIL KM+EFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGKMAEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXXGT                                ++
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDF 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX          VDRNGYL SID+DIYSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGLCEHDHSVDAAKLARLML+KGIRLKPHYAESIIKH+KKF D+ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of CsaV3_2G011310 vs. NCBI nr
Match: XP_023551479.1 (pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 320.5 bits (820), Expect = 9.8e-84
Identity = 393/475 (82.74%), Postives = 415/475 (87.37%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRI  PR SSF   ISTLST  L+T+SS+DLFYDHL+K+NGN++KTL T+KTKLDSR
Sbjct: 1   MALFRIFYPRPSSFRFKISTLSTLQLSTVSSADLFYDHLQKNNGNVEKTLTTVKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVNEVL+KCSFELSQMGLRFFIWAGRQPNYRHSSFMY+RACELIG+N SPCL+FNVIEDY
Sbjct: 61  CVNEVLHKCSFELSQMGLRFFIWAGRQPNYRHSSFMYTRACELIGLNRSPCLVFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDI MFK+ILNLCKE KLAKEALSIL +M+EFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIGMFKVILNLCKEGKLAKEALSILGEMAEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXXGT                                ++
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDF 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX          VDRNGYL SID+DIYSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGLCEHDHSVDAAKLARLML+KGIRLKPHYAESIIKH+KKF D+ LVMHLGGIR+
Sbjct: 421 VGLCEHDHSVDAAKLARLMLQKGIRLKPHYAESIIKHVKKFGDQNLVMHLGGIRE 475

BLAST of CsaV3_2G011310 vs. NCBI nr
Match: XP_022155853.1 (pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155854.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155855.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_022155856.1 pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia])

HSP 1 Score: 309.7 bits (792), Expect = 1.7e-80
Identity = 380/475 (80.00%), Postives = 400/475 (84.21%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALF I   RS SF L IS LS  HL+T+SS+DLFYDHL+K+NGN++K LAT+KT LDSR
Sbjct: 1   MALFGIFSFRSFSFGLKISKLSALHLSTVSSADLFYDHLQKNNGNVEKILATVKTTLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVN+VL+KCSFELS MGLRFFIWAGRQPNYRHSSFMYSRACELIGI+ SPCLL NVIEDY
Sbjct: 61  CVNQVLHKCSFELSLMGLRFFIWAGRQPNYRHSSFMYSRACELIGIDRSPCLLLNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGC+VDIRMFK++LNLCKEAKLA EAL IL KM EFHLR   XXXXXXXXXXXXXXXX
Sbjct: 121 RREGCVVDIRMFKVMLNLCKEAKLANEALLILGKMPEFHLRADTXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXX                                    
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXX       VLDG+NLC EVDRNGYL SID+DIYSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXELCLEERVLDGYNLCNEVDRNGYLSSIDSDIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGLCEHDH +DA KLARLMLKKGIRLKPHYA+ +IKHL KF D+ELVM LGGIRK
Sbjct: 421 VGLCEHDHPMDAEKLARLMLKKGIRLKPHYADHVIKHLNKFGDQELVMQLGGIRK 475

BLAST of CsaV3_2G011310 vs. TAIR10
Match: AT5G47360.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 134.0 bits (336), Expect = 2.4e-31
Identity = 62/135 (45.93%), Postives = 92/135 (68.15%), Query Frame = 0

Query: 26  LNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVNEVLYKCSFELSQMGLRFFIWAG 85
           L T+S+++  Y  L+    NL+K LA+   +LDS C+NEVL +C     Q GLRFFIWAG
Sbjct: 27  LTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINEVLRRCDPNQFQSGLRFFIWAG 86

Query: 86  RQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRREGCLVDIRMFKIILNLCKEAKLA 145
              ++RHS++MY++AC+++ I   P L+  VIE YR+E C V+++  +I+L LC +A LA
Sbjct: 87  TLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEECFVNVKTMRIVLTLCNQANLA 146

Query: 146 KEALSILRKMSEFHL 161
            EAL +LRK  EF++
Sbjct: 147 DEALWVLRKFPEFNV 161

BLAST of CsaV3_2G011310 vs. TAIR10
Match: AT5G65820.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 54.7 bits (130), Expect = 1.8e-07
Identity = 27/98 (27.55%), Postives = 54/98 (55.10%), Query Frame = 0

Query: 62  VNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYR 121
           +  VL +C  +   +G RFF+WA +QP Y HS  +Y    +++        ++ +IE+ R
Sbjct: 116 IERVLNRCG-DAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMR 175

Query: 122 REG-CLVDIRMFKIILNLCKEAKLAKEALSILRKMSEF 159
           +E   L++  +F +++     A + K+A+ +L +M +F
Sbjct: 176 KENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKF 212

BLAST of CsaV3_2G011310 vs. TAIR10
Match: AT1G52640.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 50.8 bits (120), Expect = 2.6e-06
Identity = 45/159 (28.30%), Postives = 73/159 (45.91%), Query Frame = 0

Query: 9   PRSSSFLLNISTL-----STFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVN 68
           P+S SF +  STL     S   +N +S   +  DH      +L+ TL     ++ S  V 
Sbjct: 17  PKSQSFRI-FSTLLHDPPSPDLVNEISR--VLSDH-RNPKDDLEHTLVAYSPRVSSNLVE 76

Query: 69  EVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRRE 128
           +VL +C   L     RFF+WA R P++ HS   Y    E++G +    LL++ + + R  
Sbjct: 77  QVLKRCK-NLGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREY 136

Query: 129 GCL-VDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLR 162
               +  ++F I+      A L  EA     +M EF ++
Sbjct: 137 NYFEISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIK 170

BLAST of CsaV3_2G011310 vs. TAIR10
Match: AT3G49730.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 45.4 bits (106), Expect = 1.1e-04
Identity = 29/114 (25.44%), Postives = 59/114 (51.75%), Query Frame = 0

Query: 50  LATLKTKLDSR--CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGIN 109
           LA  ++ +D R   +  VL +C  +   +G RFF+WA +QP Y HS  +      ++   
Sbjct: 86  LALNESGIDLRPGLIIRVLSRCG-DAGNLGYRFFLWATKQPGYFHSYEVCKSMVMILSKM 145

Query: 110 VSPCLLFNVIEDYRREGC-LVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHL 161
                ++ +IE+ R+    L++  +F +++     A + K+A+ +L +M ++ L
Sbjct: 146 RQFGAVWGLIEEMRKTNPELIEPELFVVLMRRFASANMVKKAVEVLDEMPKYGL 198

BLAST of CsaV3_2G011310 vs. Swiss-Prot
Match: sp|Q9LVS3|PP422_ARATH (Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX=3702 GN=At5g47360 PE=2 SV=1)

HSP 1 Score: 134.0 bits (336), Expect = 4.3e-30
Identity = 62/135 (45.93%), Postives = 92/135 (68.15%), Query Frame = 0

Query: 26  LNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVNEVLYKCSFELSQMGLRFFIWAG 85
           L T+S+++  Y  L+    NL+K LA+   +LDS C+NEVL +C     Q GLRFFIWAG
Sbjct: 27  LTTVSAAERLYGQLQGCTSNLEKELASANVQLDSSCINEVLRRCDPNQFQSGLRFFIWAG 86

Query: 86  RQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRREGCLVDIRMFKIILNLCKEAKLA 145
              ++RHS++MY++AC+++ I   P L+  VIE YR+E C V+++  +I+L LC +A LA
Sbjct: 87  TLSSHRHSAYMYTKACDILKIRAKPDLIKYVIESYRKEECFVNVKTMRIVLTLCNQANLA 146

Query: 146 KEALSILRKMSEFHL 161
            EAL +LRK  EF++
Sbjct: 147 DEALWVLRKFPEFNV 161

BLAST of CsaV3_2G011310 vs. Swiss-Prot
Match: sp|Q9FH87|PP447_ARATH (Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis thaliana OX=3702 GN=At5g65820 PE=3 SV=1)

HSP 1 Score: 54.7 bits (130), Expect = 3.3e-06
Identity = 27/98 (27.55%), Postives = 54/98 (55.10%), Query Frame = 0

Query: 62  VNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYR 121
           +  VL +C  +   +G RFF+WA +QP Y HS  +Y    +++        ++ +IE+ R
Sbjct: 116 IERVLNRCG-DAGNLGYRFFVWAAKQPRYCHSIEVYKSMVKILSKMRQFGAVWGLIEEMR 175

Query: 122 REG-CLVDIRMFKIILNLCKEAKLAKEALSILRKMSEF 159
           +E   L++  +F +++     A + K+A+ +L +M +F
Sbjct: 176 KENPQLIEPELFVVLVQRFASADMVKKAIEVLDEMPKF 212

BLAST of CsaV3_2G011310 vs. Swiss-Prot
Match: sp|Q9SSR6|PPR78_ARATH (Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At1g52640 PE=2 SV=1)

HSP 1 Score: 50.8 bits (120), Expect = 4.8e-05
Identity = 45/159 (28.30%), Postives = 73/159 (45.91%), Query Frame = 0

Query: 9   PRSSSFLLNISTL-----STFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVN 68
           P+S SF +  STL     S   +N +S   +  DH      +L+ TL     ++ S  V 
Sbjct: 17  PKSQSFRI-FSTLLHDPPSPDLVNEISR--VLSDH-RNPKDDLEHTLVAYSPRVSSNLVE 76

Query: 69  EVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRRE 128
           +VL +C   L     RFF+WA R P++ HS   Y    E++G +    LL++ + + R  
Sbjct: 77  QVLKRCK-NLGFPAHRFFLWARRIPDFAHSLESYHILVEILGSSKQFALLWDFLIEAREY 136

Query: 129 GCL-VDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLR 162
               +  ++F I+      A L  EA     +M EF ++
Sbjct: 137 NYFEISSKVFWIVFRAYSRANLPSEACRAFNRMVEFGIK 170

BLAST of CsaV3_2G011310 vs. TrEMBL
Match: tr|A0A0A0LI44|A0A0A0LI44_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1)

HSP 1 Score: 496.9 bits (1278), Expect = 5.1e-137
Identity = 475/475 (100.00%), Postives = 475/475 (100.00%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR
Sbjct: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY
Sbjct: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 475

BLAST of CsaV3_2G011310 vs. TrEMBL
Match: tr|A0A1S3B4L9|A0A1S3B4L9_CUCME (pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN=LOC103485755 PE=4 SV=1)

HSP 1 Score: 440.7 bits (1132), Expect = 4.3e-120
Identity = 427/475 (89.89%), Postives = 442/475 (93.05%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MALFRIS PRSSS LLNISTLSTFHL+TLSSSDLFYDHLEK+NGN++KTLAT+KTKLDSR
Sbjct: 1   MALFRISYPRSSSILLNISTLSTFHLSTLSSSDLFYDHLEKNNGNVEKTLATVKTKLDSR 60

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CVNEVLYKCS ELSQMGLRFFIWAGRQPNYRH+SFMYSRACELIGINVSPCLLFNVIEDY
Sbjct: 61  CVNEVLYKCSSELSQMGLRFFIWAGRQPNYRHTSFMYSRACELIGINVSPCLLFNVIEDY 120

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           RREGCLVDIR+F+IILNLCKEAKL KEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX
Sbjct: 121 RREGCLVDIRIFQIILNLCKEAKLTKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXX                   GTC PNTVTYTSIIQSLCE+G  LEALKVLDRMEEY
Sbjct: 241 XXXXXRLRIMDKLMEMLEEMEKQGGTCRPNTVTYTSIIQSLCEQGFLLEALKVLDRMEEY 300

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           G+ XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 GHAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   GF+LCYEVDRNGYLC IDAD+YSLLL
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFSLCYEVDRNGYLCYIDADVYSLLL 420

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGIRK 476
           VGL +HDHSVDAA LARLMLKKGIRLKPHYAESIIKHLKKFED+EL+MHLGGIRK
Sbjct: 421 VGLYQHDHSVDAAILARLMLKKGIRLKPHYAESIIKHLKKFEDQELIMHLGGIRK 475

BLAST of CsaV3_2G011310 vs. TrEMBL
Match: tr|A0A067KUM2|A0A067KUM2_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_03442 PE=4 SV=1)

HSP 1 Score: 229.6 bits (584), Expect = 1.5e-56
Identity = 333/474 (70.25%), Postives = 372/474 (78.48%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           M++  +S   S S     S  S  H  T S SD  Y HL+K+  N ++ L ++K KLDS 
Sbjct: 1   MSISSLSRFVSLSITPQTSKFSMSHFTT-SLSDALYTHLQKNPNNTERALNSIKPKLDSI 60

Query: 61  CVNEVLYKCSFE-LSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIED 120
           CVNEVL KCS +   Q+GLRFFIWAG Q NYRHSSFMYSRAC+L  I  +P ++ N+IE 
Sbjct: 61  CVNEVLDKCSLDSYFQIGLRFFIWAGYQSNYRHSSFMYSRACQLFKIKQNPQVVLNLIEA 120

Query: 121 YRREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXX 180
           YR E C+V ++ FKI+LNLCKE +LA EAL +LRKM EF LR      XXXXXXXXXXXX
Sbjct: 121 YRAEKCVVSVKTFKIVLNLCKEGRLANEALLVLRKMPEFDLRADTNVYXXXXXXXXXXXX 180

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEE 300
           XXXXXXXXXXXXXXXXXXXXX    G CSPN +TYTS+IQ+LCE+G  L+A  +LDRME 
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXEKDGGDCSPNLLTYTSVIQNLCEKGGSLDAFAILDRMEA 300

Query: 301 YGYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           +  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLL 420
           XXXXXXXXXXXXXXXXXXXXXXX         VLDG+ L  E+++ G L SID+DIYS+L
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXIRELCLENRVLDGYCLYDEIEKIGCLSSIDSDIYSVL 420

Query: 421 LVGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGI 474
           LVGLC+  HS++AAKLAR ML+KGIRLKP Y   I  HLKKF D EL   L  I
Sbjct: 421 LVGLCQQSHSMEAAKLARSMLEKGIRLKPPYVNKIADHLKKFGDMELFTRLSSI 473

BLAST of CsaV3_2G011310 vs. TrEMBL
Match: tr|A0A2P5FIU2|A0A2P5FIU2_9ROSA (Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_066350 PE=4 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 9.5e-43
Identity = 274/473 (57.93%), Postives = 311/473 (65.75%), Query Frame = 0

Query: 1   MALFRISCPRSSSFLLNISTLSTFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSR 60
           MAL  IS   SSS  L     ST      SS+D+F++HL+K+ GN++KTL T+K ++DS+
Sbjct: 25  MALCSISRLLSSSIRLQNPKFSTVRSTAASSADIFFNHLQKNGGNIEKTLVTVKAQVDSK 84

Query: 61  CVNEVLYKCSFELSQMGLRFFIWAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDY 120
           CV+ VLY+C    SQMGLRFFIWAG Q +YRH+S+MYS+AC    I  +P LL++VIE Y
Sbjct: 85  CVSGVLYRCYPSQSQMGLRFFIWAGLQSDYRHTSYMYSKACNFYKITQNPKLLYDVIEAY 144

Query: 121 RREGCLVDIRMFKIILNLCKEAKLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXX 180
           R E C V ++ FK+ILNL KEAKLA EAL +LRKM EF LR      XXXXXXXXXXXXX
Sbjct: 145 RAERCSVTVKTFKVILNLYKEAKLADEALWVLRKMPEFGLRADTTMYXXXXXXXXXXXXX 204

Query: 181 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 240
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 205 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 264

Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEY 300
           XXXXXXXXXXXXXXXXXXXXXXXXG+                                 Y
Sbjct: 265 XXXXXXXXXXXXXXXXXXXXXXXXGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXY 324

Query: 301 GYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                        
Sbjct: 325 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGVSYGECYSSLVVCLKRSRNTEEA 384

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420
                                          VLDG+ L  E+++ GYL SID+D+YSLLL
Sbjct: 385 EKVFRKMLTSGLKPDGLACSIMIKELCLVGRVLDGYQLFDEIEKIGYLISIDSDVYSLLL 444

Query: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHLGGI 474
           VGLCE  HSV+A  LARLMLKK IRLK  Y++SI + LKK  + ELV HL  I
Sbjct: 445 VGLCEQSHSVEAKTLARLMLKKRIRLKAPYSDSIGEILKKSGEEELVNHLTAI 497

BLAST of CsaV3_2G011310 vs. TrEMBL
Match: tr|A0A061G4E0|A0A061G4E0_THECC (Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobroma cacao OX=3641 GN=TCM_015917 PE=4 SV=1)

HSP 1 Score: 183.0 bits (463), Expect = 1.6e-42
Identity = 312/451 (69.18%), Postives = 348/451 (77.16%), Query Frame = 0

Query: 23  TFHLNTLSSSDLFYDHLEKSNGNLDKTLATLKTKLDSRCVNEVLYKCSFELSQMGLRFFI 82
           TF  +T SS+D F+ HL+K   N++KTLA + +KLDS CV EVL +C F+ SQMGLRFFI
Sbjct: 23  TFLFSTASSADKFFTHLQKKQSNIEKTLALVNSKLDSNCVCEVLERCCFDKSQMGLRFFI 82

Query: 83  WAGRQPNYRHSSFMYSRACELIGINVSPCLLFNVIEDYRREGCLVDIRMFKIILNLCKEA 142
           WAG Q NYRHSS+MYS+ACE + I  +P L+ +VIE Y+ E CLV+++MFK++LNLC+EA
Sbjct: 83  WAGLQSNYRHSSYMYSKACEFLKIKQNPFLVLDVIEAYKVEKCLVNVKMFKVVLNLCREA 142

Query: 143 KLAKEALSILRKMSEFHLRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 202
           ++  EAL +LRKM EF+LR    XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 143 RITDEALLVLRKMPEFNLRPDTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 202

Query: 203 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 262
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  
Sbjct: 203 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEK 262

Query: 263 XXGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEYGYXXXXXXXXXXXXXXXXXXXX 322
                                                   XXXXXXXXXXXXXXXXXXXX
Sbjct: 263 EGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGTCXXXXXXXXXXXXXXXXXXXXXX 322

Query: 323 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 382
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 323 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 382

Query: 383 XXXXXXXXXVLDGFNLCYEVDRNGYLCSIDADIYSLLLVGLCEHDHSVDAAKLARLMLKK 442
           XXX      VLDGF L  E++R  YL SIDADIYS+LLVGLC   HSV+AAKLAR ML+K
Sbjct: 383 XXXICQEGRVLDGFYLYEEIERMRYLSSIDADIYSILLVGLCRQSHSVEAAKLARSMLEK 442

Query: 443 GIRLKPHYAESIIKHLKKFEDRELVMHLGGI 474
            IRLK  Y + II+HLK   D++LV  LG I
Sbjct: 443 RIRLKAPYVDKIIEHLKNCGDKQLVTELGRI 473

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004139002.17.7e-137100.00PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis sativu... [more]
XP_008441677.16.5e-12089.89PREDICTED: pentatricopeptide repeat-containing protein At5g47360 [Cucumis melo] ... [more]
XP_022928928.17.5e-8482.95pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata][more]
XP_023551479.19.8e-8482.74pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pep... [more]
XP_022155853.11.7e-8080.00pentatricopeptide repeat-containing protein At5g47360 [Momordica charantia] >XP_... [more]
Match NameE-valueIdentityDescription
AT5G47360.12.4e-3145.93Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G65820.11.8e-0727.55Pentatricopeptide repeat (PPR) superfamily protein[more]
AT1G52640.12.6e-0628.30Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G49730.11.1e-0425.44Tetratricopeptide repeat (TPR)-like superfamily protein[more]
Match NameE-valueIdentityDescription
sp|Q9LVS3|PP422_ARATH4.3e-3045.93Pentatricopeptide repeat-containing protein At5g47360 OS=Arabidopsis thaliana OX... [more]
sp|Q9FH87|PP447_ARATH3.3e-0627.55Putative pentatricopeptide repeat-containing protein At5g65820 OS=Arabidopsis th... [more]
sp|Q9SSR6|PPR78_ARATH4.8e-0528.30Pentatricopeptide repeat-containing protein At1g52640, mitochondrial OS=Arabidop... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0LI44|A0A0A0LI44_CUCSA5.1e-137100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G123590 PE=4 SV=1[more]
tr|A0A1S3B4L9|A0A1S3B4L9_CUCME4.3e-12089.89pentatricopeptide repeat-containing protein At5g47360 OS=Cucumis melo OX=3656 GN... [more]
tr|A0A067KUM2|A0A067KUM2_JATCU1.5e-5670.25Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_03442 PE=4 SV=1[more]
tr|A0A2P5FIU2|A0A2P5FIU2_9ROSA9.5e-4357.93Pentatricopeptide repeat OS=Trema orientalis OX=63057 GN=TorRG33x02_066350 PE=4 ... [more]
tr|A0A061G4E0|A0A061G4E0_THECC1.6e-4269.18Tetratricopeptide repeat-like superfamily protein, putative isoform 1 OS=Theobro... [more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR011990TPR-like_helical_dom_sf
IPR002885Pentatricopeptide_repeat
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_2G011310.1CsaV3_2G011310.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 343..372
e-value: 0.038
score: 14.1
coord: 166..192
e-value: 1.0E-4
score: 22.2
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 200..234
e-value: 2.5E-10
score: 37.9
coord: 166..198
e-value: 7.1E-6
score: 23.9
coord: 272..305
e-value: 1.8E-8
score: 32.1
coord: 343..375
e-value: 7.4E-6
score: 23.8
IPR002885Pentatricopeptide repeatPFAMPF13041PPR_2coord: 269..318
e-value: 3.8E-12
score: 46.1
coord: 197..243
e-value: 1.3E-14
score: 54.0
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 412..446
score: 8.681
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 128..162
score: 7.498
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 233..267
score: 8.32
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 305..339
score: 8.495
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 375..409
score: 6.007
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 270..304
score: 12.792
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 198..232
score: 13.68
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 340..374
score: 9.591
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 163..197
score: 11.016
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 37..213
e-value: 1.0E-18
score: 69.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 344..471
e-value: 4.5E-16
score: 61.3
coord: 214..343
e-value: 6.2E-33
score: 116.6
NoneNo IPR availablePANTHERPTHR24015:SF719SUBFAMILY NOT NAMEDcoord: 1..472
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 1..472

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CsaV3_2G011310Cla002019Watermelon (97103) v1cucwmB189
The following gene(s) are paralogous to this gene:

None