CsaV3_4G037400 (gene) Cucumber (Chinese Long) v3

NameCsaV3_4G037400
Typegene
OrganismCucumis sativus (Cucumber (Chinese Long) v3)
DescriptionPentatricopeptide repeat-containing protein
Locationchr4 : 26214951 .. 26218353 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAAGTTAATATAAATCGTAAATTTGTTAATATATTTACGAATTTTGGTCGATTCCGGTGATAAACTTTATTCAACGGCGGTACTGAACCCGCCGTCATCACCGTTAAGCAGTTCACACCAGGCGAAAGTCCAATCGTCGTTTTGCCTCACCTCCTTGAGCCACTCCTATACTTATCTATCTGGCTTCCAGCGCCGATAAATCGATACCTTCTATGGCATTCGGCGCTTCCCGGCGTCTTATTCCCTATCAACTCAGAGCCTGCTTTTTGGGGCTTATTGCCAGTGGCAGGTATCACTATCCCTTAATCCACTCGCCGTCGCCGGCTTTATCATACTTGTTTTCAACCCTAGATGAACCATCAAATCTATTTGATGATGGTCTTTCGGGTAATGGGGATCGAAATCAACGCTGCATAGACGAGCGATTCGTTATCAGTGAACTTTCTGATCTTCTACTAGTTAATCCTTATGGTTCGGTTTATAACACTCTCAAAGAGAATTCCATTGAGAAACAGATGCCAGTTAGGGCAGTTGATGGATTCTTGCTTCCAGAAGAGAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATTGAGCATGCATTAGCTAATACTGATGTGATTTTGAGTCAAGATGTTGTCAGCAAAGTATTAAACACTGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATGCTTCTAGTTACAACATAATTCTTAAAGCTTTAGGTAGAAGGGGTTTTTTTGACTCCATGATGGATGTTTTGTACAACATGACACGGGAGGGAGTGGAGGCTACATTGGAAATGGTCTCCATTGTAGTAGACAGTCTGGTCAAGGGTCACCAAGTTTCTAAGGCACTTCAATTTTTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAACATTCTTCTACAATGCATGTGTCGACGATCCCACGTTGGTGCTGCAAACTCCTTCTTTAATTTAACCAAGGGGAATATCCCTTTCAATGTCATGACATATAACATTGTAATTGGTGGATGGTCAAGATACGGTAGGCACGGTGAAGTTGAGCAAATGTTGAAAGCAATGGAACTTGATGGATTTTCTCCAGACTGTCTGACCCACACCTATCTTATTGAGTGTCTTGGCAGAGCTAATCAGATTGATGATGCTGTCAAGATTTTTGATAAAATGGATGAAAACGGCTGTACACCAGATGTTGATGCTTACAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATCAATGCCTGACCTATTACGAGCGTATGTTGAGCAACAGATGTGAACCTGACATGAACACCTATTCGAATTTGATTACTGGCTTTCTCAAGGCCAAGAAAGTAGCCGATGCACTAGAAATGTTTGATGAAATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATTCAACTTAGCTGTAGTTATGGTCCTCCACACGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCATACAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAGACTTATGAGCATGCCATTGACTGTCTCTGTAAAACAGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCGAACACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATCTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGAACTTTCAGGTCATATCAATTACAAACATGTCTTTTTCTTCTATTGTTTCGGTCCTTTGAAGGTTAGTAATATTTCACAGAAGTCATTCTACTAATGTGTGCAGGTCGTACTTAGGATGTTTTATAGTTGTTGACTGCTTACTTTTTTTTTTTGTTACATATACGAGAGTGTCTATGAAATTTCTCGGGCTTCCATTAAGGTAAATCTACAAAAAAGTGGACTTGGGCTTTGTTTATTGAATCTGGATTGCAAAATGGAAGAATTCTTATCTTTACATCACATTTATTCTATTTGCATTGACTTGAGTCCTGTGAGCCCTGTTTCTTATGTTTTGTTCGTCGGGATTCTCCGACTCTTTTTATCGAAATGAAGGATGAAAGTTAGTTAAATGGGGTGCAGGGAAGTTTTTTTCACTGGGTTTCTTATGTTACTATTCTATTTGCTCTGTTTCCTTGTTTATTACGTTTCGTTTATCAAGGTTTTCATTTATTCAATTAGTTCTGACATTGCAGGGCCGTTGTTAATGCATATGGAGCTTCATCTGATGCCAACGGCCACATCTCGGGTGACATGTCAGACCACTTTGGTCATTCATTTAGTTACGGATGTACACGCCTTTGATAAGAATGTTTTTCTACCTCAAGTTGACCACAACCGCAACATAAAGGACTGATGTGCTGAGGTGCCCACCTTAAGCTTTCTTAGTAGGATTTAAACCATGTTTCAAATATGGATCAACCTACTTGATTATATGCTTCTCAATCCTAGCCAATGCCAGCCACCCATGAATCAATGAGGCCTAGGTATGTACTGGCAAGTGGTAATGAACATAACTAGAATGACGGCATTCTTGTCAAGATGTATCTGGATATTTTCTAGTGAACCCAACCCAGCACATTGTCACGTAAAATCATTTCTAGTAGGTATGTACATACGTTAGAGTAGCTTCAGAATCATTTTAATGTAGAAATAAGCACTAGCATCCATCTTTGAACTGCAAGTTTTTCCCACTTTTTTCAGGTTTCTCTATCAAGTGGACCATACCCATTGAAAGATATGGGCAATAACTTAGTTTGAAGAATAACCATCCAAGGCAGGGATTAATAACTTGAGTAAAAGCACTTTCCACTGAAAGTACGAGTAGAGTCTTCAAGTAAAGCTAAGTCAGAAGCTGTCGGCGAGGTTCACATCTGCAAGAAAAAAGAGAAGAGAGCAAGTAGACAGAGAAGAAGACAATGTCCACAATCATTAATTTAACCCAGAGGAGCACCGGAACCATCTAATTTTCTTGTAAATTGTTGCGCAAAAATAAAATTTCAAGTTTGCTTATATAAATAAATAAACAACAATGTTATCCATCAACAACGTCGTTATTGAAACAGATGATTTTCGCAAGGATTGTTTTTATTCTTTTTTCTTTTATGAGTGAATCCTTTTTTGCTTTCTATTTGATTGAAGAATGTATTTCT

mRNA sequence

ATGGCATTCGGCGCTTCCCGGCGTCTTATTCCCTATCAACTCAGAGCCTGCTTTTTGGGGCTTATTGCCAGTGGCAGGTATCACTATCCCTTAATCCACTCGCCGTCGCCGGCTTTATCATACTTGTTTTCAACCCTAGATGAACCATCAAATCTATTTGATGATGGTCTTTCGGGTAATGGGGATCGAAATCAACGCTGCATAGACGAGCGATTCGTTATCAGTGAACTTTCTGATCTTCTACTAGTTAATCCTTATGGTTCGGTTTATAACACTCTCAAAGAGAATTCCATTGAGAAACAGATGCCAGTTAGGGCAGTTGATGGATTCTTGCTTCCAGAAGAGAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATTGAGCATGCATTAGCTAATACTGATGTGATTTTGAGTCAAGATGTTGTCAGCAAAGTATTAAACACTGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATGCTTCTAGTTACAACATAATTCTTAAAGCTTTAGGTAGAAGGGGTTTTTTTGACTCCATGATGGATGTTTTGTACAACATGACACGGGAGGGAGTGGAGGCTACATTGGAAATGGTCTCCATTGTAGTAGACAGTCTGGTCAAGGGTCACCAAGTTTCTAAGGCACTTCAATTTTTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAACATTCTTCTACAATGCATGTGTCGACGATCCCACGTTGGTGCTGCAAACTCCTTCTTTAATTTAACCAAGGGGAATATCCCTTTCAATGTCATGACATATAACATTGTAATTGGTGGATGGTCAAGATACGGTAGGCACGGTGAAGTTGAGCAAATGTTGAAAGCAATGGAACTTGATGGATTTTCTCCAGACTGTCTGACCCACACCTATCTTATTGAGTGTCTTGGCAGAGCTAATCAGATTGATGATGCTGTCAAGATTTTTGATAAAATGGATGAAAACGGCTGTACACCAGATGTTGATGCTTACAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATCAATGCCTGACCTATTACGAGCGTATGTTGAGCAACAGATGTGAACCTGACATGAACACCTATTCGAATTTGATTACTGGCTTTCTCAAGGCCAAGAAAGTAGCCGATGCACTAGAAATGTTTGATGAAATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATTCAACTTAGCTGTAGTTATGGTCCTCCACACGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCATACAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAGACTTATGAGCATGCCATTGACTGTCTCTGTAAAACAGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCGAACACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATCTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGA

Coding sequence (CDS)

ATGGCATTCGGCGCTTCCCGGCGTCTTATTCCCTATCAACTCAGAGCCTGCTTTTTGGGGCTTATTGCCAGTGGCAGGTATCACTATCCCTTAATCCACTCGCCGTCGCCGGCTTTATCATACTTGTTTTCAACCCTAGATGAACCATCAAATCTATTTGATGATGGTCTTTCGGGTAATGGGGATCGAAATCAACGCTGCATAGACGAGCGATTCGTTATCAGTGAACTTTCTGATCTTCTACTAGTTAATCCTTATGGTTCGGTTTATAACACTCTCAAAGAGAATTCCATTGAGAAACAGATGCCAGTTAGGGCAGTTGATGGATTCTTGCTTCCAGAAGAGAAATTGCGAGGTGTTTTCCTTCAAAAACTGAATGGTAAAACCGCAATTGAGCATGCATTAGCTAATACTGATGTGATTTTGAGTCAAGATGTTGTCAGCAAAGTATTAAACACTGGGAGTTTAGGTAGCGAAGCAATGGTTACCTTCTTTTATTGGGCTATTAAACAGCCGTCGATACCTAAAGATGCTTCTAGTTACAACATAATTCTTAAAGCTTTAGGTAGAAGGGGTTTTTTTGACTCCATGATGGATGTTTTGTACAACATGACACGGGAGGGAGTGGAGGCTACATTGGAAATGGTCTCCATTGTAGTAGACAGTCTGGTCAAGGGTCACCAAGTTTCTAAGGCACTTCAATTTTTCAGAAACTTGAAAGAAATTGGGTTGAAATGTGATACTGAAACCTTGAACATTCTTCTACAATGCATGTGTCGACGATCCCACGTTGGTGCTGCAAACTCCTTCTTTAATTTAACCAAGGGGAATATCCCTTTCAATGTCATGACATATAACATTGTAATTGGTGGATGGTCAAGATACGGTAGGCACGGTGAAGTTGAGCAAATGTTGAAAGCAATGGAACTTGATGGATTTTCTCCAGACTGTCTGACCCACACCTATCTTATTGAGTGTCTTGGCAGAGCTAATCAGATTGATGATGCTGTCAAGATTTTTGATAAAATGGATGAAAACGGCTGTACACCAGATGTTGATGCTTACAATGCAATGATCTCCAACTTTATATGTATAGGTGATTTTGATCAATGCCTGACCTATTACGAGCGTATGTTGAGCAACAGATGTGAACCTGACATGAACACCTATTCGAATTTGATTACTGGCTTTCTCAAGGCCAAGAAAGTAGCCGATGCACTAGAAATGTTTGATGAAATGGTGGCAAGAATAATTCCCACTACGGGGGCAATAACATCCTTTATTCAACTTAGCTGTAGTTATGGTCCTCCACACGCAGCTATGTTAATCTACAAGAAAGCAAGAAAAGTTGGATGTAGGATATCCAAGAATGCATACAAATTGTTGCTAATGCGGCTTTCTTTGTTTGGTAAATTTGGCATGCTATTAAATATATGGAATGAGATGCAAGAAAGTGGTTATGATCCTGATGTGGAGACTTATGAGCATGCCATTGACTGTCTCTGTAAAACAGGGCAGCTTGAAAATGCTGTACTCGTCATGGAGGAATGTTTACGTCAGGGTTTCTTCCCAAGTAGGCGAACACGTAGTAAGCTTAATAACAAACTATTGGCCTGTAATAGGACAGAGATGGCATATAAACTCTGGTTGAAAATCAAAGTTGCTCGTCATCAGGAAAATCTGCAAAGATGTTGGCGTGCCAAGGGATGGCATTATTGA

Protein sequence

MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMELDGFSPDCLTHTYLIECLGRANQIDDAVKIFDKMDENGCTPDVDAYNAMISNFICIGDFDQCLTYYERMLSNRCEPDMNTYSNLITGFLKAKKVADALEMFDEMVARIIPTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY
BLAST of CsaV3_4G037400 vs. NCBI nr
Match: XP_011654277.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis sativus] >XP_011654278.1 PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis sativus] >KGN55514.1 hypothetical protein Csa_4G663730 [Cucumis sativus])

HSP 1 Score: 958.4 bits (2476), Expect = 1.1e-275
Identity = 572/572 (100.00%), Postives = 572/572 (100.00%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60
           MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60

Query: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120
           GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV
Sbjct: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120

Query: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180
           FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS
Sbjct: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180

Query: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240
           YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK
Sbjct: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240

Query: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300
           EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE
Sbjct: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300

Query: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420

Query: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480
           TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN
Sbjct: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480

Query: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540
           EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN
Sbjct: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540

Query: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY
Sbjct: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of CsaV3_4G037400 vs. NCBI nr
Match: XP_008452985.1 (PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo] >XP_008452986.1 PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo] >XP_008452987.1 PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo] >XP_016901338.1 PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucumis melo])

HSP 1 Score: 879.0 bits (2270), Expect = 8.6e-252
Identity = 525/572 (91.78%), Postives = 541/572 (94.58%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60
           MAFGASRRL+PYQ++ACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDG+SGN
Sbjct: 1   MAFGASRRLLPYQVKACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGVSGN 60

Query: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120
           GDRNQRCIDERFVISELSDLLLVNP+GSV NT+KEN  EKQ+P+RAVDGFLLPEEKLRGV
Sbjct: 61  GDRNQRCIDERFVISELSDLLLVNPHGSVSNTVKENLTEKQVPIRAVDGFLLPEEKLRGV 120

Query: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180
           FLQKLNGKTAIEHALANTDV LSQDVVSKVLNTGSLGSEAMVTFFYW+IKQPSIPKDASS
Sbjct: 121 FLQKLNGKTAIEHALANTDVNLSQDVVSKVLNTGSLGSEAMVTFFYWSIKQPSIPKDASS 180

Query: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240
           YNIILKALGRRGFFDSMMDVLY+MTREGV+ATLE VSIVVDSLVK HQVSKALQFFRNLK
Sbjct: 181 YNIILKALGRRGFFDSMMDVLYSMTREGVDATLETVSIVVDSLVKAHQVSKALQFFRNLK 240

Query: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300
           EIGLKCDTETLNILLQCMCRRSHVGAANSF NLTKG+IPFNVMTYNI+IGGWSRYGRH E
Sbjct: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFLNLTKGSIPFNVMTYNIIIGGWSRYGRHSE 300

Query: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VEQ LKAME+DGFSPD            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VEQTLKAMEVDGFSPDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        ARIIPT
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLEMFDEMVARIIPT 420

Query: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480
           TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLL+IWN
Sbjct: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLSIWN 480

Query: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540
           EMQESGYDPDVETYEHAI CLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLLACN
Sbjct: 481 EMQESGYDPDVETYEHAIGCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACN 540

Query: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY
Sbjct: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of CsaV3_4G037400 vs. NCBI nr
Match: XP_022138983.1 (putative pentatricopeptide repeat-containing protein At5g43820 isoform X1 [Momordica charantia] >XP_022138992.1 putative pentatricopeptide repeat-containing protein At5g43820 isoform X1 [Momordica charantia])

HSP 1 Score: 649.8 bits (1675), Expect = 8.5e-183
Identity = 425/574 (74.04%), Postives = 458/574 (79.79%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSP--ALSYLFSTLDEPSNLFDDGLS 60
           MAFGASRR++ YQ + CFL     GRY Y L++SPSP  +LSYLFSTLDEPSNLFD G+ 
Sbjct: 1   MAFGASRRILSYQFQGCFLARRGRGRYPYRLLYSPSPSSSLSYLFSTLDEPSNLFDGGVL 60

Query: 61  GNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLR 120
           GNG +NQ  IDERF+I ELSDLLL NPYGSV NTLKE    KQMP+RAVDGFL PEEKLR
Sbjct: 61  GNGIQNQSTIDERFIIGELSDLLLANPYGSVPNTLKEKHTVKQMPIRAVDGFLRPEEKLR 120

Query: 121 GVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDA 180
           GVFLQKLNGKTAIEHAL + DV LS DV+++VLNTGSLGSEAM+TFFYWAIKQP+I +D 
Sbjct: 121 GVFLQKLNGKTAIEHALDSADVNLSLDVINEVLNTGSLGSEAMITFFYWAIKQPTIAEDT 180

Query: 181 SSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRN 240
           SSYN+ILKALGRR FFDSMMDVL+NMTR+GV A +E VSIVVDSLVK HQ          
Sbjct: 181 SSYNVILKALGRRRFFDSMMDVLHNMTRKGVNANIETVSIVVDSLVKAHQXXXXXXXXXX 240

Query: 241 LKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRH 300
                                         SFFNL + N+PFN MTYNI++ GWSR+GR 
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSFFNLIRKNVPFNAMTYNIILSGWSRHGRL 300

Query: 301 GEVEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
            EVE++L AME DGFSPD            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 SEVERVLNAMEGDGFSPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARII 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        R+I
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEMFDEMVVRVI 420

Query: 421 PTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNI 480
           PTTGAITSFI+L CSYGPPHAAMLIYKKARKVGCRISKNAYKLL+MRLSLFGKFG LLNI
Sbjct: 421 PTTGAITSFIELCCSYGPPHAAMLIYKKARKVGCRISKNAYKLLMMRLSLFGKFGTLLNI 480

Query: 481 WNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLA 540
           WNEMQESGYDPDVETYEHAIDCL KTGQLENAVLVMEECLRQGFFPSR+  SKL NKLLA
Sbjct: 481 WNEMQESGYDPDVETYEHAIDCLWKTGQLENAVLVMEECLRQGFFPSRQICSKLYNKLLA 540

Query: 541 CNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
             R EMAY+LWLKIKVAR QENLQR WRA GWHY
Sbjct: 541 SERIEMAYRLWLKIKVARQQENLQRRWRANGWHY 574

BLAST of CsaV3_4G037400 vs. NCBI nr
Match: XP_022975779.1 (putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita maxima] >XP_022975781.1 putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita maxima])

HSP 1 Score: 649.4 bits (1674), Expect = 1.1e-182
Identity = 427/575 (74.26%), Postives = 454/575 (78.96%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSP--SPALSYLFSTLDEPSNLFDD-GL 60
           MAFGASRR + YQ + C LG + SGRYHYPL + P  S ALSYLFSTLDEPSNLFDD  +
Sbjct: 1   MAFGASRRYLSYQFKGCLLGRLTSGRYHYPLPYLPSLSSALSYLFSTLDEPSNLFDDVSV 60

Query: 61  SGNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKL 120
            GNG R QR IDERFV+ ELSDLLLVNPYGSV +T+KEN  EKQ+ +RAVDGFL PEEKL
Sbjct: 61  LGNGIRYQRSIDERFVMRELSDLLLVNPYGSVSDTVKENHTEKQVSIRAVDGFLRPEEKL 120

Query: 121 RGVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKD 180
           RGVFLQKLNGKTAIEHALANTDV LSQDVV+KVLNTGSLGSEAM+TFF WAIKQPSIP D
Sbjct: 121 RGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSEAMITFFDWAIKQPSIPTD 180

Query: 181 ASSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFR 240
            SSYNIILKALGRR FFDSM+DVL+NMTR+GV   +E                       
Sbjct: 181 TSSYNIILKALGRRRFFDSMIDVLHNMTRKGVIVNMETXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 NLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGR 300
                                            FNL KG++PFN M YNI+I GWSRYGR
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFNLIKGSVPFNAMAYNILISGWSRYGR 300

Query: 301 HGEVEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             EVE++LKAME DGFSPD            XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 LSEVERILKAMEADGFSPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX          RI
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXALEMFDEMVVRI 420

Query: 421 IPTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLN 480
           IPTTGAITSF++LSCSYGPPHAAMLIYKKA+KVGCRISKNAYKLLLMRLSLFGKFGMLLN
Sbjct: 421 IPTTGAITSFMKLSCSYGPPHAAMLIYKKAKKVGCRISKNAYKLLLMRLSLFGKFGMLLN 480

Query: 481 IWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLL 540
           IWNEMQESGYD DVETYEHAIDCLCKTGQLENAVLVM ECLRQGFFPSR+ RSKLNNKLL
Sbjct: 481 IWNEMQESGYDLDVETYEHAIDCLCKTGQLENAVLVMGECLRQGFFPSRKIRSKLNNKLL 540

Query: 541 ACNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           A NRTEMAY+LWLKIKVARHQ+NLQRCWRAKGWHY
Sbjct: 541 ASNRTEMAYRLWLKIKVARHQDNLQRCWRAKGWHY 575

BLAST of CsaV3_4G037400 vs. NCBI nr
Match: XP_022936331.1 (putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita moschata] >XP_022936332.1 putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita moschata])

HSP 1 Score: 640.2 bits (1650), Expect = 6.8e-180
Identity = 428/575 (74.43%), Postives = 455/575 (79.13%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSP--SPALSYLFSTLDEPSNLFDD-GL 60
           MAFGASRR + YQ + C LG +  GRYHYPL + P  S ALSYLFSTLDEPSNLFDD  +
Sbjct: 1   MAFGASRRYLSYQFKGCLLGRLTGGRYHYPLPYLPSLSSALSYLFSTLDEPSNLFDDVSV 60

Query: 61  SGNGDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKL 120
            GNG R QR IDERFV+ ELSDLLLVNPYGSV +T+KEN  EKQ+ +RAVDGFL PEEKL
Sbjct: 61  LGNGIRYQRSIDERFVMRELSDLLLVNPYGSVSDTVKENHTEKQVSIRAVDGFLRPEEKL 120

Query: 121 RGVFLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKD 180
           RGVFLQKLNGKTAIEHALANTDV LSQDVV+KVLNTGSLGS+AM+TFF WAIKQPSIP D
Sbjct: 121 RGVFLQKLNGKTAIEHALANTDVNLSQDVVNKVLNTGSLGSKAMITFFDWAIKQPSIPTD 180

Query: 181 ASSYNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFR 240
            SSYNIILKALGRR FFDSMMDVL+NMTR+GV   +E                       
Sbjct: 181 TSSYNIILKALGRRRFFDSMMDVLHNMTRKGVIVNMEXXXXXXXXXXXXXXXXXXXXXXX 240

Query: 241 NLKEIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGR 300
                                              L KG++PFN M YNI+I GWSRYGR
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLIKGSVPFNAMAYNILISGWSRYGR 300

Query: 301 HGEVEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
             EVE++LKAME DGFSPD            XXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 LSEVERILKAMEADGFSPDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARI 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     RI
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDEMVVRI 420

Query: 421 IPTTGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLN 480
           IPTTGAITSF++LSCSYGPPHAAMLIYKKA+KVGCRISKNAYKLLLMRLSLFGKFGMLLN
Sbjct: 421 IPTTGAITSFMKLSCSYGPPHAAMLIYKKAKKVGCRISKNAYKLLLMRLSLFGKFGMLLN 480

Query: 481 IWNEMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLL 540
           IWNEMQESGYD DVETYEHAIDCLCKTGQLENAVLVM ECLRQGF PSR+ RSKLNNKLL
Sbjct: 481 IWNEMQESGYDLDVETYEHAIDCLCKTGQLENAVLVMGECLRQGFLPSRKIRSKLNNKLL 540

Query: 541 ACNRTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           A NRTEMAY+LWLKIKVARHQ+NLQRCWRAKGWHY
Sbjct: 541 ASNRTEMAYRLWLKIKVARHQDNLQRCWRAKGWHY 575

BLAST of CsaV3_4G037400 vs. TAIR10
Match: AT5G43820.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 247.7 bits (631), Expect = 1.8e-65
Identity = 124/252 (49.21%), Postives = 170/252 (67.46%), Query Frame = 0

Query: 64  NQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQ 123
           N   +DE +V++ELS LL ++   +  +  KE+S  K     A+D FL  E+KLRGVFLQ
Sbjct: 41  NHGVVDESYVLAELSSLLPIS--SNKTSVSKEDSSSKNQV--AIDSFLSAEDKLRGVFLQ 100

Query: 124 KLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNI 183
           KL GK+AI+ +L++  + LS D+V+ VLN G+L  EAMVTFF WA+++P + KD  SY++
Sbjct: 101 KLKGKSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSV 160

Query: 184 ILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIG 243
           IL+ALGRR  F  MMDVL  M  EGV   LE ++I +DS V+ H V +A++ F   +  G
Sbjct: 161 ILRALGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFG 220

Query: 244 LKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQ 303
           +KC TE+ N LL+C+C RSHV AA S FN  KGNIPF+  +YNI+I GWS+ G   E+E+
Sbjct: 221 VKCSTESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEK 280

Query: 304 MLKAMELDGFSP 316
           +LK M   GF P
Sbjct: 281 VLKEMVESGFGP 288

BLAST of CsaV3_4G037400 vs. TAIR10
Match: AT3G22670.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 79.3 bits (194), Expect = 8.3e-15
Identity = 61/245 (24.90%), Postives = 113/245 (46.12%), Query Frame = 0

Query: 69  DERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQKLNGK 128
           DE FVI  L++ +    +      + E ++ K+ PV  +D       K+     +K    
Sbjct: 67  DEDFVIPSLANWVESQKFSR--QQVSEGNVVKK-PVEDID-------KVCDFLNKKDTSH 126

Query: 129 TAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKAL 188
             +   L+  DV++++ +V +VL   S G      FF WA  Q        +YN ++  L
Sbjct: 127 EDVVKELSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVHSGHTYNAMVDVL 186

Query: 189 GRRGFFDSMMDVLYNMTR--EGVEATLEMVSIVVDSLVKGHQVSKALQFFRNL-KEIGLK 248
           G+   FD M +++  M +  E    TL+ +S V+  L K  + +KA+  F  + K  G+K
Sbjct: 187 GKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVDAFLEMEKSYGVK 246

Query: 249 CDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQML 308
            DT  +N L+  + + + +  A+  F      I  +  T+NI+I G+ +  +  +   M+
Sbjct: 247 TDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDARTFNILIHGFCKARKFDDARAMM 301

Query: 309 KAMEL 311
             M++
Sbjct: 307 DLMKV 301

BLAST of CsaV3_4G037400 vs. TAIR10
Match: AT5G15010.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 79.3 bits (194), Expect = 8.3e-15
Identity = 50/198 (25.25%), Postives = 86/198 (43.43%), Query Frame = 0

Query: 131 IEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGR 190
           + + L   DV  S ++V ++L+      E   TFF WA KQ    +    Y+ ++  LG+
Sbjct: 114 LRNKLEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGK 173

Query: 191 RGFFDSMMDVLYNMTREGVE-ATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTE 250
              FD+   ++  M +        + + I++      H V KA+  F   K   L+   +
Sbjct: 174 MRKFDTAWTLIDEMRKFSPSLVNSQTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGID 233

Query: 251 TLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSR-YGRHGEVEQMLKAM 310
               LL  +CR  +V  A       K   PF+  ++NIV+ GW    G   E E++   M
Sbjct: 234 DFQSLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEM 293

Query: 311 ELDGFSPDCLTHTYLIEC 327
              G   D ++++ +I C
Sbjct: 294 GNVGVKHDVVSYSSMISC 311

BLAST of CsaV3_4G037400 vs. TAIR10
Match: AT3G15200.1 (Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 64.7 bits (156), Expect = 2.1e-10
Identity = 41/186 (22.04%), Postives = 77/186 (41.40%), Query Frame = 0

Query: 131 IEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGR 190
           I+  L    + L++++V +V+N      +         +KQ      +  YN IL  LG+
Sbjct: 96  IKRILDKCGIDLTEELVLEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGK 155

Query: 191 RGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTET 250
              F+    V   M++       +   ++++     H+V +A+  F   KE G+  D   
Sbjct: 156 MRRFEEFHQVFDEMSKRDGFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVA 215

Query: 251 LNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMEL 310
            + LL  +CR  HV  A + F   +     ++   N+++ GW   G   E ++  K +  
Sbjct: 216 FHGLLMWLCRYKHVEFAETLFCSRRREFGCDIKAMNMILNGWCVLGNVHEAKRFWKDIIA 275

Query: 311 DGFSPD 317
               PD
Sbjct: 276 SKCRPD 281

BLAST of CsaV3_4G037400 vs. TAIR10
Match: AT5G60960.1 (Pentatricopeptide repeat (PPR) superfamily protein)

HSP 1 Score: 61.2 bits (147), Expect = 2.3e-09
Identity = 39/194 (20.10%), Postives = 81/194 (41.75%), Query Frame = 0

Query: 135 LANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFF 194
           L+ + +  + D++ + LN       A + F  W     +      + +  +   GRR  F
Sbjct: 100 LSFSHITPNPDLILQTLNLSPEAGRAALGFNEWLDSNSNFSHTDETVSFFVDYFGRRKDF 159

Query: 195 DSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK-EIGLKCDTETLNI 254
             M++++          TLE     +D LV+  +  +   FF  ++ + GLK D E+L +
Sbjct: 160 KGMLEIISKYKGIAGGKTLES---AIDRLVRAGRPKQVTDFFEKMENDYGLKRDKESLTL 219

Query: 255 LLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMELDGF 314
           +++ +C + H   A      T   I  +    +++I GW    +  E  ++   M   GF
Sbjct: 220 VVKKLCEKGHASIAEKMVKNTANEIFPDENICDLLISGWCIAEKLDEATRLAGEMSRGGF 279

Query: 315 SPDCLTHTYLIECL 328
                 +  +++C+
Sbjct: 280 EIGTKAYNMMLDCV 290

BLAST of CsaV3_4G037400 vs. Swiss-Prot
Match: sp|P0C8R0|PP416_ARATH (Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis thaliana OX=3702 GN=At5g43820 PE=3 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 3.2e-64
Identity = 124/252 (49.21%), Postives = 170/252 (67.46%), Query Frame = 0

Query: 64  NQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQ 123
           N   +DE +V++ELS LL ++   +  +  KE+S  K     A+D FL  E+KLRGVFLQ
Sbjct: 41  NHGVVDESYVLAELSSLLPIS--SNKTSVSKEDSSSKNQV--AIDSFLSAEDKLRGVFLQ 100

Query: 124 KLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNI 183
           KL GK+AI+ +L++  + LS D+V+ VLN G+L  EAMVTFF WA+++P + KD  SY++
Sbjct: 101 KLKGKSAIQKSLSSLGIGLSIDIVADVLNRGNLSGEAMVTFFDWAVREPGVTKDVGSYSV 160

Query: 184 ILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIG 243
           IL+ALGRR  F  MMDVL  M  EGV   LE ++I +DS V+ H V +A++ F   +  G
Sbjct: 161 ILRALGRRKLFSFMMDVLKGMVCEGVNPDLECLTIAMDSFVRVHYVRRAIELFEESESFG 220

Query: 244 LKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQ 303
           +KC TE+ N LL+C+C RSHV AA S FN  KGNIPF+  +YNI+I GWS+ G   E+E+
Sbjct: 221 VKCSTESFNALLRCLCERSHVSAAKSVFNAKKGNIPFDSCSYNIMISGWSKLGEVEEMEK 280

Query: 304 MLKAMELDGFSP 316
           +LK M   GF P
Sbjct: 281 VLKEMVESGFGP 288

BLAST of CsaV3_4G037400 vs. Swiss-Prot
Match: sp|Q9LUJ4|PP248_ARATH (Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At3g22670 PE=2 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 1.5e-13
Identity = 61/245 (24.90%), Postives = 113/245 (46.12%), Query Frame = 0

Query: 69  DERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQKLNGK 128
           DE FVI  L++ +    +      + E ++ K+ PV  +D       K+     +K    
Sbjct: 67  DEDFVIPSLANWVESQKFSR--QQVSEGNVVKK-PVEDID-------KVCDFLNKKDTSH 126

Query: 129 TAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKAL 188
             +   L+  DV++++ +V +VL   S G      FF WA  Q        +YN ++  L
Sbjct: 127 EDVVKELSKCDVVVTESLVLQVLRRFSNGWNQAYGFFIWANSQTGYVHSGHTYNAMVDVL 186

Query: 189 GRRGFFDSMMDVLYNMTR--EGVEATLEMVSIVVDSLVKGHQVSKALQFFRNL-KEIGLK 248
           G+   FD M +++  M +  E    TL+ +S V+  L K  + +KA+  F  + K  G+K
Sbjct: 187 GKCRNFDLMWELVNEMNKNEESKLVTLDTMSKVMRRLAKSGKYNKAVDAFLEMEKSYGVK 246

Query: 249 CDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQML 308
            DT  +N L+  + + + +  A+  F      I  +  T+NI+I G+ +  +  +   M+
Sbjct: 247 TDTIAMNSLMDALVKENSIEHAHEVFLKLFDTIKPDARTFNILIHGFCKARKFDDARAMM 301

Query: 309 KAMEL 311
             M++
Sbjct: 307 DLMKV 301

BLAST of CsaV3_4G037400 vs. Swiss-Prot
Match: sp|Q9LFQ4|PP383_ARATH (Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=At5g15010 PE=2 SV=2)

HSP 1 Score: 79.3 bits (194), Expect = 1.5e-13
Identity = 50/198 (25.25%), Postives = 86/198 (43.43%), Query Frame = 0

Query: 131 IEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGR 190
           + + L   DV  S ++V ++L+      E   TFF WA KQ    +    Y+ ++  LG+
Sbjct: 114 LRNKLEECDVKPSNELVVEILSRVRNDWETAFTFFVWAGKQQGYVRSVREYHSMISILGK 173

Query: 191 RGFFDSMMDVLYNMTREGVE-ATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTE 250
              FD+   ++  M +        + + I++      H V KA+  F   K   L+   +
Sbjct: 174 MRKFDTAWTLIDEMRKFSPSLVNSQTLLIMIRKYCAVHDVGKAINTFHAYKRFKLEMGID 233

Query: 251 TLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSR-YGRHGEVEQMLKAM 310
               LL  +CR  +V  A       K   PF+  ++NIV+ GW    G   E E++   M
Sbjct: 234 DFQSLLSALCRYKNVSDAGHLIFCNKDKYPFDAKSFNIVLNGWCNVIGSPREAERVWMEM 293

Query: 311 ELDGFSPDCLTHTYLIEC 327
              G   D ++++ +I C
Sbjct: 294 GNVGVKHDVVSYSSMISC 311

BLAST of CsaV3_4G037400 vs. Swiss-Prot
Match: sp|Q9LIL5|PP233_ARATH (Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis thaliana OX=3702 GN=At3g15200 PE=3 SV=1)

HSP 1 Score: 64.7 bits (156), Expect = 3.8e-09
Identity = 41/186 (22.04%), Postives = 77/186 (41.40%), Query Frame = 0

Query: 131 IEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGR 190
           I+  L    + L++++V +V+N      +         +KQ      +  YN IL  LG+
Sbjct: 96  IKRILDKCGIDLTEELVLEVVNRNRSDWKPAYILSQLVVKQSVHLSSSMLYNEILDVLGK 155

Query: 191 RGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTET 250
              F+    V   M++       +   ++++     H+V +A+  F   KE G+  D   
Sbjct: 156 MRRFEEFHQVFDEMSKRDGFVNEKTYEVLLNRYAAAHKVDEAVGVFERRKEFGIDDDLVA 215

Query: 251 LNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMEL 310
            + LL  +CR  HV  A + F   +     ++   N+++ GW   G   E ++  K +  
Sbjct: 216 FHGLLMWLCRYKHVEFAETLFCSRRREFGCDIKAMNMILNGWCVLGNVHEAKRFWKDIIA 275

Query: 311 DGFSPD 317
               PD
Sbjct: 276 SKCRPD 281

BLAST of CsaV3_4G037400 vs. Swiss-Prot
Match: sp|Q9FME4|PP438_ARATH (Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=PNM1 PE=1 SV=1)

HSP 1 Score: 61.2 bits (147), Expect = 4.2e-08
Identity = 39/194 (20.10%), Postives = 81/194 (41.75%), Query Frame = 0

Query: 135 LANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFF 194
           L+ + +  + D++ + LN       A + F  W     +      + +  +   GRR  F
Sbjct: 100 LSFSHITPNPDLILQTLNLSPEAGRAALGFNEWLDSNSNFSHTDETVSFFVDYFGRRKDF 159

Query: 195 DSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK-EIGLKCDTETLNI 254
             M++++          TLE     +D LV+  +  +   FF  ++ + GLK D E+L +
Sbjct: 160 KGMLEIISKYKGIAGGKTLES---AIDRLVRAGRPKQVTDFFEKMENDYGLKRDKESLTL 219

Query: 255 LLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMELDGF 314
           +++ +C + H   A      T   I  +    +++I GW    +  E  ++   M   GF
Sbjct: 220 VVKKLCEKGHASIAEKMVKNTANEIFPDENICDLLISGWCIAEKLDEATRLAGEMSRGGF 279

Query: 315 SPDCLTHTYLIECL 328
                 +  +++C+
Sbjct: 280 EIGTKAYNMMLDCV 290

BLAST of CsaV3_4G037400 vs. TrEMBL
Match: tr|A0A0A0L3E0|A0A0A0L3E0_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G663730 PE=4 SV=1)

HSP 1 Score: 958.4 bits (2476), Expect = 7.4e-276
Identity = 572/572 (100.00%), Postives = 572/572 (100.00%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60
           MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN
Sbjct: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60

Query: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120
           GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV
Sbjct: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120

Query: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180
           FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS
Sbjct: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180

Query: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240
           YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK
Sbjct: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240

Query: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300
           EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE
Sbjct: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300

Query: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420

Query: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480
           TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN
Sbjct: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480

Query: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540
           EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN
Sbjct: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540

Query: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY
Sbjct: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of CsaV3_4G037400 vs. TrEMBL
Match: tr|A0A1S4DZE5|A0A1S4DZE5_CUCME (putative pentatricopeptide repeat-containing protein At5g43820 OS=Cucumis melo OX=3656 GN=LOC103493826 PE=4 SV=1)

HSP 1 Score: 879.0 bits (2270), Expect = 5.7e-252
Identity = 525/572 (91.78%), Postives = 541/572 (94.58%), Query Frame = 0

Query: 1   MAFGASRRLIPYQLRACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGN 60
           MAFGASRRL+PYQ++ACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDG+SGN
Sbjct: 1   MAFGASRRLLPYQVKACFLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGVSGN 60

Query: 61  GDRNQRCIDERFVISELSDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGV 120
           GDRNQRCIDERFVISELSDLLLVNP+GSV NT+KEN  EKQ+P+RAVDGFLLPEEKLRGV
Sbjct: 61  GDRNQRCIDERFVISELSDLLLVNPHGSVSNTVKENLTEKQVPIRAVDGFLLPEEKLRGV 120

Query: 121 FLQKLNGKTAIEHALANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASS 180
           FLQKLNGKTAIEHALANTDV LSQDVVSKVLNTGSLGSEAMVTFFYW+IKQPSIPKDASS
Sbjct: 121 FLQKLNGKTAIEHALANTDVNLSQDVVSKVLNTGSLGSEAMVTFFYWSIKQPSIPKDASS 180

Query: 181 YNIILKALGRRGFFDSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLK 240
           YNIILKALGRRGFFDSMMDVLY+MTREGV+ATLE VSIVVDSLVK HQVSKALQFFRNLK
Sbjct: 181 YNIILKALGRRGFFDSMMDVLYSMTREGVDATLETVSIVVDSLVKAHQVSKALQFFRNLK 240

Query: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGE 300
           EIGLKCDTETLNILLQCMCRRSHVGAANSF NLTKG+IPFNVMTYNI+IGGWSRYGRH E
Sbjct: 241 EIGLKCDTETLNILLQCMCRRSHVGAANSFLNLTKGSIPFNVMTYNIIIGGWSRYGRHSE 300

Query: 301 VEQMLKAMELDGFSPDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360
           VEQ LKAME+DGFSPD            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 301 VEQTLKAMEVDGFSPDYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 360

Query: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPT 420
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX        ARIIPT
Sbjct: 361 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLEMFDEMVARIIPT 420

Query: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWN 480
           TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLL+IWN
Sbjct: 421 TGAITSFIQLSCSYGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLSIWN 480

Query: 481 EMQESGYDPDVETYEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACN 540
           EMQESGYDPDVETYEHAI CLCKTGQLENAVLVMEECLRQGFFPSR+ RSKLNNKLLACN
Sbjct: 481 EMQESGYDPDVETYEHAIGCLCKTGQLENAVLVMEECLRQGFFPSRQIRSKLNNKLLACN 540

Query: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 573
           RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY
Sbjct: 541 RTEMAYKLWLKIKVARHQENLQRCWRAKGWHY 572

BLAST of CsaV3_4G037400 vs. TrEMBL
Match: tr|A0A2I4DPG5|A0A2I4DPG5_9ROSI (putative pentatricopeptide repeat-containing protein At5g43820 OS=Juglans regia OX=51240 GN=LOC108982187 PE=4 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 3.0e-128
Identity = 352/559 (62.97%), Postives = 411/559 (73.52%), Query Frame = 0

Query: 18  FLGLIASGRYHYPLIHSPSPALSYLFST-LDEPSNLFDDGLSGNGDRNQRCIDERFVISE 77
           FL      RYH P +  PS   S+ +ST LD PSN  +D    +    +  ++ERFV+ +
Sbjct: 11  FLVKFNRARYHSPYLSLPSSLFSFPYSTGLDFPSNSLNDRPPPDHISCKTNLEERFVLEQ 70

Query: 78  LSDLLLVNP-YGSVYNTLKENSIEKQM-PVRAVDGFLLPEEKLRGVFLQKLNGKTAIEHA 137
           LSDLL ++P   S  N  K+ +  KQ+  VRAVDGFLLPEEKLRG+FLQKL GK AIE A
Sbjct: 71  LSDLLPISPGNASAPNLFKDCNPRKQIAQVRAVDGFLLPEEKLRGIFLQKLRGKAAIEEA 130

Query: 138 LANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFF 197
           L N  V LS DVV+KV+N G+LG EAMV FF WA KQP IPKD +SY +I+KALGRR FF
Sbjct: 131 LTNVGVELSLDVVAKVVNRGNLGGEAMVIFFNWATKQPEIPKDINSYRVIIKALGRRKFF 190

Query: 198 DSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTETLNIL 257
             +M +L +M  EGV   LE +SIV+DS V+ HQVSKA+Q F N +E GLKCDTE+LN+L
Sbjct: 191 RFVMTMLRDMRIEGVSIDLETLSIVLDSFVRAHQVSKAIQIFGNSEEFGLKCDTESLNVL 250

Query: 258 LQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMELDGFS 317
           L+ +C+RSHVGAANSF N  +G  PF+ MTYNI++GGWS++GR  E+E +L+AM  DGFS
Sbjct: 251 LRSLCQRSHVGAANSFLNSIRGKTPFDSMTYNIIVGGWSKFGRVSEMECVLEAMLADGFS 310

Query: 318 PDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 377
           P             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 311 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370

Query: 378 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPTTGAITSFIQLS-CS 437
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                  CS
Sbjct: 371 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLCS 430

Query: 438 YGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVET 497
           YGPPHAAM+IY KARK G RIS + YKLLLMRLS FGK GMLLN+W++M E G+  D+E 
Sbjct: 431 YGPPHAAMVIYNKARKFGRRISLSGYKLLLMRLSRFGKCGMLLNVWDDMHECGHSSDMEV 490

Query: 498 YEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACNRTEMAYKLWLKIK 557
           YE+ I+ LC TGQLENAVLVMEECLR+GF P R   SKLNNKLLA ++ E AYKL+LKIK
Sbjct: 491 YEYVINGLCNTGQLENAVLVMEECLRKGFCPGRLIYSKLNNKLLASDKVERAYKLYLKIK 550

Query: 558 VARHQENLQRCWRAKGWHY 573
            AR  EN +  WRA GWH+
Sbjct: 551 RARRDENARIYWRANGWHF 569

BLAST of CsaV3_4G037400 vs. TrEMBL
Match: tr|A0A2I4HI33|A0A2I4HI33_9ROSI (putative pentatricopeptide repeat-containing protein At5g43820 OS=Juglans regia OX=51240 GN=LOC109018011 PE=4 SV=1)

HSP 1 Score: 465.7 bits (1197), Expect = 1.5e-127
Identity = 351/559 (62.79%), Postives = 410/559 (73.35%), Query Frame = 0

Query: 18  FLGLIASGRYHYPLIHSPSPALSYLFST-LDEPSNLFDDGLSGNGDRNQRCIDERFVISE 77
           FL      RYH P +  PS   S+ +ST LD PSN  +D    +    +  ++ERFV+ +
Sbjct: 11  FLVKFNRARYHSPYLSLPSSLFSFPYSTGLDFPSNSLNDRPPPDHISCKTNLEERFVLEQ 70

Query: 78  LSDLLLVNP-YGSVYNTLKENSIEKQM-PVRAVDGFLLPEEKLRGVFLQKLNGKTAIEHA 137
           LSDLL ++P   S  N  K+ +  KQ+  VRAVDGFLLPEEKLRG+FLQKL GK AIE A
Sbjct: 71  LSDLLPISPGNASAPNLFKDCNPRKQIAQVRAVDGFLLPEEKLRGIFLQKLRGKAAIEEA 130

Query: 138 LANTDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFF 197
           L N  V LS DVV+KV+N G+LG EAMV FF WA KQP IPKD +SY +I+KALGRR FF
Sbjct: 131 LTNVGVELSLDVVAKVVNRGNLGGEAMVIFFNWATKQPEIPKDINSYRVIIKALGRRKFF 190

Query: 198 DSMMDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTETLNIL 257
             +M +L +M  EGV   LE +SIV+DS V+ HQVSKA+Q F N +E GLKCDTE+L +L
Sbjct: 191 RFVMTMLRDMRIEGVSIDLETLSIVLDSFVRAHQVSKAIQIFGNSEEFGLKCDTESLTVL 250

Query: 258 LQCMCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAMELDGFS 317
           L+ +C+RSHVGAANSF N  +G  PF+ MTYNI++GGWS++GR  E+E +L+AM  DGFS
Sbjct: 251 LRSLCQRSHVGAANSFLNSIRGKTPFDSMTYNIIVGGWSKFGRVSEMECVLEAMLADGFS 310

Query: 318 PDCLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 377
           P             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 311 PXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 370

Query: 378 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPTTGAITSFIQLS-CS 437
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                  CS
Sbjct: 371 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPLCS 430

Query: 438 YGPPHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVET 497
           YGPPHAAM+IY KARK G RIS + YKLLLMRLS FGK GMLLN+W++M E G+  D+E 
Sbjct: 431 YGPPHAAMVIYNKARKFGRRISLSGYKLLLMRLSRFGKCGMLLNVWDDMHECGHSSDMEV 490

Query: 498 YEHAIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACNRTEMAYKLWLKIK 557
           YE+ I+ LC TGQLENAVLVMEECLR+GF P R   SKLNNKLLA ++ E AYKL+LKIK
Sbjct: 491 YEYVINGLCNTGQLENAVLVMEECLRKGFCPGRLIYSKLNNKLLASDKVERAYKLYLKIK 550

Query: 558 VARHQENLQRCWRAKGWHY 573
            AR  EN +  WRA GWH+
Sbjct: 551 RARRDENARIYWRANGWHF 569

BLAST of CsaV3_4G037400 vs. TrEMBL
Match: tr|A5C8V0|A5C8V0_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_018999 PE=4 SV=1)

HSP 1 Score: 460.3 bits (1183), Expect = 6.3e-126
Identity = 343/556 (61.69%), Postives = 402/556 (72.30%), Query Frame = 0

Query: 18  FLGLIASGRYHYPLIHSPSPALSYLFSTLDEPSNLFDDGLSGNGDRNQRCIDERFVISEL 77
           FL   +  RYH   +  PS    + FSTL   SN   D  + N  +     +ER V+ +L
Sbjct: 8   FLSRFSRTRYHTRYL--PSSVSLFQFSTLQVTSNPLMDEPTDNQIKRPSNFNERDVLYQL 67

Query: 78  SDLLLVNPYGSVYNTLKENSIEKQMPVRAVDGFLLPEEKLRGVFLQKLNGKTAIEHALAN 137
           S LL +    S+     ENS ++Q+  RAVDGFL P EKLRGVF+Q+L GK AIE AL N
Sbjct: 68  SGLLPICCNTSISKPFTENSPKEQLKTRAVDGFLSPGEKLRGVFIQRLRGKAAIELALTN 127

Query: 138 TDVILSQDVVSKVLNTGSLGSEAMVTFFYWAIKQPSIPKDASSYNIILKALGRRGFFDSM 197
             + L+ D+VS+V N G+LG EAMV FF WA+KQP+IPKD  +YN+I+KALGRR F +  
Sbjct: 128 VGIDLTIDIVSEVXNRGNLGGEAMVXFFNWAVKQPTIPKDVDTYNVIIKALGRRKFIEFX 187

Query: 198 MDVLYNMTREGVEATLEMVSIVVDSLVKGHQVSKALQFFRNLKEIGLKCDTETLNILLQC 257
           + VL +M  +G+    E +SIV+DS +K  QVSKA++ FRNL+E G KCDTE+LN+LLQC
Sbjct: 188 VXVLKDMHIQGISPNYETLSIVMDSFIKARQVSKAIEMFRNLEEFGGKCDTESLNVLLQC 247

Query: 258 MCRRSHVGAANSFFNLTKGNIPFNVMTYNIVIGGWSRYGRHGEVEQMLKAM-ELDGFSPD 317
           +C+RSHVGAAN FFN  KG IPFN MTYNI+IGGWS+YG+ GE+E+ LKAM         
Sbjct: 248 LCQRSHVGAANLFFNAMKGGIPFNCMTYNIIIGGWSKYGKIGEMERCLKAMVAXXXXXXX 307

Query: 318 CLTHTYLIECLGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 377
                       XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Sbjct: 308 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 367

Query: 378 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARIIPTTGAITSFIQLSCSYGP 437
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX              +  C YGP
Sbjct: 368 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEPLCQYGP 427

Query: 438 PHAAMLIYKKARKVGCRISKNAYKLLLMRLSLFGKFGMLLNIWNEMQESGYDPDVETYEH 497
           PHAAM+IYKKARKVGCRIS +AYKLLLMRLS FGK GMLLN+W+EMQESGY  D E YE+
Sbjct: 428 PHAAMMIYKKARKVGCRISLSAYKLLLMRLSRFGKCGMLLNLWDEMQESGYSSDTEVYEY 487

Query: 498 AIDCLCKTGQLENAVLVMEECLRQGFFPSRRTRSKLNNKLLACNRTEMAYKLWLKIKVAR 557
            I+ LC  GQL+ AVLVMEE L +GF PSR  RSKLNNKLLA N+ EMAYKL+LKIK AR
Sbjct: 488 VINGLCNIGQLDTAVLVMEESLXKGFCPSRLIRSKLNNKLLASNKVEMAYKLFLKIKXAR 547

Query: 558 HQENLQRCWRAKGWHY 573
             +N +R WR  GWH+
Sbjct: 548 QNDNARRFWRGNGWHF 561

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011654277.11.1e-275100.00PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
XP_008452985.18.6e-25291.78PREDICTED: putative pentatricopeptide repeat-containing protein At5g43820 [Cucum... [more]
XP_022138983.18.5e-18374.04putative pentatricopeptide repeat-containing protein At5g43820 isoform X1 [Momor... [more]
XP_022975779.11.1e-18274.26putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita maxima... [more]
XP_022936331.16.8e-18074.43putative pentatricopeptide repeat-containing protein At5g43820 [Cucurbita moscha... [more]
Match NameE-valueIdentityDescription
AT5G43820.11.8e-6549.21Pentatricopeptide repeat (PPR) superfamily protein[more]
AT3G22670.18.3e-1524.90Pentatricopeptide repeat (PPR) superfamily protein[more]
AT5G15010.18.3e-1525.25Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT3G15200.12.1e-1022.04Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT5G60960.12.3e-0920.10Pentatricopeptide repeat (PPR) superfamily protein[more]
Match NameE-valueIdentityDescription
sp|P0C8R0|PP416_ARATH3.2e-6449.21Putative pentatricopeptide repeat-containing protein At5g43820 OS=Arabidopsis th... [more]
sp|Q9LUJ4|PP248_ARATH1.5e-1324.90Pentatricopeptide repeat-containing protein At3g22670, mitochondrial OS=Arabidop... [more]
sp|Q9LFQ4|PP383_ARATH1.5e-1325.25Pentatricopeptide repeat-containing protein At5g15010, mitochondrial OS=Arabidop... [more]
sp|Q9LIL5|PP233_ARATH3.8e-0922.04Putative pentatricopeptide repeat-containing protein At3g15200 OS=Arabidopsis th... [more]
sp|Q9FME4|PP438_ARATH4.2e-0820.10Pentatricopeptide repeat-containing protein PNM1, mitochondrial OS=Arabidopsis t... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0L3E0|A0A0A0L3E0_CUCSA7.4e-276100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G663730 PE=4 SV=1[more]
tr|A0A1S4DZE5|A0A1S4DZE5_CUCME5.7e-25291.78putative pentatricopeptide repeat-containing protein At5g43820 OS=Cucumis melo O... [more]
tr|A0A2I4DPG5|A0A2I4DPG5_9ROSI3.0e-12862.97putative pentatricopeptide repeat-containing protein At5g43820 OS=Juglans regia ... [more]
tr|A0A2I4HI33|A0A2I4HI33_9ROSI1.5e-12762.79putative pentatricopeptide repeat-containing protein At5g43820 OS=Juglans regia ... [more]
tr|A5C8V0|A5C8V0_VITVI6.3e-12661.69Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VITISV_018999 PE=4 SV=1[more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR011990TPR-like_helical_dom_sf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0090305 nucleic acid phosphodiester bond hydrolysis
biological_process GO:0009451 RNA modification
cellular_component GO:0005575 cellular_component
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0005515 protein binding
molecular_function GO:0004519 endonuclease activity
molecular_function GO:0003723 RNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G037400.1CsaV3_4G037400.1mRNA


Analysis Name: InterPro Annotations of cucumber chineselong genome (v3)
Date Performed: 2019-03-04
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 428..571
e-value: 2.0E-16
score: 62.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 267..427
e-value: 6.3E-36
score: 126.3
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3DG3DSA:1.25.40.10coord: 132..266
e-value: 4.3E-20
score: 73.8
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILYSSF48452TPR-likecoord: 317..414
IPR002885Pentatricopeptide repeatTIGRFAMTIGR00756TIGR00756coord: 180..211
e-value: 3.2E-5
score: 21.8
coord: 354..386
e-value: 5.6E-7
score: 27.3
coord: 284..317
e-value: 2.8E-6
score: 25.1
coord: 389..416
e-value: 9.5E-6
score: 23.5
coord: 319..352
e-value: 3.1E-9
score: 34.4
IPR002885Pentatricopeptide repeatPFAMPF13812PPR_3coord: 305..361
e-value: 5.6E-10
score: 39.0
IPR002885Pentatricopeptide repeatPFAMPF12854PPR_1coord: 382..414
e-value: 2.5E-7
score: 30.2
IPR002885Pentatricopeptide repeatPFAMPF01535PPRcoord: 180..209
e-value: 0.0018
score: 18.3
coord: 493..522
e-value: 0.055
score: 13.6
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 351..385
score: 10.282
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 386..420
score: 10.008
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 316..350
score: 11.52
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 455..489
score: 7.892
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 490..524
score: 10.468
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 281..315
score: 10.885
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 212..246
score: 7.311
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 247..277
score: 6.358
IPR002885Pentatricopeptide repeatPROSITEPS51375PPRcoord: 177..211
score: 10.413
NoneNo IPR availablePANTHERPTHR24015:SF378SUBFAMILY NOT NAMEDcoord: 93..572
NoneNo IPR availablePANTHERPTHR24015FAMILY NOT NAMEDcoord: 93..572