Cp4.1LG04g04690 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g04690
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionLEA_2 domain-containing protein
LocationCp4.1LG04: 6408102 .. 6411235 (+)
RNA-Seq ExpressionCp4.1LG04g04690
SyntenyCp4.1LG04g04690
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACACTTTCGTTTCATTTTTCTCTCGGCGTCCTGGCACGATGCACGCCAAATCGTACTCGGAGGTCACCAGCGTGGACCAGTCCTCGCCGGCGCGATCGCCGCGCCGGCCGCTGTACTACGTGCAGAGCCCTTCGAACCACGACGTCGAGAAAATGTCATACGGTTCGAGCCCTATGGGATCGCCGCCGCACCCTTTTTACCACGCCTCCCCAATCCACCATTCTCGTGAGTCGTCGACCTCACGATTCTCAGCGTCGCTTAAGAACAATATGAATCGGAACGGGAATCTCTCCGCGTGGAGGAAGCTTCACCGGCCGCCGGGTTACGACGAGGAGGAGGAGGAGGATGACGACGGCGACAACGACGGCGATCGGGATTCGAAATGGAACCGGAAGTTCCGATTGTACTTGTTTTTGTTTGTGCTGTTTGTTCTTCTTTTCACTGTTTTTTCCCTCATCCTCTGGGGCGCCAGCAAGTCCTTCCACCCACAAATTCTCGTTCAGGTAATTATTTTTAATTTTTTATTTTATTTTCAAGTAATAATTAATTTAATAATTTTCTAATTAACCATGGTTTGACTTAGTAATCGACCTAGACGAAGAACCGAGTCAAATTTTGGATAGAGCTTGTAGTCACGATGACCCATAACGACCCATAAGATAAGTTTGATCTTAGGCGACGTTTGGTCGAGTTTAGTTAATAAATCTTTAGTTAGAAGTCCGTATTTGGTACCAAACATGCTGGAATGACCCCTGCTCATCTGTGGGATAGTCTTTACTCTTAGAAGATACAATCTTAGTCCGGCGTTGGTACAAATTAACCATGGGTTGACTTAGTAATCCGTAAGGACGGAAAATCGAGTCAAATTTTAGATAGATAGATTAAAGTCGCGAAGACCCATAAGATAATTTTGATCTTAGGAGGCGTTTGGACGAGTTTAGTTAATAAATCTTTAGTTAGAAGTCTCTATTTGGTACAAAACATGCTAGAATGACTCGTGCAGGTTTGTGGGGGATAGTCTTCACTCTTAGAAGATTCAATCTTAGTCACAAGTCGTTAGAACGGAAAATCGAGTCAAATTTTAGAGAGATAGATTGAAGTCACGAAGACCCATAAGATAATTTTGATCTTAGGAGGCGTTTGGACGAGTTTAGTTAATAAATCTTTAGTTATAAGTATATGTTCATGCTGGAAGGACTCGTGTTGGTTCATGAAGGTAGTCTTCACTCTTAGGAACAAGATGTAAGTAATGTGACCATGTTGTAGGAGAATGTCGGGACCATCCTGGTTTGAATCTAAGACATGAGTCGTTACAAATTAACCATAATTTGGTCTAGTAATTGGTCGGGACGGAGAACCGAGCACATAGTCAAGTAATAGAGAGAATAGGCTGAAGATTCAATAAGTCCGTGTTTAAAAGTCGATCGAGTTTAATAAATCTTCGATTAGGGTTTACGAAGGATTATAAAATTGGAGCATAATTAAATTAAATTAATTTAATAAACTTTAAGGCTGAAAAGTATTTAAAAAATATTAATAAATTTTGGAACATATTACAGAGCATGGTGTTCGAGAAGTTTAACGTACAAGCAGGGAGTGATCCGGGAGGTGTGGCAACGGATCTGATGTCACTAAATTCAACGGTCAGGATCACGTACAAAAATCCTGCCACGTTTTTCGGGGTCCACGTCAGCTCCACTCCTTTTCAGCTCCATTATTTCCAGCTTCAAATAGCTTCTGGCCAGGTAATTTACTAAATTACCCCTCCCTTTAATTCCAATTTTAATTACGCTTTAATTTAATTTATTCGTAATTTAATATTATAAAATGGTGGGCACGTCGTTTAGCAATCAGAAATTATGGAAAAAATAATATATTTTTTTTTTAAATTTAAAATTTATTATTAATTTTAATTTATTTTGGATGAGGCTTTTCAGACATTGGGAGTAGTGACGATTACAACTCGCAATCAAGTGTCTGAATCTCTGAACACTTGTAGTCTCTTTTCCCATGGTCTTTTTTTTTGTAATTTAGCCGTTGGGAGATAAAAGTAGGGGCATTTTAGTCTTTTCACTTACATGGGTTTAGGTTGAGTTAGATTTTAACCTTATGGGTTAGGTTATTCAGGTGCCTGATTGAGTTGAACCGAACCAACCTTATTAAATATTACGTTGATTCAGTATCTCCTCTAAAATTAATTTGAGTTGAACCGAACCAACTTTATGAATTAGGTTGATTCACGAACTTCTCTAAAATTAATTTGAGTTGAAACCGAATCAACTTTTATGAATCAGATTGATTCACGAGGTTCTCTAAAATTAATTTGAGTTGAACCGAACCAACTTTATGGAGTAGGTTGATTGAATTAATTGATTTTTTTACTTATTATAATTTTTTCAATTTTTAATTATCTTTTTGTATTTCTATTTTCGAAAAACTCTTATAAATAAAATTTTAAATTAAAACCCGAACAATTGAATTGGATCTAAAAAAATGTCCCAACCTTACCCGATTCAACCCATGAATACGCCCACTCATATATGTATATGTGTGTGTATATATATATATATATACATTTAAAGGGGAATAATACCAAAACCCTTTAGTACTCTATATTTGACCAAACATTAGGTGTACTAAATCCAATTATTGTCACGAGCGTGAGCCATTGATCAACCAAATGTGGAAACTGTTCACCAGATGGAGGAGTTCTACCAAAAGCGGCAGAGCTCTCGTAAGGTGACGACGTCTGTGTCGGGGCACCAAGTCCCGCTCTACGGCGGGATCTCGGCGATCGGGAATTGGAGAGACCAACGACAAGACGGGGTCGAGGTGCTACTGAACTTGACGGTGGCCGTGAGGTCCCGAGCTTACATTCTCGGGAGGCTGGTGAAGTCCACATTCCATACAAAGATTACATGTCCTGTGACTCTTAGTAACAAAAAGCTTGGGAAATCACACTCTTTCAATAAAACTTGTACTTATAATTGAGCTTTGTGTTCTCCTTTTGGGTGAGATTATGAAATTTTGTATCTACTAAGTAGCTTTTGAGAGAATCGTCTTTGTAATGTCGTACTTAATTATGTTTATTGGTAAAGATTTGACTTTTTTAGATCGAGACTGACTAGGGTTGT

mRNA sequence

ACACTTTCGTTTCATTTTTCTCTCGGCGTCCTGGCACGATGCACGCCAAATCGTACTCGGAGGTCACCAGCGTGGACCAGTCCTCGCCGGCGCGATCGCCGCGCCGGCCGCTGTACTACGTGCAGAGCCCTTCGAACCACGACGTCGAGAAAATGTCATACGGTTCGAGCCCTATGGGATCGCCGCCGCACCCTTTTTACCACGCCTCCCCAATCCACCATTCTCGTGAGTCGTCGACCTCACGATTCTCAGCGTCGCTTAAGAACAATATGAATCGGAACGGGAATCTCTCCGCGTGGAGGAAGCTTCACCGGCCGCCGGGTTACGACGAGGAGGAGGAGGAGGATGACGACGGCGACAACGACGGCGATCGGGATTCGAAATGGAACCGGAAGTTCCGATTGTACTTGTTTTTGTTTGTGCTGTTTGTTCTTCTTTTCACTGTTTTTTCCCTCATCCTCTGGGGCGCCAGCAAGTCCTTCCACCCACAAATTCTCGTTCAGAGCATGGTGTTCGAGAAGTTTAACGTACAAGCAGGGAGTGATCCGGGAGGTGTGGCAACGGATCTGATGTCACTAAATTCAACGGTCAGGATCACGTACAAAAATCCTGCCACGTTTTTCGGGGTCCACGTCAGCTCCACTCCTTTTCAGCTCCATTATTTCCAGCTTCAAATAGCTTCTGGCCAGATGGAGGAGTTCTACCAAAAGCGGCAGAGCTCTCGTAAGGTGACGACGTCTGTGTCGGGGCACCAAGTCCCGCTCTACGGCGGGATCTCGGCGATCGGGAATTGGAGAGACCAACGACAAGACGGGGTCGAGGTGCTACTGAACTTGACGGTGGCCGTGAGGTCCCGAGCTTACATTCTCGGGAGGCTGGTGAAGTCCACATTCCATACAAAGATTACATGTCCTGTGACTCTTAGTAACAAAAAGCTTGGGAAATCACACTCTTTCAATAAAACTTGTACTTATAATTGAGCTTTGTGTTCTCCTTTTGGGTGAGATTATGAAATTTTGTATCTACTAAGTAGCTTTTGAGAGAATCGTCTTTGTAATGTCGTACTTAATTATGTTTATTGGTAAAGATTTGACTTTTTTAGATCGAGACTGACTAGGGTTGT

Coding sequence (CDS)

ATGCACGCCAAATCGTACTCGGAGGTCACCAGCGTGGACCAGTCCTCGCCGGCGCGATCGCCGCGCCGGCCGCTGTACTACGTGCAGAGCCCTTCGAACCACGACGTCGAGAAAATGTCATACGGTTCGAGCCCTATGGGATCGCCGCCGCACCCTTTTTACCACGCCTCCCCAATCCACCATTCTCGTGAGTCGTCGACCTCACGATTCTCAGCGTCGCTTAAGAACAATATGAATCGGAACGGGAATCTCTCCGCGTGGAGGAAGCTTCACCGGCCGCCGGGTTACGACGAGGAGGAGGAGGAGGATGACGACGGCGACAACGACGGCGATCGGGATTCGAAATGGAACCGGAAGTTCCGATTGTACTTGTTTTTGTTTGTGCTGTTTGTTCTTCTTTTCACTGTTTTTTCCCTCATCCTCTGGGGCGCCAGCAAGTCCTTCCACCCACAAATTCTCGTTCAGAGCATGGTGTTCGAGAAGTTTAACGTACAAGCAGGGAGTGATCCGGGAGGTGTGGCAACGGATCTGATGTCACTAAATTCAACGGTCAGGATCACGTACAAAAATCCTGCCACGTTTTTCGGGGTCCACGTCAGCTCCACTCCTTTTCAGCTCCATTATTTCCAGCTTCAAATAGCTTCTGGCCAGATGGAGGAGTTCTACCAAAAGCGGCAGAGCTCTCGTAAGGTGACGACGTCTGTGTCGGGGCACCAAGTCCCGCTCTACGGCGGGATCTCGGCGATCGGGAATTGGAGAGACCAACGACAAGACGGGGTCGAGGTGCTACTGAACTTGACGGTGGCCGTGAGGTCCCGAGCTTACATTCTCGGGAGGCTGGTGAAGTCCACATTCCATACAAAGATTACATGTCCTGTGACTCTTAGTAACAAAAAGCTTGGGAAATCACACTCTTTCAATAAAACTTGTACTTATAATTGA

Protein sequence

MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIHHSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKFRLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSLNSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQVPLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKLGKSHSFNKTCTYN
Homology
BLAST of Cp4.1LG04g04690 vs. NCBI nr
Match: XP_022989441.1 (uncharacterized protein LOC111486495 [Cucurbita maxima] >XP_023529756.1 uncharacterized protein LOC111792484 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 623 bits (1607), Expect = 4.24e-225
Identity = 313/313 (100.00%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSFNKTCTYN 313
           GKSHSFNKTCTYN
Sbjct: 301 GKSHSFNKTCTYN 313

BLAST of Cp4.1LG04g04690 vs. NCBI nr
Match: KAG6589033.1 (hypothetical protein SDJN03_17598, partial [Cucurbita argyrosperma subsp. sororia] >KAG7022748.1 hypothetical protein SDJN02_16484, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 622 bits (1604), Expect = 1.22e-224
Identity = 312/313 (99.68%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEE+DDDGDNDGDRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSFNKTCTYN 313
           GKSHSFNKTCTYN
Sbjct: 301 GKSHSFNKTCTYN 313

BLAST of Cp4.1LG04g04690 vs. NCBI nr
Match: XP_022928427.1 (uncharacterized protein LOC111435243 [Cucurbita moschata])

HSP 1 Score: 621 bits (1601), Expect = 3.49e-224
Identity = 311/313 (99.36%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEE++DDDGDNDGDRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSFNKTCTYN 313
           GKSHSFNKTCTYN
Sbjct: 301 GKSHSFNKTCTYN 313

BLAST of Cp4.1LG04g04690 vs. NCBI nr
Match: XP_008447896.1 (PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo])

HSP 1 Score: 552 bits (1423), Expect = 5.05e-197
Identity = 279/315 (88.57%), Postives = 294/315 (93.33%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLK+N NRNGN+SAWRKLH     D+++EEDD  + + DRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYL LF+ FVLLFTVFSLILWGASKSFHPQIL+QSMVF KFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRI+Y+NPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSR++ TSV+GHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGV--EVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNK 300
           PLYGGISAIGNWRDQRQDGV  EV LNLTVAVRSRAYILGRLVKSTFHT ITCP+TLS K
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 KLGKSHSFNKTCTYN 313
           KLGKSHSFN TCTYN
Sbjct: 301 KLGKSHSFNNTCTYN 315

BLAST of Cp4.1LG04g04690 vs. NCBI nr
Match: XP_038888376.1 (uncharacterized protein LOC120078225 [Benincasa hispida])

HSP 1 Score: 551 bits (1421), Expect = 9.12e-197
Identity = 279/313 (89.14%), Postives = 296/313 (94.57%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTS+DQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSMDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNN NRNGNLSAWRKLHRP   D++EE+D+D +ND DRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNPNRNGNLSAWRKLHRPQDSDDDEEDDEDEEND-DRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLF+LFVLLFTVFSLILWGAS+SFHPQIL+QSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFLLFVLLFTVFSLILWGASRSFHPQILIQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITY+NPATFFGVHVSSTPF L Y+QLQIASGQMEEFYQKRQSSR+V TSV+GHQ+
Sbjct: 181 NSTVRITYRNPATFFGVHVSSTPFHLQYYQLQIASGQMEEFYQKRQSSRRVKTSVAGHQI 240

Query: 241 PLYGGISAIGNWRDQRQDGV--EVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNK 300
           PLYGGISAIGNWRDQRQDGV  E+ LNLTVAVRSRAYILGRLVKSTFHT ITCP+TLS K
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEIPLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 KLGKSHSFNKTCT 311
           KLGK HSFN +CT
Sbjct: 301 KLGKFHSFNNSCT 312

BLAST of Cp4.1LG04g04690 vs. ExPASy TrEMBL
Match: A0A6J1JK28 (uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495 PE=4 SV=1)

HSP 1 Score: 623 bits (1607), Expect = 2.05e-225
Identity = 313/313 (100.00%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSFNKTCTYN 313
           GKSHSFNKTCTYN
Sbjct: 301 GKSHSFNKTCTYN 313

BLAST of Cp4.1LG04g04690 vs. ExPASy TrEMBL
Match: A0A6J1EJW4 (uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC111435243 PE=4 SV=1)

HSP 1 Score: 621 bits (1601), Expect = 1.69e-224
Identity = 311/313 (99.36%), Postives = 313/313 (100.00%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEE++DDDGDNDGDRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSFNKTCTYN 313
           GKSHSFNKTCTYN
Sbjct: 301 GKSHSFNKTCTYN 313

BLAST of Cp4.1LG04g04690 vs. ExPASy TrEMBL
Match: A0A1S3BJ42 (uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=4 SV=1)

HSP 1 Score: 552 bits (1423), Expect = 2.45e-197
Identity = 279/315 (88.57%), Postives = 294/315 (93.33%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLK+N NRNGN+SAWRKLH     D+++EEDD  + + DRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYL LF+ FVLLFTVFSLILWGASKSFHPQIL+QSMVF KFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRI+Y+NPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSR++ TSV+GHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGV--EVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNK 300
           PLYGGISAIGNWRDQRQDGV  EV LNLTVAVRSRAYILGRLVKSTFHT ITCP+TLS K
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 KLGKSHSFNKTCTYN 313
           KLGKSHSFN TCTYN
Sbjct: 301 KLGKSHSFNNTCTYN 315

BLAST of Cp4.1LG04g04690 vs. ExPASy TrEMBL
Match: A0A0A0K4T2 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 SV=1)

HSP 1 Score: 543 bits (1400), Expect = 7.81e-194
Identity = 275/315 (87.30%), Postives = 289/315 (91.75%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLK N NRNGN+SAWRKLH     D ++EEDD+ + + DRDSKWNRKF
Sbjct: 61  HSRESSTSRFSASLKINQNRNGNVSAWRKLHHAQDSDGDDEEDDEEEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYL LF+ F+LLFTVFSLILWGASKSFHPQIL+QSMVF KFNVQAGSDPGGVATDLMSL
Sbjct: 121 RLYLILFLFFILLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRI+YKNPATFFGVHVSSTP QLHY QLQ+ASGQMEEFYQKRQSSR+V TSV+GHQV
Sbjct: 181 NSTVRISYKNPATFFGVHVSSTPIQLHYLQLQVASGQMEEFYQKRQSSRRVVTSVAGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDG--VEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNK 300
           PLYGGISAIGNWRDQRQDG  VEV LNLTVAVRSRAYILGRLVKSTFHT ITCP+TLS  
Sbjct: 241 PLYGGISAIGNWRDQRQDGAGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTN 300

Query: 301 KLGKSHSFNKTCTYN 313
           KLGKSHSFN TC YN
Sbjct: 301 KLGKSHSFNNTCIYN 315

BLAST of Cp4.1LG04g04690 vs. ExPASy TrEMBL
Match: A0A6J1C5K4 (uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007587 PE=4 SV=1)

HSP 1 Score: 494 bits (1271), Expect = 2.90e-174
Identity = 252/312 (80.77%), Postives = 276/312 (88.46%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++++DD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKPNXR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120

Query: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180

Query: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240

Query: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFH  ITC +TL  K L
Sbjct: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHXTITCSLTLRTKNL 300

Query: 301 GKSHSFNKTCTY 312
           GK HS N +C Y
Sbjct: 301 GKFHSLNNSCIY 309

BLAST of Cp4.1LG04g04690 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 273.1 bits (697), Expect = 2.8e-73
Identity = 160/315 (50.79%), Postives = 207/315 (65.71%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQS--SPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPHP-FYH 60
           MHAK+ SE TS+D +  SP RS  RPLYYVQSPSNHDVEKMS+GS  S MGSP HP +YH
Sbjct: 1   MHAKTDSEATSIDAAALSPPRSAIRPLYYVQSPSNHDVEKMSFGSGCSLMGSPTHPHYYH 60

Query: 61  ASPIHHSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSK 120
            SPIHHSRESSTSRFS         +  L +++ +     Y  + ++  DG +D D    
Sbjct: 61  CSPIHHSRESSTSRFS---------DRALLSYKSIRERRRYINDGDDKTDGGDDDDP--- 120

Query: 121 WNRKFRLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVAT 180
             R  RLY++L +  + LFTVFSLILWGASKS+ P++ V+ M+    N+QAG+D  GV T
Sbjct: 121 -FRNVRLYVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPT 180

Query: 181 DLMSLNSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSV 240
           D++SLNSTVRI Y+NP+TFF VHV+++P  LHY  L ++SG+M +F   R     V T V
Sbjct: 181 DMLSLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVV 240

Query: 241 SGHQVPLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTL 300
            GHQ+PLYGG+S          D + + LNLT+ + S+AYILGRLV S F+T+I C  TL
Sbjct: 241 QGHQIPLYGGVSF-------HLDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTL 295

Query: 301 SNKKLGKSHSFNKTC 311
               L KS S  ++C
Sbjct: 301 DANHLPKSISLLRSC 295

BLAST of Cp4.1LG04g04690 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 229.6 bits (584), Expect = 3.6e-60
Identity = 155/346 (44.80%), Postives = 206/346 (59.54%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHP 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKNNMNR-NGNLSAWRKLH------RPPGYDEEEEEDDD 120
             H+S   HSRESS+SRFS SLK    + N N  + RK H      +     EEE   DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 GDNDGDRDSKWNRKFRLYLFLFVL-FVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNV 180
           GD DG          R Y+  F++ F +LF  FSLIL+GA+K   P+I V+S+ FE   +
Sbjct: 121 GDRDGGVPR------RCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKI 180

Query: 181 QAGSDPGGVATDLMSLNSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQK 240
           QAG D GGV TD++++N+T+R+ Y+N  TFFGVHV+STP  L + Q++I SG +++FYQ 
Sbjct: 181 QAGQDAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQG 240

Query: 241 RQSSRKVTTSVSGHQVPLYGGISAI-------GNWRDQRQDG------------VEVLLN 300
           R+S R V   V G ++PLYG  S +          + +++ G              V + 
Sbjct: 241 RKSERTVLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMT 300

Query: 301 LTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKLGKSHSFNKTCT 312
           L+  VRSRAY+LG+LV+  F+ KI C +   +K L K     K CT
Sbjct: 301 LSFVVRSRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCT 338

BLAST of Cp4.1LG04g04690 vs. TAIR 10
Match: AT4G35170.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 224.9 bits (572), Expect = 8.7e-59
Identity = 142/303 (46.86%), Postives = 185/303 (61.06%), Query Frame = 0

Query: 14  QSSPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPHPFYHASPIHH---SRESSTS 73
           +SSP ++ R+P+Y V SP N DV+K+S GS  SP GSP +     S   H   +  SS  
Sbjct: 7   RSSP-QNTRKPVYVVHSPPNTDVDKISTGSGFSPFGSPLNDQGQVSNFQHHSVAESSSYP 66

Query: 74  RFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKFRLYLFLFV 133
           R S  L+N  +         ++H     D    ED+D D     D K  R  R Y  L  
Sbjct: 67  RSSGPLRNEYSS-------VQVH---DLDRRTHEDEDYDEMDGPDEKRRRITRFYSCLLF 126

Query: 134 LFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSLNSTVRITY 193
             VL FT+F LILWG SKSF P   ++ MV E  NVQ+G+D  GV TD+++LNSTVRI Y
Sbjct: 127 TLVLAFTLFCLILWGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILY 186

Query: 194 KNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQVPLYGGISA 253
           +NPATFF VHV+S P QL Y QL +ASGQM EF Q+R+S R + T V G Q+PLYGG+ A
Sbjct: 187 RNPATFFTVHVTSAPLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPA 246

Query: 254 IGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKLGKSHSFNK 312
           +   R +  D V + LNLT  +R+RAY+LGRLVK+TFH+ I C +T    KLGK+   +K
Sbjct: 247 LFGQRAE-PDQVVLPLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSK 297

BLAST of Cp4.1LG04g04690 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 200.3 bits (508), Expect = 2.3e-51
Identity = 136/339 (40.12%), Postives = 191/339 (56.34%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPS--NHDVEKMSYG-------SSPMGSPPH 60
           MHAK+ SEVTS+  SSP RSPRRP Y+VQSPS  +HD EK +         +SPMGSPPH
Sbjct: 1   MHAKTDSEVTSLSASSPTRSPRRPAYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSPPH 60

Query: 61  PFYHASPIHHSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGD 120
                        SS+SRFS    N   R G+        +     EEE   DDGD + +
Sbjct: 61  -----------SHSSSSRFSKI--NGSKRKGHAG-----EKQFAMIEEEGLLDDGDREQE 120

Query: 121 RDSKWNRKFRLYLFLFVL-FVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDP 180
              +     R Y+  F++ F LLF  FSLIL+ A+K   P+I V+S+ FE+  VQAG D 
Sbjct: 121 ALPR-----RCYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDA 180

Query: 181 GGVATDLMSLNSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRK 240
           GG+ TD++++N+T+R+ Y+N  TFFGVHV+S+P  L + Q+ I SG +++FYQ R+S R 
Sbjct: 181 GGIGTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRT 240

Query: 241 VTTSVSGHQVPLYGGISAI-------GNWRDQRQDGVEVL-----------LNLTVAVRS 300
           V  +V G ++PLYG  S +          + +++ G  V+           + L   VRS
Sbjct: 241 VVVNVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRS 300

Query: 301 RAYILGRLVKSTFHTKITCPVTLSNKKLGKSHSFNKTCT 312
           RAY+LG+LV+  F+ +I C +   +KKL K       CT
Sbjct: 301 RAYVLGKLVQPKFYKRIVCLINFEHKKLSKHIPITNNCT 316

BLAST of Cp4.1LG04g04690 vs. TAIR 10
Match: AT1G45688.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 176.4 bits (446), Expect = 3.6e-44
Identity = 121/246 (49.19%), Postives = 155/246 (63.01%), Query Frame = 0

Query: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHP 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKNNMNR-NGNLSAWRKLH------RPPGYDEEEEEDDD 120
             H+S   HSRESS+SRFS SLK    + N N  + RK H      +     EEE   DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 GDNDGDRDSKWNRKFRLYLFLFVL-FVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNV 180
           GD DG          R Y+  F++ F +LF  FSLIL+GA+K   P+I V+S+ FE   +
Sbjct: 121 GDRDGGVPR------RCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKI 180

Query: 181 QAGSDPGGVATDLMSLNSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQK 231
           QAG D GGV TD++++N+T+R+ Y+N  TFFGVHV+STP  L + Q++I SG +    QK
Sbjct: 181 QAGQDAGGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVSLPIQK 238

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022989441.14.24e-225100.00uncharacterized protein LOC111486495 [Cucurbita maxima] >XP_023529756.1 uncharac... [more]
KAG6589033.11.22e-22499.68hypothetical protein SDJN03_17598, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022928427.13.49e-22499.36uncharacterized protein LOC111435243 [Cucurbita moschata][more]
XP_008447896.15.05e-19788.57PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo][more]
XP_038888376.19.12e-19789.14uncharacterized protein LOC120078225 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
A0A6J1JK282.05e-225100.00uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495... [more]
A0A6J1EJW41.69e-22499.36uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC1114352... [more]
A0A1S3BJ422.45e-19788.57uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=... [more]
A0A0A0K4T27.81e-19487.30LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 ... [more]
A0A6J1C5K42.90e-17480.77uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT2G41990.12.8e-7350.79CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
AT1G45688.13.6e-6044.80unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35170.18.7e-5946.86Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G42860.12.3e-5140.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G45688.23.6e-4449.19unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 186..291
e-value: 8.2E-11
score: 42.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..58
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 84..111
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 52..313
NoneNo IPR availablePANTHERPTHR31852:SF175LATE EMBRYOGENESIS ABUNDANT PROTEINcoord: 52..313

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g04690.1Cp4.1LG04g04690.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane