CmoCh19G004420 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh19G004420
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationCmo_Chr19: 5389392 .. 5394402 (+)
RNA-Seq ExpressionCmoCh19G004420
SyntenyCmoCh19G004420
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGTTATCAAGAGGCGCCGCCCAGAGCCGCCCTCCGTACATCCCCAGATCCTCCTCCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTGTTCCTCCTCTTCTCCTTCCTAGCTCTCTTAATCCTCGCCGTCGTCCTTGTCGTCGTCCTCGCCCTAAAGCCAAAGAAGCCCCAATTCGATCTCCAGCAGGTGGGCATCCAGTACATGAACATAACCACTCCCAATCCCACCGCCGCCTCCCTTTCCCTCGACATTCGAATGATTTTCACCGCCGTTAACCCCAACAAAGTGGGAATCAAGTACGGGGAGTCTCGGTTCACGGTCATGTACCGAGGGATTCCACTTGGGAGAGCTTCAATTCCCGGATTCTTTCAAGACCCACATAGCCAACGCCAGGTCGACGCCACCATCGCCGTCGATCGTGTCAGCCTCCTCCAAGCCGACGCCGCTGATTTGATCCGTGACGCCTCGTTAAACGACCGTGTGGAGCTGAGGATACTCGGCGATGTTGCCGCTAAGATCCGCCTCTTGTCCTTTGATTCCCCCGGCGTTCAGGTACCACTAATCTCTCTAAGGGCATCCTTGTAATTTCATCATTCTAATGCTCTGCCCCTGTCCTGTCCTGTCCTGTCCTTCCCTGCCCTATTGTCTCTGCCATTTGTCCCCCATTTTAATGCTCTTTTTCCCCCACATTTCTAAATCCTCAAGGCCACATTATGCATGACTAGAAAAGATTTAGATAGTTGTACTAAAACCCAATTAGGATCGAGCTAAACTCATTTTAACGAAAACCCAATTCCGATCGAGTTAAACTCATTTTAACGAAAACCCAATTATGATCGAGCTGAACTCATTTTAACGAAAACCCTTTGATGTATGATTGTTTTATAGGATAATACGAACCATGAAAAGGATTTGTGTTAAAAACGAGAAACAGAAGTGAGATTGAAAGTGTTATCTAATCGTATACAAGCTACCTAACTATCAGTGATGGTTTAAAAAATTTGATGAATTCAGACTACCCACCCAAACTATAAAAGTTGAGTTGTATTAGATTTATTTGTTTTTTTGATTGAAAATCTAAAAATTCAATCCAGTTTGCTAGTTGACAATTTTTTTTATCGAGTCAATTAGACCAATTTAAAAATAGGGTTACAATCTAGTGCAACGTGACTTTTGGATGAATATTTGTTAATGTTTAGGTATGGGTGGATTGTGCAATTGTGATTAGTCCAAGGAAGCAATCTCTCACATACAAGCAATGTGGTTTTGATGGCTTAAATGTATGACTCACTCCCTCTTCCCTTCCCATCTAAAAAAATTCTTTCATTACTTTGTCTTTTAACCCCTTCGTTTCGGTTCCTAAATAATTAGGTCATTCAGTCTTTACCAATAATTCGAAGGGCTTAAATAGATGAAAATTTAGAGACTAAATTGAATACTTCAAGATTTTTTTAGAAACCCGAAGTGTGTAATATAGTGGAGTAAATTATATTAAATGAAATATAATTATAAATTTTATACAATATTAATAATGTATTAAAAAATAATAAATAAAATATTTAAATGTATATATAAAATAATTATTACAATAGAATACTTATTTATTTTTTATTAATTATAATTAGAATTTGAATCAATTAATTATACCCCTAAAATATATCTAATTGAAAAAAGAAAATAGTTATAAAAGAGAGAGAAACAATACTAAACTTATATAGTTGAAAAAAAAAAATAATGTGTAACAAACCTAAATTTTTATATATTTAGAGTCGTTATTGATGTCTCTTAACGTAGAAATATAACTCTTAAATGAACATTATTTATAGCATATACTTTGAAAACATTCGGAACAACCTTAAATTTCCAGAAAATACGGTCGGTTTTACGTGTTTCGAACAAAAACACCGGAATTTAAAACAATAAAAATAATTACAAAATAAAATAATTTAAATGATGAAATGCCATCATCCTATTCTAACCAAAGACTAGGAAATTAAATATGTGACTACCTTATGCACGTGCCACAGTCTCTATTTGCGATGCTGCTGTCATCCGTATATGAGCGTCTTGCCTTTACCTGAAAAATGATATGACACACATCTTGAGTATTGAAGAATACTCAGTAAGTGATCCCACTATAGGGGCCATGTTAATGCAAACACATGCAATCCTGCTCTTGGGACCTATCTTTATTCTCTTGGGGTGGCCTCTAGTCTCAGACTAATTTGGATGCGTAGTACTTCCCTACTCACGACCCACACGTGCGAGTGTGAATCCCTAGGGGGCTTGCACACCCCCTGGACCCTCTCCGGTCAAGCGAATCTGTAAGGACGTCCGAAGGTCAACTGAACCTCTTGGTGTCATGCTCCCCATGAACACACTCATCATCATACTGTGCGTTCGCACTCATCGTCTTTGAGGAAGTATTTCTATGCTTATAAACATTCAACATGCATGAGATCCCTCTAATGTCTACCGTATCATCTCTTCTGACACAATCCTCTAGTTTCATTTATTTGTTACATTAACACATGCTCGTGAATATGTATATTAATATCATTTTCATGCTCTTAGGGTTGTGTCCTATGTCGATCTATCGACATGATGCAATATGACACTCATAATCATAAAGCATGCTTAACATGAAAAACATCAACATATTAAGATAAACATCATGTAATCATCATACTCATCATAATGTCATCAAATGCATCAAACATATCATGTACGTCATGCATCGGAACATAAATCACGTGCATATAACATGCATCACAGCTCAAACATCATACTCATGCATCATCATAGCTGATACATCATCATGCTCGTAAATATAACTCTAACGCATACAATTTTCTAGCCTATCCTATAGTAAGGCCACTTCCTTGGTTGGCCTTGGCCGAAGTACCCCCTAATTAGCTAAACGTAAGCTCTCGTTCCTTGTGAGCTGCTCCAAACGTTGCTGGAGTTTGTTTCCTATCACATCGTTCGTCACCTTCCATTAGTATTTCACAATTAAATTAATAATGAATTAATTAACGTGAAATATGGCCCAAAATGACCTCAGTTACCTTAATCGGGGTCGAAAATAGTCCAAAAGAATCGAAACAGTGGAAACTAGGCCAAAATCAAAGAGTCGAGCCAAACAGCCGCTGGGAGCCTCCGGAGAGCCTTCGAAAAATTTGTCGGAGAGCCGCTGGAGGAAGACTACGAGCCGTCTAAGCTGAGACGGGCCGAGAGATTGCCTGTGTTGCTGGGTGCCACGTGGCAGCAACAAAGCGTGCCACATCGTATAGGCACTCTGGGTCGAGGCACGGGTCGAAGGCTGGTTCGGGTCATGTCTGCAGGCTAGCTTCTGGACTTTGATTCACGGGCTGGGCCGCAGGTGAGTTTGGGTCGTGGTCTTAAGGATATTTAGTAATAAATTATAATTTACCATATCTATTATATTTGATATTTTATTATTGTCTTTCTTACCATAAGTATCTTTCCATTTATTACTTTTTCCATAACCTTGTAATTTATTTGATTATAAATAAGATAACTTTCACACCATTTAGGTGTGGTGGATTAAACAAAAATTCTCATGGTATCAGAGCCTTTCGGTTTAGCAACTTCGATCCTTTATTTTTAATGGCTTATGAATCCCCTACTCATCTTCTTCTCCACCACCCCTGCTGGCTTTGATGTCGCAGGCGTAGTCTCCATGGCTGCCGCAAGCGGAGCCGATTGATTACTCTTCGCTCCACTCTTTTGACCTCCAAATCACTCTTCGTCATACTCTATACATACAACCTCACCTTAGCTAACATAGCCTCTCTCATTTTCTTTCCAATCTTCCCTTCATCCTTCGTCAAAACCCTAACCTAAACCTATCGACCTTTTGCAGAGATTTACCTTTGGAATCTGGGTGCCGCACTGGCCTTTTCGCCAACCAAGTCGCTGCCGCCCTTCTCTCCAAAATTGGCTCCTTCGCCCACTGTGGCGACTCTTCCGGCTTCAACATTGTCTTTGTCCCTTGCTCTAGGTTTCCCATTCTACTTCTTGGGTTAGACATATTCTCACCAGAGCCAAGGCAGCATCGTCGCTACCTCTGCAATTTTCGATGGCCAAGAAAGTGTTTCTCTATGCCTCCTCTGAACCAGGTTGTTGACATCTTCACGAAAAGTGTTTCTCAACCTCTCTTCGAATTTTTCAGATCCAAGCTTCACGTTCATTTAAATCCGACGCTCAGCTTGCGGTGAGGTGTTAAAGATATTTAGTAATAACTTATAATTTACCATATATATTATATTTGATATTTTATTATTGTCTTTCTTACCATAGGTATCTTTCCATGTATTACTCTTTCCATATTCTTATAATTTATTTGATTATAAATAAGATAACTTTCACACTATGTAGGTGTGGTGGTGGGCTACTGGGCCGAAAGATTTTTGGGTCGAGCTAGAATTTGGGCCTCGGGTTTGCAGGCTGGACCGCAAGTTTGGGCCGCTGAACTTGGGTCGTGGGTATCCGATTCGGGTTGACTCGGATTCGACTGATCCTTCTTCCTTACCCAGTTTGTCGCGATTTTCTTACGTCAATTTTCTCTCTCTTCTGTGCGGTGGTTTTCGACGTCTTTTTCGATCAATTCTAGCACCTTCCTCTTTGTCTCCGTTAGCTTACTCTTCCCTAAGGTATTTCGAGGTCGTTTTTTTGCTTGTGACGCAAATCCGTCATCTCAAATCTGGTTTTGAAACCTACGACTTATGTGCAGCGTCCGATCGCCCGACGGTGAAGGTCACGGCCTAAACGTGACGTATTTGGTATTCTCAATCATGTCCTCTCTCTTTTATACCATCCCTAGAGTTATCCTCACGTATTTTCTCATCGAATTGTGTTCACCGGAAACTTACAAATCCTTTCTCTCTCTCTTTCTGAGTTTTTCTCTGTTTGCAGGAGTTGTGGGAAATGGTTGAAGGCTGGGAGGTGTTGGCGATCCATGGAGTTCGCTGTTGTGTGA

mRNA sequence

ATGCCGTTATCAAGAGGCGCCGCCCAGAGCCGCCCTCCGTACATCCCCAGATCCTCCTCCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTGTTCCTCCTCTTCTCCTTCCTAGCTCTCTTAATCCTCGCCGTCGTCCTTGTCGTCGTCCTCGCCCTAAAGCCAAAGAAGCCCCAATTCGATCTCCAGCAGGTGGGCATCCAGTACATGAACATAACCACTCCCAATCCCACCGCCGCCTCCCTTTCCCTCGACATTCGAATGATTTTCACCGCCGTTAACCCCAACAAAGTGGGAATCAAGTACGGGGAGTCTCGGTTCACGGTCATGTACCGAGGGATTCCACTTGGGAGAGCTTCAATTCCCGGATTCTTTCAAGACCCACATAGCCAACGCCAGGTCGACGCCACCATCGCCGTCGATCGTGTCAGCCTCCTCCAAGCCGACGCCGCTGATTTGATCCGTGACGCCTCGTTAAACGACCGTGTGGAGCTGAGGATACTCGGCGATGTTGCCGCTAAGATCCGCCTCTTGTCCTTTGATTCCCCCGGCGTTCAGGTACCACTAATCTCTCTAAGGGCATCCTTCGTCCGATCGCCCGACGGTGAAGGTCACGGCCTAAACGTGACGAGTTGTGGGAAATGGTTGAAGGCTGGGAGGTGTTGGCGATCCATGGAGTTCGCTGTTGTGTGA

Coding sequence (CDS)

ATGCCGTTATCAAGAGGCGCCGCCCAGAGCCGCCCTCCGTACATCCCCAGATCCTCCTCCTCCTCCGCCTCCTTCAAGGGCTGCTGCTGCTGCCTGTTCCTCCTCTTCTCCTTCCTAGCTCTCTTAATCCTCGCCGTCGTCCTTGTCGTCGTCCTCGCCCTAAAGCCAAAGAAGCCCCAATTCGATCTCCAGCAGGTGGGCATCCAGTACATGAACATAACCACTCCCAATCCCACCGCCGCCTCCCTTTCCCTCGACATTCGAATGATTTTCACCGCCGTTAACCCCAACAAAGTGGGAATCAAGTACGGGGAGTCTCGGTTCACGGTCATGTACCGAGGGATTCCACTTGGGAGAGCTTCAATTCCCGGATTCTTTCAAGACCCACATAGCCAACGCCAGGTCGACGCCACCATCGCCGTCGATCGTGTCAGCCTCCTCCAAGCCGACGCCGCTGATTTGATCCGTGACGCCTCGTTAAACGACCGTGTGGAGCTGAGGATACTCGGCGATGTTGCCGCTAAGATCCGCCTCTTGTCCTTTGATTCCCCCGGCGTTCAGGTACCACTAATCTCTCTAAGGGCATCCTTCGTCCGATCGCCCGACGGTGAAGGTCACGGCCTAAACGTGACGAGTTGTGGGAAATGGTTGAAGGCTGGGAGGTGTTGGCGATCCATGGAGTTCGCTGTTGTGTGA

Protein sequence

MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFDSPGVQVPLISLRASFVRSPDGEGHGLNVTSCGKWLKAGRCWRSMEFAVV
Homology
BLAST of CmoCh19G004420 vs. ExPASy Swiss-Prot
Match: Q9FI03 (NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1)

HSP 1 Score: 47.0 bits (110), Expect = 3.4e-04
Identity = 37/130 (28.46%), Postives = 64/130 (49.23%), Query Frame = 0

Query: 32  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYMNITTPNPTAASLSLDIRMIF 91
           LF  FS     +L ++ +V L L P++P+F L +  I  +N+TT   +   L+  +++  
Sbjct: 27  LFFTFSTFFSGLLLIIFLVWLILHPERPEFSLTEADIYSLNLTT--SSTHLLNSSVQLTL 86

Query: 92  TAVNPN-KVGIKYGESRFTVMYRGIPL-GRASIPGFFQDPHSQRQVDATIAVDRVSLLQA 151
            + NPN KVGI Y +      YRG  +   AS+P F+Q       + A +    + + Q+
Sbjct: 87  FSKNPNKKVGIYYDKLLVYAAYRGQQITSEASLPPFYQSHEEINLLTAFLQGTELPVAQS 146

Query: 152 DAADLIRDAS 160
               + R+ S
Sbjct: 147 FGYQISRERS 154

BLAST of CmoCh19G004420 vs. ExPASy TrEMBL
Match: A0A6J1GKY5 (uncharacterized protein LOC111455278 OS=Cucurbita moschata OX=3662 GN=LOC111455278 PE=4 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 3.3e-95
Identity = 188/188 (100.00%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 22  MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 81

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 82  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 141

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 142 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 201

Query: 181 FDSPGVQV 189
           FDSPGVQV
Sbjct: 202 FDSPGVQV 209

BLAST of CmoCh19G004420 vs. ExPASy TrEMBL
Match: A0A6J1I8L5 (uncharacterized protein LOC111470648 OS=Cucurbita maxima OX=3661 GN=LOC111470648 PE=4 SV=1)

HSP 1 Score: 346.7 bits (888), Expect = 7.6e-92
Identity = 190/214 (88.79%), Postives = 193/214 (90.19%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MP SRGA  SRPPYIPR SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 1   MPSSRGATHSRPPYIPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSL+IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 61  FDLQQVGIQYMNITTPNPTAASLSLNIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180

Query: 181 FDSPGVQVPLISLRASFVRSPDGEGHGLNVTSCG 215
           FDSPGVQV   S+  + V SP      L    CG
Sbjct: 181 FDSPGVQV---SVDCAIVISP--RKQSLTYKQCG 208

BLAST of CmoCh19G004420 vs. ExPASy TrEMBL
Match: A0A1S3BGU8 (uncharacterized protein LOC103489700 OS=Cucumis melo OX=3656 GN=LOC103489700 PE=4 SV=1)

HSP 1 Score: 318.2 bits (814), Expect = 2.9e-83
Identity = 170/203 (83.74%), Postives = 180/203 (88.67%), Query Frame = 0

Query: 12  PPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYM 71
           P Y PRSSSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQVG+QYM
Sbjct: 67  PRYNPRSSSSSATFKGCCCCLFLLFSFLALLVLAIVLVVVLALKPKKPQFDLQQVGVQYM 126

Query: 72  NITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASIPGFFQDPHS 131
            IT PNPT ASLSL+IRMIFTAVNPNKVGIKY ESRFTVMYRGIPLGRAS+PGFFQDPHS
Sbjct: 127 GITNPNPTTASLSLNIRMIFTAVNPNKVGIKYEESRFTVMYRGIPLGRASVPGFFQDPHS 186

Query: 132 QRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFDSPGVQVPLI 191
           QRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQV   
Sbjct: 187 QRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQV--- 246

Query: 192 SLRASFVRSPDGEGHGLNVTSCG 215
           S+  + V SP      L    CG
Sbjct: 247 SVDCAIVISP--RKQSLTYKQCG 264

BLAST of CmoCh19G004420 vs. ExPASy TrEMBL
Match: A0A5D3D361 (Late embryogenesis abundant protein, LEA-14 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G001600 PE=4 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 4.2e-82
Identity = 162/176 (92.05%), Postives = 170/176 (96.59%), Query Frame = 0

Query: 12  PPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYM 71
           P Y PRSSSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQVG+QYM
Sbjct: 67  PRYNPRSSSSSATFKGCCCCLFLLFSFLALLVLAIVLVVVLALKPKKPQFDLQQVGVQYM 126

Query: 72  NITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASIPGFFQDPHS 131
            IT PNPT ASLSL+IRMIFTAVNPNKVGIKY ESRFTVMYRGIPLGRAS+PGFFQDPHS
Sbjct: 127 GITNPNPTTASLSLNIRMIFTAVNPNKVGIKYEESRFTVMYRGIPLGRASVPGFFQDPHS 186

Query: 132 QRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFDSPGVQ 188
           QRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQ
Sbjct: 187 QRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQ 242

BLAST of CmoCh19G004420 vs. ExPASy TrEMBL
Match: A0A0A0K5H7 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G223380 PE=4 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 7.2e-82
Identity = 168/203 (82.76%), Postives = 179/203 (88.18%), Query Frame = 0

Query: 12  PPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYM 71
           P Y PRSSSSSA+FKGCCCCLFLLFSFLALL+LA+VLVVVLALKPKKPQFDLQQV +QY+
Sbjct: 14  PRYNPRSSSSSATFKGCCCCLFLLFSFLALLVLAIVLVVVLALKPKKPQFDLQQVKVQYV 73

Query: 72  NITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASIPGFFQDPHS 131
            IT PNPT ASLSL+IRMIFTAVNPNKVGIKY ESRFTVMYRGIPLGRAS+PGFFQDPHS
Sbjct: 74  GITNPNPTTASLSLNIRMIFTAVNPNKVGIKYEESRFTVMYRGIPLGRASVPGFFQDPHS 133

Query: 132 QRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFDSPGVQVPLI 191
           QRQVDATIAVDRV+LLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF+SPGVQV   
Sbjct: 134 QRQVDATIAVDRVNLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSFNSPGVQV--- 193

Query: 192 SLRASFVRSPDGEGHGLNVTSCG 215
           S+  + V SP      L    CG
Sbjct: 194 SVDCAIVISP--RKQSLTYKQCG 211

BLAST of CmoCh19G004420 vs. NCBI nr
Match: XP_022952647.1 (uncharacterized protein LOC111455278 [Cucurbita moschata])

HSP 1 Score: 357.8 bits (917), Expect = 6.9e-95
Identity = 188/188 (100.00%), Postives = 188/188 (100.00%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 22  MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 81

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 82  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 141

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 142 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 201

Query: 181 FDSPGVQV 189
           FDSPGVQV
Sbjct: 202 FDSPGVQV 209

BLAST of CmoCh19G004420 vs. NCBI nr
Match: KAG6571818.1 (NDR1/HIN1-like protein 13, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 356.3 bits (913), Expect = 2.0e-94
Identity = 193/214 (90.19%), Postives = 196/214 (91.59%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MP SRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 1   MPSSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSL+IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 61  FDLQQVGIQYMNITTPNPTAASLSLNIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180

Query: 181 FDSPGVQVPLISLRASFVRSPDGEGHGLNVTSCG 215
           FDSPGVQV   S+  + V SP      L    CG
Sbjct: 181 FDSPGVQV---SVDCAIVISP--RKQSLTYKQCG 209

BLAST of CmoCh19G004420 vs. NCBI nr
Match: XP_023554780.1 (NDR1/HIN1-like protein 10 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 355.1 bits (910), Expect = 4.4e-94
Identity = 192/214 (89.72%), Postives = 196/214 (91.59%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MP SRGAA+SRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 1   MPSSRGAAESRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSL+IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 61  FDLQQVGIQYMNITTPNPTAASLSLNIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180

Query: 181 FDSPGVQVPLISLRASFVRSPDGEGHGLNVTSCG 215
           FDSPGVQV   S+  + V SP      L    CG
Sbjct: 181 FDSPGVQV---SVDCAIVISP--RKQSLTYKQCG 209

BLAST of CmoCh19G004420 vs. NCBI nr
Match: KAG7018167.1 (hypothetical protein SDJN02_20035, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 352.4 bits (903), Expect = 2.9e-93
Identity = 185/188 (98.40%), Postives = 187/188 (99.47%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MP SRGAAQSRPPYIPRSSS+SASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 1   MPSSRGAAQSRPPYIPRSSSASASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSL+IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 61  FDLQQVGIQYMNITTPNPTAASLSLNIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180

Query: 181 FDSPGVQV 189
           FDSPGVQV
Sbjct: 181 FDSPGVQV 188

BLAST of CmoCh19G004420 vs. NCBI nr
Match: XP_022971998.1 (uncharacterized protein LOC111470648 [Cucurbita maxima])

HSP 1 Score: 346.7 bits (888), Expect = 1.6e-91
Identity = 190/214 (88.79%), Postives = 193/214 (90.19%), Query Frame = 0

Query: 1   MPLSRGAAQSRPPYIPRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60
           MP SRGA  SRPPYIPR SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ
Sbjct: 1   MPSSRGATHSRPPYIPR-SSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQ 60

Query: 61  FDLQQVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120
           FDLQQVGIQYMNITTPNPTAASLSL+IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA
Sbjct: 61  FDLQQVGIQYMNITTPNPTAASLSLNIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRA 120

Query: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180
           SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS
Sbjct: 121 SIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLS 180

Query: 181 FDSPGVQVPLISLRASFVRSPDGEGHGLNVTSCG 215
           FDSPGVQV   S+  + V SP      L    CG
Sbjct: 181 FDSPGVQV---SVDCAIVISP--RKQSLTYKQCG 208

BLAST of CmoCh19G004420 vs. TAIR 10
Match: AT2G01080.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 264.6 bits (675), Expect = 7.4e-71
Identity = 136/187 (72.73%), Postives = 163/187 (87.17%), Query Frame = 0

Query: 7   AAQSRPPYI-PRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQ 66
           AAQ++ PY    SSSSSAS KGCCCCLFLLF+FLALL+LAVVL+V+LA+KPKKPQFDLQQ
Sbjct: 18  AAQNQQPYYRSYSSSSSASLKGCCCCLFLLFAFLALLVLAVVLIVILAVKPKKPQFDLQQ 77

Query: 67  VGIQYMNITTP----NPTAASLSLDIRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRAS 126
           V + YM I+ P    +PT ASLSL IRM+FTAVNPNKVGI+YGES FTVMY+G+PLGRA+
Sbjct: 78  VAVVYMGISNPSAVLDPTTASLSLTIRMLFTAVNPNKVGIRYGESSFTVMYKGMPLGRAT 137

Query: 127 IPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKIRLLSF 186
           +PGF+QD HS + V+ATI+VDRV+L+QA AADL+RDASLNDRVEL + GDV AKIR+++F
Sbjct: 138 VPGFYQDAHSTKNVEATISVDRVNLMQAHAADLVRDASLNDRVELTVRGDVGAKIRVMNF 197

Query: 187 DSPGVQV 189
           DSPGVQV
Sbjct: 198 DSPGVQV 204

BLAST of CmoCh19G004420 vs. TAIR 10
Match: AT1G17620.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 53.5 bits (127), Expect = 2.6e-07
Identity = 57/221 (25.79%), Postives = 101/221 (45.70%), Query Frame = 0

Query: 10  SRPPYIPRSSSSSASF-KGCC--CCLFLLFSFLALLIL--AVVLVVVLALKPKKPQFDLQ 69
           +RP Y P +     S  +GCC  CC + +F  + LL++  A   VV L  +P++P F + 
Sbjct: 38  NRPAYRPPAGRRRTSHTRGCCCRCCCWTIFVIILLLLIVAAASAVVYLIYRPQRPSFTVS 97

Query: 70  QVGIQYMNITTPNPTAASLSLDIRMIFTAVNPNK-VGIKYGESRFTVMYRG-------IP 129
           ++ I  +N T    +A  L+  I +   A NPNK VG  Y  +  T +Y+        + 
Sbjct: 98  ELKISTLNFT----SAVRLTTAISLSVIARNPNKNVGFIYDVTDIT-LYKASTGGDDDVV 157

Query: 130 LGRASIPGFFQDPHSQRQVDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAKI 189
           +G+ +I  F     +   + +TI      L +  A  L  D      V ++I+ +   K+
Sbjct: 158 IGKGTIAAFSHGKKNTTTLRSTIGSPPDELDEISAGKLKGDLKAKKAVAIKIVLNSKVKV 217

Query: 190 RLLSFDSP--GVQVPLISLRASFVRSPDGEGHGLNVTSCGK 216
           ++ +  +P  G++V    ++   V +P G+      TS  K
Sbjct: 218 KMGALKTPKSGIRVTCEGIK---VVAPTGKKATTATTSAAK 250

BLAST of CmoCh19G004420 vs. TAIR 10
Match: AT5G53730.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 47.0 bits (110), Expect = 2.4e-05
Identity = 37/130 (28.46%), Postives = 64/130 (49.23%), Query Frame = 0

Query: 32  LFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYMNITTPNPTAASLSLDIRMIF 91
           LF  FS     +L ++ +V L L P++P+F L +  I  +N+TT   +   L+  +++  
Sbjct: 27  LFFTFSTFFSGLLLIIFLVWLILHPERPEFSLTEADIYSLNLTT--SSTHLLNSSVQLTL 86

Query: 92  TAVNPN-KVGIKYGESRFTVMYRGIPL-GRASIPGFFQDPHSQRQVDATIAVDRVSLLQA 151
            + NPN KVGI Y +      YRG  +   AS+P F+Q       + A +    + + Q+
Sbjct: 87  FSKNPNKKVGIYYDKLLVYAAYRGQQITSEASLPPFYQSHEEINLLTAFLQGTELPVAQS 146

Query: 152 DAADLIRDAS 160
               + R+ S
Sbjct: 147 FGYQISRERS 154

BLAST of CmoCh19G004420 vs. TAIR 10
Match: AT3G54200.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 44.7 bits (104), Expect = 1.2e-04
Identity = 27/97 (27.84%), Postives = 51/97 (52.58%), Query Frame = 0

Query: 30  CCLFLLFSFLALLILAVVLVVV--LALKPKKPQFDLQQVGIQYMNIT-TPNPTAASLSLD 89
           C + + F+ L +L++A+V+V++     KPK+P   +  V +  +  +  P      L+L 
Sbjct: 51  CKICICFTILLILLIAIVIVILAFTLFKPKRPTTTIDSVTVDRLQASVNPLLLKVLLNLT 110

Query: 90  IRMIFTAVNPNKVGIKYGESRFTVMYRGIPLGRASIP 124
           + +  +  NPN++G  Y  S   + YRG  +G A +P
Sbjct: 111 LNVDLSLKNPNRIGFSYDSSSALLNYRGQVIGEAPLP 147

BLAST of CmoCh19G004420 vs. TAIR 10
Match: AT4G26490.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 44.7 bits (104), Expect = 1.2e-04
Identity = 41/161 (25.47%), Postives = 68/161 (42.24%), Query Frame = 0

Query: 16  PRSSSSSASFKGCCCCLFLLFSFLALLILAVVLVVVLALKPKKPQFDLQQVGIQYMNITT 75
           PRSS +S  +  C      +FS L +      L+V LA++P+ P FD+    +  +   T
Sbjct: 77  PRSSRTSL-WIWCVAGFCFVFSLLLIFFAIATLIVFLAIRPRIPVFDIPNANLHTIYFDT 136

Query: 76  PNPTAASLSLDIRMIFTAVNPN-KVGIKYGESRFTVMYRGIPLGRASIPGFFQDPHSQRQ 135
           P       + D+ M+    NPN K+ +K+ + R  + +    +    +  F Q  H  R 
Sbjct: 137 PE----FFNGDLSMLVNFTNPNKKIEVKFEKLRIELFFFNRLIAAQVVQPFLQKKHETRL 196

Query: 136 VDATIAVDRVSLLQADAADLIRDASLNDRVELRILGDVAAK 176
               +    V L    A +L R    N+++E  I G    K
Sbjct: 197 EPIRLISSLVGLPVNHAVELRRQLE-NNKIEYEIRGTFKVK 231

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FI033.4e-0428.46NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1GKY53.3e-95100.00uncharacterized protein LOC111455278 OS=Cucurbita moschata OX=3662 GN=LOC1114552... [more]
A0A6J1I8L57.6e-9288.79uncharacterized protein LOC111470648 OS=Cucurbita maxima OX=3661 GN=LOC111470648... [more]
A0A1S3BGU82.9e-8383.74uncharacterized protein LOC103489700 OS=Cucumis melo OX=3656 GN=LOC103489700 PE=... [more]
A0A5D3D3614.2e-8292.05Late embryogenesis abundant protein, LEA-14 OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0K5H77.2e-8282.76LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G223380 PE=4 ... [more]
Match NameE-valueIdentityDescription
XP_022952647.16.9e-95100.00uncharacterized protein LOC111455278 [Cucurbita moschata][more]
KAG6571818.12.0e-9490.19NDR1/HIN1-like protein 13, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023554780.14.4e-9489.72NDR1/HIN1-like protein 10 [Cucurbita pepo subsp. pepo][more]
KAG7018167.12.9e-9398.40hypothetical protein SDJN02_20035, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022971998.11.6e-9188.79uncharacterized protein LOC111470648 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT2G01080.17.4e-7172.73Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT1G17620.12.6e-0725.79Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G53730.12.4e-0528.46Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G54200.11.2e-0427.84Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT4G26490.11.2e-0425.47Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 93..173
e-value: 3.9E-10
score: 40.2
NoneNo IPR availableGENE3D2.60.40.1820coord: 52..190
e-value: 2.0E-6
score: 29.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availablePANTHERPTHR31234LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 14..189
NoneNo IPR availablePANTHERPTHR31234:SF8EXPRESSED PROTEINcoord: 14..189
NoneNo IPR availableSUPERFAMILY117070LEA14-likecoord: 40..172

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh19G004420.1CmoCh19G004420.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane