CaUC01G012540 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC01G012540
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
LocationCiama_Chr01: 24538047 .. 24539227 (-)
RNA-Seq ExpressionCaUC01G012540
SyntenyCaUC01G012540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTATGTAAATTTTGTTTCATAACAATAACCAATCACTCAAAATATTCTTTGCATTCTTAAAGGTATAGTTGGAACCTTAGAAAGTCATAGCATATAACTGAATAAAAAAAAAGAAGGGAGAGTTTAGAAGTTTTGGACTTGGCATATCTTGCAGGCAAAGCAACATTTGGTTCATTTAAAACAAAAGAGAGGGCAAATACCAAATCAATCCTCTATTGATCTCTGTCTTTCATTTATAATTCCATATTTAATGGTTGGCAAAATTCTTACATAGTGCTAAAAAGGCATTCAAATGCTTCCATTAATAGTCCAAGAAAAGAACCTGAATCCCTTTTGATATCTTGAGTTTCATGCAAGAAATCAAAAAGAGAGAGAGGAAATTAATAAGTGTTGTAATAATGAGAGAAAAGAGAATACAAGGCAAGTTTTTTAGTTAAATAATTCCATTCAAAACTTGCAAAACCTCTCTTATATATTAATCCACTCTCATTTCCATCTTTTTTGTAGTACTCTTCTTTAAAGCAAAGCCTCTTCATCAAAGTTGGCTTCTTTTCCTTAGTAGAGAACTATCCATCCTTGAGCATGGCAGGTCCTCCACAGCCACCGTCGCGAGCTGGCCCCTCGAGGATATTGCGCTTCGTCGCATTGTTCATAGTGGCACTGATAGTACTCGTTGGGCTTGCCGTGCTCATTATTTGGCTGACCATTAGGCCGAAACGACTAAGCTACACAGTGGAAAGCGCTTCGGTCCATAACTTCGACATGACGGACACTCAACTCAACGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCAGTTTACTATGATTCCATCACCGCCACCGTTGGCTTTGGCGATCAAGACTTGGCATTTGGCGTGCTCAATCCCTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCAACCTCAACGCTCAGAACTTTCTATTGCATGACTCTGTGTCGAAGGACTTGGCCCTCGAAAAGGCGGCGGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTCGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCAGTGATTGTTTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAA

mRNA sequence

ATGGGGTATTACTCTTCTTTAAAGCAAAGCCTCTTCATCAAAGTTGGCTTCTTTTCCTTAGTAGAGAACTATCCATCCTTGAGCATGGCAGGTCCTCCACAGCCACCGTCGCGAGCTGGCCCCTCGAGGATATTGCGCTTCGTCGCATTGTTCATAGTGGCACTGATAGTACTCGTTGGGCTTGCCGTGCTCATTATTTGGCTGACCATTAGGCCGAAACGACTAAGCTACACAGTGGAAAGCGCTTCGGTCCATAACTTCGACATGACGGACACTCAACTCAACGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCAGTTTACTATGATTCCATCACCGCCACCGTTGGCTTTGGCGATCAAGACTTGGCATTTGGCGTGCTCAATCCCTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCAACCTCAACGCTCAGAACTTTCTATTGCATGACTCTGTGTCGAAGGACTTGGCCCTCGAAAAGGCGGCGGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTCGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCAGTGATTGTTTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAA

Coding sequence (CDS)

ATGGGGTATTACTCTTCTTTAAAGCAAAGCCTCTTCATCAAAGTTGGCTTCTTTTCCTTAGTAGAGAACTATCCATCCTTGAGCATGGCAGGTCCTCCACAGCCACCGTCGCGAGCTGGCCCCTCGAGGATATTGCGCTTCGTCGCATTGTTCATAGTGGCACTGATAGTACTCGTTGGGCTTGCCGTGCTCATTATTTGGCTGACCATTAGGCCGAAACGACTAAGCTACACAGTGGAAAGCGCTTCGGTCCATAACTTCGACATGACGGACACTCAACTCAACGCCTCCTTTAGCTTTGGGGTAAGAGCATATAATCCCAATAAACGAGTCTCAGTTTACTATGATTCCATCACCGCCACCGTTGGCTTTGGCGATCAAGACTTGGCATTTGGCGTGCTCAATCCCTTCTACCAACCTCACAAAAACGAGCAATGGTTGAACATCAACCTCAACGCTCAGAACTTTCTATTGCATGACTCTGTGTCGAAGGACTTGGCCCTCGAAAAGGCGGCGGGAGAGATGGATTTGGATCTTTGGATCAAGGCAAGAATTAGGTTTAAGGTCGGGGTATGGAAGTCCGCGCATAGGACGCTTCGAATCCGGTGTTCGCCAGTGATTGTTTACTTGTCTAAATCCAAGACTTTCAAGAAGACTACTTGCTTTACAGAAGTCTAA

Protein sequence

MGYYSSLKQSLFIKVGFFSLVENYPSLSMAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDMTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLNINLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV
Homology
BLAST of CaUC01G012540 vs. NCBI nr
Match: KAE8652783.1 (hypothetical protein Csa_022856 [Cucumis sativus])

HSP 1 Score: 401.7 bits (1031), Expect = 4.0e-108
Identity = 199/220 (90.45%), Postives = 213/220 (96.82%), Query Frame = 0

Query: 6   SLKQSLFIKVGFFSLVENYPSLSMAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLI 65
           +L ++  IKVGFFSL ENYPSLSMAGPPQP SR+GPSRILRFV +F+VALI+LVGLAVLI
Sbjct: 4   TLNKASSIKVGFFSLAENYPSLSMAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLI 63

Query: 66  IWLTIRPKRLSYTVESASVHNFDMTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFG 125
           IWLT+RPKRLSYTVESA VHNFDMTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFG
Sbjct: 64  IWLTVRPKRLSYTVESAEVHNFDMTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFG 123

Query: 126 DQDLAFGVLNPFYQPHKNEQWLNINLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARI 185
           DQDL+FGVL+PFYQPHKNEQWLNI+LNAQNFLLHDSVSK+LALE++AGEMDLDLWIKARI
Sbjct: 124 DQDLSFGVLSPFYQPHKNEQWLNIHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARI 183

Query: 186 RFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV 226
           RFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV
Sbjct: 184 RFKVGVWKSAHRTLRIRCSPVIVYLSKSKTFKKTTCFTEV 223

BLAST of CaUC01G012540 vs. NCBI nr
Match: XP_008464346.1 (PREDICTED: uncharacterized protein At1g08160 [Cucumis melo] >KAA0032608.1 uncharacterized protein E6C27_scaffold43053G00810 [Cucumis melo var. makuwa] >TYK20878.1 uncharacterized protein E5676_scaffold284G00070 [Cucumis melo var. makuwa])

HSP 1 Score: 375.6 bits (963), Expect = 3.1e-100
Identity = 186/197 (94.42%), Postives = 195/197 (98.98%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQPPSRAGPSRILRFV +F+VALI+LVGLAVLIIWLTIRPKRLSYTVESA VHNFD
Sbjct: 1   MAGPPQPPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MT+TQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVL+PFYQPHK+EQWLN
Sbjct: 61  MTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+LNAQNFLLHDSVSKDLALE++AGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. NCBI nr
Match: XP_011654115.1 (uncharacterized protein At1g08160 [Cucumis sativus])

HSP 1 Score: 371.3 bits (952), Expect = 5.8e-99
Identity = 183/197 (92.89%), Postives = 194/197 (98.48%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQP SR+GPSRILRFV +F+VALI+LVGLAVLIIWLT+RPKRLSYTVESA VHNFD
Sbjct: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDL+FGVL+PFYQPHKNEQWLN
Sbjct: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+LNAQNFLLHDSVSK+LALE++AGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. NCBI nr
Match: XP_038894942.1 (uncharacterized protein At1g08160-like [Benincasa hispida])

HSP 1 Score: 370.2 bits (949), Expect = 1.3e-98
Identity = 183/197 (92.89%), Postives = 194/197 (98.48%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQPP RAGPSRILRFV +FIVALI+LVGLAVLIIWLTIRPKRLSYTVESASVHNFD
Sbjct: 1   MAGPPQPPPRAGPSRILRFVLMFIVALIILVGLAVLIIWLTIRPKRLSYTVESASVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MT TQLNASFSFGVRAYNPNKRV++YYDSITATVGFGDQDLAFGVL+PFYQPHK+E+WLN
Sbjct: 61  MTSTQLNASFSFGVRAYNPNKRVAIYYDSITATVGFGDQDLAFGVLSPFYQPHKDERWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+L+AQNFLLHDSVSKDLALE+AAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLDAQNFLLHDSVSKDLALERAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKS+TFKKTTCFTEV
Sbjct: 181 YLSKSQTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. NCBI nr
Match: KAG7020792.1 (hypothetical protein SDJN02_17480, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 327.8 bits (839), Expect = 7.4e-86
Identity = 161/197 (81.73%), Postives = 177/197 (89.85%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQ  SR   S ILR+V L +VALIVLVGL VLIIWLT+RPKRLSYTVESA+VHNFD
Sbjct: 1   MAGPPQLSSRPARSNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLSYTVESAAVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           M+ TQLNASF+FGV+AYNPN+ VSVYYD +T TVGFGDQDLAFGV+ PFYQPHK+  WLN
Sbjct: 61  MSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           ++LNA+NFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKS HRTLRIRCSPVIV
Sbjct: 121 MDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSGHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSK K FK+T CFTE+
Sbjct: 181 YLSKDKEFKRTACFTEI 197

BLAST of CaUC01G012540 vs. ExPASy Swiss-Prot
Match: Q8VZ13 (Uncharacterized protein At1g08160 OS=Arabidopsis thaliana OX=3702 GN=At1g08160 PE=2 SV=1)

HSP 1 Score: 129.4 bits (324), Expect = 5.1e-29
Identity = 76/199 (38.19%), Postives = 122/199 (61.31%), Query Frame = 0

Query: 34  QPPSRAGPSRILRFVALFIVALI---VLVGLAVLIIWLTIRPKRLSYTVESASVHNFDM- 93
           QP ++  P R +  V   IVAL+   +LVGLA+LI +LT+RPKRL YTVE+ASV  F + 
Sbjct: 23  QPRAQPLPGRRMNPVLCIIVALVLLGLLVGLAILITYLTLRPKRLIYTVEAASVQEFAIG 82

Query: 94  -TDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 153
             D  +NA FS+ +++YNP K VSV Y S+  +    +Q +A   ++PF Q  KNE  + 
Sbjct: 83  NNDDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIE 142

Query: 154 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 213
             L + N  L    ++DL  EK+ G ++++++I AR+ +K  +++S  RTL+  C+PV++
Sbjct: 143 TQLVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMI 202

Query: 214 YLSKSKT--FKKTTCFTEV 226
            ++ S    F++  C T +
Sbjct: 203 NVTSSSLDGFQRVLCKTRL 221

BLAST of CaUC01G012540 vs. ExPASy Swiss-Prot
Match: Q9SJ52 (NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 1.8e-18
Identity = 63/192 (32.81%), Postives = 99/192 (51.56%), Query Frame = 0

Query: 32  PPQPPS--RAGPSR-----ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASV 91
           PP P    R G  R     +L      I++LIV++G+A LI WL +RP+ + + V  AS+
Sbjct: 18  PPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHVTDASL 77

Query: 92  HNFDMT--DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHK 151
             FD T  D  L  + +  V   NPNKR+ +YYD I A   +  +  +   L PFYQ HK
Sbjct: 78  TRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPFYQGHK 137

Query: 152 NEQWLNINLNAQNFLLHDS-VSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRI 211
           N   L      QN ++ ++  S+ L  E+ +G  ++++  + R+RFK+G  K      ++
Sbjct: 138 NTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRRIKPKV 197

Query: 212 RCSPVIVYLSKS 214
            C  + + LS S
Sbjct: 198 DCDDLRLPLSTS 209

BLAST of CaUC01G012540 vs. ExPASy Swiss-Prot
Match: Q9SRN1 (NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1)

HSP 1 Score: 83.2 bits (204), Expect = 4.2e-15
Identity = 51/174 (29.31%), Postives = 88/174 (50.57%), Query Frame = 0

Query: 44  ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDM-TDTQLNASFSFGV 103
           IL  +   ++A+ V++G+A LI+WL  RP  + + V  A+++ F    +  L+ S     
Sbjct: 51  ILSLICNILIAVAVILGVAALILWLIFRPNAVKFYVADANLNRFSFDPNNNLHYSLDLNF 110

Query: 104 RAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLNINLNAQNF-LLHDS 163
              NPN+RV VYYD  + +  +GDQ      ++ FYQ HKN   +   +  QN  +L D 
Sbjct: 111 TIRNPNQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVLGDG 170

Query: 164 VSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT 216
              DL  ++ +G   ++  ++  +RFK    KS     +I+C  + + L  S +
Sbjct: 171 ARTDLKDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKPKIKCDDLKIPLGSSNS 224

BLAST of CaUC01G012540 vs. ExPASy Swiss-Prot
Match: Q9FNH6 (NDR1/HIN1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=NHL3 PE=1 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 5.4e-15
Identity = 63/188 (33.51%), Postives = 94/188 (50.00%), Query Frame = 0

Query: 44  ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDMTDT---QLNASFSF 103
           IL  +   ++ + VL+G+A LIIWL  RP  + + V  A +  F +  T   + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 104 GVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLN---PFYQPHKNEQWLNINLNAQNFL 163
            +R  NPN+R+ VYYD I     +GDQ   FG+ N    FYQ HKN   +   L  Q  +
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 164 LHD-SVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT- 222
           L D    KDL  +  +    +D  ++ +IRFK G+ KS     +I+C   +   S S + 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

BLAST of CaUC01G012540 vs. ExPASy Swiss-Prot
Match: Q9FI03 (NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 6.6e-13
Identity = 45/189 (23.81%), Postives = 90/189 (47.62%), Query Frame = 0

Query: 27  LSMAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLI--IWLTIRPKRLSYTVESASV 86
           +S+  P     + G +   R   LF        GL ++I  +WL + P+R  +++  A +
Sbjct: 4   ISITSPKHCAKKGGININNRHKKLFFTFSTFFSGLLLIIFLVWLILHPERPEFSLTEADI 63

Query: 87  HNFDMTDTQ---LNASFSFGVRAYNPNKRVSVYYDSITATVGF-GDQDLAFGVLNPFYQP 146
           ++ ++T +    LN+S    + + NPNK+V +YYD +     + G Q  +   L PFYQ 
Sbjct: 64  YSLNLTTSSTHLLNSSVQLTLFSKNPNKKVGIYYDKLLVYAAYRGQQITSEASLPPFYQS 123

Query: 147 HKNEQWLNINLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLR 206
           H+    L   L      +  S    ++ E++ G++ + + +  ++R+K+G W S      
Sbjct: 124 HEEINLLTAFLQGTELPVAQSFGYQISRERSTGKIIIGMKMDGKLRWKIGTWVSGAYRFN 183

Query: 207 IRCSPVIVY 210
           + C  ++ +
Sbjct: 184 VNCLAIVAF 192

BLAST of CaUC01G012540 vs. ExPASy TrEMBL
Match: A0A5D3DBK2 (LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold284G00070 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 1.5e-100
Identity = 186/197 (94.42%), Postives = 195/197 (98.98%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQPPSRAGPSRILRFV +F+VALI+LVGLAVLIIWLTIRPKRLSYTVESA VHNFD
Sbjct: 1   MAGPPQPPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MT+TQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVL+PFYQPHK+EQWLN
Sbjct: 61  MTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+LNAQNFLLHDSVSKDLALE++AGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. ExPASy TrEMBL
Match: A0A1S3CLP6 (uncharacterized protein At1g08160 OS=Cucumis melo OX=3656 GN=LOC103502251 PE=4 SV=1)

HSP 1 Score: 375.6 bits (963), Expect = 1.5e-100
Identity = 186/197 (94.42%), Postives = 195/197 (98.98%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQPPSRAGPSRILRFV +F+VALI+LVGLAVLIIWLTIRPKRLSYTVESA VHNFD
Sbjct: 1   MAGPPQPPSRAGPSRILRFVIIFLVALIILVGLAVLIIWLTIRPKRLSYTVESAEVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MT+TQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVL+PFYQPHK+EQWLN
Sbjct: 61  MTNTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLSPFYQPHKDEQWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+LNAQNFLLHDSVSKDLALE++AGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKDLALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. ExPASy TrEMBL
Match: A0A0A0LVK3 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G132720 PE=4 SV=1)

HSP 1 Score: 371.3 bits (952), Expect = 2.8e-99
Identity = 183/197 (92.89%), Postives = 194/197 (98.48%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQP SR+GPSRILRFV +F+VALI+LVGLAVLIIWLT+RPKRLSYTVESA VHNFD
Sbjct: 1   MAGPPQPLSRSGPSRILRFVIIFLVALIILVGLAVLIIWLTVRPKRLSYTVESAEVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDL+FGVL+PFYQPHKNEQWLN
Sbjct: 61  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLSFGVLSPFYQPHKNEQWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           I+LNAQNFLLHDSVSK+LALE++AGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV
Sbjct: 121 IHLNAQNFLLHDSVSKELALERSAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSKSKTFKKTTCFTEV
Sbjct: 181 YLSKSKTFKKTTCFTEV 197

BLAST of CaUC01G012540 vs. ExPASy TrEMBL
Match: A0A6J1FH28 (uncharacterized protein At1g08160 OS=Cucurbita moschata OX=3662 GN=LOC111443929 PE=4 SV=1)

HSP 1 Score: 326.2 bits (835), Expect = 1.0e-85
Identity = 161/197 (81.73%), Postives = 176/197 (89.34%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQ  SR     ILR+V L +VALIVLVGL VLIIWLT+RPKRLSYTVESA+VHNFD
Sbjct: 1   MAGPPQLSSRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLSYTVESAAVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           M+ TQLNASF+FGV+AYNPN+ VSVYYD +T TVGFGDQDLAFGV+ PFYQPHK+  WLN
Sbjct: 61  MSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           ++LNA+NFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKS HRTLRIRCSPVIV
Sbjct: 121 MDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSGHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSK K FK+T CFTEV
Sbjct: 181 YLSKDKEFKRTACFTEV 197

BLAST of CaUC01G012540 vs. ExPASy TrEMBL
Match: A0A6J1I194 (uncharacterized protein At1g08160 OS=Cucurbita maxima OX=3661 GN=LOC111468522 PE=4 SV=1)

HSP 1 Score: 318.9 bits (816), Expect = 1.7e-83
Identity = 157/197 (79.70%), Postives = 173/197 (87.82%), Query Frame = 0

Query: 29  MAGPPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFD 88
           MAGPPQ   R     ILR+V L +VALIVLVGL VLIIWLT+RPKRL YTVESA+VHNFD
Sbjct: 1   MAGPPQLSPRPARPNILRYVILVLVALIVLVGLVVLIIWLTVRPKRLRYTVESAAVHNFD 60

Query: 89  MTDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 148
           M+ TQLNASF+FGV+AYNPN+ VSVYYD +T TVGFGDQDLAFGV+ PFYQPHK+  WLN
Sbjct: 61  MSTTQLNASFNFGVKAYNPNRHVSVYYDHVTVTVGFGDQDLAFGVIKPFYQPHKDVTWLN 120

Query: 149 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 208
           ++LNA+NFLLHDSVSKDLALEKAAGEMDLDLWIKARIR+KVGVWK  HRTLRIRCSPVIV
Sbjct: 121 MDLNAKNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRYKVGVWKLGHRTLRIRCSPVIV 180

Query: 209 YLSKSKTFKKTTCFTEV 226
           YLSK K FK+T CFTEV
Sbjct: 181 YLSKDKEFKRTACFTEV 197

BLAST of CaUC01G012540 vs. TAIR 10
Match: AT5G22870.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 157.5 bits (397), Expect = 1.2e-38
Identity = 82/195 (42.05%), Postives = 129/195 (66.15%), Query Frame = 0

Query: 32  PPQPPSRAGPSRILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDMT- 91
           P QP  R  PS ++ ++ L I+ LI +  +  LI WL  +PK+L YTVE+ASV NF++T 
Sbjct: 16  PAQPLRR--PS-LICYIFLVILTLIFMAAVGFLITWLETKPKKLRYTVENASVQNFNLTN 75

Query: 92  DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLNIN 151
           D  ++A+F F ++++NPN R+SVYY S+   V F DQ LAF  + PF+QP  N + ++  
Sbjct: 76  DNHMSATFQFTIQSHNPNHRISVYYSSVEIFVKFKDQTLAFDTVEPFHQPRMNVKQIDET 135

Query: 152 LNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYL 211
           L A+N  +  S  KDL  + + G++  ++++KAR+RFKVG+WKS+HRT +I+CS V V L
Sbjct: 136 LIAENVAVSKSNGKDLRSQNSLGKIGFEVFVKARVRFKVGIWKSSHRTAKIKCSHVTVSL 195

Query: 212 SKSKTFKKTTCFTEV 226
           S+    + ++C  ++
Sbjct: 196 SQPNKSQNSSCDADI 207

BLAST of CaUC01G012540 vs. TAIR 10
Match: AT1G08160.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 129.4 bits (324), Expect = 3.6e-30
Identity = 76/199 (38.19%), Postives = 122/199 (61.31%), Query Frame = 0

Query: 34  QPPSRAGPSRILRFVALFIVALI---VLVGLAVLIIWLTIRPKRLSYTVESASVHNFDM- 93
           QP ++  P R +  V   IVAL+   +LVGLA+LI +LT+RPKRL YTVE+ASV  F + 
Sbjct: 23  QPRAQPLPGRRMNPVLCIIVALVLLGLLVGLAILITYLTLRPKRLIYTVEAASVQEFAIG 82

Query: 94  -TDTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLN 153
             D  +NA FS+ +++YNP K VSV Y S+  +    +Q +A   ++PF Q  KNE  + 
Sbjct: 83  NNDDHINAKFSYVIKSYNPEKHVSVRYHSMRISTAHHNQSVAHKNISPFKQRPKNETRIE 142

Query: 154 INLNAQNFLLHDSVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIV 213
             L + N  L    ++DL  EK+ G ++++++I AR+ +K  +++S  RTL+  C+PV++
Sbjct: 143 TQLVSHNVALSKFNARDLRAEKSKGTIEMEVYITARVSYKTWIFRSRRRTLKAVCTPVMI 202

Query: 214 YLSKSKT--FKKTTCFTEV 226
            ++ S    F++  C T +
Sbjct: 203 NVTSSSLDGFQRVLCKTRL 221

BLAST of CaUC01G012540 vs. TAIR 10
Match: AT2G35980.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 94.4 bits (233), Expect = 1.3e-19
Identity = 63/192 (32.81%), Postives = 99/192 (51.56%), Query Frame = 0

Query: 32  PPQPPS--RAGPSR-----ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASV 91
           PP P    R G  R     +L      I++LIV++G+A LI WL +RP+ + + V  AS+
Sbjct: 18  PPAPKGYYRRGHGRGCGCCLLSLFVKVIISLIVILGVAALIFWLIVRPRAIKFHVTDASL 77

Query: 92  HNFDMT--DTQLNASFSFGVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHK 151
             FD T  D  L  + +  V   NPNKR+ +YYD I A   +  +  +   L PFYQ HK
Sbjct: 78  TRFDHTSPDNILRYNLALTVPVRNPNKRIGLYYDRIEAHAYYEGKRFSTITLTPFYQGHK 137

Query: 152 NEQWLNINLNAQNFLLHDS-VSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRI 211
           N   L      QN ++ ++  S+ L  E+ +G  ++++  + R+RFK+G  K      ++
Sbjct: 138 NTTVLTPTFQGQNLVIFNAGQSRTLNAERISGVYNIEIKFRLRVRFKLGDLKFRRIKPKV 197

Query: 212 RCSPVIVYLSKS 214
            C  + + LS S
Sbjct: 198 DCDDLRLPLSTS 209

BLAST of CaUC01G012540 vs. TAIR 10
Match: AT3G11650.1 (NDR1/HIN1-like 2 )

HSP 1 Score: 83.2 bits (204), Expect = 3.0e-16
Identity = 51/174 (29.31%), Postives = 88/174 (50.57%), Query Frame = 0

Query: 44  ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDM-TDTQLNASFSFGV 103
           IL  +   ++A+ V++G+A LI+WL  RP  + + V  A+++ F    +  L+ S     
Sbjct: 51  ILSLICNILIAVAVILGVAALILWLIFRPNAVKFYVADANLNRFSFDPNNNLHYSLDLNF 110

Query: 104 RAYNPNKRVSVYYDSITATVGFGDQDLAFGVLNPFYQPHKNEQWLNINLNAQNF-LLHDS 163
              NPN+RV VYYD  + +  +GDQ      ++ FYQ HKN   +   +  QN  +L D 
Sbjct: 111 TIRNPNQRVGVYYDEFSVSGYYGDQRFGSANVSSFYQGHKNTTVILTKIEGQNLVVLGDG 170

Query: 164 VSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT 216
              DL  ++ +G   ++  ++  +RFK    KS     +I+C  + + L  S +
Sbjct: 171 ARTDLKDDEKSGIYRINAKLRLSVRFKFWFIKSWKLKPKIKCDDLKIPLGSSNS 224

BLAST of CaUC01G012540 vs. TAIR 10
Match: AT5G06320.1 (NDR1/HIN1-like 3 )

HSP 1 Score: 82.8 bits (203), Expect = 3.9e-16
Identity = 63/188 (33.51%), Postives = 94/188 (50.00%), Query Frame = 0

Query: 44  ILRFVALFIVALIVLVGLAVLIIWLTIRPKRLSYTVESASVHNFDMTDT---QLNASFSF 103
           IL  +   ++ + VL+G+A LIIWL  RP  + + V  A +  F +  T   + N   +F
Sbjct: 44  ILSVIFNILITIAVLLGIAALIIWLIFRPNAIKFHVTDAKLTEFTLDPTNNLRYNLDLNF 103

Query: 104 GVRAYNPNKRVSVYYDSITATVGFGDQDLAFGVLN---PFYQPHKNEQWLNINLNAQNFL 163
            +R  NPN+R+ VYYD I     +GDQ   FG+ N    FYQ HKN   +   L  Q  +
Sbjct: 104 TIR--NPNRRIGVYYDEIEVRGYYGDQ--RFGMSNNISKFYQGHKNTTVVGTKLVGQQLV 163

Query: 164 LHD-SVSKDLALEKAAGEMDLDLWIKARIRFKVGVWKSAHRTLRIRCSPVIVYLSKSKT- 222
           L D    KDL  +  +    +D  ++ +IRFK G+ KS     +I+C   +   S S + 
Sbjct: 164 LLDGGERKDLNEDVNSQIYRIDAKLRLKIRFKFGLIKSWRFKPKIKCDLKVPLTSNSTSG 223

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAE8652783.14.0e-10890.45hypothetical protein Csa_022856 [Cucumis sativus][more]
XP_008464346.13.1e-10094.42PREDICTED: uncharacterized protein At1g08160 [Cucumis melo] >KAA0032608.1 unchar... [more]
XP_011654115.15.8e-9992.89uncharacterized protein At1g08160 [Cucumis sativus][more]
XP_038894942.11.3e-9892.89uncharacterized protein At1g08160-like [Benincasa hispida][more]
KAG7020792.17.4e-8681.73hypothetical protein SDJN02_17480, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
Q8VZ135.1e-2938.19Uncharacterized protein At1g08160 OS=Arabidopsis thaliana OX=3702 GN=At1g08160 P... [more]
Q9SJ521.8e-1832.81NDR1/HIN1-like protein 10 OS=Arabidopsis thaliana OX=3702 GN=NHL10 PE=2 SV=1[more]
Q9SRN14.2e-1529.31NDR1/HIN1-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=NHL2 PE=2 SV=1[more]
Q9FNH65.4e-1533.51NDR1/HIN1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=NHL3 PE=1 SV=1[more]
Q9FI036.6e-1323.81NDR1/HIN1-like protein 26 OS=Arabidopsis thaliana OX=3702 GN=NHL26 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3DBK21.5e-10094.42LEA_2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_... [more]
A0A1S3CLP61.5e-10094.42uncharacterized protein At1g08160 OS=Cucumis melo OX=3656 GN=LOC103502251 PE=4 S... [more]
A0A0A0LVK32.8e-9992.89LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G132720 PE=4 ... [more]
A0A6J1FH281.0e-8581.73uncharacterized protein At1g08160 OS=Cucurbita moschata OX=3662 GN=LOC111443929 ... [more]
A0A6J1I1941.7e-8379.70uncharacterized protein At1g08160 OS=Cucurbita maxima OX=3661 GN=LOC111468522 PE... [more]
Match NameE-valueIdentityDescription
AT5G22870.11.2e-3842.05Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT1G08160.13.6e-3038.19Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT2G35980.11.3e-1932.81Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT3G11650.13.0e-1629.31NDR1/HIN1-like 2 [more]
AT5G06320.13.9e-1633.51NDR1/HIN1-like 3 [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 102..203
e-value: 3.5E-14
score: 53.2
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 40..224
NoneNo IPR availablePANTHERPTHR31852:SF5GB|AAF18257.1coord: 40..224

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC01G012540.1CaUC01G012540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0046658 anchored component of plasma membrane
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0009506 plasmodesma