CaUC04G072820 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC04G072820
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionlate embryogenesis abundant protein B19.4
LocationCiama_Chr04: 21855312 .. 21858998 (+)
RNA-Seq ExpressionCaUC04G072820
SyntenyCaUC04G072820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGTTTCTCTAGAAGAATTAGAAGGTCGGGCAACACAAATGGCATCGCAACAGGAAAGATCAGAGCTTGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTTCTTAACTGTCTCTCAATTGTTAATGTACTATGTATTTCTGCTTTAATAACACAATACGATAGAAAACATTTTTAAGTATTTCTTGAAAAAAACTTAGATTTTATCTTTGTCCAACAAAGAAAAAAACTCTAATGTTGATAATCTTTTTTCAAAAAAAATGATTTTTTTTATAGTTTTGTTAACTCGTTTGTTTCTTGGGTCGTGGCGACATGGAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCGTTGATCAGAGCTTATGGCTCTGTGTGCATACAAGATGATGATCCAGCTGACGTTGTTTTGAAAGTAGCTGTGTGGCGTGTTCTTGTCGTGTTTTTTCTTTTTTTCTTTTCTTTTTTCCTTCTTGTTTATGTATTTCGGTAAGGTTGCTTTGTAATAGTTAATGTCTTTTTAAGTCTGTTTTTTTTTCCGTTTCTGGCCACATGTACGTCGACACGTCAGGGTTTTGGTTAAATTGGACTTGAACAAGGACGAACTTTGCTTTATTATGTGAATTTTAGATGTCTTATTTTTATTTGCATGGAAAAAGGAAAGGATCTTGGCCAACTTTCATTCGCATTATCATGAACACTCTCCCTATTCTCAATGGGTTAAGATGGAAATTTCATGTTTCTCTTTCCAGATCGAATTTGTGCTTAGTTAAGTTCACTATGTGAAGATAAAGAAACTGGAACAAACTAACTCATAATTGTTTCAATTAAATGTTGGAACTAATGTGAGTTTGATTTTGAAATTGAAATTTAGGGCGGAGTGTAAATTGATATATCCAAATATAAATTGATTGATTAGACTATGGGAGATTATGGAAGATTTTTTAGAGTTTGATCCACTCAAATTAGAAGTTTCATCAATGAAATTAAGGAAATACACAAAAAAAAAATTAAATGTGGTTTTCTAAAAATATGAAGAATAAAGTACCACATCAATTTTCATCTTTCATTTAGAGAAAATAAAGTTCAAATAAGAATTTTGTACGATTTAATTTTAAAAAAAGAAAACCAACTTACAGATGCGCTTCCAACTTATATGTGGTTTTCTAGAGCTAATGCTTTGTTTGACATTTTACCATGATACCTTACTGGATAAAAACTTGAAGTTGGACTACTATGATGACATCAAAATCCACTCCATTTGATCCCAACTTCTAATTAGATATTAGGTAAACACTTTTGGTCCAATAAAAGTAAAGATTAAATTAAACCATTATCAGCAAAGGTTTTAAGCCTAAATTGTTATTTCAAATAGAAAGATAAAAACTACTTTTTAACTGATTTAAAAATAGCTATGAGCCAATAATCTAATGACAAAATTTTAGCATTCTGTTCCCACTATAAATTTGTGTTAGTAAATTGAGCACTTAAATATTTATCGTGCTGAATATTTTAAGGGTTCTTGATTTTTGTTTTTCATGGTTTAAAAAACTAAATCTAAAGCTGGGGTTTAAATTTCTAGTTAAAATATTATTTTGGTATCTAACTGGTCAAAATTTTAATCCCAACGTCGAATTTTAATCTATATCCTTAGACTTATTTTAAAACAACTTTGTCATACATATTTATTTATTTTATTTTCTGAAAATTATTGTTATCATTATTATTTAATCAATTTTGGTAAAAGTTAGTTTCAGGAATTAAAACGGGACTAAAATGGAATAAACTGGGACTGAAAATGGTATTTGACCTAAATTTTTCTACCTTATCGTAGTACTGTTGATAGAAGGGAATCAGATCTTAGAACCGCTATCTCGTCTCAACACGTGTCATCCGTGGGACACTAGAAGAAACTTTTTGACTGAGTAAGCACCTACGTGTCACCCACGCAATGTTCTACAATTCGTTATCAGCCACGTAGGCACTCCAGAGTTGGGAATAAGAGTTCTCAAATTCTATATAAACCCAGAGGGACAAGAAGAGACAGGCATAAAACTCTTATCTTCTCTTGTCGAAGCTAAGCATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGAAGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGTCTGACGACTTGTCGACTTTTCTAGTTTCCAACCCTATTGCTCCATTTTAAACTTACCTTAGTTAAAAAGCACTTTTTATCTAAATTTAACCATTTGAATCCTTTAGCTCTGCTACTTAATTATTTAAGTCTTAATACTTTTTAACTTGATATAATATAATCAAAATGAAGTGTCGCTACTATAAAACAAGGTTTATTATTCTACAAATTTAATAAATTTCATTTGGTATCTAAAACTTTTTAAATGGTTAAATTAAAATTCTAGTCTCTAATTATTTTTATCTCGTTGATTCGAGACCTGACCGAACTTCAAATCTTGTGGCTGAAGTCTCTATACTTCTAATTTTCTATATTTAATAAGGCTATGATATATGCAACTTTTTTTCTTGTCTTTTTTTCTTAAAAAAAAAACTAAAAACTTATAGGAAAGGGATAGTTTTAAAAAAAACTTATAAAAAAGGGATAGTTTTTAAATATAACAAAATCCGTGTAGCGAAATGTTTAATAAATATAGTTAAATTAGACTACTATCATTCATGACCAACTACTACGGATAAATCGACTACTATTTATTTAGAATTATTGCCATCACTAAAAAGAATGGCTATATTTAAAAATATTTTAAGCAATTTCGCTATTTAAAATAATTATTCAAAAAAATATTATATTTGATAATTTTTTAAGTAAATTTTAGTTAATAATTGATCTATTAGACCTACTCTAAAAATTTTGTCTTATTAGATACAATTTTACAGAAAATTTTGTAGGACTGAAAATTGAAACTAATTTATAGGCTAAACTTTTAAAGAAAATTATTATAAATTGAAAAAATATTAGAACTTCTATCGCTAACATCCCAGAACTTTTAAGTTAATGTTTAAAAATGTCTCTAAATAGCATAGAGACCACGCAAAACACTCTTGAAAGTACTAATGTTGAGGCACATTTTAAACCAATGCCAACCGCCAAAGTGAAAAGTGGTGACTCATTCTGATGGTGATGCAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

mRNA sequence

AAGTTTCTCTAGAAGAATTAGAAGGTCGGGCAACACAAATGGCATCGCAACAGGAAAGATCAGAGCTTGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGCATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGAAGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

Coding sequence (CDS)

ATGGCATCGCAACAGGAAAGATCAGAGCTTGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAGGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGCATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGAAGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTCGTCCCCGGTGGAACTGGGGGCAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAG

Protein sequence

MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
Homology
BLAST of CaUC04G072820 vs. NCBI nr
Match: KAG7033764.1 (Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 321.2 bits (822), Expect = 6.6e-84
Identity = 180/213 (84.51%), Postives = 188/213 (88.26%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MA+QQERSEL+AKAKQGETVVPGGTGGKS EAQER    RSRGGQTRKEQLGHEGYQE+G
Sbjct: 1   MAAQQERSELEAKAKQGETVVPGGTGGKSLEAQER----RSRGGQTRKEQLGHEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELK 120
           H+GGE RREQMG EGYQEMG+KGGLSTMDKS  ER  EEGIEIDESK            +
Sbjct: 61  HRGGETRREQMGQEGYQEMGKKGGLSTMDKSAAERVEEEGIEIDESK------------E 120

Query: 121 KEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180
           +EMSSEQER ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE
Sbjct: 121 REMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180

Query: 181 MGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           MGRKGGLSN+GMPGGERAAEEGVEIDESKFR K
Sbjct: 181 MGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK 197

BLAST of CaUC04G072820 vs. NCBI nr
Match: KAF4401425.1 (hypothetical protein G4B88_001619 [Cannabis sativa])

HSP 1 Score: 266.5 bits (680), Expect = 1.9e-67
Identity = 160/271 (59.04%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of CaUC04G072820 vs. NCBI nr
Match: XP_016183975.2 (LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis])

HSP 1 Score: 248.8 bits (634), Expect = 4.1e-62
Identity = 150/233 (64.38%), Postives = 167/233 (71.67%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MAS+Q++ ELD +AKQGETVVPGGTGGKS EAQE LAEGRS+GGQT              
Sbjct: 1   MASKQQKQELDERAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQT-------------- 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTK------DPK 120
                 RREQ+G EGYQEMGRKGG STM+KSGGERA EEG+EIDESKF TK      + +
Sbjct: 61  ------RREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFVTKNLNKYPEYQ 120

Query: 121 HLE--------------ELKKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAE 180
           H+E              ++    S +Q R ELD RA+QGETVVPGGTGGKSLEAQEHLAE
Sbjct: 121 HIESKYNMLSLLSSNSIQVISMASKQQNRQELDERAKQGETVVPGGTGGKSLEAQEHLAE 180

Query: 181 GRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRS+GGQTR+EQLG EGYQEMGRKGG S     GGERA EEGVEIDESKF TK
Sbjct: 181 GRSKGGQTRREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFTTK 213

BLAST of CaUC04G072820 vs. NCBI nr
Match: XP_007206180.2 (late embryogenesis abundant protein B19.3 [Prunus persica])

HSP 1 Score: 236.9 bits (603), Expect = 1.6e-58
Identity = 144/235 (61.28%), Postives = 163/235 (69.36%), Query Frame = 0

Query: 5   QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGG 64
           Q+R ELD KA++GE V+PGGTGGKS EAQE LAEGRSRGGQTRK ++             
Sbjct: 9   QKRRELDEKARKGEVVIPGGTGGKSLEAQEHLAEGRSRGGQTRKNEI------------- 68

Query: 65  EARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKKEMS 124
                  GHEGY EMG+KGGLST DKSGGERAAEEGI +DESK++T    +     KEM+
Sbjct: 69  -------GHEGYHEMGKKGGLSTTDKSGGERAAEEGIPLDESKYKTNGRSN---DSKEMA 128

Query: 125 SEQERS------ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQ----------- 184
           SEQERS      ELD +ARQGE VVPGGTGGKSLEAQEHLAEGRSRGGQ           
Sbjct: 129 SEQERSDPSRRKELDEKARQGEVVVPGGTGGKSLEAQEHLAEGRSRGGQTRREQVGHEGY 188

Query: 185 ---------TRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
                    TRKEQ+GHEGY+EMG+KGGLS     GGERAAEEG+ IDESK++TK
Sbjct: 189 RELGHRGGETRKEQIGHEGYREMGKKGGLSTKDKSGGERAAEEGIPIDESKYKTK 220

BLAST of CaUC04G072820 vs. NCBI nr
Match: XP_021296037.1 (late embryogenesis abundant protein B19.4 [Herrania umbratica])

HSP 1 Score: 216.1 bits (549), Expect = 3.0e-52
Identity = 135/225 (60.00%), Postives = 158/225 (70.22%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKHLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER+ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of CaUC04G072820 vs. ExPASy Swiss-Prot
Match: Q07187 (Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 4.5e-43
Identity = 112/214 (52.34%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of CaUC04G072820 vs. ExPASy Swiss-Prot
Match: Q05191 (Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE=2 SV=1)

HSP 1 Score: 167.2 bits (422), Expect = 2.1e-40
Identity = 105/212 (49.53%), Postives = 125/212 (58.96%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQTRKEQLG EGY+E+GH
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKEQLGEEGYREMGH 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKK 121
           +GGE R+EQ+G EGY+EMG KGG                                 E +K
Sbjct: 63  KGGETRKEQLGEEGYREMGHKGG---------------------------------ETRK 122

Query: 122 EMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEM 181
           E   E+   E+                               +GG+TRKEQ+G EGY+EM
Sbjct: 123 EQLGEEGYREMG-----------------------------HKGGETRKEQMGEEGYREM 152

Query: 182 GRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRKGGLS     GGERAA EG++IDESKF+TK
Sbjct: 183 GRKGGLSTMNESGGERAAREGIDIDESKFKTK 152

BLAST of CaUC04G072820 vs. ExPASy Swiss-Prot
Match: I1N2Z5 (Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1)

HSP 1 Score: 163.7 bits (413), Expect = 2.3e-39
Identity = 90/110 (81.82%), Postives = 97/110 (88.18%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           M SQQ  R ELD KA+QGETVVPGGTGGKS EAQE LAEGRSRGGQTRK+QLG EGY E+
Sbjct: 1   MESQQANREELDEKARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKQQLGSEGYHEM 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR 110
           G +GG+ R+EQMG EGYQEMGRKGGLSTMDKSGGERA EEGIEIDESKF+
Sbjct: 61  GTKGGQTRKEQMGREGYQEMGRKGGLSTMDKSGGERAEEEGIEIDESKFK 110

BLAST of CaUC04G072820 vs. ExPASy Swiss-Prot
Match: Q5KTS7 (Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 8.7e-39
Identity = 86/110 (78.18%), Postives = 98/110 (89.09%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + Q++RSELDA+AKQGETVVPGGTGGKS EAQE LAEGRS+GG TRKEQLG EGYQE+G 
Sbjct: 3   SGQEKRSELDARAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGHTRKEQLGTEGYQEIGT 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTK 112
           +GGE RREQMG EGY++MGR GGL+T DKSG ERA EEGI+ID+SKFRTK
Sbjct: 63  KGGETRREQMGKEGYEQMGRMGGLATKDKSGAERAEEEGIDIDQSKFRTK 112

BLAST of CaUC04G072820 vs. ExPASy Swiss-Prot
Match: Q02400 (Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.8e-37
Identity = 88/130 (67.69%), Postives = 100/130 (76.92%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQ---------------- 61
           + QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQ                
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKDQLGEEGYREMGH 62

Query: 62  ----TRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGI 112
               TRKEQLG EGY+E+GH+GGE R+EQMG EGY EMGRKGGLSTM++SGGERAA EGI
Sbjct: 63  KGGETRKEQLGEEGYREMGHKGGETRKEQMGEEGYHEMGRKGGLSTMEESGGERAAREGI 122

BLAST of CaUC04G072820 vs. ExPASy TrEMBL
Match: A0A7J6I2G2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1)

HSP 1 Score: 266.5 bits (680), Expect = 9.3e-68
Identity = 160/271 (59.04%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   SQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           SQ++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of CaUC04G072820 vs. ExPASy TrEMBL
Match: A0A6J1B9A4 (late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC110425442 PE=4 SV=1)

HSP 1 Score: 216.1 bits (549), Expect = 1.4e-52
Identity = 135/225 (60.00%), Postives = 158/225 (70.22%), Query Frame = 0

Query: 1   MASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKHLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER+ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of CaUC04G072820 vs. ExPASy TrEMBL
Match: A0A498K5A7 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1)

HSP 1 Score: 197.6 bits (501), Expect = 5.3e-47
Identity = 136/279 (48.75%), Postives = 154/279 (55.20%), Query Frame = 0

Query: 1   MASQQE------RSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHE 60
           MAS+QE      R+ELD KA++GET+VPGGTGG S EAQE LAEGRSRGGQTRK Q+   
Sbjct: 1   MASEQEKQDPQKRNELDEKARRGETIVPGGTGGHSLEAQEHLAEGRSRGGQTRKGQI--- 60

Query: 61  GYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPK 120
                            G EGY EMG+KGGLST DK GGERAAEEGI+IDESK   +DP+
Sbjct: 61  -----------------GEEGYHEMGKKGGLSTTDKPGGERAAEEGIKIDESK---RDPR 120

Query: 121 HLEELKKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQL- 180
                         R ELD +ARQGE VVPGGTG K+L AQEHLAEGR RGG+ RKEQL 
Sbjct: 121 -------------RRQELDQKARQGENVVPGGTGSKTLNAQEHLAEGRHRGGEARKEQLG 180

Query: 181 -----------------------------------------------------------G 214
                                                                      G
Sbjct: 181 SEGYGEIGHRGGEARKEQLGHEGYRDMGHRRCEASKKQLGHEGYQEMGRHGGEMRKEQIG 240

BLAST of CaUC04G072820 vs. ExPASy TrEMBL
Match: A0A446J0E8 (Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G143760 PE=3 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 2.6e-46
Identity = 116/212 (54.72%), Postives = 134/212 (63.21%), Query Frame = 0

Query: 2   ASQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH 61
           + QQERSELD  A++GETVVPGGTGGKS EAQE LA+GRSRGG+TRKEQLG EGY+E+GH
Sbjct: 3   SGQQERSELDRMAREGETVVPGGTGGKSLEAQEHLADGRSRGGETRKEQLGEEGYREMGH 62

Query: 62  QGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEELKK 121
           +GGE R+EQ+G EGY+EMGRKGGLSTM++SGGERAA                        
Sbjct: 63  KGGETRKEQLGEEGYREMGRKGGLSTMEESGGERAAR----------------------- 122

Query: 122 EMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEM 181
                                                 EGRSRGGQTR+EQ+G EGY EM
Sbjct: 123 --------------------------------------EGRSRGGQTRREQMGEEGYSEM 153

Query: 182 GRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRKGGLS     GGERAA EG++IDESKF+TK
Sbjct: 183 GRKGGLSTNDESGGERAAREGIDIDESKFKTK 153

BLAST of CaUC04G072820 vs. ExPASy TrEMBL
Match: R0HE60 (Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 1.7e-45
Identity = 123/214 (57.48%), Postives = 145/214 (67.76%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVV GGTGGKS EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVQGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEEL 120
           G +GGE R+EQ+GHEGYQEMGRKGG +  ++ G E   E G +  E++      +  +E+
Sbjct: 61  GSKGGETRKEQLGHEGYQEMGRKGGETRREQLGHEGYQEMGRKGGETRKEQLGHEGYQEM 120

Query: 121 KKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
            ++   E  + +L     Q E    GG   K     E   E   +GG+ RKEQLGHEGYQ
Sbjct: 121 GRK-GGETRKEQLGHEGYQ-EMGQKGGEARKEQLGHEGYQEMGRKGGEARKEQLGHEGYQ 180

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMDKSGGERAEEEGIEIDESKFTNK 212

BLAST of CaUC04G072820 vs. TAIR 10
Match: AT3G51810.1 (Stress induced protein )

HSP 1 Score: 176.0 bits (445), Expect = 3.2e-44
Identity = 112/214 (52.34%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MASQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MAS+Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKHLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of CaUC04G072820 vs. TAIR 10
Match: AT2G40170.1 (Stress induced protein )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 73/91 (80.22%), Postives = 81/91 (89.01%), Query Frame = 0

Query: 123 MSSEQERSELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 182
           M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MG
Sbjct: 1   MASQQEKKQLDERAKKGETVVPGGTGGKSFEAQQHLAEGRSRGGQTRKEQLGTEGYQQMG 60

Query: 183 RKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           RKGGLS    PGGE A EEGVEIDESKFRTK
Sbjct: 61  RKGGLSTGDKPGGEHAEEEGVEIDESKFRTK 91

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7033764.16.6e-8484.51Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
KAF4401425.11.9e-6759.04hypothetical protein G4B88_001619 [Cannabis sativa][more]
XP_016183975.24.1e-6264.38LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis... [more]
XP_007206180.21.6e-5861.28late embryogenesis abundant protein B19.3 [Prunus persica][more]
XP_021296037.13.0e-5260.00late embryogenesis abundant protein B19.4 [Herrania umbratica][more]
Match NameE-valueIdentityDescription
Q071874.5e-4352.34Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1[more]
Q051912.1e-4049.53Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE... [more]
I1N2Z52.3e-3981.82Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1[more]
Q5KTS78.7e-3978.18Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 ... [more]
Q024002.8e-3767.69Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE... [more]
Match NameE-valueIdentityDescription
A0A7J6I2G29.3e-6859.04Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1[more]
A0A6J1B9A41.4e-5260.00late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC... [more]
A0A498K5A75.3e-4748.75Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1[more]
A0A446J0E82.6e-4654.72Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G... [more]
R0HE601.7e-4557.48Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT3G51810.13.2e-4452.34Stress induced protein [more]
AT2G40170.11.1e-3380.22Stress induced protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR038956Late embryogenesis abundant protein, LEA_5 subgroupPFAMPF00477LEA_5coord: 124..178
e-value: 5.0E-25
score: 88.1
coord: 2..108
e-value: 4.4E-56
score: 188.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 199..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 92..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..76
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 123..213
coord: 62..112
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 1..64
IPR000389Small hydrophilic plant seed proteinPANTHERPTHR34671EM-LIKE PROTEIN GEA1coord: 1..64
coord: 123..213
coord: 62..112
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 17..25
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 139..147

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC04G072820.1CaUC04G072820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009737 response to abscisic acid