CcUC04G065080.1 (mRNA) Watermelon (PI 537277) v1

Overview
NameCcUC04G065080.1
TypemRNA
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionlate embryogenesis abundant protein B19.4
LocationCicolChr04: 21976968 .. 21981250 (+)
Sequence length737
RNA-Seq ExpressionCcUC04G065080.1
SyntenyCcUC04G065080.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTTCTATATAGAAGAATTAGAAGGTCGGGCAACACAAATGGCAACGCAACAGGAAAGATCAGAGCTCGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTGTTCTTAACTGTCTCTCAATTAATTGTTAATGTACTATATATTTCTGCTCTAATAACACAATACAGTAGAAAACATTTTCAAATATTTCTTGAGGAAAACTTAGATTTTATCTTTGTCCAACAAAGAAAAAAACTTTCATGTGATAATTCTTTTTCAAAAAAATGATTTTTTTAGAGTTTTGTTAACTCGTTTGTTTCTTGGGTCGTGGCGACATGGAGGTCGGAGTCGGGGTGGGCAGACGAGGAAAGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCGTTGATCAGAGCTTATGGCTCTGTGTGCATACCAGATGATGATCCAGCTGACTTTGTTTTGAAAGTAGCTGTGTGGCGTGTTCTTGTCGTGTTTTTTTTTTTTCCCCCCCCCTTCATGTTTATGTATTTTGGTAAGGTTGCTTTGTAATAGCTAATGTCTTTTTAAGTCTGTTTTTCTTTCCGTTTCTGGCCACATGTACGTCGGCACGTCAGGGTTTTGGTTATTGGACTTGAACAAGGAGAAACTTTGCTTTATTATGTGAATTTTAGATGTCTTATTTTATTTGCATAGAAAAAGGAAAGGACCTTGGCCAACTTTAATTCGCATTGCCATGAACACTCTCCCTATTCTCAATGGGTTAAGATGGAAATTTCATGTTTCTCTTTCCAGATCCAATTTGTGCTTAGCTAAGTTCATAGTGAAGATAAAGAAACTGGAACAAACTAACTTTGCTTTATTATGTGAATTTTAGATGTCTTATTTTATTTGCATAGAAAAAGGAAAGGATCTTGGCCAACTTTAATTCGCATTGCCATGAACACTCCCCCTATTCTCAATGGGTTAAGATGGAAATTTCATGTTTCTCTTTCCAGATCAAATTTGTGCTTAGTTAAGTTCACTATGTGAAGATAAAGCAAACTAACTCATAATTGTTTCAATTAAATGTTAGAACTAATGTGAGTTTGATTCTGAAATTGAAATTTAGGGCGGAGTGTAAATTAATATGATATATCCTATTAGACAATGGAGATATGATATATCCTATTAGACAATGGGAGATTATGCAAGATTTTCTAGAGTTTGACCCACTCGAATTAAAAGTTTTATCAACGAAATTAAGGAAATACACAAAAAAATTAAAAGAATTTAGAAGATAATTAGGAGTTTATATTTAGGGGTGACTTAATATTTTTTCTAAAAATATAAAAAGAATGAAGTTCGACATCAATTTTCATCTTTCATTTAGTGAAGAAGTTCAAATAAGAATTTTGTACGATTTAATTTTGAAATAAGAAAACGAAATTACAAATGCGCTTCCAACTTATGTTTGGGGTTTAAAATCAAAATGAGTAGGTTTAAACTGATTCTTTTTAAATATTTCAATATTTCTTAAGATATCAAGGAAGAGCTAATGCTTTGTTTGACATTTTACCATGATACCTTACTGGATGAAAACTTGAAGTTGGACTACTATGATGACATCAGAATCACTCCCCATTTGATCCCAACTTCTAATTAGACATTAGGTAAGCACTTTTGGTCCAATAAAAGTAAATATTAAATTAAACCATTATCAGCAAAGGTTCTAAGCCTAAATTGTTATTTCAAATAGAAAGATAAAAACTACTTTTTAACTGATTTTAAAATAGCTATGAGCCAATAATCTAATGACAAAATTTTAGCATTTTGTTCCCACTATAAATTTGTGTTAGTAAATTGAGCACTTAAATATTCATCGTGCTGAATATTTTAAGGGTTCTTGATTTTTGTTTTTCATGGTTTAGAAAACTAAATCTAAAGCTGGGGTTTAAATTTCTAGTCAAAATATTATATTGGTATCTAACTGGTCAAAATTTCAATCCCAACGTCGAATTTTAATCTATATCCTTAGACTTATTTTGAAACAACTTTGTCATACATATTTATTTATTTTATTTTCTGAAAATTATTGTTATCATTATTATTTAATCGGTTTTGGTAAAAGTCAGTTTCAGGAATTAAAACTGGACTAAAATGGAATAAACTGGGACTGAAATGGTATTTGACCTAAAATTTTCTACCTTATCGTAGTACTGGACCTAAATTTTTCTACCTTATCGTTGTACTGTTGATAGAAGGGAATCAGATCTTAGAACCGCTATCTCGTCTCAACACGTGTCACCTGTGGGACACTAGAAGAAACTTTTTGACTGAGTAAGCACCTACGTGTCACCCAAGCAATGTTCTACAATTCGTTATCAGCCACGCAGGCACTCCAGAGTTGGGAATAAGAGTTCTCAAATTCTATATAAACCCAAAGGGACAAGAAGAGACAGGCATAAAACTCTTATCTTCTCTTGTCGAAGCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTTGTCCCCGGTGGAACTGGGGGTAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGTGTGACGACTTGTCGACTTTTCTAGTTTCCAACCCTATTGTTCCATTTTAAACTTACCTTTTCTTAGTTAAAAAGCACTTTTTAACTAAATTTAACCATTTGAATCCTTTAGCTCTGGTACTTAATTATTTAAGTCTTAATACTTTTTAACTCGATATAATATAAATCAAAATGAAGTGTCGCTACTATAAAACAAGGTTTATTATTCAACAAATTTAATAAATTTCATTTGGTATCTAAAATTTTTAAAATGGTTAAATTAAAATTCTAGTCTCTAATTATTTTTATCTCGTTGATTCGAGACCTGACCGAACTTCAAATCTTGTGTCTGAAGTCTCTACTTCTAATTTTCTATATTTAATAAGGCTATGATATATGCAACTTTTTTTCTTTTCTTTTCTTTTAAAAAAACTAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAGGAAAGGGATAGTTTAAAAAAAAAAAAAAAAAAGAAAAAAAAAGAAAGAACTTATAGGAAAGAGGTAGTTTTAAAAAATATAACAAAATCCGTGTAGCGAAATGTTTAATAAATATAGTTAAATTAGACTCCTATCATTCATGACCGACTACTACGGATAAGTCGACTACTATTTACTTAGAATTATTGCCAGAAAAAGAATTGCTATATTTTAAAATATTTTAAGCAATTTTGTTATTTAAAACAATTATTTAAAAAAAATATATTATATTTGATAATTTTTTAAGTAAATTTTAGTTAATAATTGATCTATTAGACATACTCTAAAAATTTTATGTCTTATAAGATACAATACTACAGAAAATTTTATAAGACTGAAAATTGGAACTAATTTATAGGATAAACTTTTAAGAAAATTATTATAAATTGAAAAGACATCCCAAAACTTTTAATTTAATCTTTAAAAATATCTCTAAATAGCATAGAGACCACGTAAAACACTCTTGAAAGTACTAATGTTGGGGCCATTTTAAACCCAAGCCAACCGCCAAAGTGGAAAGTGGTGACTCATTCTGATGGTGATGCAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

mRNA sequence

GTTTCTATATAGAAGAATTAGAAGGTCGGGCAACACAAATGGCAACGCAACAGGAAAGATCAGAGCTCGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAAGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTTGTCCCCGGTGGAACTGGGGGTAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAGAAGAAAGCCTTTCACAATGTCGTTGAGTTTCAAGTTCTAAGTTCCATTTTCAGCTTT

Coding sequence (CDS)

ATGGCAACGCAACAGGAAAGATCAGAGCTCGACGCTAAAGCAAAGCAGGGTGAGACTGTGGTTCCTGGCGGAACCGGCGGCAAAAGCTTTGAAGCTCAAGAACGTCTCGCTGAAGGTCGGAGTCGGGGTGGGCAGACGAGGAAAGAGCAATTGGGGCATGAAGGGTATCAGGAGCTAGGGCACCAGGGAGGAGAGGCAAGAAGGGAGCAGATGGGCCATGAAGGCTACCAAGAAATGGGTCGTAAAGGAGGGCTGAGCACCATGGACAAGTCCGGTGGTGAGCGGGCAGCGGAGGAAGGAATCGAGATCGACGAGTCCAAGTTCAGGACCAAGGACCCTAAGTATTTGGAAGAGTTGAAGAAAGAGATGTCGTCTGAGCAAGAAAGATGTGAACTCGACGCCAGGGCCAGGCAAGGGGAGACTGTTGTCCCCGGTGGAACTGGGGGTAAGAGTCTCGAAGCTCAGGAGCACCTTGCTGAAGGGCGGAGCCGTGGGGGCCAGACAAGGAAGGAGCAGCTAGGACACGAAGGGTACCAAGAGATGGGCCGTAAAGGAGGGCTAAGCAACACGGGTATGCCAGGAGGAGAGCGTGCTGCTGAGGAAGGGGTTGAAATTGACGAATCCAAGTTCAGGACTAAGTAG

Protein sequence

MATQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK
Homology
BLAST of CcUC04G065080.1 vs. NCBI nr
Match: KAG7033764.1 (Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 326.2 bits (835), Expect = 2.0e-85
Identity = 181/213 (84.98%), Postives = 188/213 (88.26%), Query Frame = 0

Query: 1   MATQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MA QQERSEL+AKAKQGETVVPGGTGGKS EAQER    RSRGGQTRKEQLGHEGYQE+G
Sbjct: 1   MAAQQERSELEAKAKQGETVVPGGTGGKSLEAQER----RSRGGQTRKEQLGHEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELK 120
           H+GGE RREQMG EGYQEMG+KGGLSTMDKS  ER  EEGIEIDESK            +
Sbjct: 61  HRGGETRREQMGQEGYQEMGKKGGLSTMDKSAAERVEEEGIEIDESK------------E 120

Query: 121 KEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180
           +EMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE
Sbjct: 121 REMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQE 180

Query: 181 MGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           MGRKGGLSN+GMPGGERAAEEGVEIDESKFR K
Sbjct: 181 MGRKGGLSNSGMPGGERAAEEGVEIDESKFRAK 197

BLAST of CcUC04G065080.1 vs. NCBI nr
Match: KAF4401425.1 (hypothetical protein G4B88_001619 [Cannabis sativa])

HSP 1 Score: 265.4 bits (677), Expect = 4.3e-67
Identity = 159/271 (58.67%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   TQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           +Q++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of CcUC04G065080.1 vs. NCBI nr
Match: XP_016183975.2 (LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis])

HSP 1 Score: 249.2 bits (635), Expect = 3.2e-62
Identity = 152/233 (65.24%), Postives = 167/233 (71.67%), Query Frame = 0

Query: 1   MATQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           MA++Q++ ELD +AKQGETVVPGGTGGKS EAQE LAEGRS+GGQT              
Sbjct: 1   MASKQQKQELDERAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQT-------------- 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKD----PKYL 120
                 RREQ+G EGYQEMGRKGG STM+KSGGERA EEG+EIDESKF TK+    P+Y 
Sbjct: 61  ------RREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFVTKNLNKYPEYQ 120

Query: 121 E-ELKKEM---------------SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAE 180
             E K  M               S +Q R ELD RA+QGETVVPGGTGGKSLEAQEHLAE
Sbjct: 121 HIESKYNMLSLLSSNSIQVISMASKQQNRQELDERAKQGETVVPGGTGGKSLEAQEHLAE 180

Query: 181 GRSRGGQTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           GRS+GGQTR+EQLG EGYQEMGRKGG S     GGERA EEGVEIDESKF TK
Sbjct: 181 GRSKGGQTRREQLGTEGYQEMGRKGGFSTMEKSGGERAEEEGVEIDESKFTTK 213

BLAST of CcUC04G065080.1 vs. NCBI nr
Match: XP_007206180.2 (late embryogenesis abundant protein B19.3 [Prunus persica])

HSP 1 Score: 235.7 bits (600), Expect = 3.6e-58
Identity = 143/235 (60.85%), Postives = 161/235 (68.51%), Query Frame = 0

Query: 5   QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGG 64
           Q+R ELD KA++GE V+PGGTGGKS EAQE LAEGRSRGGQTRK ++             
Sbjct: 9   QKRRELDEKARKGEVVIPGGTGGKSLEAQEHLAEGRSRGGQTRKNEI------------- 68

Query: 65  EARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMS 124
                  GHEGY EMG+KGGLST DKSGGERAAEEGI +DESK++T          KEM+
Sbjct: 69  -------GHEGYHEMGKKGGLSTTDKSGGERAAEEGIPLDESKYKTNG---RSNDSKEMA 128

Query: 125 SEQERC------ELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQ----------- 184
           SEQER       ELD +ARQGE VVPGGTGGKSLEAQEHLAEGRSRGGQ           
Sbjct: 129 SEQERSDPSRRKELDEKARQGEVVVPGGTGGKSLEAQEHLAEGRSRGGQTRREQVGHEGY 188

Query: 185 ---------TRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
                    TRKEQ+GHEGY+EMG+KGGLS     GGERAAEEG+ IDESK++TK
Sbjct: 189 RELGHRGGETRKEQIGHEGYREMGKKGGLSTKDKSGGERAAEEGIPIDESKYKTK 220

BLAST of CcUC04G065080.1 vs. NCBI nr
Match: XP_021296037.1 (late embryogenesis abundant protein B19.4 [Herrania umbratica])

HSP 1 Score: 217.2 bits (552), Expect = 1.3e-52
Identity = 135/225 (60.00%), Postives = 157/225 (69.78%), Query Frame = 0

Query: 1   MATQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKYLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of CcUC04G065080.1 vs. ExPASy Swiss-Prot
Match: Q07187 (Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 7.6e-43
Identity = 111/214 (51.87%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MATQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MA++Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of CcUC04G065080.1 vs. ExPASy Swiss-Prot
Match: Q05191 (Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE=2 SV=1)

HSP 1 Score: 168.3 bits (425), Expect = 9.3e-41
Identity = 104/210 (49.52%), Postives = 123/210 (58.57%), Query Frame = 0

Query: 4   QQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQG 63
           QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQTRKEQLG EGY+E+GH+G
Sbjct: 5   QQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKEQLGEEGYREMGHKG 64

Query: 64  GEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEM 123
           GE R+EQ+G EGY+EMG KGG +  ++ G E   E G                       
Sbjct: 65  GETRKEQLGEEGYREMGHKGGETRKEQLGEEGYREMG----------------------- 124

Query: 124 SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGR 183
                                                   +GG+TRKEQ+G EGY+EMGR
Sbjct: 125 ---------------------------------------HKGGETRKEQMGEEGYREMGR 152

Query: 184 KGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           KGGLS     GGERAA EG++IDESKF+TK
Sbjct: 185 KGGLSTMNESGGERAAREGIDIDESKFKTK 152

BLAST of CcUC04G065080.1 vs. ExPASy Swiss-Prot
Match: I1N2Z5 (Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 3.9e-39
Identity = 87/106 (82.08%), Postives = 94/106 (88.68%), Query Frame = 0

Query: 4   QQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQG 63
           Q  R ELD KA+QGETVVPGGTGGKS EAQE LAEGRSRGGQTRK+QLG EGY E+G +G
Sbjct: 5   QANREELDEKARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKQQLGSEGYHEMGTKG 64

Query: 64  GEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR 110
           G+ R+EQMG EGYQEMGRKGGLSTMDKSGGERA EEGIEIDESKF+
Sbjct: 65  GQTRKEQMGREGYQEMGRKGGLSTMDKSGGERAEEEGIEIDESKFK 110

BLAST of CcUC04G065080.1 vs. ExPASy Swiss-Prot
Match: Q5KTS7 (Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 SV=1)

HSP 1 Score: 161.8 bits (408), Expect = 8.7e-39
Identity = 86/108 (79.63%), Postives = 97/108 (89.81%), Query Frame = 0

Query: 4   QQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQG 63
           Q++RSELDA+AKQGETVVPGGTGGKS EAQE LAEGRS+GG TRKEQLG EGYQE+G +G
Sbjct: 5   QEKRSELDARAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGHTRKEQLGTEGYQEIGTKG 64

Query: 64  GEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTK 112
           GE RREQMG EGY++MGR GGL+T DKSG ERA EEGI+ID+SKFRTK
Sbjct: 65  GETRREQMGKEGYEQMGRMGGLATKDKSGAERAEEEGIDIDQSKFRTK 112

BLAST of CcUC04G065080.1 vs. ExPASy Swiss-Prot
Match: Q02400 (Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE=2 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 2.8e-37
Identity = 88/128 (68.75%), Postives = 99/128 (77.34%), Query Frame = 0

Query: 4   QQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQ------------------ 63
           QQERSELD  A++GETVVPGGTGGK+ EAQE LAEGRSRGGQ                  
Sbjct: 5   QQERSELDRMAREGETVVPGGTGGKTLEAQEHLAEGRSRGGQTRKDQLGEEGYREMGHKG 64

Query: 64  --TRKEQLGHEGYQELGHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEI 112
             TRKEQLG EGY+E+GH+GGE R+EQMG EGY EMGRKGGLSTM++SGGERAA EGI+I
Sbjct: 65  GETRKEQLGEEGYREMGHKGGETRKEQMGEEGYHEMGRKGGLSTMEESGGERAAREGIDI 124

BLAST of CcUC04G065080.1 vs. ExPASy TrEMBL
Match: A0A7J6I2G2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1)

HSP 1 Score: 265.4 bits (677), Expect = 2.1e-67
Identity = 159/271 (58.67%), Postives = 178/271 (65.68%), Query Frame = 0

Query: 3   TQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGH- 62
           +Q++R ELDAKAKQGETVVPGGTGG+S EAQE LAEGRSRGGQTR EQLGHEGYQE+G  
Sbjct: 89  SQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMGRK 148

Query: 63  -----------------------------------------------------------Q 122
                                                                      +
Sbjct: 149 GGLSTTDKSGGERAEEEGIQIDESKQELDAKARQGETVIPGGTGGKSLEAQEHLAEGRSR 208

Query: 123 GGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKE 182
           GG+ R EQ+GHEGYQEMGRKGGLST DKSGG+RA EEGI+IDES                
Sbjct: 209 GGQTRSEQLGHEGYQEMGRKGGLSTTDKSGGDRAEEEGIQIDES---------------- 268

Query: 183 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 214
            +S+++R ELDA+A+QGETVVPGGTGG+SLEAQEHLAEGRSRGGQTR EQLGHEGYQEMG
Sbjct: 269 -NSQKQRQELDAKAKQGETVVPGGTGGQSLEAQEHLAEGRSRGGQTRSEQLGHEGYQEMG 328

BLAST of CcUC04G065080.1 vs. ExPASy TrEMBL
Match: A0A6J1B9A4 (late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC110425442 PE=4 SV=1)

HSP 1 Score: 217.2 bits (552), Expect = 6.5e-53
Identity = 135/225 (60.00%), Postives = 157/225 (69.78%), Query Frame = 0

Query: 1   MATQQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELG 60
           M+ QQ+R ELD +A++ E V+PGGTGGKS EA+E LAEGRSRGGQTRKEQ+  EGYQE+G
Sbjct: 1   MSYQQQREELDHRAREVEIVIPGGTGGKSLEAEEHLAEGRSRGGQTRKEQIRTEGYQEMG 60

Query: 61  HQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFR-TKDPKYLEEL 120
           HQ                    GGLST DKSGGERA EEG++I++SK+R ++  K    +
Sbjct: 61  HQ--------------------GGLSTGDKSGGERAEEEGVQIEKSKYRASQRQKRRSSV 120

Query: 121 KK------EMSSEQ-------ERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGG 180
           KK       M+SEQ       ER ELDARARQGE VVPGGT GKSLEAQE LAEGR  GG
Sbjct: 121 KKPRQKGTRMASEQVKNASDEERAELDARARQGEVVVPGGTSGKSLEAQERLAEGRHPGG 180

Query: 181 QTRKEQLGHEGYQEMGRKGGLSNTGMPGGERAAEEGVEIDESKFR 212
           +  K+Q+G EGYQEMGRKGGLS T    GERAAEEG+ IDESK R
Sbjct: 181 EAGKQQIGREGYQEMGRKGGLSTTDKSDGERAAEEGMPIDESKHR 205

BLAST of CcUC04G065080.1 vs. ExPASy TrEMBL
Match: A0A498K5A7 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1)

HSP 1 Score: 197.2 bits (500), Expect = 6.9e-47
Identity = 132/269 (49.07%), Postives = 150/269 (55.76%), Query Frame = 0

Query: 5   QERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQGG 64
           Q+R+ELD KA++GET+VPGGTGG S EAQE LAEGRSRGGQTRK Q+             
Sbjct: 11  QKRNELDEKARRGETIVPGGTGGHSLEAQEHLAEGRSRGGQTRKGQI------------- 70

Query: 65  EARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEMS 124
                  G EGY EMG+KGGLST DK GGERAAEEGI+IDESK   +DP+          
Sbjct: 71  -------GEEGYHEMGKKGGLSTTDKPGGERAAEEGIKIDESK---RDPR---------- 130

Query: 125 SEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQL----------- 184
               R ELD +ARQGE VVPGGTG K+L AQEHLAEGR RGG+ RKEQL           
Sbjct: 131 ---RRQELDQKARQGENVVPGGTGSKTLNAQEHLAEGRHRGGEARKEQLGSEGYGEIGHR 190

Query: 185 -------------------------------------------------GHEGYQEMGRK 214
                                                            G EGYQEMG+K
Sbjct: 191 GGEARKEQLGHEGYRDMGHRRCEASKKQLGHEGYQEMGRHGGEMRKEQIGEEGYQEMGKK 243

BLAST of CcUC04G065080.1 vs. ExPASy TrEMBL
Match: A0A446J0E8 (Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G143760 PE=3 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 2.6e-46
Identity = 116/210 (55.24%), Postives = 133/210 (63.33%), Query Frame = 0

Query: 4   QQERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQELGHQG 63
           QQERSELD  A++GETVVPGGTGGKS EAQE LA+GRSRGG+TRKEQLG EGY+E+GH+G
Sbjct: 5   QQERSELDRMAREGETVVPGGTGGKSLEAQEHLADGRSRGGETRKEQLGEEGYREMGHKG 64

Query: 64  GEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEELKKEM 123
           GE R+EQ+G EGY+EMGRKGGLSTM++SGGERAA                          
Sbjct: 65  GETRKEQLGEEGYREMGRKGGLSTMEESGGERAAR------------------------- 124

Query: 124 SSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMGR 183
                                               EGRSRGGQTR+EQ+G EGY EMGR
Sbjct: 125 ------------------------------------EGRSRGGQTRREQMGEEGYSEMGR 153

Query: 184 KGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           KGGLS     GGERAA EG++IDESKF+TK
Sbjct: 185 KGGLSTNDESGGERAAREGIDIDESKFKTK 153

BLAST of CcUC04G065080.1 vs. ExPASy TrEMBL
Match: R0HE60 (Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 2.9e-45
Identity = 123/215 (57.21%), Postives = 145/215 (67.44%), Query Frame = 0

Query: 1   MATQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MA++Q  R ELD KAKQGETVV GGTGGKS EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVQGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDES-KFRTKDPKYLEE 120
           G +GGE R+EQ+GHEGYQEMGRKGG +  ++ G E   E G +  E+ K +     Y E 
Sbjct: 61  GSKGGETRKEQLGHEGYQEMGRKGGETRREQLGHEGYQEMGRKGGETRKEQLGHEGYQEM 120

Query: 121 LKKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGY 180
            +K   + +E+   +     G+    GG   K     E   E   +GG+ RKEQLGHEGY
Sbjct: 121 GRKGGETRKEQLGHEGYQEMGQ---KGGEARKEQLGHEGYQEMGRKGGEARKEQLGHEGY 180

Query: 181 QEMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           QEMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 QEMGRKGGLSTMDKSGGERAEEEGIEIDESKFTNK 212

BLAST of CcUC04G065080.1 vs. TAIR 10
Match: AT3G51810.1 (Stress induced protein )

HSP 1 Score: 175.3 bits (443), Expect = 5.4e-44
Identity = 111/214 (51.87%), Postives = 125/214 (58.41%), Query Frame = 0

Query: 1   MATQQ-ERSELDAKAKQGETVVPGGTGGKSFEAQERLAEGRSRGGQTRKEQLGHEGYQEL 60
           MA++Q  R ELD KAKQGETVVPGGTGG S EAQE LAEGRS+GGQTRKEQLGHEGYQE+
Sbjct: 1   MASKQLSREELDEKAKQGETVVPGGTGGHSLEAQEHLAEGRSKGGQTRKEQLGHEGYQEI 60

Query: 61  GHQGGEARREQMGHEGYQEMGRKGGLSTMDKSGGERAAEEGIEIDESKFRTKDPKYLEEL 120
           GH+GGEAR+EQ+GHEGYQEMG KGG +  ++ G E   E G                   
Sbjct: 61  GHKGGEARKEQLGHEGYQEMGHKGGEARKEQLGHEGYQEMG------------------- 120

Query: 121 KKEMSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQ 180
                                                       +GG+ RKEQLGHEGY+
Sbjct: 121 -------------------------------------------HKGGEARKEQLGHEGYK 152

Query: 181 EMGRKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           EMGRKGGLS     GGERA EEG+EIDESKF  K
Sbjct: 181 EMGRKGGLSTMEKSGGERAEEEGIEIDESKFTNK 152

BLAST of CcUC04G065080.1 vs. TAIR 10
Match: AT2G40170.1 (Stress induced protein )

HSP 1 Score: 141.0 bits (354), Expect = 1.1e-33
Identity = 73/91 (80.22%), Postives = 81/91 (89.01%), Query Frame = 0

Query: 123 MSSEQERCELDARARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKEQLGHEGYQEMG 182
           M+S+QE+ +LD RA++GETVVPGGTGGKS EAQ+HLAEGRSRGGQTRKEQLG EGYQ+MG
Sbjct: 1   MASQQEKKQLDERAKKGETVVPGGTGGKSFEAQQHLAEGRSRGGQTRKEQLGTEGYQQMG 60

Query: 183 RKGGLSNTGMPGGERAAEEGVEIDESKFRTK 214
           RKGGLS    PGGE A EEGVEIDESKFRTK
Sbjct: 61  RKGGLSTGDKPGGEHAEEEGVEIDESKFRTK 91

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7033764.12.0e-8584.98Em-like protein GEA1, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
KAF4401425.14.3e-6758.67hypothetical protein G4B88_001619 [Cannabis sativa][more]
XP_016183975.23.2e-6265.24LOW QUALITY PROTEIN: late embryogenesis abundant protein B19.4 [Arachis ipaensis... [more]
XP_007206180.23.6e-5860.85late embryogenesis abundant protein B19.3 [Prunus persica][more]
XP_021296037.11.3e-5260.00late embryogenesis abundant protein B19.4 [Herrania umbratica][more]
Match NameE-valueIdentityDescription
Q071877.6e-4351.87Em-like protein GEA1 OS=Arabidopsis thaliana OX=3702 GN=EM1 PE=2 SV=1[more]
Q051919.3e-4149.52Late embryogenesis abundant protein B19.4 OS=Hordeum vulgare OX=4513 GN=B19.4 PE... [more]
I1N2Z53.9e-3982.08Protein SLE1 OS=Glycine max OX=3847 GN=SLE1 PE=2 SV=1[more]
Q5KTS78.7e-3979.63Carrot ABA-induced in somatic embryos 3 OS=Daucus carota OX=4039 GN=CAISE3 PE=2 ... [more]
Q024002.8e-3768.75Late embryogenesis abundant protein B19.3 OS=Hordeum vulgare OX=4513 GN=B19.3 PE... [more]
Match NameE-valueIdentityDescription
A0A7J6I2G22.1e-6758.67Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_001619 PE=3 SV=1[more]
A0A6J1B9A46.5e-5360.00late embryogenesis abundant protein B19.4 OS=Herrania umbratica OX=108875 GN=LOC... [more]
A0A498K5A76.9e-4749.07Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_014793 PE=4 SV=1[more]
A0A446J0E82.6e-4655.24Uncharacterized protein OS=Triticum turgidum subsp. durum OX=4567 GN=TRITD_1Av1G... [more]
R0HE602.9e-4557.21Uncharacterized protein OS=Capsella rubella OX=81985 GN=CARUB_v10019332mg PE=4 S... [more]
Match NameE-valueIdentityDescription
AT3G51810.15.4e-4451.87Stress induced protein [more]
AT2G40170.11.1e-3380.22Stress induced protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 116..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 199..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 155..178
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 120..136
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..76
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..105
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 1..64
NoneNo IPR availablePANTHERPTHR34671:SF13SUBFAMILY NOT NAMEDcoord: 123..213
coord: 62..112
IPR038956Late embryogenesis abundant protein, LEA_5 subgroupPFAMPF00477LEA_5coord: 3..108
e-value: 7.3E-56
score: 187.3
coord: 124..178
e-value: 8.8E-25
score: 87.3
IPR000389Small hydrophilic plant seed proteinPANTHERPTHR34671EM-LIKE PROTEIN GEA1coord: 123..213
coord: 62..112
IPR000389Small hydrophilic plant seed proteinPANTHERPTHR34671EM-LIKE PROTEIN GEA1coord: 1..64
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 17..25
IPR022377Small hydrophilic plant seed protein, conserved sitePROSITEPS00431SMALL_HYDR_PLANT_SEEDcoord: 139..147

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CcUC04G065080CcUC04G065080gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC04G065080.1-exonCcUC04G065080.1-exon-CicolChr04:21976968..21977120exon
CcUC04G065080.1-exonCcUC04G065080.1-exon-CicolChr04:21977342..21977563exon
CcUC04G065080.1-exonCcUC04G065080.1-exon-CicolChr04:21979631..21979774exon
CcUC04G065080.1-exonCcUC04G065080.1-exon-CicolChr04:21981033..21981250exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC04G065080.1-five_prime_utrCcUC04G065080.1-five_prime_utr-CicolChr04:21976968..21977005five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC04G065080.1-cdsCcUC04G065080.1-cds-CicolChr04:21977006..21977120CDS
CcUC04G065080.1-cdsCcUC04G065080.1-cds-CicolChr04:21977342..21977563CDS
CcUC04G065080.1-cdsCcUC04G065080.1-cds-CicolChr04:21979631..21979774CDS
CcUC04G065080.1-cdsCcUC04G065080.1-cds-CicolChr04:21981033..21981193CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CcUC04G065080.1-three_prime_utrCcUC04G065080.1-three_prime_utr-CicolChr04:21981194..21981250three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CcUC04G065080.1CcUC04G065080.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009737 response to abscisic acid