CmoCh15G003530 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh15G003530
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionuniversal stress protein A-like protein
LocationCmo_Chr15: 1654009 .. 1659089 (+)
RNA-Seq ExpressionCmoCh15G003530
SyntenyCmoCh15G003530
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTTCGCACACATTTGTATGCTGAATCCAACATCACTCCCGATTACGATAATTAAATCTTTTTTTGTTTGTTTGTATTTTCATATATCGCACAGAATACAAGAAGATCGATCCCGAACTCAGAAACTCTTCCACAACGACCTCGCTTTTCCGGCAACCGCCAAGAAATCTCCAACGCTCCTCTCCTTTCCGGTCAGTTTTCCGATCGCCTTGTTTATTTAGGGTTTCGCGTCTTTTCTCAGATGCCTCTGTTTTCATCTTCAAATTAGTTTTAATGTTTGTTATTGAATTTCAAATGCAATTCAAATCATATATTATTATTATTATTACATTTGTAGTGTTTTCTTCTCAAATATAGAAATGCTGACTATATCCAATTAAAATCTTTCTATATATATATATATATATATATTTTTTTTTTTTTTGTTGGAAATCATAATTTTATTTTCAAATCGATATATTTCTTCAGTTTCTTGAAAATTAAACTCATTTAAAAAATAAAAAAGAATAATAATTTGTGTGGTTGGATATGGGAGCGTTTTCTTTTTCCAACATAGGAAATACTGGCCATCCAATTAATTTTTTCTTTAACGTTTTTTTTAAATAATTCATTTTCTTAATTTTCTCAGACAGAAAATACCAATTATTATAGAGTTATAGTTAACTATTCTTAACTAAAATAAATAAATAAATAAATAAAAAGGGTAAATTATTTTAATGAAAAATAAAGAAATAAAATAGGAGTAATTTGGGTGGGCTATCCCAGTCATGAAGTTGACCCATCATCCCACGTGTTATCTTATCCATTTAAAACTAATTATATTATTTGGTCATATTTTTTAACCTATTTTTGTGTGTATTTTTTACCCATTTATTTTACTTCCCAAAATAATGAAATAAAAGTGTTTTTTTTTAAGTAAAAATCCTATTTAATCAAAATCACTTTTTGCTGGAGAATGAAAATTGCAAGCTAATTTAAGAAATGATAGTAAGTTTATAATCAAAGAATACTCCCTCCACCTCCAATTGGTATGAGGTTTTTTTGGAAAATCAAAAGCAAAGCCATGAGAGCTTATGCTCAAAGTAGACAATATCATATCATTGTAGAGAGTTTGTTGGAGGACAGGTATTGGAAGATGAAAGTCTTTATAACAAAGAATGAATGTAGAGCAAATTGTCATGAGAGCTGCTCAAAATTGACACTATCATGGTCTGCGTTCTTCTAGCACTCTTTACCTTAAAAAAAAAATTCTCATGCAAAATCCATGTAATGAACAAAAGAATATTAAATGAGAGAGAACAGTTTTTTTTTTTTTAAATTTTAATTTTTTTTTATTTTTAATTTCGTAGACTACATTCTAAGTGTTCATCTAAATTTCTTGCAGATTCTGATCAAGAAATCAGAAAAAGAGAATTCAAATATGGAAAAAGACTCATCTCGACCCACACGGCGGCAGAGATTCGCGGTGGACGACGGCGCCGATCTAATAGACTGCTCCGGCAAGCAATGCCAGTCGTGCACCACCGGCTTGGTTGCGGACTGCGTCGCCATCTGGTTATGCCCATGCTCGGTTGTCAGCTTCTTGGCTCTGGCTCTCGTCAAACTTCCGTGGATGATCGGGCGGCGGTGTCTGCAGCAGAGAAGGCAGAAGAGGAAATTGATTGGGCGGAGAGGAGAACGCGAGGGCGGTGGTGTGGCGGCGGAGAGTGGTGGGGGTGCGGCTAGTGAGGAGGGGCTGCCGCCGTGGTTCGGAGAGGAAGAAGCAGGGATGGGGAATTTGAGTGCGAGGTTTGAAGCAGAGAGAATTTGGGTGCAGTTGCATCAGGTGGGTCAGTTGGGGTTTGGAACTGTTTCCTTCACTGGGAATACAAATTTGTAAAAGGTTACAAATTATCTAAATTATTATTAATTTTGCATTAATTTAGCCTTTAAATTCACTAAATATGCTATCAATTTATAATAATCTAAACCTTATTATAAAAAATCTCATCAAAATTAAATGTTAATATATGTTTTTAATGAATTAAGTTTTGTAGTTTATAAACACATTTAATCCCATAATTAATTGATTAAATTCTATTAAATTTTGATATATTTTTTTGGTGTAGCAACTAAATGGTTAGGAAATTTTAAATAAAATAATAAAATTGTTTTGTTTTTGTTTTTTTTTTTTTTTTTTTTGTAGACAAGAAATAAGAACGGAATCCAGGCTACCATTGAGCATCGAGACTCGAGAGATAGAGATCGTATGAGGAAAGAGCTTCTTGCGAAGAATCGACTACAGAGACAACGATGGAGGATCAACCGACTCGAATAATGCTTGCAGTGAATCAGTCAACCATCAATGGCTATCCGCATGCATCCATAAGCTGTAGAAGGGCCTTCGAATGGACTCTCCAGAAGATCGTCCGCTCCAACACCTCTGGTTTCCACTTGCTCTTCCTCCATGTCCAAGTTCCTGACGAAGACGGTTCGTTCTCCGTCTCCTCTCGATTTCTTTACTTTCTTATTCTTTTCTTTATGGCCTCTTTTCCTAATGCTGATCGATTATGTGAAATGCGATGCTATTCTGTTGGTTGTTTACTGTTTTGCGGAATCTTGAGCTTCGTTGGATGAACTTCGACCTTTAATTTGATCATTTTTTGGTTGATTGGGAGCAGAGAAGGGAAGTGTTTCTTTGTAAGAAGGTGTTATTTCTGCTACCGCCCTGTCGTCTCGGCTGATCTTAAAATTTTGTAAATTAGTTGTAAATATATCCAGTGATTGCACCGATGATTGGTTTATATGATTTTCTTGTACACAATGGATTCTCTGTTTCAAAATTTCTTGAAGTTTCGAATCTATAAATAAGAAACGAGCTTACTCGGTCTTCCTCTTGATGTATTTTTTCTGTCCGATACAGATACGTGCAATATTTAGGTGTTATTCGTTTTTTCCTGGTAATTGATTCAACCATCTTTTTCATCAACATCTTTGGAGAAAGACTAATGCTATTTCATCTGTCCAATCGTCAAGGAACTTGTTAAATTCAAAACATCCAAATATGACCAAAACATTCACAAACCAAATGCAAAGCTCAAGCAAGCCCGTGCATAAAATTAGCAGAAGTGTGGTGTCTGATTATCTAACATAACATTGAATTTAGGATGACTCTTGAGAAAAAGTGTTTTTTAGGTTACTCCTAAAAGAGCTTATAGAACACATAAGAATTAAGTGTGTCTTGTTATATTACTTTTACCCTCTTCATGGATTTTTTAGATTCGTTGGCCATTTTTGAGGAAGACTATCTCCCATTTCACAAGATAATGGTTATTGGTTCCGTTAGTAGGTACTATACTATCGCTTGAGGAAGTGGATCAATTTTTTTTTTGCAGTATTAGTATAATGCTAGACTCATTAGATCACCTTGAATAGTTTGGCCTTCTGGACGACTCCTCCAATCTCTCCGATGCGTCCATTCTTTATCTTACATAATCACATTGAGAACTGATGTGTGGGATATTACAAAATGATGCTGATTCCATGTATCATATGCTAGAAAATAACTGCTAACCAAAACAGTACTCTGCTCATTTGATTTCTTGTAGGTTTTGATGGCTCAGATAGTATCTCTGCGTCCTCTGATGATTTTAAAGACCAGAACCCTAGGTACAATTCAAGAGGGCTTCAACTACTGGAATTCTTTGTCACGAGATGTCATGAACTTGGGGTAATGTAATCCATGTTTATACGATTTTTCATGGTGGAGTGATCTGTCAATATAAATATGCAAATTATACAGCCAGCTTTACATGATGTTCATATTCCACATAAATCAAATGTGAGATCCCACGTTGGTGGGAGAGGGGAACGAAACATTCTTTATAAGGGTGTGTAAACCTCCCTCTTGTAGATGCGTCTTAAAACCGTGAGGCTGACGGCGATACGTAAAGTGCCAAAGCAGACAATATCTGCTAGCAGTGGGTTTGAACTGTTACAAATGGTATTAGAGCTAAGCACCAAGGGGTGTGCCAGCGAGGAAGCTGTGCCTCGAAAGGGGTGGATTGTGAGATCCCACGTCAATTGGAGAGGAGAATAAAACATTATTTATAAAGGTGTGGAAACTTCTCCCTAGTAGATGTGTCTTAAAATCGTGAAGCCGACGACGTTACGTAATTGGCTAAAACGGACAATATCTACAAGTGGTGAGTTTGAGCTGTTACATTTGTGTTACAATCATTGTGTGGCAATTTTGTAGGTCACTTGCGAATCTTGGCTTAAGAAAGGCGATCCAACTGAAGTAATTTGCCTAGAGGCGAAACGCGTGCAACCTGATTTTCTAGTTCTGGGAAGCAGGGGGCTTGGCCCGTTCAAGAAGTATGCTCTAGTTTTATTAGCGCTTCTACATTAGTGATTCTAAAGTACAAACCACATTTGATCATTTCTTCCAATCTCCGGTCTTAGAGCCGTCCGGAGTCTGTGTTTACCATTGACCCTTTACGGAATTGGAAACCTAACTGCCTGAAAATTCCTCTTGTGGATGTCTGTAGGGTTTTTGTGGGCACTGTAAGTGAGTTCTGTGCAAAACATGCCGAGTGCCCTGTCATTACAATCAAACGCAGGGAGGACGAGACGCCCGAAGATCCCATCGATGACTGAGTTTTCAGCCAGTAAAACTCACGGCCTCTTTGATTTCCCTTCATCATGTTTATAAATAAGAAACTTTTGGTGAGCGTCTTGTGCATTTAGTGTGTAAAGGTTGCAAATTCGCCACTGATTTGCATTTGTGTTGTGATTTGTTCAAGCATACAGCCATTATTTTGTTCATTTGATTGATGACTCATTTGGGATTTGGTAACATCCTTTCCATGTGAGATATATAGTTAAAACCATTCTCTTTCTCTCTAGAAAACAGGAATTTTGTATGGAATTTCACAGAAATAGACGTTGAAATACAAGTTTATGTAATTTTACAGATCCAATGAAGGAAGAAGAAGAACAAGGGCAACAATTTGGTGAATAGAAGCAGAAAATAAGCATGGCTTCACAGTTCATATCAAGAAAAATGGAAAATCCGCAAGACTGAGTAAGTTTCTCTAA

mRNA sequence

ATGCTTCGCACACATTTAATACAAGAAGATCGATCCCGAACTCAGAAACTCTTCCACAACGACCTCGCTTTTCCGGCAACCGCCAAGAAATCTCCAACGCTCCTCTCCTTTCCGATTCTGATCAAGAAATCAGAAAAAGAGAATTCAAATATGGAAAAAGACTCATCTCGACCCACACGGCGGCAGAGATTCGCGGTGGACGACGGCGCCGATCTAATAGACTGCTCCGGCAAGCAATGCCAGTCGTGCACCACCGGCTTGGTTGCGGACTGCGTCGCCATCTGGTTATGCCCATGCTCGGTTGTCAGCTTCTTGGCTCTGGCTCTCGTCAAACTTCCGTGGATGATCGGGCGGCGGTGTCTGCAGCAGAGAAGGCAGAAGAGGAAATTGATTGGGCGGAGAGGAGAACGCGAGGGCGGTGGTGTGGCGGCGGAGAGTGGTGGGGGTGCGGCTAGTGAGGAGGGGCTGCCGCCGTGGTTCGGAGAGGAAGAAGCAGGGATGGGGAATTTGAGTGCGAGGTTTGAAGCAGAGAGAATTTGGGTGCAGTTGCATCAGACAAGAAATAAGAACGGAATCCAGGCTACCATTGAGCATCGAGACTCGAGAGATAGAGATCTGAATCAGTCAACCATCAATGGCTATCCGCATGCATCCATAAGCTGTAGAAGGGCCTTCGAATGGACTCTCCAGAAGATCGTCCGCTCCAACACCTCTGGTTTCCACTTGCTCTTCCTCCATGTCCAAGTTCCTGACGAAGACGGTTTTGATGGCTCAGATAGTATCTCTGCGTCCTCTGATGATTTTAAAGACCAGAACCCTAGGTACAATTCAAGAGGGCTTCAACTACTGGAATTCTTTGTCACGAGATGTCATGAACTTGGGGTCACTTGCGAATCTTGGCTTAAGAAAGGCGATCCAACTGAAGTAATTTGCCTAGAGGCGAAACGCGTGCAACCTGATTTTCTAGTTCTGGGAAGCAGGGGGCTTGGCCCGTTCAAGAAGGTTTTTGTGGGCACTGTAAAAGCAGAAAATAAGCATGGCTTCACAGTTCATATCAAGAAAAATGGAAAATCCGCAAGACTGAGTAAGTTTCTCTAA

Coding sequence (CDS)

ATGCTTCGCACACATTTAATACAAGAAGATCGATCCCGAACTCAGAAACTCTTCCACAACGACCTCGCTTTTCCGGCAACCGCCAAGAAATCTCCAACGCTCCTCTCCTTTCCGATTCTGATCAAGAAATCAGAAAAAGAGAATTCAAATATGGAAAAAGACTCATCTCGACCCACACGGCGGCAGAGATTCGCGGTGGACGACGGCGCCGATCTAATAGACTGCTCCGGCAAGCAATGCCAGTCGTGCACCACCGGCTTGGTTGCGGACTGCGTCGCCATCTGGTTATGCCCATGCTCGGTTGTCAGCTTCTTGGCTCTGGCTCTCGTCAAACTTCCGTGGATGATCGGGCGGCGGTGTCTGCAGCAGAGAAGGCAGAAGAGGAAATTGATTGGGCGGAGAGGAGAACGCGAGGGCGGTGGTGTGGCGGCGGAGAGTGGTGGGGGTGCGGCTAGTGAGGAGGGGCTGCCGCCGTGGTTCGGAGAGGAAGAAGCAGGGATGGGGAATTTGAGTGCGAGGTTTGAAGCAGAGAGAATTTGGGTGCAGTTGCATCAGACAAGAAATAAGAACGGAATCCAGGCTACCATTGAGCATCGAGACTCGAGAGATAGAGATCTGAATCAGTCAACCATCAATGGCTATCCGCATGCATCCATAAGCTGTAGAAGGGCCTTCGAATGGACTCTCCAGAAGATCGTCCGCTCCAACACCTCTGGTTTCCACTTGCTCTTCCTCCATGTCCAAGTTCCTGACGAAGACGGTTTTGATGGCTCAGATAGTATCTCTGCGTCCTCTGATGATTTTAAAGACCAGAACCCTAGGTACAATTCAAGAGGGCTTCAACTACTGGAATTCTTTGTCACGAGATGTCATGAACTTGGGGTCACTTGCGAATCTTGGCTTAAGAAAGGCGATCCAACTGAAGTAATTTGCCTAGAGGCGAAACGCGTGCAACCTGATTTTCTAGTTCTGGGAAGCAGGGGGCTTGGCCCGTTCAAGAAGGTTTTTGTGGGCACTGTAAAAGCAGAAAATAAGCATGGCTTCACAGTTCATATCAAGAAAAATGGAAAATCCGCAAGACTGAGTAAGTTTCTCTAA

Protein sequence

MLRTHLIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQRFAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQRRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQLHQTRNKNGIQATIEHRDSRDRDLNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLGSRGLGPFKKVFVGTVKAENKHGFTVHIKKNGKSARLSKFL
Homology
BLAST of CmoCh15G003530 vs. ExPASy Swiss-Prot
Match: Q8LGG8 (Universal stress protein A-like protein OS=Arabidopsis thaliana OX=3702 GN=At3g01520 PE=1 SV=2)

HSP 1 Score: 201.4 bits (511), Expect = 1.7e-50
Identity = 93/137 (67.88%), Postives = 110/137 (80.29%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +N STI  YP+ SISC+RAFEWTL+KIVRSNTS F +L LHVQV DEDGFD  DSI AS 
Sbjct: 12  VNASTIKDYPNPSISCKRAFEWTLEKIVRSNTSDFKILLLHVQVVDEDGFDDVDSIYASP 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           +DF+D      ++GL LLEFFV +CHE+GV CE+W+K GDP +VIC E KRV+PDFLV+G
Sbjct: 72  EDFRDMRQSNKAKGLHLLEFFVNKCHEIGVGCEAWIKTGDPKDVICQEVKRVRPDFLVVG 131

Query: 326 SRGLGPFKKVFVGTVKA 343
           SRGLG F+KVFVGTV A
Sbjct: 132 SRGLGRFQKVFVGTVSA 148

BLAST of CmoCh15G003530 vs. ExPASy TrEMBL
Match: A0A6J1FI52 (uncharacterized protein LOC111445546 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445546 PE=4 SV=1)

HSP 1 Score: 325.9 bits (834), Expect = 2.2e-85
Identity = 169/182 (92.86%), Postives = 170/182 (93.41%), Query Frame = 0

Query: 4   THLIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR 63
           +H IQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR
Sbjct: 24  SHRIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR 83

Query: 64  FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ 123
           FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ
Sbjct: 84  FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ 143

Query: 124 RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQL 183
           RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEE           EAERIWVQL
Sbjct: 144 RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEE-----------EAERIWVQL 194

Query: 184 HQ 186
           HQ
Sbjct: 204 HQ 194

BLAST of CmoCh15G003530 vs. ExPASy TrEMBL
Match: A0A6J1K2Z6 (universal stress protein A-like protein OS=Cucurbita maxima OX=3661 GN=LOC111489596 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 7.4e-73
Identity = 134/135 (99.26%), Postives = 135/135 (100.00%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +NQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS
Sbjct: 12  VNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG
Sbjct: 72  DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 131

Query: 326 SRGLGPFKKVFVGTV 341
           SRGLGPFKKVFVGTV
Sbjct: 132 SRGLGPFKKVFVGTV 146

BLAST of CmoCh15G003530 vs. ExPASy TrEMBL
Match: A0A6J1FNN1 (universal stress protein A-like protein isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445546 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 7.4e-73
Identity = 134/135 (99.26%), Postives = 135/135 (100.00%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +NQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS
Sbjct: 12  VNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG
Sbjct: 72  DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 131

Query: 326 SRGLGPFKKVFVGTV 341
           SRGLGPFKKVFVGTV
Sbjct: 132 SRGLGPFKKVFVGTV 146

BLAST of CmoCh15G003530 vs. ExPASy TrEMBL
Match: A0A7G2ELA3 ((thale cress) hypothetical protein OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOCUS9818 PE=4 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 2.3e-58
Identity = 148/345 (42.90%), Postives = 189/345 (54.78%), Query Frame = 0

Query: 51  MEKDSSRPTRRQR-----FAVDDGAD---LIDCSGKQCQSCTTGLVADCVAIWLCPCSVV 110
           ME+++ +P R  R          GAD      CSGK+C+S     +ADCVA+  CPC+VV
Sbjct: 1   MEEENQKPHRVSRKDQSGSHWSQGADEEPRARCSGKRCRSWAAAAIADCVALCCCPCAVV 60

Query: 111 SFLALALVKLPWMIGRRCLQQ---RRQKRKLIGRR------------------------- 170
           +   LA VK+PWMIGR+C+ +    +++ K I R                          
Sbjct: 61  NIFTLAFVKVPWMIGRKCIGRGGPSKKRMKKINREDRFHHHHHHRRSAEMVSGGCCGGGD 120

Query: 171 --GEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQLHQTRNKNGI 230
             GE +      E  G    EE       EEE     +SAR EAER+W++L+Q  +    
Sbjct: 121 GDGEFDDHRFVVERDGSLTKEEAKTASLKEEEE--TRISARVEAERVWLELYQIGHLGFA 180

Query: 231 QATIEHRDSRDRDL---------------NQSTINGYPHASISCRRAFEWTLQKIVRSNT 290
            ++     SRD  +               N STI  YP+ SISC+RAFEWTL+KIVRSNT
Sbjct: 181 SSSSSENWSRDTKIWRKMGSEPTKVMVAVNASTIKDYPNPSISCKRAFEWTLEKIVRSNT 240

Query: 291 SGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRCHELGVTC 343
           S F +L LHVQV DEDGFD  DSI AS +DF+D      ++GL LLEFF        V C
Sbjct: 241 SDFKILLLHVQVVDEDGFDDVDSIYASPEDFRDMRQSNKAKGLHLLEFF--------VGC 300

BLAST of CmoCh15G003530 vs. ExPASy TrEMBL
Match: A0A6J1JLH1 (universal stress protein A-like protein OS=Cucurbita maxima OX=3661 GN=LOC111486957 PE=4 SV=1)

HSP 1 Score: 234.6 bits (597), Expect = 6.7e-58
Identity = 111/135 (82.22%), Postives = 118/135 (87.41%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +NQSTI GYPHASIS  RAFEWTLQKIVRSNTSGF  LFLHVQVPDEDGFD  DSI AS 
Sbjct: 12  VNQSTIKGYPHASISSSRAFEWTLQKIVRSNTSGFKFLFLHVQVPDEDGFDDVDSIFASP 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           DDFKD   R N+RGLQLLEFFV RCHE+GV CE+WLKKGDPTEVICLE KRVQPDFLV+G
Sbjct: 72  DDFKDLKQRDNARGLQLLEFFVNRCHEIGVACEAWLKKGDPTEVICLEVKRVQPDFLVVG 131

Query: 326 SRGLGPFKKVFVGTV 341
           SRG+G FK+VFVGTV
Sbjct: 132 SRGIGRFKRVFVGTV 146

BLAST of CmoCh15G003530 vs. NCBI nr
Match: KAG6578655.1 (Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 574.7 bits (1480), Expect = 5.6e-160
Identity = 284/290 (97.93%), Postives = 285/290 (98.28%), Query Frame = 0

Query: 51  MEKDSSRPTRRQRFAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALV 110
           MEK  SRPTRRQRFAVDDG DLIDCSGKQCQSCTTGLVADCVAI LCPCSVVSFLALALV
Sbjct: 1   MEKGPSRPTRRQRFAVDDGVDLIDCSGKQCQSCTTGLVADCVAICLCPCSVVSFLALALV 60

Query: 111 KLPWMIGRRCLQQRRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNL 170
           KLPWMIGRRCLQQ RQKRKLIGRRGER+GGGVAAESGGGAASEEGLPPWFGEEEAGMGNL
Sbjct: 61  KLPWMIGRRCLQQSRQKRKLIGRRGERKGGGVAAESGGGAASEEGLPPWFGEEEAGMGNL 120

Query: 171 SARFEAERIWVQLHQTRNKNGIQATIEHRDSRDRDLNQSTINGYPHASISCRRAFEWTLQ 230
           SARFEAERIWVQLHQTRNKNGIQATIEHRDSRDRDLNQSTINGYPHASISCRRAFEWTLQ
Sbjct: 121 SARFEAERIWVQLHQTRNKNGIQATIEHRDSRDRDLNQSTINGYPHASISCRRAFEWTLQ 180

Query: 231 KIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRC 290
           KIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRC
Sbjct: 181 KIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRC 240

Query: 291 HELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLGSRGLGPFKKVFVGTV 341
           HELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLGSRGLGPFKKVFVGTV
Sbjct: 241 HELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLGSRGLGPFKKVFVGTV 290

BLAST of CmoCh15G003530 vs. NCBI nr
Match: XP_022939759.1 (uncharacterized protein LOC111445546 isoform X1 [Cucurbita moschata])

HSP 1 Score: 325.9 bits (834), Expect = 4.6e-85
Identity = 169/182 (92.86%), Postives = 170/182 (93.41%), Query Frame = 0

Query: 4   THLIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR 63
           +H IQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR
Sbjct: 24  SHRIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR 83

Query: 64  FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ 123
           FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ
Sbjct: 84  FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ 143

Query: 124 RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQL 183
           RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEE           EAERIWVQL
Sbjct: 144 RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEE-----------EAERIWVQL 194

Query: 184 HQ 186
           HQ
Sbjct: 204 HQ 194

BLAST of CmoCh15G003530 vs. NCBI nr
Match: XP_023551092.1 (uncharacterized protein LOC111809018 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 308.9 bits (790), Expect = 5.8e-80
Identity = 161/182 (88.46%), Postives = 166/182 (91.21%), Query Frame = 0

Query: 4   THLIQEDRSRTQKLFHNDLAFPATAKKSPTLLSFPILIKKSEKENSNMEKDSSRPTRRQR 63
           +H IQEDRSRTQKLFHNDLAFPAT KKSPTL SFPILIKKSE+ENSNMEK+SS+PTRRQR
Sbjct: 24  SHRIQEDRSRTQKLFHNDLAFPATTKKSPTLFSFPILIKKSEQENSNMEKNSSQPTRRQR 83

Query: 64  FAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRRCLQQ 123
           FAVDDGADLIDCSGKQCQSCTTGLVADCVAI LCPCSVVSFLALALVKLPWMIGRRCLQQ
Sbjct: 84  FAVDDGADLIDCSGKQCQSCTTGLVADCVAICLCPCSVVSFLALALVKLPWMIGRRCLQQ 143

Query: 124 RRQKRKLIGRRGEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQL 183
            RQKRKLIGRRGER+GGGVAAESGGGAASEEGLPPWFGEE           EAERIWVQL
Sbjct: 144 SRQKRKLIGRRGERKGGGVAAESGGGAASEEGLPPWFGEE-----------EAERIWVQL 194

Query: 184 HQ 186
           HQ
Sbjct: 204 HQ 194

BLAST of CmoCh15G003530 vs. NCBI nr
Match: XP_022939760.1 (universal stress protein A-like protein isoform X2 [Cucurbita moschata] >XP_022939761.1 universal stress protein A-like protein isoform X2 [Cucurbita moschata] >XP_022993673.1 universal stress protein A-like protein [Cucurbita maxima] >XP_022993674.1 universal stress protein A-like protein [Cucurbita maxima] >XP_023551094.1 universal stress protein A-like protein isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 284.3 bits (726), Expect = 1.5e-72
Identity = 134/135 (99.26%), Postives = 135/135 (100.00%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +NQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS
Sbjct: 12  VNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG
Sbjct: 72  DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 131

Query: 326 SRGLGPFKKVFVGTV 341
           SRGLGPFKKVFVGTV
Sbjct: 132 SRGLGPFKKVFVGTV 146

BLAST of CmoCh15G003530 vs. NCBI nr
Match: XP_010485410.1 (PREDICTED: uncharacterized protein LOC104763741 [Camelina sativa] >XP_010485411.1 PREDICTED: uncharacterized protein LOC104763741 [Camelina sativa])

HSP 1 Score: 255.4 bits (651), Expect = 7.6e-64
Identity = 155/337 (45.99%), Postives = 199/337 (59.05%), Query Frame = 0

Query: 51  MEKDSSRPTRRQR-------FAVDDGAD----LIDCSGKQCQSCTTGLVADCVAIWLCPC 110
           ME++S +P R  R            GAD       CS K+C+S     +ADCVA+  CPC
Sbjct: 1   MEEESQKPHRVSRKLEQSGSHCSSQGADEEPRSARCSRKRCRSWVAAGIADCVALCCCPC 60

Query: 111 SVVSFLALALVKLPWMIGRRCLQQRRQK-RKLIGRRGEREGGGVAAE--SGGG------- 170
           +V++ L LA VK+PWMIGR+C+ + ++K RK   R  +++    +AE  SGGG       
Sbjct: 61  AVLNLLTLAFVKVPWMIGRKCVGRSKKKGRKREDRLHQQQQKRRSAEMVSGGGCCGGGDD 120

Query: 171 ----AASEEGLPPWFGEEEAGMG------NLSARFEAERIWVQLHQTRNKNGIQATIEHR 230
                   +G     GEE  G         +SAR EAER+W++L+Q  +  G +     R
Sbjct: 121 DHRFVVERDGSLTKEGEERRGASLEGEETRISARVEAERVWLELYQIGHL-GFERRFAER 180

Query: 231 DSRDRD--------------LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFL 290
           + R                 +N STI  YPH SIS +RAFEWTL+KIVRSNT  F +L L
Sbjct: 181 EKRGDSGDMGSEQPTKVMVAVNGSTIKEYPHPSISSKRAFEWTLEKIVRSNTCDFKILLL 240

Query: 291 HVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGD 343
           HVQV DEDGFD  DSI AS DDF+       ++GL LLEFFVT+CHE+GV CE+W+K GD
Sbjct: 241 HVQVLDEDGFDDVDSIYASPDDFRSMRETNKAKGLHLLEFFVTKCHEIGVACEAWIKIGD 300

BLAST of CmoCh15G003530 vs. TAIR 10
Match: AT5G14680.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 211.8 bits (538), Expect = 8.9e-55
Identity = 95/135 (70.37%), Postives = 115/135 (85.19%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +N+ST+ GYPHASIS ++AFEWTL+KIVRSNTSGF LL LHVQV DEDGFD  DSI AS 
Sbjct: 12  VNESTLKGYPHASISSKKAFEWTLKKIVRSNTSGFKLLLLHVQVQDEDGFDDMDSIYASP 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           DDF+    R  ++GL LLEFFV +CH++GV CE+W++KGDPTE+IC E +RV+PDFLV+G
Sbjct: 72  DDFRQMRERNKAKGLHLLEFFVKKCHDIGVGCEAWIRKGDPTELICHEVRRVRPDFLVVG 131

Query: 326 SRGLGPFKKVFVGTV 341
           SRGLGPF+KVFVGTV
Sbjct: 132 SRGLGPFQKVFVGTV 146

BLAST of CmoCh15G003530 vs. TAIR 10
Match: AT3G01520.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 201.4 bits (511), Expect = 1.2e-51
Identity = 93/137 (67.88%), Postives = 110/137 (80.29%), Query Frame = 0

Query: 206 LNQSTINGYPHASISCRRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASS 265
           +N STI  YP+ SISC+RAFEWTL+KIVRSNTS F +L LHVQV DEDGFD  DSI AS 
Sbjct: 12  VNASTIKDYPNPSISCKRAFEWTLEKIVRSNTSDFKILLLHVQVVDEDGFDDVDSIYASP 71

Query: 266 DDFKDQNPRYNSRGLQLLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLG 325
           +DF+D      ++GL LLEFFV +CHE+GV CE+W+K GDP +VIC E KRV+PDFLV+G
Sbjct: 72  EDFRDMRQSNKAKGLHLLEFFVNKCHEIGVGCEAWIKTGDPKDVICQEVKRVRPDFLVVG 131

Query: 326 SRGLGPFKKVFVGTVKA 343
           SRGLG F+KVFVGTV A
Sbjct: 132 SRGLGRFQKVFVGTVSA 148

BLAST of CmoCh15G003530 vs. TAIR 10
Match: AT5G14690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G01516.1); Has 86 Blast hits to 86 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 84; Viruses - 2; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 74.7 bits (182), Expect = 1.7e-13
Identity = 58/175 (33.14%), Postives = 80/175 (45.71%), Query Frame = 0

Query: 60  RRQRFAVDDGADLIDCSGKQCQSCTTGLVADCVAIWLCPCSVVSFLALALVKLPWMIGRR 119
           RR+R       D + CS K+C+S     +ADCVA+  CPC++++ L L LVK+PWMIGRR
Sbjct: 28  RRRRHNHHTHGDEVKCSSKRCRSWAAAAIADCVALCCCPCAIINLLTLTLVKVPWMIGRR 87

Query: 120 CL---QQRRQKRKLIGRR-------GERE----------------------GGGVAAESG 179
           CL    + ++KR++I RR       GE E                      GGG     G
Sbjct: 88  CLGGGGRNKKKRRVIHRRKRRGNINGEDEFYHHNNHHRRFETAEEGEKCGCGGGGGGCYG 147

Query: 180 GG-----------------AASEEGLPPWFGEEEAGMGNLSARFEAERIWVQLHQ 186
           GG                    EE        E+     +SAR EAER+W++L+Q
Sbjct: 148 GGDYDDHRFVVERDGSLTKEEEEEERTTSCKGEDHDESRISARVEAERVWLELYQ 202

BLAST of CmoCh15G003530 vs. TAIR 10
Match: AT3G01516.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G14690.1); Has 67 Blast hits to 67 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 67; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 70.5 bits (171), Expect = 3.2e-12
Identity = 56/174 (32.18%), Postives = 80/174 (45.98%), Query Frame = 0

Query: 45  EKENSNMEKDSSRPTRRQRFAVDDGAD---LIDCSGKQCQSCTTGLVADCVAIWLCPCSV 104
           E+EN    + S +      ++   GAD      CSGK+C+S     +ADCVA+  CPC+V
Sbjct: 2   EEENQKSHRVSRKDQSGSHWS--QGADEEPRARCSGKRCRSWAAAAIADCVALCCCPCAV 61

Query: 105 VSFLALALVKLPWMIGRRCLQQ---RRQKRKLIGRR------------------------ 164
           V+   LA VK+PWMIGR+C+ +    +++ K I R                         
Sbjct: 62  VNIFTLAFVKVPWMIGRKCIGRGGPSKKRMKKINREDRFHHHHHHRRSAEMVSGGCCGGG 121

Query: 165 ---GEREGGGVAAESGGGAASEEGLPPWFGEEEAGMGNLSARFEAERIWVQLHQ 186
              GE +      E  G    EE       EEE     +SAR EAER+W++L+Q
Sbjct: 122 DGDGEFDDHRFVVERDGSLTKEEAKTASLKEEEE--TRISARVEAERVWLELYQ 171

BLAST of CmoCh15G003530 vs. TAIR 10
Match: AT1G68300.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 56.2 bits (134), Expect = 6.3e-08
Identity = 36/119 (30.25%), Postives = 55/119 (46.22%), Query Frame = 0

Query: 222 RRAFEWTLQKIVRSNTSGFHLLFLHVQVPDEDGFDGSDSISASSDDFKDQNPRYNSRGLQ 281
           +RA +WTL  +  S      +LF      D      S   +A  +        + + GL 
Sbjct: 23  KRALQWTLVYLKDSLADSDIILFTAQPHLDLSCVYASSYGAAPIELINSLQESHKNAGLN 82

Query: 282 LLEFFVTRCHELGVTCESWLKKGDPTEVICLEAKRVQPDFLVLGSRGLGPFKKVFVGTV 341
            L+     C E GVT    L+ G+P E IC  A+++  D LV+GS G G  ++ F+G+V
Sbjct: 83  RLDEGTKICAETGVTPRKVLEFGNPKEAICEAAEKLGVDMLVVGSHGKGALQRTFLGSV 141

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LGG81.7e-5067.88Universal stress protein A-like protein OS=Arabidopsis thaliana OX=3702 GN=At3g0... [more]
Match NameE-valueIdentityDescription
A0A6J1FI522.2e-8592.86uncharacterized protein LOC111445546 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1K2Z67.4e-7399.26universal stress protein A-like protein OS=Cucurbita maxima OX=3661 GN=LOC111489... [more]
A0A6J1FNN17.4e-7399.26universal stress protein A-like protein isoform X2 OS=Cucurbita moschata OX=3662... [more]
A0A7G2ELA32.3e-5842.90(thale cress) hypothetical protein OS=Arabidopsis thaliana OX=3702 GN=AT9943_LOC... [more]
A0A6J1JLH16.7e-5882.22universal stress protein A-like protein OS=Cucurbita maxima OX=3661 GN=LOC111486... [more]
Match NameE-valueIdentityDescription
KAG6578655.15.6e-16097.93Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. ... [more]
XP_022939759.14.6e-8592.86uncharacterized protein LOC111445546 isoform X1 [Cucurbita moschata][more]
XP_023551092.15.8e-8088.46uncharacterized protein LOC111809018 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022939760.11.5e-7299.26universal stress protein A-like protein isoform X2 [Cucurbita moschata] >XP_0229... [more]
XP_010485410.17.6e-6445.99PREDICTED: uncharacterized protein LOC104763741 [Camelina sativa] >XP_010485411.... [more]
Match NameE-valueIdentityDescription
AT5G14680.18.9e-5570.37Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT3G01520.11.2e-5167.88Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT5G14690.11.7e-1333.14unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G01516.13.2e-1232.18unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G68300.16.3e-0830.25Adenine nucleotide alpha hydrolases-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014729Rossmann-like alpha/beta/alpha sandwich foldGENE3D3.40.50.620HUPscoord: 203..342
e-value: 8.1E-46
score: 158.5
IPR006016UspAPFAMPF00582Uspcoord: 218..340
e-value: 2.6E-10
score: 41.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 131..156
NoneNo IPR availableCDDcd00293USP_Likecoord: 218..340
e-value: 2.9385E-13
score: 64.3134
NoneNo IPR availableSUPERFAMILY52402Adenine nucleotide alpha hydrolases-likecoord: 211..340
IPR044187Universal stress protein A-like protein, plantPANTHERPTHR47710ADENINE NUCLEOTIDE ALPHA HYDROLASES-LIKE SUPERFAMILY PROTEINcoord: 206..342

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh15G003530.1CmoCh15G003530.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016208 AMP binding