CsGy1G023950 (gene) Cucumber (Gy14) v2

NameCsGy1G023950
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v2)
Descriptionglycine-rich protein
LocationChr1 : 22671164 .. 22674434 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAGGAAGGGAATTGGTGAAATTGAGCATAGTTGTGAGTTAGCCCAATTATTACTCTCTCATAACCTAAAAAGCCCATTTCTCTATCAGTCCATTGCCACCTACCGATGCTCATTATCCCTTCTTCCTTCTCCGCAACAACCTCCACCGCCACACCCCCATTCCACCGTCATTCACTCGCCGACACGCACCACTACCCACCACATTCCCTCATCTTCCGCCCAGGTTATCTCTCTCTCTCTCTCTCTCTCTCTCTCAGTCTCAGTCTCTGGTTTCTAAGCCTTTTCTTTTCCTTTCTATTCTCTTTTCTTTCATCGATTCGACCAATTCGATAACTGGGTACTTTTGGAATGATCAACTGATTAAAATTACACTTGTATAGCCTCTACTTAAGCTTTTGAATTTAGGGTAGATATTACAAAAATATGGCTCTATGCGATTATGAATAGCTATTTTATCTGGTTATGTCTGGTAATTGTTATGATTCATACATGATATGAGAACTTGGATGTCAGGGTGCTGGAGTGTGGCATATGACCATAGGGTGTTTATCAAAGGATGAGCAGCATGCAGATAACTGCTACACAGAATTCTATTTGTGCCAATAAATCAATATGCCTTGTTTCTAAGTCAATATATCCATCATTCCATGCTAATCAGTCACGACGTGCTGTTGTAAATCTAAGTGCCAATGCATCTTATTTCAAGCAGGGTCTACCAGTTTTGAAGTATGAACATCGGAGGGTTGGATTAAAATATCAGCATACACCAATTGTTTCCTTATACGGTAGCAAGGGAAAGGGCAGTGATGATGGGGTGAGAGCATTTCTTAATTGTTCGTTTGTGATCATTAGATTGTCTATTCGGCTCTATTCTTCTATATGTTTCTTTCTGATATAACTATTAGTAAATTTTATTTTCTGATGCTTTGCTCTACCTATAATACTCTGCTTGGGTTTCTTCTTTCTTCTTTCTTCTTTCTTCTTTTTCCTTTCCTTTTTTATTTTAAATTGTTATCTTTGACGTGTTAAAAGGGAAAAAAAGTTCATAATATGGTTTCTAATGTTCTCCGAATGATTGAAGGATTGGTGGAACTACGCTTCTCTGAGACTTCAGATTGTGTAACATGCTTCTTGAATTTCTCGTGTGTCATTTTCTGTTCCACTAAGCAACGGTGGATGGTCATGAAGTACATTTTAATTGCTGGAATTTTTCAAACTTGAGGAATTTAGAATGGCACTGTTAGTTCTGTTTTCCCACGAGTGTTGAATATTCATCAATATGCTTACTTTGCAAAAGTCAGCCATTGTAATCTTTTTTTTCATAAACCATTAAGGTTACTCCTTATGAAGATTGGAACATTTAACAAGAACATGTGTTTGGTTTGATATCAAGTGTCAACCATCTTTTTCACTCTTTATTTCATGAATGTTCCTCATGCTAGGAATTATTGTTGTTCATTCACTTTTCCAGAAAGTGAGCAAGTTTGTTTTCCCCCCCTAATTTCCTTTTTTGGTTCAAGTTGCTTGATATTCAATACACAATGGTAGGAGGATTTGGAGCGTTTATTCATTTAACCACTATTATGTTCATGTCCCTATTTTTGAATTTCAAATGTGCAAATGATAACCTTGGAGTTTGACGAGTTTCTATTTGTTTAAGAGGTATCTGTTAATCTAGATATAGTAATAGTACTTTGTTTCTAGCAGGGTTCTCCATGGAAAGGTTTGGACAAAGTTGTTGAAAGTTTTAAGGGACGATCAGTAGAAGATGTCTTGCGACAACAAATTGAAAAGAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAGGTGGCGGCGGCGGCAGTGGTGGTGGCGACAGCGGTGATGGTGGCGAGGATAGCTCTAGTGGATCTGAGGATTATAGTCTCACAGGAATTATGGATGAAATACTGCAAGTCATTCTGGCGACCCTCGGCTTAGTTTTCGTAGTAATATACTCATTTAACATTTTCTATTGGAAATGGCATTTCAATCAGTCTTAAAAAGTTACAACTAACCAAACAGAAAGTAGAGGAATTCACTGGTAAAAAAAGTCTCGAGTTAGCTTAATAGTACATATTTCGATGTAGGGCCTGATAACAAGGAGATTTCCTAGACTTGATATTTATAGTTAACTACTAGATCTACTGTTTTGACTCCATAAAGCCTTTGATGAAGAAAATAACTGGCCTTCATATATTCAGGACTAGTAAAGTTAAACTCTAGATGTGAGCCAACAAAGTCAACCTTCACGAAATTGGATGGTAATGCTGTAAAAACCGCAACCTTTTCTCCTTTTGCTCTCTCTGTTGTCAAGTTAAGAGAAAACACAATATTCATAACTCAGAAAGCTGGAAAGGAATAAGAGTATGTACCATGAAAATCATGTTTATTTGAATACATATCTTGACATTGAATATGTACTAAATGAGGAAATTTTTTAAACATGATTGTTGGTTGTGTACTTGGGACAAGTTAATTAGTTACAAAGCTTGGCTTATGTTTTCTTGTTTGTGGTGGGCAGTACATTTATATACTCAGTGGGGAAGAGCTATCGCGATTAGCGAAGGATTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGTGCGTTTGAAGCGAGCGATGTACAACTGGGGAAAGTTTTACCAAAGCCTCATGAAAAAGAAGAAATATGATCAATATTGGCTGGAGAAAGCTATTCTTAGCACTCCAACATGGTGGGATAATCCTGATAAGTATATGCCTAAGAAGGCACAGAATCAGAAACAGAATGTTGCATCAGATGATTATGATGAAACCGATTACCTAGACTCTGATTATGGTGAAATTGATTTCTAAAGTCTGATTATGGTGAAATCGATTAAGTAAAGTCTGATGATGAGGAATTCTGAATCCTTCTGCTTTTCTGTAAACCAACCTGATGTGTCTTTTTCTGCTGTCAATTTTTGTTTTCTGAGATGCTAAATTTGATATGTTATGTTGTAGGATCTTGGTGGACTGATGTACATAACATGAGATTTTGTCACTTTTTATAATGTTCTTAGTTATGTGATCTCAAAGTCTTTCTGGTGATTGGAAGCGACTTAGTCCCAATGACCATAAATCTTAGAGGCATGGGCAAGTAGTAGAGTTTGTATAGTCAAGCATGAAATTAATTTTTTTAATTGTGGAGCTCACAAATTCCTTGGCCCAAAGAGGAGCTCTCAACTCCATTATCTTCCCTAACAACTTTGAAATCTAG

mRNA sequence

GAAGGAAGGGAATTGGTGAAATTGAGCATAGTTGTGAGTTAGCCCAATTATTACTCTCTCATAACCTAAAAAGCCCATTTCTCTATCAGTCCATTGCCACCTACCGATGCTCATTATCCCTTCTTCCTTCTCCGCAACAACCTCCACCGCCACACCCCCATTCCACCGTCATTCACTCGCCGACACGCACCACTACCCACCACATTCCCTCATCTTCCGCCCAGGGTGCTGGAGTGTGGCATATGACCATAGGGTGTTTATCAAAGGATGAGCAGCATGCAGATAACTGCTACACAGAATTCTATTTGTGCCAATAAATCAATATGCCTTGTTTCTAAGTCAATATATCCATCATTCCATGCTAATCAGTCACGACGTGCTGTTGTAAATCTAAGTGCCAATGCATCTTATTTCAAGCAGGGTCTACCAGTTTTGAAGTATGAACATCGGAGGGTTGGATTAAAATATCAGCATACACCAATTGTTTCCTTATACGGTAGCAAGGGAAAGGGCAGTGATGATGGGGGTTCTCCATGGAAAGGTTTGGACAAAGTTGTTGAAAGTTTTAAGGGACGATCAGTAGAAGATGTCTTGCGACAACAAATTGAAAAGAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAGGTGGCGGCGGCGGCAGTGGTGGTGGCGACAGCGGTGATGGTGGCGAGGATAGCTCTAGTGGATCTGAGGATTATAGTCTCACAGGAATTATGGATGAAATACTGCAAGTCATTCTGGCGACCCTCGGCTTAGTTTTCGTATACATTTATATACTCAGTGGGGAAGAGCTATCGCGATTAGCGAAGGATTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGTGCGTTTGAAGCGAGCGATGTACAACTGGGGAAAGTTTTACCAAAGCCTCATGAAAAAGAAGAAATATGATCAATATTGGCTGGAGAAAGCTATTCTTAGCACTCCAACATGGTGGGATAATCCTGATAAGTATATGCCTAAGAAGGCACAGAATCAGAAACAGAATGTTGCATCAGATGATTATGATGAAACCGATTACCTAGACTCTGATTATGGTGAAATTGATTTCTAAAGTCTGATTATGGTGAAATCGATTAAGTAAAGTCTGATGATGAGGAATTCTGAATCCTTCTGCTTTTCTGTAAACCAACCTGATGTGTCTTTTTCTGCTGTCAATTTTTGTTTTCTGAGATGCTAAATTTGATATGTTATGTTGTAGGATCTTGGTGGACTGATGTACATAACATGAGATTTTGTCACTTTTTATAATGTTCTTAGTTATGTGATCTCAAAGTCTTTCTGGTGATTGGAAGCGACTTAGTCCCAATGACCATAAATCTTAGAGGCATGGGCAAGTAGTAGAGTTTGTATAGTCAAGCATGAAATTAATTTTTTTAATTGTGGAGCTCACAAATTCCTTGGCCCAAAGAGGAGCTCTCAACTCCATTATCTTCCCTAACAACTTTGAAATCTAG

Coding sequence (CDS)

ATGAGCAGCATGCAGATAACTGCTACACAGAATTCTATTTGTGCCAATAAATCAATATGCCTTGTTTCTAAGTCAATATATCCATCATTCCATGCTAATCAGTCACGACGTGCTGTTGTAAATCTAAGTGCCAATGCATCTTATTTCAAGCAGGGTCTACCAGTTTTGAAGTATGAACATCGGAGGGTTGGATTAAAATATCAGCATACACCAATTGTTTCCTTATACGGTAGCAAGGGAAAGGGCAGTGATGATGGGGGTTCTCCATGGAAAGGTTTGGACAAAGTTGTTGAAAGTTTTAAGGGACGATCAGTAGAAGATGTCTTGCGACAACAAATTGAAAAGAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAGGTGGCGGCGGCGGCAGTGGTGGTGGCGACAGCGGTGATGGTGGCGAGGATAGCTCTAGTGGATCTGAGGATTATAGTCTCACAGGAATTATGGATGAAATACTGCAAGTCATTCTGGCGACCCTCGGCTTAGTTTTCGTATACATTTATATACTCAGTGGGGAAGAGCTATCGCGATTAGCGAAGGATTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGTGCGTTTGAAGCGAGCGATGTACAACTGGGGAAAGTTTTACCAAAGCCTCATGAAAAAGAAGAAATATGATCAATATTGGCTGGAGAAAGCTATTCTTAGCACTCCAACATGGTGGGATAATCCTGATAAGTATATGCCTAAGAAGGCACAGAATCAGAAACAGAATGTTGCATCAGATGATTATGATGAAACCGATTACCTAGACTCTGATTATGGTGAAATTGATTTCTAA

Protein sequence

MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
BLAST of CsGy1G023950 vs. NCBI nr
Match: XP_004145228.1 (PREDICTED: uncharacterized protein LOC101222813 [Cucumis sativus] >KGN65868.1 hypothetical protein Csa_1G534740 [Cucumis sativus])

HSP 1 Score: 462.2 bits (1188), Expect = 1.2e-126
Identity = 280/280 (100.00%), Postives = 280/280 (100.00%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH
Sbjct: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX 120
           RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX
Sbjct: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 180

Query: 181 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT 240
           LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT
Sbjct: 181 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT 240

Query: 241 WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF 281
           WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
Sbjct: 241 WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF 280

BLAST of CsGy1G023950 vs. NCBI nr
Match: XP_008444591.1 (PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo])

HSP 1 Score: 342.8 bits (878), Expect = 1.1e-90
Identity = 219/277 (79.06%), Postives = 243/277 (87.73%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MSSMQITATQNSICANKSICLVSKSIYPSFHANQS RAVVNLSANASYFKQGLP+LKY+H
Sbjct: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSLRAVVNLSANASYFKQGLPILKYKH 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIE--XXX 120
           RRVGLK+QHTPIVSL+GSKGKGSDDGGSPWK  DKVVESF KG SVEDVLR+QIE  XXX
Sbjct: 61  RRVGLKHQHTPIVSLFGSKGKGSDDGGSPWKAFDKVVESFKKGGSVEDVLRKQIEXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVY 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +SL   +DE LQV+LATLG +F+Y
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDFSLAEALDETLQVVLATLGFIFMY 180

Query: 181 IYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILS 240
            Y+L+GEE++RL KDYIKY FGGSKSVRL+RAMY WG+FYQ L  KKKYD++WLEKAI++
Sbjct: 181 FYLLNGEEVTRLLKDYIKYRFGGSKSVRLRRAMYEWGRFYQRLTAKKKYDEFWLEKAIIN 240

Query: 241 TPTWWDNPDKYMPK-----KAQNQKQNVASDDYDETD 270
           TPTWWD+PD Y        KA+NQ++N ASDD  ETD
Sbjct: 241 TPTWWDHPDNYRHAAMAYGKAENQEKNFASDDDGETD 277

BLAST of CsGy1G023950 vs. NCBI nr
Match: XP_022140099.1 (uncharacterized protein LOC111010834 [Momordica charantia])

HSP 1 Score: 315.1 bits (806), Expect = 2.4e-82
Identity = 202/282 (71.63%), Postives = 231/282 (81.91%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MSSMQITATQNSIC+++SIC+ SKSIYPSF A +SR A+VNLSANASYFKQGLPVLKY+H
Sbjct: 1   MSSMQITATQNSICSSRSICIASKSIYPSFRATRSRSALVNLSANASYFKQGLPVLKYKH 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIEXXXXX 120
           RR GL +QHTPIVSL+GSKGK S DGGSPWK  DKVVE+F KGRSVEDVLRQQIE     
Sbjct: 61  RRAGLNHQHTPIVSLFGSKGKESGDGGSPWKTFDKVVENFKKGRSVEDVLRQQIE----- 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIY 180
               XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX     I+DE LQVILAT+G +F+YIY
Sbjct: 121 ----XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIIDETLQVILATIGFIFLYIY 180

Query: 181 ILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTP 240
           I+SGEEL+RLAKDYIK++FGGSKSVRLKRAMY WG+FYQ L +KK+YD+YWLEKAI++TP
Sbjct: 181 IISGEELTRLAKDYIKFVFGGSKSVRLKRAMYKWGRFYQKLTEKKQYDEYWLEKAIINTP 240

Query: 241 TWWDNPDK-------YMPKKAQNQKQNVASDDYDETDYLDSD 275
           TWWD+PDK       YM  + +NQ      +D  E D  +SD
Sbjct: 241 TWWDHPDKYRRAVMDYMESQYENQHSASNVNDDAEMDVSNSD 273

BLAST of CsGy1G023950 vs. NCBI nr
Match: XP_022955156.1 (uncharacterized protein LOC111457207 [Cucurbita moschata])

HSP 1 Score: 283.9 bits (725), Expect = 6.0e-73
Identity = 180/249 (72.29%), Postives = 206/249 (82.73%), Query Frame = 0

Query: 2   SSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEHR 61
           S MQITATQNS+C NKSICLVSKS YPSF A+Q+R A VN SANASY K+GLPVLKY+HR
Sbjct: 3   SMMQITATQNSLCPNKSICLVSKSTYPSFLASQTRSAFVNPSANASYLKKGLPVLKYDHR 62

Query: 62  RVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIEXXXXXX 121
           RVGLK+++TPI SL+GSKGK + DGGSPWK  DKVVE+F KGRSVED+LRQQIE      
Sbjct: 63  RVGLKHRYTPIASLFGSKGKDNADGGSPWKAFDKVVENFKKGRSVEDILRQQIENKDFYD 122

Query: 122 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 181
           XXXXXXXXXXXXXXXXXXXXXXXXXXXX      S+ GI++E + V+LAT+GLV VYIYI
Sbjct: 123 XXXXXXXXXXXXXXXXXXXXXXXXXXXXP-----SILGILEETMHVVLATIGLVLVYIYI 182

Query: 182 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFY-QSLMKKKKYDQYWLEKAILSTP 241
           + G+EL  LAKDYIKYLFG  +S RLK AMY+WGKFY +   KK K D+YWLEKAIL+TP
Sbjct: 183 IEGQELVLLAKDYIKYLFGADRSARLKSAMYSWGKFYKRRTQKKPKPDEYWLEKAILNTP 242

Query: 242 TWWDNPDKY 249
           TWWD+PDKY
Sbjct: 243 TWWDHPDKY 246

BLAST of CsGy1G023950 vs. NCBI nr
Match: XP_023542514.1 (uncharacterized protein LOC111802398 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 282.3 bits (721), Expect = 1.7e-72
Identity = 177/249 (71.08%), Postives = 204/249 (81.93%), Query Frame = 0

Query: 2   SSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEHR 61
           S MQITATQNS+C NKSICLVSKS+YPSF A+Q+R A VN SANASY K+GLPVLKY+HR
Sbjct: 3   SMMQITATQNSLCPNKSICLVSKSMYPSFLASQTRSAFVNPSANASYLKKGLPVLKYDHR 62

Query: 62  RVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIEXXXXXX 121
           RVGLK+++TPI SL+GSKGK S DG SPWK  DKVVE+F KGRSVED+LRQQIE      
Sbjct: 63  RVGLKHRYTPIASLFGSKGKDSGDGASPWKAFDKVVENFKKGRSVEDILRQQIENKDFYD 122

Query: 122 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 181
           XXXXXXXXXXXXXXXXXXXXXXXXXX        S+ GI++E + V+LAT+GLV VYIYI
Sbjct: 123 XXXXXXXXXXXXXXXXXXXXXXXXXXEGP-----SILGILEETMHVVLATIGLVLVYIYI 182

Query: 182 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKK-KYDQYWLEKAILSTP 241
           + G+EL  L KDYIKYLFG  +S RLK AMY+WGKFY+   +KK K D+YWLEKAIL+TP
Sbjct: 183 IEGQELVLLVKDYIKYLFGADQSARLKSAMYSWGKFYKRRTRKKPKPDEYWLEKAILNTP 242

Query: 242 TWWDNPDKY 249
           TWWD+PDKY
Sbjct: 243 TWWDHPDKY 246

BLAST of CsGy1G023950 vs. TAIR10
Match: AT2G43630.1 (FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 142.1 bits (357), Expect = 5.1e-34
Identity = 118/231 (51.08%), Postives = 156/231 (67.53%), Query Frame = 0

Query: 20  CLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEHRRVGLKYQHTPIVSLYGSK 79
           C+ S  I  S   +   R    L A A+   Q  P+L +  R    K + +  V L+G K
Sbjct: 23  CISSVPIRSSVRFDHFPRTSFTLRATAAVSTQFSPLLDHRRRLPTGKSKQSSAVCLFGGK 82

Query: 80  GK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXXXXXXXXXXXXXXXXXXX 139
            K  GSD+  SPWK ++K +     +SVED+LR+QI+         XXXXXXXXXXXXXX
Sbjct: 83  DKPDGSDE-ISPWKAIEKAMGK---KSVEDMLREQIQ-KKDFYDTDXXXXXXXXXXXXXX 142

Query: 140 XXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYL 199
           XXXXXXXXXXXXXXXX  L GI DE LQV+LATLG +F+Y YI++GEEL +LA+DYI++L
Sbjct: 143 XXXXXXXXXXXXXXXXXGLAGIADETLQVVLATLGFIFLYTYIITGEELVKLARDYIRFL 202

Query: 200 FGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKY 249
            G  K+VRL RAM +W  F + + +++ YD+YWLEKAI++TPTW+D+P+KY
Sbjct: 203 MGRPKTVRLTRAMDSWNGFLEKMSRQRVYDEYWLEKAIINTPTWYDSPEKY 248

BLAST of CsGy1G023950 vs. TAIR10
Match: AT3G59640.1 (glycine-rich protein)

HSP 1 Score: 104.8 bits (260), Expect = 9.1e-23
Identity = 97/239 (40.59%), Postives = 137/239 (57.32%), Query Frame = 0

Query: 6   ITATQNSICANKSICL-------VSKSIYPS---FHANQSRRAVVNLSANASYFKQGLPV 65
           +++TQ ++C     C        VS + + S   F      +  +  SA++S   Q  P+
Sbjct: 1   MSSTQANLCRPSLFCARTTQTRHVSSAPFMSSLRFDYRPLPKLAIRASASSSMSSQFSPL 60

Query: 66  LKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQI 125
             +  R      +  P+V L G K K  GS++  S W+ ++K +     +SVED+LR+QI
Sbjct: 61  QNHRCR----NQRQGPVVCLLGGKDKSNGSNELSSTWEAIEKAMGK---KSVEDMLREQI 120

Query: 126 EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGL 185
           +                  XXXXXXXXXXXXXXXXXXXXX  L    DE LQV+LATLG 
Sbjct: 121 Q------KKDTGGIPPRGRXXXXXXXXXXXXXXXXXXXXXXXLASFGDETLQVVLATLGF 180

Query: 186 VFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLE 233
           +F+Y YI++GEEL RLA+DYI+YL G  KSVRL R M  W +F++ + +KK Y++YWL+
Sbjct: 181 IFLYFYIINGEELFRLARDYIRYLIGRPKSVRLTRVMEGWSRFFEKMSRKKVYNEYWLK 226

BLAST of CsGy1G023950 vs. TrEMBL
Match: tr|A0A0A0LVP5|A0A0A0LVP5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534740 PE=4 SV=1)

HSP 1 Score: 462.2 bits (1188), Expect = 8.1e-127
Identity = 280/280 (100.00%), Postives = 280/280 (100.00%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH
Sbjct: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX 120
           RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX
Sbjct: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYIYI 180

Query: 181 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT 240
           LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT
Sbjct: 181 LSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPT 240

Query: 241 WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF 281
           WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
Sbjct: 241 WWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF 280

BLAST of CsGy1G023950 vs. TrEMBL
Match: tr|A0A1S3BA69|A0A1S3BA69_CUCME (uncharacterized protein LOC103487859 OS=Cucumis melo OX=3656 GN=LOC103487859 PE=4 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 7.2e-91
Identity = 219/277 (79.06%), Postives = 243/277 (87.73%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MSSMQITATQNSICANKSICLVSKSIYPSFHANQS RAVVNLSANASYFKQGLP+LKY+H
Sbjct: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSLRAVVNLSANASYFKQGLPILKYKH 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIE--XXX 120
           RRVGLK+QHTPIVSL+GSKGKGSDDGGSPWK  DKVVESF KG SVEDVLR+QIE  XXX
Sbjct: 61  RRVGLKHQHTPIVSLFGSKGKGSDDGGSPWKAFDKVVESFKKGGSVEDVLRKQIEXXXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVY 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX   +SL   +DE LQV+LATLG +F+Y
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKDFSLAEALDETLQVVLATLGFIFMY 180

Query: 181 IYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILS 240
            Y+L+GEE++RL KDYIKY FGGSKSVRL+RAMY WG+FYQ L  KKKYD++WLEKAI++
Sbjct: 181 FYLLNGEEVTRLLKDYIKYRFGGSKSVRLRRAMYEWGRFYQRLTAKKKYDEFWLEKAIIN 240

Query: 241 TPTWWDNPDKYMPK-----KAQNQKQNVASDDYDETD 270
           TPTWWD+PD Y        KA+NQ++N ASDD  ETD
Sbjct: 241 TPTWWDHPDNYRHAAMAYGKAENQEKNFASDDDGETD 277

BLAST of CsGy1G023950 vs. TrEMBL
Match: tr|A0A2P4KWK7|A0A2P4KWK7_QUESU (Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_31071 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 7.8e-45
Identity = 142/252 (56.35%), Postives = 174/252 (69.05%), Query Frame = 0

Query: 1   MSSMQITATQNSICAN---KSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLK 60
           MS+MQITA Q  IC      S  L S          +  R  ++ + NAS  +Q +PV  
Sbjct: 1   MSTMQITAYQPKICVRHIPHSYRLPSNPCILPTLPKRVPRTSLSTNVNASRHQQCVPV-- 60

Query: 61  YEHRRVGLKYQHTPIVSLYGSKGK-GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEXX 120
                   KYQ +  V L+G KGK G ++GGSPW     ++  FKG+SVEDVLRQQI XX
Sbjct: 61  -------SKYQQSMPVCLFGGKGKTGGENGGSPWNAFQNILGRFKGKSVEDVLRQQIXXX 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFV 180
           XXXXXXXXXXXXXXXXXXXXXXXXXXX            L GI+DE +QVILAT+G +F+
Sbjct: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXDE---------DLAGIIDETVQVILATIGFIFM 180

Query: 181 YIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAIL 240
           Y+YI+SGEE++RLAKDYIKYLFGGS+S RLKRAMY WG+FY+ L +KK  D +WLEKAI+
Sbjct: 181 YVYIISGEEMARLAKDYIKYLFGGSQSARLKRAMYKWGRFYKKLTEKKVVDPFWLEKAII 234

Query: 241 STPTWWDNPDKY 249
           +TPTWWD+P+KY
Sbjct: 241 NTPTWWDSPEKY 234

BLAST of CsGy1G023950 vs. TrEMBL
Match: tr|A0A251R0R7|A0A251R0R7_PRUPE (Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G205800 PE=4 SV=1)

HSP 1 Score: 188.0 bits (476), Expect = 3.0e-44
Identity = 143/261 (54.79%), Postives = 181/261 (69.35%), Query Frame = 0

Query: 1   MSSMQITATQNSICANKSICLVSKSIYPSFHANQSRRAVVNLSANASYFKQGLPVLKYEH 60
           MS+ QIT++Q SIC   +    S+++ PS          +  + NAS  +Q +PVLK + 
Sbjct: 1   MSTFQITSSQPSICFRNT----SQALRPS--PKLLVPTCLQPTYNASIRQQRMPVLKAQQ 60

Query: 61  RRVGLKYQHTPIVSLYGSKGKG-SDDGGSPWKGLDKVVESF-KGRSVEDVLRQQIEXXXX 120
            RV  KY  +  V L G KGK  S D GSPWK L+K + +  K +S+EDVLRQQIE    
Sbjct: 61  CRVLSKYHQSAPVCLLGGKGKSESGDEGSPWKALEKAMGNLKKDQSIEDVLRQQIE---R 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLVFVYI 180
                   XXXXXXXXXXXXXXXXXXXXXXXXXXX  L GIMDE LQVILAT+G +F+Y 
Sbjct: 121 NEFYEERGXXXXXXXXXXXXXXXXXXXXXXXXXXXXGLAGIMDETLQVILATVGFLFLYF 180

Query: 181 YILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILST 240
           YI+SGEE +RLAKDYIK+L  GSKS+RL+R+MY WG+FY++L +KK YD++WLEKAI++T
Sbjct: 181 YIISGEEWTRLAKDYIKFLLSGSKSIRLQRSMYKWGRFYKNLTEKKYYDKFWLEKAIITT 240

Query: 241 PTWWDNPDKYMPKKAQNQKQN 260
           PTWWD+P+KY      N + N
Sbjct: 241 PTWWDSPEKYRHIVRSNLESN 252

BLAST of CsGy1G023950 vs. TrEMBL
Match: tr|A0A2P6QUA4|A0A2P6QUA4_ROSCH (Uncharacterized protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0406201 PE=4 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.6e-42
Identity = 146/254 (57.48%), Postives = 183/254 (72.05%), Query Frame = 0

Query: 1   MSSMQITATQNSI---CANKSICLVSKSIYPSFHANQS-RRAVVNLSANASYFKQGLPVL 60
           MS++QI++    I    +++S+ L SK      HA  +  R  +  + NAS  ++ + VL
Sbjct: 1   MSTVQISSCNPCIRLRNSSQSLRLPSKPYILPIHAKHALLRNPLVATCNASIQQRCMTVL 60

Query: 61  KYEHRRVGLKYQHTPIVSLYGSKGK-GSDDGGSPWKGLDKVVESFKGR-SVEDVLRQQIE 120
           K +  R    YQ +  V L G KGK GSDD  SPWK L+K + + K   S+EDVLRQQIE
Sbjct: 61  KNQRCRAVSTYQQSFPVCLLGGKGKNGSDDEASPWKSLEKAMSNLKKESSIEDVLRQQIE 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXYSLTGIMDEILQVILATLGLV 180
                 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    GI+DE LQVILAT+G +
Sbjct: 121 -----KXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGFAGIIDETLQVILATIGFI 180

Query: 181 FVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKA 240
           F+Y+YI++GEE +RLAKDY+K+LF GS+SVRLKRAMY WG+FYQ L +KK YD++WLEKA
Sbjct: 181 FLYVYIITGEEWARLAKDYLKFLFSGSESVRLKRAMYKWGRFYQKLTEKKVYDKFWLEKA 240

Query: 241 ILSTPTWWDNPDKY 249
           I+STPTWWD+P KY
Sbjct: 241 IISTPTWWDSPAKY 249

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_004145228.11.2e-126100.00PREDICTED: uncharacterized protein LOC101222813 [Cucumis sativus] >KGN65868.1 hy... [more]
XP_008444591.11.1e-9079.06PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo][more]
XP_022140099.12.4e-8271.63uncharacterized protein LOC111010834 [Momordica charantia][more]
XP_022955156.16.0e-7372.29uncharacterized protein LOC111457207 [Cucurbita moschata][more]
XP_023542514.11.7e-7271.08uncharacterized protein LOC111802398 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT2G43630.15.1e-3451.08FUNCTIONS IN: molecular_function unknown[more]
AT3G59640.19.1e-2340.59glycine-rich protein[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
tr|A0A0A0LVP5|A0A0A0LVP5_CUCSA8.1e-127100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G534740 PE=4 SV=1[more]
tr|A0A1S3BA69|A0A1S3BA69_CUCME7.2e-9179.06uncharacterized protein LOC103487859 OS=Cucumis melo OX=3656 GN=LOC103487859 PE=... [more]
tr|A0A2P4KWK7|A0A2P4KWK7_QUESU7.8e-4556.35Uncharacterized protein OS=Quercus suber OX=58331 GN=CFP56_31071 PE=4 SV=1[more]
tr|A0A251R0R7|A0A251R0R7_PRUPE3.0e-4454.79Uncharacterized protein OS=Prunus persica OX=3760 GN=PRUPE_1G205800 PE=4 SV=1[more]
tr|A0A2P6QUA4|A0A2P6QUA4_ROSCH1.6e-4257.48Uncharacterized protein OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr4g0406201 PE=4... [more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy1G023950.1CsGy1G023950.1mRNA


Analysis Name: InterPro Annotations of cucumber Gy14 genome (v2)
Date Performed: 2018-09-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 265..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 252..280
NoneNo IPR availablePANTHERPTHR35483FAMILY NOT NAMEDcoord: 1..260
NoneNo IPR availablePANTHERPTHR35483:SF1GLYCINE-RICH PROTEINcoord: 1..260