HG10012224 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012224
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionKeratin, type I cytoskeletal 9-like
LocationChr01: 19035702 .. 19036865 (+)
RNA-Seq ExpressionHG10012224
SyntenyHG10012224
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCTCATAAACTTTCTTTTCTTGTGTTCTTCCTTTTGTTCGGTATTGAAATTTCTACTGCATCCAGATTTCTATCAACTTTTGGTGAAGGAGGATATGGCAATCCAAGTAGCCTCGGTGATAGTAAGTATAGTTCTATAGAAAAAAATGGTGTTGGAGGCTATGATGGTGAATATGGTGGTGGATATAGTCCAAAAGATTATGGATATGGAAAGTATAAGAATGATTATTATTTGGGTTGGGAAAGGTACGGTAAGGAGAGTACTGATTATGGTTATAGTGTTCCCAAATATGATATTGAAAAAGGTAAAGGGGGTGATTATTATGTTTATGGTGGAGGAAAACGTGGAGGATCATCTAAAGATTATGATACTTATGCAAGTAGCTGCAAAGATGGAGGAGTGTACAAATATCCTAGTTATGGAGACAATGACAAAGGTGGAGAATCACCCAAGAAATATGTTGATTATAGAAATGGTGACAAAGTGGGAGAATCATCCGGAGAATATGGTGGTGATAATGCATACAATGGCAAAACTGGAGCATACAACTATCCTAATTATGAAAATAGTGAGAAAAGTGCAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAAAATCATCCGAAGAATATAGTGGTAGTGCATACAGTGGCAAAGCTGGAGCATACGACTATCCTAATTATGGAAATGGTGAGAAAAGTGGAGAATCATCCAAAGAATATGTTGGTTATAGAAATGGTGGCACAGGTAGAGAATCATCTGAAGAATATAGTGGTAAAGTATATAGTGACAAAGCTGGAGTATACAACTATCCTAGTTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAGAATCATCTGAAGAATATAGTGGTAATGCATACAGTGGCAAAGCTGGAGCATACAACTATCCTAATTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTATAGAACTGGTGGCAAAGTTGTAGCATACAAATATCCTAATTATGGAAATGGTGAAAAAGATGACAAATCATCTAAAAGATATGGTGGCCATCAAGAGAGCAATATATATGGCGGAGGTAGCGGCACTGAACCTTAA

mRNA sequence

ATGGCTTCTCATAAACTTTCTTTTCTTGTGTTCTTCCTTTTGTTCGGTATTGAAATTTCTACTGCATCCAGATTTCTATCAACTTTTGGTGAAGGAGGATATGGCAATCCAAGTAGCCTCGGTGATAGTAAGTATAGTTCTATAGAAAAAAATGGTGTTGGAGGCTATGATGGTGAATATGGTGGTGGATATAGTCCAAAAGATTATGGATATGGAAAGTATAAGAATGATTATTATTTGGGTTGGGAAAGGTACGGTAAGGAGAGTACTGATTATGGTTATAGTGTTCCCAAATATGATATTGAAAAAGGTAAAGGGGGTGATTATTATGTTTATGGTGGAGGAAAACGTGGAGGATCATCTAAAGATTATGATACTTATGCAAGTAGCTGCAAAGATGGAGGAGTGTACAAATATCCTAGTTATGGAGACAATGACAAAGGTGGAGAATCACCCAAGAAATATGTTGATTATAGAAATGGTGACAAAGTGGGAGAATCATCCGGAGAATATGGTGGTGATAATGCATACAATGGCAAAACTGGAGCATACAACTATCCTAATTATGAAAATAGTGAGAAAAGTGCAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAAAATCATCCGAAGAATATAGTGGTAGTGCATACAGTGGCAAAGCTGGAGCATACGACTATCCTAATTATGGAAATGGTGAGAAAAGTGGAGAATCATCCAAAGAATATGTTGGTTATAGAAATGGTGGCACAGGTAGAGAATCATCTGAAGAATATAGTGGTAAAGTATATAGTGACAAAGCTGGAGTATACAACTATCCTAGTTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAGAATCATCTGAAGAATATAGTGGTAATGCATACAGTGGCAAAGCTGGAGCATACAACTATCCTAATTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTATAGAACTGGTGGCAAAGTTGTAGCATACAAATATCCTAATTATGGAAATGGTGAAAAAGATGACAAATCATCTAAAAGATATGGTGGCCATCAAGAGAGCAATATATATGGCGGAGGTAGCGGCACTGAACCTTAA

Coding sequence (CDS)

ATGGCTTCTCATAAACTTTCTTTTCTTGTGTTCTTCCTTTTGTTCGGTATTGAAATTTCTACTGCATCCAGATTTCTATCAACTTTTGGTGAAGGAGGATATGGCAATCCAAGTAGCCTCGGTGATAGTAAGTATAGTTCTATAGAAAAAAATGGTGTTGGAGGCTATGATGGTGAATATGGTGGTGGATATAGTCCAAAAGATTATGGATATGGAAAGTATAAGAATGATTATTATTTGGGTTGGGAAAGGTACGGTAAGGAGAGTACTGATTATGGTTATAGTGTTCCCAAATATGATATTGAAAAAGGTAAAGGGGGTGATTATTATGTTTATGGTGGAGGAAAACGTGGAGGATCATCTAAAGATTATGATACTTATGCAAGTAGCTGCAAAGATGGAGGAGTGTACAAATATCCTAGTTATGGAGACAATGACAAAGGTGGAGAATCACCCAAGAAATATGTTGATTATAGAAATGGTGACAAAGTGGGAGAATCATCCGGAGAATATGGTGGTGATAATGCATACAATGGCAAAACTGGAGCATACAACTATCCTAATTATGAAAATAGTGAGAAAAGTGCAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAAAATCATCCGAAGAATATAGTGGTAGTGCATACAGTGGCAAAGCTGGAGCATACGACTATCCTAATTATGGAAATGGTGAGAAAAGTGGAGAATCATCCAAAGAATATGTTGGTTATAGAAATGGTGGCACAGGTAGAGAATCATCTGAAGAATATAGTGGTAAAGTATATAGTGACAAAGCTGGAGTATACAACTATCCTAGTTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTACAGAAATGGTGGCAAAGGTAGAGAATCATCTGAAGAATATAGTGGTAATGCATACAGTGGCAAAGCTGGAGCATACAACTATCCTAATTATGGAAATGGTGAAAAAAGTGGAGAATCATCCAAAGAATATGTTGATTATAGAACTGGTGGCAAAGTTGTAGCATACAAATATCCTAATTATGGAAATGGTGAAAAAGATGACAAATCATCTAAAAGATATGGTGGCCATCAAGAGAGCAATATATATGGCGGAGGTAGCGGCACTGAACCTTAA

Protein sequence

MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGYDGEYGGGYSPKDYGYGKYKNDYYLGWERYGKESTDYGYSVPKYDIEKGKGGDYYVYGGGKRGGSSKDYDTYASSCKDGGVYKYPSYGDNDKGGESPKKYVDYRNGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYRNGGKGRKSSEEYSGSAYSGKAGAYDYPNYGNGEKSGESSKEYVGYRNGGTGRESSEEYSGKVYSDKAGVYNYPSYGNGEKSGESSKEYVDYRNGGKGRESSEEYSGNAYSGKAGAYNYPNYGNGEKSGESSKEYVDYRTGGKVVAYKYPNYGNGEKDDKSSKRYGGHQESNIYGGGSGTEP
Homology
BLAST of HG10012224 vs. NCBI nr
Match: XP_038887039.1 (shematrin-like protein 2 [Benincasa hispida])

HSP 1 Score: 179.9 bits (455), Expect = 4.3e-41
Identity = 114/232 (49.14%), Postives = 139/232 (59.91%), Query Frame = 0

Query: 1   MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGYDGEY 60
           MAS+KLS LVF LLFGIEIS A+R L+T+ +G YGNPS+  +  Y+ + +NGV    G Y
Sbjct: 1   MASYKLSSLVFLLLFGIEISIATRALTTYSQGEYGNPSTYSNGGYNYVGENGV----GRY 60

Query: 61  GGGYSPKDYGYGKYKNDYYLGWERYGKESTDYGYSVPKYDIEKGKG---------GDYYV 120
            GGY+PK YGYGKYKNDYY G E+YG+ +T +GY V  YD  K KG         G  Y 
Sbjct: 61  AGGYTPKGYGYGKYKNDYYSGREKYGQGTTSFGYGVSGYDTGKSKGNDDDYSEQYGQGYG 120

Query: 121 YGGGKRGGSSKDYDTYASSCKDGGVYKYPSYGDNDKGGESPKKYVDYRNGDKVGESSGEY 180
            GGGKRGGS  +Y++YAS CKDGGV  YPSYG ++KGGE  K+Y                
Sbjct: 121 DGGGKRGGSFNEYNSYASGCKDGGVNNYPSYGYDEKGGEMSKEY---------------- 180

Query: 181 GGDNAYNGKTG-AYNYPNYENSEKSAESSKEYVDYRNGGKGRKSSEEYSGSA 223
             DN Y GK+G  Y YPNY    K  +S KEY        GR+ S  Y G +
Sbjct: 181 -NDNRYGGKSGEMYEYPNYGAGNKGDKSYKEY-------GGRQGSSAYDGDS 204

BLAST of HG10012224 vs. NCBI nr
Match: XP_038887038.1 (heterogeneous nuclear ribonucleoprotein A3-like [Benincasa hispida])

HSP 1 Score: 140.2 bits (352), Expect = 3.8e-29
Identity = 102/206 (49.51%), Postives = 120/206 (58.25%), Query Frame = 0

Query: 16  GIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGYDGEYGGGYSPKDYGYGKYK 75
           GI+I  A+R L+T  EGGYG PSS      + + +NGV    G YGG Y+PK YGYGKY 
Sbjct: 2   GIQICLAARVLTTSDEGGYGYPSS-----GAFVGENGV----GSYGGDYNPKGYGYGKYG 61

Query: 76  NDYYLGWERYGKESTDYGY-SVPKYDIE-KGKGGDYYVY---------GGGKRGGSSKDY 135
           NDYY GW  YG+ ST  GY S P Y+   K KGG+Y  Y         GGG+RGGS K Y
Sbjct: 62  NDYYNGWRWYGQGSTISGYGSAPGYEYNGKDKGGEYDYYEQYGQGYGGGGGERGGSFKGY 121

Query: 136 DTYASSCKDGGVYKYPSYGDNDKGGESPKKYVDYRNGDKVGESSGEYGGDNAYNGKTGAY 195
           D+  S   DGG Y Y SYG+N+K  ES K Y  YR G K  ES  EYGG+   + K G  
Sbjct: 122 DSRTS---DGGAYNYRSYGNNEKTRESSKDYFGYRYGGKSRESFKEYGGNELNSEKWGGM 181

Query: 196 N-YPNYENSEKSAESSKEYVDYRNGG 210
           N YP Y NS K  +SS+EY     GG
Sbjct: 182 NKYPYYGNSNKDEKSSQEYSGGIYGG 195

BLAST of HG10012224 vs. NCBI nr
Match: KAA0038572.1 (keratin, type I cytoskeletal 9-like [Cucumis melo var. makuwa])

HSP 1 Score: 120.9 bits (302), Expect = 2.4e-23
Identity = 161/442 (36.43%), Postives = 202/442 (45.70%), Query Frame = 0

Query: 1   MASHKLSF-LVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGYDGE 60
           MAS +LSF LVFF+LF IEIS     +++ GE  YGNPS+ G  KY  I KNG+GGY  +
Sbjct: 1   MASPRLSFSLVFFILFAIEIS----LIASEGE-KYGNPSNFGYRKYGFIGKNGLGGYAEK 60

Query: 61  YGGGYSPKDYGYGKYKNDYYLGWERYGKESTDYGYSVPKYDIEKGKGGDYYVYGGGKRGG 120
                         Y NDYY  W R+ K      YS                 G G  GG
Sbjct: 61  -------------NYNNDYY--WGRWRKSDQGSTYS-----------------GNGASGG 120

Query: 121 SSKDYDTYASSC-------------------------KDGGVYKYPSYG--DNDKGGESP 180
           SS +YD + SSC                         K+ G Y  PS G   N+ GGE+ 
Sbjct: 121 SSNEYDDHTSSCKSPSNGSGYKSEEPSKENYGNANNGKNAGEYNNPSTGGNSNNGGGEAS 180

Query: 181 KKYVDYR-NGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYRNGGKG 240
           K Y     NG   GE S    G N  NG  G      Y  SE + +S  EY +   GG G
Sbjct: 181 KGYSGSEGNGKSGGEYSNPSTGGNGNNG--GGEASKGYSGSEGNGKSGGEYSNPSTGGNG 240

Query: 241 R----KSSEEYSGSAYSGKAGAYDYPNYGNGEK----SGESSKEYVGYRN---------- 300
                ++S+ YSGS  +GK+G  +Y N   GE      GE+SK Y G  N          
Sbjct: 241 NNGGGEASKGYSGSGSNGKSGG-EYSNPSTGENGNNGGGEASKGYSGSENSGQSGGSGYS 300

Query: 301 --GGTGRESSEEYSGKVYSDKAGVYNYPSYGNGEKSGE--SSKEYVDYRNGGKGRESSEE 360
             GG+G +S EE S K Y+           GNG KSGE  S+KEY    N G G E+S+ 
Sbjct: 301 NPGGSGYKSGEEPSAKEYNGN---------GNGYKSGEEPSAKEYNGNGNNG-GGEASKG 360

Query: 361 YSGNAYSGKAGA-YNYPNYGNGEKSGESSKEYVDYRTGGKVVA-YKYPNY-GNGEK-DDK 388
           YSG+  +GK+G  Y+ P+ GN  K GE SK+Y    + GK    Y  P+  GNG     +
Sbjct: 361 YSGSGSNGKSGGEYSNPSTGNDNKGGEPSKDYSGSGSNGKSGGEYSNPSTGGNGNNGGGE 391

BLAST of HG10012224 vs. NCBI nr
Match: TYK31171.1 (keratin, type I cytoskeletal 9-like [Cucumis melo var. makuwa])

HSP 1 Score: 104.0 bits (258), Expect = 3.0e-18
Identity = 143/402 (35.57%), Postives = 182/402 (45.27%), Query Frame = 0

Query: 32  GGYGNPSSLGDSKYSSIEK------NGVGGYDGEYGGGYSPKDYGYGKYKNDYYLGWERY 91
           G Y NPS+ G+      E       +G GG  G  G GYS    G   YK+      + Y
Sbjct: 331 GEYSNPSTGGNGNNGGGEASKGYSGSGNGGQSG--GSGYSNPSTGGNGYKSGEEPSAKEY 390

Query: 92  GKESTDYGYSVPKYDIEKG----KGGDY---YVYGGGKRGG--SSKDYDTYASSCKDGGV 151
                + G    K     G     GG+Y      G G  GG  +SK Y    S+ K GG 
Sbjct: 391 NGNGNNGGGEASKGYSGSGSNGKSGGEYSNPSTGGNGNNGGGETSKGYSGSGSNGKSGGG 450

Query: 152 YKYPSYGDNDKGGESPKKYVDYRNGDKVGESSGEYG----GDNAYNGKTGAYNYPNYENS 211
           Y  PS G+++KGGE  K   DY      G+S GEY     G N  NG  G      Y  S
Sbjct: 451 YSNPSTGNDNKGGEPSK---DYNGSGSNGKSGGEYSNPSTGGNGNNG--GGEASKGYSGS 510

Query: 212 EKSAESSKEYVDYRNGGKGR----KSSEEYSGSAYSGKAGAYDYPN-YGNGEKSGE--SS 271
             + +S  EY +   G  G     ++S+ YSGS  SG++G   Y N  G+G KSGE  S+
Sbjct: 511 GSNGKSGGEYSNPSTGENGNNGGGEASKGYSGSENSGQSGGSGYSNPGGSGYKSGEEPSA 570

Query: 272 KEYVGYRNGGTGRESSEEYSGKVYSDKAG--VYNYPSYGNGEKSGESSKEYV-------- 331
           KEY G  N G G E+S+ YSG   S ++G   Y+ PS GN  K GE SK+Y         
Sbjct: 571 KEYNGNGNNG-GGEASKGYSGSGNSGQSGGSGYSNPSTGNDNKGGEPSKDYSGSGSNGKS 630

Query: 332 --DYRN---GGKGRESSE-----EYSGNAYSGKAGAYNYPNYGNGEKSGESSKEYVDYRT 388
             +Y N   GG G +S E     EY+GN   GK+  YN P+ G  E +G           
Sbjct: 631 GGEYSNPSTGGNGYKSGEEPSTKEYNGNMNGGKSEEYNGPSKGGDENNG----------- 690

BLAST of HG10012224 vs. NCBI nr
Match: XP_022963750.1 (glycine-rich cell wall structural protein 1.8-like isoform X46 [Cucurbita moschata])

HSP 1 Score: 88.2 bits (217), Expect = 1.7e-13
Identity = 144/433 (33.26%), Postives = 188/433 (43.42%), Query Frame = 0

Query: 1   MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGY---- 60
           MASHKLS LVFFL FGI I +A+R LS+  E GYG+ S  G S Y+S+ + GV  Y    
Sbjct: 3   MASHKLSSLVFFLFFGIGICSAARRLSSSYE-GYGSSSGHGYSSYASVGEYGVENYGNGY 62

Query: 61  --DGEYGGGYSPKDYGYGKYKNDYYLGWERYGKESTDY-GYSVPKYDIEKGK------GG 120
             DG YGG Y   +Y  GKY    Y G +    +S +Y GYS  +++   G+      GG
Sbjct: 63  GKDGAYGGKYG--EYNGGKYGG--YDGGKHERYDSGNYGGYSGGRHEGYDGENYGGYGGG 122

Query: 121 DYYVYGGGKRGGSSK----DYDTYASSCKDGGVY--------------KYPSYGDNDKGG 180
           +Y  YGGGK GG S+    +YD       DGG Y              K+  Y     GG
Sbjct: 123 NYGGYGGGKYGGYSRGKHEEYDKDKHEGYDGGNYGGYSGGKHEEYDKDKHEGYDGGKYGG 182

Query: 181 ESPKKYVDYRNGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYRNGG 240
               K+ +Y      G   G YGG +   G  G Y+   +E  +K      +  +Y  GG
Sbjct: 183 YGGSKHEEYDKDKHEGYDGGNYGGYS--GGNYGGYSRSKHEEYDKDKHEGYDGGNY--GG 242

Query: 241 KGRKSSEEYSGSAYSGKAGAYDYPNYG--NGEKSGESSKE----YVGYRNGGTGRESSEE 300
                 EEY+   + G    YD   YG  +G K  E  K+    Y G + GG      EE
Sbjct: 243 YSGGKHEEYNKDKHDG----YDGGKYGGYSGGKHEEYDKDKHEVYDGGKYGGYSGGKHEE 302

Query: 301 YSGKVYSDKAGVYNYPSYG--NGEKSGESSKE----YVDYRNGGKGRESSEEYSGNAYSG 360
           Y      DK   Y+  +YG  NG K  E  K+    Y     GG      EEY  + + G
Sbjct: 303 YE----KDKHEGYDGGNYGGYNGGKYEEYDKDKHEGYDGGNYGGYSGGKHEEYDKDKHEG 362

Query: 361 KAGAYNYPNYGNGEKSGESSKEYVDYRTGGKVVAY--KYPNYGNGEKDDKSSKRYGGHQE 387
             G   Y  YG G K  E  K+  +   GG    Y  K+  Y  G+ +      +GG+  
Sbjct: 363 YDGG-KYGGYG-GSKHEEYDKDKHEGYDGGNYGGYRSKHEEYDRGKHEGYDGVNHGGYGG 416

BLAST of HG10012224 vs. ExPASy TrEMBL
Match: A0A5A7T532 (Keratin, type I cytoskeletal 9-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold92G001010 PE=4 SV=1)

HSP 1 Score: 120.9 bits (302), Expect = 1.1e-23
Identity = 161/442 (36.43%), Postives = 202/442 (45.70%), Query Frame = 0

Query: 1   MASHKLSF-LVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGYDGE 60
           MAS +LSF LVFF+LF IEIS     +++ GE  YGNPS+ G  KY  I KNG+GGY  +
Sbjct: 1   MASPRLSFSLVFFILFAIEIS----LIASEGE-KYGNPSNFGYRKYGFIGKNGLGGYAEK 60

Query: 61  YGGGYSPKDYGYGKYKNDYYLGWERYGKESTDYGYSVPKYDIEKGKGGDYYVYGGGKRGG 120
                         Y NDYY  W R+ K      YS                 G G  GG
Sbjct: 61  -------------NYNNDYY--WGRWRKSDQGSTYS-----------------GNGASGG 120

Query: 121 SSKDYDTYASSC-------------------------KDGGVYKYPSYG--DNDKGGESP 180
           SS +YD + SSC                         K+ G Y  PS G   N+ GGE+ 
Sbjct: 121 SSNEYDDHTSSCKSPSNGSGYKSEEPSKENYGNANNGKNAGEYNNPSTGGNSNNGGGEAS 180

Query: 181 KKYVDYR-NGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYRNGGKG 240
           K Y     NG   GE S    G N  NG  G      Y  SE + +S  EY +   GG G
Sbjct: 181 KGYSGSEGNGKSGGEYSNPSTGGNGNNG--GGEASKGYSGSEGNGKSGGEYSNPSTGGNG 240

Query: 241 R----KSSEEYSGSAYSGKAGAYDYPNYGNGEK----SGESSKEYVGYRN---------- 300
                ++S+ YSGS  +GK+G  +Y N   GE      GE+SK Y G  N          
Sbjct: 241 NNGGGEASKGYSGSGSNGKSGG-EYSNPSTGENGNNGGGEASKGYSGSENSGQSGGSGYS 300

Query: 301 --GGTGRESSEEYSGKVYSDKAGVYNYPSYGNGEKSGE--SSKEYVDYRNGGKGRESSEE 360
             GG+G +S EE S K Y+           GNG KSGE  S+KEY    N G G E+S+ 
Sbjct: 301 NPGGSGYKSGEEPSAKEYNGN---------GNGYKSGEEPSAKEYNGNGNNG-GGEASKG 360

Query: 361 YSGNAYSGKAGA-YNYPNYGNGEKSGESSKEYVDYRTGGKVVA-YKYPNY-GNGEK-DDK 388
           YSG+  +GK+G  Y+ P+ GN  K GE SK+Y    + GK    Y  P+  GNG     +
Sbjct: 361 YSGSGSNGKSGGEYSNPSTGNDNKGGEPSKDYSGSGSNGKSGGEYSNPSTGGNGNNGGGE 391

BLAST of HG10012224 vs. ExPASy TrEMBL
Match: A0A5D3E6D9 (Keratin, type I cytoskeletal 9-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G004520 PE=4 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 1.4e-18
Identity = 143/402 (35.57%), Postives = 182/402 (45.27%), Query Frame = 0

Query: 32  GGYGNPSSLGDSKYSSIEK------NGVGGYDGEYGGGYSPKDYGYGKYKNDYYLGWERY 91
           G Y NPS+ G+      E       +G GG  G  G GYS    G   YK+      + Y
Sbjct: 331 GEYSNPSTGGNGNNGGGEASKGYSGSGNGGQSG--GSGYSNPSTGGNGYKSGEEPSAKEY 390

Query: 92  GKESTDYGYSVPKYDIEKG----KGGDY---YVYGGGKRGG--SSKDYDTYASSCKDGGV 151
                + G    K     G     GG+Y      G G  GG  +SK Y    S+ K GG 
Sbjct: 391 NGNGNNGGGEASKGYSGSGSNGKSGGEYSNPSTGGNGNNGGGETSKGYSGSGSNGKSGGG 450

Query: 152 YKYPSYGDNDKGGESPKKYVDYRNGDKVGESSGEYG----GDNAYNGKTGAYNYPNYENS 211
           Y  PS G+++KGGE  K   DY      G+S GEY     G N  NG  G      Y  S
Sbjct: 451 YSNPSTGNDNKGGEPSK---DYNGSGSNGKSGGEYSNPSTGGNGNNG--GGEASKGYSGS 510

Query: 212 EKSAESSKEYVDYRNGGKGR----KSSEEYSGSAYSGKAGAYDYPN-YGNGEKSGE--SS 271
             + +S  EY +   G  G     ++S+ YSGS  SG++G   Y N  G+G KSGE  S+
Sbjct: 511 GSNGKSGGEYSNPSTGENGNNGGGEASKGYSGSENSGQSGGSGYSNPGGSGYKSGEEPSA 570

Query: 272 KEYVGYRNGGTGRESSEEYSGKVYSDKAG--VYNYPSYGNGEKSGESSKEYV-------- 331
           KEY G  N G G E+S+ YSG   S ++G   Y+ PS GN  K GE SK+Y         
Sbjct: 571 KEYNGNGNNG-GGEASKGYSGSGNSGQSGGSGYSNPSTGNDNKGGEPSKDYSGSGSNGKS 630

Query: 332 --DYRN---GGKGRESSE-----EYSGNAYSGKAGAYNYPNYGNGEKSGESSKEYVDYRT 388
             +Y N   GG G +S E     EY+GN   GK+  YN P+ G  E +G           
Sbjct: 631 GGEYSNPSTGGNGYKSGEEPSTKEYNGNMNGGKSEEYNGPSKGGDENNG----------- 690

BLAST of HG10012224 vs. ExPASy TrEMBL
Match: A0A6J1HIW2 (glycine-rich cell wall structural protein 1.8-like isoform X46 OS=Cucurbita moschata OX=3662 GN=LOC111463951 PE=4 SV=1)

HSP 1 Score: 88.2 bits (217), Expect = 8.2e-14
Identity = 144/433 (33.26%), Postives = 188/433 (43.42%), Query Frame = 0

Query: 1   MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGY---- 60
           MASHKLS LVFFL FGI I +A+R LS+  E GYG+ S  G S Y+S+ + GV  Y    
Sbjct: 3   MASHKLSSLVFFLFFGIGICSAARRLSSSYE-GYGSSSGHGYSSYASVGEYGVENYGNGY 62

Query: 61  --DGEYGGGYSPKDYGYGKYKNDYYLGWERYGKESTDY-GYSVPKYDIEKGK------GG 120
             DG YGG Y   +Y  GKY    Y G +    +S +Y GYS  +++   G+      GG
Sbjct: 63  GKDGAYGGKYG--EYNGGKYGG--YDGGKHERYDSGNYGGYSGGRHEGYDGENYGGYGGG 122

Query: 121 DYYVYGGGKRGGSSK----DYDTYASSCKDGGVY--------------KYPSYGDNDKGG 180
           +Y  YGGGK GG S+    +YD       DGG Y              K+  Y     GG
Sbjct: 123 NYGGYGGGKYGGYSRGKHEEYDKDKHEGYDGGNYGGYSGGKHEEYDKDKHEGYDGGKYGG 182

Query: 181 ESPKKYVDYRNGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYRNGG 240
               K+ +Y      G   G YGG +   G  G Y+   +E  +K      +  +Y  GG
Sbjct: 183 YGGSKHEEYDKDKHEGYDGGNYGGYS--GGNYGGYSRSKHEEYDKDKHEGYDGGNY--GG 242

Query: 241 KGRKSSEEYSGSAYSGKAGAYDYPNYG--NGEKSGESSKE----YVGYRNGGTGRESSEE 300
                 EEY+   + G    YD   YG  +G K  E  K+    Y G + GG      EE
Sbjct: 243 YSGGKHEEYNKDKHDG----YDGGKYGGYSGGKHEEYDKDKHEVYDGGKYGGYSGGKHEE 302

Query: 301 YSGKVYSDKAGVYNYPSYG--NGEKSGESSKE----YVDYRNGGKGRESSEEYSGNAYSG 360
           Y      DK   Y+  +YG  NG K  E  K+    Y     GG      EEY  + + G
Sbjct: 303 YE----KDKHEGYDGGNYGGYNGGKYEEYDKDKHEGYDGGNYGGYSGGKHEEYDKDKHEG 362

Query: 361 KAGAYNYPNYGNGEKSGESSKEYVDYRTGGKVVAY--KYPNYGNGEKDDKSSKRYGGHQE 387
             G   Y  YG G K  E  K+  +   GG    Y  K+  Y  G+ +      +GG+  
Sbjct: 363 YDGG-KYGGYG-GSKHEEYDKDKHEGYDGGNYGGYRSKHEEYDRGKHEGYDGVNHGGYGG 416

BLAST of HG10012224 vs. ExPASy TrEMBL
Match: A0A6J1HIT1 (uncharacterized PE-PGRS family protein PE_PGRS54-like isoform X18 OS=Cucurbita moschata OX=3662 GN=LOC111463951 PE=4 SV=1)

HSP 1 Score: 87.0 bits (214), Expect = 1.8e-13
Identity = 139/436 (31.88%), Postives = 186/436 (42.66%), Query Frame = 0

Query: 1   MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGY---- 60
           MASHKLS LVFFL FGI I +A+R LS+  E GYG+ S  G S Y+S+ + GV  Y    
Sbjct: 3   MASHKLSSLVFFLFFGIGICSAARRLSSSYE-GYGSSSGHGYSSYASVGEYGVENYGNGY 62

Query: 61  --DGEYGGGYSPKDYGYGKYKNDYYLGWERYGKESTDY-GYSVPKYDIEKGKGGDYYVYG 120
             DG YGG Y   +Y  GKY    Y G +    +S +Y GYS  ++  E   G +Y  YG
Sbjct: 63  GKDGAYGGKYG--EYNGGKYGG--YDGGKHERYDSGNYGGYSGGRH--EGYDGENYGGYG 122

Query: 121 GGKR----GGSSKDYDTYASSCKDGGVY--------------KYPSYGDNDKGGESPKKY 180
           GGK     GG  K Y        DGG Y              K+  Y   + GG    KY
Sbjct: 123 GGKHEEYGGGKHKGY--------DGGNYGGYGFGKHEEYDKDKHEGYNGGNYGGYGGGKY 182

Query: 181 VDYRNGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYR--NGGK--- 240
            +Y  G+  G   G +  +    GK   Y+   +E  +K      +  +YR  +GGK   
Sbjct: 183 EEYDRGNYGGFGGGRH--EEYDRGKHEEYDKGKHEEYDKDKHEGYDRGNYRGYDGGKHEE 242

Query: 241 -GRKSSEEYSGSAYSGKAGA----YD---YPNYGNGEKSGESSKEYVGY---RNGGTGRE 300
            GR   E Y G  Y G  G     YD   +  Y  G+  G     Y GY   + GG G  
Sbjct: 243 YGRGKHEGYDGGNYGGYGGGKHDEYDRGKHEEYDKGKHEGYEGGNYGGYGGGKYGGYGGS 302

Query: 301 SSEEYSGKVYSDKAGVYNYPSYGNGEKSGESSKEYVDYRNGGKGRESSEEYSGNAYSGKA 360
             EEY    +    G  NY  Y  G   G S  ++ +Y      ++  E Y G  Y G +
Sbjct: 303 KHEEYDKDKHEGYDG-GNYGGYSGGNYGGYSRSKHEEY-----DKDKHEGYDGGNYGGYS 362

Query: 361 GAYNYPNYGNGEKSGESSKEYVDYRTGGKVVAY-----------KYPNYGNGEKDDKSSK 385
           G   +  Y   +  G    +Y  Y +GGK   Y           KY  Y  G+ ++    
Sbjct: 363 GG-KHEEYNKDKHDGYDGGKYGGY-SGGKHEEYDKDKHEVYDGGKYGGYSGGKHEEYEKD 412

BLAST of HG10012224 vs. ExPASy TrEMBL
Match: A0A6J1HL17 (glycine-rich cell wall structural protein 1.8-like isoform X22 OS=Cucurbita moschata OX=3662 GN=LOC111463951 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 2.4e-13
Identity = 140/432 (32.41%), Postives = 188/432 (43.52%), Query Frame = 0

Query: 1   MASHKLSFLVFFLLFGIEISTASRFLSTFGEGGYGNPSSLGDSKYSSIEKNGVGGY---- 60
           MASHKLS LVFFL FGI I +A+R LS+  E GYG+ S  G S Y+S+ + GV  Y    
Sbjct: 3   MASHKLSSLVFFLFFGIGICSAARRLSSSYE-GYGSSSGHGYSSYASVGEYGVENYGNGY 62

Query: 61  --DGEYGGGYSPKDYGYGKYKNDYYLGWERYGKESTDY-GYSVPKYDIEKGKGGDYYVYG 120
             DG YGG Y   +Y  GKY    Y G +    +S +Y GYS  ++  E   G +Y  YG
Sbjct: 63  GKDGAYGGKYG--EYNGGKYGG--YDGGKHERYDSGNYGGYSGGRH--EGYDGENYGGYG 122

Query: 121 GGKR----GGSSKDYDTYASSCKDGGVY--------------KYPSYGDNDKGGESPKKY 180
           GGK     GG  K Y        DGG Y              K+  Y   + GG    KY
Sbjct: 123 GGKHEEYGGGKHKGY--------DGGNYGGYGFGKHEEYDKDKHEGYNGGNYGGYGGGKY 182

Query: 181 VDYRNGDKVGESSGEYGGDNAYNGKTGAYNYPNYENSEKSAESSKEYVDYR--NGGK--- 240
            +Y  G+  G   G +  +    GK   Y+   +E  +K      +  +YR  +GGK   
Sbjct: 183 EEYDRGNYGGFGGGRH--EEYDRGKHEEYDKGKHEEYDKDKHEGYDRGNYRGYDGGKHEE 242

Query: 241 -GRKSSEEYSGSAYSGKAGAYDYPNYG--NGEKSGESSKE----YVGYRNGGTGRESSEE 300
            GR   EEY    + G    YD  NYG  +G K  E  K+    Y G + GG G    EE
Sbjct: 243 YGRGKHEEYDKDKHEG----YDGGNYGGYSGGKHEEYDKDKHEGYDGGKYGGYGGSKHEE 302

Query: 301 YSGKVYSDKAGVYNYPSYGNGEKSGESSKEYVDYRNGGKGRESSEEYSGNAYSGKAGAYN 360
           Y    +    G  NY  Y  G   G S  ++ +Y      ++  E Y G  Y G +G   
Sbjct: 303 YDKDKHEGYDG-GNYGGYSGGNYGGYSRSKHEEY-----DKDKHEGYDGGNYGGYSGG-K 362

Query: 361 YPNYGNGEKSGESSKEYVDYRTGGKVVAY-----------KYPNYGNGEKDDKSSKRYGG 385
           +  Y   +  G    +Y  Y +GGK   Y           KY  Y  G+ ++    ++ G
Sbjct: 363 HEEYNKDKHDGYDGGKYGGY-SGGKHEEYDKDKHEVYDGGKYGGYSGGKHEEYEKDKHEG 404

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887039.14.3e-4149.14shematrin-like protein 2 [Benincasa hispida][more]
XP_038887038.13.8e-2949.51heterogeneous nuclear ribonucleoprotein A3-like [Benincasa hispida][more]
KAA0038572.12.4e-2336.43keratin, type I cytoskeletal 9-like [Cucumis melo var. makuwa][more]
TYK31171.13.0e-1835.57keratin, type I cytoskeletal 9-like [Cucumis melo var. makuwa][more]
XP_022963750.11.7e-1333.26glycine-rich cell wall structural protein 1.8-like isoform X46 [Cucurbita moscha... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7T5321.1e-2336.43Keratin, type I cytoskeletal 9-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A5D3E6D91.4e-1835.57Keratin, type I cytoskeletal 9-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A6J1HIW28.2e-1433.26glycine-rich cell wall structural protein 1.8-like isoform X46 OS=Cucurbita mosc... [more]
A0A6J1HIT11.8e-1331.88uncharacterized PE-PGRS family protein PE_PGRS54-like isoform X18 OS=Cucurbita m... [more]
A0A6J1HL172.4e-1332.41glycine-rich cell wall structural protein 1.8-like isoform X22 OS=Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..164
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 194..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 139..387
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 289..305

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012224.1HG10012224.1mRNA