HG10018445 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018445
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTIP41-like protein
LocationChr04: 4229722 .. 4231288 (-)
RNA-Seq ExpressionHG10018445
SyntenyHG10018445
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGGTACCCTTTTCATGAAATCTTCACTTCTCCCTATCTTTTATGTGTTTGAATTAGTTTGTTTCTGATTTGATTTTTTTTGGTTTTTTTTAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTAAATTACTTCCAACTTCCAAATGTCCAAATCCTTCCATCTGTTGAATATCTCAGAACAGAGTATTTTAACAGTTGGGTACTCTGTTTAAGTTATGGGTCCTCTTTCATTCCTGAGGTTTTTCTGTTTTGTTGTTCTATTAAGTTATGGGTCCTCTTTCATTCCTGAGGTTTTTCTGTTTTGTTGTTCTACAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGTATTTCAGAACCTCCCTTCCTTCATCTGAAGAATCACCTTTGTTTCTTTAATTAAGCTTCATAATCATCATTGTCATGCCTAATTAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGGTAAGCAAGAATTTCCCCAAAATTTTCACCACATGGCCCAAAATTAAGATTGTTGTAATGGGTTTCCTTGATGGACTTTTACAAATCACAGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA

mRNA sequence

ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA

Coding sequence (CDS)

ATGGCAACTCCCTTTGAACAACCCCTCTCTTTGACCTTCCCTACTGAGTTTCCTTACGATTTTGACTCCTTTCTCTCTAACTCCGACCTCAACTCCCCCGTCGAATCGCTTGGGAGTTCTACTACGGATTCCACTGATAGCTGTGGCAGCGATGATGATGACTTCTTTGTGGGTTTGGCTCAGCAACTTGCATGGACCTCCCTTTGTGAAACTGAGAAATCTGTATCTCCTTCCTTTAATCCAAACGAATTTGAGAAGAAGTATGTTAAAGCTGGCTCTCCTCAGTCGACTTTAAGTGGAATAGACGCCTGGTTTCGCCCTGAAAGTCCTTCCTCTCAGTTGCAATCGCCTCCCATGGCGGTTTTTGGGGCTGAGAATGATGCTAGAGCCCTTCTACATGCAGCTGCTAGAGAGGCGGCGAGGTTGAAGATGAGCGGCGAAACAACCCTGTTTCACAACAATGACCCTTTTATGAGAGGATTTGTGGGTGCTCGATCTTCCATTCCATTTAAATCTGCAAACAATGTGGATTATGGGCTTTACTCGAATCAAAACTGTGCTCGGAATCTGGCATTTGCCGCGCAGGTGCAGCAAGTGAAACACGATCTTGTATTACAGGCGATTCGTGCCTCTTCTTGGGGCGGAAGACAAGCTAAAGTCAGCTGGTCTGCTCAGCCACACTGGAAGCCGGAGATTCAAAACAGAGAGAGATATGTTGTTAATGCAAGTCGCAACGGCGGATTGTACCCTTCCCCATGGCTTCCGCCGCCCCAAATTCAGCAACCACCGTCCAACGCCTCCGTCATGCGTTGCATTCATCCCAGCAGATCTGGCGTGAAAAGAGCTTCCTCCTGCACCGGCGTTTTCTTGCCTCGCAGATATGTAAACCCTTCAGAATGCCGCCAGAAACAAGGAAGCCCAGCCGTTCGATTTCCAGAGGAGATGAAAAGCCCCATTCAAGCCCCATTAAACGGTTGCCTAGCACCTGGCTCCGATTCCATTTTATCTCGAAGAAATAATCCCCTTCTGCCATCGCCAAGGATTTTCCGGACAGATGGAGCCATGAATCAAGAACATCACCTACCTCAGGAATGGACATACTGA

Protein sequence

MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEWTY
Homology
BLAST of HG10018445 vs. NCBI nr
Match: XP_038884388.1 (uncharacterized protein LOC120075245 isoform X1 [Benincasa hispida])

HSP 1 Score: 629.8 bits (1623), Expect = 1.5e-176
Identity = 317/372 (85.22%), Postives = 332/372 (89.25%), Query Frame = 0

Query: 1   MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
           MATP     SL+FPTEFPYDFDSF SNSDLNSPVES+GSS TDSTDS GSDDDDFFVGLA
Sbjct: 1   MATP-----SLSFPTEFPYDFDSFFSNSDLNSPVESVGSSVTDSTDSSGSDDDDFFVGLA 60

Query: 61  QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
           QQLAWTSLCETEKS SPSFNP  FEK YVKAGSPQSTL+GID WFRPESPSSQLQSPPMA
Sbjct: 61  QQLAWTSLCETEKSTSPSFNPKNFEKMYVKAGSPQSTLTGIDTWFRPESPSSQLQSPPMA 120

Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
           VFGAENDARALLHAAAREAARLKMSGETT F NNDPFMR +VGARSSIP KS NNVDYG+
Sbjct: 121 VFGAENDARALLHAAAREAARLKMSGETTPFRNNDPFMREYVGARSSIPVKSTNNVDYGV 180

Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
           +SNQNCARNLAFAAQ+QQVK DLVLQA+RASSWGGRQAKVSWSAQPHWK EIQ+RER V+
Sbjct: 181 FSNQNCARNLAFAAQMQQVKQDLVLQALRASSWGGRQAKVSWSAQPHWKAEIQSRERNVL 240

Query: 241 NAS-----RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYV 300
           NAS       GGLY SPWLPPPQ QQPPSN SVMRCIHP RSGVKRASS TGVFLPRRY+
Sbjct: 241 NASGRCGGNTGGLYHSPWLPPPQNQQPPSNPSVMRCIHPGRSGVKRASSGTGVFLPRRYI 300

Query: 301 NPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAM 360
           +PSECRQKQGSPAVRF EEMKSPIQAPLNG L+P  DS+LSRRNNPLLP PR FRT+GAM
Sbjct: 301 SPSECRQKQGSPAVRFAEEMKSPIQAPLNGWLSPSIDSMLSRRNNPLLPLPRSFRTEGAM 360

Query: 361 NQEHHLPQEWTY 368
           NQE HLPQEWTY
Sbjct: 361 NQELHLPQEWTY 367

BLAST of HG10018445 vs. NCBI nr
Match: XP_038884389.1 (uncharacterized protein LOC120075245 isoform X2 [Benincasa hispida])

HSP 1 Score: 623.6 bits (1607), Expect = 1.1e-174
Identity = 316/372 (84.95%), Postives = 331/372 (88.98%), Query Frame = 0

Query: 1   MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
           MATP     SL+FPTEFPYDFDSF SNSDLNSPVES+GSS TDSTDS GSDDDDFFVGLA
Sbjct: 1   MATP-----SLSFPTEFPYDFDSFFSNSDLNSPVESVGSSVTDSTDSSGSDDDDFFVGLA 60

Query: 61  QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
           QQLAWTSLCETEKS SPSFNP  FE  YVKAGSPQSTL+GID WFRPESPSSQLQSPPMA
Sbjct: 61  QQLAWTSLCETEKSTSPSFNPKNFE-MYVKAGSPQSTLTGIDTWFRPESPSSQLQSPPMA 120

Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
           VFGAENDARALLHAAAREAARLKMSGETT F NNDPFMR +VGARSSIP KS NNVDYG+
Sbjct: 121 VFGAENDARALLHAAAREAARLKMSGETTPFRNNDPFMREYVGARSSIPVKSTNNVDYGV 180

Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
           +SNQNCARNLAFAAQ+QQVK DLVLQA+RASSWGGRQAKVSWSAQPHWK EIQ+RER V+
Sbjct: 181 FSNQNCARNLAFAAQMQQVKQDLVLQALRASSWGGRQAKVSWSAQPHWKAEIQSRERNVL 240

Query: 241 NAS-----RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYV 300
           NAS       GGLY SPWLPPPQ QQPPSN SVMRCIHP RSGVKRASS TGVFLPRRY+
Sbjct: 241 NASGRCGGNTGGLYHSPWLPPPQNQQPPSNPSVMRCIHPGRSGVKRASSGTGVFLPRRYI 300

Query: 301 NPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAM 360
           +PSECRQKQGSPAVRF EEMKSPIQAPLNG L+P  DS+LSRRNNPLLP PR FRT+GAM
Sbjct: 301 SPSECRQKQGSPAVRFAEEMKSPIQAPLNGWLSPSIDSMLSRRNNPLLPLPRSFRTEGAM 360

Query: 361 NQEHHLPQEWTY 368
           NQE HLPQEWTY
Sbjct: 361 NQELHLPQEWTY 366

BLAST of HG10018445 vs. NCBI nr
Match: XP_008441420.1 (PREDICTED: uncharacterized protein LOC103485542 [Cucumis melo] >KAA0041485.1 uncharacterized protein E6C27_scaffold6G00720 [Cucumis melo var. makuwa] >TYK24358.1 uncharacterized protein E5676_scaffold205G001390 [Cucumis melo var. makuwa])

HSP 1 Score: 595.9 bits (1535), Expect = 2.4e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5   SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           E E         N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65  EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
           A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
           LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS      
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244

Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
            GGLY SPWLPP Q QQP  N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304

Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
           G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359

BLAST of HG10018445 vs. NCBI nr
Match: XP_004138440.2 (uncharacterized protein LOC101208139 [Cucumis sativus] >KGN45759.1 hypothetical protein Csa_005113 [Cucumis sativus])

HSP 1 Score: 583.9 bits (1504), Expect = 9.3e-163
Identity = 298/364 (81.87%), Postives = 315/364 (86.54%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           +LTFPTEFPYDFDSFLSN DLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQ+LAWTSLC
Sbjct: 5   TLTFPTEFPYDFDSFLSNCDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQKLAWTSLC 64

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           E E         N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65  EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
           A+LHAAAREAA+LKMSGETT F NNDPFMRGFVGARSS+P KS NNVDYG++S QN ARN
Sbjct: 125 AILHAAAREAAKLKMSGETTPFQNNDPFMRGFVGARSSVPVKSTNNVDYGVFSTQNSARN 184

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS-----R 249
           LAFAAQVQQVK DLVLQA+RASS   RQAK SWSAQPHWK EIQNRER VVNAS      
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLRERQAKASWSAQPHWKQEIQNRERNVVNASGRCGGG 244

Query: 250 NGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
            GGLY SPWLPP Q QQP SN +V+RCIHP RSGVKRASS TGVFLPRRY+NPSECRQKQ
Sbjct: 245 TGGLYHSPWLPPLQNQQPTSNPTVVRCIHPVRSGVKRASSGTGVFLPRRYINPSECRQKQ 304

Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
           G P+VRF EEMKSPIQAPLNGC +PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFVEEMKSPIQAPLNGCHSPGFDPILSRRNNPLLPLPRSFRTEGVMNQEHHHLPQ 359

BLAST of HG10018445 vs. NCBI nr
Match: XP_022134429.1 (uncharacterized protein LOC111006679 [Momordica charantia])

HSP 1 Score: 537.7 bits (1384), Expect = 7.7e-149
Identity = 274/362 (75.69%), Postives = 305/362 (84.25%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           +L+FPT+FPY+FDS  SN DLNSPVES+ SS T+STDS  SDDDDFFVGLA+QLAWT LC
Sbjct: 18  TLSFPTDFPYEFDSLASNFDLNSPVESVVSS-TESTDS--SDDDDFFVGLARQLAWTYLC 77

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           ETE S +P FNPN+FEKKYVKAGSPQSTLSGIDAWFRP SPSSQL+SPP+AVFGAENDAR
Sbjct: 78  ETESSPTPCFNPNKFEKKYVKAGSPQSTLSGIDAWFRPHSPSSQLKSPPIAVFGAENDAR 137

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
            L+HAAAREAARLKMS ETT F +NDPF+RGF+GARSSIP KS +NVDYGL+SN+ CARN
Sbjct: 138 VLVHAAAREAARLKMSAETTPFQSNDPFVRGFMGARSSIPVKSTSNVDYGLFSNEGCARN 197

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYV----VNASRN 249
           LAF+AQVQQV+HDLVLQAI ASSW GRQAKV W+A PH KPEIQNRER +       S  
Sbjct: 198 LAFSAQVQQVRHDLVLQAICASSW-GRQAKVDWAAPPHRKPEIQNRERNIGLGGGRCSGA 257

Query: 250 GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQG 309
            GLY S WLPPPQ Q PP NAS +RCIHP    VKRASS TGVFLPRRYVNPSECRQKQG
Sbjct: 258 AGLYQSAWLPPPQSQPPPPNASAVRCIHPGGPAVKRASSGTGVFLPRRYVNPSECRQKQG 317

Query: 310 SPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEW 368
           +PAVRFPEEM +PIQAP NGCL+PG D++L+RRN  LLP PR  R + A+NQE HLPQEW
Sbjct: 318 TPAVRFPEEMINPIQAPFNGCLSPGFDAMLARRNTSLLPLPRSLRGEAAINQELHLPQEW 375

BLAST of HG10018445 vs. ExPASy TrEMBL
Match: A0A5A7TJ83 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G001390 PE=4 SV=1)

HSP 1 Score: 595.9 bits (1535), Expect = 1.1e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5   SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           E E         N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65  EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
           A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
           LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS      
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244

Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
            GGLY SPWLPP Q QQP  N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304

Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
           G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359

BLAST of HG10018445 vs. ExPASy TrEMBL
Match: A0A1S3B425 (uncharacterized protein LOC103485542 OS=Cucumis melo OX=3656 GN=LOC103485542 PE=4 SV=1)

HSP 1 Score: 595.9 bits (1535), Expect = 1.1e-166
Identity = 304/364 (83.52%), Postives = 319/364 (87.64%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           SLTFPTEFPYDFDSFLSNSDLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQQLAWTSLC
Sbjct: 5   SLTFPTEFPYDFDSFLSNSDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQQLAWTSLC 64

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           E E         N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65  EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
           A+LHAAAREAARLKMSGETT F N DPFMRGFVGARSSIP KS NNVDYG++SNQN ARN
Sbjct: 125 AILHAAAREAARLKMSGETTPFQNIDPFMRGFVGARSSIPVKSTNNVDYGVFSNQNSARN 184

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRN---- 249
           LAFAAQVQQVK DLVLQA+RASS GGRQAKVSWSAQPHWK EIQNRER VVNAS      
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLGGRQAKVSWSAQPHWKQEIQNRERNVVNASGRCGVG 244

Query: 250 -GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
            GGLY SPWLPP Q QQP  N +V+RCIHP RSGVKRASS TGVFLPRRY+NP++CRQKQ
Sbjct: 245 AGGLYHSPWLPPLQNQQPTPNTTVVRCIHPVRSGVKRASSGTGVFLPRRYINPTDCRQKQ 304

Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
           G P+VRF EEMKSPIQAPLNGCL+PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFAEEMKSPIQAPLNGCLSPGFDPILSRRNNPLLPLPRSFRTEGGMNQEHHHLPQ 359

BLAST of HG10018445 vs. ExPASy TrEMBL
Match: A0A0A0KA94 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009380 PE=4 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 4.5e-163
Identity = 298/364 (81.87%), Postives = 315/364 (86.54%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           +LTFPTEFPYDFDSFLSN DLNSPVES+GSSTTDSTDSCGSDDD+FFVGLAQ+LAWTSLC
Sbjct: 5   TLTFPTEFPYDFDSFLSNCDLNSPVESVGSSTTDSTDSCGSDDDEFFVGLAQKLAWTSLC 64

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           E E         N FEKKYVKAGSPQSTLSGID WFRPESPSSQL+SPPMAVFGAENDAR
Sbjct: 65  EAE---------NTFEKKYVKAGSPQSTLSGIDTWFRPESPSSQLKSPPMAVFGAENDAR 124

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
           A+LHAAAREAA+LKMSGETT F NNDPFMRGFVGARSS+P KS NNVDYG++S QN ARN
Sbjct: 125 AILHAAAREAAKLKMSGETTPFQNNDPFMRGFVGARSSVPVKSTNNVDYGVFSTQNSARN 184

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS-----R 249
           LAFAAQVQQVK DLVLQA+RASS   RQAK SWSAQPHWK EIQNRER VVNAS      
Sbjct: 185 LAFAAQVQQVKQDLVLQALRASSLRERQAKASWSAQPHWKQEIQNRERNVVNASGRCGGG 244

Query: 250 NGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQ 309
            GGLY SPWLPP Q QQP SN +V+RCIHP RSGVKRASS TGVFLPRRY+NPSECRQKQ
Sbjct: 245 TGGLYHSPWLPPLQNQQPTSNPTVVRCIHPVRSGVKRASSGTGVFLPRRYINPSECRQKQ 304

Query: 310 GSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQE-HHLPQ 368
           G P+VRF EEMKSPIQAPLNGC +PG D ILSRRNNPLLP PR FRT+G MNQE HHLPQ
Sbjct: 305 GIPSVRFVEEMKSPIQAPLNGCHSPGFDPILSRRNNPLLPLPRSFRTEGVMNQEHHHLPQ 359

BLAST of HG10018445 vs. ExPASy TrEMBL
Match: A0A6J1BYQ6 (uncharacterized protein LOC111006679 OS=Momordica charantia OX=3673 GN=LOC111006679 PE=4 SV=1)

HSP 1 Score: 537.7 bits (1384), Expect = 3.7e-149
Identity = 274/362 (75.69%), Postives = 305/362 (84.25%), Query Frame = 0

Query: 10  SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTSLC 69
           +L+FPT+FPY+FDS  SN DLNSPVES+ SS T+STDS  SDDDDFFVGLA+QLAWT LC
Sbjct: 18  TLSFPTDFPYEFDSLASNFDLNSPVESVVSS-TESTDS--SDDDDFFVGLARQLAWTYLC 77

Query: 70  ETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAENDAR 129
           ETE S +P FNPN+FEKKYVKAGSPQSTLSGIDAWFRP SPSSQL+SPP+AVFGAENDAR
Sbjct: 78  ETESSPTPCFNPNKFEKKYVKAGSPQSTLSGIDAWFRPHSPSSQLKSPPIAVFGAENDAR 137

Query: 130 ALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNCARN 189
            L+HAAAREAARLKMS ETT F +NDPF+RGF+GARSSIP KS +NVDYGL+SN+ CARN
Sbjct: 138 VLVHAAAREAARLKMSAETTPFQSNDPFVRGFMGARSSIPVKSTSNVDYGLFSNEGCARN 197

Query: 190 LAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYV----VNASRN 249
           LAF+AQVQQV+HDLVLQAI ASSW GRQAKV W+A PH KPEIQNRER +       S  
Sbjct: 198 LAFSAQVQQVRHDLVLQAICASSW-GRQAKVDWAAPPHRKPEIQNRERNIGLGGGRCSGA 257

Query: 250 GGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSECRQKQG 309
            GLY S WLPPPQ Q PP NAS +RCIHP    VKRASS TGVFLPRRYVNPSECRQKQG
Sbjct: 258 AGLYQSAWLPPPQSQPPPPNASAVRCIHPGGPAVKRASSGTGVFLPRRYVNPSECRQKQG 317

Query: 310 SPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLPQEW 368
           +PAVRFPEEM +PIQAP NGCL+PG D++L+RRN  LLP PR  R + A+NQE HLPQEW
Sbjct: 318 TPAVRFPEEMINPIQAPFNGCLSPGFDAMLARRNTSLLPLPRSLRGEAAINQELHLPQEW 375

BLAST of HG10018445 vs. ExPASy TrEMBL
Match: A0A6J1JZ67 (uncharacterized protein LOC111489193 OS=Cucurbita maxima OX=3661 GN=LOC111489193 PE=4 SV=1)

HSP 1 Score: 498.0 bits (1281), Expect = 3.3e-137
Identity = 265/374 (70.86%), Postives = 295/374 (78.88%), Query Frame = 0

Query: 1   MATPFEQPLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLA 60
           MATP + P   TFPTEFPY+ DSF S SDLNSP+ES  SS TDSTDS GSDDDDFF GLA
Sbjct: 1   MATPLD-PSLFTFPTEFPYESDSFASFSDLNSPLESAVSS-TDSTDSSGSDDDDFFEGLA 60

Query: 61  QQLAWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMA 120
            Q AWTSL ET+KS SPS   N FE KYVK GSPQSTL+GID WFR E+P SQ+QSPP+A
Sbjct: 61  HQFAWTSLSETDKSTSPSSKANNFENKYVKTGSPQSTLAGIDTWFRSETP-SQVQSPPLA 120

Query: 121 VFGAENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGL 180
           + GA NDARAL+HAAA EA R +MS  TTLFH N PF+RGF+GARSS+P +S N +DYGL
Sbjct: 121 LSGAVNDARALVHAAAIEATRFQMSRGTTLFHTNVPFVRGFLGARSSVPVESVNELDYGL 180

Query: 181 YSNQNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVV 240
           +SN+NC RNLAFAAQ QQVK DLVLQA+ ASSW GRQAKV WSAQPHWKPE +NRE   V
Sbjct: 181 FSNRNCDRNLAFAAQAQQVKRDLVLQALSASSW-GRQAKVGWSAQPHWKPEFRNREGTFV 240

Query: 241 NA-----SRNGGLYPSPWLPPPQIQQPPSNAS-VMRCIHPSRSGVKRASSCTGVFLPRRY 300
           +A       +G  Y SPWLPPP+ QQP  NAS   RCIHP RSGVKRASS TGVFLPR+Y
Sbjct: 241 DARGRCSGGSGDFYHSPWLPPPRHQQPTPNASAATRCIHPGRSGVKRASSGTGVFLPRKY 300

Query: 301 VNPSECRQKQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGA 360
           VNPSE  QK+GSPAVRFPEEM+SPIQAPLNG L PG +SI SRRN P+LP PR FR + A
Sbjct: 301 VNPSESFQKKGSPAVRFPEEMRSPIQAPLNGFLWPGFNSISSRRNKPVLPLPRSFRGEVA 360

Query: 361 MNQEH-HLPQEWTY 368
           +NQE  HLPQEWTY
Sbjct: 361 INQEQLHLPQEWTY 370

BLAST of HG10018445 vs. TAIR 10
Match: AT3G55690.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G39870.1); Has 76 Blast hits to 69 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 3; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 88.6 bits (218), Expect = 1.1e-17
Identity = 104/365 (28.49%), Postives = 150/365 (41.10%), Query Frame = 0

Query: 5   FEQPL-SLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQL 64
           FEQP+  LTFP EFPY+   F S++   SP +S     T++ D    D+DDF  GL ++L
Sbjct: 21  FEQPMEKLTFPNEFPYE---FASSTFSTSPEDS-----TETEDETTDDEDDFLAGLTRRL 80

Query: 65  AWTSLCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFG 124
           A      T++  SPSF     +K  +K    +ST SG+ +   P  P SQ+ SPP +   
Sbjct: 81  A----LSTQRLSSPSF---VTDKSQMKPKVTESTQSGLGS---PNGPFSQVPSPPTSP-S 140

Query: 125 AENDARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSN 184
            E D+  +L AAA E A++K +       N D     +     +       NV Y     
Sbjct: 141 REEDSLKVLSAAAGEVAKIKKA-------NFDAKPISYPNPNPNYLTSFPQNVAY----- 200

Query: 185 QNCARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNAS 244
            NC                                   W  +PH+ P+ Q          
Sbjct: 201 YNC----------------------------------YWLWEPHY-PQSQM--------- 260

Query: 245 RNGGLYPSPWLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNPSEC-RQ 304
              G+ P+ W  PP         S +R  +   + VK  S+ TGVFLPR+Y NPS+  ++
Sbjct: 261 ---GIVPNAWHIPP---------SPVRAFYTLPTAVKSPSTGTGVFLPRKYSNPSDSPKK 293

Query: 305 KQGSPAVRFPEEMKSPIQAPLNGCLAPGSDSILSRRNNPLLPSPRIFRTDGAMNQEHHLP 364
           K G   V+   + K  I+     C  P S + LS   + +          G +  E  L 
Sbjct: 321 KSGDGCVKVVNQQKPKIEVLPVRC-KPNSKAGLSTGRSKI----DYVAGGGCLKHEKPLL 293

Query: 365 QEWTY 368
           QEW Y
Sbjct: 381 QEWMY 293

BLAST of HG10018445 vs. TAIR 10
Match: AT2G39870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55690.1); Has 73 Blast hits to 71 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 2; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 86.3 bits (212), Expect = 5.7e-17
Identity = 98/325 (30.15%), Postives = 133/325 (40.92%), Query Frame = 0

Query: 8   PLSLTFPTEFPYDFDSFLSNSDLNSPVESLGSSTTDSTDSCGSDDDDFFVGLAQQLAWTS 67
           P  L FP EFPY+FDS   +    SP +S     T++ D    D++DF  GL ++LA   
Sbjct: 26  PTRLGFPNEFPYEFDSPSFSPGFTSPGDS-----TETEDESSDDEEDFLAGLTRRLA--- 85

Query: 68  LCETEKSVSPSFNPNEFEKKYVKAGSPQSTLSGIDAWFRPESPSSQLQSPPMAVFGAEND 127
              T++  SP F     EK+ V A SPQSTLSG+ ++    S S  L SPP        D
Sbjct: 86  -PSTQRLPSPLFKSE--EKRQVAATSPQSTLSGLGSFSNSGSRSPILPSPPAPTSSFRRD 145

Query: 128 -ARALLHAAAREAARLKMSGETTLFHNNDPFMRGFVGARSSIPFKSANNVDYGLYSNQNC 187
            A  ++ AAA E ARLK+ G     H               +P ++  +    L   QN 
Sbjct: 146 NAWDVISAAAGEVARLKL-GSYEPHH---------------LPLQTPES----LLRRQNA 205

Query: 188 ARNLAFAAQVQQVKHDLVLQAIRASSWGGRQAKVSWSAQPHWKPEIQNRERYVVNASRNG 247
           A           +  +L  Q +    W         SAQ  +K       R VVN     
Sbjct: 206 A-----------IHAELQHQRLIEQMW-------LCSAQSRFKLSENRIPRRVVNEE--- 265

Query: 248 GLYPSP---------WLPPPQIQQPPSNASVMRCIHPSRSGVKRASSCTGVFLPRRYVNP 307
           GL+ +P         WLPP Q   P                +KR S+ TGVFLPRRY   
Sbjct: 266 GLFENPRYVRRNNPTWLPPQQAAAP----------------LKRPSAGTGVFLPRRY--- 270

Query: 308 SECRQKQGSPAVRFPEEMKSPIQAP 323
                    P+    + +K+P+  P
Sbjct: 326 ---------PSAAPSDSLKTPVNTP 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884388.11.5e-17685.22uncharacterized protein LOC120075245 isoform X1 [Benincasa hispida][more]
XP_038884389.11.1e-17484.95uncharacterized protein LOC120075245 isoform X2 [Benincasa hispida][more]
XP_008441420.12.4e-16683.52PREDICTED: uncharacterized protein LOC103485542 [Cucumis melo] >KAA0041485.1 unc... [more]
XP_004138440.29.3e-16381.87uncharacterized protein LOC101208139 [Cucumis sativus] >KGN45759.1 hypothetical ... [more]
XP_022134429.17.7e-14975.69uncharacterized protein LOC111006679 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5A7TJ831.1e-16683.52Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B4251.1e-16683.52uncharacterized protein LOC103485542 OS=Cucumis melo OX=3656 GN=LOC103485542 PE=... [more]
A0A0A0KA944.5e-16381.87Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G009380 PE=4 SV=1[more]
A0A6J1BYQ63.7e-14975.69uncharacterized protein LOC111006679 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1JZ673.3e-13770.86uncharacterized protein LOC111489193 OS=Cucurbita maxima OX=3661 GN=LOC111489193... [more]
Match NameE-valueIdentityDescription
AT3G55690.11.1e-1728.49unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G39870.15.7e-1730.15unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33356:SF17H/ACA RIBONUCLEOPROTEIN COMPLEX NON-CORE SUBUNIT NAF1-LIKEcoord: 6..367
NoneNo IPR availablePANTHERPTHR33356TIP41-LIKE PROTEINcoord: 6..367

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018445.1HG10018445.1mRNA