HG10012226 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012226
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
LocationChr01: 19058614 .. 19059505 (-)
RNA-Seq ExpressionHG10012226
SyntenyHG10012226
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCAATATAATTCAAGAATCTTCAGAACCCCAAAACCAAGAAGAATCTTTCGATCCTTTCCGTTTCTCCACCCTTTGTCTCAACTCCTCCGCCGTCGACCCTCCACTCTGTTCTTCGTGCGCTCGCCGTCAACCTCGCCTCGCATCCACTCCCATGAAACGCCCTTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCTCCAAGAAGCAATTTCTTGATCATCAACAACCCAATTCCACCCCTTTCTCCAAGATCGATCTCCCCATTCCTTTTGATCATTCTGTTTCCCCTCTCCGCCGCTCTTTTTCCGACCCCACCGAAGCCCTGAATTTCTCCCCTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCGTCCCCTGAAAATACTTTCGATTCCCCAATTAAAATTGGGAAATCCAACGATTTGATCATAGAAGACAACCCCGAATCAAAGGTTTGTTGATAATTCACCTGTTGTTGTTTATTCTTGTGGGTAATTTAATTGATAAAAATTTTTCTTAGAGACTTAGAAGGATCAAGGATCGATTGAAGGAGATGAATCAGTGGTGGAACGAAGTGATAAGTGAAGAAGAACACGATGAAGTTAATACAAAAAAGGTATGATTTATTTCATTTCGTTGTCAATTTTATTGATGAGAAATGTTGTAAATTGGGAATTTGAAATGCAGAGAGATTGTTGCAAGGAAGAAGAAGATGATGAAGAAACAGTGGGAGTGGAAAGAGTGGGAGATTCATTGGTGCTACGTTTAAAGTGTTCATGTGGGAAAGGATTTGAGATTCTTCTTTCTGGGAGAAGCTGTTTCTACAAGCTGCTGTAG

mRNA sequence

ATGAGCAATATAATTCAAGAATCTTCAGAACCCCAAAACCAAGAAGAATCTTTCGATCCTTTCCGTTTCTCCACCCTTTGTCTCAACTCCTCCGCCGTCGACCCTCCACTCTGTTCTTCGTGCGCTCGCCGTCAACCTCGCCTCGCATCCACTCCCATGAAACGCCCTTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCTCCAAGAAGCAATTTCTTGATCATCAACAACCCAATTCCACCCCTTTCTCCAAGATCGATCTCCCCATTCCTTTTGATCATTCTGTTTCCCCTCTCCGCCGCTCTTTTTCCGACCCCACCGAAGCCCTGAATTTCTCCCCTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCGTCCCCTGAAAATACTTTCGATTCCCCAATTAAAATTGGGAAATCCAACGATTTGATCATAGAAGACAACCCCGAATCAAAGAGACTTAGAAGGATCAAGGATCGATTGAAGGAGATGAATCAGTGGTGGAACGAAGTGATAAGTGAAGAAGAACACGATGAAGTTAATACAAAAAAGAGAGATTGTTGCAAGGAAGAAGAAGATGATGAAGAAACAGTGGGAGTGGAAAGAGTGGGAGATTCATTGGTGCTACGTTTAAAGTGTTCATGTGGGAAAGGATTTGAGATTCTTCTTTCTGGGAGAAGCTGTTTCTACAAGCTGCTGTAG

Coding sequence (CDS)

ATGAGCAATATAATTCAAGAATCTTCAGAACCCCAAAACCAAGAAGAATCTTTCGATCCTTTCCGTTTCTCCACCCTTTGTCTCAACTCCTCCGCCGTCGACCCTCCACTCTGTTCTTCGTGCGCTCGCCGTCAACCTCGCCTCGCATCCACTCCCATGAAACGCCCTTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCTCCAAGAAGCAATTTCTTGATCATCAACAACCCAATTCCACCCCTTTCTCCAAGATCGATCTCCCCATTCCTTTTGATCATTCTGTTTCCCCTCTCCGCCGCTCTTTTTCCGACCCCACCGAAGCCCTGAATTTCTCCCCTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCGTCCCCTGAAAATACTTTCGATTCCCCAATTAAAATTGGGAAATCCAACGATTTGATCATAGAAGACAACCCCGAATCAAAGAGACTTAGAAGGATCAAGGATCGATTGAAGGAGATGAATCAGTGGTGGAACGAAGTGATAAGTGAAGAAGAACACGATGAAGTTAATACAAAAAAGAGAGATTGTTGCAAGGAAGAAGAAGATGATGAAGAAACAGTGGGAGTGGAAAGAGTGGGAGATTCATTGGTGCTACGTTTAAAGTGTTCATGTGGGAAAGGATTTGAGATTCTTCTTTCTGGGAGAAGCTGTTTCTACAAGCTGCTGTAG

Protein sequence

MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVDPPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEEHDEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
Homology
BLAST of HG10012226 vs. NCBI nr
Match: XP_038888901.1 (uncharacterized protein LOC120078676 [Benincasa hispida])

HSP 1 Score: 376.3 bits (965), Expect = 2.0e-100
Identity = 209/257 (81.32%), Postives = 216/257 (84.05%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPF--RFSTLCLNSSAVDPPLCSSCARRQPRLASTPMKRPSP 60
           MSN+IQESSEPQN EE FDPF  RFSTLCLN SAVDP LCSSCARR PR A+TPMKRP+P
Sbjct: 1   MSNLIQESSEPQNPEEPFDPFHSRFSTLCLNPSAVDPSLCSSCARRHPRSAATPMKRPTP 60

Query: 61  T-PSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSDPTEALNFSP-- 120
           T P QHP    SK  FLDHQQP+ST FSKIDLPIPFD SV PLRRS SDPTEA NFSP  
Sbjct: 61  TPPQQHP----SKNLFLDHQQPDST-FSKIDLPIPFDPSVFPLRRSVSDPTEARNFSPTP 120

Query: 121 --QSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRR 180
             QSPAKRLCLNSPLPPLPLRRTVSDPNPSPE T DSPIKIGK       DNPESKRLRR
Sbjct: 121 VIQSPAKRLCLNSPLPPLPLRRTVSDPNPSPEKTSDSPIKIGK-------DNPESKRLRR 180

Query: 181 IKDRLKEMNQWWNEVISEEEHDEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCG 240
           IKDRLKEMNQWWNEV+SEE+ DE  TKK DC KEEE+DEETVGVERVGDSL L LKCSCG
Sbjct: 181 IKDRLKEMNQWWNEVMSEEQ-DENETKKSDCLKEEEEDEETVGVERVGDSLALHLKCSCG 240

Query: 241 KGFEILLSGRSCFYKLL 251
           KGFEILLSGRSCFYKLL
Sbjct: 241 KGFEILLSGRSCFYKLL 244

BLAST of HG10012226 vs. NCBI nr
Match: XP_022995232.1 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 319.7 bits (818), Expect = 2.2e-83
Identity = 180/265 (67.92%), Postives = 200/265 (75.47%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDPAATT-KKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSAERTSESPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLV 240
           R+IK+RLKEMN+WWNEV+SE+EH     DE  TKK+ +CCK+EED+EETVGVERVGDSL 
Sbjct: 181 RKIKNRLKEMNEWWNEVMSEQEHEEEKRDENETKKKVECCKDEEDEEETVGVERVGDSLE 240

Query: 241 LRLKCSCGKGFEILLSGRSCFYKLL 251
           LRLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 LRLKCPCGKGFEILLSGTSCFYKLL 264

BLAST of HG10012226 vs. NCBI nr
Match: XP_022995233.1 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 318.5 bits (815), Expect = 5.0e-83
Identity = 180/264 (68.18%), Postives = 198/264 (75.00%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDPAATT-KKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSAERTSESPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVL 240
           R+IK+RLKEMN+WWNEV+SE+EH     DE  TKK  CCK+EED+EETVGVERVGDSL L
Sbjct: 181 RKIKNRLKEMNEWWNEVMSEQEHEEEKRDENETKK--CCKDEEDEEETVGVERVGDSLEL 240

Query: 241 RLKCSCGKGFEILLSGRSCFYKLL 251
           RLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 RLKCPCGKGFEILLSGTSCFYKLL 261

BLAST of HG10012226 vs. NCBI nr
Match: KAG6606253.1 (hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia] >KAG7036195.1 hypothetical protein SDJN02_02996, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 318.2 bits (814), Expect = 6.5e-83
Identity = 179/266 (67.29%), Postives = 197/266 (74.06%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDP-TATTKKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSTDKTSVSPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEHDE-------VNTKKRDCCKEEEDDEETVGVERVGDSL 240
           R+IKDRLKEMN+WWNEV+SE+EH+E          KK +CCKEEED+EETVGVERVGDSL
Sbjct: 181 RKIKDRLKEMNEWWNEVMSEQEHEEEKRDEKNETKKKVECCKEEEDEEETVGVERVGDSL 240

Query: 241 VLRLKCSCGKGFEILLSGRSCFYKLL 251
            LRLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 ELRLKCPCGKGFEILLSGTSCFYKLL 265

BLAST of HG10012226 vs. NCBI nr
Match: XP_022930995.1 (uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata])

HSP 1 Score: 317.4 bits (812), Expect = 1.1e-82
Identity = 180/264 (68.18%), Postives = 197/264 (74.62%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDP-TATTKKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSTDKTSVSPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVL 240
           R+IKDRLKEMN+WWNEV+SE+EH     DE  TKK  CCKE+ED+EETVGVERVGDSL L
Sbjct: 181 RKIKDRLKEMNEWWNEVMSEQEHEEEKRDENETKK--CCKEDEDEEETVGVERVGDSLEL 240

Query: 241 RLKCSCGKGFEILLSGRSCFYKLL 251
           RLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 RLKCPCGKGFEILLSGTSCFYKLL 261

BLAST of HG10012226 vs. ExPASy TrEMBL
Match: A0A6J1JY87 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490841 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 1.1e-83
Identity = 180/265 (67.92%), Postives = 200/265 (75.47%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDPAATT-KKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSAERTSESPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLV 240
           R+IK+RLKEMN+WWNEV+SE+EH     DE  TKK+ +CCK+EED+EETVGVERVGDSL 
Sbjct: 181 RKIKNRLKEMNEWWNEVMSEQEHEEEKRDENETKKKVECCKDEEDEEETVGVERVGDSLE 240

Query: 241 LRLKCSCGKGFEILLSGRSCFYKLL 251
           LRLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 LRLKCPCGKGFEILLSGTSCFYKLL 264

BLAST of HG10012226 vs. ExPASy TrEMBL
Match: A0A6J1K7B1 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111490841 PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 2.4e-83
Identity = 180/264 (68.18%), Postives = 198/264 (75.00%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDPAATT-KKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSAERTSESPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVL 240
           R+IK+RLKEMN+WWNEV+SE+EH     DE  TKK  CCK+EED+EETVGVERVGDSL L
Sbjct: 181 RKIKNRLKEMNEWWNEVMSEQEHEEEKRDENETKK--CCKDEEDEEETVGVERVGDSLEL 240

Query: 241 RLKCSCGKGFEILLSGRSCFYKLL 251
           RLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 RLKCPCGKGFEILLSGTSCFYKLL 261

BLAST of HG10012226 vs. ExPASy TrEMBL
Match: A0A6J1EYB4 (uncharacterized protein LOC111437321 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437321 PE=4 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 5.4e-83
Identity = 180/264 (68.18%), Postives = 197/264 (74.62%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDP-TATTKKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSTDKTSVSPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVL 240
           R+IKDRLKEMN+WWNEV+SE+EH     DE  TKK  CCKE+ED+EETVGVERVGDSL L
Sbjct: 181 RKIKDRLKEMNEWWNEVMSEQEHEEEKRDENETKK--CCKEDEDEEETVGVERVGDSLEL 240

Query: 241 RLKCSCGKGFEILLSGRSCFYKLL 251
           RLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 RLKCPCGKGFEILLSGTSCFYKLL 261

BLAST of HG10012226 vs. ExPASy TrEMBL
Match: A0A6J1ET23 (proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437321 PE=4 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 9.2e-83
Identity = 180/265 (67.92%), Postives = 198/265 (74.72%), Query Frame = 0

Query: 1   MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPS 60
           MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR S
Sbjct: 1   MSNLIQESAEPQNPEQQHFDSRFSTLCLNPGGTHHRRPPLCSSCGRRPPRCAATHKKRRS 60

Query: 61  PTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS------VSPLRRSFSDPTEAL 120
           PT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S       SPL RS SDPTEA 
Sbjct: 61  PTQIQDP-TATTKKHLLDPKQHNLTSFSKIDLPIPFGPSSAHPTPFSPLSRSVSDPTEAR 120

Query: 121 NFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRL 180
           NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRL
Sbjct: 121 NFSPPSPAKRLCPNSALPPLPLRRTVSDPTPSTDKTSVSPLTIGRVNDSIKEDSPDSKRL 180

Query: 181 RRIKDRLKEMNQWWNEVISEEEH-----DEVNTKK-RDCCKEEEDDEETVGVERVGDSLV 240
           R+IKDRLKEMN+WWNEV+SE+EH     DE  TKK  +CCKE+ED+EETVGVERVGDSL 
Sbjct: 181 RKIKDRLKEMNEWWNEVMSEQEHEEEKRDENETKKVVECCKEDEDEEETVGVERVGDSLE 240

Query: 241 LRLKCSCGKGFEILLSGRSCFYKLL 251
           LRLKC CGKGFEILLSG SCFYKLL
Sbjct: 241 LRLKCPCGKGFEILLSGTSCFYKLL 264

BLAST of HG10012226 vs. ExPASy TrEMBL
Match: A0A0A0LI25 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902250 PE=4 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 1.6e-79
Identity = 184/267 (68.91%), Postives = 201/267 (75.28%), Query Frame = 0

Query: 14  QEESFDPFR-FSTLCLN---SSAVDPPLCSSCARRQPRLASTPMKRPSPTP--SQHPST- 73
           QE+ +DPF+ FSTLCLN   SSAVDP LCSSC R   R ++TPMKRPSPTP  SQ  ST 
Sbjct: 6   QEQPYDPFQSFSTLCLNSSSSSAVDPSLCSSCFRPHSRSSATPMKRPSPTPPSSQQLSTV 65

Query: 74  TTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSDPTEALNFSP----QSPAKRL 133
           TTSK   LD QQPNS PFSKI+LPIPF  SVSPLRRS SDPT+A NFSP    QSPAKRL
Sbjct: 66  TTSKNLLLDPQQPNSIPFSKINLPIPFPPSVSPLRRSLSDPTDARNFSPPLQTQSPAKRL 125

Query: 134 CLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMN 193
           CLNSPLPPLPLRRTVSDPNP+PE T DSPIKI K       D+PESKRL+RIKDRLKEMN
Sbjct: 126 CLNSPLPPLPLRRTVSDPNPAPEKTSDSPIKIQK-------DSPESKRLKRIKDRLKEMN 185

Query: 194 QWWNEVISEEE--HDEVNTKK-----------RDCCKEEE------DDEETVGVERVGDS 251
            WWNEV+SEEE  +DE   KK           RD  +EEE      DDEETVGVERVGDS
Sbjct: 186 HWWNEVMSEEEEHNDEKEIKKEWFVNGVFEIQRDDEEEEEEEEEEKDDEETVGVERVGDS 245

BLAST of HG10012226 vs. TAIR 10
Match: AT2G32235.1 (unknown protein; Has 38 Blast hits to 38 proteins in 14 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 11; Plants - 11; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 71.2 bits (173), Expect = 1.3e-12
Identity = 78/242 (32.23%), Postives = 122/242 (50.41%), Query Frame = 0

Query: 50  STPMKRPSPTPSQHPSTTTSKKQFL----DHQQPNSTPFSKIDLP-IPFDHSV--SPL-R 109
           ++P+KRPSP  S+       KK F+    + + PN   +SKI LP + F+ +   SPL +
Sbjct: 72  TSPVKRPSP-ESKQGDEPRRKKLFIPRPEEEEDPNLMGYSKIPLPVVEFNPTQIRSPLYK 131

Query: 110 RSFSDP----------TEALNFSPQSPAKRLCLNS----PLPPLP--LRRTVSDPNPSPE 169
           RS SD           +    ++  S A+     S     LPP P   RR+VSD +P+P 
Sbjct: 132 RSLSDTFASPVGSTFGSGGSGYTRNSVAQETSPPSGNVPSLPPRPRMFRRSVSDLSPAPS 191

Query: 170 NTFDSPIKIGKSNDLIIED--NPES----KRLRRIKDRLKEMNQWWNEVISEEEHDEVNT 229
           +   S +   +SN +   D  NPES    K L  IKD ++E++QW N+++   E     +
Sbjct: 192 S--KSLLGSSRSNAIPEGDLANPESSDANKMLYIIKDGVRELDQWCNKLLKYGEAVSSGS 251

Query: 230 KKRDCCKEEEDD-----------EETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYK 251
            K+D   +  D+           +E V V R+G++ V+ + C CG+ ++ L SGR C+YK
Sbjct: 252 VKQDDSPKAVDEVVQQEEQPKECKEGVKVNRLGEAFVVEINCPCGRNYQTLFSGRDCYYK 310

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888901.12.0e-10081.32uncharacterized protein LOC120078676 [Benincasa hispida][more]
XP_022995232.12.2e-8367.92proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita m... [more]
XP_022995233.15.0e-8368.18proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita m... [more]
KAG6606253.16.5e-8367.29hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022930995.11.1e-8268.18uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JY871.1e-8367.92proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita... [more]
A0A6J1K7B12.4e-8368.18proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 OS=Cucurbita... [more]
A0A6J1EYB45.4e-8368.18uncharacterized protein LOC111437321 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1ET239.2e-8367.92proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 OS=Cucurbita... [more]
A0A0A0LI251.6e-7968.91Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902250 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G32235.11.3e-1232.23unknown protein; Has 38 Blast hits to 38 proteins in 14 species: Archae - 0; Bac... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 134..153
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 139..153
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 52..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..91

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012226.1HG10012226.1mRNA