HG10012231 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10012231
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr01: 19145890 .. 19146216 (+)
RNA-Seq ExpressionHG10012231
SyntenyHG10012231
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATTGCCTAACATGCCAAGCCCTTCCAAGGACCCAATCTGATAGGGAGAATAATCATGGTTATGAAACCCCCTCTAGTAGAGGCAAATCATGTTGCCTATACGTCCCGCGGAGGTGGTCGGCCGAATTGACTCCCCCGTCGTACGAGTCGATTAAAATCGACGACGATATTAGCTCTTCTTCGCACAAGAAGGCCAAGACTCGCCGCATTCATAGTGACTGTAGCGGGAACGAGCCGAGGTTGGTGAGGAGCTCGGGGATGAGAAGGGATTGGAGCTTTGAGGATTTGGGATTGAGAGATCAAAAGAAAGGGAGATTCCATTGA

mRNA sequence

ATGAATTGCCTAACATGCCAAGCCCTTCCAAGGACCCAATCTGATAGGGAGAATAATCATGGTTATGAAACCCCCTCTAGTAGAGGCAAATCATGTTGCCTATACGTCCCGCGGAGGTGGTCGGCCGAATTGACTCCCCCGTCGTACGAGTCGATTAAAATCGACGACGATATTAGCTCTTCTTCGCACAAGAAGGCCAAGACTCGCCGCATTCATAGTGACTGTAGCGGGAACGAGCCGAGGTTGGTGAGGAGCTCGGGGATGAGAAGGGATTGGAGCTTTGAGGATTTGGGATTGAGAGATCAAAAGAAAGGGAGATTCCATTGA

Coding sequence (CDS)

ATGAATTGCCTAACATGCCAAGCCCTTCCAAGGACCCAATCTGATAGGGAGAATAATCATGGTTATGAAACCCCCTCTAGTAGAGGCAAATCATGTTGCCTATACGTCCCGCGGAGGTGGTCGGCCGAATTGACTCCCCCGTCGTACGAGTCGATTAAAATCGACGACGATATTAGCTCTTCTTCGCACAAGAAGGCCAAGACTCGCCGCATTCATAGTGACTGTAGCGGGAACGAGCCGAGGTTGGTGAGGAGCTCGGGGATGAGAAGGGATTGGAGCTTTGAGGATTTGGGATTGAGAGATCAAAAGAAAGGGAGATTCCATTGA

Protein sequence

MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISSSSHKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDLGLRDQKKGRFH
Homology
BLAST of HG10012231 vs. NCBI nr
Match: XP_038887010.1 (uncharacterized protein LOC120077178 [Benincasa hispida])

HSP 1 Score: 198.0 bits (502), Expect = 4.2e-47
Identity = 97/108 (89.81%), Postives = 98/108 (90.74%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRE NHGYET SSRGKSCCLYV RRWSAELTPPSYESIK DDDISS
Sbjct: 3   MNCLTCQALPRTQSDRE-NHGYETSSSRGKSCCLYVSRRWSAELTPPSYESIKNDDDISS 62

Query: 61  SSHKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDLGLRDQKKGRFH 109
           S  KKAK RR+HSDC  NEPRLVRSSGMRRDWSFEDLGLRDQKK RFH
Sbjct: 63  SLQKKAKNRRVHSDCGRNEPRLVRSSGMRRDWSFEDLGLRDQKKVRFH 109

BLAST of HG10012231 vs. NCBI nr
Match: KGN60385.1 (hypothetical protein Csa_002007 [Cucumis sativus])

HSP 1 Score: 185.7 bits (470), Expect = 2.2e-43
Identity = 94/112 (83.93%), Postives = 101/112 (90.18%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAEL-TPPSYESIKIDDD-- 60
           MNCLTCQALPRTQSDRE N GYETPS+RGKSCCLYVPRRWS EL TP SY+SIKIDDD  
Sbjct: 3   MNCLTCQALPRTQSDRE-NRGYETPSTRGKSCCLYVPRRWSTELTTPSSYDSIKIDDDHH 62

Query: 61  ISSSS-HKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDLGLRDQKKGRFH 109
           ISSS+ HKKA+TRR+ S+C GNEP+LVRSSGMRRDWSFEDLGLR QKKGRFH
Sbjct: 63  ISSSNVHKKARTRRVRSECGGNEPKLVRSSGMRRDWSFEDLGLRGQKKGRFH 113

BLAST of HG10012231 vs. NCBI nr
Match: XP_031737602.1 (uncharacterized protein LOC116402474 [Cucumis sativus])

HSP 1 Score: 185.7 bits (470), Expect = 2.2e-43
Identity = 94/112 (83.93%), Postives = 101/112 (90.18%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAEL-TPPSYESIKIDDD-- 60
           MNCLTCQALPRTQSDRE N GYETPS+RGKSCCLYVPRRWS EL TP SY+SIKIDDD  
Sbjct: 1   MNCLTCQALPRTQSDRE-NRGYETPSTRGKSCCLYVPRRWSTELTTPSSYDSIKIDDDHH 60

Query: 61  ISSSS-HKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDLGLRDQKKGRFH 109
           ISSS+ HKKA+TRR+ S+C GNEP+LVRSSGMRRDWSFEDLGLR QKKGRFH
Sbjct: 61  ISSSNVHKKARTRRVRSECGGNEPKLVRSSGMRRDWSFEDLGLRGQKKGRFH 111

BLAST of HG10012231 vs. NCBI nr
Match: XP_008465978.2 (PREDICTED: uncharacterized protein LOC103503546 [Cucumis melo] >TYK31168.1 uncharacterized protein E5676_scaffold455G004480 [Cucumis melo var. makuwa])

HSP 1 Score: 181.8 bits (460), Expect = 3.1e-42
Identity = 93/111 (83.78%), Postives = 98/111 (88.29%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP  +  IK+D+DISS
Sbjct: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP--HNPIKLDEDISS 60

Query: 61  SS-HKKAKTRRIHS-DCSGNEPRLVRSSGMRRDWSFEDLGLRD-QKKGRFH 109
           S  HKK ++RR  S D  GNEPRLVRSSGMRRDWSFEDLGLRD QKKGRFH
Sbjct: 61  SDLHKKPRSRRARSGDYGGNEPRLVRSSGMRRDWSFEDLGLRDHQKKGRFH 109

BLAST of HG10012231 vs. NCBI nr
Match: KAA0038574.1 (uncharacterized protein E6C27_scaffold92G001040 [Cucumis melo var. makuwa])

HSP 1 Score: 181.8 bits (460), Expect = 3.1e-42
Identity = 93/111 (83.78%), Postives = 98/111 (88.29%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP  +  IK+D+DISS
Sbjct: 3   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP--HNPIKLDEDISS 62

Query: 61  SS-HKKAKTRRIHS-DCSGNEPRLVRSSGMRRDWSFEDLGLRD-QKKGRFH 109
           S  HKK ++RR  S D  GNEPRLVRSSGMRRDWSFEDLGLRD QKKGRFH
Sbjct: 63  SDLHKKPRSRRARSGDYGGNEPRLVRSSGMRRDWSFEDLGLRDHQKKGRFH 111

BLAST of HG10012231 vs. ExPASy TrEMBL
Match: A0A0A0LEC0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902280 PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 1.1e-43
Identity = 94/112 (83.93%), Postives = 101/112 (90.18%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAEL-TPPSYESIKIDDD-- 60
           MNCLTCQALPRTQSDRE N GYETPS+RGKSCCLYVPRRWS EL TP SY+SIKIDDD  
Sbjct: 3   MNCLTCQALPRTQSDRE-NRGYETPSTRGKSCCLYVPRRWSTELTTPSSYDSIKIDDDHH 62

Query: 61  ISSSS-HKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDLGLRDQKKGRFH 109
           ISSS+ HKKA+TRR+ S+C GNEP+LVRSSGMRRDWSFEDLGLR QKKGRFH
Sbjct: 63  ISSSNVHKKARTRRVRSECGGNEPKLVRSSGMRRDWSFEDLGLRGQKKGRFH 113

BLAST of HG10012231 vs. ExPASy TrEMBL
Match: A0A5A7T546 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold92G001040 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.5e-42
Identity = 93/111 (83.78%), Postives = 98/111 (88.29%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP  +  IK+D+DISS
Sbjct: 3   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP--HNPIKLDEDISS 62

Query: 61  SS-HKKAKTRRIHS-DCSGNEPRLVRSSGMRRDWSFEDLGLRD-QKKGRFH 109
           S  HKK ++RR  S D  GNEPRLVRSSGMRRDWSFEDLGLRD QKKGRFH
Sbjct: 63  SDLHKKPRSRRARSGDYGGNEPRLVRSSGMRRDWSFEDLGLRDHQKKGRFH 111

BLAST of HG10012231 vs. ExPASy TrEMBL
Match: A0A5D3E5T4 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G004480 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.5e-42
Identity = 93/111 (83.78%), Postives = 98/111 (88.29%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP  +  IK+D+DISS
Sbjct: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP--HNPIKLDEDISS 60

Query: 61  SS-HKKAKTRRIHS-DCSGNEPRLVRSSGMRRDWSFEDLGLRD-QKKGRFH 109
           S  HKK ++RR  S D  GNEPRLVRSSGMRRDWSFEDLGLRD QKKGRFH
Sbjct: 61  SDLHKKPRSRRARSGDYGGNEPRLVRSSGMRRDWSFEDLGLRDHQKKGRFH 109

BLAST of HG10012231 vs. ExPASy TrEMBL
Match: A0A1S3CRL3 (uncharacterized protein LOC103503546 OS=Cucumis melo OX=3656 GN=LOC103503546 PE=4 SV=1)

HSP 1 Score: 181.8 bits (460), Expect = 1.5e-42
Identity = 93/111 (83.78%), Postives = 98/111 (88.29%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
           MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP  +  IK+D+DISS
Sbjct: 1   MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTP--HNPIKLDEDISS 60

Query: 61  SS-HKKAKTRRIHS-DCSGNEPRLVRSSGMRRDWSFEDLGLRD-QKKGRFH 109
           S  HKK ++RR  S D  GNEPRLVRSSGMRRDWSFEDLGLRD QKKGRFH
Sbjct: 61  SDLHKKPRSRRARSGDYGGNEPRLVRSSGMRRDWSFEDLGLRDHQKKGRFH 109

BLAST of HG10012231 vs. ExPASy TrEMBL
Match: A0A6J1DZN2 (uncharacterized protein LOC111026067 OS=Momordica charantia OX=3673 GN=LOC111026067 PE=4 SV=1)

HSP 1 Score: 144.1 bits (362), Expect = 3.5e-31
Identity = 83/125 (66.40%), Postives = 87/125 (69.60%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENN----HGYETPSS--RGKSCCLYVPRRWSAELTPPSYESIKI 60
           MNCLTCQALPRTQSDRENN    + YET SS  RGKSCCL+V RRWSAELTP SY  IK 
Sbjct: 3   MNCLTCQALPRTQSDRENNNNNYYAYETSSSRGRGKSCCLHVTRRWSAELTPTSYGHIKT 62

Query: 61  DDDIS--------SSSHKKAKTRRIHSDCSG---NEPRLVRSSGMRRDWSFEDLGLRDQK 109
             D +        S S KK K R + S       NEPRLVRSSGMRRDWSFEDL LRDQK
Sbjct: 63  TADNAREGCSSRKSFSDKKVKDRYLRSGSMNEKENEPRLVRSSGMRRDWSFEDLRLRDQK 122

BLAST of HG10012231 vs. TAIR 10
Match: AT5G46770.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 70.9 bits (172), Expect = 7.3e-13
Identity = 50/129 (38.76%), Postives = 72/129 (55.81%), Query Frame = 0

Query: 1   MNCLTCQALPRTQSDRENNHG------YETPSSRGKSCCL--YVPRRWSAELTPPSYESI 60
           +NCL+CQALPRT S+++ +         E  +  GK+CC+     R WS  L+P  YE  
Sbjct: 3   LNCLSCQALPRTDSNKDVDLSGPGPPRVEINNVLGKTCCVNPIGGRNWSGNLSPRIYE-- 62

Query: 61  KIDDDISSSSHKKAKTRRIHS-DCSG-------------NEPRLVRSSGMRRDWSFEDL- 105
           KI    SS +HK  K ++IH    SG              +P+LVRS+G+RR+WSFE+L 
Sbjct: 63  KIGRPGSSLAHKMKKVKKIHHVRLSGPVGSSPSNVPTRPEQPKLVRSTGVRRNWSFENLR 122

BLAST of HG10012231 vs. TAIR 10
Match: AT2G35215.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G46770.1); Has 19 Blast hits to 19 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 19; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 47.0 bits (110), Expect = 1.1e-05
Identity = 31/97 (31.96%), Postives = 48/97 (49.48%), Query Frame = 0

Query: 1  MNCLTCQALPRTQSDRENNHGYETPSSRGKSCCLYVPRRWSAELTPPSYESIKIDDDISS 60
          +NCL C  L RT SDR+     ++      +   +       E    +  S+ +   + +
Sbjct: 3  LNCLACHILQRTDSDRDMGSRKDSSFKENFATSAF-------EKMVRNRSSLPVVRRV-N 62

Query: 61 SSHKKAKTRRIHSDCSGNEPRLVRSSGMRRDWSFEDL 98
            H++  +  I      +EP+LVRSSG+RRDWSFEDL
Sbjct: 63 KGHRRLYSADIMVYGELDEPKLVRSSGIRRDWSFEDL 91

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887010.14.2e-4789.81uncharacterized protein LOC120077178 [Benincasa hispida][more]
KGN60385.12.2e-4383.93hypothetical protein Csa_002007 [Cucumis sativus][more]
XP_031737602.12.2e-4383.93uncharacterized protein LOC116402474 [Cucumis sativus][more]
XP_008465978.23.1e-4283.78PREDICTED: uncharacterized protein LOC103503546 [Cucumis melo] >TYK31168.1 uncha... [more]
KAA0038574.13.1e-4283.78uncharacterized protein E6C27_scaffold92G001040 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LEC01.1e-4383.93Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G902280 PE=4 SV=1[more]
A0A5A7T5461.5e-4283.78Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A5D3E5T41.5e-4283.78Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3CRL31.5e-4283.78uncharacterized protein LOC103503546 OS=Cucumis melo OX=3656 GN=LOC103503546 PE=... [more]
A0A6J1DZN23.5e-3166.40uncharacterized protein LOC111026067 OS=Momordica charantia OX=3673 GN=LOC111026... [more]
Match NameE-valueIdentityDescription
AT5G46770.17.3e-1338.76unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
AT2G35215.11.1e-0531.96unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 52..83
NoneNo IPR availablePANTHERPTHR36019PLANT/PROTEINcoord: 1..103

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10012231.1HG10012231.1mRNA