Lsi05G022010 (gene) Bottle gourd (USVL1VR-Ls)

NameLsi05G022010
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionZinc ion binding protein
Locationchr05 : 28577295 .. 28577966 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTTGAATCGCCCACTCTGCTGCTTGTGTGGCGACGTTGGTTTTCCAGCGAAACTCTTCCGCTGCCTTGATTGCTCCAATCGATTTCAGCACTCGTAAGACCGTCTTTTGTTTTAATTTATTAGGCTATGATACTATGTTAGATGTTGTATTATCTTAAACTTACATATTTTGGTAGTTCAGTTACTGCAGCAACTACTACTGTGGGGAATCGGGGGATGCAATAGTTCAAGTATGCGATTGGTGTCGAAGTGAACAGAGAACCCACCGTCCTGCCTCTGCAACTGCGAATAATAATAGTGGTACTATGTCCCACAAGGATCATAAAATCACTGATCAAATTACACAAATAAGAAGCTCCGGCGCCGGACTGCCTTCGTCTCGCCCTGCTCCACGCCGATACAAGCTTCTCAAGGATGTCATGTGTTAGAAACTTTTCCATCAACCCCCAACAAACCAAATACTTTTTCAATTTTCATTTTCTTATTTACATATATGCACCTATATCATCTTAATTAATATAATCCTGTATGTGTATGTCGCCTTTCAAAATTAAATTATCCACAACATAAGATGGTTCTCTGTATTTTATAATCAAGGCATTTTCAATTACTTCAACCTTTTTAATGCATTCGTTTGCTCTTGAAATAATCTGAATATATATTCAAC

mRNA sequence

ATGGATTTGAATCGCCCACTCTGCTGCTTGTGTGGCGACGTTGGTTTTCCAGCGAAACTCTTCCGCTGCCTTGATTGCTCCAATCGATTTCAGCACTCTTACTGCAGCAACTACTACTGTGGGGAATCGGGGGATGCAATAGTTCAAGTATGCGATTGGTGTCGAAGTGAACAGAGAACCCACCGTCCTGCCTCTGCAACTGCGAATAATAATAGTGGTACTATGTCCCACAAGGATCATAAAATCACTGATCAAATTACACAAATAAGAAGCTCCGGCGCCGGACTGCCTTCGTCTCGCCCTGCTCCACGCCGATACAAGCTTCTCAAGGATGTCATGTGTTAGAAACTTTTCCATCAACCCCCAACAAACCAAATACTTTTTCAATTTTCATTTTCTTATTTACATATATGCACCTATATCATCTTAATTAATATAATCCTGTATGTGTATGTCGCCTTTCAAAATTAAATTATCCACAACATAAGATGGTTCTCTGTATTTTATAATCAAGGCATTTTCAATTACTTCAACCTTTTTAATGCATTCGTTTGCTCTTGAAATAATCTGAATATATATTCAAC

Coding sequence (CDS)

ATGGATTTGAATCGCCCACTCTGCTGCTTGTGTGGCGACGTTGGTTTTCCAGCGAAACTCTTCCGCTGCCTTGATTGCTCCAATCGATTTCAGCACTCTTACTGCAGCAACTACTACTGTGGGGAATCGGGGGATGCAATAGTTCAAGTATGCGATTGGTGTCGAAGTGAACAGAGAACCCACCGTCCTGCCTCTGCAACTGCGAATAATAATAGTGGTACTATGTCCCACAAGGATCATAAAATCACTGATCAAATTACACAAATAAGAAGCTCCGGCGCCGGACTGCCTTCGTCTCGCCCTGCTCCACGCCGATACAAGCTTCTCAAGGATGTCATGTGTTAG

Protein sequence

MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRTHRPASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC
BLAST of Lsi05G022010 vs. TrEMBL
Match: A0A0A0L5P1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G117940 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 1.4e-42
Identity = 86/114 (75.44%), Postives = 93/114 (81.58%), Query Frame = 1

Query: 1   MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRT 60
           MDLN PLCCLCGDVGFPAKLFRC +CSNRFQHSYCSNYYCGESGDA ++VCDWCRSEQRT
Sbjct: 1   MDLNPPLCCLCGDVGFPAKLFRCTNCSNRFQHSYCSNYYCGESGDATIRVCDWCRSEQRT 60

Query: 61  HRPASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC 115
            RPA A+      T S K +KIT++    RSS  GLPS RPAPRRYKLLKDVMC
Sbjct: 61  CRPAFAS------TTSQKSNKITER----RSSAVGLPSPRPAPRRYKLLKDVMC 104

BLAST of Lsi05G022010 vs. TrEMBL
Match: A0A022RQV6_ERYGU (Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a016574mg PE=4 SV=1)

HSP 1 Score: 122.9 bits (307), Expect = 2.6e-25
Identity = 62/111 (55.86%), Postives = 76/111 (68.47%), Query Frame = 1

Query: 5   RPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGD-AIVQVCDWCRSEQRTHRP 64
           R +CCLCGDVGF  KLFRC  C  RFQHSYCSNY C E  + A++QVCDWCR ++R +  
Sbjct: 5   RTVCCLCGDVGFSDKLFRCSKCLTRFQHSYCSNYNCREYAEAAVLQVCDWCRCDEREYNI 64

Query: 65  ASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC 115
           +S+  NN+     +   KI  Q T  +S+G  LPS RP+ RRYKLLKDVMC
Sbjct: 65  SSSKNNNSHIRSPYSGEKIKQQ-TPEKSTGK-LPSPRPSARRYKLLKDVMC 113

BLAST of Lsi05G022010 vs. TrEMBL
Match: A0A061DUE3_THECC (Late cornified envelope protein 1E OS=Theobroma cacao GN=TCM_005618 PE=4 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 1.4e-23
Identity = 59/119 (49.58%), Postives = 77/119 (64.71%), Query Frame = 1

Query: 7   LCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRTHRPAS- 66
           +CC+CGDVGFP KLFRC  C +RFQHSYCSNYY  E  + I ++CDWC+SE+R+ R  S 
Sbjct: 7   VCCMCGDVGFPDKLFRCNKCRHRFQHSYCSNYY-SELAEPI-ELCDWCQSEERSSRHGSS 66

Query: 67  ---ATANNNSGTMSHKDHKITDQITQ-------IRSSGAGLPSSRPAPRRYKLLKDVMC 115
              ++  N +G  +  ++  TD+I Q        +   +G PS RP  RRYKLLKDVMC
Sbjct: 67  SKKSSTGNETGITNRSEYSGTDKIKQQDRDESAEKGKSSGTPSPRPTTRRYKLLKDVMC 123

BLAST of Lsi05G022010 vs. TrEMBL
Match: A9PG29_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s05730g PE=2 SV=1)

HSP 1 Score: 117.1 bits (292), Expect = 1.4e-23
Identity = 65/127 (51.18%), Postives = 79/127 (62.20%), Query Frame = 1

Query: 7   LCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRTHRPASA 66
           +CC+CGDVGFP KLFRC  C NRFQH YCSNYY GE  + I Q CDWC+SE+R  R  ++
Sbjct: 7   VCCMCGDVGFPDKLFRCNKCRNRFQHLYCSNYY-GEFSEPIEQ-CDWCQSEERNARHGNS 66

Query: 67  T----ANNNSGTMSHK------DHKI---------TDQITQIRSSGAGLPSSRPAPRRYK 115
           +    A ++SGT+  K      DHKI         T    Q   + +G+PS RP  RRYK
Sbjct: 67  SKKSGAEHDSGTLVTKRSEYSGDHKIKQHDREENSTTSSDQKGKNPSGIPSPRPTTRRYK 126

BLAST of Lsi05G022010 vs. TrEMBL
Match: D7LRR6_ARALL (Zinc ion binding protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_907580 PE=4 SV=1)

HSP 1 Score: 114.4 bits (285), Expect = 9.2e-23
Identity = 64/130 (49.23%), Postives = 78/130 (60.00%), Query Frame = 1

Query: 1   MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRT 60
           +DL R +CC+CGDVGF  KLF C  C NRFQHSYCS+YY  E  D I ++CDWC+ E ++
Sbjct: 2   VDLERRVCCMCGDVGFFDKLFHCSKCLNRFQHSYCSSYY-KEQADPI-KICDWCQCEAKS 61

Query: 61  HRPASATANNNSGTMSHKD------HKITDQ-ITQIRSSG---------AGLPSSRPAPR 115
              A   AN  S   S++       H+I  Q I Q  SS          +G+PS RPA R
Sbjct: 62  RTGAKHGANGGSSKRSYRSEYSSAHHQIKQQEIHQTTSSSIPPAAEKGKSGVPSPRPATR 121

BLAST of Lsi05G022010 vs. TAIR10
Match: AT3G60520.1 (AT3G60520.1 unknown protein)

HSP 1 Score: 113.2 bits (282), Expect = 1.0e-25
Identity = 63/130 (48.46%), Postives = 76/130 (58.46%), Query Frame = 1

Query: 1   MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRT 60
           +DL R +CC+CGDVGF  KLF C  C NRFQHSYCS+YY  E  D I ++CDWC+ E ++
Sbjct: 2   VDLERRVCCMCGDVGFFDKLFHCSKCLNRFQHSYCSSYY-KEQADPI-KICDWCQCEAKS 61

Query: 61  HRPASATANNNSGTMSHKD------HKITDQ-ITQIRSSG---------AGLPSSRPAPR 115
              A    N  S   S++       H+I  Q I Q  SS           G+PS RPA R
Sbjct: 62  RTGAKHGVNGGSSKRSYRSEYSSPHHQIKQQEINQTTSSSIPPAADKGKTGVPSPRPATR 121

BLAST of Lsi05G022010 vs. TAIR10
Match: AT1G02070.1 (AT1G02070.1 unknown protein)

HSP 1 Score: 95.1 bits (235), Expect = 2.9e-20
Identity = 51/130 (39.23%), Postives = 70/130 (53.85%), Query Frame = 1

Query: 7   LCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQR------- 66
           +CC+CGDVGF  KLF C  C  RFQHSYCSNYY G+  +   ++CDWCRS+ R       
Sbjct: 4   VCCMCGDVGFSDKLFSCGHCRCRFQHSYCSNYY-GQFAEP-TEICDWCRSDDRKLSNVAR 63

Query: 67  -----THRPASATANNNSGT----------MSHKDHKITDQITQIRSSGAGLPSSRPAPR 115
                + +P+S+    N  +          + H +++       +   G G+ S + A R
Sbjct: 64  HGGSSSKKPSSSVKYENDFSNRSEYSPGHRIKHNNNRHDQVAKGVAGDGGGVTSPKTATR 123

BLAST of Lsi05G022010 vs. NCBI nr
Match: gi|659074774|ref|XP_008437789.1| (PREDICTED: uncharacterized protein LOC103483122 [Cucumis melo])

HSP 1 Score: 181.4 bits (459), Expect = 8.8e-43
Identity = 87/114 (76.32%), Postives = 92/114 (80.70%), Query Frame = 1

Query: 1   MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRT 60
           MDLN PLCCLCGDVGFPA LFRC +CSNRFQHSYCSNYY GESGDAI++VCDWCRSEQRT
Sbjct: 41  MDLNPPLCCLCGDVGFPANLFRCTNCSNRFQHSYCSNYYSGESGDAIIRVCDWCRSEQRT 100

Query: 61  HRPASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC 115
            RPA+        T S KD K    IT+IRSS AGLPS RPAPRRYKLLKDVMC
Sbjct: 101 RRPAAFAT-----TTSQKDRK----ITEIRSSAAGLPSPRPAPRRYKLLKDVMC 145

BLAST of Lsi05G022010 vs. NCBI nr
Match: gi|449433002|ref|XP_004134287.1| (PREDICTED: uncharacterized protein LOC101222685 [Cucumis sativus])

HSP 1 Score: 180.3 bits (456), Expect = 2.0e-42
Identity = 86/114 (75.44%), Postives = 93/114 (81.58%), Query Frame = 1

Query: 1   MDLNRPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRT 60
           MDLN PLCCLCGDVGFPAKLFRC +CSNRFQHSYCSNYYCGESGDA ++VCDWCRSEQRT
Sbjct: 1   MDLNPPLCCLCGDVGFPAKLFRCTNCSNRFQHSYCSNYYCGESGDATIRVCDWCRSEQRT 60

Query: 61  HRPASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC 115
            RPA A+      T S K +KIT++    RSS  GLPS RPAPRRYKLLKDVMC
Sbjct: 61  CRPAFAS------TTSQKSNKITER----RSSAVGLPSPRPAPRRYKLLKDVMC 104

BLAST of Lsi05G022010 vs. NCBI nr
Match: gi|848861144|ref|XP_012831396.1| (PREDICTED: uncharacterized protein LOC105952400 [Erythranthe guttata])

HSP 1 Score: 122.9 bits (307), Expect = 3.7e-25
Identity = 62/111 (55.86%), Postives = 76/111 (68.47%), Query Frame = 1

Query: 5   RPLCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGD-AIVQVCDWCRSEQRTHRP 64
           R +CCLCGDVGF  KLFRC  C  RFQHSYCSNY C E  + A++QVCDWCR ++R +  
Sbjct: 5   RTVCCLCGDVGFSDKLFRCSKCLTRFQHSYCSNYNCREYAEAAVLQVCDWCRCDEREYNI 64

Query: 65  ASATANNNSGTMSHKDHKITDQITQIRSSGAGLPSSRPAPRRYKLLKDVMC 115
           +S+  NN+     +   KI  Q T  +S+G  LPS RP+ RRYKLLKDVMC
Sbjct: 65  SSSKNNNSHIRSPYSGEKIKQQ-TPEKSTGK-LPSPRPSARRYKLLKDVMC 113

BLAST of Lsi05G022010 vs. NCBI nr
Match: gi|743788321|ref|XP_011033224.1| (PREDICTED: uncharacterized protein LOC105131778 [Populus euphratica])

HSP 1 Score: 117.1 bits (292), Expect = 2.0e-23
Identity = 65/127 (51.18%), Postives = 79/127 (62.20%), Query Frame = 1

Query: 7   LCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRTHRPASA 66
           +CC+CGDVGFP KLFRC  C NRFQH YCSNYY GE  + I Q CDWC+SE+R  R  ++
Sbjct: 7   VCCMCGDVGFPDKLFRCNKCRNRFQHLYCSNYY-GEFSEPIEQ-CDWCQSEERNARHGNS 66

Query: 67  T----ANNNSGTMSHK------DHKI---------TDQITQIRSSGAGLPSSRPAPRRYK 115
           +    A ++SGT+  K      DHKI         T    Q   + +G+PS RP  RRYK
Sbjct: 67  SKKSVAEHDSGTLVTKRSEYSGDHKIKQHDREENSTTSSDQKGKNPSGIPSPRPTTRRYK 126

BLAST of Lsi05G022010 vs. NCBI nr
Match: gi|566202783|ref|XP_006375260.1| (hypothetical protein POPTR_0014s05730g [Populus trichocarpa])

HSP 1 Score: 117.1 bits (292), Expect = 2.0e-23
Identity = 65/127 (51.18%), Postives = 79/127 (62.20%), Query Frame = 1

Query: 7   LCCLCGDVGFPAKLFRCLDCSNRFQHSYCSNYYCGESGDAIVQVCDWCRSEQRTHRPASA 66
           +CC+CGDVGFP KLFRC  C NRFQH YCSNYY GE  + I Q CDWC+SE+R  R  ++
Sbjct: 7   VCCMCGDVGFPDKLFRCNKCRNRFQHLYCSNYY-GEFSEPIEQ-CDWCQSEERNARHGNS 66

Query: 67  T----ANNNSGTMSHK------DHKI---------TDQITQIRSSGAGLPSSRPAPRRYK 115
           +    A ++SGT+  K      DHKI         T    Q   + +G+PS RP  RRYK
Sbjct: 67  SKKSGAEHDSGTLVTKRSEYSGDHKIKQHDREENSTTSSDQKGKNPSGIPSPRPTTRRYK 126

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L5P1_CUCSA1.4e-4275.44Uncharacterized protein OS=Cucumis sativus GN=Csa_3G117940 PE=4 SV=1[more]
A0A022RQV6_ERYGU2.6e-2555.86Uncharacterized protein OS=Erythranthe guttata GN=MIMGU_mgv1a016574mg PE=4 SV=1[more]
A0A061DUE3_THECC1.4e-2349.58Late cornified envelope protein 1E OS=Theobroma cacao GN=TCM_005618 PE=4 SV=1[more]
A9PG29_POPTR1.4e-2351.18Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0014s05730g PE=2 SV=1[more]
D7LRR6_ARALL9.2e-2349.23Zinc ion binding protein OS=Arabidopsis lyrata subsp. lyrata GN=ARALYDRAFT_90758... [more]
Match NameE-valueIdentityDescription
AT3G60520.11.0e-2548.46 unknown protein[more]
AT1G02070.12.9e-2039.23 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659074774|ref|XP_008437789.1|8.8e-4376.32PREDICTED: uncharacterized protein LOC103483122 [Cucumis melo][more]
gi|449433002|ref|XP_004134287.1|2.0e-4275.44PREDICTED: uncharacterized protein LOC101222685 [Cucumis sativus][more]
gi|848861144|ref|XP_012831396.1|3.7e-2555.86PREDICTED: uncharacterized protein LOC105952400 [Erythranthe guttata][more]
gi|743788321|ref|XP_011033224.1|2.0e-2351.18PREDICTED: uncharacterized protein LOC105131778 [Populus euphratica][more]
gi|566202783|ref|XP_006375260.1|2.0e-2351.18hypothetical protein POPTR_0014s05730g [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi05G022010.1Lsi05G022010.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 112..114
scor
NoneNo IPR availablePANTHERPTHR33779FAMILY NOT NAMEDcoord: 8..114
score: 1.2
NoneNo IPR availablePANTHERPTHR33779:SF2SUBFAMILY NOT NAMEDcoord: 8..114
score: 1.2