LsiUNG000940 (gene) Bottle gourd (USVL1VR-Ls)

NameLsiUNG000940
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls))
DescriptionBNR/Asp-box repeat family protein
Locationchr00 : 2645765 .. 2650675 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACATTCGTTAAGCGGAGGATTATTTATCAATTATCCAATTGTGAGCTGGTTCGGTCATGCTTACAAACAACACAACAATACAATACAAGTTATATCCAAAGTCAACTATTTCATGCTCAGCTCTCGATTTTTCTCGTCAATTTCCTGTCTGTATCAGACTATCAAATTTCAGAGTTTGCGAGTTCCGACTGTTTAGGTACAAAACCATTCAACTGCTTCCCTCTTACTTCCATTCTTTTCTGCTCTTACTTCTTACCTCCCTTTTTATCCAAAATGTAATGTCTTCATCTTCCTAAGTTGACTTCACTTGAAGATGCGGAAATCAGTGGGGACTCTTCTGATTTTCTTTTTCCTGCTCTCCATTTTCAGTATTTCATCCCCAGGATTTGGAGACACCTTCACCTCTAAATTCAATTTGTTAACCCCAAAATTCTCCTCGCGTTTTCATCGCTTATATCGTCATTCGCATATATTTGGACCAAGGTTGTTAATTTTTGTTTTTTTTATCCCTTTTTCTTTTACACCCACAATTGTAGATTTTCATCTTGGATGAAAATTTTGAGATACTCAATTTTTGGATTCTGTTTCGAGCTGGTGTTGTATGTTAATTCTTTAATATGAACTAAGTGTTTGATTAAATGCGCAAGTGAGAACTTTTTTTTTCCTTATGGTCAACGTGTTGTGCAGAGCATTAGTTAGAGGACATTCGAATATAGCTTCAGTCGTTAGAAGTTTGAAAATGGGAAAAGGGCCCTTGCTAGAGGAGTTTACATTTCCTGCTAATTCTGTTCCTTTCAACAACTGTCATGCTTCTACAATTGTGGAGGTTTGGTTTGATGGAGAAACTTCATTTGATCTTATAATAATTTTGTTAGGGGAGAATCTGAGTTTTTTCTTTCTGGATAGAGCAGGTTGATAAAGATCATTACTTGGTTGCATATTTTGGGGGTACATTTGAGGGCGCACCGGATGTGAAGATTTGGTTGCAGACATATAAGGTAAACAAGCTGATCCGATTTATTATTCCCCAATGCCTATGAAAGAAGGCCTAATTACTTGAACATTGTTCTCATCTGAGTTATGTTCTGCTTGATTGCTGATATGCTTTCTCAAATTATACTACTGTAAAATTCACTCTCCCTCCTTGCCTTTGTTTATGTCAAAACAGGCAGCTGGCTTTTTTATCCATCTTATTGAGTGGTTTAGAAGAGATTTATATAACTTTTTAACTTTCCTCGTTGATGTTCATTTCTTGCTAGATTGTCCACTTGTAATTTTTCGCTAGCATTCATTAGAAGGGCTGCTAACAATTTTACTCAAGTTATGCAGACATAGTTGGACAAGCAATAACTCTTTTTATCTTGCCATGCCCACGTTGAGGCCCTTATTTCTGTAATGATTTCTTGTTAAAATTATTGGTAAATTTAACTTGCAGAACGGCTCCTGGCACCCACCAATCGTGGCTGATGAGGAGCCTGATGTTCCAATGTGGAACCCTGTCTTGTTCAAACTTCCATCAGATGAGCTGCTTTTGTTTTATAGAGTGGGCCAAGACGTTCAAAGGTATATCATTTTTAAGCTCATAATGGCTTTCTGAAATAACAGCTCTACACAAGTTAATTTCTTTTATATTTGATTCAAGTTTCAATAAATTTATTGGACCATACGACTCCCATGGTGGCAGTGGCACTGTCTTTTCAGCCAGCAATTGCGAGACATGGATTTGGAATCCATTTCATTTGATAACATTATTTAAATCCATGTCATTTGAAAACATTATTTATCATATCAAAGAGTTCAGTATCAGGGCCGCCAAACCCATGTTACGGAATTCTTAATCGTCAAGAAGCTAATAATTGCTACCAAGGATATATATAAGATGTCTTATAAATAAAATGATCATAGGAAGTTGATGTGCTCAAGTTAGTTGGTTTCATAAATTATGCTTTCTAAAAAGAGCCTTAATACCTTTTCCTTGAAATCCCAAGGCACCCTTCTTTTGACTGGTTCACTCCAGTCTCCAGTGACCAACTGTCTAACATATTGAATAACTTTTTAAGTGAAAAGTTAATCCCTTCTGCTTCCAATTTTGTTTGATTCTATCTTATGCTGATGGAGATAACAATCAGATGGAGTGGGTGCATGAAGCGGTCATATGACAAAGGCGTTACTTGGACAGCAAGAGAACAGCTTCCTCCTGGTATCTTAGGACCAATAAAGAACAAGGTATAATATATGTGCTGATATCTACCATTTTCAATTAAACTGTTTTTGCTCAAGAAGCTGTCTTCTTTCTTTGTCTGATGCTGTAGCCAATCTTGCTGGAAAATGGGGTTTTGCTCTGCGGATCTTCAGTTGAAAGTTGGAACTCTTGGGGTGCATGGATGGAGGTTATTATGAATAGTTCTTTAGTTTTGACTTGTCCTAAAATCTGGAAAATGTGACTGACATGTCAATATAACAGGCCACATCTGATTCAGGTAGAACATGGAGAAAGTTTGGCCCAATATACATGGAAAATCGATCTCTCAGTGTCATTCAACCAGCTCCTTACCAGACTGCAAATGGAAATCTGCGTGTTTTACTTCGATCCTTCACAGGTATTGGTAGTATTTGTATGTCAGAATCCACTGATGGTGGTCACAACTGGAGCTATGCAAAACCTACAAATCTTCCTAATCCAAACTCAGGTAACATGGTTGAGCTACATGCATTACCTGTAACTGCAAGCATCTTTGTACTGTTAGGAATGCAACTAGGAATATTAAGGTATATAACTAGATAGGGAGGCAGTTGTGGAGTTCAGTTATAAATAGAGGGAGTTAGGCATTTGGTGGGTGAATTAGGGTTTGGGAGATATCTCTAGTAATTTCTTCTTTGTATTGCAATATATTAGTTTCGTATGTTTTCTGTATTTGGATACCTAGCACATACATGAACGTCAGTCTTGTGAGCATCTTCATTATGAAATTGAATTACTTCCCAGATTTAGCAATTTCTGATCTTATAAGCTTGTCATGCATTGTCACAAGCAGAGATGGCTGATCTTACTACAACTTAGATTCCAACATTTCATCTTTGACTATGTAATCTTATCAATAGGTATTGATGGAGTCAAACTGAGGGATGGTCGTTTATTACTGGTTTATAACACGGTATCGAGAGGTGTGCTTAAGGTTGGACTGTCTCTGGATGATGGTGACTCATGGCTAGATGTCATGACCTTGGAGGATGAGCCAGGAAAGGAATTCTCATATCCAGCCGTCATTGAGGCTAGTGATGGCTCAATTCACATCACATACACATACAAGAGAGAGCAGATTAAGGTATGAAGAGGTTGATATCTGATCTAAAAGATACATTTGTGACCAACAACATCAAGTTTTTGTTATTGCCTACTTTCCCATTTTCCATTGATCGCTCTATTAGATACCATAAAACTTCATGTTTTTTTTTCTTTAGTTCTGAAATTTCTTAACCTATGTTTCTATACTTGATTGCCGGAGATTAACTCCATCATCATTTTTTGGGGCAGCATGTTGTCTTCAAACTGAAAATGCATGGGAAACATTGACGATGATAACTAAGAAAGCAAAGCTCTGTATCTGTTTGAAAGCGATGGTGAAGCTGTGATATTGGAATAAAGTGCTGAGTTATGGTGCATCATTTGTACATTTAGGTAAATATGATTGTTTTTACTGTGGTGTATCCATCAAGATGACTCGGATTGAGCTTATGGCAGAACTCTTGGATGTATTACTATAATTTTCTTTTTGGGTGAGAATGGATTGTAGTTGTAATGTTGTATCAGTTAAAGACGTGTTTGAGTGAAAGAATAAGTTAATATCATTCTTATAGAATCTATCTGTAAAATAATCCTTCGGGAATTTCCAAGATCGTTAAATGTTAGGAGTAGTTGCAAATATAGTAATTAGCAGATATAGCACAGTGCAAAAAAATTTTACAAATATAACAAAATTTAGATCAAACTCTGTGAGTATTAGTGATGAACCATATCATTGATAGAGTCTATCACTAATAAGAATCGATCAACAATAGTCTATTATTGAATGATGGATTTTGCTATATTTGTAGTTTTCTTTTTAAAATATTGCTATACACGTGATTATTATCCCTAAAGGTGCTACTGATTGCAATTAGGAGTGTTACAAAAAACTGAAAAATCGAAAACCAAACCAATCCAAATTGATTCATTGGTTTGGTTTGGTTTTTTAGTATAAAATTGGACATATTGGTTTTTATTTGGAGAAAACTAAAAAGTATTGGTTCAGATCGAATTTTGATTGAAAAAACCAATCAAAACCAAACCAAATAGATGGTATGTATACTCTTAAAAATAAACTCTAACAATATTAGGTATTTTTTAATGGTTTGATTATTATGGTGCATGTATATTTATTGTTTAGAAAAAAAAGATCCATTATTATGGAGTGTGTATGATATTTATTTAGAAAGTACTAAGAAAAAAAAATTAGAATGTCCAATTTCAATCTATAAAATAGAAAATTGACCCAAATTTAATTTAAAAATAGCAAAAATGACAAATTAAAAAAAAAGAAATAAAAATTAAAATTTGGAAGAGAAAAAAATAAAATTTCAAAACCTAAAACCAAACCAAGTAAAAACTAATCCATTGGTTTTGTTCTTCATTAGACATCAGTTTAGATCTCTTCTTTATAAAATCAATTAGTTTAATTTGGTTCTTAGTTGACCTCAAAACCGACAAAATCAAACCAACAATCACTCTAATTGTAATTACCGATATCTAAACCCAAATTGGTTGGGCTCAAAACTCATACTTCTATAGGCCTTAGGTTGGATTCTGAGAGTAACTGCGCCACTTCAGGATGCACTAAGACAATGCAGAGGCCCAAAAGGTCGGCCTTAA

mRNA sequence

CAACATTCGTTAAGCGGAGGATTATTTATCAATTATCCAATTGTGAGCTGGTTCGGTCATGCTTACAAACAACACAACAATACAATACAAGTTATATCCAAAGTCAACTATTTCATGCTCAGCTCTCGATTTTTCTCGTCAATTTCCTGTCTGTATCAGACTATCAAATTTCAGAGTTTGCGAGTTCCGACTGTTTAGTATTTCATCCCCAGGATTTGGAGACACCTTCACCTCTAAATTCAATTTGTTAACCCCAAAATTCTCCTCGCGTTTTCATCGCTTATATCGTCATTCGCATATATTTGGACCAAGAGCATTAGTTAGAGGACATTCGAATATAGCTTCAGTCGTTAGAAGTTTGAAAATGGGAAAAGGGCCCTTGCTAGAGGAGTTTACATTTCCTGCTAATTCTGTTCCTTTCAACAACTGTCATGCTTCTACAATTGTGGAGGTTGATAAAGATCATTACTTGGTTGCATATTTTGGGGGTACATTTGAGGGCGCACCGGATGTGAAGATTTGGTTGCAGACATATAAGAACGGCTCCTGGCACCCACCAATCGTGGCTGATGAGGAGCCTGATGTTCCAATGTGGAACCCTGTCTTGTTCAAACTTCCATCAGATGAGCTGCTTTTGTTTTATAGAGTGGGCCAAGACGTTCAAAGATGGAGTGGGTGCATGAAGCGGTCATATGACAAAGGCGTTACTTGGACAGCAAGAGAACAGCTTCCTCCTGGTATCTTAGGACCAATAAAGAACAAGCCAATCTTGCTGGAAAATGGGGTTTTGCTCTGCGGATCTTCAGTTGAAAGTTGGAACTCTTGGGGTGCATGGATGGAGGCCACATCTGATTCAGGTAGAACATGGAGAAAGTTTGGCCCAATATACATGGAAAATCGATCTCTCAGTGTCATTCAACCAGCTCCTTACCAGACTGCAAATGGAAATCTGCGTGTTTTACTTCGATCCTTCACAGGTATTGATGGAGTCAAACTGAGGGATGGTCGTTTATTACTGGTTTATAACACGGTATCGAGAGGTGTGCTTAAGGTTGGACTGTCTCTGGATGATGGTGACTCATGGCTAGATGTCATGACCTTGGAGGATGAGCCAGGAAAGGAATTCTCATATCCAGCCGTCATTGAGGCTAGTGATGGCTCAATTCACATCACATACACATACAAGAGAGAGCAGATTAAGGCCTTAGGTTGGATTCTGAGAGTAACTGCGCCACTTCAGGATGCACTAAGACAATGCAGAGGCCCAAAAGGTCGGCCTTAA

Coding sequence (CDS)

ATGCTTACAAACAACACAACAATACAATACAAGTTATATCCAAAGTCAACTATTTCATGCTCAGCTCTCGATTTTTCTCGTCAATTTCCTGTCTGTATCAGACTATCAAATTTCAGAGTTTGCGAGTTCCGACTGTTTAGTATTTCATCCCCAGGATTTGGAGACACCTTCACCTCTAAATTCAATTTGTTAACCCCAAAATTCTCCTCGCGTTTTCATCGCTTATATCGTCATTCGCATATATTTGGACCAAGAGCATTAGTTAGAGGACATTCGAATATAGCTTCAGTCGTTAGAAGTTTGAAAATGGGAAAAGGGCCCTTGCTAGAGGAGTTTACATTTCCTGCTAATTCTGTTCCTTTCAACAACTGTCATGCTTCTACAATTGTGGAGGTTGATAAAGATCATTACTTGGTTGCATATTTTGGGGGTACATTTGAGGGCGCACCGGATGTGAAGATTTGGTTGCAGACATATAAGAACGGCTCCTGGCACCCACCAATCGTGGCTGATGAGGAGCCTGATGTTCCAATGTGGAACCCTGTCTTGTTCAAACTTCCATCAGATGAGCTGCTTTTGTTTTATAGAGTGGGCCAAGACGTTCAAAGATGGAGTGGGTGCATGAAGCGGTCATATGACAAAGGCGTTACTTGGACAGCAAGAGAACAGCTTCCTCCTGGTATCTTAGGACCAATAAAGAACAAGCCAATCTTGCTGGAAAATGGGGTTTTGCTCTGCGGATCTTCAGTTGAAAGTTGGAACTCTTGGGGTGCATGGATGGAGGCCACATCTGATTCAGGTAGAACATGGAGAAAGTTTGGCCCAATATACATGGAAAATCGATCTCTCAGTGTCATTCAACCAGCTCCTTACCAGACTGCAAATGGAAATCTGCGTGTTTTACTTCGATCCTTCACAGGTATTGATGGAGTCAAACTGAGGGATGGTCGTTTATTACTGGTTTATAACACGGTATCGAGAGGTGTGCTTAAGGTTGGACTGTCTCTGGATGATGGTGACTCATGGCTAGATGTCATGACCTTGGAGGATGAGCCAGGAAAGGAATTCTCATATCCAGCCGTCATTGAGGCTAGTGATGGCTCAATTCACATCACATACACATACAAGAGAGAGCAGATTAAGGCCTTAGGTTGGATTCTGAGAGTAACTGCGCCACTTCAGGATGCACTAAGACAATGCAGAGGCCCAAAAGGTCGGCCTTAA

Protein sequence

MLTNNTTIQYKLYPKSTISCSALDFSRQFPVCIRLSNFRVCEFRLFSISSPGFGDTFTSKFNLLTPKFSSRFHRLYRHSHIFGPRALVRGHSNIASVVRSLKMGKGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLSVIQPAPYQTANGNLRVLLRSFTGIDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYTYKREQIKALGWILRVTAPLQDALRQCRGPKGRP
BLAST of LsiUNG000940 vs. TrEMBL
Match: R0F0G7_9BRAS (Uncharacterized protein OS=Capsella rubella GN=CARUB_v10028058mg PE=4 SV=1)

HSP 1 Score: 461.1 bits (1185), Expect = 1.4e-126
Identity = 210/282 (74.47%), Postives = 239/282 (84.75%), Query Frame = 1

Query: 108 LLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSWHPP 167
           +LE FT+PANS PF +CHASTIVEVDKDH+L AYFGG+ EGAPDVKIWLQ +K+G W  P
Sbjct: 9   VLETFTYPANSAPFKSCHASTIVEVDKDHFLAAYFGGSREGAPDVKIWLQHFKDGQWDTP 68

Query: 168 IVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQLPPG 227
           ++ DE+P VPMWNPVLFKLPS +LLLFY++GQ+VQ+WSGCMKRS DKG TWT REQLPPG
Sbjct: 69  VIVDEQPGVPMWNPVLFKLPSQQLLLFYKIGQEVQKWSGCMKRSNDKGRTWTEREQLPPG 128

Query: 228 ILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLSVIQ 287
           ILGPIKNKPIL E+G LLCGSSVESWNSWGAWME TSD+GR+WRK GPIY++ +SLSVIQ
Sbjct: 129 ILGPIKNKPILFEDGTLLCGSSVESWNSWGAWMEVTSDAGRSWRKQGPIYIQGKSLSVIQ 188

Query: 288 PAPYQTANGNLR--------VLLRSFTGIDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDG 347
           P PYQTA G LR        VL    +GIDGVKL+DGRL+L YNT SRGVLKVG+SLDDG
Sbjct: 189 PVPYQTAAGTLRNWSFAVPTVLPNPNSGIDGVKLKDGRLVLAYNTDSRGVLKVGVSLDDG 248

Query: 348 DSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYTYKREQIK 382
           DSW DV+TLED PG EFSYPAVIEA DG++H+TYTY R QIK
Sbjct: 249 DSWTDVLTLEDSPGMEFSYPAVIEAGDGNVHVTYTYNRTQIK 290

BLAST of LsiUNG000940 vs. TrEMBL
Match: A0A059CSZ9_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_C02440 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 3.5e-125
Identity = 212/328 (64.63%), Postives = 254/328 (77.44%), Query Frame = 1

Query: 84  PRALVRGHSNIASVVRSLKMGKGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFG 143
           P +++   SN+    R     +GP+ ++FTFPA S PFNNCHASTIVEV++DH+LVAYFG
Sbjct: 66  PTSIMAQSSNLTKDSRV----EGPVEQDFTFPAKSAPFNNCHASTIVEVNEDHFLVAYFG 125

Query: 144 GTFEGAPDVKIWLQTYKNGSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQR 203
           GT EGAPDVKIWLQ Y++G WHPPI+ DE+ DVPMWNPVL+K PS ELLLFY++GQD QR
Sbjct: 126 GTEEGAPDVKIWLQRYQHGRWHPPIIIDEQEDVPMWNPVLYKFPSGELLLFYKIGQDFQR 185

Query: 204 WSGCMKRSYDKGVTWTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEAT 263
           WSG +KRSY++GV+W+ REQLPPGILGPIKNKPILL+NG LLCGSSVESWNSWGAWME T
Sbjct: 186 WSGFLKRSYNQGVSWSEREQLPPGILGPIKNKPILLDNGDLLCGSSVESWNSWGAWMEIT 245

Query: 264 SDSGRTWRKFGPIYMENRSLSVIQPAPYQTANGNLRVLLRSFT----------------- 323
            D+GRTWRK+GPIY++N SLSVIQP P++TA G LRVLLRSFT                 
Sbjct: 246 PDAGRTWRKYGPIYLKNESLSVIQPVPFKTARGTLRVLLRSFTGIDRICMSESHDGGMNW 305

Query: 324 -------------GIDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPG 382
                        GIDGVKL++G L++ YNT+SRG+LKV +SLDDGDSW + +TLE+   
Sbjct: 306 GYAEFTELPNPNSGIDGVKLKNGLLIVAYNTISRGILKVAVSLDDGDSWHEALTLEENLE 365

BLAST of LsiUNG000940 vs. TrEMBL
Match: A0A059CSB9_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_C02440 PE=4 SV=1)

HSP 1 Score: 455.3 bits (1170), Expect = 7.8e-125
Identity = 208/307 (67.75%), Postives = 246/307 (80.13%), Query Frame = 1

Query: 105 KGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSW 164
           +GP+ ++FTFPA S PFNNCHASTIVEV++DH+LVAYFGGT EGAPDVKIWLQ Y++G W
Sbjct: 14  EGPVEQDFTFPAKSAPFNNCHASTIVEVNEDHFLVAYFGGTEEGAPDVKIWLQRYQHGRW 73

Query: 165 HPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQL 224
           HPPI+ DE+ DVPMWNPVL+K PS ELLLFY++GQD QRWSG +KRSY++GV+W+ REQL
Sbjct: 74  HPPIIIDEQEDVPMWNPVLYKFPSGELLLFYKIGQDFQRWSGFLKRSYNQGVSWSEREQL 133

Query: 225 PPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLS 284
           PPGILGPIKNKPILL+NG LLCGSSVESWNSWGAWME T D+GRTWRK+GPIY++N SLS
Sbjct: 134 PPGILGPIKNKPILLDNGDLLCGSSVESWNSWGAWMEITPDAGRTWRKYGPIYLKNESLS 193

Query: 285 VIQPAPYQTANGNLRVLLRSFT------------------------------GIDGVKLR 344
           VIQP P++TA G LRVLLRSFT                              GIDGVKL+
Sbjct: 194 VIQPVPFKTARGTLRVLLRSFTGIDRICMSESHDGGMNWGYAEFTELPNPNSGIDGVKLK 253

Query: 345 DGRLLLVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYT 382
           +G L++ YNT+SRG+LKV +SLDDGDSW + +TLE+    EFSYPAVIEASDGS+H+TYT
Sbjct: 254 NGLLIVAYNTISRGILKVAVSLDDGDSWHEALTLEENLEMEFSYPAVIEASDGSVHVTYT 313

BLAST of LsiUNG000940 vs. TrEMBL
Match: V4WIY4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008429mg PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 7.3e-123
Identity = 220/333 (66.07%), Postives = 250/333 (75.08%), Query Frame = 1

Query: 85  RALVRGHSN-IASVVRSLKMGK-----GPLLEEFTFPANSVPFNNCHASTIVEVDKDHYL 144
           R + R H N I S+  SL M K     G + EEFTFPANS PF +CHASTIVEVDK H+L
Sbjct: 79  RKVARVHPNTITSISESLNMKKDCGIKGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFL 138

Query: 145 VAYFGGTFEGAPDVKIWLQTYKNGSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVG 204
           VAYFGG+ EGAPDVKIWLQT+K+G W  PI+ADEEP+VPMWNPVLFKLPS+ LLLFY++G
Sbjct: 139 VAYFGGSCEGAPDVKIWLQTFKDGRWQSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIG 198

Query: 205 QDVQRWSGCMKRSYDKGVTWTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGA 264
           Q+VQ+WSGCMKRSY+KGVTW+ REQLPPGILGP KNKPILLENG+LLCGSSVESWNSWG+
Sbjct: 199 QEVQKWSGCMKRSYNKGVTWSEREQLPPGILGPSKNKPILLENGLLLCGSSVESWNSWGS 258

Query: 265 WMEATSDSGRTWRKFGPIYMENRSLSVIQPAPYQTANGNLRVLLRSFTGIDGVKLR---D 324
           WME T D+GR+WRK+GPIY+ N SLSVIQP P+ TAN  LRVL+RSF GI  V +    D
Sbjct: 259 WMEVTVDAGRSWRKYGPIYIPNESLSVIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCD 318

Query: 325 GRLL---------------------------LVYNTVSRGVLKVGLSLDDGDSWLDVMTL 382
           G L                            L YNTVSRGVLKV LS DDGDSW D +TL
Sbjct: 319 GGLTWSYAKPTQLLNPNSGIDGVKLKDGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTL 378

BLAST of LsiUNG000940 vs. TrEMBL
Match: A0A067F4S0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020796mg PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 3.1e-121
Identity = 211/307 (68.73%), Postives = 239/307 (77.85%), Query Frame = 1

Query: 105 KGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSW 164
           KG + EEFTFPANS PF +CHASTIVEVDK H+LVAYFGG+ EGAPDVKIWLQT+K+G W
Sbjct: 8   KGLVAEEFTFPANSAPFKSCHASTIVEVDKGHFLVAYFGGSCEGAPDVKIWLQTFKDGRW 67

Query: 165 HPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQL 224
             PI+ADEEP+VPMWNPVLFKLPS+ LLLFY++GQ+VQ+WSGCMKRSY+KGVTW+ REQL
Sbjct: 68  QSPIIADEEPNVPMWNPVLFKLPSNGLLLFYKIGQEVQKWSGCMKRSYNKGVTWSEREQL 127

Query: 225 PPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLS 284
           PPGILGP KNKPILLENG+LLCGSSVESWNSWG+WME T D+GR+WRK+GPIY+ N SLS
Sbjct: 128 PPGILGPSKNKPILLENGLLLCGSSVESWNSWGSWMEVTVDAGRSWRKYGPIYIPNESLS 187

Query: 285 VIQPAPYQTANGNLRVLLRSFTGIDGVKLR---DGRLL---------------------- 344
           VIQP P+ TAN  LRVL+RSF GI  V +    DG L                       
Sbjct: 188 VIQPVPFHTANRTLRVLMRSFNGIGRVCMSESCDGGLTWSYAKPTQLLNPNSGIDGVKLK 247

Query: 345 -----LVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYT 382
                L YNTVSRGVLKV LS DDGDSW D +TLE+    EFSYPAVI+ASDGS+HITYT
Sbjct: 248 DGRLLLAYNTVSRGVLKVALSKDDGDSWHDALTLEENLAMEFSYPAVIQASDGSVHITYT 307

BLAST of LsiUNG000940 vs. TAIR10
Match: AT5G57700.3 (AT5G57700.3 BNR/Asp-box repeat family protein)

HSP 1 Score: 372.5 bits (955), Expect = 3.4e-103
Identity = 180/295 (61.02%), Postives = 214/295 (72.54%), Query Frame = 1

Query: 94  IASVVRSLKMGKGP---LLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAP 153
           I  V++S +M +     LLE FTFPA+S PF +CHASTIVEV KDH+L AYFGGT EGAP
Sbjct: 3   ICLVMKSSQMSETEFKVLLETFTFPADSAPFKSCHASTIVEVVKDHFLAAYFGGTREGAP 62

Query: 154 DVKIWLQTYKNGSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKR 213
           DVKIWLQ +K+G W  P++ DEEP VPM+NPVLFKLPS ELLLFY++GQ+VQ+WSGCMKR
Sbjct: 63  DVKIWLQHFKDGQWDSPVIVDEEPGVPMYNPVLFKLPSHELLLFYKIGQEVQKWSGCMKR 122

Query: 214 SYDKGVTWTAREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTW 273
           SYDKG+TWT REQLPPGILGPIKNKPILLE+G LLCGSSVESWNSWGAWME TSD+GRTW
Sbjct: 123 SYDKGITWTEREQLPPGILGPIKNKPILLEDGTLLCGSSVESWNSWGAWMEVTSDAGRTW 182

Query: 274 RKFGPIYMENRSLSVIQPAPYQTANGNLRVLLRSFTGIDGVKLRDGRLLLVYNTVSRGVL 333
           RK GPIY++ +SLSVIQP PYQTA GNLR+LLRSFTGID + + +               
Sbjct: 183 RKKGPIYIQGKSLSVIQPVPYQTAAGNLRILLRSFTGIDRICISE--------------- 242

Query: 334 KVGLSLDDGDSW-LDVMTLEDEPGKEFSYPAVIEASDGSIHITYTYKREQIKALG 385
               SLD G++W   V T+   P         ++  DG + + Y      +  LG
Sbjct: 243 ----SLDGGENWSFAVPTVLPNPNSGID---GVKLKDGRLVLAYNTDSRGVLKLG 275

BLAST of LsiUNG000940 vs. NCBI nr
Match: gi|659107416|ref|XP_008453663.1| (PREDICTED: uncharacterized protein LOC103494313 isoform X1 [Cucumis melo])

HSP 1 Score: 526.2 bits (1354), Expect = 5.2e-146
Identity = 256/316 (81.01%), Postives = 273/316 (86.39%), Query Frame = 1

Query: 1   MLTNNTTIQYKLYPKSTISCSALDFSRQFPVCIRLSNFRVCEFRLFSISSPGFGDTFTSK 60
           MLTN      + + +S +S   ++F  +    I+    RV    LFSISSPGFG+TFTSK
Sbjct: 1   MLTNK-----RQHFQSQLSSFLVNFLPESHQTIKFQTLRV---PLFSISSPGFGNTFTSK 60

Query: 61  FNLLTPKFSSRFHRLYRHSHIFGPRALVRGHSNIASVVRSLKMGKG-----PLLEEFTFP 120
           FNLLTPKFSSR HRLY HSHIFGPR LVRGHSNI  VVRSLKMGKG     PLLEEFTFP
Sbjct: 61  FNLLTPKFSSRIHRLYHHSHIFGPRILVRGHSNI--VVRSLKMGKGSYVEGPLLEEFTFP 120

Query: 121 ANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSWHPPIVADEEPD 180
           ANSVPFNNCHASTIVEVDKDHYLVAYFGGT EGAPDVKIWLQ +KNGSWH PIVADEEPD
Sbjct: 121 ANSVPFNNCHASTIVEVDKDHYLVAYFGGTLEGAPDVKIWLQAFKNGSWHSPIVADEEPD 180

Query: 181 VPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQLPPGILGPIKNK 240
           +PMWNPVLFKLPSDELLLFY+VGQDVQ+WSGCMKRSYDKG+TWTAREQLPPGILGPIKNK
Sbjct: 181 IPMWNPVLFKLPSDELLLFYKVGQDVQKWSGCMKRSYDKGITWTAREQLPPGILGPIKNK 240

Query: 241 PILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLSVIQPAPYQTAN 300
           PILLENGVLLCGSSVESWNSWGAWME TSDSGR+WRKFGPIYM+NRSLSVIQP PYQTAN
Sbjct: 241 PILLENGVLLCGSSVESWNSWGAWMEVTSDSGRSWRKFGPIYMKNRSLSVIQPVPYQTAN 300

Query: 301 GNLRVLLRSFTGIDGV 312
           GNLRVLLRSFTGI  +
Sbjct: 301 GNLRVLLRSFTGIGSI 306

BLAST of LsiUNG000940 vs. NCBI nr
Match: gi|449464334|ref|XP_004149884.1| (PREDICTED: uncharacterized protein LOC101219119 isoform X1 [Cucumis sativus])

HSP 1 Score: 519.2 bits (1336), Expect = 6.3e-144
Identity = 243/272 (89.34%), Postives = 254/272 (93.38%), Query Frame = 1

Query: 45  LFSISSPGFGDTFTSKFNLLTPKFSSRFHRLYRHSHIFGPRALVRGHSNIASVVRSLKMG 104
           +FSISSPGFG+TF SKFNLLTPKFSSR HRLY HSHIFGPR LVRGHSNI  VVRSLKMG
Sbjct: 17  VFSISSPGFGNTFISKFNLLTPKFSSRIHRLYHHSHIFGPRVLVRGHSNI--VVRSLKMG 76

Query: 105 KG-----PLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTY 164
           KG     PL EEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGT EGAPDVKIWLQ +
Sbjct: 77  KGSYVDGPLQEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTLEGAPDVKIWLQAF 136

Query: 165 KNGSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWT 224
           KNGSWH PIVADEEPD+PMWNPVLFKLPSDELLLFY+VGQ+VQ+WSGCMKRSYDKG+TWT
Sbjct: 137 KNGSWHSPIVADEEPDIPMWNPVLFKLPSDELLLFYKVGQEVQKWSGCMKRSYDKGITWT 196

Query: 225 AREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYME 284
           AREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWME TSDSGR+WRKFGPIYM+
Sbjct: 197 AREQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEVTSDSGRSWRKFGPIYMK 256

Query: 285 NRSLSVIQPAPYQTANGNLRVLLRSFTGIDGV 312
           NRSLSVIQP PYQTANGNLRVLLRSFTGID +
Sbjct: 257 NRSLSVIQPVPYQTANGNLRVLLRSFTGIDSI 286

BLAST of LsiUNG000940 vs. NCBI nr
Match: gi|731439085|ref|XP_010647630.1| (PREDICTED: uncharacterized protein LOC104878719 [Vitis vinifera])

HSP 1 Score: 476.5 bits (1225), Expect = 4.7e-131
Identity = 224/307 (72.96%), Postives = 251/307 (81.76%), Query Frame = 1

Query: 105 KGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSW 164
           +GP+LEEFTFP+NS PFN CHASTIVEV K H+LVAYFGGT EGAPDVKIWLQTYK+G W
Sbjct: 6   EGPVLEEFTFPSNSAPFNCCHASTIVEVGKLHFLVAYFGGTAEGAPDVKIWLQTYKDGFW 65

Query: 165 HPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQL 224
           H PI  DEEPDVPMWNPVLFKLPSDELLLFY++GQ+VQ+WSGCMKRS+D GVTWT REQL
Sbjct: 66  HFPIPIDEEPDVPMWNPVLFKLPSDELLLFYKIGQEVQKWSGCMKRSFDGGVTWTEREQL 125

Query: 225 PPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLS 284
           PPGILGPIKNKPILLENG+LLCGSSVESWNSWGAWME T DSGR+WRK+GPI+++N +LS
Sbjct: 126 PPGILGPIKNKPILLENGLLLCGSSVESWNSWGAWMEVTEDSGRSWRKYGPIFIKNETLS 185

Query: 285 VIQPAPYQTANGNLRVLLRSF------------------------------TGIDGVKLR 344
           VIQP PYQTANG LRVLLRSF                              +GIDGVKL 
Sbjct: 186 VIQPVPYQTANGALRVLLRSFDGIDRVCMSDSLVGGQSWIFAKLTALPNPNSGIDGVKLW 245

Query: 345 DGRLLLVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYT 382
           DGRLLL YNT+SR VLKV +S DDGDSW +V+TLE++ G EFSYPAVI+A+DGS+HITYT
Sbjct: 246 DGRLLLAYNTISREVLKVAISADDGDSWQEVVTLEEKTGMEFSYPAVIQATDGSVHITYT 305

BLAST of LsiUNG000940 vs. NCBI nr
Match: gi|565437231|ref|XP_006281863.1| (hypothetical protein CARUB_v10028058mg [Capsella rubella])

HSP 1 Score: 461.1 bits (1185), Expect = 2.1e-126
Identity = 210/282 (74.47%), Postives = 239/282 (84.75%), Query Frame = 1

Query: 108 LLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKNGSWHPP 167
           +LE FT+PANS PF +CHASTIVEVDKDH+L AYFGG+ EGAPDVKIWLQ +K+G W  P
Sbjct: 9   VLETFTYPANSAPFKSCHASTIVEVDKDHFLAAYFGGSREGAPDVKIWLQHFKDGQWDTP 68

Query: 168 IVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAREQLPPG 227
           ++ DE+P VPMWNPVLFKLPS +LLLFY++GQ+VQ+WSGCMKRS DKG TWT REQLPPG
Sbjct: 69  VIVDEQPGVPMWNPVLFKLPSQQLLLFYKIGQEVQKWSGCMKRSNDKGRTWTEREQLPPG 128

Query: 228 ILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENRSLSVIQ 287
           ILGPIKNKPIL E+G LLCGSSVESWNSWGAWME TSD+GR+WRK GPIY++ +SLSVIQ
Sbjct: 129 ILGPIKNKPILFEDGTLLCGSSVESWNSWGAWMEVTSDAGRSWRKQGPIYIQGKSLSVIQ 188

Query: 288 PAPYQTANGNLR--------VLLRSFTGIDGVKLRDGRLLLVYNTVSRGVLKVGLSLDDG 347
           P PYQTA G LR        VL    +GIDGVKL+DGRL+L YNT SRGVLKVG+SLDDG
Sbjct: 189 PVPYQTAAGTLRNWSFAVPTVLPNPNSGIDGVKLKDGRLVLAYNTDSRGVLKVGVSLDDG 248

Query: 348 DSWLDVMTLEDEPGKEFSYPAVIEASDGSIHITYTYKREQIK 382
           DSW DV+TLED PG EFSYPAVIEA DG++H+TYTY R QIK
Sbjct: 249 DSWTDVLTLEDSPGMEFSYPAVIEAGDGNVHVTYTYNRTQIK 290

BLAST of LsiUNG000940 vs. NCBI nr
Match: gi|702299579|ref|XP_010048700.1| (PREDICTED: uncharacterized protein LOC104437457 isoform X1 [Eucalyptus grandis])

HSP 1 Score: 458.8 bits (1179), Expect = 1.0e-125
Identity = 221/370 (59.73%), Postives = 270/370 (72.97%), Query Frame = 1

Query: 52  GFGDTFTSKFNLLTPKFSSRFHRLYRHSHIFGPRALVRGHSNIASVVRSLKMG------- 111
           GF  +F+S+   L+ +  +    L R+  +   R +  G S+ AS   S+          
Sbjct: 28  GFSGSFSSEHGHLSSRLKAHSGYLLRYHQVHLARKV--GISSKASTPTSIMAQSSNLTKD 87

Query: 112 ---KGPLLEEFTFPANSVPFNNCHASTIVEVDKDHYLVAYFGGTFEGAPDVKIWLQTYKN 171
              +GP+ ++FTFPA S PFNNCHASTIVEV++DH+LVAYFGGT EGAPDVKIWLQ Y++
Sbjct: 88  SRVEGPVEQDFTFPAKSAPFNNCHASTIVEVNEDHFLVAYFGGTEEGAPDVKIWLQRYQH 147

Query: 172 GSWHPPIVADEEPDVPMWNPVLFKLPSDELLLFYRVGQDVQRWSGCMKRSYDKGVTWTAR 231
           G WHPPI+ DE+ DVPMWNPVL+K PS ELLLFY++GQD QRWSG +KRSY++GV+W+ R
Sbjct: 148 GRWHPPIIIDEQEDVPMWNPVLYKFPSGELLLFYKIGQDFQRWSGFLKRSYNQGVSWSER 207

Query: 232 EQLPPGILGPIKNKPILLENGVLLCGSSVESWNSWGAWMEATSDSGRTWRKFGPIYMENR 291
           EQLPPGILGPIKNKPILL+NG LLCGSSVESWNSWGAWME T D+GRTWRK+GPIY++N 
Sbjct: 208 EQLPPGILGPIKNKPILLDNGDLLCGSSVESWNSWGAWMEITPDAGRTWRKYGPIYLKNE 267

Query: 292 SLSVIQPAPYQTANGNLRVLLRSFT------------------------------GIDGV 351
           SLSVIQP P++TA G LRVLLRSFT                              GIDGV
Sbjct: 268 SLSVIQPVPFKTARGTLRVLLRSFTGIDRICMSESHDGGMNWGYAEFTELPNPNSGIDGV 327

Query: 352 KLRDGRLLLVYNTVSRGVLKVGLSLDDGDSWLDVMTLEDEPGKEFSYPAVIEASDGSIHI 382
           KL++G L++ YNT+SRG+LKV +SLDDGDSW + +TLE+    EFSYPAVIEASDGS+H+
Sbjct: 328 KLKNGLLIVAYNTISRGILKVAVSLDDGDSWHEALTLEENLEMEFSYPAVIEASDGSVHV 387

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
R0F0G7_9BRAS1.4e-12674.47Uncharacterized protein OS=Capsella rubella GN=CARUB_v10028058mg PE=4 SV=1[more]
A0A059CSZ9_EUCGR3.5e-12564.63Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_C02440 PE=4 SV=1[more]
A0A059CSB9_EUCGR7.8e-12567.75Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_C02440 PE=4 SV=1[more]
V4WIY4_9ROSI7.3e-12366.07Uncharacterized protein OS=Citrus clementina GN=CICLE_v10008429mg PE=4 SV=1[more]
A0A067F4S0_CITSI3.1e-12168.73Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g020796mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G57700.33.4e-10361.02 BNR/Asp-box repeat family protein[more]
Match NameE-valueIdentityDescription
gi|659107416|ref|XP_008453663.1|5.2e-14681.01PREDICTED: uncharacterized protein LOC103494313 isoform X1 [Cucumis melo][more]
gi|449464334|ref|XP_004149884.1|6.3e-14489.34PREDICTED: uncharacterized protein LOC101219119 isoform X1 [Cucumis sativus][more]
gi|731439085|ref|XP_010647630.1|4.7e-13172.96PREDICTED: uncharacterized protein LOC104878719 [Vitis vinifera][more]
gi|565437231|ref|XP_006281863.1|2.1e-12674.47hypothetical protein CARUB_v10028058mg [Capsella rubella][more]
gi|702299579|ref|XP_010048700.1|1.0e-12559.73PREDICTED: uncharacterized protein LOC104437457 isoform X1 [Eucalyptus grandis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR011040Sialidase
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
LsiUNG000940.1LsiUNG000940.1mRNA


Analysis Name: InterPro Annotations of Lagenaria siceraria
Date Performed: 2017-09-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011040SialidasesGENE3DG3DSA:2.120.10.10coord: 112..306
score: 3.3E-36coord: 307..382
score: 9.1
IPR011040SialidasesPFAMPF13088BNR_2coord: 307..371
score: 2.3E-15coord: 138..304
score: 2.4
IPR011040SialidasesunknownSSF50939Sialidasescoord: 114..381
score: 6.45
NoneNo IPR availablePANTHERPTHR33307FAMILY NOT NAMEDcoord: 108..381
score: 1.0E
NoneNo IPR availablePANTHERPTHR33307:SF2BNR/ASP-BOX REPEAT FAMILY PROTEINcoord: 108..381
score: 1.0E