Lsi04G013820.1 (mRNA) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi04G013820.1
TypemRNA
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHybrid signal transduction histidine kinase M-like protein
Locationchr04: 21629992 .. 21631846 (+)
Sequence length939
RNA-Seq ExpressionLsi04G013820.1
SyntenyLsi04G013820.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAAATCCCTCAATTCTTCCATTTATAAAAACCCGTTTCCTCTTCTTTCTTCTGCTTCTTCTTCTCGTCTGTGTTTTTACTTTCCTTCTCTCTCTCTCTCTCTCTCCTTGTTGTCACTGTTCAAATTCAAAGTCGTTATGTCGATTACTTTGGATAGGGTTCGGCAGGTTGAAGATAAACGCTCTGGATTCTCGCCGGAGGATTTGGGTTATGGATCGGTGTTTCAGAGGTTGGTCACTCACGATGTGGATCGCGCTGTTGAAGGGTTTGAAAAGGAGGAGGCTGACGAATCTAATACTTGCAGCTCCGCTTCTACTTCTTCGTCTTCTTCGATCGGGAGGAATAGTGATCAGTCCGCTAGATCGTCGGACGGTGAGGATAGCGGCGAGAATGATGAGGTTCAGAGCTCCTATAAAGGGCCGCTGGATATGATGGACTCCTTGGAGGAGGTTTTGCCTGTCAGGTTGGTGTTCTTTCTATTTGTTTTGTGGATGCTTTGAACTCGTATTTTCTTGCGCGATCTTAGTTTGGTAATTTCGAATTGCAAATCGAAATCATTCCTTGGTTCTAGTTCTTAAAATTCATGGATGATCTGAAACGAGTTTAGGGAAGGCTTGATTAAGCTGGAAAATTTAACACTCTTGTAGTACATTTGATCTAGGGATTGAGGATCGTTAATCTGTCCTGTAGAATGGTGGAGTCATTTGATCTGTGCTTGATTATTGATCAATTACTTTTGCCTCATTTTGGATTTATTACTGGGGGGATTACGTTTACATCCTGGCGGTGACTAGTATTTATTGCTGACAAACAGAGAAAATATTTAGATTCGAGTCTTGGAAGAGTCATTAGTCTGGGCATGAACAAGTCCTGGCCTGTGGAAACATGTTTGCTGATTTCTTGTGGATAAATGCAATGATAAACTCGAGATAACTATATGATTTCGGTGGGTTAATTATGAGAAAGAGAGCATTCCGAAGCTGAAGATTAGTAAATTACATCATTTCTTTAGTGAACTTTAATCGTGAGTTTACCATCGATTTCTATTGTCATCTTTCTTGCCAATCTGTGTAGTGAAGATTGTGTTTCTGAGATCTTTCTAAGCTCAGTCAGAAAGAAATCATTGCTTTGTAGTGCACTAACATTATAAAAATTGTTCTGCAACCACTGCAAGACAATGACTACTTTTGATTTTAGCTTTATGGTTTCAGTAACACTATCTCTACTTTTGAGATTGGAATAAGGGAGAGTACCCATTAAGAGTACCCATTAATACAGTGGAGTTTTTATGCAATTTTACAGCAGAACACACACAAACATACATGATACACGTATTCTATTTGTTTCTTTATCTAATAAATAATTATTACAACATGCAGAAAAGGTATCTCAAAGTTCTATAGTGGAAAATCAAAGTCTTTCACGAGTCTGGCCGATGCTTCCTCTGTTAACTCCATGAAGGAGATTGCAAAGCCAGAGAATGCTTATTCCAAGAAACGTAGGAATCTTATGGCGTACAATCTTGTGTGGGAGAAGAACCGCAGTTTTCCTCTCAAGAATAATGGTGGTGGGATATCAAAGAGACCCATCAGCTCAAGCAGAAGCTCCTTAGCTTTGGCTGTCGCTATGAGCAGCTCCGAAAGTAACAGCAGTGAGGATTCAAATTCTAGCTCATATTCAAGCTCTCCACCGCCTCGCCCACCTCTACACCCACAATCCAGACCATGTAACAGCAATTTCACTTCCATGGTGCCACCTCAAAGAACTTTCTCCACTTGGCGTTCATATTCCTTGGCCGATTTACAAGAATGCGCCACCTTTACTAATAAGGCCAACCTAACCAATCTAAACTAA

mRNA sequence

CCAAAATCCCTCAATTCTTCCATTTATAAAAACCCGTTTCCTCTTCTTTCTTCTGCTTCTTCTTCTCGTCTGTGTTTTTACTTTCCTTCTCTCTCTCTCTCTCTCTCCTTGTTGTCACTGTTCAAATTCAAAGTCGTTATGTCGATTACTTTGGATAGGGTTCGGCAGGTTGAAGATAAACGCTCTGGATTCTCGCCGGAGGATTTGGGTTATGGATCGGTGTTTCAGAGGTTGGTCACTCACGATGTGGATCGCGCTGTTGAAGGGTTTGAAAAGGAGGAGGCTGACGAATCTAATACTTGCAGCTCCGCTTCTACTTCTTCGTCTTCTTCGATCGGGAGGAATAGTGATCAGTCCGCTAGATCGTCGGACGGTGAGGATAGCGGCGAGAATGATGAGGTTCAGAGCTCCTATAAAGGGCCGCTGGATATGATGGACTCCTTGGAGGAGGTTTTGCCTGTCAGAAAAGGTATCTCAAAGTTCTATAGTGGAAAATCAAAGTCTTTCACGAGTCTGGCCGATGCTTCCTCTGTTAACTCCATGAAGGAGATTGCAAAGCCAGAGAATGCTTATTCCAAGAAACGTAGGAATCTTATGGCGTACAATCTTGTGTGGGAGAAGAACCGCAGTTTTCCTCTCAAGAATAATGGTGGTGGGATATCAAAGAGACCCATCAGCTCAAGCAGAAGCTCCTTAGCTTTGGCTGTCGCTATGAGCAGCTCCGAAAGTAACAGCAGTGAGGATTCAAATTCTAGCTCATATTCAAGCTCTCCACCGCCTCGCCCACCTCTACACCCACAATCCAGACCATGTAACAGCAATTTCACTTCCATGGTGCCACCTCAAAGAACTTTCTCCACTTGGCGTTCATATTCCTTGGCCGATTTACAAGAATGCGCCACCTTTACTAATAAGGCCAACCTAACCAATCTAAACTAA

Coding sequence (CDS)

ATGTCGATTACTTTGGATAGGGTTCGGCAGGTTGAAGATAAACGCTCTGGATTCTCGCCGGAGGATTTGGGTTATGGATCGGTGTTTCAGAGGTTGGTCACTCACGATGTGGATCGCGCTGTTGAAGGGTTTGAAAAGGAGGAGGCTGACGAATCTAATACTTGCAGCTCCGCTTCTACTTCTTCGTCTTCTTCGATCGGGAGGAATAGTGATCAGTCCGCTAGATCGTCGGACGGTGAGGATAGCGGCGAGAATGATGAGGTTCAGAGCTCCTATAAAGGGCCGCTGGATATGATGGACTCCTTGGAGGAGGTTTTGCCTGTCAGAAAAGGTATCTCAAAGTTCTATAGTGGAAAATCAAAGTCTTTCACGAGTCTGGCCGATGCTTCCTCTGTTAACTCCATGAAGGAGATTGCAAAGCCAGAGAATGCTTATTCCAAGAAACGTAGGAATCTTATGGCGTACAATCTTGTGTGGGAGAAGAACCGCAGTTTTCCTCTCAAGAATAATGGTGGTGGGATATCAAAGAGACCCATCAGCTCAAGCAGAAGCTCCTTAGCTTTGGCTGTCGCTATGAGCAGCTCCGAAAGTAACAGCAGTGAGGATTCAAATTCTAGCTCATATTCAAGCTCTCCACCGCCTCGCCCACCTCTACACCCACAATCCAGACCATGTAACAGCAATTTCACTTCCATGGTGCCACCTCAAAGAACTTTCTCCACTTGGCGTTCATATTCCTTGGCCGATTTACAAGAATGCGCCACCTTTACTAATAAGGCCAACCTAACCAATCTAAACTAA

Protein sequence

MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSASTSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKSKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPISSSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFSTWRSYSLADLQECATFTNKANLTNLN
Homology
BLAST of Lsi04G013820.1 vs. ExPASy TrEMBL
Match: A0A0A0L451 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G648530 PE=4 SV=1)

HSP 1 Score: 461.8 bits (1187), Expect = 1.9e-126
Identity = 250/266 (93.98%), Postives = 256/266 (96.24%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSITLDRVRQV+DKRSGFSPEDLGYGSVFQRLVTHDVDR VEGF+KEEADESNTCSSAST
Sbjct: 1   MSITLDRVRQVDDKRSGFSPEDLGYGSVFQRLVTHDVDRVVEGFQKEEADESNTCSSAST 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQ    SD ED+GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS
Sbjct: 61  SSSSSIGRNSDQ----SDDEDNGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SS+SSLALAVAMSSSESNSSEDSN SSYSSSPPPRPPLHPQSRP N+NF SMVPPQ+TFS
Sbjct: 181 SSKSSLALAVAMSSSESNSSEDSNCSSYSSSPPPRPPLHPQSRPSNNNFPSMVPPQKTFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATFANKANLTNLN 262

BLAST of Lsi04G013820.1 vs. ExPASy TrEMBL
Match: A0A5A7VFR8 (Vitellogenin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G001690 PE=4 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 2.3e-124
Identity = 250/267 (93.63%), Postives = 256/267 (95.88%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEE-ADESNTCSSAS 60
           MSITLDRVRQV+ KRSGFSPEDLGYGSVFQRLVT DVDRAVEGF+KEE ADESNTCSSAS
Sbjct: 1   MSITLDRVRQVDGKRSGFSPEDLGYGSVFQRLVTQDVDRAVEGFQKEEDADESNTCSSAS 60

Query: 61  TSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120
           TSSSSSIGRNSDQ    SD ED+GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK
Sbjct: 61  TSSSSSIGRNSDQ----SDDEDNGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120

Query: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180
           SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI
Sbjct: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180

Query: 181 SSSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTF 240
           SSS+SSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRP N+NF SMVPPQ+TF
Sbjct: 181 SSSKSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPSNNNFPSMVPPQKTF 240

Query: 241 STWRSYSLADLQECATFTNKANLTNLN 267
           STWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 STWRSYSLADLQECATFANKANLTNLN 263

BLAST of Lsi04G013820.1 vs. ExPASy TrEMBL
Match: A0A1S3BU42 (vitellogenin-2 OS=Cucumis melo OX=3656 GN=LOC103493705 PE=4 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 2.3e-124
Identity = 250/267 (93.63%), Postives = 256/267 (95.88%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEE-ADESNTCSSAS 60
           MSITLDRVRQV+ KRSGFSPEDLGYGSVFQRLVT DVDRAVEGF+KEE ADESNTCSSAS
Sbjct: 1   MSITLDRVRQVDGKRSGFSPEDLGYGSVFQRLVTQDVDRAVEGFQKEEDADESNTCSSAS 60

Query: 61  TSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120
           TSSSSSIGRNSDQ    SD ED+GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK
Sbjct: 61  TSSSSSIGRNSDQ----SDDEDNGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120

Query: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180
           SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI
Sbjct: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180

Query: 181 SSSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTF 240
           SSS+SSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRP N+NF SMVPPQ+TF
Sbjct: 181 SSSKSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPSNNNFPSMVPPQKTF 240

Query: 241 STWRSYSLADLQECATFTNKANLTNLN 267
           STWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 STWRSYSLADLQECATFANKANLTNLN 263

BLAST of Lsi04G013820.1 vs. ExPASy TrEMBL
Match: A0A6J1FCW4 (uncharacterized protein LOC111442964 OS=Cucurbita moschata OX=3662 GN=LOC111442964 PE=4 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 1.5e-115
Identity = 236/266 (88.72%), Postives = 249/266 (93.61%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSITLDRVR+V D+ SGFSPEDLGYGS+F+RL T DVDR VEGFEK EA+ESNTCSSAST
Sbjct: 1   MSITLDRVRRV-DQPSGFSPEDLGYGSMFERLETGDVDRDVEGFEK-EAEESNTCSSAST 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQSARSSD EDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFY+GKS
Sbjct: 61  SSSSSIGRNSDQSARSSDDEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYNGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KSFTSLADAS+V+S++EI KPENAYSKKRRNLMA+NLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSFTSLADASTVSSVEEIVKPENAYSKKRRNLMAFNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SSRSS ALAVAMSSSESNSSEDSN SSYSSSPPPRPPLHPQS+  NSN  S+VPPQR FS
Sbjct: 181 SSRSSFALAVAMSSSESNSSEDSNCSSYSSSPPPRPPLHPQSKASNSNLASIVPPQRNFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECAT  NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATSANKANLTNLN 264

BLAST of Lsi04G013820.1 vs. ExPASy TrEMBL
Match: A0A6J1FT55 (uncharacterized protein LOC111446629 OS=Cucurbita moschata OX=3662 GN=LOC111446629 PE=4 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 3.7e-114
Identity = 237/266 (89.10%), Postives = 246/266 (92.48%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSI LDRV++VEDK SGFSPEDLGYGSVF+RL T DVD  VEGFEK EA+ESNTCSSAS 
Sbjct: 1   MSIALDRVQRVEDKGSGFSPEDLGYGSVFERLDTGDVDHGVEGFEK-EAEESNTCSSAS- 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQSARSSDGE+SGE+DEVQSSYKGPLDMMDSLEEVLPVRKGISKFY+GKS
Sbjct: 61  SSSSSIGRNSDQSARSSDGEESGESDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYNGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KS+TSLADA SV+SMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSYTSLADA-SVSSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SSRSSLALAVAMSSSESN SEDSNSSSYS SPPP PPLHPQSR  N N  SMVPPQR FS
Sbjct: 181 SSRSSLALAVAMSSSESNCSEDSNSSSYSGSPPPLPPLHPQSRASNCNLASMVPPQRNFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECAT  NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATSANKANLTNLN 263

BLAST of Lsi04G013820.1 vs. NCBI nr
Match: XP_038896868.1 (uncharacterized protein LOC120085087 [Benincasa hispida])

HSP 1 Score: 472.6 bits (1215), Expect = 2.2e-129
Identity = 256/266 (96.24%), Postives = 258/266 (96.99%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVT DVDRAVEGF+K EADESNTCSSAST
Sbjct: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTQDVDRAVEGFQKGEADESNTCSSAST 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQSARSSDGED GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS
Sbjct: 61  SSSSSIGRNSDQSARSSDGEDCGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KSFTSLADAS+VNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSFTSLADASAVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRP N NF SM PPQRTFS
Sbjct: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPSNGNFPSMAPPQRTFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATFANKANLTNLN 266

BLAST of Lsi04G013820.1 vs. NCBI nr
Match: XP_004141345.1 (vitellogenin-2 [Cucumis sativus] >KGN55382.1 hypothetical protein Csa_012692 [Cucumis sativus])

HSP 1 Score: 461.8 bits (1187), Expect = 3.9e-126
Identity = 250/266 (93.98%), Postives = 256/266 (96.24%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSITLDRVRQV+DKRSGFSPEDLGYGSVFQRLVTHDVDR VEGF+KEEADESNTCSSAST
Sbjct: 1   MSITLDRVRQVDDKRSGFSPEDLGYGSVFQRLVTHDVDRVVEGFQKEEADESNTCSSAST 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQ    SD ED+GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS
Sbjct: 61  SSSSSIGRNSDQ----SDDEDNGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SS+SSLALAVAMSSSESNSSEDSN SSYSSSPPPRPPLHPQSRP N+NF SMVPPQ+TFS
Sbjct: 181 SSKSSLALAVAMSSSESNSSEDSNCSSYSSSPPPRPPLHPQSRPSNNNFPSMVPPQKTFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATFANKANLTNLN 262

BLAST of Lsi04G013820.1 vs. NCBI nr
Match: XP_008452790.1 (PREDICTED: vitellogenin-2 [Cucumis melo] >KAA0064541.1 vitellogenin-2 [Cucumis melo var. makuwa] >TYK20049.1 vitellogenin-2 [Cucumis melo var. makuwa])

HSP 1 Score: 454.9 bits (1169), Expect = 4.7e-124
Identity = 250/267 (93.63%), Postives = 256/267 (95.88%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEE-ADESNTCSSAS 60
           MSITLDRVRQV+ KRSGFSPEDLGYGSVFQRLVT DVDRAVEGF+KEE ADESNTCSSAS
Sbjct: 1   MSITLDRVRQVDGKRSGFSPEDLGYGSVFQRLVTQDVDRAVEGFQKEEDADESNTCSSAS 60

Query: 61  TSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120
           TSSSSSIGRNSDQ    SD ED+GENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK
Sbjct: 61  TSSSSSIGRNSDQ----SDDEDNGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGK 120

Query: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180
           SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI
Sbjct: 121 SKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPI 180

Query: 181 SSSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTF 240
           SSS+SSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRP N+NF SMVPPQ+TF
Sbjct: 181 SSSKSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPSNNNFPSMVPPQKTF 240

Query: 241 STWRSYSLADLQECATFTNKANLTNLN 267
           STWRSYSLADLQECATF NKANLTNLN
Sbjct: 241 STWRSYSLADLQECATFANKANLTNLN 263

BLAST of Lsi04G013820.1 vs. NCBI nr
Match: XP_022936302.1 (uncharacterized protein LOC111442964 [Cucurbita moschata] >XP_022936303.1 uncharacterized protein LOC111442964 [Cucurbita moschata])

HSP 1 Score: 425.6 bits (1093), Expect = 3.1e-115
Identity = 236/266 (88.72%), Postives = 249/266 (93.61%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSITLDRVR+V D+ SGFSPEDLGYGS+F+RL T DVDR VEGFEK EA+ESNTCSSAST
Sbjct: 1   MSITLDRVRRV-DQPSGFSPEDLGYGSMFERLETGDVDRDVEGFEK-EAEESNTCSSAST 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQSARSSD EDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFY+GKS
Sbjct: 61  SSSSSIGRNSDQSARSSDDEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYNGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KSFTSLADAS+V+S++EI KPENAYSKKRRNLMA+NLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSFTSLADASTVSSVEEIVKPENAYSKKRRNLMAFNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SSRSS ALAVAMSSSESNSSEDSN SSYSSSPPPRPPLHPQS+  NSN  S+VPPQR FS
Sbjct: 181 SSRSSFALAVAMSSSESNSSEDSNCSSYSSSPPPRPPLHPQSKASNSNLASIVPPQRNFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECAT  NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATSANKANLTNLN 264

BLAST of Lsi04G013820.1 vs. NCBI nr
Match: XP_022941285.1 (uncharacterized protein LOC111446629 [Cucurbita moschata])

HSP 1 Score: 421.0 bits (1081), Expect = 7.6e-114
Identity = 237/266 (89.10%), Postives = 246/266 (92.48%), Query Frame = 0

Query: 1   MSITLDRVRQVEDKRSGFSPEDLGYGSVFQRLVTHDVDRAVEGFEKEEADESNTCSSAST 60
           MSI LDRV++VEDK SGFSPEDLGYGSVF+RL T DVD  VEGFEK EA+ESNTCSSAS 
Sbjct: 1   MSIALDRVQRVEDKGSGFSPEDLGYGSVFERLDTGDVDHGVEGFEK-EAEESNTCSSAS- 60

Query: 61  SSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYSGKS 120
           SSSSSIGRNSDQSARSSDGE+SGE+DEVQSSYKGPLDMMDSLEEVLPVRKGISKFY+GKS
Sbjct: 61  SSSSSIGRNSDQSARSSDGEESGESDEVQSSYKGPLDMMDSLEEVLPVRKGISKFYNGKS 120

Query: 121 KSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180
           KS+TSLADA SV+SMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS
Sbjct: 121 KSYTSLADA-SVSSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKNNGGGISKRPIS 180

Query: 181 SSRSSLALAVAMSSSESNSSEDSNSSSYSSSPPPRPPLHPQSRPCNSNFTSMVPPQRTFS 240
           SSRSSLALAVAMSSSESN SEDSNSSSYS SPPP PPLHPQSR  N N  SMVPPQR FS
Sbjct: 181 SSRSSLALAVAMSSSESNCSEDSNSSSYSGSPPPLPPLHPQSRASNCNLASMVPPQRNFS 240

Query: 241 TWRSYSLADLQECATFTNKANLTNLN 267
           TWRSYSLADLQECAT  NKANLTNLN
Sbjct: 241 TWRSYSLADLQECATSANKANLTNLN 263

BLAST of Lsi04G013820.1 vs. TAIR 10
Match: AT5G21940.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G43850.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 151.0 bits (380), Expect = 1.4e-36
Identity = 116/229 (50.66%), Postives = 155/229 (67.69%), Query Frame = 0

Query: 52  SNTCSSASTSSSSSIGRNSDQSARSSD--GEDSGENDEVQSSYKGPLDMMDSLEEVLPVR 111
           S++ SS S+S+SSSIGRNSD   +SS+  G+D+GEN EV+S YKGPL+MM+SLE+VLPVR
Sbjct: 30  SDSSSSPSSSASSSIGRNSDDGEKSSEDGGDDAGEN-EVESPYKGPLEMMESLEQVLPVR 89

Query: 112 KGISKFYSGKSKSFTSL-ADASSV----NSMKEIAKPENAYSKKRRNLMAYNLVWEKNRS 171
           KGISK+YSGKSKSFT+L A+A+S     +SMK++AKPEN YS++RRNL+ +  +WE N++
Sbjct: 90  KGISKYYSGKSKSFTNLTAEAASALTSSSSMKDLAKPENPYSRRRRNLLCHQ-IWENNKT 149

Query: 172 FPLKNNGGGISKRPI-SSSRSSLALAVA-----MSSSESNSSEDSN--SSSYSSSPPPR- 231
            P     GGISK+ + SSSRS+L LA+A     M+   S+S  DS+  SS  +S  PPR 
Sbjct: 150 TP----RGGISKKHVMSSSRSALTLAMAVAAGVMTGEGSSSGGDSSPGSSPTTSGSPPRQ 209

Query: 232 -----------PPLHPQSRPCNSNFTSMVPPQRTFSTWRSYSLADLQEC 254
                      PPL+P+S+    N TS       F  WRS+S+AD   C
Sbjct: 210 LHHHQHQMKKLPPLYPRSQGSFGNLTSS-QSSLGFCAWRSFSVADFPRC 251

BLAST of Lsi04G013820.1 vs. TAIR 10
Match: AT3G43850.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: vacuole; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G21940.1); Has 215 Blast hits to 215 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 213; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 102.4 bits (254), Expect = 5.6e-22
Identity = 71/157 (45.22%), Postives = 98/157 (62.42%), Query Frame = 0

Query: 50  DESNTCSSASTSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEEVLPVR 109
           D+   C S+ST SS SIG NSD        +D G  +E++SSY GPLDMM+SLEE LP++
Sbjct: 15  DQEFACLSSST-SSDSIGENSD--------DDEGGENEIESSYNGPLDMMESLEEALPIK 74

Query: 110 KGISKFYSGKSKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPLKN 169
           + ISKFY GKSKSF SL++ SS+  +K++ KPEN YS++RRNL+++ +            
Sbjct: 75  RAISKFYKGKSKSFMSLSETSSL-PVKDLTKPENLYSRRRRNLLSHRIC----------- 134

Query: 170 NGGGISKRPISSSRSSLALAVAMSSSESNSSEDSNSS 207
           + GGISK+P  S        +AMS  E +SS   + S
Sbjct: 135 SRGGISKKPFKS-------VLAMSQREGDSSSSGDDS 143

BLAST of Lsi04G013820.1 vs. TAIR 10
Match: AT5G24890.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24550.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 59.3 bits (142), Expect = 5.4e-09
Identity = 46/116 (39.66%), Postives = 64/116 (55.17%), Query Frame = 0

Query: 60  TSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKG--PLDMMDSLEEVLPVRKGISKFYS 119
           +S SSSIG   D  +   + E   END+V S   G   L  M SLE+ LP ++G+S  Y 
Sbjct: 59  SSDSSSIGTPGD--SEEDEEESENENDDVSSKELGLRGLASMSSLEDSLPSKRGLSNHYK 118

Query: 120 GKSKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLV------WEKNRSFPL 168
           GKSKSF +L +   + S+KE+AK EN  +K+RR  +   L       W+  +S PL
Sbjct: 119 GKSKSFGNLGE---IGSVKEVAKQENPLNKRRRLQICNKLARKSFYSWQNPKSMPL 169

BLAST of Lsi04G013820.1 vs. TAIR 10
Match: AT2G24550.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31510.1); Has 219 Blast hits to 219 proteins in 33 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 2; Plants - 184; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 57.8 bits (138), Expect = 1.6e-08
Identity = 46/122 (37.70%), Postives = 71/122 (58.20%), Query Frame = 0

Query: 49  ADESNTCSSASTSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLD-MMDSLEEVLP 108
           ++ +N     S+ SSSSIG        SS+ E+  E D+  S  +G LD    SLE+ LP
Sbjct: 54  SNNNNKSPEESSDSSSSIG-------ESSENEEEEEEDDAVSCQRGTLDSFSSSLEDSLP 113

Query: 109 VRKGISKFYSGKSKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKNRSFPL 168
           +++G+S  Y GKSKSF +L +A+S    K++ K EN ++K+RR ++A N +  + RS   
Sbjct: 114 IKRGLSNHYVGKSKSFGNLMEAAS--KAKDLEKVENPFNKRRRLVIA-NKLRRRGRSMSA 165

Query: 169 KN 170
            N
Sbjct: 174 SN 165

BLAST of Lsi04G013820.1 vs. TAIR 10
Match: AT5G56550.1 (oxidative stress 3 )

HSP 1 Score: 46.2 bits (108), Expect = 4.7e-05
Identity = 47/145 (32.41%), Postives = 75/145 (51.72%), Query Frame = 0

Query: 45  EKEEADESNTCSSASTSSSSSIGRNSDQSARSSDGEDSGENDEVQSSYKGPLDMMDSLEE 104
           E++   E +T  S    +SSS   +S  S  S   ED  ++D   SS  GPL+ +  L  
Sbjct: 26  EEDIVQEVSTTFSDEEDNSSSCSLSS--SMCSDFTEDDDDDDVSSSSSNGPLEDLSDLMS 85

Query: 105 VLPVRKGISKFYSGKSKSFTSLADASSVNSMKEIAKPENAYSKKRRNLMAYNLVWEKN-- 164
            LP+++G+SKFY GKS+SFTSL +  S+  + +       Y  KR++  +   + +++  
Sbjct: 86  HLPIKRGLSKFYEGKSQSFTSLGNVKSLEDLMKRGFKSRNYGAKRKSSRSTGGILDQSYK 145

Query: 165 RSFPLKNNGGGISKRPISSSRSSLA 188
           R F  K     ISK+P  +  S L+
Sbjct: 146 RVFSPK---ATISKKPNRTPSSVLS 165

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L4511.9e-12693.98Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G648530 PE=4 SV=1[more]
A0A5A7VFR82.3e-12493.63Vitellogenin-2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G00169... [more]
A0A1S3BU422.3e-12493.63vitellogenin-2 OS=Cucumis melo OX=3656 GN=LOC103493705 PE=4 SV=1[more]
A0A6J1FCW41.5e-11588.72uncharacterized protein LOC111442964 OS=Cucurbita moschata OX=3662 GN=LOC1114429... [more]
A0A6J1FT553.7e-11489.10uncharacterized protein LOC111446629 OS=Cucurbita moschata OX=3662 GN=LOC1114466... [more]
Match NameE-valueIdentityDescription
XP_038896868.12.2e-12996.24uncharacterized protein LOC120085087 [Benincasa hispida][more]
XP_004141345.13.9e-12693.98vitellogenin-2 [Cucumis sativus] >KGN55382.1 hypothetical protein Csa_012692 [Cu... [more]
XP_008452790.14.7e-12493.63PREDICTED: vitellogenin-2 [Cucumis melo] >KAA0064541.1 vitellogenin-2 [Cucumis m... [more]
XP_022936302.13.1e-11588.72uncharacterized protein LOC111442964 [Cucurbita moschata] >XP_022936303.1 unchar... [more]
XP_022941285.17.6e-11489.10uncharacterized protein LOC111446629 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT5G21940.11.4e-3650.66unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G43850.15.6e-2245.22unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G24890.15.4e-0939.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G24550.11.6e-0837.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G56550.14.7e-0532.41oxidative stress 3 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..232
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 43..97
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..209
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 53..73
NoneNo IPR availablePANTHERPTHR33172OS08G0516900 PROTEINcoord: 44..257
NoneNo IPR availablePANTHERPTHR33172:SF37MYOSIN LIGHT CHAIN KINASE DDB_G0279831 ISOFORM X1-RELATEDcoord: 44..257

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Lsi04G013820Lsi04G013820gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi04G013820.1.exon.2Lsi04G013820.1.exon.2exon
Lsi04G013820.1.exon.1Lsi04G013820.1.exon.1exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi04G013820.1.five_prime_UTR.1Lsi04G013820.1.five_prime_UTR.1five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Lsi04G013820.1.CDS.1Lsi04G013820.1.CDS.1CDS
Lsi04G013820.1.CDS.2Lsi04G013820.1.CDS.2CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Lsi04G013820.1Lsi04G013820.1-proteinpolypeptide