Lsi03G003750 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi03G003750
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Locationchr03: 4381584 .. 4382474 (+)
RNA-Seq ExpressionLsi03G003750
SyntenyLsi03G003750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGCGGTGAAGCAGCGGCGCGGCGGATGGTTATCGGATCTTGTGGTAAGGCGATTATCAATCCGATAATGATGAGATTCCGGCCGATTGCGCCGAAGCCTGTTCCTGGAGGCTCTATTTCCGGTGGTGCATCGTCGTTGGAGAGTAAGAACTCGTTGATTTCCAAAGGAAGAACGAAGAGGAAGTACGTTAGGGTTAGGAGATATAATAGGAGGAAGAAAACAACAAGAAACACCACCGCCAGTACTGACTGTAAAACAAACGGGGAGATTGTCGATCATAATCAGACCGCCGTAACGACGACGACGACGACACTACAACTGCTACCGGTGGAATTGATCGGCGTCGGATCTGAGTGGACGGAGAATAAAAGGGATGGTGGAAGGATGGAGAAGGAGATAGTATTGGTATCAGATCCGAAACGTGGAGCGGTGGAATTGTGGGTGACGGTAGAATGCGTGACGGACGCATGCATGGATCTAGGCGTGAAGGAGGCAGAGATAGGGTGTACGGATGAGGAGAGGATAAAGAATCTAGAGGAAGACACGTGTCCAGGGTTTGTATCGGACGGTATGAATAGAGTAGAGTGGTTAAACAAGGCATTTAAGAGGATGGTTTGGCAAAGGCAAAGGGAAAAGAGAAGGAACGAGGGTGAGTCACCTGCAGAAGTAGGTGTATGGTTAATATCAAAACCCAAATTACCTTGTTTGAGGAGAGTATTTACGTGCCAAATAAAGCTGCAATTCAAAAGAGGTACGGAAATGGAGAAGGACTCCAAAGTTGTGCCTTGTGATGCGTGGAGAATGGACGGTGGAGGATTTGCATGGAAACTAGACGTCAAAGCTGCTCTTACTTTAGCTCCTTTGCTTCAAGAAGATTTTGATTGA

mRNA sequence

ATGGATGGCGGTGAAGCAGCGGCGCGGCGGATGGTTATCGGATCTTGTGGTAAGGCGATTATCAATCCGATAATGATGAGATTCCGGCCGATTGCGCCGAAGCCTGTTCCTGGAGGCTCTATTTCCGGTGGTGCATCGTCGTTGGAGAGTAAGAACTCGTTGATTTCCAAAGGAAGAACGAAGAGGAAGTACGTTAGGGTTAGGAGATATAATAGGAGGAAGAAAACAACAAGAAACACCACCGCCAGTACTGACTGTAAAACAAACGGGGAGATTGTCGATCATAATCAGACCGCCGTAACGACGACGACGACGACACTACAACTGCTACCGGTGGAATTGATCGGCGTCGGATCTGAGTGGACGGAGAATAAAAGGGATGGTGGAAGGATGGAGAAGGAGATAGTATTGGTATCAGATCCGAAACGTGGAGCGGTGGAATTGTGGGTGACGGTAGAATGCGTGACGGACGCATGCATGGATCTAGGCGTGAAGGAGGCAGAGATAGGGTGTACGGATGAGGAGAGGATAAAGAATCTAGAGGAAGACACGTGTCCAGGGTTTGTATCGGACGGTATGAATAGAGTAGAGTGGTTAAACAAGGCATTTAAGAGGATGGTTTGGCAAAGGCAAAGGGAAAAGAGAAGGAACGAGGGTGAGTCACCTGCAGAAGTAGGTGTATGGTTAATATCAAAACCCAAATTACCTTGTTTGAGGAGAGTATTTACGTGCCAAATAAAGCTGCAATTCAAAAGAGGTACGGAAATGGAGAAGGACTCCAAAGTTGTGCCTTGTGATGCGTGGAGAATGGACGGTGGAGGATTTGCATGGAAACTAGACGTCAAAGCTGCTCTTACTTTAGCTCCTTTGCTTCAAGAAGATTTTGATTGA

Coding sequence (CDS)

ATGGATGGCGGTGAAGCAGCGGCGCGGCGGATGGTTATCGGATCTTGTGGTAAGGCGATTATCAATCCGATAATGATGAGATTCCGGCCGATTGCGCCGAAGCCTGTTCCTGGAGGCTCTATTTCCGGTGGTGCATCGTCGTTGGAGAGTAAGAACTCGTTGATTTCCAAAGGAAGAACGAAGAGGAAGTACGTTAGGGTTAGGAGATATAATAGGAGGAAGAAAACAACAAGAAACACCACCGCCAGTACTGACTGTAAAACAAACGGGGAGATTGTCGATCATAATCAGACCGCCGTAACGACGACGACGACGACACTACAACTGCTACCGGTGGAATTGATCGGCGTCGGATCTGAGTGGACGGAGAATAAAAGGGATGGTGGAAGGATGGAGAAGGAGATAGTATTGGTATCAGATCCGAAACGTGGAGCGGTGGAATTGTGGGTGACGGTAGAATGCGTGACGGACGCATGCATGGATCTAGGCGTGAAGGAGGCAGAGATAGGGTGTACGGATGAGGAGAGGATAAAGAATCTAGAGGAAGACACGTGTCCAGGGTTTGTATCGGACGGTATGAATAGAGTAGAGTGGTTAAACAAGGCATTTAAGAGGATGGTTTGGCAAAGGCAAAGGGAAAAGAGAAGGAACGAGGGTGAGTCACCTGCAGAAGTAGGTGTATGGTTAATATCAAAACCCAAATTACCTTGTTTGAGGAGAGTATTTACGTGCCAAATAAAGCTGCAATTCAAAAGAGGTACGGAAATGGAGAAGGACTCCAAAGTTGTGCCTTGTGATGCGTGGAGAATGGACGGTGGAGGATTTGCATGGAAACTAGACGTCAAAGCTGCTCTTACTTTAGCTCCTTTGCTTCAAGAAGATTTTGATTGA

Protein sequence

MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRTKRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLPVELIGVGSEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAEVGVWLISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLLQEDFD
Homology
BLAST of Lsi03G003750 vs. ExPASy TrEMBL
Match: A0A6J1FKE3 (uncharacterized protein LOC111444824 OS=Cucurbita moschata OX=3662 GN=LOC111444824 PE=4 SV=1)

HSP 1 Score: 425.2 bits (1092), Expect = 2.2e-115
Identity = 225/306 (73.53%), Postives = 248/306 (81.05%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV G S+S G SS++SKNS ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGPSVSLGTSSVDSKNSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD        TTTTLQLLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR----TAPTTTTLQLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+ GRME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVGRMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGES------PAEVGVWL 240
           RIKNLEEDTCPGF+S+GMNRVEWLNKAFKRMV Q++  +R+ E  S       AEV VWL
Sbjct: 181 RIKNLEEDTCPGFISNGMNRVEWLNKAFKRMVRQKRERERQEETRSGSESGGAAEVAVWL 240

Query: 241 ISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAP 296
           IS PKL    RVF+CQIKLQFKR TEMEKDSKVVPCDAWR+D GGFAWKLDVKAALTLAP
Sbjct: 241 ISNPKLGMTSRVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLDRGGFAWKLDVKAALTLAP 300

BLAST of Lsi03G003750 vs. ExPASy TrEMBL
Match: A0A6J1JZR6 (uncharacterized protein LOC111488911 OS=Cucurbita maxima OX=3661 GN=LOC111488911 PE=4 SV=1)

HSP 1 Score: 422.9 bits (1086), Expect = 1.1e-114
Identity = 224/303 (73.93%), Postives = 245/303 (80.86%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV GGS+S G SS++SK S ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGGSVSLGTSSVDSKTSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD        TTTTLQLLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR---TAPPTTTTLQLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+ GRME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVGRMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMV---WQRQREKRRNEGESPAEVGVWLISK 240
           RIKNL+EDTCPGFVS+GMNRVEWLNKAFKRMV    QRQ + R       AEV VWLIS 
Sbjct: 181 RIKNLDEDTCPGFVSNGMNRVEWLNKAFKRMVRQKRQRQEQTRSGSESGAAEVAVWLISN 240

Query: 241 PKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLLQ 296
           PKL    RVF+CQIKLQFKR TEMEKDSKVVPCDAWR++ GGFAWKLDVKAALTLAP L 
Sbjct: 241 PKLGMTSRVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLNRGGFAWKLDVKAALTLAPFLP 299

BLAST of Lsi03G003750 vs. ExPASy TrEMBL
Match: A0A5D3DL11 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold205G00490 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 7.0e-114
Identity = 238/325 (73.23%), Postives = 259/325 (79.69%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCG-----KAIINPIMMRFRPIAPKPVPGGSISGGASSLESK--NS 60
           MDGG+AA+ R +IGSC      K IINPIMMRFRPIAPKP+PGGSI    SSL+SK  NS
Sbjct: 1   MDGGDAASTRRLIGSCTGGSNCKPIINPIMMRFRPIAPKPLPGGSI---PSSLDSKNNNS 60

Query: 61  LISKGRTKRKYVRVRRYNRRKK-TTRNTTASTDCKT-NGEIVDHNQTAVTTTTTTLQLLP 120
            ISKGRTKRKYVRVRRYNR+KK TTRN   + +  T +GE++DH+QTAV    TTLQLLP
Sbjct: 61  SISKGRTKRKYVRVRRYNRKKKTTTRNNHNNNNSTTEDGELMDHDQTAV----TTLQLLP 120

Query: 121 VELIGVGSEWTENKRDGGR--MEKEIVLVSDPKRG-AVELWVTVECVTDACMDLGVKEAE 180
           V  IG G   +ENK + GR  MEKE+ LVSDPK G  VELWVTVECVTDACMDL ++E E
Sbjct: 121 V--IG-GGGGSENKTETGRRMMEKEMRLVSDPKGGVGVELWVTVECVTDACMDLELREGE 180

Query: 181 IGCTDEERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQRE--------KRRNEG- 240
           IGCTDEERIKNLE+DTCPGFVSDGMNRVEWLNKAFKRMVWQRQR         K+  EG 
Sbjct: 181 IGCTDEERIKNLEKDTCPGFVSDGMNRVEWLNKAFKRMVWQRQRNNNDNKSKMKKEEEGG 240

Query: 241 --------ESPAEVGVWLISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMD 297
                   ESPAEV VWLISKPKLP +RRVFTCQIKLQFKRGTEMEKDS+VVPCDAWRMD
Sbjct: 241 KGDCEGDCESPAEVSVWLISKPKLPNMRRVFTCQIKLQFKRGTEMEKDSRVVPCDAWRMD 300

BLAST of Lsi03G003750 vs. ExPASy TrEMBL
Match: A0A1S3B3A6 (uncharacterized protein LOC103485503 OS=Cucumis melo OX=3656 GN=LOC103485503 PE=4 SV=1)

HSP 1 Score: 420.2 bits (1079), Expect = 7.0e-114
Identity = 238/325 (73.23%), Postives = 259/325 (79.69%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCG-----KAIINPIMMRFRPIAPKPVPGGSISGGASSLESK--NS 60
           MDGG+AA+ R +IGSC      K IINPIMMRFRPIAPKP+PGGSI    SSL+SK  NS
Sbjct: 1   MDGGDAASTRRLIGSCTGGSNCKPIINPIMMRFRPIAPKPLPGGSI---PSSLDSKNNNS 60

Query: 61  LISKGRTKRKYVRVRRYNRRKK-TTRNTTASTDCKT-NGEIVDHNQTAVTTTTTTLQLLP 120
            ISKGRTKRKYVRVRRYNR+KK TTRN   + +  T +GE++DH+QTAV    TTLQLLP
Sbjct: 61  SISKGRTKRKYVRVRRYNRKKKTTTRNNHNNNNSTTEDGELMDHDQTAV----TTLQLLP 120

Query: 121 VELIGVGSEWTENKRDGGR--MEKEIVLVSDPKRG-AVELWVTVECVTDACMDLGVKEAE 180
           V  IG G   +ENK + GR  MEKE+ LVSDPK G  VELWVTVECVTDACMDL ++E E
Sbjct: 121 V--IG-GGGGSENKTETGRRMMEKEMRLVSDPKGGVGVELWVTVECVTDACMDLELREGE 180

Query: 181 IGCTDEERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQRE--------KRRNEG- 240
           IGCTDEERIKNLE+DTCPGFVSDGMNRVEWLNKAFKRMVWQRQR         K+  EG 
Sbjct: 181 IGCTDEERIKNLEKDTCPGFVSDGMNRVEWLNKAFKRMVWQRQRNNNDNKSKMKKEEEGG 240

Query: 241 --------ESPAEVGVWLISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMD 297
                   ESPAEV VWLISKPKLP +RRVFTCQIKLQFKRGTEMEKDS+VVPCDAWRMD
Sbjct: 241 KGDCEGDCESPAEVSVWLISKPKLPNMRRVFTCQIKLQFKRGTEMEKDSRVVPCDAWRMD 300

BLAST of Lsi03G003750 vs. ExPASy TrEMBL
Match: A0A0A0K8B5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G008060 PE=4 SV=1)

HSP 1 Score: 416.8 bits (1070), Expect = 7.7e-113
Identity = 235/328 (71.65%), Postives = 258/328 (78.66%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCG-----KAIINPIMMRFRPIAPKPVPGGSISGGASSLESK--NS 60
           MDGGEAA+ R +IGSC      K I+NPIMMRFRPIAPKP+PGGS+    SSL+SK  NS
Sbjct: 1   MDGGEAASTRRMIGSCTGGSNCKPILNPIMMRFRPIAPKPLPGGSV---PSSLDSKNNNS 60

Query: 61  LISKGRTKRKYVRVRRYNRRKK-TTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLPV 120
            ISKGRTKRKYVRVRRYNR+KK TTRN  ++T+   +GE++DH+QTAV    TTLQLLPV
Sbjct: 61  SISKGRTKRKYVRVRRYNRKKKTTTRNNNSTTE---DGELMDHDQTAV----TTLQLLPV 120

Query: 121 ELIGVGSEWTENKRDGGR--MEKEIVLVSDPKRG-AVELWVTVECVTDACMDLGVKEAEI 180
             IG G   +ENK + GR  MEKE+ LVSDPK G  VELWVTVECV DACMDL ++E EI
Sbjct: 121 --IG-GGGGSENKTETGRRMMEKEMRLVSDPKGGVGVELWVTVECVRDACMDLELREGEI 180

Query: 181 GCTDEERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQR----------EKRRNEG 240
           GCTDEERIKNLE DTCPGFVSDGMNRVEWLNKAFKRMVWQR+R          +K   EG
Sbjct: 181 GCTDEERIKNLEMDTCPGFVSDGMNRVEWLNKAFKRMVWQRERNNKDSKSKKKKKEEEEG 240

Query: 241 -----------ESPAEVGVWLISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAW 297
                      ESPAEV VWLISKPKLP +RRVFTCQIKLQFKRGTEMEKDS+VVPCDAW
Sbjct: 241 GKGECDCDCDCESPAEVSVWLISKPKLPNMRRVFTCQIKLQFKRGTEMEKDSRVVPCDAW 300

BLAST of Lsi03G003750 vs. NCBI nr
Match: XP_038886679.1 (uncharacterized protein LOC120076820 [Benincasa hispida])

HSP 1 Score: 504.6 bits (1298), Expect = 5.8e-139
Identity = 257/305 (84.26%), Postives = 270/305 (88.52%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGGEAAA+RM++GSCGK IINPIMMRFRPIAPKPVPGGSISGGASSLESKNS ISKGRT
Sbjct: 1   MDGGEAAAQRMIVGSCGKTIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSSISKGRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTT--TTTTLQLLPVELIGVG 120
           KRKYVRVRRYNR+KKTTR          NGEI +H++ AVTT   TTTLQLLPVE IG G
Sbjct: 61  KRKYVRVRRYNRKKKTTR----------NGEIANHDEAAVTTAEATTTLQLLPVEFIGGG 120

Query: 121 SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEERIK 180
           SEW+ENK   GRMEKE+VLVSDPKR AVELWVTVECVTDACMDL +KE EIGCTDEERIK
Sbjct: 121 SEWSENKTVTGRMEKEMVLVSDPKREAVELWVTVECVTDACMDLELKEGEIGCTDEERIK 180

Query: 181 NLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRR-------NEGESPAEVGVWLIS 240
           NLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQR+RE+ +        EGESPAEVGVWLIS
Sbjct: 181 NLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQREREREKRIRNINEGEGESPAEVGVWLIS 240

Query: 241 KPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLL 297
           KPKLP LRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLL
Sbjct: 241 KPKLPNLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLL 295

BLAST of Lsi03G003750 vs. NCBI nr
Match: KAG7015962.1 (hypothetical protein SDJN02_21066, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 426.0 bits (1094), Expect = 2.6e-115
Identity = 225/306 (73.53%), Postives = 248/306 (81.05%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV G S+S G SS++SKNS ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGASVSLGTSSVDSKNSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD        TTTTLQLLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR----TAPTTTTLQLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+ GRME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVGRMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGES------PAEVGVWL 240
           RIKNLEEDTCPGF+S+GMNRVEWLNKAFKRMV Q++  +R+ E  S       AEV VWL
Sbjct: 181 RIKNLEEDTCPGFISNGMNRVEWLNKAFKRMVRQKRERERQEETRSGSESGGAAEVAVWL 240

Query: 241 ISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAP 296
           IS PKL    RVF+CQIKLQFKR TEMEKDSKVVPCDAWR+D GGFAWKLDVKAALTLAP
Sbjct: 241 ISNPKLGMTSRVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLDRGGFAWKLDVKAALTLAP 300

BLAST of Lsi03G003750 vs. NCBI nr
Match: XP_022938665.1 (uncharacterized protein LOC111444824 [Cucurbita moschata] >XP_022938666.1 uncharacterized protein LOC111444824 [Cucurbita moschata] >KAG6578382.1 hypothetical protein SDJN03_22830, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 425.2 bits (1092), Expect = 4.5e-115
Identity = 225/306 (73.53%), Postives = 248/306 (81.05%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV G S+S G SS++SKNS ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGPSVSLGTSSVDSKNSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD        TTTTLQLLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR----TAPTTTTLQLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+ GRME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVGRMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGES------PAEVGVWL 240
           RIKNLEEDTCPGF+S+GMNRVEWLNKAFKRMV Q++  +R+ E  S       AEV VWL
Sbjct: 181 RIKNLEEDTCPGFISNGMNRVEWLNKAFKRMVRQKRERERQEETRSGSESGGAAEVAVWL 240

Query: 241 ISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAP 296
           IS PKL    RVF+CQIKLQFKR TEMEKDSKVVPCDAWR+D GGFAWKLDVKAALTLAP
Sbjct: 241 ISNPKLGMTSRVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLDRGGFAWKLDVKAALTLAP 300

BLAST of Lsi03G003750 vs. NCBI nr
Match: XP_022992618.1 (uncharacterized protein LOC111488911 [Cucurbita maxima] >XP_022992619.1 uncharacterized protein LOC111488911 [Cucurbita maxima])

HSP 1 Score: 422.9 bits (1086), Expect = 2.2e-114
Identity = 224/303 (73.93%), Postives = 245/303 (80.86%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV GGS+S G SS++SK S ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGGSVSLGTSSVDSKTSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD        TTTTLQLLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR---TAPPTTTTLQLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+ GRME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVGRMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMV---WQRQREKRRNEGESPAEVGVWLISK 240
           RIKNL+EDTCPGFVS+GMNRVEWLNKAFKRMV    QRQ + R       AEV VWLIS 
Sbjct: 181 RIKNLDEDTCPGFVSNGMNRVEWLNKAFKRMVRQKRQRQEQTRSGSESGAAEVAVWLISN 240

Query: 241 PKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAPLLQ 296
           PKL    RVF+CQIKLQFKR TEMEKDSKVVPCDAWR++ GGFAWKLDVKAALTLAP L 
Sbjct: 241 PKLGMTSRVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLNRGGFAWKLDVKAALTLAPFLP 299

BLAST of Lsi03G003750 vs. NCBI nr
Match: XP_023551035.1 (uncharacterized protein LOC111808988 [Cucurbita pepo subsp. pepo] >XP_023551036.1 uncharacterized protein LOC111808988 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 422.2 bits (1084), Expect = 3.8e-114
Identity = 224/306 (73.20%), Postives = 248/306 (81.05%), Query Frame = 0

Query: 1   MDGGEAAARRMVIGSCGKAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRT 60
           MDGG AAAR M +GSCGK II+PIMMRFRPIAPKPV G S+S G SS++SKNS ISK RT
Sbjct: 1   MDGGGAAARPM-LGSCGKTIIHPIMMRFRPIAPKPVVGPSVSLGTSSVDSKNSSISKPRT 60

Query: 61  KRKYVRVRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP---VELIGV 120
           KRKYVRVRRYNR+KKTTRNTT STD   NGE VD      TTTTTTL+LLP   +EL+G 
Sbjct: 61  KRKYVRVRRYNRKKKTTRNTTGSTDGNKNGEFVDR----TTTTTTTLKLLPERHIELVGG 120

Query: 121 G--SEWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDEE 180
           G  SEW+E KR+  RME ++VLVSDPKRGA+ELWVTVECV +ACMDL V+E EIGCTDEE
Sbjct: 121 GGRSEWSEKKREVERMENDVVLVSDPKRGALELWVTVECVREACMDLEVEEGEIGCTDEE 180

Query: 181 RIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGES------PAEVGVWL 240
           RIKNLEEDTCPGF+S+GMNRVEWLNKAFKRMV Q++  +R+ E  S       AEV VWL
Sbjct: 181 RIKNLEEDTCPGFISNGMNRVEWLNKAFKRMVRQKRERERQEETRSGSESGGAAEVAVWL 240

Query: 241 ISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTLAP 296
           IS PKL     VF+CQIKLQFKR TEMEKDSKVVPCDAWR+D GGFAWKLDVKAALTLAP
Sbjct: 241 ISNPKLGMTSPVFSCQIKLQFKRDTEMEKDSKVVPCDAWRLDRGGFAWKLDVKAALTLAP 300

BLAST of Lsi03G003750 vs. TAIR 10
Match: AT3G27250.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G40800.1); Has 104 Blast hits to 104 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 104; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 146.4 bits (368), Expect = 3.7e-35
Identity = 100/285 (35.09%), Postives = 148/285 (51.93%), Query Frame = 0

Query: 18  KAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRTKRKYVRVRRYNRRKKTT 77
           K  ++ +M+R+RPIAPKP  G     G +   + +S     RTKRKYVRV + N  K T 
Sbjct: 19  KVSVDALMLRYRPIAPKPTTGQPC--GVADNNNNSSYGMSKRTKRKYVRVSKNN--KGTC 78

Query: 78  RNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLPVELIGVGSEWTENKRDGGRMEKEIVL 137
           R        K+  ++ D  +    T   TLQLLP E   +  E++   +D      + ++
Sbjct: 79  RG-------KSRSDLSDDRE---QTDVVTLQLLP-EKSDISGEYSPLDQDSLDPSVKSII 138

Query: 138 VSDPKR------------GAVELWVTVECVTDACMDLGVKEAEIGCTDEERIKNLEEDTC 197
             + +               +E WVTVE VT  C +  +    +G TD E + NL +DTC
Sbjct: 139 GEETQETNTWGMFNGSVTAEMETWVTVESVTSVC-EGSLSSHAVGITDVEIVDNLGKDTC 198

Query: 198 PGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAEVGVWLISK---PKLPCLRRVF 257
           P FVSDG NRV W+N+A++R V         +      EV VWL+++     + C  + F
Sbjct: 199 PAFVSDGSNRVVWVNEAYRRNV-----SGDDSTASVSPEVVVWLVAEEATAAMHCNYQAF 258

Query: 258 TCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTL 288
           TC++++Q+    +  K +K VPCD W+M+ GGFAW+LD  AALTL
Sbjct: 259 TCRVRMQYT--WKETKYTKTVPCDVWKMEFGGFAWRLDTTAALTL 280

BLAST of Lsi03G003750 vs. TAIR 10
Match: AT5G63350.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G48510.1); Has 103 Blast hits to 102 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 101; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 139.0 bits (349), Expect = 6.0e-33
Identity = 113/301 (37.54%), Postives = 149/301 (49.50%), Query Frame = 0

Query: 14  GSCGKAIINPIMMRFRPIAPKPV-PGGSI------------SGGASSLESKNSLISKGRT 73
           G  G +  + IM+RFRPIAPKP   GGS+            SGG+S L  K+     GR 
Sbjct: 17  GRYGLSKADRIMLRFRPIAPKPASDGGSVSLTGKSGSTTTTSGGSSDLSGKS-----GRG 76

Query: 74  KRKYVR------VRRYNRRKKTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLPVEL 133
           KRKY +       RR N++K+     TA+T   T   +    +T        L   PVE 
Sbjct: 77  KRKYQKDCSGGNSRRCNKKKRDLSGDTATTTAVTLSLL---PETPEKRVFPDLNAFPVEK 136

Query: 134 IGVGSEWTENKRDGGRMEKEIVLVSDPKRG-AVELWVTVECVTDACMDLGVKEAEIGCTD 193
                    +   GG +          +R   V   VTVE VTDA +D       +G T+
Sbjct: 137 QKRNGPLWLSFNGGGEILTPYKTAEISRRTVVVSSCVTVERVTDAWID----GYGLGETN 196

Query: 194 EERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAE-------VG 253
           +ER  NL EDTCPGF+SDG+ RV W N+A+K+M   R+      E   P +       V 
Sbjct: 197 QERKMNLVEDTCPGFISDGVGRVTWTNEAYKKMA--REDINIPMEEGVPEDISYDNFHVN 256

Query: 254 VWLISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALT 288
           V L+ K +       FTC+++LQ+    + E+ S  VPCD WRMDGGGFAW+LDVKAAL 
Sbjct: 257 VRLVMKERPMLTYPAFTCRVRLQY-TCQDRERGSVTVPCDVWRMDGGGFAWRLDVKAALC 302

BLAST of Lsi03G003750 vs. TAIR 10
Match: AT5G50360.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G48510.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 136.3 bits (342), Expect = 3.9e-32
Identity = 103/298 (34.56%), Postives = 142/298 (47.65%), Query Frame = 0

Query: 21  INPIMMRFRPIAPKPVPGGSISGGASSLESKNSLIS-----KGRTKRKYVRVRRYNRRKK 80
           ++ IM+R+RPIAP+P  GGS    AS  E   S+I+       R KRKY +         
Sbjct: 24  VDRIMLRYRPIAPRPDSGGS---PASPTEKNGSVITNVSSRSRRGKRKYSK----ENNSS 83

Query: 81  TTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLPVELIGVGSEWTENKRDGGRMEKEI 140
           +T +  ++ + K        N +       TL LLP          T  K+D     K  
Sbjct: 84  STGSVNSNGNSKRQRNDETKNGSGGGREIVTLPLLPE---------TPEKKDSPLKAK-- 143

Query: 141 VLVSDPKRGAVELW--------------------------VTVECVTDACMDLGVKEAEI 200
              + P+ GA  LW                          +TVECVT+  M+    E E+
Sbjct: 144 ---AAPELGAAALWLSFNDGASYNRRYQTELMTETVVSSLLTVECVTERLME---GEYEL 203

Query: 201 GCTDEERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAEVGVWL 260
           GCTDEER  NLE DTCPGF+SDG+ RV W N +++ +V  +  E+        +++ VWL
Sbjct: 204 GCTDEERKMNLERDTCPGFISDGLGRVIWTNGSYRELVVGKDHEQ-------CSKMSVWL 263

Query: 261 ISKPKLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTL 288
           + K K     R FTC+++LQ+    + E  S    CD WRM  GGFAW+LDV AAL L
Sbjct: 264 VMKEKPLVTYRTFTCRMRLQY-TCRDKEVSSITSFCDVWRMSDGGFAWRLDVDAALCL 289

BLAST of Lsi03G003750 vs. TAIR 10
Match: AT5G40800.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G27250.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 134.4 bits (337), Expect = 1.5e-31
Identity = 107/294 (36.39%), Postives = 146/294 (49.66%), Query Frame = 0

Query: 18  KAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRTKRKYVRVRRYNR---RK 77
           K  ++ +M+++RPIAPKP   G    G +S        S  RTKRKYVRV + N+   R 
Sbjct: 20  KTSVDTLMLKYRPIAPKPTTTGQPLVGDTS--------STRRTKRKYVRVSKNNKATCRS 79

Query: 78  KTTRNTTASTDCKTNGEIVDHNQTAVTTTTTTLQLLP----------------VELIGVG 137
           KT    ++STD +   E +            TLQLLP                VE I  G
Sbjct: 80  KTNGFRSSSTDPENGREDI-----------VTLQLLPERSTPLSLDHNNLDPTVETIN-G 139

Query: 138 SEW----TENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKEAEIGCTDE 197
            E     T  K +GG    + V         VE WVTVE V       G+    +G TDE
Sbjct: 140 DETCNTDTWLKFNGGDDALQQV--------PVETWVTVESVNS-----GLVSHAVGLTDE 199

Query: 198 ERIKNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAEVGVWL-ISKP 257
           E    L++DTCPGF+SDG NRV  +N+A++R+V          +G    EV VWL + + 
Sbjct: 200 ELTYALDKDTCPGFISDGSNRVVMVNEAYRRIV--------TGDGGFGREVIVWLVVDQT 259

Query: 258 KLPCLRRVFTCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALTL 288
              C  R FTC++++++       K +K +PCD W+M+ GGFAW+LD  AALTL
Sbjct: 260 ATFCDYRTFTCKVRMEYT--WRETKYTKTLPCDVWKMEFGGFAWRLDTTAALTL 270

BLAST of Lsi03G003750 vs. TAIR 10
Match: AT5G40790.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G27250.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 133.3 bits (334), Expect = 3.3e-31
Identity = 106/301 (35.22%), Postives = 152/301 (50.50%), Query Frame = 0

Query: 18  KAIINPIMMRFRPIAPKPVPGGSISGGASSLESKNSLISKGRTKRKYVRVRRYNRRKKTT 77
           K +I+ IM RFRPIAPKP  G S    +    S   L    R+KRKYVRVR  +++  + 
Sbjct: 21  KTVISKIMQRFRPIAPKPAVGES----SDDTNSDRFLGRNRRSKRKYVRVR--DKKNSSG 80

Query: 78  RNTTASTDCKTNGEIVDHNQTAV--------TTTTTTLQLLPVELIGVG----------S 137
            N       K NG    + +T +         T   TLQLLP +   +G          S
Sbjct: 81  SNNKKDITGKKNGCDRGNIKTDLDKKIDGDDRTDIVTLQLLPEKDRDIGNNGDKAGEFCS 140

Query: 138 EWTENKRDGGRMEKEIVLVSDPKRGAVELWVTVECVTDACMDLGVKE--AEIGCTD--EE 197
           + ++           I L S   R  VE W+TVECV+D C DLG      ++G  D  EE
Sbjct: 141 DLSDMDPKKSLYNSIIGLSSSFDRKVVESWLTVECVSDTCTDLGWYHILEQLGRMDQAEE 200

Query: 198 RI-KNLEEDTCPGFVSDGMNRVEWLNKAFKRMVWQRQREKRRNEGESPAEV-GVWLI--- 257
           R+ + LE DTCP  VSDG NRV W+N+A++RM+           G    +V  VWL+   
Sbjct: 201 RVMRMLEVDTCPWLVSDGSNRVCWVNRAYRRMM-----------GAPDVDVIRVWLVVAM 260

Query: 258 -SKPKLPCLRRVF---TCQIKLQFKRGTEMEKDSKVVPCDAWRMDGGGFAWKLDVKAALT 288
               ++ C+  ++   TC+++++++  T  +     VPCD WR+  GGFAW+LDV++AL 
Sbjct: 261 DLMEEIACMVELYGAVTCRVRVRYEPSTWRK---MTVPCDVWRIRSGGFAWRLDVESALR 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1FKE32.2e-11573.53uncharacterized protein LOC111444824 OS=Cucurbita moschata OX=3662 GN=LOC1114448... [more]
A0A6J1JZR61.1e-11473.93uncharacterized protein LOC111488911 OS=Cucurbita maxima OX=3661 GN=LOC111488911... [more]
A0A5D3DL117.0e-11473.23Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3B3A67.0e-11473.23uncharacterized protein LOC103485503 OS=Cucumis melo OX=3656 GN=LOC103485503 PE=... [more]
A0A0A0K8B57.7e-11371.65Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G008060 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_038886679.15.8e-13984.26uncharacterized protein LOC120076820 [Benincasa hispida][more]
KAG7015962.12.6e-11573.53hypothetical protein SDJN02_21066, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022938665.14.5e-11573.53uncharacterized protein LOC111444824 [Cucurbita moschata] >XP_022938666.1 unchar... [more]
XP_022992618.12.2e-11473.93uncharacterized protein LOC111488911 [Cucurbita maxima] >XP_022992619.1 uncharac... [more]
XP_023551035.13.8e-11473.20uncharacterized protein LOC111808988 [Cucurbita pepo subsp. pepo] >XP_023551036.... [more]
Match NameE-valueIdentityDescription
AT3G27250.13.7e-3535.09unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G63350.16.0e-3337.54unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G50360.13.9e-3234.56unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G40800.11.5e-3136.39unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G40790.13.3e-3135.22unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33595VON WILLEBRAND FACTOR A DOMAIN PROTEINcoord: 1..114
NoneNo IPR availablePANTHERPTHR33595VON WILLEBRAND FACTOR A DOMAIN PROTEINcoord: 137..288
NoneNo IPR availablePANTHERPTHR33595:SF4EMB|CAB62340.1coord: 137..288
NoneNo IPR availablePANTHERPTHR33595:SF4EMB|CAB62340.1coord: 1..114

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi03G003750.1Lsi03G003750.1mRNA