Tan0010199 (gene) Snake gourd v1

Overview
NameTan0010199
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG10: 3716739 .. 3718122 (+)
RNA-Seq ExpressionTan0010199
SyntenyTan0010199
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCTGTCGTGTAAAATTTCAATTCCTTCTCTAGCGAAGTTTCTTCTTCTGCTTCCTCTTCGTTCTCATCGTCCAAAAATCTCCGATCAAATTCTCTGTAAATCCTCACAATCCGCCATTGAAACTCCACACGCCGAGTACGAGTTTGCGGTTTTACTAGTAAATGAGGTGCAAGAAGCACCAATTCGACTTCAGCAGTAGCATCGGCGCATGTGCTTCCTGTTTACGGGAGCGTCTTTTTTCGATCATCGCCGCTCAGGCACAGGCGGAGAAGCAATCGGAGCTCGGTTCCGCTGCCGGCGATATTCATGCGGCGGATGATACACCTCTTCCGCCGCCGCTCGTTTTTCTTCGTTCTGTCTCTCCTCATGATCGTCGTCCGAAATCGGATGAAGATTTGTGGAGCGATCTCGATCGCGACGGAAATCGACGCCATCGCCATCAGCGATTCTACAGTACTCCGCAAATCGGACCTAATTGCATAACGAACAACAGTGCGAACACTACATTCGTCAGTACAGGCTCGTTCGATCGGAAGCCGAGGAGGAAATTCTCGCTCTGGTCCAAGCTTTTTAGGTCTAGATCCGAGAAATTCGAGAAGAATCATAAAAGTCCTTCTCGAGAATCGAATGGTCCGGATTCGTCTTCTTTATCGCCGTCGTGGTTCTCGACGATCTTCAACGGCCGTCGGATTAAGCAGCAATCGAGCCTTGCGACAGTCGAGGAATCGATCGCCGGTGCCGAACGGAGGCATCATTGCCAGACGATTGAGCGCGGAATGTCGCCGGTTAGAGTTTCGGACTCGGATGAAGAAGAATGCGAAGGTCCCGATCGTTCTCCAATTTCTCAAAAGTTTCAACAATCTCCAATGGCGGCTCCTGGATCGGCGAAACGCGGGAGATTAGGGCACAAACAAAACGTTTCGGGATTCGCATTTTGTTTGAGTCCACTCGTACGAGCAAGTCCGAACCGGAATTGGAACCAGAAGGTAACTCCGCCGGAAAATTCTTTCTCCGGCAACCTCAGGGTTCCGGCGAAGCCTCATCTCTGTGCGAACCGGTCGAGGAAGATAGCGGACTTTGGGAGAGTCAATCACAACCGTTGATATCGACTACCGTTGACATTTGACCACAGAATTGATTGTTGATGGTGATACATTTGGCAAAAGTACGATTCTACCCCTCCTGTTTTCTTTTAAGATTTTTAATCTTTTTTCCTCTGTGAGAAAAGGCCACGCGGTGTCAGGGATTAGTTTCTTTGCTATTTCCCTATTCTTCATATGATCATGTGCATTAATTTATTATCAATACGCACACAAAAAAAAAAAAAGCAAACAAACAATAACACTTAATTCGACCTTGGGATTTGGGATTTGTGGAATTTTC

mRNA sequence

CTCTGTCGTGTAAAATTTCAATTCCTTCTCTAGCGAAGTTTCTTCTTCTGCTTCCTCTTCGTTCTCATCGTCCAAAAATCTCCGATCAAATTCTCTGTAAATCCTCACAATCCGCCATTGAAACTCCACACGCCGAGTACGAGTTTGCGGTTTTACTAGTAAATGAGGTGCAAGAAGCACCAATTCGACTTCAGCAGTAGCATCGGCGCATGTGCTTCCTGTTTACGGGAGCGTCTTTTTTCGATCATCGCCGCTCAGGCACAGGCGGAGAAGCAATCGGAGCTCGGTTCCGCTGCCGGCGATATTCATGCGGCGGATGATACACCTCTTCCGCCGCCGCTCGTTTTTCTTCGTTCTGTCTCTCCTCATGATCGTCGTCCGAAATCGGATGAAGATTTGTGGAGCGATCTCGATCGCGACGGAAATCGACGCCATCGCCATCAGCGATTCTACAGTACTCCGCAAATCGGACCTAATTGCATAACGAACAACAGTGCGAACACTACATTCGTCAGTACAGGCTCGTTCGATCGGAAGCCGAGGAGGAAATTCTCGCTCTGGTCCAAGCTTTTTAGGTCTAGATCCGAGAAATTCGAGAAGAATCATAAAAGTCCTTCTCGAGAATCGAATGGTCCGGATTCGTCTTCTTTATCGCCGTCGTGGTTCTCGACGATCTTCAACGGCCGTCGGATTAAGCAGCAATCGAGCCTTGCGACAGTCGAGGAATCGATCGCCGGTGCCGAACGGAGGCATCATTGCCAGACGATTGAGCGCGGAATGTCGCCGGTTAGAGTTTCGGACTCGGATGAAGAAGAATGCGAAGGTCCCGATCGTTCTCCAATTTCTCAAAAGTTTCAACAATCTCCAATGGCGGCTCCTGGATCGGCGAAACGCGGGAGATTAGGGCACAAACAAAACGTTTCGGGATTCGCATTTTGTTTGAGTCCACTCGTACGAGCAAGTCCGAACCGGAATTGGAACCAGAAGGTAACTCCGCCGGAAAATTCTTTCTCCGGCAACCTCAGGGTTCCGGCGAAGCCTCATCTCTGTGCGAACCGGTCGAGGAAGATAGCGGACTTTGGGAGAGTCAATCACAACCGTTGATATCGACTACCGTTGACATTTGACCACAGAATTGATTGTTGATGGTGATACATTTGGCAAAAGTACGATTCTACCCCTCCTGTTTTCTTTTAAGATTTTTAATCTTTTTTCCTCTGTGAGAAAAGGCCACGCGGTGTCAGGGATTAGTTTCTTTGCTATTTCCCTATTCTTCATATGATCATGTGCATTAATTTATTATCAATACGCACACAAAAAAAAAAAAAGCAAACAAACAATAACACTTAATTCGACCTTGGGATTTGGGATTTGTGGAATTTTC

Coding sequence (CDS)

ATGAGGTGCAAGAAGCACCAATTCGACTTCAGCAGTAGCATCGGCGCATGTGCTTCCTGTTTACGGGAGCGTCTTTTTTCGATCATCGCCGCTCAGGCACAGGCGGAGAAGCAATCGGAGCTCGGTTCCGCTGCCGGCGATATTCATGCGGCGGATGATACACCTCTTCCGCCGCCGCTCGTTTTTCTTCGTTCTGTCTCTCCTCATGATCGTCGTCCGAAATCGGATGAAGATTTGTGGAGCGATCTCGATCGCGACGGAAATCGACGCCATCGCCATCAGCGATTCTACAGTACTCCGCAAATCGGACCTAATTGCATAACGAACAACAGTGCGAACACTACATTCGTCAGTACAGGCTCGTTCGATCGGAAGCCGAGGAGGAAATTCTCGCTCTGGTCCAAGCTTTTTAGGTCTAGATCCGAGAAATTCGAGAAGAATCATAAAAGTCCTTCTCGAGAATCGAATGGTCCGGATTCGTCTTCTTTATCGCCGTCGTGGTTCTCGACGATCTTCAACGGCCGTCGGATTAAGCAGCAATCGAGCCTTGCGACAGTCGAGGAATCGATCGCCGGTGCCGAACGGAGGCATCATTGCCAGACGATTGAGCGCGGAATGTCGCCGGTTAGAGTTTCGGACTCGGATGAAGAAGAATGCGAAGGTCCCGATCGTTCTCCAATTTCTCAAAAGTTTCAACAATCTCCAATGGCGGCTCCTGGATCGGCGAAACGCGGGAGATTAGGGCACAAACAAAACGTTTCGGGATTCGCATTTTGTTTGAGTCCACTCGTACGAGCAAGTCCGAACCGGAATTGGAACCAGAAGGTAACTCCGCCGGAAAATTCTTTCTCCGGCAACCTCAGGGTTCCGGCGAAGCCTCATCTCTGTGCGAACCGGTCGAGGAAGATAGCGGACTTTGGGAGAGTCAATCACAACCGTTGA

Protein sequence

MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSAAGDIHAADDTPLPPPLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVSTGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIKQQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAAPGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCANRSRKIADFGRVNHNR
Homology
BLAST of Tan0010199 vs. NCBI nr
Match: KAG6597247.1 (hypothetical protein SDJN03_10427, partial [Cucurbita argyrosperma subsp. sororia] >KAG7028716.1 hypothetical protein SDJN02_09897, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 516.5 bits (1329), Expect = 1.6e-142
Identity = 267/315 (84.76%), Postives = 283/315 (89.84%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSAA-GDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+ +AA GD  AADD PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAASGDNRAADDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP+D R KSDE L SDLDR+GNRR+RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPYDHRLKSDEGLLSDLDREGNRRNRHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+G DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHGSDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHCQTI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCQTIKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV PPE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVMPPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 315

BLAST of Tan0010199 vs. NCBI nr
Match: XP_022974739.1 (uncharacterized protein LOC111473474 [Cucurbita maxima] >XP_022975245.1 uncharacterized protein LOC111474358 [Cucurbita maxima])

HSP 1 Score: 513.1 bits (1320), Expect = 1.7e-141
Identity = 265/315 (84.13%), Postives = 282/315 (89.52%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSEL-GSAAGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+  +AAGD  AADD PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAAAGDNRAADDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP+D R KSDE L SDLDR+GNRR+RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPYDHRLKSDEGLLSDLDREGNRRNRHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+G DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHGSDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHC+TI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCETIKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV  PE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVMSPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 315

BLAST of Tan0010199 vs. NCBI nr
Match: XP_023539051.1 (uncharacterized protein LOC111799804 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 511.9 bits (1317), Expect = 3.8e-141
Identity = 264/315 (83.81%), Postives = 282/315 (89.52%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSA-AGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+ +A +GD  AA+D PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAVSGDNRAAEDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP+D R KSDE L SDLDR+GNRR+RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPYDHRLKSDEGLLSDLDREGNRRNRHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+  DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHASDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHCQTI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCQTIQRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV PPE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVMPPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 315

BLAST of Tan0010199 vs. NCBI nr
Match: XP_023539052.1 (uncharacterized protein LOC111799804 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 508.8 bits (1309), Expect = 3.3e-140
Identity = 263/315 (83.49%), Postives = 281/315 (89.21%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSA-AGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+ +A +GD  AA+D PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAVSGDNRAAEDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP+D R KSDE L SDLDR+GNRR+RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPYDHRLKSDEGLLSDLDREGNRRNRHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+  DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHASDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHCQT +RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCQTNKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV PPE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVMPPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 315

BLAST of Tan0010199 vs. NCBI nr
Match: XP_022942680.1 (uncharacterized protein LOC111447641 [Cucurbita moschata])

HSP 1 Score: 506.9 bits (1304), Expect = 1.2e-139
Identity = 265/315 (84.13%), Postives = 279/315 (88.57%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSEL-GSAAGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+  +AAGD  AADD PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAAAGDNRAADDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP D R KSDE L SDLDR+GN   RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPFDHRLKSDEGLLSDLDREGN---RHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+G DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHGSDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHCQTI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCQTIKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV PPE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVIPPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 312

BLAST of Tan0010199 vs. ExPASy TrEMBL
Match: A0A6J1IG75 (uncharacterized protein LOC111473474 OS=Cucurbita maxima OX=3661 GN=LOC111474358 PE=4 SV=1)

HSP 1 Score: 513.1 bits (1320), Expect = 8.4e-142
Identity = 265/315 (84.13%), Postives = 282/315 (89.52%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSEL-GSAAGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+  +AAGD  AADD PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAAAGDNRAADDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP+D R KSDE L SDLDR+GNRR+RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPYDHRLKSDEGLLSDLDREGNRRNRHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+G DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHGSDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHC+TI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCETIKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV  PE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVMSPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 315

BLAST of Tan0010199 vs. ExPASy TrEMBL
Match: A0A6J1FWR4 (uncharacterized protein LOC111447641 OS=Cucurbita moschata OX=3662 GN=LOC111447641 PE=4 SV=1)

HSP 1 Score: 506.9 bits (1304), Expect = 6.0e-140
Identity = 265/315 (84.13%), Postives = 279/315 (88.57%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSEL-GSAAGDIHAADDTPL-PP 60
           MRCKKHQFD SSSIG CASCLRERLF IIAAQAQAEKQSE+  +AAGD  AADD PL PP
Sbjct: 1   MRCKKHQFDCSSSIGVCASCLRERLFLIIAAQAQAEKQSEIRAAAAGDNRAADDRPLPPP 60

Query: 61  PLVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVS 120
           PLVFLRSVSP D R KSDE L SDLDR+GN   RHQRFYSTPQIGP+  TNN+AN+TFV+
Sbjct: 61  PLVFLRSVSPFDHRLKSDEGLLSDLDREGN---RHQRFYSTPQIGPDYRTNNTANSTFVT 120

Query: 121 TGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           TGSFDRK R KFSLWSKLF+SRSEKFEK HKSPS ES+G DS+SLSPSWFSTI NGRRIK
Sbjct: 121 TGSFDRKKRSKFSLWSKLFKSRSEKFEKKHKSPSHESHGSDSASLSPSWFSTILNGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           QQSSLATVEESIAGAERRHHCQTI+RGMSP  +S+SDEEECE   RSPISQKFQQSPM  
Sbjct: 181 QQSSLATVEESIAGAERRHHCQTIKRGMSPAIISESDEEECESHGRSPISQKFQQSPMTI 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKR +LGHKQNVSGFAFCLSPLVRASPNRNWNQKV PPE SFSGNLRVP KPHLCAN
Sbjct: 241 PGSAKREKLGHKQNVSGFAFCLSPLVRASPNRNWNQKVIPPEVSFSGNLRVPGKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 312

BLAST of Tan0010199 vs. ExPASy TrEMBL
Match: A0A5D3D3J1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G003230 PE=4 SV=1)

HSP 1 Score: 496.9 bits (1278), Expect = 6.2e-137
Identity = 263/315 (83.49%), Postives = 281/315 (89.21%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEK-QSELGSAAGDIHAADDTPLPPP 60
           MRCKKH  DF+S++G CASCLRERL SIIAAQAQAEK QS+L S  G I +ADD PLPPP
Sbjct: 1   MRCKKHHSDFTSTVGVCASCLRERLLSIIAAQAQAEKNQSQLTS--GGIRSADD-PLPPP 60

Query: 61  LVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVST 120
           L+FL SVSPH    KS+EDLW++LDR+GN R RHQRFYSTPQIGPNC TNNSANTTFV+T
Sbjct: 61  LLFLHSVSPH--ATKSNEDLWTNLDREGNLRSRHQRFYSTPQIGPNCRTNNSANTTFVTT 120

Query: 121 GSFDRKPR-RKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           GSFDRK R +KFSLWSKLFRSRSEKFEKNHKSPSRES+GP SSS SPSWFSTIF+GRRIK
Sbjct: 121 GSFDRKQRSKKFSLWSKLFRSRSEKFEKNHKSPSRESHGPGSSSSSPSWFSTIFHGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           +QSSL  VEESI  AERR HCQ IERGMSPVRVSDSD EECEGPDRSPISQKFQ SPMAA
Sbjct: 181 RQSSLTPVEESIPVAERR-HCQAIERGMSPVRVSDSD-EECEGPDRSPISQKFQLSPMAA 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKRGRLGH QNVSGFAFCLSPLVRASPNRNWNQKV PPE +FSGN++VPAKPHLCAN
Sbjct: 241 PGSAKRGRLGHNQNVSGFAFCLSPLVRASPNRNWNQKVIPPETAFSGNIKVPAKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 308

BLAST of Tan0010199 vs. ExPASy TrEMBL
Match: A0A1S3AVZ2 (uncharacterized protein LOC103483430 OS=Cucumis melo OX=3656 GN=LOC103483430 PE=4 SV=1)

HSP 1 Score: 496.9 bits (1278), Expect = 6.2e-137
Identity = 263/315 (83.49%), Postives = 281/315 (89.21%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEK-QSELGSAAGDIHAADDTPLPPP 60
           MRCKKH  DF+S++G CASCLRERL SIIAAQAQAEK QS+L S  G I +ADD PLPPP
Sbjct: 1   MRCKKHHSDFTSTVGVCASCLRERLLSIIAAQAQAEKNQSQLTS--GGIRSADD-PLPPP 60

Query: 61  LVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVST 120
           L+FL SVSPH    KS+EDLW++LDR+GN R RHQRFYSTPQIGPNC TNNSANTTFV+T
Sbjct: 61  LLFLHSVSPH--ATKSNEDLWTNLDREGNLRSRHQRFYSTPQIGPNCRTNNSANTTFVTT 120

Query: 121 GSFDRKPR-RKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           GSFDRK R +KFSLWSKLFRSRSEKFEKNHKSPSRES+GP SSS SPSWFSTIF+GRRIK
Sbjct: 121 GSFDRKQRSKKFSLWSKLFRSRSEKFEKNHKSPSRESHGPGSSSSSPSWFSTIFHGRRIK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           +QSSL  VEESI  AERR HCQ IERGMSPVRVSDSD EECEGPDRSPISQKFQ SPMAA
Sbjct: 181 RQSSLTPVEESIPVAERR-HCQAIERGMSPVRVSDSD-EECEGPDRSPISQKFQLSPMAA 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKRGRLGH QNVSGFAFCLSPLVRASPNRNWNQKV PPE +FSGN++VPAKPHLCAN
Sbjct: 241 PGSAKRGRLGHNQNVSGFAFCLSPLVRASPNRNWNQKVIPPETAFSGNIKVPAKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 308

BLAST of Tan0010199 vs. ExPASy TrEMBL
Match: A0A0A0L4I6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G128830 PE=4 SV=1)

HSP 1 Score: 484.2 bits (1245), Expect = 4.2e-133
Identity = 256/315 (81.27%), Postives = 277/315 (87.94%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEK-QSELGSAAGDIHAADDTPLPPP 60
           MRCKKH  DF+S++G CASCLRERL SIIAAQAQAEK QS+L    G I +ADD PLPPP
Sbjct: 1   MRCKKHHSDFTSTVGVCASCLRERLLSIIAAQAQAEKNQSQL--TYGGIRSADD-PLPPP 60

Query: 61  LVFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVST 120
           L+F+ SVSPH    KSDEDLWS+LDR+GNRR  HQRFYSTPQIGPN  TNNSANTTFV+T
Sbjct: 61  LLFIHSVSPH--ATKSDEDLWSNLDREGNRRFLHQRFYSTPQIGPNGRTNNSANTTFVTT 120

Query: 121 GSFDRKPR-RKFSLWSKLFRSRSEKFEKNHKSPSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           GSFDRK R +KFSLWSKLFRSRS+KFEKNHKSPSRES+GP SSS SPSWFSTIF+G R K
Sbjct: 121 GSFDRKQRSKKFSLWSKLFRSRSDKFEKNHKSPSRESHGPGSSSSSPSWFSTIFHGHRTK 180

Query: 181 QQSSLATVEESIAGAERRHHCQTIERGMSPVRVSDSDEEECEGPDRSPISQKFQQSPMAA 240
           +QSSL+ VEESI+ AERR HC  IERGMSPVRVSDSD EECEGPDRSPISQKFQ SPMAA
Sbjct: 181 RQSSLSPVEESISVAERR-HCHAIERGMSPVRVSDSD-EECEGPDRSPISQKFQLSPMAA 240

Query: 241 PGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVTPPENSFSGNLRVPAKPHLCAN 300
           PGSAKRGRLGH QNVSGFAFCLSPL+RASPNRNWNQK  PPE +FSGN++VPAKPHLCAN
Sbjct: 241 PGSAKRGRLGHNQNVSGFAFCLSPLMRASPNRNWNQKAIPPETAFSGNIKVPAKPHLCAN 300

Query: 301 RSRKIADFGRVNHNR 314
           RSRKIADFGRVNHNR
Sbjct: 301 RSRKIADFGRVNHNR 308

BLAST of Tan0010199 vs. TAIR 10
Match: AT2G44600.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G60200.1); Has 56 Blast hits to 55 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 0; Plants - 52; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 177.2 bits (448), Expect = 2.1e-44
Identity = 128/331 (38.67%), Postives = 177/331 (53.47%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSAAGDIHAADDTPLPPPL 60
           MRCK+H  DFSSSIG CASCLRERLF++  + A +E          D         PPPL
Sbjct: 1   MRCKRHTVDFSSSIGVCASCLRERLFTLAVSTAASEND--------DNDHRHSRISPPPL 60

Query: 61  VFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQIGPNCITNNSANTTFVSTG 120
           +F RSVSP+    KSD    +      +    + RF++TPQ+    + ++S+   F S  
Sbjct: 61  LFPRSVSPYVAPRKSDAGTGTGAGGRDSVASSNNRFFATPQV---VVQSSSSEKVFESDR 120

Query: 121 SFDRKPRRKFSLWSKLFRSRSEKFEKNHKS--PSRESNGPDSSSLSPSWFSTIFNGRRIK 180
           SF +K +   S +S  FR+RS+ ++    S   S   + P S++ S SWFS + + R  K
Sbjct: 121 SF-KKKKSGLSRFSSFFRTRSDDYDSRRDSCDASTVFSQPSSATTSRSWFSKVISVRSKK 180

Query: 181 QQSSLATVEESIAGAERRHH-------CQTIERGMSPVRVSDSDEEECEGPDRSPISQKF 240
           Q ++     E +  +E  HH        Q   RGMSP   S +++E  E    SP   + 
Sbjct: 181 QSTTNTCYIEDLIASESDHHHHNQNRPRQRYCRGMSPAGDSTTNDESVE---ESP--GRL 240

Query: 241 QQSP-MAAPGSAKRGRLGHKQNVSGFAFCLSPLVRASPN--RNWNQKVTPPENSFSGNLR 300
           +++P M  PG  K   +G  ++VSG AFCLSPLVRA PN   NW  K  PP+  +SG L+
Sbjct: 241 RRTPVMGTPGRKKTATIGIGRSVSGMAFCLSPLVRAKPNCSSNWKAKF-PPDFGYSGELK 300

Query: 301 VPAKPHL------CANRSRKIADFGRVNHNR 314
            PAKPHL      C NRS+K+ D GRV+H R
Sbjct: 301 SPAKPHLSTAASFCGNRSKKLVDLGRVDHRR 313

BLAST of Tan0010199 vs. TAIR 10
Match: AT3G60200.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G44600.1); Has 60 Blast hits to 60 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 8; Fungi - 0; Plants - 51; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 154.1 bits (388), Expect = 1.9e-37
Identity = 130/333 (39.04%), Postives = 179/333 (53.75%), Query Frame = 0

Query: 1   MRCKKHQFDFSSSIGACASCLRERLFSIIAAQAQAEKQSELGSAAGDIHAADDTPLPPPL 60
           MRCK+H  D SSS G CASCLRERL S+ A+ A +    +  S       +++   PP L
Sbjct: 1   MRCKRHTVDLSSSNGVCASCLRERLLSLAASAAVSAAVEDNQS-----KKSNNNNHPPLL 60

Query: 61  VFLRSVSPHDRRPKSDEDLWSDLDRDGNRRHRHQRFYSTPQI---GPNCITNNSANTTFV 120
           +F RSVSP+  R KSD           + R    RF +TPQI   G +C         F 
Sbjct: 61  IFPRSVSPYVTRRKSDAGAGGGDPLVSSNR----RFITTPQIDLVGYSC-------KDFE 120

Query: 121 STGSFDRKPRRKFSLWSKLFRSRSEKFEKNHKSPSRESNGPD------SSSLSPSWFSTI 180
           S  S   K  +K S +S LFR+RSE F+ N KS +   +  D      SSS S SW STI
Sbjct: 121 SNRSNKSKQGKKVSRFSNLFRARSEDFDTNPKSNNPRFSSCDASEISSSSSSSRSWISTI 180

Query: 181 FN-GRRIKQQSSLATVEESIAGAE-RRHHCQTIERGMSPVRVSDSDEEECEGPDRSPIS- 240
            + GRR KQ ++   +E+ IA    +R +C    RGMSPVR ++        P++S  S 
Sbjct: 181 LSTGRRKKQPTTACYIEDVIAARRPQRIYC----RGMSPVRDTE--------PEQSAESI 240

Query: 241 QKFQQSPMAAPGSAKRGRLGHKQNVSGFAFCLSPLVRASPNRNWNQKVT-PPENSFSGNL 300
           ++ +++P       ++  +G  +++SG AFCLSPLVRASPN  + +K+  P E   SG +
Sbjct: 241 EELRRTPATKTPGRRKIAMGIGKSMSGMAFCLSPLVRASPNCPFKRKMRFPSEFGNSGEV 300

Query: 301 -RVPAKPHL------CANRSRKIADFGRVNHNR 314
             VP KPH+      CANRS+K+ D GRV+  R
Sbjct: 301 TAVPEKPHISAAASFCANRSKKLVDLGRVDRRR 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6597247.11.6e-14284.76hypothetical protein SDJN03_10427, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022974739.11.7e-14184.13uncharacterized protein LOC111473474 [Cucurbita maxima] >XP_022975245.1 uncharac... [more]
XP_023539051.13.8e-14183.81uncharacterized protein LOC111799804 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023539052.13.3e-14083.49uncharacterized protein LOC111799804 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022942680.11.2e-13984.13uncharacterized protein LOC111447641 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1IG758.4e-14284.13uncharacterized protein LOC111473474 OS=Cucurbita maxima OX=3661 GN=LOC111474358... [more]
A0A6J1FWR46.0e-14084.13uncharacterized protein LOC111447641 OS=Cucurbita moschata OX=3662 GN=LOC1114476... [more]
A0A5D3D3J16.2e-13783.49Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3AVZ26.2e-13783.49uncharacterized protein LOC103483430 OS=Cucumis melo OX=3656 GN=LOC103483430 PE=... [more]
A0A0A0L4I64.2e-13381.27Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G128830 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G44600.12.1e-4438.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G60200.11.9e-3739.04unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 208..247
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 67..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 208..224
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 143..165
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 63..85
NoneNo IPR availablePANTHERPTHR35486EXPRESSED PROTEINcoord: 1..313

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0010199.1Tan0010199.1mRNA