Tan0007671 (gene) Snake gourd v1

Overview
NameTan0007671
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMicronuclear linker histone polyprotein-like protein
LocationLG04: 8183156 .. 8184316 (-)
RNA-Seq ExpressionTan0007671
SyntenyTan0007671
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGAGAGAGAGGCAAAGCTCTGGAAGTTTACAGCAACGACATGGACTTCTATTCCTCTGCTTCTGAGTTCCCCTGTAAGAAACACCCTTCTTCTTCCTCTGTCGGGGTCTGTGCTGATTGTTTGAAAGATCGGTTGATCAAACTTGTTTGTTCTGACTGTGGTGAGCAGAGGCTCTCCTCCTGCTCCTGCTCTGAGATCTCATCTAATCGAAATTCTTGTACACTGGAAGTGGGAAGTGTTGGGAGAGTTTCGTTTTTGATAGAGAATGAAAGAAATGGCGTTTCGTTGTTGGGTCCGATGAAGCCCAAGATGGAGAAAAGGGAGGAAGTTGTGCTTTTGGAGAGAAGCAGTAGTAGCTGTGTGGAGATTAAGAAGAGTGGGTTTTGGAGGATCGGGAAATTTTTCAGGAAGAAGAGGGAGAAGGGATGTGAGAGATCAAGCGTTTGTGGGTTTGATGATAAGAGTGATATTTGTATGGTTGATTACATGGGTGTGTCGAGGTCGAGATCTCTCTGTAGTTTTCGTGGTGGTGGGTTTTTCGGGTCGGAGGACGGTGGAGACATGGTGGTTTCCGGCGGCCGGAGCTCAATCTCCGGTGCCAGAACTTCGAGTGTCAATGGAGGACTCGTCTGTGATTCCGCCAGAAGAAGTGGTTTCAGCGAAACAGAGCCTAGAAAAAGTGGTTTTGAAAGTGATCACAGAGAATGTGGGAATTATGAAAGCGATCATAATGGGTTTAGTTTAGCAAATAGACGTGTCTTTTCTTTGAAAGAGAGCGATTTCAATGGAATGGACGAATCTGGGTTCATAGATTTCAAGTTGGATTTCATATCAGAAACGAAGGCAGAATTCTCTGTTCCCAAAATGGGTATGGGACTGGGTTTAGGCCCTCTCTCAACCCCCAACTCTGCATTTGGGAGCACGAGAGCTTTTGACACGGCGGCGGCGCATGAATACGGCAGAGGGCTGTACGGTGGCACCGCCGGCGAGGGGATCATCGGCAGCGGCGGAGGGTCCTGTAGGATAACAGTCAGTGACAGGGGGATAAAGAAGGGGAGGAAAAGCTTGAAAGCATGGAAATGGATATTCAAGCATCCACCAAACTGGGCAAATGCAACTGGGAGGAAGAAAGAAGAAGATTTAATGACTAAAACTTGA

mRNA sequence

ATGAGAGAGAGAGGCAAAGCTCTGGAAGTTTACAGCAACGACATGGACTTCTATTCCTCTGCTTCTGAGTTCCCCTGTAAGAAACACCCTTCTTCTTCCTCTGTCGGGGTCTGTGCTGATTGTTTGAAAGATCGGTTGATCAAACTTGTTTGTTCTGACTGTGGTGAGCAGAGGCTCTCCTCCTGCTCCTGCTCTGAGATCTCATCTAATCGAAATTCTTGTACACTGGAAGTGGGAAGTGTTGGGAGAGTTTCGTTTTTGATAGAGAATGAAAGAAATGGCGTTTCGTTGTTGGGTCCGATGAAGCCCAAGATGGAGAAAAGGGAGGAAGTTGTGCTTTTGGAGAGAAGCAGTAGTAGCTGTGTGGAGATTAAGAAGAGTGGGTTTTGGAGGATCGGGAAATTTTTCAGGAAGAAGAGGGAGAAGGGATGTGAGAGATCAAGCGTTTGTGGGTTTGATGATAAGAGTGATATTTGTATGGTTGATTACATGGGTGTGTCGAGGTCGAGATCTCTCTGTAGTTTTCGTGGTGGTGGGTTTTTCGGGTCGGAGGACGGTGGAGACATGGTGGTTTCCGGCGGCCGGAGCTCAATCTCCGGTGCCAGAACTTCGAGTGTCAATGGAGGACTCGTCTGTGATTCCGCCAGAAGAAGTGGTTTCAGCGAAACAGAGCCTAGAAAAAGTGGTTTTGAAAGTGATCACAGAGAATGTGGGAATTATGAAAGCGATCATAATGGGTTTAGTTTAGCAAATAGACGTGTCTTTTCTTTGAAAGAGAGCGATTTCAATGGAATGGACGAATCTGGGTTCATAGATTTCAAGTTGGATTTCATATCAGAAACGAAGGCAGAATTCTCTGTTCCCAAAATGGGTATGGGACTGGGTTTAGGCCCTCTCTCAACCCCCAACTCTGCATTTGGGAGCACGAGAGCTTTTGACACGGCGGCGGCGCATGAATACGGCAGAGGGCTGTACGGTGGCACCGCCGGCGAGGGGATCATCGGCAGCGGCGGAGGGTCCTGTAGGATAACAGTCAGTGACAGGGGGATAAAGAAGGGGAGGAAAAGCTTGAAAGCATGGAAATGGATATTCAAGCATCCACCAAACTGGGCAAATGCAACTGGGAGGAAGAAAGAAGAAGATTTAATGACTAAAACTTGA

Coding sequence (CDS)

ATGAGAGAGAGAGGCAAAGCTCTGGAAGTTTACAGCAACGACATGGACTTCTATTCCTCTGCTTCTGAGTTCCCCTGTAAGAAACACCCTTCTTCTTCCTCTGTCGGGGTCTGTGCTGATTGTTTGAAAGATCGGTTGATCAAACTTGTTTGTTCTGACTGTGGTGAGCAGAGGCTCTCCTCCTGCTCCTGCTCTGAGATCTCATCTAATCGAAATTCTTGTACACTGGAAGTGGGAAGTGTTGGGAGAGTTTCGTTTTTGATAGAGAATGAAAGAAATGGCGTTTCGTTGTTGGGTCCGATGAAGCCCAAGATGGAGAAAAGGGAGGAAGTTGTGCTTTTGGAGAGAAGCAGTAGTAGCTGTGTGGAGATTAAGAAGAGTGGGTTTTGGAGGATCGGGAAATTTTTCAGGAAGAAGAGGGAGAAGGGATGTGAGAGATCAAGCGTTTGTGGGTTTGATGATAAGAGTGATATTTGTATGGTTGATTACATGGGTGTGTCGAGGTCGAGATCTCTCTGTAGTTTTCGTGGTGGTGGGTTTTTCGGGTCGGAGGACGGTGGAGACATGGTGGTTTCCGGCGGCCGGAGCTCAATCTCCGGTGCCAGAACTTCGAGTGTCAATGGAGGACTCGTCTGTGATTCCGCCAGAAGAAGTGGTTTCAGCGAAACAGAGCCTAGAAAAAGTGGTTTTGAAAGTGATCACAGAGAATGTGGGAATTATGAAAGCGATCATAATGGGTTTAGTTTAGCAAATAGACGTGTCTTTTCTTTGAAAGAGAGCGATTTCAATGGAATGGACGAATCTGGGTTCATAGATTTCAAGTTGGATTTCATATCAGAAACGAAGGCAGAATTCTCTGTTCCCAAAATGGGTATGGGACTGGGTTTAGGCCCTCTCTCAACCCCCAACTCTGCATTTGGGAGCACGAGAGCTTTTGACACGGCGGCGGCGCATGAATACGGCAGAGGGCTGTACGGTGGCACCGCCGGCGAGGGGATCATCGGCAGCGGCGGAGGGTCCTGTAGGATAACAGTCAGTGACAGGGGGATAAAGAAGGGGAGGAAAAGCTTGAAAGCATGGAAATGGATATTCAAGCATCCACCAAACTGGGCAAATGCAACTGGGAGGAAGAAAGAAGAAGATTTAATGACTAAAACTTGA

Protein sequence

MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLSSCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSSCVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGFFGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNYESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLSTPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAWKWIFKHPPNWANATGRKKEEDLMTKT
Homology
BLAST of Tan0007671 vs. NCBI nr
Match: XP_008446515.1 (PREDICTED: uncharacterized protein LOC103489222 [Cucumis melo] >KAA0034504.1 putative lysozyme-like protein [Cucumis melo var. makuwa] >TYK09058.1 putative lysozyme-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 712.2 bits (1837), Expect = 2.4e-201
Identity = 358/386 (92.75%), Postives = 370/386 (95.85%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGP+KPK+EKREEVVLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPIKPKLEKREEVVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRG GF
Sbjct: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGNGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           +SDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K +  VPKMG+GL    LS
Sbjct: 241 DSDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKPDIFVPKMGLGL----LS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360
            PNSAFGSTRA D  AAHE  RGLYGG AGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW
Sbjct: 301 NPNSAFGSTRALD-MAAHECSRGLYGGAAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360

Query: 361 KWIFKHPPNWANATGRKKEEDLMTKT 387
           KWIFKHPPNWANA+GRKKEE+LM+KT
Sbjct: 361 KWIFKHPPNWANASGRKKEEELMSKT 381

BLAST of Tan0007671 vs. NCBI nr
Match: XP_038892601.1 (uncharacterized protein LOC120081636 [Benincasa hispida])

HSP 1 Score: 712.2 bits (1837), Expect = 2.4e-201
Identity = 360/388 (92.78%), Postives = 372/388 (95.88%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 27  MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 86

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS
Sbjct: 87  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 146

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRGGG+
Sbjct: 147 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGGGY 206

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMV SGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 207 FGSEDGGDMVASGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 266

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           ESD NGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K++ SVPKMGMG+GLG LS
Sbjct: 267 ESDQNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKSDISVPKMGMGMGLGLLS 326

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGII--GSGGGSCRITVSDRGIKKGRKSLK 360
            PNSAFGSTRAF+  AAH+  RGLYGG AGEGII  G GGGSCRITVSDRGIKKGRKSLK
Sbjct: 327 NPNSAFGSTRAFE-MAAHDCSRGLYGGAAGEGIIGGGGGGGSCRITVSDRGIKKGRKSLK 386

Query: 361 AWKWIFKHPPNWANATGRKKEEDLMTKT 387
           AWKWIFKHPPNWANA+ RKK EDLM+KT
Sbjct: 387 AWKWIFKHPPNWANASARKK-EDLMSKT 412

BLAST of Tan0007671 vs. NCBI nr
Match: XP_004135128.1 (uncharacterized protein LOC101207638 [Cucumis sativus] >KGN52019.1 hypothetical protein Csa_008787 [Cucumis sativus])

HSP 1 Score: 710.7 bits (1833), Expect = 7.0e-201
Identity = 358/390 (91.79%), Postives = 370/390 (94.87%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISS RNSCT+EVGSVGRVSFLIENERNGVSLLGP+KPK+EKREEVVLLERSSSS
Sbjct: 61  SCSCSEISSKRNSCTVEVGSVGRVSFLIENERNGVSLLGPIKPKIEKREEVVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRG GF
Sbjct: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGNGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           +SDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K + SVPKMG G+GLG LS
Sbjct: 241 DSDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKQDISVPKMGFGMGLGLLS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGS----GGGSCRITVSDRGIKKGRKS 360
            PNS FGSTRAFD  AAHE  RGLY GTAGEGIIG+    GGGSCRITVSDRGIKKGRKS
Sbjct: 301 NPNSTFGSTRAFD-MAAHECSRGLYCGTAGEGIIGNGAGGGGGSCRITVSDRGIKKGRKS 360

Query: 361 LKAWKWIFKHPPNWANATGRKKEEDLMTKT 387
           LKAWKWIFKHPPNW NAT RKKEE+LM+KT
Sbjct: 361 LKAWKWIFKHPPNWTNATARKKEEELMSKT 389

BLAST of Tan0007671 vs. NCBI nr
Match: KAG7032127.1 (hypothetical protein SDJN02_06170, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 692.6 bits (1786), Expect = 2.0e-195
Identity = 349/391 (89.26%), Postives = 365/391 (93.35%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKAL VYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLI+LVCSDCGEQRLS
Sbjct: 80  MRERGKALAVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIRLVCSDCGEQRLS 139

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGPMKPKMEKREE+VLLERSSSS
Sbjct: 140 SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEIVLLERSSSS 199

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVE KKSGFWRIG FFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRGGG+
Sbjct: 200 CVESKKSGFWRIGNFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGGGY 259

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHR+CGNY
Sbjct: 260 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRDCGNY 319

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKM----GMGLGL 300
           ESDHNG SLA+RRVFSLKESDFNGMDESGFIDFKLDF SETK + SVPKM    G+GLGL
Sbjct: 320 ESDHNGLSLASRRVFSLKESDFNGMDESGFIDFKLDFTSETKPDISVPKMGLGLGLGLGL 379

Query: 301 GPLSTPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKS 360
           GP S PNSAFGS+RAFD A        LY G+ GEGIIGSGGGSCRITVSDRGIKKGRKS
Sbjct: 380 GPFSNPNSAFGSSRAFDMA--------LYSGSTGEGIIGSGGGSCRITVSDRGIKKGRKS 439

Query: 361 LKAWKWIFKHPPNWAN-ATGRKKEEDLMTKT 387
           LK+WKW+FKHPPNWAN AT RKKEE+LM+KT
Sbjct: 440 LKSWKWMFKHPPNWANAATSRKKEEELMSKT 462

BLAST of Tan0007671 vs. NCBI nr
Match: XP_022957131.1 (uncharacterized protein LOC111458603 isoform X3 [Cucurbita moschata] >KAG6601341.1 hypothetical protein SDJN03_06574, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 692.6 bits (1786), Expect = 2.0e-195
Identity = 349/391 (89.26%), Postives = 365/391 (93.35%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKAL VYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLI+LVCSDCGEQRLS
Sbjct: 1   MRERGKALAVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIRLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGPMKPKMEKREE+VLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEIVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVE KKSGFWRIG FFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRGGG+
Sbjct: 121 CVESKKSGFWRIGNFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGGGY 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHR+CGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRDCGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKM----GMGLGL 300
           ESDHNG SLA+RRVFSLKESDFNGMDESGFIDFKLDF SETK + SVPKM    G+GLGL
Sbjct: 241 ESDHNGLSLASRRVFSLKESDFNGMDESGFIDFKLDFTSETKPDISVPKMGLGLGLGLGL 300

Query: 301 GPLSTPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKS 360
           GP S PNSAFGS+RAFD A        LY G+ GEGIIGSGGGSCRITVSDRGIKKGRKS
Sbjct: 301 GPFSNPNSAFGSSRAFDMA--------LYSGSTGEGIIGSGGGSCRITVSDRGIKKGRKS 360

Query: 361 LKAWKWIFKHPPNWAN-ATGRKKEEDLMTKT 387
           LK+WKW+FKHPPNWAN AT RKKEE+LM+KT
Sbjct: 361 LKSWKWMFKHPPNWANAATSRKKEEELMSKT 383

BLAST of Tan0007671 vs. ExPASy TrEMBL
Match: A0A5A7STG5 (Putative lysozyme-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G00470 PE=4 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 1.2e-201
Identity = 358/386 (92.75%), Postives = 370/386 (95.85%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGP+KPK+EKREEVVLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPIKPKLEKREEVVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRG GF
Sbjct: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGNGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           +SDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K +  VPKMG+GL    LS
Sbjct: 241 DSDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKPDIFVPKMGLGL----LS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360
            PNSAFGSTRA D  AAHE  RGLYGG AGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW
Sbjct: 301 NPNSAFGSTRALD-MAAHECSRGLYGGAAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360

Query: 361 KWIFKHPPNWANATGRKKEEDLMTKT 387
           KWIFKHPPNWANA+GRKKEE+LM+KT
Sbjct: 361 KWIFKHPPNWANASGRKKEEELMSKT 381

BLAST of Tan0007671 vs. ExPASy TrEMBL
Match: A0A1S3BG21 (uncharacterized protein LOC103489222 OS=Cucumis melo OX=3656 GN=LOC103489222 PE=4 SV=1)

HSP 1 Score: 712.2 bits (1837), Expect = 1.2e-201
Identity = 358/386 (92.75%), Postives = 370/386 (95.85%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGP+KPK+EKREEVVLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPIKPKLEKREEVVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRG GF
Sbjct: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGNGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           +SDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K +  VPKMG+GL    LS
Sbjct: 241 DSDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKPDIFVPKMGLGL----LS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360
            PNSAFGSTRA D  AAHE  RGLYGG AGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW
Sbjct: 301 NPNSAFGSTRALD-MAAHECSRGLYGGAAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360

Query: 361 KWIFKHPPNWANATGRKKEEDLMTKT 387
           KWIFKHPPNWANA+GRKKEE+LM+KT
Sbjct: 361 KWIFKHPPNWANASGRKKEEELMSKT 381

BLAST of Tan0007671 vs. ExPASy TrEMBL
Match: A0A0A0KT28 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G608070 PE=4 SV=1)

HSP 1 Score: 710.7 bits (1833), Expect = 3.4e-201
Identity = 358/390 (91.79%), Postives = 370/390 (94.87%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLIKLVCSDCGEQRLS
Sbjct: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIKLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISS RNSCT+EVGSVGRVSFLIENERNGVSLLGP+KPK+EKREEVVLLERSSSS
Sbjct: 61  SCSCSEISSKRNSCTVEVGSVGRVSFLIENERNGVSLLGPIKPKIEKREEVVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRG GF
Sbjct: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGNGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           +SDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDF SE+K + SVPKMG G+GLG LS
Sbjct: 241 DSDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFTSESKQDISVPKMGFGMGLGLLS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGS----GGGSCRITVSDRGIKKGRKS 360
            PNS FGSTRAFD  AAHE  RGLY GTAGEGIIG+    GGGSCRITVSDRGIKKGRKS
Sbjct: 301 NPNSTFGSTRAFD-MAAHECSRGLYCGTAGEGIIGNGAGGGGGSCRITVSDRGIKKGRKS 360

Query: 361 LKAWKWIFKHPPNWANATGRKKEEDLMTKT 387
           LKAWKWIFKHPPNW NAT RKKEE+LM+KT
Sbjct: 361 LKAWKWIFKHPPNWTNATARKKEEELMSKT 389

BLAST of Tan0007671 vs. ExPASy TrEMBL
Match: A0A6J1GYD7 (uncharacterized protein LOC111458603 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111458603 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 9.5e-196
Identity = 349/391 (89.26%), Postives = 365/391 (93.35%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKAL VYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLI+LVCSDCGEQRLS
Sbjct: 1   MRERGKALAVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIRLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGPMKPKMEKREE+VLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEIVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVE KKSGFWRIG FFRKKREKGCERSSVCGFD+KSDICMVDYMGVSRSRSLCSFRGGG+
Sbjct: 121 CVESKKSGFWRIGNFFRKKREKGCERSSVCGFDEKSDICMVDYMGVSRSRSLCSFRGGGY 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHR+CGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRDCGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKM----GMGLGL 300
           ESDHNG SLA+RRVFSLKESDFNGMDESGFIDFKLDF SETK + SVPKM    G+GLGL
Sbjct: 241 ESDHNGLSLASRRVFSLKESDFNGMDESGFIDFKLDFTSETKPDISVPKMGLGLGLGLGL 300

Query: 301 GPLSTPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKS 360
           GP S PNSAFGS+RAFD A        LY G+ GEGIIGSGGGSCRITVSDRGIKKGRKS
Sbjct: 301 GPFSNPNSAFGSSRAFDMA--------LYSGSTGEGIIGSGGGSCRITVSDRGIKKGRKS 360

Query: 361 LKAWKWIFKHPPNWAN-ATGRKKEEDLMTKT 387
           LK+WKW+FKHPPNWAN AT RKKEE+LM+KT
Sbjct: 361 LKSWKWMFKHPPNWANAATSRKKEEELMSKT 383

BLAST of Tan0007671 vs. ExPASy TrEMBL
Match: A0A6J1J1U6 (uncharacterized protein LOC111480310 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111480310 PE=4 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 3.6e-195
Identity = 346/387 (89.41%), Postives = 362/387 (93.54%), Query Frame = 0

Query: 1   MRERGKALEVYSNDMDFYSSASEFPCKKHPSSSSVGVCADCLKDRLIKLVCSDCGEQRLS 60
           MRERGKAL VYSNDMDFYSSASEFPCKKHPSSSSVG+CADCLKDRLI+LVCSDCGEQRLS
Sbjct: 1   MRERGKALAVYSNDMDFYSSASEFPCKKHPSSSSVGICADCLKDRLIRLVCSDCGEQRLS 60

Query: 61  SCSCSEISSNRNSCTLEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEVVLLERSSSS 120
           SCSCSEISSNRNSCT+EVGSVGRVSFLIENERNGVSLLGPMKPKMEKREE+VLLERSSSS
Sbjct: 61  SCSCSEISSNRNSCTVEVGSVGRVSFLIENERNGVSLLGPMKPKMEKREEIVLLERSSSS 120

Query: 121 CVEIKKSGFWRIGKFFRKKREKGCERSSVCGFDDKSDICMVDYMGVSRSRSLCSFRGGGF 180
           CVE KKSGFWRIG FFRKKREK CERSSVCGFD+KSDICMVD+MGVSRSRSLCSFRGGGF
Sbjct: 121 CVESKKSGFWRIGNFFRKKREKECERSSVCGFDEKSDICMVDHMGVSRSRSLCSFRGGGF 180

Query: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRECGNY 240
           FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHR+CGNY
Sbjct: 181 FGSEDGGDMVVSGGRSSISGARTSSVNGGLVCDSARRSGFSETEPRKSGFESDHRDCGNY 240

Query: 241 ESDHNGFSLANRRVFSLKESDFNGMDESGFIDFKLDFISETKAEFSVPKMGMGLGLGPLS 300
           ESDHNGF LA+RRVFSLKESDFNGM ESGFIDFKLDF SETK + SVPKMG+GLGLGP S
Sbjct: 241 ESDHNGFGLASRRVFSLKESDFNGMGESGFIDFKLDFTSETKPDISVPKMGLGLGLGPFS 300

Query: 301 TPNSAFGSTRAFDTAAAHEYGRGLYGGTAGEGIIGSGGGSCRITVSDRGIKKGRKSLKAW 360
            PN AFGS+RAFD A        LY G+ G+GIIGSGGGSCRITVSDRGIKKGRKSLK+W
Sbjct: 301 NPNPAFGSSRAFDMA--------LYSGSTGQGIIGSGGGSCRITVSDRGIKKGRKSLKSW 360

Query: 361 KWIFKHPPNWAN-ATGRKKEEDLMTKT 387
           KWIFKHPPNWAN AT RKKEE+LM+KT
Sbjct: 361 KWIFKHPPNWANAATSRKKEEELMSKT 379

BLAST of Tan0007671 vs. TAIR 10
Match: AT3G25590.1 (unknown protein; Has 149 Blast hits to 140 proteins in 44 species: Archae - 0; Bacteria - 6; Metazoa - 40; Fungi - 6; Plants - 39; Viruses - 0; Other Eukaryotes - 58 (source: NCBI BLink). )

HSP 1 Score: 184.9 bits (468), Expect = 1.2e-46
Identity = 173/441 (39.23%), Postives = 231/441 (52.38%), Query Frame = 0

Query: 1   MRERGK--ALEVYSNDMD-FYSSASEFPCKKHPS-SSSVGVCADCLKDRLIKLVCSDCGE 60
           M+ERGK  A+E  ++D+  +YS+ SEF C+KHPS SS VG+C  CL DRL+ LVCS+CGE
Sbjct: 1   MKERGKRTAMERRNDDVSLYYSTPSEFTCRKHPSVSSGVGICPYCLNDRLVNLVCSECGE 60

Query: 61  QRLSSCSCSEISSNRNSCTLEVGSVG----RVSFLIENERNGVSLLGPMKPKMEKREEVV 120
           QRLSSCSCS+IS  R        +VG    R+S LI+ ER         + K  K EEVV
Sbjct: 61  QRLSSCSCSDISPTRTVDAAVDAAVGENVVRISSLIDEER----AKQRKETKQRKTEEVV 120

Query: 121 LLERSSSSCVEI----KKSGFWRIGKFFRK---KREKGCERSSVCGFDDKSDICMVDY-- 180
           + +RSSSSCVEI    K   F RIG+FFRK   K+E+  E+      ++ +D  ++DY  
Sbjct: 121 VFKRSSSSCVEINKRTKNHRFSRIGRFFRKINLKKERDFEK------NNNNDSWVLDYNN 180

Query: 181 ----MGVSRSRSLCSFRGGGFF--GSEDGGDMVVSGGRSSISGARTSSVNGGL-VCDS-- 240
               +GVSRSRSLCSFRG   +  GSE+ G    S    + S AR+SSVNGGL +C++  
Sbjct: 181 DVKKLGVSRSRSLCSFRGKDLYCIGSEEDG----SSYSGAFSAARSSSVNGGLGLCETEY 240

Query: 241 -------ARRSGFSE-TEPRKSGFESDHRECGNYESDHNGFSLANRRVFSLKESDFNGMD 300
                   R+S FSE TE  KS FE      G  +S+ +  +  NR+      S+F+  +
Sbjct: 241 SRKSNFEGRKSNFSETTEHWKSNFE------GGRKSNFSETTTENRK------SNFSESE 300

Query: 301 ---ESGFIDFKLDFISET-----KAEFSVPKMGMGLGLGPLSTPN-----SAFGSTRAFD 360
               SGF   K +F SET     ++ FS  +     G  P +  N     S + + R  D
Sbjct: 301 PPRRSGFEARKSNF-SETEYPTRRSNFSETEYNTRRGNNPATAENHPRRSSNYEAARKSD 360

Query: 361 TAAAHEYGR--------------------------GLYGGTAGEGII-GSGGGSCRITVS 367
           +AA +   R                          G  GG   +G++   GGGSCR    
Sbjct: 361 SAAMNFTRRVMSMKESSYFTGGEEPGFIDLKFDSSGGGGGDVNDGVLEHGGGGSCR---K 411

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_008446515.12.4e-20192.75PREDICTED: uncharacterized protein LOC103489222 [Cucumis melo] >KAA0034504.1 put... [more]
XP_038892601.12.4e-20192.78uncharacterized protein LOC120081636 [Benincasa hispida][more]
XP_004135128.17.0e-20191.79uncharacterized protein LOC101207638 [Cucumis sativus] >KGN52019.1 hypothetical ... [more]
KAG7032127.12.0e-19589.26hypothetical protein SDJN02_06170, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022957131.12.0e-19589.26uncharacterized protein LOC111458603 isoform X3 [Cucurbita moschata] >KAG6601341... [more]
Match NameE-valueIdentityDescription
A0A5A7STG51.2e-20192.75Putative lysozyme-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A1S3BG211.2e-20192.75uncharacterized protein LOC103489222 OS=Cucumis melo OX=3656 GN=LOC103489222 PE=... [more]
A0A0A0KT283.4e-20191.79Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G608070 PE=4 SV=1[more]
A0A6J1GYD79.5e-19689.26uncharacterized protein LOC111458603 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J1U63.6e-19589.41uncharacterized protein LOC111480310 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT3G25590.11.2e-4639.23unknown protein; Has 149 Blast hits to 140 proteins in 44 species: Archae - 0; B... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR34197:SF3BNACNNG49610D PROTEINcoord: 1..385
NoneNo IPR availablePANTHERPTHR34197OS04G0591300 PROTEINcoord: 1..385

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0007671.1Tan0007671.1mRNA