Tan0015238 (gene) Snake gourd v1

Overview
NameTan0015238
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEmbryo sac development arrest protein
LocationLG06: 76122795 .. 76127231 (+)
RNA-Seq ExpressionTan0015238
SyntenyTan0015238
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTCAGATTTTCTCCTCTGTTTCTCTCTCTCTCTTCTTCTCTCTTTTCGTTTTCCTCTTTCGCTCTCTCAATCTCTCTTTATATTAATCTCAATAACCCTAGAACCCTAAGTTCTCTTTTTCCTCCGCCGTCTACGGCGGCGACTTTCCAGAACGTCGTCGGTGTGTAAATCTGGCCGTACTTTCATCCGTCACCGCCGCAACAACCACCGCCTCTGGATCCGACTCCGTCGTATCGACGTTTCGGATTCGGTTTAAACATTCCATTTTCTCTCTCTCTCTCAAATTTTCTTGGATTTATCCTGGGAATTTACGCTTTTGACCCTTTTTCGTTGCTAAAGTTACTGACATTCACCGTCTGCTTCTTCTTCTTCTTTTTTATTTTTTAACCGCACCAAGTCCTTTATTATTATTATTCCTTTTTTAGTTTTTTTTTTTCATTTCCTCTTATCATTTACGATTTGAAATTTGTTACGTCCGAAACGTCCAACTCTAGTGATTTGACGGGTGAACCTTTCTAGGAGTGTGATTTTACGATTTTACCCTTTGGAAGTTACACGTGGCATGATCTAAGGTTGAAGATGAGTGTTCACTCACGCACCACCACCACCATGTTGCCTCCTGGGGCGTCAAGGAAGCGAAAAGAAGTTGAGCCGTTTGTGAAGCCGAAAGCTGTGGGAGCCGACTCGGTCGCATCGAATCGGCTTCTGGCCGGCTACCTGGCTCACGAGTTTCTCACTAAAGGTACTTTGTTTGGGGAGAAGTACGAGCCGGCTCGAAGCGAAGCCGTTGGAATGGCTTGCTCGCAACCAGCCGAGGTTAAGAGGACGAAGCCGGAAGCCACCGCCGCGCCGAGGGTGAAAAAGGAAAATCAGAGCTATGCTGAGGTGGCAAGCATCTTGAAGATGGATGGGGCCCACTTGCCTGGGATTGTTAACCCGGCCCAGCTCGCTCGGTGGATTAAAATGTGACTGCCGACGTGGCGTCTTTTCATTGGATGTAAATTTTAGTTTTCCAGTTTGCGTTGAAATTTCCCATCGATGGATCATGAGAGGAGAGACGAGTTCTCTCTCCCTCTCTCACTGAAAATGCTTTCTGGAAATTTTTTTCTCTCTTTCTCTTTTGTTCTCTCGATTTTCAAATCGGCGTAGCGAAATCTCTGACTGATCGGACTGAGTTTGGAAGCGGAGAATCGGCTCGGTTTGTTCTAATTCTGTAATCGTACAGTTTCTCTTTAAATGACTATACATATTACTCCTCTTTTTTCTTCTGTTTCAGATTTTTTATTGCTCCGATTTCTGAAATACAGGTCTGTAATGGATGCATGCATACATACATTTCAAATGCTGCAAATGTTGCATCGACATTTGCATTAGAAAACGGAGTCATGAATGATCATTGAGGTACGTATGCTGTTGAAATTTTGTGCGTTTTACGAATGTGTTTGGTTATGTGTGGATCGATGAATGATTCTTCTCGTGTTCGTCGTATATGATCGGTTCGATCGATCCTCTTTTGTTAAGTCATAGCCGTTTAGGTTCGTTCATCCGTCGTGCCAATTTACAATTTGTTACGGCCTTAGATTTTCGCTGCGAATTTTTTCCGTTTCCGTCGATATATAGCTTTAATTTCATTGGAAGGAAACCGCAATTAATTTTGGAAACGTTCTTTGTGGAAGAACGTTGGATGATCGGATAATTGATTGCATCATATGCACTAACAAGTTTCTTTGTGGAAGAACTTAGAACTTGTTTTGGCTTTAGATATTCGGTGCGAATCTTCTCTGTTTCTGTCGTTATCTTCACTTGGAAGAAAACCGTCATTGATTTTAGAAACATTCTTTCTGGAAGAAACGTTGGATGATTGGATCCATGGCCACCGTGCATTTGGAATATCGCCGATAGTATTCAATTTTTAGAAGTTATGTTTGCTTCTAACCAAGGTTTTAAATGTTGATGGGATGGAAATTTCTGAAAATAATTTAAAAAATATATTTAAATTAAAAAATAAACCTCCTACACATTTTAAATGAGTTAAAATAGAATATTATATTCACATTATATTACATTTTGTTACTATTTTTTTTAATATTTCATCACTTTTGTACGATAATATAATGGATATGTCGATTCACCCCTATATGGAAATATCGATGGAAATGTCGTCATGTTGATGAAAATTTAATACCATGCTTCTCAACACGTGTTCGCCATTAAATTTTAGTTTAAAAATTTTGGTTTGAATTTTTGTACTTTGTATTTTTTTAAAAAAAAAGTAAAGTAGAAAGCAAAACAAATGATACAGAAGTAGTTTCTAAAGCTTAATTTTCAAGGCCCGGTGCATAATTATTTAGGCCATAATTATTTGATTTTTGGTTTTTGAAAATTTTGCTTAATTTTTTACTATTTATCTTTATCTTCTATATAATGATTTCATATTTCTTAAGATACCATGCTAAATTTCAAAAATAAGTTTTTAAAAGCTATTTTGGTAAGTTTTCAAAACTTAGCTTGTTTTTTTAAAATATAGGTAAAAGTAAATAAGAAAGCATAGATACTCATTGGTAGAAGTAGTATTTGTAGGTTTAATTTTCAAAAATCAAAAACAAAAAATCAAATGGTTATCAAATGGAGTCTTAGTTTCTGGTATCTAAAAATTAAGTTTGTGTTCTCTCATTTTTCCTAAAATAATTTTCATCTCTTCTAAGGAAACATTTAGACGGATGCCTAGCTAATTTCAAAAATAAAAATAATTTTTGAAAACTAAAAAAAAATAGTTTCTAAAAACTTATTTTTGTTTTTGAAATTTGAGTACACATTCACACATTTCTTTGAGAAAGATGAAAAATATTGTAAAGAAATTGAAATAAAAACAATTTTAATTTTTAAAAATCAAATAATTATGAAATTGGAGCTTAGTTTTCTTTCTTTCTTTAATTTTTTTTGACACTTAAGCTTATAACGGGAAAATCAATTATTTTTTACATTCTTTTGATAGAGTAGACAGTAAAAACCAAGAAACAAATGTGTTTTTGTCTGTTAAAGCGCTCTCTCTCTTTGTATAGTTATGTTTTTTTTTCCTTTACTCCCGCATGCGGAAACTCAAAAGAAACTTATTTTTCTAAGTTTCAACTGAGCCATTCAATGAAGAAAGAGAAATAATAAAAGAGAAATAATAAAAGAAAGAGAAAAGAGATGGATGCTTAAAAAATATGTCAGTTTCATTTTTAAAATCGATTATGTGTGTGACTGTTTGTTGTTTTTGGAGGAAAGAATAAAGGTCTGTTAAAGCACTCAAATTATCATAAATGATAGAACAAACATAATGTGTGAACAAGAGCATACTCAATTGACATCAAATATGATTTATGACTTAGAGGTCATGGAGATTCGAATTCTCCTACTCTCAATTGTTGATCTACAAAAAAAAAACAACATTTTTATGTAAAAGTTTATTATTATCATAAATTTGGATAGAAAAAGGAATTTGAGATTATTAATGAATTTTACCTTTATGTCGTTAATTTCAAAGCTTAGTTTGATTGCTCCACCGTCTAACTTCATTGTCAAGTCACTTATTTATTCCTCTTATTAAGGCGTGATAGCATAATCATTTGATTTTTTTTAAAAAAATTATCTTTATTTTCTTATAGTTTTTTTGCAATAGATTTTATTTTTGTCAAGAAAATATTTGAATTTTTAATTAAATTTCAAAAACAAAAGTAAGTTTTTAAAAATTACTTTTTTTTTTTAATTTTCAAATTTTGGTTTGAATTTTGAAAACATTTTAAAAAAATAGATAACAAAATAAAGAAAATCTATAGGTGATAAACTTAAATTTTCAAAAATTAGAAATCAATAGTTATCAAACTGAGTTTAAGCTTTTCTTTTTCAAAACTATTTTGCATGCTTGAATCGATGGAAAAAGGAATCGACGCAATTATAAAGATTGTATGGAGACTGTATTGTAAGTCGTAAGGAGCTTTGTGCTTGTGTTTTTTCAACACTGTTTATGTGATAGTGGTTTGGAGTGTTTGTAAAACACTATAGTGAAAAGAAGTTTAATAATAAATGATTTTTAAAAGTGCTTTTGGAATGATTTTGGGTCAAACACAGGTAAAAAAGCTATTATCACATAGCAATTTAACTTAAAGTATTTAAGTACTTTTTTTTTTTTTTTTAAGTAAGAATTACAAACACCACCCCCTATGCTACAAATGGAGTTTGGCGTGGGCGTGGGGTTGGACATCTTTAAGCCTAGTATTTAGTTGGGGTCCAAATACCTAGTTTGTGTTTAGAGAAATTGTGTTGGAATGTAGGTTTTGAAAGTACCCACCAGAGCACGTAGAGATGCCCATTTTTACCTGGGGACTCACCCTTAACGGAGCAGAGAATTCTCGTTTAAGTGGGAATTAGGGAGGGAATAGGAAAAAAAAAT

mRNA sequence

TTCTCAGATTTTCTCCTCTGTTTCTCTCTCTCTCTTCTTCTCTCTTTTCGTTTTCCTCTTTCGCTCTCTCAATCTCTCTTTATATTAATCTCAATAACCCTAGAACCCTAAGTTCTCTTTTTCCTCCGCCGTCTACGGCGGCGACTTTCCAGAACGTCGTCGGTGTGTAAATCTGGCCGTACTTTCATCCGTCACCGCCGCAACAACCACCGCCTCTGGATCCGACTCCGTCGTATCGACGTTTCGGATTCGGTTTAAACATTCCATTTTCTCTCTCTCTCTCAAATTTTCTTGGATTTATCCTGGGAATTTACGCTTTTGACCCTTTTTCGTTGCTAAAGTTACTGACATTCACCGTCTGCTTCTTCTTCTTCTTTTTTATTTTTTAACCGCACCAAGTCCTTTATTATTATTATTCCTTTTTTAGTTTTTTTTTTTCATTTCCTCTTATCATTTACGATTTGAAATTTGTTACGTCCGAAACGTCCAACTCTAGTGATTTGACGGGTGAACCTTTCTAGGAGTGTGATTTTACGATTTTACCCTTTGGAAGTTACACGTGGCATGATCTAAGGTTGAAGATGAGTGTTCACTCACGCACCACCACCACCATGTTGCCTCCTGGGGCGTCAAGGAAGCGAAAAGAAGTTGAGCCGTTTGTGAAGCCGAAAGCTGTGGGAGCCGACTCGGTCGCATCGAATCGGCTTCTGGCCGGCTACCTGGCTCACGAGTTTCTCACTAAAGGTACTTTGTTTGGGGAGAAGTACGAGCCGGCTCGAAGCGAAGCCGTTGGAATGGCTTGCTCGCAACCAGCCGAGGTTAAGAGGACGAAGCCGGAAGCCACCGCCGCGCCGAGGGTGAAAAAGGAAAATCAGAGCTATGCTGAGGTGGCAAGCATCTTGAAGATGGATGGGGCCCACTTGCCTGGGATTGTTAACCCGGCCCAGCTCGCTCGGTGGATTAAAATGTGACTGCCGACGTGGCGTCTTTTCATTGGATGTAAATTTTAGTTTTCCAGTTTGCGTTGAAATTTCCCATCGATGGATCATGAGAGGAGAGACGAGTTCTCTCTCCCTCTCTCACTGAAAATGCTTTCTGGAAATTTTTTTCTCTCTTTCTCTTTTGTTCTCTCGATTTTCAAATCGGCGTAGCGAAATCTCTGACTGATCGGACTGAGTTTGGAAGCGGAGAATCGGCTCGGTCTGTAATGGATGCATGCATACATACATTTCAAATGCTGCAAATGTTGCATCGACATTTGCATTAGAAAACGGAGTCATGAATGATCATTGAGGTTTTGAAAGTACCCACCAGAGCACGTAGAGATGCCCATTTTTACCTGGGGACTCACCCTTAACGGAGCAGAGAATTCTCGTTTAAGTGGGAATTAGGGAGGGAATAGGAAAAAAAAAT

Coding sequence (CDS)

ATGAGTGTTCACTCACGCACCACCACCACCATGTTGCCTCCTGGGGCGTCAAGGAAGCGAAAAGAAGTTGAGCCGTTTGTGAAGCCGAAAGCTGTGGGAGCCGACTCGGTCGCATCGAATCGGCTTCTGGCCGGCTACCTGGCTCACGAGTTTCTCACTAAAGGTACTTTGTTTGGGGAGAAGTACGAGCCGGCTCGAAGCGAAGCCGTTGGAATGGCTTGCTCGCAACCAGCCGAGGTTAAGAGGACGAAGCCGGAAGCCACCGCCGCGCCGAGGGTGAAAAAGGAAAATCAGAGCTATGCTGAGGTGGCAAGCATCTTGAAGATGGATGGGGCCCACTTGCCTGGGATTGTTAACCCGGCCCAGCTCGCTCGGTGGATTAAAATGTGA

Protein sequence

MSVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEKYEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM
Homology
BLAST of Tan0015238 vs. NCBI nr
Match: XP_022964143.1 (uncharacterized protein LOC111464257 [Cucurbita moschata])

HSP 1 Score: 232.6 bits (592), Expect = 1.9e-57
Identity = 117/128 (91.41%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGEK
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVEAFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEK 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIKM
Sbjct: 123 QLARWIKM 130

BLAST of Tan0015238 vs. NCBI nr
Match: KAG6593829.1 (hypothetical protein SDJN03_13305, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 232.6 bits (592), Expect = 1.9e-57
Identity = 117/128 (91.41%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGEK
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVETFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEK 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIKM
Sbjct: 123 QLARWIKM 130

BLAST of Tan0015238 vs. NCBI nr
Match: KAG7026157.1 (hypothetical protein SDJN02_12656, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 231.1 bits (588), Expect = 5.4e-57
Identity = 116/128 (90.62%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGEK
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVETFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEK 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIK+
Sbjct: 123 QLARWIKI 130

BLAST of Tan0015238 vs. NCBI nr
Match: XP_023514270.1 (uncharacterized protein LOC111778588 [Cucurbita pepo subsp. pepo] >XP_023524524.1 uncharacterized protein LOC111788416 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 229.6 bits (584), Expect = 1.6e-56
Identity = 116/128 (90.62%), Postives = 121/128 (94.53%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGE 
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVEAFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEI 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIKM
Sbjct: 123 QLARWIKM 130

BLAST of Tan0015238 vs. NCBI nr
Match: XP_023514451.1 (uncharacterized protein LOC111778710 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 229.6 bits (584), Expect = 1.6e-56
Identity = 116/128 (90.62%), Postives = 121/128 (94.53%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGE 
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVEAFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEI 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIKM
Sbjct: 123 QLARWIKM 130

BLAST of Tan0015238 vs. ExPASy TrEMBL
Match: A0A6J1HJY7 (uncharacterized protein LOC111464257 OS=Cucurbita moschata OX=3662 GN=LOC111464257 PE=4 SV=1)

HSP 1 Score: 232.6 bits (592), Expect = 9.0e-58
Identity = 117/128 (91.41%), Postives = 122/128 (95.31%), Query Frame = 0

Query: 2   SVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEK 61
           SVHSRTTTTMLPPGASRKRKEVE FVKPKAVGADSV SNRLLAGYLAHEFL+KGTLFGEK
Sbjct: 3   SVHSRTTTTMLPPGASRKRKEVEAFVKPKAVGADSVVSNRLLAGYLAHEFLSKGTLFGEK 62

Query: 62  YEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPA 121
           YEPARSEAVGM CS+PAE K+TKPEA AAP V+KENQSYAEVASILKM+GAHLPGIVNPA
Sbjct: 63  YEPARSEAVGMTCSRPAEYKKTKPEAAAAPSVEKENQSYAEVASILKMEGAHLPGIVNPA 122

Query: 122 QLARWIKM 130
           QLARWIKM
Sbjct: 123 QLARWIKM 130

BLAST of Tan0015238 vs. ExPASy TrEMBL
Match: A0A6J1KXG9 (uncharacterized protein LOC111499092 OS=Cucurbita maxima OX=3661 GN=LOC111499092 PE=4 SV=1)

HSP 1 Score: 221.9 bits (564), Expect = 1.6e-54
Identity = 112/129 (86.82%), Postives = 117/129 (90.70%), Query Frame = 0

Query: 1   MSVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGE 60
           MS HSRT TTMLPP ASRKRKEVEPFVKPKA G DSV+SN+LLAGYLAHEFL+KGTLFGE
Sbjct: 1   MSFHSRTITTMLPPRASRKRKEVEPFVKPKATGVDSVSSNQLLAGYLAHEFLSKGTLFGE 60

Query: 61  KYEPARSEAVGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNP 120
           KYEP RSEAVGM  SQP+E KRTKPEA AAP V+KEN SYAEVASILKMDGAHLPGIVNP
Sbjct: 61  KYEPPRSEAVGMTSSQPSECKRTKPEAAAAPSVRKENHSYAEVASILKMDGAHLPGIVNP 120

Query: 121 AQLARWIKM 130
           AQLARWIKM
Sbjct: 121 AQLARWIKM 129

BLAST of Tan0015238 vs. ExPASy TrEMBL
Match: A0A6J1H3W4 (uncharacterized protein LOC111460190 OS=Cucurbita moschata OX=3662 GN=LOC111460190 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 9.6e-52
Identity = 111/135 (82.22%), Postives = 117/135 (86.67%), Query Frame = 0

Query: 1   MSVHSRTTTTMLPPGASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGE 60
           MSVHSRT TTMLPP ASRKRKEVEPFVKPKA   DSV+SN+LLAGYLAHEFL+KGTLFGE
Sbjct: 1   MSVHSRTITTMLPPRASRKRKEVEPFVKPKATAVDSVSSNQLLAGYLAHEFLSKGTLFGE 60

Query: 61  KYEPARSEAVGMACSQPAEVKRTKPE------ATAAPRVKKENQSYAEVASILKMDGAHL 120
           KYEPARSEAVGM  SQP+E KR KPE      A AAP V+KE+ SYAEVASILKMDGAHL
Sbjct: 61  KYEPARSEAVGMTSSQPSECKRMKPEAATAAAAAAAPSVRKEDHSYAEVASILKMDGAHL 120

Query: 121 PGIVNPAQLARWIKM 130
           PGIVNPAQLARWIKM
Sbjct: 121 PGIVNPAQLARWIKM 135

BLAST of Tan0015238 vs. ExPASy TrEMBL
Match: A0A5A7UAS9 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold772G00360 PE=4 SV=1)

HSP 1 Score: 171.4 bits (433), Expect = 2.5e-39
Identity = 92/120 (76.67%), Postives = 100/120 (83.33%), Query Frame = 0

Query: 11  MLPPGAS-RKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEKYEPARSEA 70
           M PPG S RKRKEVEP VKPK   AD++++NRLLAGYLAHEFL+ GTLFGEKYE A++EA
Sbjct: 1   MSPPGGSPRKRKEVEPLVKPKVAEADAISANRLLAGYLAHEFLSNGTLFGEKYETAQTEA 60

Query: 71  VGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM 130
           VGMA  Q AE KRTKPEA AA  +KK N SYAEVASILKMDGAHLPGIVNP QLA WIKM
Sbjct: 61  VGMANLQSAECKRTKPEAAAA-GIKKLNPSYAEVASILKMDGAHLPGIVNPGQLAWWIKM 119

BLAST of Tan0015238 vs. ExPASy TrEMBL
Match: A0A0A0LI65 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G173040 PE=4 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 7.2e-39
Identity = 92/120 (76.67%), Postives = 97/120 (80.83%), Query Frame = 0

Query: 11  MLPPGAS-RKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEKYEPARSEA 70
           M PPG S RKRKEVEP VKPK   ADS+++NRLLAGYLAHEFL  GTLFGEKYEPA +EA
Sbjct: 1   MSPPGGSPRKRKEVEPLVKPKVAEADSISANRLLAGYLAHEFLCNGTLFGEKYEPALNEA 60

Query: 71  VGMACSQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM 130
           VGMA SQ  E KRTK EA AA  +KK N SYAEVA ILKMDGAHLPGIVNP QLA WIKM
Sbjct: 61  VGMANSQSTECKRTKLEAAAA-SIKKVNHSYAEVARILKMDGAHLPGIVNPGQLAWWIKM 119

BLAST of Tan0015238 vs. TAIR 10
Match: AT5G44060.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G04000.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 96.7 bits (239), Expect = 1.5e-20
Identity = 50/96 (52.08%), Postives = 68/96 (70.83%), Query Frame = 0

Query: 34  ADSVASNRLLAGYLAHEFLTKGTLFGEKYEPARSEAVGMACSQPAEVKRTKPEATAAPRV 93
           A+ V SN+LLAGYLAHEFL  GTLFGE + P +++A G   +Q  + ++ KP     P  
Sbjct: 57  AEPVCSNQLLAGYLAHEFLNNGTLFGELWNPTKAQA-GPLTTQSIDPRKNKPSHDIEPSD 116

Query: 94  KKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM 130
            K  + Y EVA+IL++DG HLPGIVNP+QLAR++K+
Sbjct: 117 HKRRR-YVEVANILRVDGTHLPGIVNPSQLARFLKL 150

BLAST of Tan0015238 vs. TAIR 10
Match: AT3G23440.1 (embryo sac development arrest 6 )

HSP 1 Score: 92.0 bits (227), Expect = 3.6e-19
Identity = 50/115 (43.48%), Postives = 68/115 (59.13%), Query Frame = 0

Query: 15  GASRKRKEVEPFVKPKAVGADSVASNRLLAGYLAHEFLTKGTLFGEKYEPARSEAVGMAC 74
           GASRKRK+ E  ++      ++   N LLAGY+AHE+LT GT+ G K     +E   +  
Sbjct: 10  GASRKRKDTESDLR------EAATPNWLLAGYMAHEYLTCGTMLGRKLYSGWAEVGPLVS 69

Query: 75  SQPAEVKRTKPEATAAPRVKKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM 130
             P + +           VKK  QSY+EVAS+ K DG H+PG+VNP QLA+WI+M
Sbjct: 70  PSPLQSR----------EVKKARQSYSEVASVFKTDGNHVPGVVNPTQLAKWIQM 108

BLAST of Tan0015238 vs. TAIR 10
Match: AT1G04000.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G44060.1); Has 62 Blast hits to 62 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 88.6 bits (218), Expect = 4.0e-18
Identity = 49/105 (46.67%), Postives = 69/105 (65.71%), Query Frame = 0

Query: 28  KPKAVGADSVASNRL-LAGYLAHEFLTKGTLFGEKYEPARSEAVGMACSQPAEVKRTKPE 87
           K     A+ + SN+L LAGYL+HE+LT+GTLFGE++  AR++         AE  + KP 
Sbjct: 57  KSSTAAAEPIGSNQLMLAGYLSHEYLTQGTLFGEQWNQARAQ---------AESSKIKPS 116

Query: 88  ATAAP--RVKKENQSYAEVASILKMDGAHLPGIVNPAQLARWIKM 130
            T  P    + + + Y EVA++L+ DGA LPGIVNPAQLAR++K+
Sbjct: 117 HTVEPAEECEPKRKRYREVANLLRSDGAQLPGIVNPAQLARFLKL 152

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022964143.11.9e-5791.41uncharacterized protein LOC111464257 [Cucurbita moschata][more]
KAG6593829.11.9e-5791.41hypothetical protein SDJN03_13305, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7026157.15.4e-5790.63hypothetical protein SDJN02_12656, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023514270.11.6e-5690.63uncharacterized protein LOC111778588 [Cucurbita pepo subsp. pepo] >XP_023524524.... [more]
XP_023514451.11.6e-5690.63uncharacterized protein LOC111778710 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1HJY79.0e-5891.41uncharacterized protein LOC111464257 OS=Cucurbita moschata OX=3662 GN=LOC1114642... [more]
A0A6J1KXG91.6e-5486.82uncharacterized protein LOC111499092 OS=Cucurbita maxima OX=3661 GN=LOC111499092... [more]
A0A6J1H3W49.6e-5282.22uncharacterized protein LOC111460190 OS=Cucurbita moschata OX=3662 GN=LOC1114601... [more]
A0A5A7UAS92.5e-3976.67Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LI657.2e-3976.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G173040 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G44060.11.5e-2052.08unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G23440.13.6e-1943.48embryo sac development arrest 6 [more]
AT1G04000.14.0e-1846.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..24
NoneNo IPR availablePANTHERPTHR34657EMBRYO SAC DEVELOPMENT ARREST 6coord: 19..129

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0015238.1Tan0015238.1mRNA
Tan0015238.2Tan0015238.2mRNA