Tan0020347 (gene) Snake gourd v1

Overview
NameTan0020347
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
LocationLG03: 66088629 .. 66098717 (+)
RNA-Seq ExpressionTan0020347
SyntenyTan0020347
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTTATTTTCTATTTCGGCTTATTCTGTAGATTGTAACTCATAGTGGCTCACTTTCTGGGTTAGTCGTTTTTCATACTTCAGAGGTTTGCTTCATCCTTTAATGTGCCACTTTTTTTTTAACTGATCGTAATCTGCTTAATGACGGTGTTATTTTGGTAGCTGATTCTGAGAACTGTAGCTCATAGTGCCTCATGTTCTTATTGAATTGCACTGTTAAATAGATGTTCTAGCATTATCGGAGATTTCTTGGTATTTTATAATCTACAAATGTTGGTTTCTCTCTTCCAGGTTTACTTTTTATCTCATGCGGGATTTTGGGGTCTAATTAGTTTCAGGAAATGTTCAATTTCTGAAGGTTTTTAAATTTTAGCTCTAGGAAGAGCTTAAATTAATGAGGCTTCGAATAGTTGAGGAGGGCAATTTCTTGGAACGCGAACATGTTTTAGGGGCTTGGGCCAGAGCCGGAGTGCTTAATAGTTATTATGTTTTAAATATTGTTTCATGCCCTTGTTTAACTTTTTCTTAAAAAAACGAATAATTTATTTCAGTGTTTGCTTTGCTTCTTTGTATTTAGGGCTTCAAGTCTTCGTATATTTGAAACTTTGGATTTGTTGGTGGAGTAGCAGAGAGCAATCATGGATGGTAAGTTGACAATTGCACTCAATCAATAGTTAGTGGATATTCTCTTTGGCAAATACTTTGCGATTGAATTTAACGTGTGGTTAAGTGAGTAACCTCATTTTCTGCTTCTGTTTTCTTAACTTTTATTTCTGTGTGTGTATGTGATGGGGGTTGTTCATAGTGCTGGATTCTCTTTTGTTTAATCCCGTTATTCTGTGGCCTGATCTCTGATCAATCGTTTGATAATGTATTCCCTGCTGTAAATTGAATAGAAAGGTTGTGTTTGTCTACAAATAATAAGGAACATGTATATTGTTGACTCGATGGGGTTTTGATGTGTTTATTATTCTAGAGATTGCTTCTTTAATTGTGTTGCATATGCATATTAACAATCCCAGGAAAGGTTGATTCTGTATGTTGCCTAAGTTTCAAAACTATTAGCTGTATTAAAATTTAAATTTGGAATCCCTTTATTTGCCTTTAGATTTCTTGAGATGACGTATGAGATTTATTTTCTGTTTGATTTTTTACTTTTTCTAGTGCAAAAGGAAATATCTAATTACAAACATCATAAAAATATTGGTTTTGTTTTTTTTTCCTTTCATTTTTTATCAAACAACTGCAGCTATAGATAAGTGGCTCAATTTCATATGCGGGTTTGGAACAATATCATTTTTGAATTTTGAACTCCCCATCGTCTCTCTCAAGCTTTTTAGTTTTCTTAATGTCCTTATGTGCGTCTAATTTAATATCTGATCCATTGTTACCTGCAAAAGTATGAATAGAACTCAAATGATTGCTTAACTTGGTTATTGAATTCGTTTGAGCTGAAACAGAGCTATAATTGACTAATTTTCACCATTTTGGCTTCAGTTGACTCTCAACCAACCATGGAAGAAACCATTTTGGTTGGTGATGATTTAATGATGGGGCCACCATCGCCAATCATTCCACCTGAAATTGCATCTCACGTGCTTGAAGATATTGATTTATGTGATGGGATATTGAGGAATCTCTTTTTATGTAAGTTTTGTTCTTTACTAGTAATTATCGTTATCTTTTAGAAGCTGCTCTTATACTGTAACATAGAATTTAAGCTTGTTTTCTTATTTTTGTTTATAGGCTTGCAAATCAATGATATTGAACCTTTTTGTCAAGATGAAATAGCTGTTTATCGTCAGTGTGCTGAAAAACGGGTAAGATTATAACTTGCATATTTTATCAATCTCTCTCTCTTTAGTTTAAGGTTTATTGAAATGGAAATTCGAGATGCCAGTTTTTCCTTATTTCTAGAATTGTTCAGTTGAGGGCTTCTGGGTCCTGGAAATCCATTAATTCAAGGCTTCTCTTTGGACTTTTGACTCTACCCTTCTTTTATTAATTAACTCTTCTTTTTTATTACTGTCAAGTGGAAGATCTTTCCATAATCCCCATGATATTCTTTTGCGTTGATTCCTTATCTCCCATTATATTTTGTTTCCTTGCTTCTTTAATGAAGTTCAGTTTCATATCAGAAAAATATGTCATTTTTCCTGTTCTTTGACACCTTCCTAATTATTTGGCACTTGATAAATACATTGTAGGATAGAGAACTAAGACAGCGGCTTCAAGACAGTGAACGGAAATTAGGGTCATCAATGCCTCTAGATGCAGCAAAGGAAAGGGCTACACAACTTGAATCAGAAGTTACATCACTGGAGAGGTATGCTACACTCTCCGGTAATAAGGCTACAATCACAAACCATTATTGTCAATTCTGAAACTTGTGGTAGATACTATTTCCCCTTGTTAATTTTTTACATTAGTTTCATCAAACTTGGTTGATCTTCAGTACAAATTAAATTCCCTCTGCGTTTCTTAAAGCTAGCAAGGTTGACCAGTGTATATCAAGAAAACTTTCTAGGATACCTCTTCTATATAAGAGAGGTTTTAGGGAAAAAAAAAAACTGGAGCACTTCCTGTAGTTTTTTCCAGATCTCCTGCTACAAATCTTTTTGGGTGACTATCTCAGGAACAATGTCGATGATTTATGGATCAGCATGGCTATGGATATAGATGGGATGTGGTGATTATTGAACCCTCTTACTTTTGAATATTGATCCTTGGGTAGGATGGGAAAGTGAGGATCTTGTGTTGTTCCTGGATGCTAGCTGGGTTTCTATGTTAGTTCTTTTGCAATTACTTATCTTTTTTGATTCAAGCCTATTGAAGAGTATTTTTTTTAACTGTCTACAGCAAGTTCTACAAGGAGAATCTCTTATCCTTCTTCCTTTTTGCCTTCTGTTTTGTGAATTAAGAGATATAACTACCCAACTGATGAACATTTTCTAATTTTCCTAGCTATCCTTATGCTGAAATGGAAAAAAAAAAAGTTCTAACATTCATACACTTATAGAATTATTCCAAATTCTCTCTTCTCTCTCCTCCCTCCTCCCCCCTCCCTCCCACCAATCCCTAGATCTCTACTGATCTCCTCTGGCCACCATGGATGCCACCAGCCACCAGCCTTCTTCCTTATGGAATTATTCCACTCGCTCAGTCTTTATTGGCCGAAAAACAAGGGTGGTTTTCATTCTTCTCCCTTATCTATGATTACCCAAGACAAGCACATAAGAAGGTAAGTCATCTCACTTACAGCGAAGTTGTTAAAAAATCGGAACAATTGGGGACTTTCGAACCAGCGGATGGTGTTGTTTTTCCAAATAGTTCTCCTCCAGACATTGATGCATTGATGAACTGTAGTGGTATTCTCATTTGCCAACGATATTCCCAGAAAGATCCTTGGCCTTAGATTAAGGTTGCACTTAAAGAAGTTTCTCTCCAAGATGTACTATGAATCCATTTCAAGATCATAAGGGCAGTGATTCATATCTATGATCAAAAACTCCCTGAAAAACTTGGTGATAGCTCTTGTTGGCATTTGATTTGAGGTCTCAAGTTGAATTTCCACACTTTCTCCACTGCCTTTTCTTGAAGGATAGAATGATCGCTTCATATGGTGGTTGGATTGAAATTTTTTTTAGTACATCAACAATTGGGGGTGGGGGAATTCGAACCCATGACCTCTTAGTCATGAGTCATACTTGATGCCAATTGGGCTATGCTTTTGTTGGCGGTTGGATTGAAATTTTGGACTTCCTTTAAGCCTCTGGACTAACAAGGCTTTCAGGTACATTGGAGATCATTGTGGTGGCTTCCTACAGACCTCAAATTGCTCGCGTGAATGTTCAGGGTCTTGACTCTTGGAAAAAACAATCTTTGATTAAGAAATTTATCTTAAAACAAAATCCAGGGATTGTTCTTTTACAAGAGACCAAGTTATCTTCTGTTAATCATCAACTGATAAAGGATATTTGAAGTTCATCACACATTGGTTGGGCATCTCTTGACGCAGTTAATTCTACGGGAGGAATTTTGATCCTTTGGAGTGAACTGGACTTCTTGGTTAAGGAAGTCATACAGGGTTTGTGTACTCTTTCCATTCATGTTTTTTTGATTGATGGGTTTTCTTTTTGGCTTACATCAGTATATGGTCCTTCTGGAAACAACTATTATGATGACTATTGGAGAGAATTGGATGGTTTGGCTGGTTTGGGTGGAAATCAATGGATTATTGGAGGGGATTTTAATGTTACTCGTTGGTCCTGGGAGAAGTCGCACGATCCTCTATCAGCCAGAGTATGAACACGTTCAATCAGTGGATTTCCAATTATAATTTGCTTGATGTTCCTTTGCAGAATGGTAGCTATACTTGGTTGAGTTTTGACCCTACACAATACTTATCTCTCTTGGATAGGTTCCTTATTACTGATGGATGCACAAACAAATTTGGTTTGGCTACTCTTAGACGAATGGATAGGGTTACTTCAGATCCTTATCCACCTTCTCTTTCCTTTGATAATATTATTTGGGGTCCTTGTCCTTTTCGATTTGAGAATTCGTGGTTGCAAATTGGTTCAATTCAGGATGATGTATCTGAATGGTGGACTGAAAACCCTTTTGTTGGCTGGCCAAGACACGATTTAATGATGAAATTGAGCCAGCACGGTTGCAATCTTTTATTTCTCAATTGTAGATTTTGGACAACTTAGAAGATAGTACACCTTTAACAACATCTCAAATTGAATCTCGTCGTCTTTTTCGTGGACAGATTGAGGTTTTGACAGCTCAGGATCATTTTTATTGGTAACAAAGATGCAAATTGAAATGGGTTAAGGAGGGTGATGTGAACACAAATTTTTTACATAGAATTTTAGCGGCGAGGAAAAGGAAGAATTCGATTACTGAAGTTATTTCAAGGAATGGATCAGTTTTCTAACCGCCGATGAAATTGAACATGAGTTTATTGATTTTTACTCTAATTTGTTCAAGAAAGATGCCAACCTTCGTTTTCTTCCATCTAATATTGGTTGGAGGCCTATATCTGTAGATCAGGCAAACAACCTAGAACGGGTAATCACAGAGGAGGAAGTTCGACAAGCTTTCTTTTCCTTGGGTTCTAGTAAATCACCGGGTCCTGATGGTTTTAAAATCGCGTTCTTCAAGTTTTTTTTTGGACATCATCAAGATTGATATTCTCACCATGATCCACGACTTCTTTTTCTTTTAGGGCTATTATTGCATCTGTGAATGAGACATACATTTGGTAAATGAGTTTCTTCCTATTAGCCTTATCTCTTGTGTTTATCTCTTGAGTCATGTCTAATCGGTTGAAGTTGGTCTTGCCCACCACTATAGCTGAAAATCAACTTGCTTTTGTAGCACATAGGCATATTATTGACGCTTCATTAATAGCCAATGAATTGATTGATGATTGGAGCCTTCGTCATCTTCAGGGCATGGTTTTGAAACTGGATTTAGAAAAAGCTTGTGATACAGTGGACTGGGATTTCTTAGATGCGATACTACATGCAAAAGGGTTTGGGGATCTATGGAGGAAATGGATTTGAGGTTGCCTTTCTCGTGCAAATTACTCCATTATCATCAATGGTCGGCCATGTGGGAAAATCATTCCTACTTGTGGAGTTCAACAAGGTGACCCTTTATCCTCTTTGCTCTTTATTTTGGTCTCTGATTGTCTAAGTTGACTACTGGCTCATAGTGCTAGTTTGGGTCATATATCAGCTCATCCAACGAGTGCATCACCGTTTTGTTTAAACCATTTGCAATTTACGAGTGATACTTTGCTCTTTTCTACTTCTGATCAAACTACACCACAACACCTATTTCAAGTCATTAAAATCTTTGAACAAACTTCTGATTGAAAATTAATTTTGCAAAGAGTGAGATTTTGGGAATTAACATTGATGACTTTGAGATGGAATGGATTTTGTCGACCTTTGGTTGTAAACAAAGATTTTGGCCTAACACTTACCTTGGTTTTCCTATGGGAGGAAGTTTAAAAATTGTTTCCTTTTGGCAGCCAATAATTGAACGTTTGCGACACAAGCTTCATAATAGGAAGTTTGCTGTCATTTCAAAAGGTGGGAGACGTACTATGATTCAAGCTATGTTGTCTAGCATGCCTACTTATTATATGTCCTTGTTTCAACTCCCTTCAAAGGTTATTAAGGTCTTGGATAAGATTGTTCATGATTTCTTTTGGGAAGGTTTGAGGGGTGATGGCGGAGTACATAATGTTAATTGGGCAACGACACAACTTCCTAAACTTATGGGAGGTCTTGGGATTCACAATTTTGGTCATCAGAACTCAGCTCTATTGGCTAAATGGATCTAGTGTTTCACGCATGAACCAAATTCTCTTTGGCGAAAGCTTGTAGTTGCTAAATACTATGGGGAGACGGTAACAGACAGTTGGGCATTTACCATTAGGAATATGTTAGGGGTTTAGGGATAATAGTAGTATGGTATTAGCAAGGGCATAAGGGTAATTAGTTAGGGAGTTAGTTAGTGTTTTGGTTATAAATAGAGGGAGTGAGTTTAAGGGAAGGGAGGAAAATTGTTGAGTGATTTTAGGGCTTGGGTTAGCATACTCAAGGGAGGTTCCAAGTGCCTTATACTTGGTTTATCTTGCTACGTTTTATCATTCATATTTCAATATATTTCCATTATCGTGTTCCTAGTATTTGGGTTCCTAACAATTGGTATCAGAGCGCGTTCATCTTGGGTATCGTCCTTAGTAATGGATCAAGATTGTGTGCTATTGAAGATACTGGGAATTCTCGAGGAAATGCAACAAACAGTGAAAGGAATGCAAAAACAGTTAGAGGAAAGTGATTGCCAAAAGATGATGAGAGAACAAAAACAAAAGATGGAGAGAGAACAAAAATGTGAAAAGGAGAAGGAAGACAGTAGTTCGGTGGGAGTAATCAAAGAAGCCAAAATGAAGAAGAGCAAGGAAAATCAGAGTAGATCAAGGTTGGGCAGCAAAATCAACCGAAAATTATCGATTCGTAGTCGAAGAGTGGCGGGAAGGCGTCGATCCTCCGTGAGAGCACCTCCAAAATCACAAGGGAAAACCAATCAGCAAGGTGAGAAGAAAGGGAGGGAGGAATCACCGGCGAGAACAGAGTCGGAAAAACAACAACACCGCAAGAGGAAGGAGAACCTACCAAGAAACCCATGGTTGAAGAATCGGAGAGGAATTTTGCAGCAGGTGAAGACGACCCGTGTGGCTAAAATTTCGGAAAATGACCAAGGGACAAAGAAGCAATGGAAGAACTTACGTGCGGTCAAATGGCGGGAAAAGCGAGGTGGTAGCCGAAAGAAGAAACGTCGACGGCAGGCGTGGCTGAACATGATGGCGGCTAGCCCTAATCACGTGGAGAAACGAATTACGGAAGGATCTGGAATAAAGAGGGTTGGCGAGTTGTTTAATGGGCTGGATCCAAATAAAGAGAGGGAGGCCCATCTGCTTACAAGGGTAAAATATAGTGGGGTTTTGAAAGGGTTGGATACAAATTGTAATGTGGAGGCCCAGTTGATTCCTAAACCAAAAAAAAAGAACTATTATACCAGACGGAAATTTGGGCTGTGTTGCACCTATTTTAAGGCCCAACCCAAGATCTCATCTTACTCACAAGAGCTAATCAACCCTAGTTCCCACTGTGTTCTACTATTTACTGCCACTGCAACCATCTTCATTGGGCAAGCTCACATCGATGGATGTCAACACCTGTTGTCACCACCACCCATTGCTACCGCCCATGGATGTACATTGAGCTCACCGACCAGTTCGAGGAAGTGGCGGCTAAATACAATGCAGAATATATCGTTTGTTTCTGTTCTTTTTATGTTTATGTTGAGTCTTGTAAGCACACCTTGAGGACAAGGTGTTTGAAGGGCCCAGTAATGTTAGGGGTTTAGGGATAATAGTAGTATGGTATTAGCAAGGGCATAAGGGTAATTAGTTAGGGAGTTAGTTAGTGTTTTGGTTATAAATAGAGGGAGTGGGTTTGAGGGAAGGGAGGAAAATTGTTGAGTGATTTTAGGGCTTGGGTTAGCATACTCAAGGGAGGTTCCAAGTGCCTTATACTTGGTTTATCTTGTTACGTTTTATCATTCATATTTCAATATATTTCCATTATCGTGTTCCTAGTATTTGGGTTCCTAACAGAATACCTCTTATAAATCTCCTTGGAAATATATTTGTAATGTTCGAGATTTGATTTCTTCTCAATCTGTTAGGCGTATTGGAAATGGTAAGTCTACAACATTTTAGACTGACTCTTGGCTTAGTTGCGGTTCTCTTGATGTTGTTTTCCCACGTCTCTTTCGGCTTACATTGCACCCTAATTCCTCGGTGGCTGGATTGTGAAATTTCATGGACTCTGCGTGGGATCTAAAGCTTTGTCAAAATTTAACAGATTTGAAAACCAATGAATAGGCTTGCCCCTCTCACCTGCTTTCTCATATTAGGCTTTCGGAGAGACATAATTTCGGGGTTTGGCCTTTGGAGCCTTCTAGAGGCTTTTCTGTTAAATCACTTACGGATTGTTTGTTGAGTTCTGGTGATACGAGGTTAAGTGATCTTTACTCTATTATCTGGAAGGATTAGTATCCTAAGAAGATCAAGATCTTCCTTGGGAGCTTAGTTTGGGAGCGGTTAATACTGCTGATAGGTTACAAAGAAAAATGTCTTATATGAGTCTTTCTCTTTCGTGGTGTATTATGTGTGGTTTAGATTCAGAGTCAACAGGTCATCTTTTTGTGCATTGTTCTTTTGCTTAACAGTTTTGGTGGTGTATTTTGGATGGCTTTAGTTGGTCAACAATTTTATCAAACAATATCTTTGATTTCCTTTCATCCGATTTGGTGGGACATCCTTTCAGAGGCGTACAGAAGTCTCTCTGGCTAGCCTTTGTTTGTGCTTTATATGGAATCTTTAGATGGAGAGGAATGGTCGTCATTTCAAGGATGTTTCTTATACCTTTGATCATTTTATGGATCATGTTCTTTCTATTGTTTATTTGTGAGCAAAACTTGACCACCTTTTGACCTTTATAGTTTATCTTTCTTTATTTCTAATTGGTACTCTTTCTTGTAATTCACCTACCGGTGCTCGGAGTTTATCAATGATATTCTTTCTTATTAAAAAAAAGTTTTGTGAATTAAGAGAATATTTCTCGTAGATTTGTGTTTCTGGTCTAGCAGTGTTTATGTTTCCCTTCTAGGACTCGATTGCAGTAATATTCTGTTTCATGTCCCTAACTTCACTCATTTATCGTTTTGATCTGCAGACGCTTGATTCTTGCTAGTGGAGTTGAAGGCATTGAAGGATTCCGTCAAAGATGGAGTTTGCACGGTCAGCTTACCGACACCAAGTAATCTAGATCAGCTCTCCCTTTAAGCTGTTGCTTTAGTTCATCGACGCCTTTTTCCTGATGAGTTTTATCAGTTGGTTTAGGTTTGCTACCTGAGCTACAATCAGGATTTTTCTCAAAAGACGCGATTCTAAAGAATGATTTTCTATTACATCACTCAGTGTTGATGTAACCCTTAGCAAAAATTTCAGTCTCTACATTCTTAACAACTGATTAAACCGATTTCAGTAACTCGCTAACCATCGACTTTAACTGTTGTCTTGTGTCCATTTTCATACAGGAAAAGGCTGGAATCTTTAAAACAAGGAATCGAGAACAGGAGAGACGATGAGCCTGTGGAGAAAACATCATCTTCCAAAAGATGGTTTTTTTAGTGTAAGTGATTGCTTCAAATACCAGGCTTCTTCCTATTTGGTTCTTAAAGGCTCAAAGAACACAATGATTCTTCTGAATACAAATAAAGCTTGAGTTTTAGGTTCTATATATCTAGTCTGAAAAGCCATCTTTTTACATTTTTGGCATAAAAAGAAGCTACTGGATATGAAGTTACTGTGTTATATCAGATAATACAAGCAGAGAAATTTATGTAATCATGAGTAGAGTCAGAAAAGTAGCTTTTTTTACACCCATCAAAGGGAAACTGCATCCATTGCACAATAGAAGTTATTGAGGAGAATTTAACTCT

mRNA sequence

GTTATTTTCTATTTCGGCTTATTCTGTAGATTGTAACTCATAGTGGCTCACTTTCTGGGTTAGTCGTTTTTCATACTTCAGAGGGCTTCAAGTCTTCGTATATTTGAAACTTTGGATTTGTTGGTGGAGTAGCAGAGAGCAATCATGGATGTTGACTCTCAACCAACCATGGAAGAAACCATTTTGGTTGGTGATGATTTAATGATGGGGCCACCATCGCCAATCATTCCACCTGAAATTGCATCTCACGTGCTTGAAGATATTGATTTATGTGATGGGATATTGAGGAATCTCTTTTTATGCTTGCAAATCAATGATATTGAACCTTTTTGTCAAGATGAAATAGCTGTTTATCGTCAGTGTGCTGAAAAACGGGATAGAGAACTAAGACAGCGGCTTCAAGACAGTGAACGGAAATTAGGGTCATCAATGCCTCTAGATGCAGCAAAGGAAAGGGCTACACAACTTGAATCAGAAGTTACATCACTGGAGAGACGCTTGATTCTTGCTAGTGGAGTTGAAGGCATTGAAGGATTCCGTCAAAGATGGAGTTTGCACGGTCAGCTTACCGACACCAAGAAAAGGCTGGAATCTTTAAAACAAGGAATCGAGAACAGGAGAGACGATGAGCCTGTGGAGAAAACATCATCTTCCAAAAGATGGTTTTTTTAGTGTAAGTGATTGCTTCAAATACCAGGCTTCTTCCTATTTGGTTCTTAAAGGCTCAAAGAACACAATGATTCTTCTGAATACAAATAAAGCTTGAGTTTTAGGTTCTATATATCTAGTCTGAAAAGCCATCTTTTTACATTTTTGGCATAAAAAGAAGCTACTGGATATGAAGTTACTGTGTTATATCAGATAATACAAGCAGAGAAATTTATGTAATCATGAGTAGAGTCAGAAAAGTAGCTTTTTTTACACCCATCAAAGGGAAACTGCATCCATTGCACAATAGAAGTTATTGAGGAGAATTTAACTCT

Coding sequence (CDS)

ATGGATGTTGACTCTCAACCAACCATGGAAGAAACCATTTTGGTTGGTGATGATTTAATGATGGGGCCACCATCGCCAATCATTCCACCTGAAATTGCATCTCACGTGCTTGAAGATATTGATTTATGTGATGGGATATTGAGGAATCTCTTTTTATGCTTGCAAATCAATGATATTGAACCTTTTTGTCAAGATGAAATAGCTGTTTATCGTCAGTGTGCTGAAAAACGGGATAGAGAACTAAGACAGCGGCTTCAAGACAGTGAACGGAAATTAGGGTCATCAATGCCTCTAGATGCAGCAAAGGAAAGGGCTACACAACTTGAATCAGAAGTTACATCACTGGAGAGACGCTTGATTCTTGCTAGTGGAGTTGAAGGCATTGAAGGATTCCGTCAAAGATGGAGTTTGCACGGTCAGCTTACCGACACCAAGAAAAGGCTGGAATCTTTAAAACAAGGAATCGAGAACAGGAGAGACGATGAGCCTGTGGAGAAAACATCATCTTCCAAAAGATGGTTTTTTTAG

Protein sequence

MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIEPFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLILASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF
Homology
BLAST of Tan0020347 vs. NCBI nr
Match: XP_022972788.1 (uncharacterized protein LOC111471293 [Cucurbita maxima] >XP_022972789.1 uncharacterized protein LOC111471293 [Cucurbita maxima])

HSP 1 Score: 329.3 bits (843), Expect = 2.0e-86
Identity = 168/176 (95.45%), Postives = 173/176 (98.30%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEP-VEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKR+ESLKQGIENRRDD+P  EKTS+SKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRVESLKQGIENRRDDKPAAEKTSTSKRWFF 176

BLAST of Tan0020347 vs. NCBI nr
Match: XP_023517156.1 (uncharacterized protein LOC111780996 [Cucurbita pepo subsp. pepo] >XP_023517158.1 uncharacterized protein LOC111780996 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 328.6 bits (841), Expect = 3.4e-86
Identity = 168/176 (95.45%), Postives = 172/176 (97.73%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLER LI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERHLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEP-VEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKRLESLKQGIENRRDD+P  EKTS+SKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRLESLKQGIENRRDDKPAAEKTSTSKRWFF 176

BLAST of Tan0020347 vs. NCBI nr
Match: KAG6595315.1 (hypothetical protein SDJN03_11868, partial [Cucurbita argyrosperma subsp. sororia] >KAG7027324.1 hypothetical protein SDJN02_11336, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 327.8 bits (839), Expect = 5.8e-86
Identity = 167/176 (94.89%), Postives = 173/176 (98.30%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE ++LCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVELCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEP-VEKTSSSKRWFF 176
           LASGVEG+EGFRQRWSLHG+LTDTKKRLESLKQGIENRRDD+P  EKTS+SKRWFF
Sbjct: 121 LASGVEGMEGFRQRWSLHGRLTDTKKRLESLKQGIENRRDDKPAAEKTSTSKRWFF 176

BLAST of Tan0020347 vs. NCBI nr
Match: XP_022143937.1 (uncharacterized protein LOC111013729 [Momordica charantia])

HSP 1 Score: 327.8 bits (839), Expect = 5.8e-86
Identity = 165/175 (94.29%), Postives = 172/175 (98.29%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYR+CAEKRDRELRQRLQDSE+KLGSSMPLDAAKERATQLESEVT LERRLI
Sbjct: 61  PFCQDEIAVYRRCAEKRDRELRQRLQDSEQKLGSSMPLDAAKERATQLESEVTLLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF 176
           LASG+EG+EGFRQRWSLHG+LTDTKKRLESLKQGIENRR+DEP EKTSSSKRWFF
Sbjct: 121 LASGLEGVEGFRQRWSLHGRLTDTKKRLESLKQGIENRRNDEPREKTSSSKRWFF 175

BLAST of Tan0020347 vs. NCBI nr
Match: XP_011653622.1 (uncharacterized protein LOC101214968 [Cucumis sativus] >XP_031740768.1 uncharacterized protein LOC101214968 [Cucumis sativus] >XP_031740769.1 uncharacterized protein LOC101214968 [Cucumis sativus] >KGN54242.1 hypothetical protein Csa_018052 [Cucumis sativus])

HSP 1 Score: 327.4 bits (838), Expect = 7.5e-86
Identity = 166/175 (94.86%), Postives = 170/175 (97.14%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQP+MEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPSMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSE KLGSSMPLDAAKERATQLESEVT LERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSECKLGSSMPLDAAKERATQLESEVTLLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKRLESLK+GIENRRDDEP  KTSSSKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRLESLKKGIENRRDDEPARKTSSSKRWFF 175

BLAST of Tan0020347 vs. ExPASy TrEMBL
Match: A0A6J1ICK9 (uncharacterized protein LOC111471293 OS=Cucurbita maxima OX=3661 GN=LOC111471293 PE=4 SV=1)

HSP 1 Score: 329.3 bits (843), Expect = 9.6e-87
Identity = 168/176 (95.45%), Postives = 173/176 (98.30%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEP-VEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKR+ESLKQGIENRRDD+P  EKTS+SKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRVESLKQGIENRRDDKPAAEKTSTSKRWFF 176

BLAST of Tan0020347 vs. ExPASy TrEMBL
Match: A0A6J1CSA0 (uncharacterized protein LOC111013729 OS=Momordica charantia OX=3673 GN=LOC111013729 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 2.8e-86
Identity = 165/175 (94.29%), Postives = 172/175 (98.29%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYR+CAEKRDRELRQRLQDSE+KLGSSMPLDAAKERATQLESEVT LERRLI
Sbjct: 61  PFCQDEIAVYRRCAEKRDRELRQRLQDSEQKLGSSMPLDAAKERATQLESEVTLLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF 176
           LASG+EG+EGFRQRWSLHG+LTDTKKRLESLKQGIENRR+DEP EKTSSSKRWFF
Sbjct: 121 LASGLEGVEGFRQRWSLHGRLTDTKKRLESLKQGIENRRNDEPREKTSSSKRWFF 175

BLAST of Tan0020347 vs. ExPASy TrEMBL
Match: A0A0A0L0W6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G295470 PE=4 SV=1)

HSP 1 Score: 327.4 bits (838), Expect = 3.6e-86
Identity = 166/175 (94.86%), Postives = 170/175 (97.14%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQP+MEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPSMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSE KLGSSMPLDAAKERATQLESEVT LERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSECKLGSSMPLDAAKERATQLESEVTLLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKRLESLK+GIENRRDDEP  KTSSSKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRLESLKKGIENRRDDEPARKTSSSKRWFF 175

BLAST of Tan0020347 vs. ExPASy TrEMBL
Match: A0A1S4DXE6 (uncharacterized protein LOC103490140 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490140 PE=4 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 1.8e-85
Identity = 164/175 (93.71%), Postives = 169/175 (96.57%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQP+MEETILVGDDLMMGPPSPIIPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPSMEETILVGDDLMMGPPSPIIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAE+RDRELRQRLQDSE KLGSSMPLDAAKERATQLESEVT LERRLI
Sbjct: 61  PFCQDEIAVYRQCAERRDRELRQRLQDSEHKLGSSMPLDAAKERATQLESEVTLLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEPVEKTSSSKRWFF 176
           LASGVEGIEGFRQRWSLHG+LTDTKKRLESLK+GIENRRDDEP  KT SSKRWFF
Sbjct: 121 LASGVEGIEGFRQRWSLHGRLTDTKKRLESLKKGIENRRDDEPARKTLSSKRWFF 175

BLAST of Tan0020347 vs. ExPASy TrEMBL
Match: A0A6J1HFJ1 (uncharacterized protein LOC111463112 OS=Cucurbita moschata OX=3662 GN=LOC111463112 PE=4 SV=1)

HSP 1 Score: 324.3 bits (830), Expect = 3.1e-85
Identity = 166/176 (94.32%), Postives = 172/176 (97.73%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLE ++LCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEGVELCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI
Sbjct: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENRRDDEP-VEKTSSSKRWFF 176
           LASGVEG+EGFRQRWSLHG+LTDTKKRLESLKQGIE RRDD+P  EKTS+SKRWFF
Sbjct: 121 LASGVEGMEGFRQRWSLHGRLTDTKKRLESLKQGIEIRRDDKPAAEKTSTSKRWFF 176

BLAST of Tan0020347 vs. TAIR 10
Match: AT4G04190.1 (unknown protein; Has 35 Blast hits to 35 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 33; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 274.2 bits (700), Expect = 7.0e-74
Identity = 141/185 (76.22%), Postives = 155/185 (83.78%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQP MEETILVGDDLM GPPSP+IPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPMMEETILVGDDLMTGPPSPVIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDE+A+YRQCAEKRD+ LR RLQ+SE KLG SMP++ AKER TQLE+E TS ER LI
Sbjct: 61  PFCQDELALYRQCAEKRDKILRVRLQESEHKLGLSMPIELAKERITQLEAEATSFERHLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENR----------RDDEPVEKTSSS 176
           LASG EGIEGFR+RWSLHG++TDTKKRLESLKQG+ENR           D  P  K S+ 
Sbjct: 121 LASGAEGIEGFRRRWSLHGRMTDTKKRLESLKQGMENRNKVEHEHNQHHDQSP--KPSAP 180

BLAST of Tan0020347 vs. TAIR 10
Match: AT4G04190.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 274.2 bits (700), Expect = 7.0e-74
Identity = 141/185 (76.22%), Postives = 155/185 (83.78%), Query Frame = 0

Query: 1   MDVDSQPTMEETILVGDDLMMGPPSPIIPPEIASHVLEDIDLCDGILRNLFLCLQINDIE 60
           MDVDSQP MEETILVGDDLM GPPSP+IPPEIASHVLE +DLCDGILRNLFLCLQINDIE
Sbjct: 1   MDVDSQPMMEETILVGDDLMTGPPSPVIPPEIASHVLEGVDLCDGILRNLFLCLQINDIE 60

Query: 61  PFCQDEIAVYRQCAEKRDRELRQRLQDSERKLGSSMPLDAAKERATQLESEVTSLERRLI 120
           PFCQDE+A+YRQCAEKRD+ LR RLQ+SE KLG SMP++ AKER TQLE+E TS ER LI
Sbjct: 61  PFCQDELALYRQCAEKRDKILRVRLQESEHKLGLSMPIELAKERITQLEAEATSFERHLI 120

Query: 121 LASGVEGIEGFRQRWSLHGQLTDTKKRLESLKQGIENR----------RDDEPVEKTSSS 176
           LASG EGIEGFR+RWSLHG++TDTKKRLESLKQG+ENR           D  P  K S+ 
Sbjct: 121 LASGAEGIEGFRRRWSLHGRMTDTKKRLESLKQGMENRNKVEHEHNQHHDQSP--KPSAP 180

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022972788.12.0e-8695.45uncharacterized protein LOC111471293 [Cucurbita maxima] >XP_022972789.1 uncharac... [more]
XP_023517156.13.4e-8695.45uncharacterized protein LOC111780996 [Cucurbita pepo subsp. pepo] >XP_023517158.... [more]
KAG6595315.15.8e-8694.89hypothetical protein SDJN03_11868, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022143937.15.8e-8694.29uncharacterized protein LOC111013729 [Momordica charantia][more]
XP_011653622.17.5e-8694.86uncharacterized protein LOC101214968 [Cucumis sativus] >XP_031740768.1 uncharact... [more]
Match NameE-valueIdentityDescription
A0A6J1ICK99.6e-8795.45uncharacterized protein LOC111471293 OS=Cucurbita maxima OX=3661 GN=LOC111471293... [more]
A0A6J1CSA02.8e-8694.29uncharacterized protein LOC111013729 OS=Momordica charantia OX=3673 GN=LOC111013... [more]
A0A0A0L0W63.6e-8694.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G295470 PE=4 SV=1[more]
A0A1S4DXE61.8e-8593.71uncharacterized protein LOC103490140 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1HFJ13.1e-8594.32uncharacterized protein LOC111463112 OS=Cucurbita moschata OX=3662 GN=LOC1114631... [more]
Match NameE-valueIdentityDescription
AT4G04190.17.0e-7476.22unknown protein; Has 35 Blast hits to 35 proteins in 13 species: Archae - 0; Bac... [more]
AT4G04190.27.0e-7476.22unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 71..91
NoneNo IPR availableCOILSCoilCoilcoord: 98..118
NoneNo IPR availablePANTHERPTHR36047OS01G0191000 PROTEINcoord: 1..145

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020347.1Tan0020347.1mRNA
Tan0020347.2Tan0020347.2mRNA
Tan0020347.3Tan0020347.3mRNA