Tan0017332 (gene) Snake gourd v1

Overview
NameTan0017332
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMacro domain-containing protein
LocationLG04: 16290908 .. 16302214 (-)
RNA-Seq ExpressionTan0017332
SyntenyTan0017332
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTCGGATTGCGTTCTGCGGAGATTTTCGTGTTTGCACGTGAGTTCATGTTCTTATTAAACGCAAAATAATAAATTCAAAATTTCCGAGTTCCCTATCTGCCCTTTCCATCTCTCGCATGATTCTGATGACAAGCGCGGCTTGGCGGAACAGTGTAGCAGTGGGCAGCTCCATATTCCGCAGGGTATCGCAAACTCTCACCTTCTCTCCTAATTTCACGTCTTCGCCTCATTTCCGGCGATCAAGAGCAAGGACCATGGCCGTCTCAATGTCGAATGGATCGGGCAACGGAGTGGTTCGCTTCAAAATCTCTCACTCAACCGCATGCGTTATTCAGAAGGGTGATATCACGCAGTGGTTCATCGACGGTTCCTCTGACGCTATTGTTAGTGACTCTCTCTCTCTCTTTCTCTCTCTCTACTGAGAGGCGGCTATTTTACTAAGGGTTTATACTGATTTGCTTACTGGAAAGGGTGGTGGGAAGTAATCAGTTGAAGTCTCCTCTGCTTTTGATTAATGATTGGAATGATCTTTTGTTCAAATTCATGAATTGCTTTAAGTTTGTAGTTGATTTTCAAAATGGGAAGATGCTGTAGTGTAGATTTTTTTTCCTCAGTGTTATGCTTCGCGACCTGGAGGTACTAGAATCATCGATCAAGTCACTGAGTAACTATTTTGTAGAATTGTGATTGGGGGCTTGTATGGATTTAGGTTACTGCATACTCGAATTCTTTGGGTATGATTCCTATGAAATGGATCATTGATGAGAATGAACTCAGTGAAACAACTAAAAATACTGGTGCAAAATGTTTTAAGCTCGCTCAAGTTTGAAATAGCTCCCAATAATTTTTTCAGCTCCATGAAGTGTTGTTGCATCATAAACTATGCAATGACACCTAGAAAGAAGATAAAGAAAGAGGGTACCAAACTAATGAACTACCTTGCCTTGCATGCCAAAATTGCACCCTGTCTTAGAATGATTTCCTGTCGAACATTATTATGAAGACAAACCCTTTTCCATGTGTTTGACTTTGATTCATCTAATATTTTGCGTCTTATATTCAACGCACTCTTGTCTGGAATTCGAAGTGGTCATGCCTGGTTTGGCTCTATTGCATGCACTGTGAAGTTCAATGTATCCATTCAACTTTATGGCATGCTGTTTGTATCTGATGATTAAATACCTGTATTAGCTTATTCTGTAAGTGGTGCAACCAAGCATCAAGGAAATAAATCTCCAAGAACTCTCAACAAAATGTGTTGATTATTAGAATTCAAGCTCATATAAGAGGTTGACCCTCATATTTATAGAAGATTTAAGCCCACTAACGAAGTAAGTTATCAAACTAACCTACAAACTAGAACAAACTCAAACTAAAAATTACAGAACTATTACTAATTATACTACATCAGTAAGAAGGATTCATATCTTTTGTCCTGTATTTTCTTATTTTTAGAAGTTTCCAATACCATGAAAACTATTAAGATTTTGTTGCTTTGGCTTAAATCTTATGTAGGTTAATCCAGCAAATGAGGTAATGCTAGGAGGAGGTGGTGCTGATGGAGGTATGGAGTGTAAAACATTATTTTAGTTTTATAAGTTGTAAGACGAACTTACGATCTTGAGTTTTCTTTCCTCACCGTTTTGTAGCTATACATGATGCTGCTGGGCCAGATCTCGTACAGGCATGCTATTCTGTCCCAGAAGTCCAACCTGGAATTCGTTGTCCAACTGGAGAAGCAAGGATTACTCCGTAGGCTCTTCTAGTTCTAGTAACTGATCATGACCAACTGTTTCAATTGTAAATGCTAATATGATCTAAAAATGCAGAGGTTTTCGATTGCCAGCATCTCATGTAATCCATACTGTTGGACCAATCTACAACACCAGTAGTAACCCTCAAGCCTTACTGAGAAGTGCATATAGGTTCTGTTATGTAATTGCGCTTTCCTTTTCTGTTTGATTTACCATTCAGGACTTAACGTTTTCTATTGCCATATTGGTTTAGAAATTCCTTGGCTGTGGCAAAGGAGAATAACATTCAATACCTTGCCTTTCCTGCCATATCCTGTGGTGTATATCGGTAAGTTGACATGAAACGACTTCATTTTTTGCTTGTCTCTATGTTTTTTATCCCATTTTTTATTATAAAAGATTAACTGTTTCGTGCTTGATGGATCCAGGTTCCTCTTAAATTAGTTTTTCAATAACTTGTGTGTGTTAGTTTATACCATTCTTTTATTAAAATTAGTTAATTCATGATTGATTGATCCAGATTTCTCTGTCAAATATTTGTTTATTGATTTTTATTCTACCCTTGCAGATATCCTTTAAATGAAGCTGCCACAATAGCCTTATCTACCGTTAGAGAGTTTTCTCAGGGCTTGAAAGAGGTAATGGATTATTCTTCGGTACTTGGAAAACGTATCCTTCTTTTGATTGCAATATGTGGACACTTCCTTCTATCTTTTGGACATGTTTGAATTATCCTTCATTCTCGGTCAATTTTATTAAGGGCCTTTCCCCTTTCCAGTATATTATTTCTTCAGTTAAGAGCCTGAAACTACTCCAGGCTACCCTCCAGGTCTTTGGTAGCCAGTACCCTCCCTACTAATCTCTTCACTGAGGAGCCCCTTCTATTTGATCAATTATTATAGAAGCTCTAGTCATAGTCTTCAAATTCTGTGGGTTCTCTTGATTAAGACTTCTTCAACCTCTTTATATCCTCCGAGTCTTCATGTTGACTAGGAATCTGAGCTCCTCATTGTAGGACTCTTCTAAGTTAGGTTGGTAAGGCTTTGGATAAATAACATGGATGATAGGTTGAATCTTTATCTATGATGTTTATCAAATCTTTATGAGAAGTTTTACCGACTTTTTGGATGATCTTTACAGGATCCCATACTTCCTCACTAAGAAGCACGGATACGGATACGAGACACGGATACGATACGACACGGACACGGCGACACGTCATTTCTTTAGGCTAGGACACGTTTATTAAATATACATTTTTTAAAATATATATCATTTTTATATAGAAAAAATTTAAAGTTAATAAGTTTATGTATCTATATGCTTAAAAAAATGATTTTGATGTATTTCGTTCTCAAACTTCATTAATTTTGTCCTATATTACATGTATTTTAGTATACTTGACTAGTGTCATTATGTGCCTAACAAATAATGGACCTATATTGACTTTGTACAACTAGTTTCTAACACATCTATAATGCTAACAAATGTCCGATACGTGTCCAACAAGTGTCGGAGTGTCTAAGTGTCCGACACGTGTCGGACACGGACACGCTAGCCAAATTAAAGTGTCCGTGCTTCTTAGCTTCCTCATGGGGATGTTGGTTTTTACTTGGAACTCTCTGTTGCTCAAGTCTTAGCTTAAGCTTGATTAACACCTTGTCTCCAGCTCAGAACTTCGAAGGGTGACACTGATCTGTCCATTTCTTTCATGTGTCTTGAGACCTTTTTAGGTTGGCTTGATCCACTTTTGAAGTTTGGTCCATCTCTTTAGACATCATAAGTTGCCGACTCGAAGTAACCTCAAGAAGGCCCTTTCGATTTGGGGAACTCTTTTAACAATTAAAACAGGATTGTGTCGCATTAAGCAATTGAACTAAGTTCTTTTGTCGGGCATCAACAACTTAATCAGCTTGGTCTAAAGGTACTTGTAAATCTCTCATCTCTATTGCTAACAATCTTTTCCAAGATGCCTTGACCAATTGTTAAATGAATCCTTGTCAAAGAGAATAGAACAATTGTTATCCTTGCATAGTTGGTTAACAGATAGTAAATTTGTTGTCATGGAGGGAACATGAAGAATAGAAGGAAGATTGAAAGTGAAAGAAGGAGTAGGAAGGAGTCCTTTACCATTGTGTGCTACCAATAGGCCTTGACCATCTCTCAATGTCATTTTCTCATCTCCTTCACAAGGCACATGAGAGTTGATGAGGGCCATATCCTGGGCGACATGTGCATTACAACCTGAGTCCGTCAGCAAGACAGGGTCGTTAGCAGAGGAATCACTAGAAGAATAAAAAGCAACCATTGCTGCAAGTTGGGCAAGTGGGTGTCTCCCTTGGTAGGCACAATCCATTATGTGGAAGCAATCTAAGGCTGAATGCCCTGGTCGATTGCAGATTTGACACACATGCCTATTTCCATTGCTATGGTCAAAACCATTTGGAGTGAAGCCTCCACAATTTTGTGTGTTACTGCCTCGTCCTCCTTTGCGATGCCCATTCATTCCACGGCCAGATGAAGAGTGTCCTCCATGCCCCCTATTCCAAGAATTATTAGAGGAAGTCGTTCTGCCTTTGGTCTGTCTGATATTACGTTGCAGAGAGAGTAAGATCGTGGCAGATACAAGACTCTCCGTTGCTTTGTGTTGACTCTCAAGTGTTTGTTCTTCAACTTTAAGCAAGGTATACAGACTTTCAAAAGAAACAACATCAGATCGATTCCTGACAGACGTGCGAAAGGCATGATAATCAGAGGGTAAGCCATTTAGTACACAGATCAACATTTCTTCATCGTCAATAAGAACATTCATGTTTTCAAGTCGATCTTTAATGTCTTTTGCTCGTTGGATATATCATCAACGGATTCATCGGCCTTCTTGGTTAAAGATTGTTGGGAAGAGCGCAGATGAATGATGTTAGAACAAGAAACAGAGGAAAATCGCGTTTCGAGACATATCCAGGTTTCACGAGCAGTGGTGCAACCAACAACGAAGGCAAGTACCTGGGCTGAAAGCGTGGAGTTTATCAACACAGTGATTTCTCGATCTTGGGCAATCCAGTCGCCATGAAGAGGATTTACTTCAATCGTCAATCGTCCTTGGTCATCAAGGAGAAATCGTGTGGGTTTTGTAACAGATCCATCAACAAACCCAAAAAGCTTATGAGCATGGAGGATCGGTGCTTTCTGGTGCTTCCAAAGGGAGTAATTACTAGAATCCAACTTGAGTGAGACAAGGTTGCAGACGTTGGAGAGTAAAAGAATGGGCAACGAAGTTCTCGAGAGAACAGAGTTTTGGGACGTTGTAGAAGAAGAAGGTGGAATTCTGGCGTTAGTCAATAAGATGGCTCTGATATCATAAAACAAGAGAGATTACCGTGGTAAAGAATCTGATTTATTCAATACTCACATGCTTTACACAAGACCTTTTTATAGAAGGTTTGAGGTGTGCAATAACTGATCTGTTACAACAGATTCAGTTCTTAACTAATACAACAATCTAACTTAATTAAAAACAATCTAACAAAGTATAACATATATCTAGCAAAGGCTTCATGGAATTTTTGAAGAATAGTAGGGCTATCATCTCAGCGAAGCACGACTTGATTGTGGGAATAAAGGTGACATACTTCGAAATTTGGTCTACAAATCTTGGATGCCATTCTTGTTTGAAAAGCAGTAGAATGGAAATCTCTAAAGAAAAGAGGTTTCAACTGAAAGCATATTTTGAAAAAGCCTATGAGGCACATTGGTCTTACCTTGACGACCTTAGAGCTTAGGGGTCGTTTGGTGCAAGGTTTTGTGGATGGGAATGGGTTAGAGTTTTAAATCTCCTTGTTTGGAACAAGGTTTGTAAGCATGGGCATGGGTATCACTTATTCCCATGCATCACTTTTTCCCATGGTGGATGGGTTTCTTATACCCTTCAAGAATGATGGGTTTTCTTTATACCCATCCATCTCTCTCTTTCTCTCTCTTCTTTATTTATATCTCTCTCTTCTTTATTAATATTTATATATTTTTATTTATTTATTTATTTATTTTCAATAATCATAATTAATTATTTAATTAAAAATATTTATAATTAATTCATTTATTTTTATTAGTTGACATTAATTTTGATTAGTTAATAAAATAATGTTTTTAAGATTAATTTAATCAAATAAGCATAAACATTCATTATAAAAATATTAAATCACTAATTAATATTATATTAATCAATTAATTATAAATATTATTAATTTATGGTAAGGATAATCATTATAATTAATTATTCAATTCGTGATCAATTAATAATTAATTTTTTTATTAAAAAATATTAATTAATTATTATTTTTTAAATGCATTTTTATTATTTTATTTATAATAGATAATAATATATAAAATTTATAAGATAAATATCATAAAATAAAATTTCACATTATTTATTAGATCGTAAGTATATATTACACAAGATTTATATATTAATTTCGTCAAATAAATATAAAATATCATTTTTTACATAAAAATCTGATAACATTCCCATGTAGAACCAAACACAGTCATCTTATTCCCATGCATTTATTATTTCCAGACATCTATTACCCATTCCCATCCAATAAAACCATAAACCAAACGGAATGTTAAACCAAGCCTACATGATATAAACGGATTGTAGACTGTGTCTCTTCGTCAACAGTATCATCAATGAGAAATTAAGGGACAATTTTTTTGCTTGTAGCAGCTTGGACCCTTCTCTCCCTTCCTGTTCATGCCAGTGGGATACTCCTTTAGCTGTCTTATTCGTCATTGTCGTGGTAAGCAGACCCTTAAAGGCTTGGAGTTAGGAAAGTAGAAAATAGAGGTTTTTTGCTTTCAATACATAGATGACACCATTAATTCTTGTCCTTTATGAAGGTGGTTTGTTGGAAAATGTGTAATTAATTCTCGATCTCTTACTTAAAGCTTCTAGCCTCTATCAATCGTTCAAATCCACTATCATTGGCCCTATTCTAAGAAGCACGGACACTCTATTTTAGACAAAGTGTCCGTGTCGGATACGTGTCGGACACGCTCCGGACACGTGTCAGACACGCCAATTTAAGTGTCCCTTATTATTATTATTATTTTTAATTTCGGACACGTTTTGGACACGTTGTGGACACGTGACGGACACGCTGGATAAATTTTTAACCCAAAAAAGGTCTAACCCATTTATTTTGAGCCCAAAGTTAAAAGCGCATTTAATTTTATACACAAAAAAGACATTAAAAATATAAATAATAAAATTAAAAATTAAAAACTCTTCTACCACTTAGGAACAAAACATGTTTTCTATTTTCTTTTTTATATAAAACTGTTTTTTTTTTTTTTTTGTATTTACTCTTATTTCATTTGGAAACTTTGTTTATTGAATGTTAGATAATGTGTTTTGTACAATATGTTTTTTAGTGGATTAAGTTTAGTATTTTATGTTAAATGTACATATATCCTAAAATAATAATAAGAAAAGAACGTATCCCCAACGTGTCCGTGTCCTAGTTTTTTAGAAATTGGCGTATCGTCGTGTCGTGTCGTATCTGTATCCGTGTCTCGTGTGCGTGTCCGTGCTTCTTAGGCCTTATTTATGAGGAAGCTTCTCCTAAAGCCGGTTCTTGGTTGTGAGGTAGACCCTTTGTTGTTTAACTATCTTGGTCTCCTTGAATTCTAGACGTGGCTTTCTTTTTTGGATTCAAGTTGGCCAAGTGGGGGAACATCTGTATCGTTGCTCAATTTGGTGCAGTCAGTACTTTCAGACATCCCGCTCTGTTTTTTTTCCTCCCTCTTCAAAATTCCTATCAAGGTGACTAAAGCCTTGGAGAAAATGATAAGGGATTTCACGTGGAGAGGAGCTACCCTCGATCTAGGTGACCTTTTGAGCTGGAAATGGGCTACCCTTACTCAACTTTTAGAAGTTATTGAATAAGAACCTCAAAAGAAAGGAAAAAAAGGTTCTTCTGCCCCTTCAGAGGCAGGTGATCAACAACAAATATGGAGCTTCAAATTGGACCAAGATTCCTCAAGGCAGTAAAAAAGGTTGCTGTCTGGTGCATATGACCATAGGCAAATAGGACAATCGTGGCCATGACATGATAAACGAGCCTCCCCCACTTCCCTCTTTTTAAAAAGAGAAATATGTTGCATGAAGGCATTTATCCTTTACACAATCTTTGAAAGAAACACTTATCTTTTAATTAAAAGAAGCAATTATCTCACTATTACCAAATTTTTATTAATTCCAGATGGTGGTAATTTCAATAAGTTTCGTGCACGGATGAAATTGAGTTCATTCTTTTATTATTACTGCTATTAATATTACTATAATTTTTAATTTGCCATTGTGAAAAAGAAAAGTAGGGGAAAAGCTTAAGACGATAACTTGCATTACTGTTAATAATAAGTTAAATAACTCAAACAATTTGTTATCGACCCAAGTTACTTTACATCTTTTCAAAGAAATGTACAACTGTGAAAAGAGGTACAAGTCATCCTAAATTTTCAGACATTTGGTGTATTCATTTACAATCTTTCTTTATCTTAAAACAATATATCAATCCCCAGACTTTGTGATATCTTTCTTTATCAAAGGGTTTTTGTAACCTAGGTTTTGAGTTACATGTAGTTTCACAACATGTAGGTGTTAGGAACCCAAATACTAAGAACACAAGAATTGAAATATATTGTAAGATAAAGGATAAAAAGTAACCAGATAAACCAAGTATAAGACACTTGGAACCTCCTTCTATGCTAACCCAAACCCTAAAAACACTCAACAATTGTTTGTCTTCCCTCAATCCCAAAGCACTAACTAACTCCCTAACTAATTACCCTTATGCCCTTACTAATACCATACTACTACTATCCATAAACCCCTATTAGTTCCCTTACAATAGGGTAGAGATATTCGAATCTCTGACCTCTTGGTCGAGGATATATGCCTCATCCAGTTGAGCTATGCTCAGGTTGACACTTAGATTTTAAAAAATAAATAGCGGCATAAATACTGTTATGATTCTAATGGGATTTCTTATTAGGATATATTAGTAATTAGCTAGAGAGTTAGCTATGCTATTTGGTTATAAATAGAGGAGGTGAGTTAAAGGGGGAAGGGACATTAATTCAGTGAGTTTGGTCTAAAGGCTTGAGTGAGAGTGTGGAGGGTCCAAATTCCTTAAATTACTTAGTTTATCTTACTATTTTGTTATCTTTCATATTTCAATATATTTGGGTTCTATCAATTTGGTATCAGAGCGGTCAGTTTTGGCCCTGAGTTAGATGCAAATGAACGTGAAGATGTTAGAGTTGTTTTTAAGTTTACGACGATGTCGGGAAGAAATGAAGTCGATATTCCAATCTCTTCAGGATGAAATGAAGAGTTTCAAGGAGATGAGGATTCAAATAAATTAGGGTGAGTATAGTAGCGATAAAAGAAGAGATTGTAGAAATAAAGAAAGAGATGACTAAACGAAAAATCATTAGAGAAAAAAAAAAGAAAAGGACGAAGAAGCAGTAGAAGAAGCGGATTAGGGCGCACAAAAGAGGTGTAGAGAGAATTCTAAGACACGGTTGTTGGAGAACACCAAAGGTATGGTTGACAAAAGCAAAAAAACAAAGTCGAAGACGATCAGAAAAAAATTAAGAATCAGTTGAAGGAGGTTAGACAAAAGCGATTGATGGGAAACCAACCTGTATTTTGTCGGAATTGGAAGATCACGGGCGGATGGGGGAGAAAGAGAACGGGCTTTGCGAAGGCAGAATGTCCGGAGTGGGATCATTGGCCATTGTGGAAGACACAACGCTTCAGTAGCCACACAGTTCGAAAGAAAAGAAAAGAAAAGGCCAACAGAGGGCATTACCTACAACCCTGGCGATGGTGGCCAGAGGTGAAGAGCCAAGAACGACATCGATGCAAATGTCGTGAAGAGGCAGTGGAGGAATCGAAAGAAATGGATGGAAAAAATAAGAAAAAGGCGAATCATACCTAAAAGAGACCGATGCCAGTGCCGAGTGCAAGTGTGGTTGACGATGAAGTCCTAAAAAAGGTATTGCTGCAACTTTCATGAAGATGAATTCGTGGGTAGGCACGTGAGTCTTCAAAAAGAAGAATCTAAGTGGGCTTCCTTTTATTAAATGGAAAGGAGCCTATTAATTCCCAGTCTGGTTGAGGTAAGAAATTATCCATCATTTTGTTATCTCTCGGCCCCACACATTTGAAAAAAAAAAATAGGGCTTTGTACCCAATTTTGCTATTTGTGTTTGTTTAACAACACCTTGTCCTCAAGGTGTGTTTTAGGGGCAGGTAATGATAGGATTACGGTATTAGTAAATAGTTATAGAGTTAGTTATGCTATTTGGTCATAAATAGAGGGAGTGAGTTAGGGGAAGGGGCATTAGTTTAGTGAGTTTGTTTTAGGCTTGAGTGAGAGCAGTGTTTTAAAAAGCCCCTTCGGGCGCGCGCCTAGGCGCAAGGCGCACCTTAAGTGCCAGGCGCATGTAAGAAGGTGCGCGCCTCTGGGGCTTCTTAAGTTTATTTTATTTTTTTCATTTTTAATAGTAAAACGTATTGTTTTACTTATTTTAAATATATATTTAAGGCTATTTAGTGAAGAAGAGATTGATATTTAAGGCTTTTGATCTTGTATTGTTTATAATTTATTAGTGTTGTATTGTTTCATGCTTTTTAATGGAGTATCTATTGTCTCTATGTTCAAAAACTTATACTTGAACCTTATTTCTTTTACCTATTTTTCTTGGTGAAAACTTAATTTCTTTCCATGAAGGGTTTTTTTTTATTGTGTATGTATACATATATATTTTATATTTTTTTTATATATAGTACGCCTTAGAAGAAAAGCCTGTGCCTTTTGGTGCGCCTTGCACCTAGGCTCCGGAGGGCTATTGCGCCTTGAGCCTTTAAAAGCACTGAGTGAGAGTGCTCAAGAAAGGAGAAAGTTCAAGTTCCTCAAATTACTTGGTTTATCTTATTGCTTTGTTATCTTTCATATTTTAATACATTTGGGTTCGGTGACATTCTACTTTCTTATGGATCTCTTCGCAAAGCTTGATTAATGATCGTGTTCTATTATTTGCAGGTGCACTTTGTCCTTTATTCTTCTGATGTTTACAATGTTTGGTTGGACAAGGCCAATGAATTGCTCAAGAACTAATGCAGGCTGCGTTGGTAATGGTATCATTCTCCTTTATCATCTAAAGTTCTAGTGTTGTATTCATCTGAACACAGTCATGTTTTCTTATCAAATACTCTGAAGGGGTTACTAAAATAAACTTTGGATCCTTTCTCCATTGAAAGAATGGTTAGAAGTAAGTGCTTGGATCTTCTTGACCTATACTCTACTGCTAGCTTTTGTCTGTATATTTAATGCTTGGAAATACTTTCTTGATCAGACATTGGATTAGATCAAGGGGTTTAAGTTTAACTGTACTCACTGGCACTCCAATGCCCAAAGTGCGATTCTTAATGTCTTAGGAGTTGTTTGGGGCACTGAGTTGATTATAATAACATGAGTTATAATAA

mRNA sequence

GTCGGATTGCGTTCTGCGGAGATTTTCGTGTTTGCACGTGAGTTCATGTTCTTATTAAACGCAAAATAATAAATTCAAAATTTCCGAGTTCCCTATCTGCCCTTTCCATCTCTCGCATGATTCTGATGACAAGCGCGGCTTGGCGGAACAGTGTAGCAGTGGGCAGCTCCATATTCCGCAGGGTATCGCAAACTCTCACCTTCTCTCCTAATTTCACGTCTTCGCCTCATTTCCGGCGATCAAGAGCAAGGACCATGGCCGTCTCAATGTCGAATGGATCGGGCAACGGAGTGGTTCGCTTCAAAATCTCTCACTCAACCGCATGCGTTATTCAGAAGGGTGATATCACGCAGTGGTTCATCGACGGTTCCTCTGACGCTATTGTTAATCCAGCAAATGAGGTAATGCTAGGAGGAGGTGGTGCTGATGGAGCTATACATGATGCTGCTGGGCCAGATCTCGTACAGGCATGCTATTCTGTCCCAGAAGTCCAACCTGGAATTCGTTGTCCAACTGGAGAAGCAAGGATTACTCCAGGTTTTCGATTGCCAGCATCTCATGTAATCCATACTGTTGGACCAATCTACAACACCAGTAGTAACCCTCAAGCCTTACTGAGAAGTGCATATAGAAATTCCTTGGCTGTGGCAAAGGAGAATAACATTCAATACCTTGCCTTTCCTGCCATATCCTGTGGTGTATATCGATATCCTTTAAATGAAGCTGCCACAATAGCCTTATCTACCGTTAGAGAGTTTTCTCAGGGCTTGAAAGAGGTGCACTTTGTCCTTTATTCTTCTGATGTTTACAATGTTTGGTTGGACAAGGCCAATGAATTGCTCAAGAACTAATGCAGGCTGCGTTGGTAATGGTATCATTCTCCTTTATCATCTAAAGTTCTAGTGTTGTATTCATCTGAACACAGTCATGTTTTCTTATCAAATACTCTGAAGGGGTTACTAAAATAAACTTTGGATCCTTTCTCCATTGAAAGAATGGTTAGAAGTAAGTGCTTGGATCTTCTTGACCTATACTCTACTGCTAGCTTTTGTCTGTATATTTAATGCTTGGAAATACTTTCTTGATCAGACATTGGATTAGATCAAGGGGTTTAAGTTTAACTGTACTCACTGGCACTCCAATGCCCAAAGTGCGATTCTTAATGTCTTAGGAGTTGTTTGGGGCACTGAGTTGATTATAATAACATGAGTTATAATAA

Coding sequence (CDS)

ATGATTCTGATGACAAGCGCGGCTTGGCGGAACAGTGTAGCAGTGGGCAGCTCCATATTCCGCAGGGTATCGCAAACTCTCACCTTCTCTCCTAATTTCACGTCTTCGCCTCATTTCCGGCGATCAAGAGCAAGGACCATGGCCGTCTCAATGTCGAATGGATCGGGCAACGGAGTGGTTCGCTTCAAAATCTCTCACTCAACCGCATGCGTTATTCAGAAGGGTGATATCACGCAGTGGTTCATCGACGGTTCCTCTGACGCTATTGTTAATCCAGCAAATGAGGTAATGCTAGGAGGAGGTGGTGCTGATGGAGCTATACATGATGCTGCTGGGCCAGATCTCGTACAGGCATGCTATTCTGTCCCAGAAGTCCAACCTGGAATTCGTTGTCCAACTGGAGAAGCAAGGATTACTCCAGGTTTTCGATTGCCAGCATCTCATGTAATCCATACTGTTGGACCAATCTACAACACCAGTAGTAACCCTCAAGCCTTACTGAGAAGTGCATATAGAAATTCCTTGGCTGTGGCAAAGGAGAATAACATTCAATACCTTGCCTTTCCTGCCATATCCTGTGGTGTATATCGATATCCTTTAAATGAAGCTGCCACAATAGCCTTATCTACCGTTAGAGAGTTTTCTCAGGGCTTGAAAGAGGTGCACTTTGTCCTTTATTCTTCTGATGTTTACAATGTTTGGTTGGACAAGGCCAATGAATTGCTCAAGAACTAA

Protein sequence

MILMTSAAWRNSVAVGSSIFRRVSQTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGVVRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKANELLKN
Homology
BLAST of Tan0017332 vs. ExPASy Swiss-Prot
Match: Q87JZ5 (Macro domain-containing protein VPA0103 OS=Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) OX=223926 GN=VPA0103 PE=4 SV=1)

HSP 1 Score: 156.8 bits (395), Expect = 3.2e-37
Identity = 82/166 (49.40%), Postives = 110/166 (66.27%), Query Frame = 0

Query: 69  ACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPG 128
           A  + +GDIT   +    DAIVN AN  MLGGGG DGAIH AAGP L+ ACY+V +V  G
Sbjct: 3   AISLVQGDITTAHV----DAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVD-G 62

Query: 129 IRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYLAF 188
           IRCP G+ARIT    L A +VIH VGPIY+  ++P+ +L SAY+ SL +A  N+ Q +A 
Sbjct: 63  IRCPFGDARITEAGNLNARYVIHAVGPIYDKFADPKTVLESAYQRSLDLALANHCQSVAL 122

Query: 189 PAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVW 235
           PAISCGVY YP  EAA +A++  +       ++ F L+S ++ ++W
Sbjct: 123 PAISCGVYGYPPQEAAEVAMAVCQRPEYAALDMRFYLFSEEMLSIW 163

BLAST of Tan0017332 vs. ExPASy Swiss-Prot
Match: Q8P5Z8 (Macro domain-containing protein XCC3184 OS=Xanthomonas campestris pv. campestris (strain ATCC 33913 / DSM 3586 / NCPPB 528 / LMG 568 / P 25) OX=190485 GN=XCC3184 PE=4 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 7.2e-37
Identity = 80/167 (47.90%), Postives = 109/167 (65.27%), Query Frame = 0

Query: 72  IQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRC 131
           + +GDITQ  +    D IVN ANE +LGGGG DGAIH AAGP L++AC ++PEV+PG+RC
Sbjct: 5   VWQGDITQLDV----DVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPEVRPGVRC 64

Query: 132 PTGEARITPGFRLPASHVIHTVGPIYNTSS-NPQALLRSAYRNSLAVAKENNIQYLAFPA 191
           PTGE RIT GF L A H+ HTVGP++     N    L + Y  SL +A++  +  +AFPA
Sbjct: 65  PTGEIRITDGFDLKARHIFHTVGPVWRDGKHNEPEQLANCYWQSLKLAEQMMLHSIAFPA 124

Query: 192 ISCGVYRYPLNEAATIALSTVREFSQG---LKEVHFVLYSSDVYNVW 235
           ISCG+Y YPL +AA IA++  R++ +     K +  V Y+   Y  +
Sbjct: 125 ISCGIYGYPLYQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAY 167

BLAST of Tan0017332 vs. ExPASy Swiss-Prot
Match: Q8PHB6 (Macro domain-containing protein XAC3343 OS=Xanthomonas axonopodis pv. citri (strain 306) OX=190486 GN=XAC3343 PE=4 SV=2)

HSP 1 Score: 154.1 bits (388), Expect = 2.1e-36
Identity = 78/167 (46.71%), Postives = 110/167 (65.87%), Query Frame = 0

Query: 72  IQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRC 131
           + +GDIT+  +    D IVN ANE +LGGGG DGAIH AAGP L++AC ++P+V+PG+RC
Sbjct: 5   VWQGDITELDV----DVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRC 64

Query: 132 PTGEARITPGFRLPASHVIHTVGPIYNTS-SNPQALLRSAYRNSLAVAKENNIQYLAFPA 191
           PTGE RIT GF L A H+ HTVGP++     N    L + Y  SL +A++  +  +AFPA
Sbjct: 65  PTGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPA 124

Query: 192 ISCGVYRYPLNEAATIALSTVREFSQG---LKEVHFVLYSSDVYNVW 235
           ISCG+Y YPL++AA IA++  R++ +     K +  V Y+   Y  +
Sbjct: 125 ISCGIYGYPLHQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAY 167

BLAST of Tan0017332 vs. ExPASy Swiss-Prot
Match: Q9HXU7 (Macro domain-containing protein PA3693 OS=Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) OX=208964 GN=PA3693 PE=4 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 2.8e-33
Identity = 82/163 (50.31%), Postives = 105/163 (64.42%), Query Frame = 0

Query: 72  IQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRC 131
           + +GDIT+  +    DAIVN AN  +LGGGG DGAIH AAG +LV AC  +        C
Sbjct: 6   VWQGDITRLAV----DAIVNAANSSLLGGGGVDGAIHRAAGAELVAACRLLH------GC 65

Query: 132 PTGEARITPGFRLPASHVIHTVGPIYNTSSNPQA-LLRSAYRNSLAVAKENNIQYLAFPA 191
            TGEA+IT GFRLPA+HVIHTVGP++    N +A LL S YR SLA+A++     +AFPA
Sbjct: 66  KTGEAKITRGFRLPAAHVIHTVGPVWRGGDNGEAELLASCYRRSLALAEQAGAASVAFPA 125

Query: 192 ISCGVYRYPLNEAATIALSTV---REFSQGLKEVHFVLYSSDV 231
           ISCG+Y YPL +AA IA+  V   R     L+E+  V + S +
Sbjct: 126 ISCGIYGYPLEQAAAIAVEEVCRQRPAHSSLEEIVLVAFDSSM 158

BLAST of Tan0017332 vs. ExPASy Swiss-Prot
Match: Q8KAE4 (Macro domain-containing protein CT2219 OS=Chlorobaculum tepidum (strain ATCC 49652 / DSM 12025 / NBRC 103806 / TLS) OX=194439 GN=CT2219 PE=4 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 4.8e-33
Identity = 82/165 (49.70%), Postives = 112/165 (67.88%), Query Frame = 0

Query: 74  KGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRCPT 133
           K DIT   +    DAIVN AN  +LGGGG DGAIH AAGP L++AC  +        C T
Sbjct: 11  KADITSLTV----DAIVNAANTSLLGGGGVDGAIHRAAGPKLLEACRELG------GCLT 70

Query: 134 GEARITPGFRLPASHVIHTVGPIYNTSSNPQA-LLRSAYRNSLAVAKENNIQYLAFPAIS 193
           GEA+IT G+RLPA+ VIHTVGP+++  ++ +A LL S YRNSL +A E++ + +AFP+IS
Sbjct: 71  GEAKITKGYRLPATFVIHTVGPVWHGGNHGEAELLASCYRNSLKLAIEHHCRTIAFPSIS 130

Query: 194 CGVYRYPLNEAATIALSTVREF---SQGLKEVHFVLYSS---DVY 232
            G+Y YP+ +AA IA++TVRE     +G+++V F  +S    DVY
Sbjct: 131 TGIYGYPVEQAAAIAITTVREMLADERGIEKVIFCCFSDRDLDVY 165

BLAST of Tan0017332 vs. NCBI nr
Match: XP_038891619.1 (macro domain-containing protein VPA0103, partial [Benincasa hispida])

HSP 1 Score: 419.9 bits (1078), Expect = 1.6e-113
Identity = 211/247 (85.43%), Postives = 228/247 (92.31%), Query Frame = 0

Query: 2   ILMTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGN 61
           +LMTS A RNSVAVGSSI RR S    Q+ T S N + SPHFRRS ART+AVSM+N SG+
Sbjct: 17  VLMTSTARRNSVAVGSSILRRASLSQFQSFTLSSNLSFSPHFRRSTARTLAVSMANESGS 76

Query: 62  GVVRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQ 121
           GVVRFK+S STACVIQKGDITQWFIDGSSDAIVNPAN+VMLGGGGADGAIH+AAGPDLVQ
Sbjct: 77  GVVRFKVSPSTACVIQKGDITQWFIDGSSDAIVNPANKVMLGGGGADGAIHNAAGPDLVQ 136

Query: 122 ACYSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAV 181
           ACYSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAV
Sbjct: 137 ACYSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNVSSNPQALLRSAYRNSLAV 196

Query: 182 AKENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDK 241
           AKENNIQY+AFPAISCGV+RYP +EAAT+ALSTV+EFSQGLKEVHFVLY+SD+YNVWLDK
Sbjct: 197 AKENNIQYIAFPAISCGVFRYPYDEAATVALSTVKEFSQGLKEVHFVLYASDIYNVWLDK 256

Query: 242 ANELLKN 245
           AN+LLKN
Sbjct: 257 ANKLLKN 263

BLAST of Tan0017332 vs. NCBI nr
Match: XP_022943121.1 (uncharacterized protein LOC111447945 [Cucurbita moschata] >XP_023529382.1 uncharacterized protein LOC111792254 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 413.7 bits (1062), Expect = 1.1e-111
Identity = 208/245 (84.90%), Postives = 224/245 (91.43%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSVA+GSSIFRRVS    Q+ T S N + S   RRS A+ +A++MSNGSG+GV
Sbjct: 1   MTSTVWRNSVALGSSIFRRVSPSQLQSFTVSSNLSFSLPLRRSTAKILAMAMSNGSGSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S STACVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+KAN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLEKAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. NCBI nr
Match: XP_022987338.1 (uncharacterized protein LOC111484918 [Cucurbita maxima])

HSP 1 Score: 411.4 bits (1056), Expect = 5.5e-111
Identity = 207/245 (84.49%), Postives = 223/245 (91.02%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSVAVGSSIFRRVS    Q+ T S N + S   RRS A+ +A++MSNGSG+GV
Sbjct: 1   MTSTVWRNSVAVGSSIFRRVSPSQLQSFTVSSNLSFSLPLRRSTAKILAMAMSNGSGSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S STACVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN +SNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNVNSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+ AN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLETAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. NCBI nr
Match: KAG6600880.1 (hypothetical protein SDJN03_06113, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 408.7 bits (1049), Expect = 3.6e-110
Identity = 205/245 (83.67%), Postives = 222/245 (90.61%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSV +GSSIFRRVS    Q+ T S N + S   RRS A+ +A++MSNGS +GV
Sbjct: 1   MTSTVWRNSVVLGSSIFRRVSPSQLQSFTVSSNLSFSLPLRRSTAKILAMAMSNGSSSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S ST+CVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTSCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+KAN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLEKAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. NCBI nr
Match: KAG7031514.1 (hypothetical protein SDJN02_05554 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 407.1 bits (1045), Expect = 1.0e-109
Identity = 204/245 (83.27%), Postives = 222/245 (90.61%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSV +GSSIFRRVS    Q+ T S N + S   R+S A+ +A++MSNGS +GV
Sbjct: 1   MTSTVWRNSVVLGSSIFRRVSPSQLQSFTVSSNLSFSLPLRQSTAKILAMAMSNGSSSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S ST+CVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTSCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+KAN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLEKAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. ExPASy TrEMBL
Match: A0A6J1FQU9 (uncharacterized protein LOC111447945 OS=Cucurbita moschata OX=3662 GN=LOC111447945 PE=4 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 5.4e-112
Identity = 208/245 (84.90%), Postives = 224/245 (91.43%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSVA+GSSIFRRVS    Q+ T S N + S   RRS A+ +A++MSNGSG+GV
Sbjct: 1   MTSTVWRNSVALGSSIFRRVSPSQLQSFTVSSNLSFSLPLRRSTAKILAMAMSNGSGSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S STACVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN SSNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+KAN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLEKAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. ExPASy TrEMBL
Match: A0A6J1JJ63 (uncharacterized protein LOC111484918 OS=Cucurbita maxima OX=3661 GN=LOC111484918 PE=4 SV=1)

HSP 1 Score: 411.4 bits (1056), Expect = 2.7e-111
Identity = 207/245 (84.49%), Postives = 223/245 (91.02%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           MTS  WRNSVAVGSSIFRRVS    Q+ T S N + S   RRS A+ +A++MSNGSG+GV
Sbjct: 1   MTSTVWRNSVAVGSSIFRRVSPSQLQSFTVSSNLSFSLPLRRSTAKILAMAMSNGSGSGV 60

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           VRFK+S STACVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQAC
Sbjct: 61  VRFKVSPSTACVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQAC 120

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN +SNPQALLRSAYRNSLAVAK
Sbjct: 121 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNVNSNPQALLRSAYRNSLAVAK 180

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGVYRYP +EAATIALSTV+EFS GLKEVHFVLYSSD+YNVWL+ AN
Sbjct: 181 ENNIQYIAFPAISCGVYRYPFDEAATIALSTVKEFSNGLKEVHFVLYSSDIYNVWLETAN 240

Query: 244 ELLKN 245
           ELLKN
Sbjct: 241 ELLKN 245

BLAST of Tan0017332 vs. ExPASy TrEMBL
Match: A0A1S3C4C8 (macro domain-containing protein VPA0103 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103496741 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 8.9e-107
Identity = 198/245 (80.82%), Postives = 217/245 (88.57%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           +   AW+NSV+VG S+ RRVS    Q+ T S N + SPHF RS  RT AVSM+N S +GV
Sbjct: 3   IAGTAWQNSVSVGISLLRRVSPLHFQSFTLSSNLSFSPHFPRSTPRTFAVSMANESRSGV 62

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           V FK+S ST CVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLV+AC
Sbjct: 63  VGFKVSPSTDCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVRAC 122

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN S NPQALLRSAYRNSLAVAK
Sbjct: 123 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASRNPQALLRSAYRNSLAVAK 182

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKAN 243
           ENNIQY+AFPAISCGV+RYP +EAATIALST++EFSQGLKEVHFVLY+ D+YNVWLDKAN
Sbjct: 183 ENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEVHFVLYAPDIYNVWLDKAN 242

Query: 244 ELLKN 245
           ELLKN
Sbjct: 243 ELLKN 247

BLAST of Tan0017332 vs. ExPASy TrEMBL
Match: A0A6J1CYJ5 (uncharacterized protein LOC111015968 OS=Momordica charantia OX=3673 GN=LOC111015968 PE=4 SV=1)

HSP 1 Score: 392.9 bits (1008), Expect = 9.8e-106
Identity = 195/232 (84.05%), Postives = 215/232 (92.67%), Query Frame = 0

Query: 15  VGSSIFRRVSQ--TLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGVVRFKISHSTACVI 74
           +  S+F RVSQ  +LTFSPNF+SS HF+RSRAR +A SMSNG+GNGVVRFKIS ST  VI
Sbjct: 6   IAGSLFSRVSQPKSLTFSPNFSSSSHFQRSRARILAASMSNGAGNGVVRFKISPSTDFVI 65

Query: 75  QKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRCP 134
           QKGDIT WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLVQACY+VPEVQPGIRCP
Sbjct: 66  QKGDITHWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVQACYAVPEVQPGIRCP 125

Query: 135 TGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKENNIQYLAFPAIS 194
           TGEARITPGF LPASHVIHTVGPIY  SSNPQALLRSAYRNSLAVAKENNIQY+AFPAIS
Sbjct: 126 TGEARITPGFGLPASHVIHTVGPIYYKSSNPQALLRSAYRNSLAVAKENNIQYIAFPAIS 185

Query: 195 CGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKANELLKN 245
           CGV+RYP +EAATIA+ST++EFS+ LKEVHFVL+SSD+Y+VWL+KANELLKN
Sbjct: 186 CGVFRYPYDEAATIAISTIKEFSRALKEVHFVLFSSDIYDVWLNKANELLKN 237

BLAST of Tan0017332 vs. ExPASy TrEMBL
Match: A0A1S3C4G0 (macro domain-containing protein VPA0103 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496741 PE=4 SV=1)

HSP 1 Score: 390.2 bits (1001), Expect = 6.4e-105
Identity = 198/250 (79.20%), Postives = 217/250 (86.80%), Query Frame = 0

Query: 4   MTSAAWRNSVAVGSSIFRRVS----QTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGV 63
           +   AW+NSV+VG S+ RRVS    Q+ T S N + SPHF RS  RT AVSM+N S +GV
Sbjct: 3   IAGTAWQNSVSVGISLLRRVSPLHFQSFTLSSNLSFSPHFPRSTPRTFAVSMANESRSGV 62

Query: 64  VRFKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQAC 123
           V FK+S ST CVIQKGDIT+WFIDGSSDAIVNPANEVMLGGGGADGAIH+AAGPDLV+AC
Sbjct: 63  VGFKVSPSTDCVIQKGDITKWFIDGSSDAIVNPANEVMLGGGGADGAIHNAAGPDLVRAC 122

Query: 124 YSVPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAK 183
           YSV EVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYN S NPQALLRSAYRNSLAVAK
Sbjct: 123 YSVQEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNASRNPQALLRSAYRNSLAVAK 182

Query: 184 ENNIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKE-----VHFVLYSSDVYNVW 243
           ENNIQY+AFPAISCGV+RYP +EAATIALST++EFSQGLKE     VHFVLY+ D+YNVW
Sbjct: 183 ENNIQYIAFPAISCGVFRYPYDEAATIALSTIKEFSQGLKEVRIFLVHFVLYAPDIYNVW 242

Query: 244 LDKANELLKN 245
           LDKANELLKN
Sbjct: 243 LDKANELLKN 252

BLAST of Tan0017332 vs. TAIR 10
Match: AT2G40600.1 (appr-1-p processing enzyme family protein )

HSP 1 Score: 270.0 bits (689), Expect = 1.8e-72
Identity = 137/242 (56.61%), Postives = 175/242 (72.31%), Query Frame = 0

Query: 2   ILMTSAAWRNSVAVGSSIFRRVSQTLTFSPNFTSSPHFRRSRARTMAVSMSNGSGNGVVR 61
           +L +S+      +  SS    +S  +       SS     SR  T++ SM++G    V  
Sbjct: 16  LLHSSSTILRPTSSSSSRASLISFAVNNFHTIASSSSTLSSRLTTVSSSMASGDEGAV-- 75

Query: 62  FKISHSTACVIQKGDITQWFIDGSSDAIVNPANEVMLGGGGADGAIHDAAGPDLVQACYS 121
           F +S S+   I KGDIT+W +D SSDAIVNPANE MLGGGGADGAIH AAGP L  ACY 
Sbjct: 76  FNLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGPQLRAACYE 135

Query: 122 VPEVQPGIRCPTGEARITPGFRLPASHVIHTVGPIYNTSSNPQALLRSAYRNSLAVAKEN 181
           VPEV+PG+RCPTGEARITPGF LPAS VIHTVGPIY++  NPQ  L ++Y+NSL VAKEN
Sbjct: 136 VPEVRPGVRCPTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKEN 195

Query: 182 NIQYLAFPAISCGVYRYPLNEAATIALSTVREFSQGLKEVHFVLYSSDVYNVWLDKANEL 241
           NI+Y+AFPAISCG+Y YP +EAA I +ST+++FS   KEVHFVL++ D+++VW++KA E+
Sbjct: 196 NIKYIAFPAISCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLFADDIFSVWVNKAKEV 255

Query: 242 LK 244
           L+
Sbjct: 256 LQ 255

BLAST of Tan0017332 vs. TAIR 10
Match: AT1G69340.1 (appr-1-p processing enzyme family protein )

HSP 1 Score: 78.6 bits (192), Expect = 7.9e-15
Identity = 61/189 (32.28%), Postives = 91/189 (48.15%), Query Frame = 0

Query: 35  SSPHFRRSRARTMAVSMSNGSGNGVV-RFKISHSTACVIQKGDITQWFIDGSSDAIVNPA 94
           S+P +    A +   S +  SGNG+V +F + H     I       W ++   DA+VN  
Sbjct: 52  SNPRYANPLASS---SEAGSSGNGMVSKFPVDHEINSRIYLWRGEPWNLE--VDAVVNST 111

Query: 95  NEVMLGGGGADGAIHDAAGPDLVQACYSVPEVQPGIRCPTGEARITPGFRLPASHVIHTV 154
           NE +     + G +H AAGP L + C ++        C TG A++T  + LPA  VIHTV
Sbjct: 112 NENLDEAHSSPG-LHVAAGPGLAEQCATLG------GCRTGMAKVTNAYDLPARRVIHTV 171

Query: 155 GPIYNTSSNPQA--LLRSAYRNSLAVAKENNIQYLAFPAISCGVYRYPLNEAATIALSTV 214
           GP Y    +  A   L   YR+ L +  ++ +Q +A   I      YP   AA +A+ TV
Sbjct: 172 GPKYAVKYHTAAENALSHCYRSCLELLIDSGLQSIALGCIYTEAKNYPREPAAHVAIRTV 228

Query: 215 REFSQGLKE 221
           R F +  K+
Sbjct: 232 RRFLEKQKD 228

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q87JZ53.2e-3749.40Macro domain-containing protein VPA0103 OS=Vibrio parahaemolyticus serotype O3:K... [more]
Q8P5Z87.2e-3747.90Macro domain-containing protein XCC3184 OS=Xanthomonas campestris pv. campestris... [more]
Q8PHB62.1e-3646.71Macro domain-containing protein XAC3343 OS=Xanthomonas axonopodis pv. citri (str... [more]
Q9HXU72.8e-3350.31Macro domain-containing protein PA3693 OS=Pseudomonas aeruginosa (strain ATCC 15... [more]
Q8KAE44.8e-3349.70Macro domain-containing protein CT2219 OS=Chlorobaculum tepidum (strain ATCC 496... [more]
Match NameE-valueIdentityDescription
XP_038891619.11.6e-11385.43macro domain-containing protein VPA0103, partial [Benincasa hispida][more]
XP_022943121.11.1e-11184.90uncharacterized protein LOC111447945 [Cucurbita moschata] >XP_023529382.1 unchar... [more]
XP_022987338.15.5e-11184.49uncharacterized protein LOC111484918 [Cucurbita maxima][more]
KAG6600880.13.6e-11083.67hypothetical protein SDJN03_06113, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7031514.11.0e-10983.27hypothetical protein SDJN02_05554 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1FQU95.4e-11284.90uncharacterized protein LOC111447945 OS=Cucurbita moschata OX=3662 GN=LOC1114479... [more]
A0A6J1JJ632.7e-11184.49uncharacterized protein LOC111484918 OS=Cucurbita maxima OX=3661 GN=LOC111484918... [more]
A0A1S3C4C88.9e-10780.82macro domain-containing protein VPA0103 isoform X2 OS=Cucumis melo OX=3656 GN=LO... [more]
A0A6J1CYJ59.8e-10684.05uncharacterized protein LOC111015968 OS=Momordica charantia OX=3673 GN=LOC111015... [more]
A0A1S3C4G06.4e-10579.20macro domain-containing protein VPA0103 isoform X1 OS=Cucumis melo OX=3656 GN=LO... [more]
Match NameE-valueIdentityDescription
AT2G40600.11.8e-7256.61appr-1-p processing enzyme family protein [more]
AT1G69340.17.9e-1532.28appr-1-p processing enzyme family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002589Macro domainSMARTSM00506YBR022w_8coord: 69..207
e-value: 3.5E-45
score: 166.1
IPR002589Macro domainPFAMPF01661Macrocoord: 90..207
e-value: 1.1E-36
score: 125.4
IPR002589Macro domainPROSITEPS51154MACROcoord: 57..226
score: 25.403543
IPR043472Macro domain-likeGENE3D3.40.220.10Leucine Aminopeptidase, subunit E, domain 1coord: 31..222
e-value: 1.4E-57
score: 197.0
IPR043472Macro domain-likeSUPERFAMILY52949Macro domain-likecoord: 69..218
NoneNo IPR availablePANTHERPTHR11106GANGLIOSIDE INDUCED DIFFERENTIATION ASSOCIATED PROTEIN 2-RELATEDcoord: 50..220
NoneNo IPR availablePANTHERPTHR11106:SF96MACRO DOMAIN-CONTAINING PROTEIN VPA0103-LIKE ISOFORM X1coord: 50..220
NoneNo IPR availableCDDcd02908Macro_OAADPr_deacetylasecoord: 71..214
e-value: 1.10895E-79
score: 234.321

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0017332.1Tan0017332.1mRNA
Tan0017332.2Tan0017332.2mRNA