Tan0012473 (gene) Snake gourd v1

Overview
NameTan0012473
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheparan-alpha-glucosaminide N-acetyltransferase
LocationLG05: 10082063 .. 10091868 (-)
RNA-Seq ExpressionTan0012473
SyntenyTan0012473
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGATTCTCAACCGCTGCTGAAGAACCAACAGGAGTTGCCGGAGTCCAGTCGCAAAGCTCCGCGAGTTGTCTCACTCGACGTCTTTCGCGGCCTCAGCGTCTTTGTTAGTTTTCTCTTTCCCTCTCTGGATCTCGACTCACTATACACGCTGTGGACATCTCTTTGTTTGGCTTGGTTTTTATATGCTTCTGTTCTAACTACTGTTCATACGTAATGGTTTGCTTTTGTGCTTTTTGGTTGTGTTTGGCTATCGGAAAGAATGTGAGAGATGGCAAAGGATCTAATCGAAGCTGAATCTTACTCTTATGTTGAAACGAATGCAATTGATTAGTTTGGAGCACGCTGATTTTCTTTATCTGGAAGTGGCTATTGATCAATGTTATCTCTGTTAGTTCAGCGTTTGAGTTTTACTTTTCTAACGTATGGCATAAGATGACAATACGGTTTCGTCAGCTCTTCAGTTTAGCGAGTAAAATAAAATTAGTAACTACCAGCTAAATTCCTGATGATTGATTATGGAATGCTGTGCTGTGTGTATCAGATGATGATGTTCGTGGACTACGGTGGCTCGTTTTTGCCAATTATCGCTCATTCGTCATGGAATGGACTTCATTTGGCTGATTTTGTAATGCCTTGGTTTCTATTTATTGCGGGAGCCTCGCTTGGACTTGTTTATAAAGTAAGTGAATGGATTCAACCTAGAATCTTCGGCCTCTTGTTGGCTCCTTCGATCAACTAGCTACTGCCCCTAAGTGTGGAAACACAATTCGTGTAGACAAATCATGGTTTCTCCTGTTTTTATAAAATTTACCGGGATTTTGTTTTGACATAAAGAATGGCTTAAAATAGGAAGTACAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGCTTTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGTAAAGTTAATTATTGCGGGTAACTCTTGCGCGAAACATTTTAATATTCATGTGAATTGACCTACTTCCTAAATACATTATGGTAAGTAATCGCATGTGAACTTAAGAGTGTCAAGTTGTACTTAGTCCGTAATGTACTGTGGTTCAAAATTGCTATATTGAATGCATACTATTAAGATGAATCATCATTTCTCCAAATTATTTCAGGTGGTTACTTTCATGGAATAACATCTTTGACATATGGTGTTGATATGGAAAGGATTAGGTGGCTTGGCATTTTACAGGTGTTGCAATTCGTCACTCCTTTTCTTTGTGCAATTGCTACAATAGTCTAAGTTTGTTCTAACCAGAGCACATAACGATAAGGTTTCTGTTACATAGCCGGATGAATGGATGAGGATGAGGATTTTGAACTATATGATGTTGTTCTGAATATTTGCGAGATAGCTCAAACGCCATCCAAGCTAGATTTTAAGCGGACAAAATGACCTAACATTAAATTTATAGGCGGATTTGGATTAAAGAACTTTTCTTTTTCCTTCTTCTTTTACATTTAACATCATGTTTTTTTATCTATTTTGATTCAATGTAGAGAATATCTGTTGGATACTTGATTACTGCACTATGTGAGATCTGGCTAACTCGTTGTACACGTGAAAGAGCTCAAAATACTAAGAGTTTCAGCTGGCATTGGTAAGGATATATTTCGCACCATGATTCATATTACATTTGCAGAATATCAAGAGGCTATTATGCAGATTACATCTGCATAATTTTATTCTCCTTGATTGTAAGCCCTTTTTGTAGTTGGCTTCCTTCTTTTGTGGGCTCTCTTTTTTAATGTCCTTATATATTCTCTCATTTTTTCTCAATGAAAGTTTGATTATTGATTAAAAAAATTATTCTTTTTTGAGGTATGAATAGTAATCCCTTTTCTACTCTTACTAGATACGAGCAGACCTCCACTTTATGTATTTATAGTGACATGATTTAGTTGCAAATTTTTCCAAGAGAGTACAACAGACTCAGATATCGAGTGAAAGTGAGAGTTTACTACTCCATTTGTTGGTGTTTTACCAAACTATGCTATTTAAGCCCTTTATCGAGTAATTGAAATAACCATTAACAATAAGAAGCTGCATTAATAAGAATGTATGAACTTCTCGATTTTATTATGTTTCTTCATGTTTAGCATTTTAATTTGCAGGTATATTATATTTTTTCTGTTGTCATTGTATATGGGACTGTCGTATGGTTTATATGTTCCAGATTGGGACTTTAAAATATCAGCCACGAGCTCTTCACTTCCATCAAATGGAACCCATGTTTACATGGTAAGGAAATACTATGTACCATGTTCTTGTCTTTTCTCTAGTCGCTTCATTTTTAATGTCCATTACTTTCTATATAAAAAAAAAAAAGCAGAGAAACCGTTCATGGTTTAGTGCGAAGAAGGTGAGATTATTATCAAAATGCTTGTGTACTTACATCTGGGATGATACCCATGCAAAGTTTGACTTCCATCCATGTACCAGTTGCTTCAGCCTCAAATTTATAGGCTGCCGAACTATGATAATCCTTTATTTGATGAAAGAAAGGAAGAGCTTTAGACCATAACTTACACTGAAAATAAGATCATTCTTGTTATAGGTGAATTGTTCTGTTCGAGGTGATCTGGGACCTGCTTGTAATTCTGTTGGGATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTGTACAGAAATCTGAAGGTATCAGTGAAGTCGGTTTTTTGATCTTGTAATGTTCACCATCGTTGTGCATTTTGATCAGCCCTAATGAAGAATGTGGTTTGAAGGAGTGCAACATTTCCTCCGGTGGTCAAGTTCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGGTGCAGATCTATGCTTCTTCTCAGTAAACTGTTTTTATAAAGATTTATGCATTCATTTGGATGCATGCGGGGCTTTTTCAAGGGGTGTGTCTTTTGATGAGAATCTGAGATATTGGGGAAAAGTTGGTGAGGATAAAATTGCTTGCAACCGTTTGATCTGAGAGAAATACCATACATTTTTGGGCAAACATAACATTGAGATGTATGCGGTAAAGTTTAACCTCTTTCAAATCGTTGATCTAAAATTGTGAAGCCTTTTGATTATGCATGTCTTATGCTTCATTGAATTCTAGATAAGATTTTTTTCTCTTTAAAAATTGTTATTATTTCAGGATTTCTCATTTTCTTGTATCTTATGTATTTTATTTATTTTGTTTTCTTAGTTCAACAACATTTAGAGGGAGCAAGATTCGAACTCTTGACTTCTTGGTTATAAGTCTATACTTAATACTAATTGAGCTATGCTTTTGTTGACTATGTATTTATTTTGTATTTATTAATATATTTTTTTGTTTTATCAAATAAAAGAGACAGAACTTGTAACCAGACCTTCAATTCGTCCTTAACATTCTGTCTGCACAGTGTTGAACTAAGTTTTGAATTTTAAAATCATAATTACCTAATGAAATTATGTAATTTTTTATTTTTGTTTTTTGCATATCCATGCTCTATAGTACAGCTAAAGACCAATAAGCCACAATCACTTTCTCTTTTGATTCATCCTTGATCGATTTTCTTTCTGTTTGTTTTCATTTTCCCATCTTTGAAATTTTTTTCGGTGTTTACTTCCCTCTGCTCGGTTGTTTACATAAATAAAGAACACAATGATTGAGGTTTAGCTACACCATTCACTGTTATGTTCATTCCTTTCATTCGTTATATTATTTCACAAGAGTGATTTTTTCGCTGTTGCAAATGTTTGCAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAATGTACAGGTGAGATTGAGCAATAACTTAGCATATTACTCATTACTCTGATTAGGCGGTTCAAATTTCGCACAATTGTCTTCTCTCAATGAAAAATTTAGTATAGTGGACATCTTGTTTCTGATATAAATACTCCTTACATAGTTTATGCATCTAACGTAAGCACTTCTTCCATCCTTTTGCTGATTTTCTTTGATTTCCTCGTTTCAGTCAAATATTGTTTACTAATTGCATTGCTTCAATATTTTCCAGGATCACAAAAGTCGCACCAATAGCTGGTTCTCACTCTCGCTTAAGATTTTGGCCCTCGGAATATTCCTCGTCTTTATAGGTATAAACTTGGTGCTGAAAACGTATCGATTATATCAACTAGGCTAACACTCTGGGACCGACTGTTGATTTATTCTTATCTCCGATTTTTTCCGCAAAGTTATGTTATGTATAACCTTCTACTATATCTGATTTGAAATTGATTCTTGTTTTGAAATTGTCCTTCAGGTATCCCTGTAAATAAGTCCCTCTACACAGTCAGCTACATGTTAATTACTTCAGCATCAGCAGGAATTATCTTTTGTGCTCTATATATCTTGGTGAGTTCATCAATTTAGATTTTGGATTGAAGATGAAACTTTCGTAACGGAAGTAAGAATTTTGAATGAACATCAACCTTTGTTTCTTTCCGTGTTATTCTTCAAACTGAATCATAATAATTATAATATCAGACGCGGCATCGTATAAAGAACAGCTTACCACCTTTTTTCCTCGAGTTAAATTCCATCCATGCATTTATTCTTTTTTAACCAAACTACATGGACAACTGAGGTCTGAAGGTCTCTTAATTTCTATAGGTGGATGTCCACGGCTATCGACGCTTGACATGTATTCTGGAGTGGATGGGGAAGCATGCTTTGAGTATTTATGTTTTAGTAATCTCTAACATACTCGTCATTGGGATCCAAGGATTCTACTGGAAATCTCCTAAAAATAACATTGTAAGCACTAAACAATGGAGCTATTTTTTGGTGGGAACTGGGAAGTCTATTACTTCTTACAATAAGCCCGATTACTTTGAGATTCATTTCAAACTTCCTGTGCTTGAAATTTTACAGGTGCACTGGATTGTAAGTCGTGTCAAAGCTTAAAGTTGAAATAATTCGAGTACCTGTATAGAGTATTTAGTATTTGGAGCTGCTAACTTGCTGGGGTTAATTCACTTATCAGTAATCATAGTGCTCGAGATCTGTCGACCGTATTCGAAATCACCCTGCAAATATTAAGCAAAAAGGCCGGTAAGAAGATAACAGAGCTTTTGCAAGTAAGATCATCAATTGATATGTTAAATGTCCCAAAGTTTTTAGTGTAGCATGTATGTATGTGTCTAAGATTTCAGACATTTGCTTGACACCGAAATTGTAATTGTTGTATTCTATGATTGCAACCTCTGGAATGAAATTTTCTACTTTGGAGCTTGAACATATGATGTGTGTTCATATGCTTATTATTAATGAAATTTTGTCTGTATTTGTGAGGAGAAAAGTTCATTTTTTATTATGACTAAGTTACACTTGCATCTAATCTTATTGTTTAAATTTCGTGGAATTGATCTAAAACTTCCTTTGGTAGAAAATTTATGGTATGTTGTTAGGTTCTTTTATTCCTTTTAGACTTAGTTTTGGATGTTTTGATATTCATCCAAAATAATGAAGTATTTTAAAAGATGATTTGTAAATTTATAGCCTGGACATTTGAATACGTGTGCATGATGAGTGGCAGCTGGAGGCGCATGATCTGACATCAGTCCAGATGGCAGCGTCTCCATTTTCCATCACATACTGTTCCTAAAACGTGGCTCACTTTTGCCCTAATGTCACCCAACATCCAAGCCGACTAAGCGTCTTCCAAATTTGTTTTTTTCCCCTCGTCACTTTTGTCCTAAACATCCCAAATTTGTTTTTTCCTATTTCAAAATTTTTAATTATTAAGACTAAGAGTCCTATCTGAACAACCCAAAAATTATTTTGCCTTCTGATTTCTTTCTTCAGCCCGAAAGAGCACAATTTGGGTGGGTGCAGTTCACTTGGATACTGCAAAACAAAATTTAGTTTTTTTTTTTTTTTAAATTTTCTGTAGTGTTGAATTGCACAGGACACGGCTGAACCGAGTTCGAAATGATTTTCCCATTTCCTAATAAAACAAGTCCATAATTTTCACTTTTAAGTTTTACCTTGTTACTAAACTTGACCACAAAATGTCTTTTAATTCTTTTGAACTCACTCGTTGCATGACAACCTATGTGCATGAAACAAATTCTTGTGCGAGAAAAAGAAAAGAACACAATGCCTTCTTCTTTCAGTAACTTAGTAAGTATCAAATTTTATTAAGGCATGTTTGGGGCACAAATTATAATAATCTGTGTTATTATAATCTATGAAATACTGTGCTCCTCAAACACAAATTATAATAACTTCAGACTATACTACTTTGTATTTATTTGAACTCAAAAGCTGTTATATTTGTATAAAGCAGATAAAATCTGACCACAAAAACTGTGGTTTAAGAATCCAATGCACCGAGGGAGGGAAGGGGGGTTTTTGTTTTTTTTTCCCTTTCTGAAAAGGAGTGAATAGAAAACAGCTCCCTTGGCTAGTATGTGTCTTTTTAATCGTGCATGAGAAAAAAATTACTATTCTGATTGAGTTTCTTTGTTTACTATATCCTCTCTCTAAAGCACATACCCACAAGACAACGAAATAGCCGTTAGTGTCTCTGTAGATGAATGACAGAGAGACCCTTTTTTTAAGCTTTATTGAAGAGACTAAACCTCTTCAAGCTCTTTATTATATCTTGTTCATTAGTTTTTTTTTCTAGAAATACCAAATCATTCAGAGCTATTAGAAGCTATATATAGAGAGAAAGAGTTCAATTTGCAGGTGCTTGAAGCTTTGGAAAAGCAATTGACAATGTCTTCTAAGGAAGTTATTGAAGAGGGAGAGAGTCAGCAGTCTAAGGACTATGTGGATCCGCCACCAGCTCCATTCATGGGGTTGTCCGAGCTGAAGCTATGGTCCTTTTACAGAGCTGTAATTGCAGAGTTCATGGCCACACTCCTTTTCCTGTACATCACCATCGCCACTGTCATTGGGAACAATGCACAAACAGGACCCTGTTCTGGCGTTGGCCTCCTCGGCATAGCATGGTCCTTCGGCGCCATGATCTTCGTCCTTGTTTACTGTACTGCTGGTATTTCAGGTACATCCTAATCCTAGTTGAATAGCTAAACTGTACCCTTTTTTTTCAACAAAATGTGAACAGTGCTACGTTCGCACTGAATTAGTGATAGGTGAGTTATATGTTAAAAACGGAAACATAGATAAAGAAAGAGTCACAGAAACATAGAGATTTAGGTGATCTACTAAAGTTACATTAGCTACCTTCACGGGCAGGGGAAGAGAATTTTATCAACGGGAGGGAAACAGAATATAGAATACGGTTTAATTTTTTAGATGGGAGTTTAGGGGTTATATATGGCAGTATGACACCCTTCTAAACCAAAATATAACATCGGTCCAATAAGCTCAAAAATAAACTTGATGTCTATATGACTTAAGTTTCAAATCATATGACGAATTGAAGTCATCTTCACATTATAAGTCATCCTGAGTATGAACTAAGAATGTTACTCTAAATCACAATTGAGTTATCGTATATTATATAATTTTTGTTAATTATATTTTGCCTGGAAACACACCAGAGATCATTCATACCTTTTGAATAGTTGGCCGGGGCCTCTATCTGACCAAATATTATGTTTTTATAAGCAGGAGAAGAATTCAGATAGAAGTAATTTTTTTCCATCAAAAAATAAATAGCCGTTAAATTTGTAGGGATGGTCGAAGGTTTCAAAGGATAATTTTTTTTTTCCCAACTTTTGATCTTAGAGAGGGGAGGGAAGAAAAGTAAACATGGGTTAAAAACTTGATAGGACAGAAGAAGAAATGTTGCCGGTAGTTTCTTATCATTTATAGAATTAAAAAATTATAATTTGAATCTCTATAATTTCGAACTTACATCCATGATCTTACAAGTAAATTATTTAGGGAGAGGTCTGATATTGTCTTAAATTATATTTGTTATAATAAGTTATAATATCAGTATTATTATAACCGGTGAAATACTCTGCACCCCCAAACACAAACTATTATAATCTCAAACTATACTACTCTGCATTTTAAATAATATTATATGATTTCACAAATTATAATAATCCACAGACTATTATAACTCTTTGTTATTATAACTCACTCAACACCCTAAACAATCCTAAGTTACTCTAAACAATCCTAAGTTATTATAATATGTGTAATCATATAATATTATTTAAAATGCAGAGTAGTATAGTATGTAGTTATAATAGTCTGTGTTTGAGGTGCAAAGTATTGTATGAGTTATAATAACACAGATTATTATAACCTGTACCCCAAACAATTGTTCTATGTTTAATTTTTTTGAAAACATGTGGCTTGAAAATGGACAATACTCGACAAGGGCATGATTAGAAGCTAAATCACATGGCTGTTTTTTTTTTTTTTAAAAAAAAAACATAATGAATCACATAGCTGTTAGTGTTGAATTGTATATTAAACTTTTTTAATTCATATTCTTGGTAAATCTCATTCTCACCCACTAAAAAAACCTCACAACCTTTCATGGTGGTTGATTTCCAGGTGGCCATATCAATCCCGCCGTGACGTTCGGGCTGCTGCTGGCCAGAAGGGTGTCCCTGATCCGCGCGGTGGCTTACATAGCGGCGCAGTGTCTGGGAGCCATTGCCGGCGTTGCATTGGTTGAATCTTTCATGAGGCACGCCTACAACGCCAACGGCGGCGGAGCCAACCTCGTCGCCGCCGGCTTCAGCAAAGGCACCGCCCTCGGGGCCGAGATCATTGGCACTTTTCTGCTGGTCTATACTGTGTTCTCTGCCACTGACCCAAAGAGAAACGCCCGCGACTCCCATGTTCCTGTAAGTCTCTTTTCTCCATATTTTTTTATTTCTTCATAATTTTTTTTTGGATGTGTTTCTTTCTAAAACATATTTGAATTTTTCGCAAATTAGTTAAAATTGAGTTATAAAAATTATGGTATTTTAAAAAGGTTTTAAATGGACAGAAAATTAAAGAAACAAGTAGTATTATAATCTTGATTTTGGAGAAAATAACTAATTTTTTGTGTTTGAGAAATAAAAGCATTTGATAATTGTTTCTATTTTTCATTTCTCATCCATAAGAAGAAACCTGTAATGTTTCTTTTCTTGTTTTTAATTTTTGAGAAATAAAAGCATTTGATGAATGTTTATGCTCTTCATTTTTTCTTTAGAAACCATAAAATCAAGTTTCTTTCTGATCCCTATTTTCAATTTTTTTTTTTCCTAAAGATAAACAAAAACATAATTTGATTTTATTAGTCTTGACTTATAAAAAGCGCAATAAATATTTAAAAATGTATAACATAAATGGGAGTTTCAGGTTTTGGCTCCATTGCCCATCGGACTTGCCGTCTTTGTGGTGCATCTGGCCACCATTCCAATTACAGGCACCGGCATCAACCCTGCTCGGAGCTTAGGCGCTGCGGTCATGTTCAACAACACAAGAGTCTGGAGCCACCATGTAAGCAAAGAAAACAATGTTGTCCTGAAAAAAAGGATATAATTTTTACCGTGGGAACTATTTTAACAGCTTAATTATTGATTGTTTCAGTTAGAAAAAAAAAAAAAAAGTTTATCTGTCGATGGAATGTGAGGTTAATTGTGTGTGTGTGTGTTGTTTGTGCAGTGGATTTTCTGGGTTGGACCATTTGTTGGGGCGATGGCAGCGGCGGTTTACCATCAATACGTTCTTAGAGCGGCGGCCGTGAAGGCACTTGTATCTTTCCGCAGCAACCGATCCACTTGA

mRNA sequence

ATGGCCGATTCTCAACCGCTGCTGAAGAACCAACAGGAGTTGCCGGAGTCCAGTCGCAAAGCTCCGCGAGTTGTCTCACTCGACGTCTTTCGCGGCCTCAGCGTCTTTATGATGATGTTCGTGGACTACGGTGGCTCGTTTTTGCCAATTATCGCTCATTCGTCATGGAATGGACTTCATTTGGCTGATTTTGTAATGCCTTGGTTTCTATTTATTGCGGGAGCCTCGCTTGGACTTGTTTATAAAGAAGTACAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGCTTTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCTTTGACATATGGTGTTGATATGGAAAGGATTAGGTGGCTTGGCATTTTACAGAGAATATCTGTTGGATACTTGATTACTGCACTATGTGAGATCTGGCTAACTCGTTGTACACGTGAAAGAGCTCAAAATACTAAGAGTTTCAGCTGGCATTGGTATATTATATTTTTTCTGTTGTCATTGTATATGGGACTGTCGTATGGTTTATATGTTCCAGATTGGGACTTTAAAATATCAGCCACGAGCTCTTCACTTCCATCAAATGGAACCCATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTGGGACCTGCTTGTAATTCTGTTGGGATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTGTACAGAAATCTGAAGGAGTGCAACATTTCCTCCGGTGGTCAAGTTCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAATAGCTGGTTCTCACTCTCGCTTAAGATTTTGGCCCTCGGAATATTCCTCGTCTTTATAGGTATCCCTGTAAATAAGTCCCTCTACACAGTCAGCTACATGTTAATTACTTCAGCATCAGCAGGAATTATCTTTTGTGCTCTATATATCTTGACGCGGCATCGTATAAAGAACAGCTTACCACCTTTTTTCCTCGAGTCTGAAGGTCTCTTAATTTCTATAGGTGGATGTCCACGGCTATCGACGCTTGACATGTATTCTGGAGTGGATGGGGAAGCATGCTTTGAATCTGTCGACCGTATTCGAAATCACCCTGCAAATATTAAGCAAAAAGGCCGTCAGCAGTCTAAGGACTATGTGGATCCGCCACCAGCTCCATTCATGGGGTTGTCCGAGCTGAAGCTATGGTCCTTTTACAGAGCTGTAATTGCAGAGTTCATGGCCACACTCCTTTTCCTGTACATCACCATCGCCACTGTCATTGGGAACAATGCACAAACAGGACCCTGTTCTGGCGTTGGCCTCCTCGGCATAGCATGGTCCTTCGGCGCCATGATCTTCGTCCTTGTTTACTGTACTGCTGGTATTTCAGGTGGCCATATCAATCCCGCCGTGACGTTCGGGCTGCTGCTGGCCAGAAGGGTGTCCCTGATCCGCGCGGTGGCTTACATAGCGGCGCAGTGTCTGGGAGCCATTGCCGGCGTTGCATTGGTTGAATCTTTCATGAGGCACGCCTACAACGCCAACGGCGGCGGAGCCAACCTCGTCGCCGCCGGCTTCAGCAAAGGCACCGCCCTCGGGGCCGAGATCATTGGCACTTTTCTGCTGGTCTATACTGTGTTCTCTGCCACTGACCCAAAGAGAAACGCCCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATCGGACTTGCCGTCTTTGTGGTGCATCTGGCCACCATTCCAATTACAGGCACCGGCATCAACCCTGCTCGGAGCTTAGGCGCTGCGGTCATGTTCAACAACACAAGAGTCTGGAGCCACCATTGGATTTTCTGGGTTGGACCATTTGTTGGGGCGATGGCAGCGGCGGTTTACCATCAATACGTTCTTAGAGCGGCGGCCGTGAAGGCACTTGTATCTTTCCGCAGCAACCGATCCACTTGA

Coding sequence (CDS)

ATGGCCGATTCTCAACCGCTGCTGAAGAACCAACAGGAGTTGCCGGAGTCCAGTCGCAAAGCTCCGCGAGTTGTCTCACTCGACGTCTTTCGCGGCCTCAGCGTCTTTATGATGATGTTCGTGGACTACGGTGGCTCGTTTTTGCCAATTATCGCTCATTCGTCATGGAATGGACTTCATTTGGCTGATTTTGTAATGCCTTGGTTTCTATTTATTGCGGGAGCCTCGCTTGGACTTGTTTATAAAGAAGTACAAAGTAAAGTGACCGCTACAAGGAATGCAGCATGCAGGGCTTTGTACCTTTTTCTCCTGGGTGTTCTTCTTCAAGGTGGTTACTTTCATGGAATAACATCTTTGACATATGGTGTTGATATGGAAAGGATTAGGTGGCTTGGCATTTTACAGAGAATATCTGTTGGATACTTGATTACTGCACTATGTGAGATCTGGCTAACTCGTTGTACACGTGAAAGAGCTCAAAATACTAAGAGTTTCAGCTGGCATTGGTATATTATATTTTTTCTGTTGTCATTGTATATGGGACTGTCGTATGGTTTATATGTTCCAGATTGGGACTTTAAAATATCAGCCACGAGCTCTTCACTTCCATCAAATGGAACCCATGTTTACATGGTGAATTGTTCTGTTCGAGGTGATCTGGGACCTGCTTGTAATTCTGTTGGGATGATTGATCGTTATGTTCTTGGTATTCACCATTTGTATACTAAACCTGTGTACAGAAATCTGAAGGAGTGCAACATTTCCTCCGGTGGTCAAGTTCCTGAGACTTCACCTTCATGGTGTCATGCTCCTTTTGAACCTGAAGGTCTTTTAAGCTCTTTAACAGCTTCAGTGGCATGCATAATAGGACTTCAGTATGGTCACATTCTTGCCAATGTACAGGATCACAAAAGTCGCACCAATAGCTGGTTCTCACTCTCGCTTAAGATTTTGGCCCTCGGAATATTCCTCGTCTTTATAGGTATCCCTGTAAATAAGTCCCTCTACACAGTCAGCTACATGTTAATTACTTCAGCATCAGCAGGAATTATCTTTTGTGCTCTATATATCTTGACGCGGCATCGTATAAAGAACAGCTTACCACCTTTTTTCCTCGAGTCTGAAGGTCTCTTAATTTCTATAGGTGGATGTCCACGGCTATCGACGCTTGACATGTATTCTGGAGTGGATGGGGAAGCATGCTTTGAATCTGTCGACCGTATTCGAAATCACCCTGCAAATATTAAGCAAAAAGGCCGTCAGCAGTCTAAGGACTATGTGGATCCGCCACCAGCTCCATTCATGGGGTTGTCCGAGCTGAAGCTATGGTCCTTTTACAGAGCTGTAATTGCAGAGTTCATGGCCACACTCCTTTTCCTGTACATCACCATCGCCACTGTCATTGGGAACAATGCACAAACAGGACCCTGTTCTGGCGTTGGCCTCCTCGGCATAGCATGGTCCTTCGGCGCCATGATCTTCGTCCTTGTTTACTGTACTGCTGGTATTTCAGGTGGCCATATCAATCCCGCCGTGACGTTCGGGCTGCTGCTGGCCAGAAGGGTGTCCCTGATCCGCGCGGTGGCTTACATAGCGGCGCAGTGTCTGGGAGCCATTGCCGGCGTTGCATTGGTTGAATCTTTCATGAGGCACGCCTACAACGCCAACGGCGGCGGAGCCAACCTCGTCGCCGCCGGCTTCAGCAAAGGCACCGCCCTCGGGGCCGAGATCATTGGCACTTTTCTGCTGGTCTATACTGTGTTCTCTGCCACTGACCCAAAGAGAAACGCCCGCGACTCCCATGTTCCTGTTTTGGCTCCATTGCCCATCGGACTTGCCGTCTTTGTGGTGCATCTGGCCACCATTCCAATTACAGGCACCGGCATCAACCCTGCTCGGAGCTTAGGCGCTGCGGTCATGTTCAACAACACAAGAGTCTGGAGCCACCATTGGATTTTCTGGGTTGGACCATTTGTTGGGGCGATGGCAGCGGCGGTTTACCATCAATACGTTCTTAGAGCGGCGGCCGTGAAGGCACTTGTATCTTTCCGCAGCAACCGATCCACTTGA

Protein sequence

MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYMGLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTRHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESVDRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRST
Homology
BLAST of Tan0012473 vs. ExPASy Swiss-Prot
Match: Q9ZVX8 (Probable aquaporin PIP2-8 OS=Arabidopsis thaliana OX=3702 GN=PIP2-8 PE=1 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 4.2e-119
Identity = 214/272 (78.68%), Postives = 240/272 (88.24%), Query Frame = 0

Query: 415 IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT 474
           + ++GR   KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QT
Sbjct: 5   VSEEGR-HGKDYVDPPPAPLLDMAELKLWSFYRAIIAEFIATLLFLYVTVATVIGHKNQT 64

Query: 475 GPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQ 534
           GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL RAVAY+ AQ
Sbjct: 65  GPCGGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLPRAVAYMVAQ 124

Query: 535 CLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPK 594
           CLGAI GV LV++FM   Y   GGGAN VA G+S GTALGAEIIGTF+LVYTVFSATDPK
Sbjct: 125 CLGAICGVGLVKAFMMTPYKRLGGGANTVADGYSTGTALGAEIIGTFVLVYTVFSATDPK 184

Query: 595 RNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFW 654
           R+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W  HWIFW
Sbjct: 185 RSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAWDDHWIFW 244

Query: 655 VGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           VGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 245 VGPFVGALAAAAYHQYILRAAAIKALASFRSN 275

BLAST of Tan0012473 vs. ExPASy Swiss-Prot
Match: P93004 (Aquaporin PIP2-7 OS=Arabidopsis thaliana OX=3702 GN=PIP2-7 PE=1 SV=2)

HSP 1 Score: 422.5 bits (1085), Expect = 8.9e-117
Identity = 207/273 (75.82%), Postives = 239/273 (87.55%), Query Frame = 0

Query: 415 IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQ 474
           + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  Q
Sbjct: 5   VSEEGKTHHGKDYVDPPPAPLLDMGELKSWSFYRALIAEFIATLLFLYVTVATVIGHKKQ 64

Query: 475 TGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAA 534
           TGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL+RA+ Y+ A
Sbjct: 65  TGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLVRALGYMIA 124

Query: 535 QCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDP 594
           QCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDP
Sbjct: 125 QCLGAICGVGFVKAFMKTPYNTLGGGANTVADGYSKGTALGAEIIGTFVLVYTVFSATDP 184

Query: 595 KRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIF 654
           KR+ARDSH+PVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W   WIF
Sbjct: 185 KRSARDSHIPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAWDDQWIF 244

Query: 655 WVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           WVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Sbjct: 245 WVGPFLGALAAAAYHQYILRASAIKALGSFRSN 277

BLAST of Tan0012473 vs. ExPASy Swiss-Prot
Match: P42767 (Aquaporin PIP-type OS=Atriplex canescens OX=35922 PE=2 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 1.5e-116
Identity = 211/274 (77.01%), Postives = 238/274 (86.86%), Query Frame = 0

Query: 415 IKQKGR--QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNA 474
           I ++G+  Q  KDYVDPPPAPF  + ELKLWSF+RA IAEF+ATLLFLYIT+ATVIG   
Sbjct: 6   ISEEGQVHQHGKDYVDPPPAPFFDMGELKLWSFWRAAIAEFIATLLFLYITVATVIGYKK 65

Query: 475 QTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIA 534
           +T PC+ VGLLGIAWSFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL+RA+ Y+ 
Sbjct: 66  ETDPCASVGLLGIAWSFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLLRALVYMI 125

Query: 535 AQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATD 594
           AQC GAI GV LV++FM+  YN  GGGAN VA G++KGTA GAE+IGTF+LVYTVFSATD
Sbjct: 126 AQCAGAICGVGLVKAFMKGPYNQFGGGANSVALGYNKGTAFGAELIGTFVLVYTVFSATD 185

Query: 595 PKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWI 654
           PKR+ARDSHVP+LAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N  RVW  HWI
Sbjct: 186 PKRSARDSHVPILAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNKKRVWDDHWI 245

Query: 655 FWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           FWVGPFVGA+AAA YHQYVLRAAA+KAL SFRSN
Sbjct: 246 FWVGPFVGALAAAAYHQYVLRAAAIKALGSFRSN 279

BLAST of Tan0012473 vs. ExPASy Swiss-Prot
Match: Q7XLR1 (Probable aquaporin PIP2-6 OS=Oryza sativa subsp. japonica OX=39947 GN=PIP2-6 PE=2 SV=2)

HSP 1 Score: 419.1 bits (1076), Expect = 9.8e-116
Identity = 207/267 (77.53%), Postives = 233/267 (87.27%), Query Frame = 0

Query: 424 KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTG--PCSGVG 483
           KDY DPPPAP   + EL+LWSFYRA+IAEF+ATLLFLYIT+ATVIG   Q+    C GVG
Sbjct: 15  KDYTDPPPAPLFDVGELRLWSFYRALIAEFIATLLFLYITVATVIGYKVQSSADQCGGVG 74

Query: 484 LLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLGAIAG 543
            LGIAW+FG MIF+LVYCTAGISGGHINPAVTFGLLLAR+VS+IRAV YI AQCLG I G
Sbjct: 75  TLGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLLLARKVSVIRAVMYIVAQCLGGIVG 134

Query: 544 VALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNARDSH 603
           V +V+  M+H YNANGGGAN+VA+G+S GTALGAEIIGTF+LVYTVFSATDPKRNARDSH
Sbjct: 135 VGIVKGIMKHQYNANGGGANMVASGYSTGTALGAEIIGTFVLVYTVFSATDPKRNARDSH 194

Query: 604 VPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGPFVGA 663
           VPVLAPLPIG AVF+VHLATIPITGTGINPARS+GAAV++N  + W  HWIFW GPF+GA
Sbjct: 195 VPVLAPLPIGFAVFMVHLATIPITGTGINPARSIGAAVIYNQKKAWDDHWIFWAGPFIGA 254

Query: 664 MAAAVYHQYVLRAAAVKALVSFRSNRS 689
           +AAA YHQY+LRAAA+KAL SFRSN S
Sbjct: 255 LAAAAYHQYILRAAAIKALGSFRSNPS 281

BLAST of Tan0012473 vs. ExPASy Swiss-Prot
Match: Q8H5N9 (Probable aquaporin PIP2-1 OS=Oryza sativa subsp. japonica OX=39947 GN=PIP2-1 PE=2 SV=1)

HSP 1 Score: 401.7 bits (1031), Expect = 1.6e-110
Identity = 204/276 (73.91%), Postives = 228/276 (82.61%), Query Frame = 0

Query: 419 GRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT---- 478
           G   +KDY DPPPAP +  +EL  WS YRAVIAEF+ATLLFLYIT+ATVIG   QT    
Sbjct: 14  GEFAAKDYTDPPPAPLIDAAELGSWSLYRAVIAEFIATLLFLYITVATVIGYKHQTDASA 73

Query: 479 ----GPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAY 538
                 C GVG+LGIAW+FG MIF+LVYCTAGISGGHINPAVTFGL LAR+VSL+RA+ Y
Sbjct: 74  SGADAACGGVGVLGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLFLARKVSLVRAILY 133

Query: 539 IAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSA 598
           I AQCLGAI GV LV++F    +N  GGGAN +AAG+SKGT L AEIIGTF+LVYTVFSA
Sbjct: 134 IVAQCLGAICGVGLVKAFQSAYFNRYGGGANTLAAGYSKGTGLAAEIIGTFVLVYTVFSA 193

Query: 599 TDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHH 658
           TDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS+GAAV+FNN + W +H
Sbjct: 194 TDPKRNARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSIGAAVIFNNEKAWHNH 253

Query: 659 WIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           WIFWVGPFVGA  AA YHQY+LRA A+KAL SFRSN
Sbjct: 254 WIFWVGPFVGAAIAAFYHQYILRAGAIKALGSFRSN 289

BLAST of Tan0012473 vs. NCBI nr
Match: XP_022146281.1 (heparan-alpha-glucosaminide N-acetyltransferase [Momordica charantia])

HSP 1 Score: 1088.9 bits (2815), Expect = 0.0e+00
Identity = 560/692 (80.92%), Postives = 601/692 (86.85%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MADS+PLLKNQ ELPESS KAPRVVSLDVFRGLSVF+MM VDYGGSFLPIIAHS WNGLH
Sbjct: 1   MADSRPLLKNQLELPESSGKAPRVVSLDVFRGLSVFLMMLVDYGGSFLPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMPWFLFIAG SL LVYKEV+SKV ATR AA R LYLFLLGVLLQGGYFHGITSLT
Sbjct: 61  LADFVMPWFLFIAGVSLALVYKEVKSKVAATRIAASRGLYLFLLGVLLQGGYFHGITSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           YGVDMERIRWLGILQRISVGYLI ALCEIWLT  TRE   NTKSF WHW  IFFLLSLYM
Sbjct: 121 YGVDMERIRWLGILQRISVGYLIAALCEIWLTCRTREEVGNTKSFWWHWCAIFFLLSLYM 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GLSYGLYVPDW+FKIS T+SSLP NG++ YMVNCSVRGD GPACNS GMIDRYVLGI HL
Sbjct: 181 GLSYGLYVPDWNFKISTTTSSLPPNGSYGYMVNCSVRGDHGPACNSAGMIDRYVLGIDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           YTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Sbjct: 241 YTKPVYRNMKECNISSRGQVPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSNV 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT- 360
           Q+HKSRT SWFSLSLK L LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYIL  
Sbjct: 301 QEHKSRTRSWFSLSLKTLVLGIFLVFIGIPVNKSLYTVSYMLITSASAGIVFCALYILVD 360

Query: 361 ---RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESVDRIRNHPANIK 420
                R+   L   ++    L I +     +  +    G+ G   F       N    ++
Sbjct: 361 VHGYRRLTCVLE--WMGKHALCIYVLVISNIFVI----GIQG---FYWRSPKNNIDVVME 420

Query: 421 QKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP 480
           ++ RQQ KDYVDPPPAP +G SELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA+ GP
Sbjct: 421 EERRQQGKDYVDPPPAPLLGFSELKLWSFHRAVIAEFMATLLFLYITLATVIGNNARPGP 480

Query: 481 CSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCL 540
           C+GVG LGIAW+FG MIFVLVYCTAGISGGHINPAVTFGLLLAR+VS++RAVAY+AAQCL
Sbjct: 481 CTGVGPLGIAWAFGVMIFVLVYCTAGISGGHINPAVTFGLLLARKVSVVRAVAYMAAQCL 540

Query: 541 GAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN 600
           GAI GVALV+SFM+HAYN++GGGANLVA GFS+GTALGAEIIGTFLLVYTVFSATDPKRN
Sbjct: 541 GAIVGVALVKSFMKHAYNSHGGGANLVADGFSRGTALGAEIIGTFLLVYTVFSATDPKRN 600

Query: 601 ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVG 660
           ARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAVM+NN R W  HWIFWVG
Sbjct: 601 ARDSHVPVLAPLPIGFAVFVVHLATIPITGTGINPARSLGAAVMYNNGRAWRDHWIFWVG 660

Query: 661 PFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS 689
           PF+GA AAA+YHQYVLRAA VKAL SFRSNRS
Sbjct: 661 PFLGAAAAALYHQYVLRAAGVKALGSFRSNRS 683

BLAST of Tan0012473 vs. NCBI nr
Match: XP_034226800.1 (uncharacterized protein LOC117636424 [Prunus dulcis])

HSP 1 Score: 928.7 bits (2399), Expect = 2.9e-266
Identity = 468/703 (66.57%), Postives = 542/703 (77.10%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLH
Sbjct: 1   MADYTPLLNGVDHPATICAKPPRIASLDVFRGLCVFLMMLVDYGGSIFPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL LFLLGVLLQGGYFHG+TSLT
Sbjct: 61  LADFVMPFFLFIAGVSLALVYKKVTNRAEATWKAVFKALKLFLLGVLLQGGYFHGVTSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           +GVD+ERIRW GILQRI++GY++ ALCEIWL+R T +     +S+ WHW +IF L ++Y 
Sbjct: 121 FGVDIERIRWFGILQRIALGYIVAALCEIWLSRQTWDELGFFRSYYWHWCVIFSLSAIYA 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GL YGLYVPDW+FK    +S  PSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HL
Sbjct: 181 GLLYGLYVPDWEFKALTPTSMRPSNDSFVYLVKCSVRGDLGPACNSAGMIDRCILGVDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           Y KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Sbjct: 241 YLKPVYRNLKECNLSADGEVPESSPSWCHAPFDPEGILSSLTAAVTCIIGLQYGHILAHI 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTR 360
           +DHK R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY    
Sbjct: 301 EDHKGRLNAWSLFSVSIFVLGSFLAFIGIPVNKSLYTISYMLITSASAGITFCALY---- 360

Query: 361 HRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESV--------------- 420
                           LLI + G   ++++  + G+   + F  V               
Sbjct: 361 ----------------LLIDVYGYRCITSVLEWMGIHSLSIFVLVTSNLAIIAIQGLYWS 420

Query: 421 DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIA 480
           D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++
Sbjct: 421 DPENNLVMSEEGQAHHHGKDYVDPPPAPLIDSDELKRWSFYRALIAEFIATLLFLYITVS 480

Query: 481 TVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLI 540
           TVIGN  Q+GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLI
Sbjct: 481 TVIGNKVQSGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLI 540

Query: 541 RAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY 600
           RAVAYI AQ LGAI GV LV++F +H YN+ GGGAN+VA+G+SKGTALGAEIIGTF+LVY
Sbjct: 541 RAVAYIVAQSLGAIVGVGLVKAFQKHNYNSQGGGANMVASGYSKGTALGAEIIGTFVLVY 600

Query: 601 TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTR 660
           TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN +
Sbjct: 601 TVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGPAVIFNNEK 660

Query: 661 VWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS 689
            W   WIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN S
Sbjct: 661 AWDDQWIFWVGPFVGALAAAAYHQYILRAAAIKALGSFRSNPS 683

BLAST of Tan0012473 vs. NCBI nr
Match: XP_020425883.1 (uncharacterized protein LOC18767296 [Prunus persica])

HSP 1 Score: 927.9 bits (2397), Expect = 4.9e-266
Identity = 467/701 (66.62%), Postives = 540/701 (77.03%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLH
Sbjct: 1   MADYTPLLNGVDHPATICAKPPRIASLDVFRGLCVFLMMLVDYGGSIFPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL LFLLGVLLQGGYFHG+TSLT
Sbjct: 61  LADFVMPFFLFIAGVSLALVYKKVTNRAEATWKAVFKALKLFLLGVLLQGGYFHGVTSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           +GVD+ERIRW GILQRI++GY++ ALCEIWL+R T +     KS+ WHW +IF L ++Y 
Sbjct: 121 FGVDIERIRWFGILQRIALGYIVAALCEIWLSRQTWDEVGFFKSYYWHWCVIFSLSAIYA 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GL YGLYVPDW+FK    +S  PS+ + VY+V CSVRGDLGPACNS GMIDR++LG+ HL
Sbjct: 181 GLLYGLYVPDWEFKALTPTSMRPSSDSFVYLVKCSVRGDLGPACNSAGMIDRFILGVDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           Y KPVYRNLKECN+S+ G+VPE+SPSWCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Sbjct: 241 YLKPVYRNLKECNLSADGEVPESSPSWCHAPFDPEGILSSLTAAVTCIIGLQYGHILAHI 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTR 360
           +DHK R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY    
Sbjct: 301 EDHKGRLNAWSLFSVSIFVLGSFLAFIGIPVNKSLYTISYMLITSASAGITFCALY---- 360

Query: 361 HRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESV--------------- 420
                           LLI + G   ++++  + G+   + F  V               
Sbjct: 361 ----------------LLIDVYGYRCITSVLEWMGIHSLSIFVLVTSNLAIIAIQGLYWS 420

Query: 421 DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIA 480
           D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++
Sbjct: 421 DPENNIVMSEEGQAHHHGKDYVDPPPAPLIDSDELKRWSFYRALIAEFIATLLFLYITVS 480

Query: 481 TVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLI 540
           TVIGN  Q+GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLI
Sbjct: 481 TVIGNKVQSGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLI 540

Query: 541 RAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY 600
           RAVAYI AQ LGAI GV LV++F +H YN+ GGGAN VA G+SKGTALGAEIIGTF+LVY
Sbjct: 541 RAVAYIVAQSLGAIVGVGLVKAFQKHNYNSQGGGANTVAPGYSKGTALGAEIIGTFVLVY 600

Query: 601 TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTR 660
           TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS G AV+FNN +
Sbjct: 601 TVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGPAVIFNNEK 660

Query: 661 VWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
            W   WIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 AWDDQWIFWVGPFVGALAAAAYHQYILRAAAIKALGSFRSN 681

BLAST of Tan0012473 vs. NCBI nr
Match: XP_021810408.1 (uncharacterized protein LOC110753759 [Prunus avium])

HSP 1 Score: 923.7 bits (2386), Expect = 9.3e-265
Identity = 466/701 (66.48%), Postives = 541/701 (77.18%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLH
Sbjct: 1   MADYAPLLNGVDHPATIPVKPPRIASLDVFRGLCVFLMMLVDYGGSIFPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL LFLLGVLLQGGYFHG+TSLT
Sbjct: 61  LADFVMPFFLFIAGVSLALVYKKVANRTEATWKAVLKALKLFLLGVLLQGGYFHGVTSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           +GVD+ERIRW GILQRI++GY++ ALCEIWL+R T       +S+ WHW +IF L ++Y 
Sbjct: 121 FGVDIERIRWFGILQRIALGYIVAALCEIWLSRQTWGEVAFFRSYYWHWCVIFSLSAIYA 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GL YGLYVPDW+FK S  +S LPSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HL
Sbjct: 181 GLLYGLYVPDWEFKASTPTSMLPSNDSFVYLVKCSVRGDLGPACNSAGMIDRCILGVDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           Y KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Sbjct: 241 YLKPVYRNLKECNLSADGGVPESSPPWCHAPFDPEGILSSLTAAVTCIIGLQYGHILAHI 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTR 360
           +DH  R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY    
Sbjct: 301 EDHNGRLNAWSLFSVSIFVLGSFLAFIGIPVNKSLYTISYMLITSASAGITFCALY---- 360

Query: 361 HRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESV--------------- 420
                           LLI + G   ++++  + G+   + F  V               
Sbjct: 361 ----------------LLIDVYGYRCITSVLEWMGIHSLSIFVLVTSNLAIIAIQGLYWS 420

Query: 421 DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIA 480
           D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++
Sbjct: 421 DPENNIVMSEEGQAHHHGKDYVDPPPAPLIDSDELKRWSFYRALIAEFIATLLFLYITVS 480

Query: 481 TVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLI 540
           TVIG+ AQ+GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLI
Sbjct: 481 TVIGHKAQSGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLI 540

Query: 541 RAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY 600
           RAVAYI AQ LGAIAGV LV++  +H YN+ GGGAN V+ G+SKGTALGAEIIGTF+LVY
Sbjct: 541 RAVAYIVAQSLGAIAGVGLVKAIQKHNYNSLGGGANSVSQGYSKGTALGAEIIGTFVLVY 600

Query: 601 TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTR 660
           TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +
Sbjct: 601 TVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGAAVIYNHEK 660

Query: 661 VWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           VW  HWIFWVGPF+GA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 VWDDHWIFWVGPFIGALAAAAYHQYILRAAAIKALGSFRSN 681

BLAST of Tan0012473 vs. NCBI nr
Match: XP_038694264.1 (uncharacterized protein LOC119991845 [Tripterygium wilfordii])

HSP 1 Score: 917.9 bits (2371), Expect = 5.1e-263
Identity = 466/695 (67.05%), Postives = 542/695 (77.99%), Query Frame = 0

Query: 1   MADSQPL--LKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNG 60
           MAD QPL  ++ Q +   +S K  RVVS+DVFRGL VF+MM VDYGG+  P+IAHS WNG
Sbjct: 1   MADYQPLPEIEEQPQPSLTSVKPTRVVSVDVFRGLCVFLMMLVDYGGAVFPVIAHSPWNG 60

Query: 61  LHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITS 120
           L LA+FVMP+FLFIAG S  L +K+V +KV AT     RA+ LFLLGVLLQGGYFHG  S
Sbjct: 61  LRLAEFVMPFFLFIAGISTALAFKKVPNKVDATSKTVLRAVKLFLLGVLLQGGYFHGTKS 120

Query: 121 LTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSL 180
           LTYGVD+ +IRWLG+LQRIS+GY++ ALCEIWLT  T+   +  +S+ WHW + F + ++
Sbjct: 121 LTYGVDIVKIRWLGVLQRISIGYVVAALCEIWLTHRTQREVKFVRSYCWHWCLAFSISAI 180

Query: 181 YMGLSYGLYVPDWDFKIS-ATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGI 240
           Y GL YGLYVPDW  K+S AT  SLPSN  ++Y V CSVRGDLGPACNS GMIDR VLG+
Sbjct: 181 YAGLLYGLYVPDWQIKVSNATIYSLPSNDANLYTVICSVRGDLGPACNSAGMIDRLVLGV 240

Query: 241 HHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL 300
            HLY KPVYRNLKECNIS+ GQVP+TSP+WCHAPF+PEGLLSSLTA+V CIIGLQYGH+L
Sbjct: 241 DHLYAKPVYRNLKECNISTSGQVPDTSPAWCHAPFDPEGLLSSLTAAVTCIIGLQYGHVL 300

Query: 301 ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYI 360
           A++QDHK R   W   S+ +L LG+FLVFIGIPVNKSLYT++YMLITSASAGI FC  Y+
Sbjct: 301 AHLQDHKGRLCHWSLFSVLLLVLGLFLVFIGIPVNKSLYTINYMLITSASAGITFCFFYL 360

Query: 361 LTRHRIKNSLPPFFLESEGL------LISIGGCPRLSTLDMYSGVDGEACFESVDRIRNH 420
           L        L  F LE  G+      ++       +S    Y  V       S D I   
Sbjct: 361 LVDVYGYRYL-TFILEWMGIHSLSIFVLVTSNLAVMSIQGFYLSVPANNIKMSKDVIEGE 420

Query: 421 PANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNN 480
                   R Q KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+ 
Sbjct: 421 --------RSQGKDYVDPPPAPLIDIAELKLWSFYRALIAEFIATLLFLYVTVATVIGHK 480

Query: 481 AQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYI 540
            QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVT GL LAR+VSLIRA+AY+
Sbjct: 481 KQTGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTLGLFLARKVSLIRALAYM 540

Query: 541 AAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSAT 600
            AQCLGAI GV LV++FM+H YN+ GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSAT
Sbjct: 541 VAQCLGAICGVGLVKAFMKHEYNSLGGGANSVAPGYSKGTALGAEIIGTFVLVYTVFSAT 600

Query: 601 DPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHW 660
           DPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W  HW
Sbjct: 601 DPKRSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNDKSWDDHW 660

Query: 661 IFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           IFWVGPF GA+AAA YHQY+LRA A+KAL SFRSN
Sbjct: 661 IFWVGPFAGALAAAAYHQYILRAGAIKALGSFRSN 686

BLAST of Tan0012473 vs. ExPASy TrEMBL
Match: A0A6J1CYU3 (heparan-alpha-glucosaminide N-acetyltransferase OS=Momordica charantia OX=3673 GN=LOC111015529 PE=4 SV=1)

HSP 1 Score: 1088.9 bits (2815), Expect = 0.0e+00
Identity = 560/692 (80.92%), Postives = 601/692 (86.85%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MADS+PLLKNQ ELPESS KAPRVVSLDVFRGLSVF+MM VDYGGSFLPIIAHS WNGLH
Sbjct: 1   MADSRPLLKNQLELPESSGKAPRVVSLDVFRGLSVFLMMLVDYGGSFLPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMPWFLFIAG SL LVYKEV+SKV ATR AA R LYLFLLGVLLQGGYFHGITSLT
Sbjct: 61  LADFVMPWFLFIAGVSLALVYKEVKSKVAATRIAASRGLYLFLLGVLLQGGYFHGITSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           YGVDMERIRWLGILQRISVGYLI ALCEIWLT  TRE   NTKSF WHW  IFFLLSLYM
Sbjct: 121 YGVDMERIRWLGILQRISVGYLIAALCEIWLTCRTREEVGNTKSFWWHWCAIFFLLSLYM 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GLSYGLYVPDW+FKIS T+SSLP NG++ YMVNCSVRGD GPACNS GMIDRYVLGI HL
Sbjct: 181 GLSYGLYVPDWNFKISTTTSSLPPNGSYGYMVNCSVRGDHGPACNSAGMIDRYVLGIDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           YTKPVYRN+KECNISS GQVPETSPSWCHAPFEPEGLLSSLTA+VACIIGLQYGHIL+NV
Sbjct: 241 YTKPVYRNMKECNISSRGQVPETSPSWCHAPFEPEGLLSSLTATVACIIGLQYGHILSNV 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT- 360
           Q+HKSRT SWFSLSLK L LGIFLVFIGIPVNKSLYTVSYMLITSASAGI+FCALYIL  
Sbjct: 301 QEHKSRTRSWFSLSLKTLVLGIFLVFIGIPVNKSLYTVSYMLITSASAGIVFCALYILVD 360

Query: 361 ---RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESVDRIRNHPANIK 420
                R+   L   ++    L I +     +  +    G+ G   F       N    ++
Sbjct: 361 VHGYRRLTCVLE--WMGKHALCIYVLVISNIFVI----GIQG---FYWRSPKNNIDVVME 420

Query: 421 QKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP 480
           ++ RQQ KDYVDPPPAP +G SELKLWSF+RAVIAEFMATLLFLYIT+ATVIGNNA+ GP
Sbjct: 421 EERRQQGKDYVDPPPAPLLGFSELKLWSFHRAVIAEFMATLLFLYITLATVIGNNARPGP 480

Query: 481 CSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCL 540
           C+GVG LGIAW+FG MIFVLVYCTAGISGGHINPAVTFGLLLAR+VS++RAVAY+AAQCL
Sbjct: 481 CTGVGPLGIAWAFGVMIFVLVYCTAGISGGHINPAVTFGLLLARKVSVVRAVAYMAAQCL 540

Query: 541 GAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRN 600
           GAI GVALV+SFM+HAYN++GGGANLVA GFS+GTALGAEIIGTFLLVYTVFSATDPKRN
Sbjct: 541 GAIVGVALVKSFMKHAYNSHGGGANLVADGFSRGTALGAEIIGTFLLVYTVFSATDPKRN 600

Query: 601 ARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVG 660
           ARDSHVPVLAPLPIG AVFVVHLATIPITGTGINPARSLGAAVM+NN R W  HWIFWVG
Sbjct: 601 ARDSHVPVLAPLPIGFAVFVVHLATIPITGTGINPARSLGAAVMYNNGRAWRDHWIFWVG 660

Query: 661 PFVGAMAAAVYHQYVLRAAAVKALVSFRSNRS 689
           PF+GA AAA+YHQYVLRAA VKAL SFRSNRS
Sbjct: 661 PFLGAAAAALYHQYVLRAAGVKALGSFRSNRS 683

BLAST of Tan0012473 vs. ExPASy TrEMBL
Match: A0A6P5S9C4 (uncharacterized protein LOC110753759 OS=Prunus avium OX=42229 GN=LOC110753759 PE=4 SV=1)

HSP 1 Score: 923.7 bits (2386), Expect = 4.5e-265
Identity = 466/701 (66.48%), Postives = 541/701 (77.18%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           MAD  PLL           K PR+ SLDVFRGL VF+MM VDYGGS  PIIAHS WNGLH
Sbjct: 1   MADYAPLLNGVDHPATIPVKPPRIASLDVFRGLCVFLMMLVDYGGSIFPIIAHSPWNGLH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMP+FLFIAG SL LVYK+V ++  AT  A  +AL LFLLGVLLQGGYFHG+TSLT
Sbjct: 61  LADFVMPFFLFIAGVSLALVYKKVANRTEATWKAVLKALKLFLLGVLLQGGYFHGVTSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           +GVD+ERIRW GILQRI++GY++ ALCEIWL+R T       +S+ WHW +IF L ++Y 
Sbjct: 121 FGVDIERIRWFGILQRIALGYIVAALCEIWLSRQTWGEVAFFRSYYWHWCVIFSLSAIYA 180

Query: 181 GLSYGLYVPDWDFKISATSSSLPSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHHL 240
           GL YGLYVPDW+FK S  +S LPSN + VY+V CSVRGDLGPACNS GMIDR +LG+ HL
Sbjct: 181 GLLYGLYVPDWEFKASTPTSMLPSNDSFVYLVKCSVRGDLGPACNSAGMIDRCILGVDHL 240

Query: 241 YTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILANV 300
           Y KPVYRNLKECN+S+ G VPE+SP WCHAPF+PEG+LSSLTA+V CIIGLQYGHILA++
Sbjct: 241 YLKPVYRNLKECNLSADGGVPESSPPWCHAPFDPEGILSSLTAAVTCIIGLQYGHILAHI 300

Query: 301 QDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILTR 360
           +DH  R N+W   S+ I  LG FL FIGIPVNKSLYT+SYMLITSASAGI FCALY    
Sbjct: 301 EDHNGRLNAWSLFSVSIFVLGSFLAFIGIPVNKSLYTISYMLITSASAGITFCALY---- 360

Query: 361 HRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESV--------------- 420
                           LLI + G   ++++  + G+   + F  V               
Sbjct: 361 ----------------LLIDVYGYRCITSVLEWMGIHSLSIFVLVTSNLAIIAIQGLYWS 420

Query: 421 DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIA 480
           D   N   + + +     KDYVDPPPAP +   ELK WSFYRA+IAEF+ATLLFLYIT++
Sbjct: 421 DPENNIVMSEEGQAHHHGKDYVDPPPAPLIDSDELKRWSFYRALIAEFIATLLFLYITVS 480

Query: 481 TVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLI 540
           TVIG+ AQ+GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLI
Sbjct: 481 TVIGHKAQSGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLI 540

Query: 541 RAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY 600
           RAVAYI AQ LGAIAGV LV++  +H YN+ GGGAN V+ G+SKGTALGAEIIGTF+LVY
Sbjct: 541 RAVAYIVAQSLGAIAGVGLVKAIQKHNYNSLGGGANSVSQGYSKGTALGAEIIGTFVLVY 600

Query: 601 TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTR 660
           TVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++N+ +
Sbjct: 601 TVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGAAVIYNHEK 660

Query: 661 VWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           VW  HWIFWVGPF+GA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 VWDDHWIFWVGPFIGALAAAAYHQYILRAAAIKALGSFRSN 681

BLAST of Tan0012473 vs. ExPASy TrEMBL
Match: A0A1S3U4S1 (uncharacterized protein LOC106761912 OS=Vigna radiata var. radiata OX=3916 GN=LOC106761912 PE=4 SV=1)

HSP 1 Score: 911.0 bits (2353), Expect = 3.0e-261
Identity = 466/703 (66.29%), Postives = 552/703 (78.52%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAP-RVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGL 60
           MAD QPLL N  E  +     P R+ SLDVFRGLSVF+M+ VDYGGS  PIIAH+ WNG+
Sbjct: 1   MADPQPLLLNDSEPTQFQNTRPTRIASLDVFRGLSVFLMILVDYGGSLFPIIAHAPWNGI 60

Query: 61  HLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSL 120
           HLAD VMP+FLFIAG SL LVYK    +  AT  A  RA+ LF+LGV+LQGGYFHGITSL
Sbjct: 61  HLADLVMPFFLFIAGISLALVYKRRPQRTQATCKAFARAVKLFVLGVVLQGGYFHGITSL 120

Query: 121 TYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLY 180
           T+GVD+ERIRWLGILQRIS+GY++ ALCEIWL     +     K++ WH +++  LL+LY
Sbjct: 121 TFGVDLERIRWLGILQRISIGYIVAALCEIWLPPPRLKDINFVKNYYWHGFVVVILLALY 180

Query: 181 MGLSYGLYVPDWDFKISATSSSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIH 240
            GL YGLYVPDW F +SA++SSLP   G  +Y VNCSVRGDLGPACNS GMIDRY+LG+ 
Sbjct: 181 SGLLYGLYVPDWQFDVSASTSSLPPIGGGDIYTVNCSVRGDLGPACNSAGMIDRYILGLG 240

Query: 241 HLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILA 300
           HLY KPVYRNL+ECN+S  GQV + SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGH+LA
Sbjct: 241 HLYRKPVYRNLRECNMSK-GQVSDRSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLA 300

Query: 301 NVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYIL 360
           ++QDHK R  +W   SL +LALG+FL   GIP+NK+LYTVSYML+TSA++G+ F ALY L
Sbjct: 301 HLQDHKERLYNWMCFSLSLLALGLFLALTGIPINKALYTVSYMLLTSAASGLTFIALYYL 360

Query: 361 T----RHRI--------KNSLPPFFLESEGL-LISIGGCPRLSTLDMYSGVDGEACFESV 420
                + R+        K+SL  F L S  + +I+I G         +S  +       V
Sbjct: 361 VDVHGQRRLTAVLEWMGKHSLSIFVLVSSNMAVIAIQGF-------YWSKPENNIINWIV 420

Query: 421 DRIRNHPANIKQ--KGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYIT 480
            R        K+  +  QQ KDY DPPPAP + L+E+KLWSFYRA+IAEF+ATLLFLY+T
Sbjct: 421 TRFALSLTMSKELTEEGQQRKDYFDPPPAPLIDLAEIKLWSFYRALIAEFIATLLFLYVT 480

Query: 481 IATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVS 540
           +ATVIG+  QTGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VS
Sbjct: 481 VATVIGHKKQTGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVS 540

Query: 541 LIRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLL 600
           LIRAV YI AQCLGAI+GV LV++FM+H YN+ GGGAN V++G+SKGTALGAEIIGTF+L
Sbjct: 541 LIRAVLYIVAQCLGAISGVGLVKAFMKHPYNSLGGGANSVSSGYSKGTALGAEIIGTFVL 600

Query: 601 VYTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNN 660
           VYTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN
Sbjct: 601 VYTVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGAAVIYNN 660

Query: 661 TRVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
            +VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 GKVWDDHWIFWVGPFVGALAAAAYHQYILRAAAIKALGSFRSN 695

BLAST of Tan0012473 vs. ExPASy TrEMBL
Match: A0A1S2Y9C8 (uncharacterized protein LOC101494283 OS=Cicer arietinum OX=3827 GN=LOC101494283 PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 1.8e-258
Identity = 456/701 (65.05%), Postives = 539/701 (76.89%), Query Frame = 0

Query: 1   MADSQPLL--KNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNG 60
           MAD QPLL   N Q + + ++   RV S+DVFRGLSVF+M+ VDYG S  PII+H+ WNG
Sbjct: 1   MADPQPLLLNTNSQPIHQFNQNPRRVASVDVFRGLSVFLMILVDYGASIFPIISHAPWNG 60

Query: 61  LHLADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITS 120
           LHLADFVMP+FLF+AG SL LVYK    +  AT  A  RAL LF+LG+LLQGGY HG TS
Sbjct: 61  LHLADFVMPFFLFLAGISLALVYKRRPQRTQATWKALLRALQLFILGILLQGGYLHGTTS 120

Query: 121 LTYGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSL 180
           LTYGVD+ RIR  G+LQRIS+GY++ ALCEIWL     +     KS+ WHW++   LL++
Sbjct: 121 LTYGVDIGRIRLFGVLQRISIGYIVAALCEIWLPTLRWKEMGFFKSYYWHWFVATILLAI 180

Query: 181 YMGLSYGLYVPDWDFKISATSSSLPS-NGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGI 240
           Y GL YGLYVPDW F +S ++SSLP  +G ++Y VNCSVRGD GPACNS GMIDRY+LG+
Sbjct: 181 YSGLLYGLYVPDWQFDVSTSTSSLPPIDGGNIYTVNCSVRGDFGPACNSAGMIDRYILGL 240

Query: 241 HHLYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHIL 300
            HLY KPVYRNLKECN+SS GQ+ ++SPSWCHAPF+PEG+LSS+TA+V+CIIGLQYGHIL
Sbjct: 241 DHLYKKPVYRNLKECNMSSTGQLSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHIL 300

Query: 301 ANVQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYI 360
           A+++DHK R + W S S+   ALG+FL  IGIPVNKSLYTVSYML++SA++G+ F ALY+
Sbjct: 301 AHLEDHKGRLHHWLSFSVSFFALGLFLAVIGIPVNKSLYTVSYMLLSSAASGLTFMALYV 360

Query: 361 LT---RHRI---------KNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESV 420
           L     HR          K+SL  F L S  L +                + G    +  
Sbjct: 361 LVDVYGHRRLTSVLEWMGKHSLSIFVLVSSNLAV--------------IAIQGFYWTKPE 420

Query: 421 DRIRNHPANIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIA 480
           + I        Q      KDYVDPPPAP +  +E+KLWSFYRA+IAEF+ATLLFLY+T+A
Sbjct: 421 NNIEVSEEGHLQTHHHGGKDYVDPPPAPLLDFAEIKLWSFYRALIAEFIATLLFLYVTVA 480

Query: 481 TVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLI 540
           TVIG+  QTGPC GVGLLGIAWSFG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSLI
Sbjct: 481 TVIGHKKQTGPCDGVGLLGIAWSFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLI 540

Query: 541 RAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVY 600
           RAV Y+ AQCLGAI GV LV++ M+  YN  GGGAN VA+G+SKG+ALGAE+IGTF+LVY
Sbjct: 541 RAVLYMVAQCLGAICGVGLVKALMKQPYNNLGGGANSVASGYSKGSALGAEMIGTFVLVY 600

Query: 601 TVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTR 660
           TVFSATDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+FNN +
Sbjct: 601 TVFSATDPKRNARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIFNNAK 660

Query: 661 VWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 VWDDHWIFWVGPFVGALAAAAYHQYILRAAAIKALGSFRSN 687

BLAST of Tan0012473 vs. ExPASy TrEMBL
Match: A0A2I4GCB0 (uncharacterized protein LOC109006608 OS=Juglans regia OX=51240 GN=LOC109006608 PE=4 SV=1)

HSP 1 Score: 898.7 bits (2321), Expect = 1.6e-257
Identity = 454/702 (64.67%), Postives = 541/702 (77.07%), Query Frame = 0

Query: 1   MADSQPLLKNQQELPESSRKAPRVVSLDVFRGLSVFMMMFVDYGGSFLPIIAHSSWNGLH 60
           M D +P+   +Q+ P + R+  R+ SLDVFRGL VF+MM VDY GS  PII+HS W+G+H
Sbjct: 1   MGDYEPVPDFEQQ-PHTDRRTTRIASLDVFRGLCVFLMMLVDYAGSIFPIISHSPWDGIH 60

Query: 61  LADFVMPWFLFIAGASLGLVYKEVQSKVTATRNAACRALYLFLLGVLLQGGYFHGITSLT 120
           LADFVMP+FLF+AG S  LVYK+V ++V AT  A  RA+ LFLLGV+LQGGY HG+ SLT
Sbjct: 61  LADFVMPFFLFVAGVSPALVYKKVPNRVEATGKAVGRAVKLFLLGVVLQGGYLHGVNSLT 120

Query: 121 YGVDMERIRWLGILQRISVGYLITALCEIWLTRCTRERAQNTKSFSWHWYIIFFLLSLYM 180
           +GVD+E+IRW+GILQRIS+GY++ ALCEIWLT  T       K + WHW I F L ++Y+
Sbjct: 121 FGVDVEKIRWMGILQRISIGYIVAALCEIWLTFRTWREEGFFKGYYWHWCIAFSLSAVYL 180

Query: 181 GLSYGLYVPDWDFKISATSSSL-PSNGTHVYMVNCSVRGDLGPACNSVGMIDRYVLGIHH 240
            LSYGLYVP+W F + ++ SSL PSN ++VYMV C+VRGDLGPACNS GMIDRY+LGI H
Sbjct: 181 LLSYGLYVPNWQFTVPSSDSSLPPSNDSNVYMVKCAVRGDLGPACNSAGMIDRYILGIDH 240

Query: 241 LYTKPVYRNLKECNISSGGQVPETSPSWCHAPFEPEGLLSSLTASVACIIGLQYGHILAN 300
           LY KPVYRNLKEC IS+ GQV ++SP WCHAPF+PEG+LSSLTA+V CI GLQYGH+LAN
Sbjct: 241 LYKKPVYRNLKECKISANGQVSKSSPPWCHAPFDPEGILSSLTAAVTCIFGLQYGHVLAN 300

Query: 301 VQDHKSRTNSWFSLSLKILALGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILT 360
           +QDHK R  +W   S  +  LG+ L F GIP+NKSLYTVSYMLITSASAGI FCALY   
Sbjct: 301 LQDHKGRLKNWSLFSTSMFVLGLLLAFTGIPLNKSLYTVSYMLITSASAGITFCALY--- 360

Query: 361 RHRIKNSLPPFFLESEGLLISIGGCPRLSTLDMYSGVDGEACFESVD------------- 420
                            LL+ + G  RL+ +  + G    + F  +              
Sbjct: 361 -----------------LLVDVYGYRRLTCVLEWMGKHSLSIFVLIASNLAIIAIQGFYW 420

Query: 421 RIRNHPANIKQKG--RQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITI 480
            + ++   + ++G   Q  KDYVDPPPAP + L+ELKLWSFYRA+IAEF+ATLLFLYIT+
Sbjct: 421 TVPDNNIEVSEEGHSHQHGKDYVDPPPAPLIDLAELKLWSFYRALIAEFIATLLFLYITV 480

Query: 481 ATVIGNNAQTGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSL 540
           ATVIG   ++GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL
Sbjct: 481 ATVIGYK-KSGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSL 540

Query: 541 IRAVAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLV 600
           IRAV Y+ AQCLGAI GV LV++FM+H YN  GGG N VA G+SKGTALGAEIIGTF+LV
Sbjct: 541 IRAVVYMVAQCLGAICGVGLVKAFMKHDYNTLGGGTNSVAQGYSKGTALGAEIIGTFVLV 600

Query: 601 YTVFSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNT 660
           YTVFSATDPKR+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV+++N 
Sbjct: 601 YTVFSATDPKRSARDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSFGAAVIYDND 660

Query: 661 RVWSHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           +VW  HWIFWVGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 661 KVWDEHWIFWVGPFVGALAAAAYHQYILRAAAIKALGSFRSN 680

BLAST of Tan0012473 vs. TAIR 10
Match: AT2G16850.1 (plasma membrane intrinsic protein 2;8 )

HSP 1 Score: 430.3 bits (1105), Expect = 3.0e-120
Identity = 214/272 (78.68%), Postives = 240/272 (88.24%), Query Frame = 0

Query: 415 IKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQT 474
           + ++GR   KDYVDPPPAP + ++ELKLWSFYRA+IAEF+ATLLFLY+T+ATVIG+  QT
Sbjct: 5   VSEEGR-HGKDYVDPPPAPLLDMAELKLWSFYRAIIAEFIATLLFLYVTVATVIGHKNQT 64

Query: 475 GPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQ 534
           GPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL RAVAY+ AQ
Sbjct: 65  GPCGGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLPRAVAYMVAQ 124

Query: 535 CLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPK 594
           CLGAI GV LV++FM   Y   GGGAN VA G+S GTALGAEIIGTF+LVYTVFSATDPK
Sbjct: 125 CLGAICGVGLVKAFMMTPYKRLGGGANTVADGYSTGTALGAEIIGTFVLVYTVFSATDPK 184

Query: 595 RNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFW 654
           R+ARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W  HWIFW
Sbjct: 185 RSARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAWDDHWIFW 244

Query: 655 VGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           VGPFVGA+AAA YHQY+LRAAA+KAL SFRSN
Sbjct: 245 VGPFVGALAAAAYHQYILRAAAIKALASFRSN 275

BLAST of Tan0012473 vs. TAIR 10
Match: AT4G35100.1 (plasma membrane intrinsic protein 3 )

HSP 1 Score: 422.5 bits (1085), Expect = 6.3e-118
Identity = 207/273 (75.82%), Postives = 239/273 (87.55%), Query Frame = 0

Query: 415 IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQ 474
           + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  Q
Sbjct: 5   VSEEGKTHHGKDYVDPPPAPLLDMGELKSWSFYRALIAEFIATLLFLYVTVATVIGHKKQ 64

Query: 475 TGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAA 534
           TGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL+RA+ Y+ A
Sbjct: 65  TGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLVRALGYMIA 124

Query: 535 QCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDP 594
           QCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDP
Sbjct: 125 QCLGAICGVGFVKAFMKTPYNTLGGGANTVADGYSKGTALGAEIIGTFVLVYTVFSATDP 184

Query: 595 KRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIF 654
           KR+ARDSH+PVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W   WIF
Sbjct: 185 KRSARDSHIPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAWDDQWIF 244

Query: 655 WVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           WVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Sbjct: 245 WVGPFLGALAAAAYHQYILRASAIKALGSFRSN 277

BLAST of Tan0012473 vs. TAIR 10
Match: AT4G35100.2 (plasma membrane intrinsic protein 3 )

HSP 1 Score: 422.5 bits (1085), Expect = 6.3e-118
Identity = 207/273 (75.82%), Postives = 239/273 (87.55%), Query Frame = 0

Query: 415 IKQKGR-QQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQ 474
           + ++G+    KDYVDPPPAP + + ELK WSFYRA+IAEF+ATLLFLY+T+ATVIG+  Q
Sbjct: 5   VSEEGKTHHGKDYVDPPPAPLLDMGELKSWSFYRALIAEFIATLLFLYVTVATVIGHKKQ 64

Query: 475 TGPCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAA 534
           TGPC GVGLLGIAW+FG MIFVLVYCTAGISGGHINPAVTFGL LAR+VSL+RA+ Y+ A
Sbjct: 65  TGPCDGVGLLGIAWAFGGMIFVLVYCTAGISGGHINPAVTFGLFLARKVSLVRALGYMIA 124

Query: 535 QCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDP 594
           QCLGAI GV  V++FM+  YN  GGGAN VA G+SKGTALGAEIIGTF+LVYTVFSATDP
Sbjct: 125 QCLGAICGVGFVKAFMKTPYNTLGGGANTVADGYSKGTALGAEIIGTFVLVYTVFSATDP 184

Query: 595 KRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIF 654
           KR+ARDSH+PVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W   WIF
Sbjct: 185 KRSARDSHIPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAWDDQWIF 244

Query: 655 WVGPFVGAMAAAVYHQYVLRAAAVKALVSFRSN 687
           WVGPF+GA+AAA YHQY+LRA+A+KAL SFRSN
Sbjct: 245 WVGPFLGALAAAAYHQYILRASAIKALGSFRSN 277

BLAST of Tan0012473 vs. TAIR 10
Match: AT3G54820.1 (plasma membrane intrinsic protein 2;5 )

HSP 1 Score: 385.6 bits (989), Expect = 8.5e-107
Identity = 194/268 (72.39%), Postives = 223/268 (83.21%), Query Frame = 0

Query: 424 KDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQTGP------C 483
           KDY DPPP P    +EL  WSFYRA+IAEF+ATLLFLY+TI TVIG  +QT P      C
Sbjct: 15  KDYQDPPPEPLFDATELGKWSFYRALIAEFIATLLFLYVTIMTVIGYKSQTDPALNPDQC 74

Query: 484 SGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRAVAYIAAQCLG 543
           +GVG+LGIAW+FG MIF+LVYCTAGISGGHINPAVTFGLLLAR+V+L+RAV Y+ AQCLG
Sbjct: 75  TGVGVLGIAWAFGGMIFILVYCTAGISGGHINPAVTFGLLLARKVTLVRAVMYMVAQCLG 134

Query: 544 AIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTVFSATDPKRNA 603
           AI GVALV++F    +   GGGAN ++ G+S GT + AEIIGTF+LVYTVFSATDPKR+A
Sbjct: 135 AICGVALVKAFQSAYFTRYGGGANGLSDGYSIGTGVAAEIIGTFVLVYTVFSATDPKRSA 194

Query: 604 RDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVWSHHWIFWVGP 663
           RDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARSLGAA+++N  + W HHWIFWVGP
Sbjct: 195 RDSHVPVLAPLPIGFAVFIVHLATIPITGTGINPARSLGAAIIYNKDKAWDHHWIFWVGP 254

Query: 664 FVGAMAAAVYHQYVLRAAAVKALVSFRS 686
           F GA  AA YHQ+VLRA A+KAL SFRS
Sbjct: 255 FAGAAIAAFYHQFVLRAGAIKALGSFRS 282

BLAST of Tan0012473 vs. TAIR 10
Match: AT5G60660.1 (plasma membrane intrinsic protein 2;4 )

HSP 1 Score: 383.6 bits (984), Expect = 3.2e-106
Identity = 194/278 (69.78%), Postives = 222/278 (79.86%), Query Frame = 0

Query: 414 NIKQKGRQQSKDYVDPPPAPFMGLSELKLWSFYRAVIAEFMATLLFLYITIATVIGNNAQ 473
           ++ + G   ++DY DPPPAPF  + EL+ W  YRAVIAEF+ATLLFLY++I TVIG  AQ
Sbjct: 6   DVNESGPPAARDYKDPPPAPFFDMEELRKWPLYRAVIAEFVATLLFLYVSILTVIGYKAQ 65

Query: 474 TG------PCSGVGLLGIAWSFGAMIFVLVYCTAGISGGHINPAVTFGLLLARRVSLIRA 533
           T        C GVG+LGIAW+FG MIFVLVYCTAGISGGHINPAVT GL LAR+VSL+R 
Sbjct: 66  TDATAGGVDCGGVGILGIAWAFGGMIFVLVYCTAGISGGHINPAVTVGLFLARKVSLVRT 125

Query: 534 VAYIAAQCLGAIAGVALVESFMRHAYNANGGGANLVAAGFSKGTALGAEIIGTFLLVYTV 593
           V YI AQCLGAI G   V++F    Y   GGGAN +A G++KGT LGAEIIGTF+LVYTV
Sbjct: 126 VLYIVAQCLGAICGCGFVKAFQSSYYTRYGGGANELADGYNKGTGLGAEIIGTFVLVYTV 185

Query: 594 FSATDPKRNARDSHVPVLAPLPIGLAVFVVHLATIPITGTGINPARSLGAAVMFNNTRVW 653
           FSATDPKRNARDSHVPVLAPLPIG AVF+VHLATIPITGTGINPARS GAAV++NN + W
Sbjct: 186 FSATDPKRNARDSHVPVLAPLPIGFAVFMVHLATIPITGTGINPARSFGAAVIYNNEKAW 245

Query: 654 SHHWIFWVGPFVGAMAAAVYHQYVLRAAAVKALVSFRS 686
              WIFWVGP +GA AAA YHQ++LRAAA+KAL SF S
Sbjct: 246 DDQWIFWVGPMIGAAAAAFYHQFILRAAAIKALGSFGS 283

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZVX84.2e-11978.68Probable aquaporin PIP2-8 OS=Arabidopsis thaliana OX=3702 GN=PIP2-8 PE=1 SV=1[more]
P930048.9e-11775.82Aquaporin PIP2-7 OS=Arabidopsis thaliana OX=3702 GN=PIP2-7 PE=1 SV=2[more]
P427671.5e-11677.01Aquaporin PIP-type OS=Atriplex canescens OX=35922 PE=2 SV=1[more]
Q7XLR19.8e-11677.53Probable aquaporin PIP2-6 OS=Oryza sativa subsp. japonica OX=39947 GN=PIP2-6 PE=... [more]
Q8H5N91.6e-11073.91Probable aquaporin PIP2-1 OS=Oryza sativa subsp. japonica OX=39947 GN=PIP2-1 PE=... [more]
Match NameE-valueIdentityDescription
XP_022146281.10.0e+0080.92heparan-alpha-glucosaminide N-acetyltransferase [Momordica charantia][more]
XP_034226800.12.9e-26666.57uncharacterized protein LOC117636424 [Prunus dulcis][more]
XP_020425883.14.9e-26666.62uncharacterized protein LOC18767296 [Prunus persica][more]
XP_021810408.19.3e-26566.48uncharacterized protein LOC110753759 [Prunus avium][more]
XP_038694264.15.1e-26367.05uncharacterized protein LOC119991845 [Tripterygium wilfordii][more]
Match NameE-valueIdentityDescription
A0A6J1CYU30.0e+0080.92heparan-alpha-glucosaminide N-acetyltransferase OS=Momordica charantia OX=3673 G... [more]
A0A6P5S9C44.5e-26566.48uncharacterized protein LOC110753759 OS=Prunus avium OX=42229 GN=LOC110753759 PE... [more]
A0A1S3U4S13.0e-26166.29uncharacterized protein LOC106761912 OS=Vigna radiata var. radiata OX=3916 GN=LO... [more]
A0A1S2Y9C81.8e-25865.05uncharacterized protein LOC101494283 OS=Cicer arietinum OX=3827 GN=LOC101494283 ... [more]
A0A2I4GCB01.6e-25764.67uncharacterized protein LOC109006608 OS=Juglans regia OX=51240 GN=LOC109006608 P... [more]
Match NameE-valueIdentityDescription
AT2G16850.13.0e-12078.68plasma membrane intrinsic protein 2;8 [more]
AT4G35100.16.3e-11875.82plasma membrane intrinsic protein 3 [more]
AT4G35100.26.3e-11875.82plasma membrane intrinsic protein 3 [more]
AT3G54820.18.5e-10772.39plasma membrane intrinsic protein 2;5 [more]
AT5G60660.13.2e-10669.78plasma membrane intrinsic protein 2;4 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000425Major intrinsic proteinPRINTSPR00783MINTRINSICPcoord: 447..466
score: 43.36
coord: 575..593
score: 52.05
coord: 611..633
score: 57.61
coord: 489..513
score: 61.48
coord: 526..545
score: 49.47
coord: 651..671
score: 47.53
IPR000425Major intrinsic proteinTIGRFAMTIGR00861TIGR00861coord: 451..667
e-value: 2.0E-61
score: 205.6
IPR000425Major intrinsic proteinPFAMPF00230MIPcoord: 440..668
e-value: 7.9E-78
score: 261.3
IPR000425Major intrinsic proteinCDDcd00333MIPcoord: 447..671
e-value: 3.68261E-79
score: 249.863
IPR023271Aquaporin-likeGENE3D1.20.1080.10Glycerol uptake facilitator protein.coord: 432..689
e-value: 1.6E-85
score: 288.3
IPR023271Aquaporin-likeSUPERFAMILY81338Aquaporin-likecoord: 439..673
NoneNo IPR availablePANTHERPTHR31061LD22376Pcoord: 1..359
NoneNo IPR availablePANTHERPTHR31061:SF28HEPARAN-ALPHA-GLUCOSAMINIDE N-ACETYLTRANSFERASE-LIKE ISOFORM X1coord: 1..359
IPR022357Major intrinsic protein, conserved sitePROSITEPS00221MIPcoord: 507..515

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012473.1Tan0012473.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0015267 channel activity
molecular_function GO:0016740 transferase activity