MS023210 (gene) Bitter gourd (TR) v1

Overview
NameMS023210
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTyrosine-specific transport protein
Locationscaffold78: 991862 .. 1011765 (+)
RNA-Seq ExpressionMS023210
SyntenyMS023210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGATTTCTTCCTGTCTCCGGCTTACATTTCCTGTAATTCGAAGAAGCCTCGATTTGCCGAAGCAGAATGCGCCTTGCCTCGGGTCTTTCGAGTCACTTCGACCGCGTCGGCCCACGGTTCTTCGTCCAAGATTAAGATCTACGGTCACCTGCTTCTCGCGACGGCCGGCGGAGTCCACTGTCTCCGGAGAAGAGAAGGAAATCGTCCAGGAAGAAGAATCGCAACAGTACGAATTGGAGAGGCTGTTCTCCAACCTTAATCAAGTCACACTCAAGCGAGAACCTGGTGATTTTTCGGTTCCCTCTGCTTATTTGATTCGAGATTCTTTTGAAAATTGGCGTCGTAGTTTCATTGGAAGTTGAAAATTAATTGTAGTTTGAATTTTGATTCCTCTTGCCCTGGGTTTCTGTTCTGGTTCGGCCGGCAGGAAGCTTATCCAGCGCGATTTTCCTGGTGGCTGGGACGACAGTAAGCATTTCATTCCTGAATTTCCTTTTTCCTTCTAATATTACACTATTTTGAATAGTTAATATTTCTGAGCATGTCAAAATTATGTTATTGACTTTTTACCTAGCTTCTGAGTTCATTACTCTAATACTGACAATATTGGAAACTGTATAATTTTCTTTTTGAAAATATTTTACTTGAATCTGTTGCTTAAGAAATGTCTCTATATTTCATTAGAATAAGTTGTAGTATTTTCGTCCATAAGCTAAGCAGCTCAGCGGTAATTAGCATCTGTATCTAACCAATCTTACACCCTACTTGTTGTACTAAAAAAGAAGATGGTAGTATTTACATCCGAGGATAAAGGGAACTAAACATTCTCCTCAATGTAAGTGCAAAATGCTTTCATAAATTATAAACTAATATTAATGTACCATAATAAGTTGTGAATTCTATACCAATAAAATGATTTATTTATAACTTCTATGCTCCAAGATTTCGTGGATATGTCCTGGGGAAATACTCTTAATAATCTTTTCCTTTATATCTCTATCAAGGATTTTTTATATTATACAGATTGGTGCTGGGATCCTCGCCATTCCTGCAGTAACTCAAGAATCCGGATTTCTAGCCTCAGCTATTACGTGCACGTTTTGTTGGGTGTACATGGTAAAGCTATAGCAGCAGCTTGTATGTTTTAAAGTTTTTTTTTTTTTGTTTGTTATTTTTATCAAAATCTTATTTTAATAGTATAGTTAAAGTTTGCAATAGAATGCAGTCAGAAATCAGAATTTGATTGTGCAGTTCTATCATTTCTTTATGTTTGATTCCATAAAATTTTGTCCAATTATTGTTTCACAATGATTCTATTGATTTTTTAGTTTATTGCAAAACCTTATAAAAATGACACAATTTGTTTGAAATTTCACTCTCTTGAATTGCCTACTTGGGCAGTTGGGCTCAAAGAATGAACTACTTTAAAGTAGGAACTGTGGTATTTCTCCTGTAGTCTGTTGGTAGATTTATGTGTTGCATATCGCTTTATATATTCTAGAGCATCTCTATATTAGTTTCCTGCATTAAATGTTAATATATAAAATCTTAGCAAACTTTTCTATAATGTTGTCCATAGGTATGAAAGAGAGATTCACTATAATTAACTAAACTAAGATATAATGCAAGCAGTAAAAATCAACAAATATGAAATGTTAAAGAAAATGAAAATTAAAGAGAATTTCCTGTTGAAAGTGGTTAGGTATGTTATTAGCCCTTTGATAGTAACTTAGAAAAAAGATCCCCATCGTATATCACTTTTCTGAAAATACTTCAGACTTTAAGAGAATTATATAGTGGCGAAAAAATAGAATTTTTAGACAATTTTCAAATTCATGATAATAGATACAGTTTCTAAACTAATCTGAAAAATGGTTTTTGTAAGCATTAAACAGTGTTCACAATAAACTAAAATGGTCGGAGACTTATTTTGCATTGTTATCGTTTCCATTTATTGTTTCTTATTTCCAGTTTTATTGTTTCTCATTAGAAGTTACCGGAAGCAAATTTTTGTGCATCCCATTTGATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTTTTGAATTTTTAATTTCTTTATTAAAAAAAAAAAAAACTTCAAACAAAAAAGAAAACCTTAGTAGGCAGGCCCTATGTTTCTAAAGTTCTCTAAGAATCCAGTTCAGCAATAGAGGTTTTAAGTCCAAGATTGAAGGAGATTTAGGATGACCATGAATACAAGCTATATTGTTTCTAATTCTCAATTTCTCAGCCCAATTCTTTATATGCTTAGTACAATGAACAGCCGCAGCACCACCTTAATACTGGCATAGCATTTTTTCCATATGATAAAGTTTGTTTATAACTTATGGTTACTTTCATCCCATATTTGGGGTCCTGAATAATCAGTTACATAATTGAACCATGCAGGTTGTGACAGGACTGCTAATTGCTGAAGTCAATGTGAACACCATGTGCGAACTGGGTTCTGGTGGCGTGTCTCTGGTAAGCTCTTTTCTGTTTCTAAAGTTTTGGGGTTAAATGGGAAATGTTCATTTTTGCAAGGAGAAATACATGCCCCGGCTGCTTGTTATTCAACTTGAGCATTGGGAAAAAAGTTTTATCAAATAATCTAAACTTGTTGGTTAACCATCAAGGTGATATGATATCATAAATCAAAATTCTGATAGGAAATTGCCTTCGATTATAACGTAGTAATCTGATCTTCTCCCAGCAGAAAGTAAAGTACTGTTCTTTATTATTTTGTTTGCAAAGATGTACTGTTATTCATTGTTATACATTAAATTGTGCAAATATATCTGAAGAATAAGTTATAGAAATTCTTCTTGTAAAAAATTGAGTAATGGAGTATGTGGATAAGAAAACCTACTGAAAAATAAGCATGCTAAAGTTTCTAGCATTGAGCAAAAGTAAGTTCCAGAATTTGTCCTGTGAAAGTTATTGTGAATTTTGAACGATTTTCCATTCGATTTCTATGCTGGAAAAAAGAAAGAAGGGACACATTAAACTTTAACGCATGAACTATAAAAGGAGAAACCAGATAAAAAATTTGATATTGTTACCTTTGCGAATGTATTCCATTCTCAATTTTTTCCAAGTTCCTGCTTTATGCTTATGTAGGTTGATTTGTTGCGACCATTTAATTTTATTGTTTCCTATTGCAATATCTTTAGTATGTGATAAGTCTAGAGATCCAAGTGTTTGAGTTGTGTTTTTCAGGTATCGATGGCCATGAGAACTCTTGGGACAGTTGGCGTTCAAGTTTCTTGGTAGGCAGAGATGTAATTGTTTTAAGTTCTAATGTCTATCTGTTTCCAAACATTGAATTTTGATATGTACTTGTAGAGACAAAATGTCGCAAGTATCTCTGCCAAATTTTAACTATGAGTAGCAACTTGATGCCTTACATGCTGATGATGGTTATATTATTTATTGTGCAATTTGTAATTTTTGTTCGCAACTAGATCTATTGTTATGGAGCATTTACCTTTAAAAATAATGAGTGAAACAAAATTAAAAGATTAATTCATTTTATTGTATATAGCTTATGGATGGATAAAAGGTAAGTTTGTATTAGAGACCGTAACAATGCCTTAACGCTTTACTCTTAGGCAGTGCTACTGCTAATGAAAAATAAATCATTTAAGTAGACCAATTATGACAGACGGAAACTTATTTCTGGTGACTTACACTTGTGTCAGTGAGCCCTATTAATCTTATAATGTTCATATGGATATGGAAATGATACGGGGTCAAAAAGATTAAGCATGTGGCTGCTTGTGCAATGATGTGATAAACCATTCATCACTATACTTAATATGTTTATTTATTATGGCCTCCAGCATTTTTCTTCAGTCTGTCTGTTAATTCTGACAATAATATTTTGCAACTTGCAGTTGGTCATACATTCTCATTCACTATGCGCTTCTGGTTGCTTATGTGGCTCGTTCTTCAGACATCTTGACCAATTTTCTTGGCATTCCCTTGTACGAACGAGGAGTCTAACCTTCTTAAAGTCCCAGTATTAGAACTTGTTTGCCTTTCTAATCAGTGTGATCATACAAAGAAGACTCATGCATTGTTTTCAAGCACATTTTATATGTTTTTTTTTTGGTGTTGATGCAGCATTGTAATTGCAGATGGGAGAGTGCAACCTTGTTCTCTTTGATTTTTGGAGGTATATGCTACTTTGGAAGGTTGGTACTTCACTCTCTACACTGTATTGATTCAACTTCATATTCACATATATGTGCATTTACATATTACAAATACGTCACTTCTGCTGCATATTGACTGCAAAGATGTTTTTAATTGCAGCCAACGTTTAATTGGTGCTGTCAATGGAACCCTAGTACTTGGAATCATCATTTCCTTTGCAGGTCTTGTGGTAATATGCTCACTTCACTCCAGGAATTACATGCTCCCATTTGGGCTGTTACAAGGATTGTTAACATATAGCAAGTGACCCGTATGTTGTGTTTGGTATGGATACTTGAGTTAATGTAGTAGTTAACCACATTTTCTTGAATATTCATGATAATTTTATAGTGGGAAAACCCCCTTGACCAGAGAACTTCATTGGACATTTATTGTTTGAAATTTTTTCTGCGGATATAGCTGAAATAAAGTATACTTTGCTTGCACAACTTATGAAAAATACATTTTCACAATATTAATTTGGAAGAGCATTGCATTTTACATTTTTACTGCTCTGATTATATTACGTAGGCCCATTTCTTTCAGTTTTTTTATCAGTTTACTAGTTCTAGGACTAACAAGAATGATCTTCCGTACTCAATAAAACATCACTTGATTCTCTGGTCATGGATGCTTTTTCGTCAGTTATTTCACTTGGTGCTTGATAGATGCACTAATATCAAATTTTATTTATTAGGCAGTCGCCAGTGGAGGCCTACATTGGGATGCTCTCGTTAAAGCTAATTTCGAAGCTGTTCCTATGAGTATACCTATAATTGCCCTTTCATTTGTTTACCAGGTAAATTACGGCTCTGCTATATTTCAAAGTTGTCCTGTGAATGTAATAACGTTATGTGATTTATCTTTTAAAAAAAAAAAAAAAAAACATTATGTGATTTTTTTTTTCAGAACGTGGTGCCAGTACTCTGTACGAATCTGGAAGGAAACCTGACAAAAGTAAGGTGACAAATATTTCCTTTTCCATGTTTGGCAACATTTGTAAACTGTGTATAAACTATGAAACTACCTCAAATCATTTTTTTGTTAACTTGACATGTCTAGAACCAAAAATTTCAATAGAATTTGTAATTCCTGATGTTATTAATTTGGTTCAATGCATCCCATTTTTCTCTTAACTACAGCGGATTTTATCCGAATCCCATAATCAGTTGTGTTTTGGTTCTAAGTGTTTGTATTCTGTCAGTCATCTTTGAAACAATCAAAAGTTGTTTGATACCATCTACTGGCTATTTTTTCAGGACATCCATTGTACTTGGCACTGCAATACCTCTAGTTCTATTTCTTGTCTGGAATGGTGTCATTCTTGGCACTATTTCAAGCCTTGAAATGGGATCGGATAAGATTATAGACCCATTAGAGCAATTGAGATCAGCAAATGGAGCTGTAGGAGTAAGTTTCTGGACTGCTTAGTTTATGTACTTTATAGCACTTTTATCGGATGACTTATTTGAAGCAACTATTTCTTAATGCAGCCTATAGTAGAAGTATTCTCTCTTTTGGCGATTGCAACATCCTACATCGGATTTGTTTTGGGATTATCGGATTTTCTTGCTGACTGTAAGTGACTTCTCTCATTAGTTTTTAGAATCAGAACTGTATTAGACATGTTATATAGTCTGGAGCCTGTTGTTGGGATTTAAGGGTCACAGTTTTAGGCCTAATGACTATATAGATTAATTCTGTGTGGTCACTTGAAAAATATCAATCTGTAACAAAGAATACAATACAAGTAAATGTAGGAAAACAGACACTGAATTTATGTGGAAAACCCATCAACGTAAGGACAATCTCTTAAAAACTTCCAGTATCACATAATAGGATTACAATAAATTTCATCGGTCATAATTAGAGGAAATCACAATAATAAAAAGGTTAAATTACCAAAAAAAAAAAAAAAACATCCAAAATTGTTGTCATTTAAAAAAAAATTATCCATATCTTTCAAAAGTTTCAATACTACTCTTCGTTTCTAAAGTGCTCTTTGAGCAAACATCCCTTTAATGTTTTGAACTAAAATATGACATAACAATTAACGTGGAAGTTTGCCTAGATTAAGTTTGTAAAGAGAGGAAATGTGGAAGAAATTATCGTCTTCAAAGTCCGCCAATCCGTCATTCTATTTTCATCCTCAACTTCTTCTATCATTTTTAAAAATGATAGTTGATTCCACATTACTTCTCTTTTTTTCGTTCAAATATTAACGGATGTTTGTTCAAAAGTATTTTTGGAACTTTTATCAAAGATCCAGTTTTGAAATTTTTTAAAGAATAGGGGTATTTTTAAAACAGGGCAAAGTTCAGGGATAAATTTTGTACTTTAGCCTACACTTCAGGTAAATTACCCTTTTACCTTTGTATAACAGATCCTAGAGAAATATGGCATGATCTCACTGTAGTAGACCTAGACGGCTAGAAAAAATGACTTATTTCTCTAACGTTGTGGCCTTTTCACGAATTCCTGAAAATTTCTTCTTTCTTTTCTCAAGAACTTTACAGTGGCCCTCTCCCTATATTTCTCTCTATTATGGTTGCCTTGCACCTCATCATCCGTCTCTTTATTTAAACAGTCCTTTTTTCCATTCTAATCAATAATCATCATTTTGTTGTTCACTTATCCATTTTCTGCATTCAACTTTCTTGTGGCCCATCTTCCTGCAATAATAGCATTGAATCTCTCTAGTGTTCTTGCTTGCTGATCGACCGTGATCACTACAACCCTTTCGATTTTGATTCCTCCCTCGATTTTTCTTGACCAGAACATCCGACTCCATGGCTAAAGAACCCAAGGCTATCCTCATCACTTCTTTGTTTAACAGACTACTCTTAACCATAATCGTACTCAACTTCTCACTAGAAGAGTAATCATTTAGTGATTCGACTACCGTAACCAAGTTGTCCGGCAAAGAACTAAGAAGCAGCAACGCCTGCAACTCATCATCTAAAACTATCTGCATCGATGAGAGCTGATTCATTATACTTTGCATCTCACTCAAATGATTTGCAACTGAAGTAGTAGTACCCTCCTTGTATTTTATAACATATCTTGCCTGGTTGATTTTTTAGATCCAGATAATGCGAAGCGTTTCTCTCATACAACTCTTCTAATTTCTTCCCCAAGGAGTGTGCCAATGTCTCGTTTGAGATATGGTAATACACACTGTCATCCACCCACTGCCGAATATACCCAACAACTTGTCTGTTCAATAAGTCCCAATCTTCTTTGCTCATATTCTCTGGTTTGCTTGACTCGCCCAAAATAGGAGCACGCATCTCTTTACAATAAAGATCTTCCTTTCTTCTTTTCCATAGTTGCCAATTTGAGCCGTTGAAAATGACCATCCTACTGTCCACTTCCATGACTACAACTAATCCAATGACAACAACAATGGTTGACAAGTCAGCAACATGTGGGCTCGACTTCGTTCATTTTTAGTACTTAGTTCAAGCTGGTATCGAACCAAGCTTAATTTGAATATATCTTTGAGAGAGTGGAACTTTGCCTTGAGGGGAGTGCTGAGAATCGCCGCCCGACAAAAATGTCTAAGGAAGTCCAGATCTTTCCCTGGTCAGAGCTGAGCTGAACTCTATCCTCCTCGTATTTAGAGACATGGGTACACGTGTCCGGGCTGAGGAGCGCACTAAATTTTACCCTATAACACAACAACAATAATTTTGTATCAACAGAAACGACAACCACCTTATTGACCTGACAATGTGATTCCACTGCCCAACTTTGATACCACTTGTGGGTATTGACCTGACAATGTGATTCCACTGCCCAACTTTGATACCCAACTTTGATACCATTTGTGGGGATTGATGAAAGGTCACAATTTTTGGGCGACAAAACTGGGCATAAAATATACCTTCACCTGTAATCTAATAGATAAGTTTTAGAAAATATAAAATCATCTCCTCACCGCGGTAGATGACGACAAAATAGGTCTTTTTGGTGTCTTTATATAGCTCAGAAAATTTTAGCTCTCCTCTGGAGAATTTCCCGACGGCTCTCTCACTATATTTTTCTTTATCCTCTATTGCGGATAGAACTTGGCTCCTTACTAATAAGAAATAATCTTGTCCTTACCACCATTAAATATTTGATAACTAAGAAAAAAAATGCAAAAACAAAAAATTATTTTAAGATATAATGAGTTGTCAAATATCCAATGGTTGTAAGGACAAGGTTCCTTACTAATGATGAGTCAAGTTTTATCCCTTTGTTGCGGTGGCCTTGCACTCACCTTTTGACTCCTTGCCTTTTTTTTTTTTAACTTCTCTTTATAGAGAGTAAAGCCTAAGTGCCTCCCCACCATATGCCTATAATATTTTAGGTGATTGTAAATTAAAATCAATTTGCTCTAGAGAGTATGTAGCTCAAATGATAAAGTGTTGGTGCAGAAATTTGGTCTCCTGAGATCTCGCCAAATTTGTACCAATCTTCTAAGTATGAAAGGGATCGACGTGACCTGCAAAACAAAATAGCGCTAGGATCGGTGTGGCTTTGGCCACACGGGCTCCGATGCTTAAGTTAGTATCGGAGAAGTCGAGTGAGGAAAGTAGGAGGTTTTAGCATACCTTGGCTTACTTCCCTATTTATAGAAGTTGAATCATAATCTCCCTTAGGGTTTACAGAAGTCGAACCTTAGTATCTTTAGGAGATCTGGACTTCTGGGCCCATGTGCTTCGTCGTGGATCTAATGTCAAATCAAGGGTCGAGATCAGGTGGGGTTTCGGATTGGACTGCGCGAGGATCGAGGGGAGTAGAGTTAGGGATCTCGACCTTTTGGTCGAGAACACCTCGGCGCCGAGCACTGCTCGACGCTGACCACGGGTCGATCCAGGATGGCCCGAGCCGACGGTCTTCGATGATTCTCCTTCTGTCGCTTCATCTGTTCTGTAGTTCTCGTCCCATAACAGTAGCCTCCACTCTCAAAATAGAAGCCAATAAGCATAAGTAAAATGTTAAGACTTCTATATTGAGAGTACTTGTCGCTCAGTCGTTCCATGGGTGACTGTTTTTGTGGAATAGCCCTCATACCCTTTCGTCGTGGTTGGCTATCGTTTCATGGGTCGACCCTCTTAGCCCCCATACCCTTTTGTCGTAGTTGGCTGTAGTCAAGTCTTTTGGACTTTACCGTACAGTTAATTTTAACTGTTTAGGTTTTTGTCTTGGGATCTAACTTTTAGTTGCACTAGCTGTACCACCATTGAGTTTCTAACCCGTTTGGATTTTGCCCATGCTTTTATACTCAAGGGTTTACTCTTGGGTTTCCAACCTATTTGGCTTTTGTGAATGGTTTTATATCCTGAGTCCAGTCGATATCCTTTTGTTGTTGTCGGTTTATCGTTTCATGGGTTGGCCTTCTTAGCCTCCATACCCTTTTGTCGTAATCGGATGTGATCAAGTCTCTTGGGCTTTGCTATACAGTTAATTTTAACTGTTTATGTTTTTTTTTGTTGGCCTTGGGATATAACTTTTTAGCTGGGCCAACGGTACCGCTTTTGGGTTTCTAACTTATCTGGCTTTTGCCTATGGTTTTATACCCCAGGTTTACTCTTGGGTTTCTAACCTATTTGGTTTTTGCCAATGGTTTTGTACGTTGAGTTTCATCAATACCCTTGGGTTTCTAACCTATTTTGACTTTTGTCAAATGGTTTTATACCTTGGGTTTCGTTGATTGGTTTGACATGCCTGCTTTCTGTATTAATATTAGTGGGCCTGTACGCAACCCTTTCAGCACCTTTGGGATTCTCTTCCCCGGTTTCACTGATTGACTTTTTAGCTTATCGGCCTGCTTCTAAGTCTATCCCAGTTCGAGTTCCTTGACCAACTGTTGATTAGCTTATCCATGCCCCCACTTGAGGAGTTTTTGTTTAGAACCAAAATAAATGCTCAAGTAGTAAAAGCTAATTAAATCCCAGATGCTAGAATTGATCGTTTTTGGAAACATAAAATCATACGTAATCTGAAACAAACGACATATACTACATAATATGCAAGCAATAAAAACCTATTCAATCATTAAAAAGGACACGATTTTGTGGTTTCTCAAGATGACGTAGTAGTATCCCCGCAATACTCGTTGTTCTCTCCTGACTCTTGTTCGAGCGGGACTATCGTTGGGGCATATCTTTTTTTAGGGTCTCGTTGATTTGATTGGTCTTGAGTTTTGTTGAGATCCAATTTTGAATGAAGTAGCTCTTTTGCACCCATATATTTTTGGTTGGGTTCGAGTCAATGCTTCTGAGAAAGTTGTTGGGTTTGATCTTGCTTCGTTGCCACTTTTACCTCATAACACTTGCCCCTACTCTCGAAGTGATGTATTTCTACTTTTTTATCTATCACTTTGAACCTCGGGTTTGCATCTAATGTACCCGTGAAGGACGAGTCACATTTGTCATTTCTTGATCAAGACCCACGATCGAGGTGGCCTCTTCCTCGGCTGTCGAGCCGAACCATGTTGGAATTCTTGATCGAGTTGTCACCTCACCACGATTGTCGAGCCCCGCTTTGTTGGTTCTCGTGTAGGTATGATCACATCATCGATTCCCGTAGAAGTGTTGTCATGTCATGCACTGGTTCCCGTGGAGGTGTAGTCACCTTGCAGCAGGACCCATGACCGAGGCTACCTCGGGTTTCATGGTGGCCGAGCAGGCTTCGACCTAGTGGTTTATCACCATTGGTCCGAGTCATGCTCTCATGCCTTCTGAGGCTATGTAAATTCGTTCTACCTCCTTCACATTTTACTCACCCTTCCACTTCGAGGTAATCATTCTTCCTTTCTTGCTCTTTTGTTACTTGCTTATTTCCATGGCATCTGGTAGTGATCTAAAATTTGAGTGGACTACCTCCGACGACGAGTCGTCATCCCGCCCGTCTTCTGAGTCTCGATCCTTTCGTCCTTCCTGGGTGGACAACTCTACGCTTACAGCGAAAGACTTACGTCGTCTGCGCTCAAAACACCGCATTCCCGATTCGGTCAGCCTTCGTCCCCCAAACCCCGGGGAGAGTATTGACGGATCCTACCCCGAGGAAGTCGTGTTCTATGCGGCCATGTTCAAATTTGGAGTTCGCCTACCATTGCCACTGTTCCTTCAAAACTTATTAACTTTTAACCAAGTAGCTCCTTCTCAGATCGTGCCTAATGGGTGATCTACATTGATCAGTTGTTTCATCTTGTGGTCCTATCTTGGGGGTGACCAACCCTTAACTGCGACCATCTTCATGTCGCTTTATTTCCTTAAGCATAGCCCCGACGACAACTTCCATTTTTATGTTAGCTCACGAAACCAAATACTTATTTCAGGTCCTTCCTCGGTAAAGCACTGGAAGGATGAGTGGTTCCTTATCGGTGGTGACTGGCTTGCCCGGGACGACCGTGAATTGCATTTTCCTGTCCCGATGGAGTGTAGGGAGTTAGGTAAGTAGCTTCCCAAACTCTTATCGGGACTCTCCTATATCATATTCTAAACCTTCTCTTTTTGTTTCTTTTGTTGACAGTTCCCCGTACCTGAGAACTCTCTACCGAGACATTGAACTTCTACCACACCGAAACGCTCGGACTTTTGACCTATGCCGTTCCTACTCGTCATCATCACTGTCATCCATATCATCAGGGGCATTGCATTCAGCTCGGAAACGCTCCTGAAGAAAGGAGTAATCGACGTCGTGAAGGGATGTGAATCCCGGAGCGGAAACGGATTTTCGTTGCTGATTTTCGGGATACAAAATCAAATTTTACAGAAGAAAAATACAGGGGAAATTCTACCTTTGAAGAACGTTTCTTCAACGAGTTTCCCTCGAACACCACCACTATGTCTTCCTCGCTATCCTCTTGGGTCTCGGGATCGTGCTGGTGGGAGCCACTTGTTTGGAATGTCAGGGAGAGTTAAGGGAGAGCTTGATCGAGAGAAAACTGGGAGAGTTCTTGCTATGTTCTGTATTCTGTCTTTTTGCAGAACTGCCAAGAAAACAAAACAGTGACGTGGGGTAACTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTATCCATAGTGTTGCCAGGATAAGGTACCCAGCCTTATCCATATACTATAGATCATTTAGGTTATTCACGGGACATTATCCATTCATATATCAGGGGTATATCAAATATATACCCCGTATATCAACTATATACCCAGTATATATACTGTCCCGATTACATAAAAAATAACCTTGGATGTTTAGTTTATTGGATTTTGGACTAAGCAAAGCTATGATTCAACAACAAATTTATTAAATAAAATTTTATTAAATAGACATGTGTTTTTACAAAATACAAACTACGAGTTTAGGGCAACACACTCAACACGTCGGGGTGTTCATCGCAGACCTGACGGAGAGACTGGAGACGCCACATTCCATGACGGCATTCTGTAAATCGATGTAGTCTTGGGTCTTCTTAAACTCCTCAGTTAGGAATTCGGAGTTGGAGAGTCAAGCATTGGTCTCGTCAAGCTTCCTCCGCAGAGCTGCTAGGTCGTCTTGGGATCTCTTCCTTTTGGCCTCTAGGTCATGCTTAGCATTCGTCAGCAGGATGTTAGCTTGGTATAAAGAGGCCAGAGCTCGGTCCAGTTCATCCCGAGGGGTGTAGTGGCTACGGAGATGGTTGATCTCGTCAGCAACTCTACTCAAAAACACGAACAAGTGGTTAAAAACTTGTTAAAATCAATAACGACGTTAGGTTGAGTATGACATTGAATACCAAAAGCCAACTACGAAACTGACCTGGTAGAGATCACTAAGACAGGTATGGATGCTTTGTTGGGGGTTTGAGTAACCAAAACTCGACCCGCTTGATGACTCTCCTAGCCCTTGCCAAAGGCTGAAAGCTCTCTCTAGGTAGGCCGCCCCGTTGGCATCAGCATCATACGGACCCGAAAAAGGGAAAGCCGACACATGGGAGGAAGACTTAAGATGGCCGCGAGAGAAAGACGAGGTAGATGTCCCTTAAAGTCGAGCAGAGCGACGAGGCGCAGTAGGGGGCATCGAGGGTTGAACTTGGTCTCGAGAGAGGGGATCATCCTCCGTCAAATCAGTGATCTCGACATCAGAGGCTCGACTAGATCCCGCACCGGAGCTCCGAGTGTTTCGACGACGTTCTCTTTCGAATAAGGAAGTGGCCCGTCTCGACATTCCTACAAAGGAAGTACGAAATAAAAATCGGCTAAATGCAGATTTAAAGGAAACTCCTGATGGTTACCTGAAAGTGGAAGAGGAGGAGCCAAGCCAAAATCTATCAAGTTCTACTCGTTTACCAAGTCTGGTCCCCACCACTCGACGTCGGGTAACTGGAGGACAGTGTGATAGAAGTTCAATGCCTCGGTGGGGAGTTCACAGGTACAGGAAACTGTCAACAAAAGAAACAAAAAGAGAAGGTTTAGAATATGATATAGGAAAGTCCCGATAAGAGTTCGGGAAGCTACTTACCTAACTCCCTACACTCCATCGGGACAGGAAAATGCAATCCACGGTCGTCCCGGGCAAGTCAGTCACCACTGGCAAGGAACCACTCATCCTTCTAGTGCTTTACCGAGGAAGGACCCGAAATAAGAATTTGGTTTCATGAGCTAACGTAAAAATGGAAGTTGTCGTCGGGGCTATGCTTAAGGAAATAAAGTGACATGAAGATGGCCGCGGTTAAGGGTTGGTCACCCTCAAGATAGGACCACAAGATGAAACAACTGATCAATGTAGACTACTCATTAGGGACGATCTGGGAAAGAGCTACTTGGTTAAAAGCTAAGCAGTTTTGAAAAAACAGTGGCAATGGTAGGCGAACTCCAAATTTGAACATGGCCGCATAGAACACGACTTCCCTGGGATAGGATCCATTAATACTCTCCCTGGGGTTTGGGAGACGAAGGCTGACCGAATCGAGAATGCGGTATTTTGAGCAGAGACGGCATAAGCCTCTCGTTGAAAACGTAGAGTTGTCCACCCAGGAAGGACGAAAGGATCAGGACTCAGAAGACGGGTGGGATGACGACTCGTCGTCGGAGGTAGTCCACTCAAAGGTTAGATCACTACCAGATGCCATGGAAATAAGCAGGTAACAAAAGAGCAAGAAAGGAAGGATGATTACCTCGAAGTGAAAGAGGTGAGTGAAATGTGAAGGAGGTAGAACGGGTTTATATAGCCTCAGAAAGCATGAGAGCATGACTCAGACCAATGGTCATAAACCACTAGGCCGAAGCCTGCTCGGCCACCATGAAACTTGAGCCAGTCTCGGTCACGGGTCCTGCTGCGAGGTGACTACACCTCCACATGAACCAGTGTAGGACATGGCAACATTTCCACGAGAATGGATGATGTGATCATACCCACACGAGAACCAACAAAGTGGGGCTAGACAGTCAAGGTGACAACTCGGTCACAAATTCCAACATGGCTCAGCTCGGCAGTCGAGGAAGAGGCCACCTCGGTTGGGGTCTTGATCAAGAAATGACGGATGTGACTCGTCCTTCACGGGTCCATCAGATGCAAACCTGAGGTTGACAGCTACAACACTTCTTGTGACCCGCCAGTAAGTGTACTCTCAAAGTGATAGATAAAAAAGTGGAAATACATCATTTCGAGAGTAGGGGCAATTGTTATGAGGTAAAAGTGGCAATGAAGCAAGATCGAACTGGGTGAGTCAACCCACGAGGAATGGATCGAACCGAGCAGGGCTTGGTTCCTACCCTGCTCGGTGTCGAGCCCCGTCCTTGTTCTTCTGCTGTTGTTCCAAGACCTAGGGTTGCAACGTCCCAGCCCGTAAGCCCACAAGTATCAAGACCTAGGGTTAGCACGTTGTGAAACCCCAGGGGGTAAATCGACTATAAATAGGGAGGGGATCACCCCTTGAAGGTAAGCCAAAACGCCACTTTCAAGTGCCTTACTTGTTTAGTGATTTCTCCCATAGCTAACTTAAGCATCGGAGTGTGACAAAAATCACATCGGGTTAGGGTTGTTCGGTTTTGCAGATATCTTGTCCCTTGATATTCGAGTTTGATGCATGGAGTGGAGTGAACCGGTCTACCAGAATCCAAGCATCAACAATTGGCGAATCCTACCCCGGGGACGTCATCCTGCCCATCTTCTGCGGTCGTGTATTTCCCTCATTCGCCTGAGTATCTTTAACATCAGCTTTGATCCTTTATCCTTAATAAGATTAACCAGTTTTCTTGCAAGAGTTTGAGTTCCTTTTTATATTGTTATGAAATTATTATTGAGACCCCTTCTGAATCTAGGTGGGTTTATAAGGGTGCGTTTGATTGGTGGTCAAAAAACAAGGGTTTCAAATGAAAAATGAGTTTTATTATTAGTTTTCAGTGTTTTCAACATGTGTTTGGTAGCAAATTTGAAAATTCTCAATTTAAAAAATTGTTCAACGTGTGTTTAGCAATGTATTTGTAAACTAACCGTAGTTACCAACTAAAGATTTAATTTAATCTAAAATAGTATTAATTAATTTATTAACAATTTTATTATAAATTTTGTATAGTTATTATTTTATGATGTTGTATAATTTATAATATATTATATTTTACCTATCTTAGTTTTAACAAAAATGAAATGAGTTTATAACATGGAATCATATATTTATAATCGTGTAATTCTGCATTTTAAAATTTAAAATTTAAAATTGGAAAATACTTAAAAGATATAAAAATATTTTTTTTCATGAATTTTTACTATTTGGATCACATAATTCGAAAATAGTTTTAAAAAATAAATCTGTCGATCCAAATCTATTTGATTTACTTAGAAACAAGAAGCAAAATTCGGATTGTCCACCGAACTCTAAATTACTATAAGTGGACGGTGGCCCCGTTTATATGCATGCTTAATGTAATAAATTCTAAAATTACAAGTTTGCTGTGTCTATCAAGTTCTTATTTTTAATTTTTTTTAAAAATAGATCTTTGAACTTTTAAAGTTGTATCTATTTAATCCTTGAAATTTTTGTGTCTAATAAGGACTCGTTTGATAACATTCTCGTTTCTTGTTTCTCGTTCCCAATTGGGGCCGAGTTAAGTAAATACGGACTACCTAGTGATTTACAACCACCCTGCGATTTCAAAATAGAAAAATGGTAATCATTGAGATTTGTTTGAAAGGGGAAAAAAGAGAGAGTTCCAAACCTAAATAAACATCCAAATTGCCCATCAATCTCAATGACCAAGAATTGGCAATTGACGCGGGAAGGAACTATGGAATTGGTAATGATTCTGCTGCTATTCATCAAACGAAGCAACATCTTGGCAAACTGTTTTATATGAAAGATCTTGGTCCTCTAACTGGGATTGAAGTTTTTGGTCTCCGGGTATGCATCTAACTCAGCAGAAGTACGCCATGGAATGCTTCACAGATTGCCCTGTGTGACTCCAGCTTAATTCCAAGTTGTATTCTCATGATGGTGAGCTGCTATCAGATCCATCGGCCTATAAGAGTATGTATGGTGGGTGGTCTCCAGTACTTTACTCTGACTCGCCCAGTCGTTTGCTATTGGACTTGTAGACAAGTTTATACAGTCTCCTCGGGTTTCTCATTTAGTAGCTGATAAACGTATTTTGAGATATCTAAAAGCTTAAAAAATCTAAGTATTTTTCAACTAACGTTGATTCTTGTTTCTCCTTTTTTGAGAATACTCTCACAATTGTATTGAAATTTTGGAAACAGCCAAAACTAGTTTCTCCTTTCTTGTTCCTTTTTCCAATTTGTTCCAAAACATAAATAAAAATATTATTTTATCATACTTGAATTCTTAGAGATTGGAACAATGCAATACATTTTTCTTCATTAAAAATAAAGAAATAGAAACAAGAAACAGGAACGAAGGAGTCCTAAATGTCTGTCATTTGCTTGATTAGTTTACTCTTTTTCACTAAAAAAAAAACTAAAAATTAATCATTACATGAAAATACATTATTTTAAAATTACGTGGTAGGTCAATTTTGGCCAGGTTTAGGTGTTTATGATAGGAATGCTGTTGGAAATATTGAGGGTATTTTAGTAATTGTGTTAGGAGGTCTTGTTATAAATAGAGTTAAAAGAGAAAGGATGGATGAAGTAGACAATTAGTGAGCATGGTTTAGGGTTTAGGCTTGAGTAAGAATACTCAAAAGAGAGGGAGGATCCAAGTACCTCGAATTACTTGGAAGTTATTCTAAGTTCTTATATCTTTTATATTTCAATATATTTCTAGTTCGGGTTCCATAGTTTAATAGTGCAAACTTAAATAAACAAAGGTAACAACAAAATTTAAATATTAATGACCTATTAGAATTTTTTTACATAATTCAAAGACCAAATAGACACGTCTTAAAATTTCAAAGATCTATTGAAAATCTTTTTAAGTATGAAGCGTTAATAGACACAACTATGAAAGTTCATGAACCAAACGGAATACATAGAGATTTGTTTGTTCTCTAATTTATATGTTTTATGCAAAATCACAAGCAGTGCTAAAACTACCAACTGGGGAGAGCAGGCCTTTGCCATTCATTTTGACATTAATTCCACCCCTTATACTGTCATTGCTTGACCCAGAAATATTTTTCAAAGCACTGGACTTGGCTGGCACATATGGAGGTAAATCATTAACAATCTATTCTCGTAATTTTATTTTGGTTAAAATACTATTTTGGTCTCTGTTCTTTGAGCTTTGGTTCATCTTGGTTTCTGCGGAACTTTAAAAATGTTCATTTTGATCCTTGTATTTCGAGCTTTGATTCATTTTAGCTTCTGTATTTTCAAAATGTTTGTTTTGGTCCCTACGCTTTTACAAAGGGATCGTTTTTGTCCTTTAATTTTATTTATAATCTCATTTTTTCTACCTTAATTTCACCATGACACTACACTTAATGTATTTATTGAAATGTAACCATAATTTTATATTAAAATGTTTTCGTTAGGTATAGAATTTGTTTTAAAATTTTTCAAATACGGACTAAATGGTCACTTTTTAAAAGTATAGAACCAAAATGAACATTTTGAAAGTATAGAATCTAAAATGAACCAAAATTGAAAGTATTGAGATTTTGGAAACGAATCAAAGCTAGAGTGTAGCACCAAAATAGTATTTTAGACTTTTATTTTACCTCAACCTAAAGAAACCAAGTTTGTTTCCTGATTTATTATTTTCCGTTCCTGTTAGTGTTGCTGCTGTTTGGAATTATTCCAGCTGCAATGTCGTGGTCGGATCGGTATTCGGGTTCACCCCCGTCCGTGAAACTACCGGAGGTGGTTCCCGGAGGAAGGTTTACTCTTTCCCTCGTGATCGGAGGTGCTGGATGGGTAATTTTCTCAGAACTATTGAACAACCTTGGGCATCTA

mRNA sequence

ATGGCGATTTCTTCCTGTCTCCGGCTTACATTTCCTGTAATTCGAAGAAGCCTCGATTTGCCGAAGCAGAATGCGCCTTGCCTCGGGTCTTTCGAGTCACTTCGACCGCGTCGGCCCACGGTTCTTCGTCCAAGATTAAGATCTACGGTCACCTGCTTCTCGCGACGGCCGGCGGAGTCCACTGTCTCCGGAGAAGAGAAGGAAATCGTCCAGGAAGAAGAATCGCAACAGTACGAATTGGAGAGGCTGTTCTCCAACCTTAATCAAGTCACACTCAAGCGAGAACCTGGAAGCTTATCCAGCGCGATTTTCCTGGTGGCTGGGACGACAATTGGTGCTGGGATCCTCGCCATTCCTGCAGTAACTCAAGAATCCGGATTTCTAGCCTCAGCTATTACGTGCACGTTTTGTTGGGTGTACATGGTTGTGACAGGACTGCTAATTGCTGAAGTCAATGTGAACACCATGTGCGAACTGGGTTCTGGTGGCGTGTCTCTGGTATCGATGGCCATGAGAACTCTTGGGACAGTTGGCGTTCAAGTTTCTTGTTGGTCATACATTCTCATTCACTATGCGCTTCTGGTTGCTTATGTGGCTCGTTCTTCAGACATCTTGACCAATTTTCTTGGCATTCCCTTATGGGAGAGTGCAACCTTGTTCTCTTTGATTTTTGGAGGTATATGCTACTTTGGAAGCCAACGTTTAATTGGTGCTGTCAATGGAACCCTAGTACTTGGAATCATCATTTCCTTTGCAGGTCTTGTGGCAGTCGCCAGTGGAGGCCTACATTGGGATGCTCTCGTTAAAGCTAATTTCGAAGCTGTTCCTATGAGTATACCTATAATTGCCCTTTCATTTGTTTACCAGAACGTGGTGCCAGTACTCTGTACGAATCTGGAAGGAAACCTGACAAAAGTAAGGACATCCATTGTACTTGGCACTGCAATACCTCTAGTTCTATTTCTTGTCTGGAATGGTGTCATTCTTGGCACTATTTCAAGCCTTGAAATGGGATCGGATAAGATTATAGACCCATTAGAGCAATTGAGATCAGCAAATGGAGCTGTAGGACCTATAGTAGAAGTATTCTCTCTTTTGGCGATTGCAACATCCTACATCGGATTTGTTTTGGGATTATCGGATTTTCTTGCTGACTTGCTAAAACTACCAACTGGGGAGAGCAGGCCTTTGCCATTCATTTTGACATTAATTCCACCCCTTATACTGTCATTGCTTGACCCAGAAATATTTTTCAAAGCACTGGACTTGGCTGGCACATATGGAGTGTTGCTGCTGTTTGGAATTATTCCAGCTGCAATGTCGTGGTCGGATCGGTATTCGGGTTCACCCCCGTCCGTGAAACTACCGGAGGTGGTTCCCGGAGGAAGGTTTACTCTTTCCCTCGTGATCGGAGGTGCTGGATGGGTAATTTTCTCAGAACTATTGAACAACCTTGGGCATCTA

Coding sequence (CDS)

ATGGCGATTTCTTCCTGTCTCCGGCTTACATTTCCTGTAATTCGAAGAAGCCTCGATTTGCCGAAGCAGAATGCGCCTTGCCTCGGGTCTTTCGAGTCACTTCGACCGCGTCGGCCCACGGTTCTTCGTCCAAGATTAAGATCTACGGTCACCTGCTTCTCGCGACGGCCGGCGGAGTCCACTGTCTCCGGAGAAGAGAAGGAAATCGTCCAGGAAGAAGAATCGCAACAGTACGAATTGGAGAGGCTGTTCTCCAACCTTAATCAAGTCACACTCAAGCGAGAACCTGGAAGCTTATCCAGCGCGATTTTCCTGGTGGCTGGGACGACAATTGGTGCTGGGATCCTCGCCATTCCTGCAGTAACTCAAGAATCCGGATTTCTAGCCTCAGCTATTACGTGCACGTTTTGTTGGGTGTACATGGTTGTGACAGGACTGCTAATTGCTGAAGTCAATGTGAACACCATGTGCGAACTGGGTTCTGGTGGCGTGTCTCTGGTATCGATGGCCATGAGAACTCTTGGGACAGTTGGCGTTCAAGTTTCTTGTTGGTCATACATTCTCATTCACTATGCGCTTCTGGTTGCTTATGTGGCTCGTTCTTCAGACATCTTGACCAATTTTCTTGGCATTCCCTTATGGGAGAGTGCAACCTTGTTCTCTTTGATTTTTGGAGGTATATGCTACTTTGGAAGCCAACGTTTAATTGGTGCTGTCAATGGAACCCTAGTACTTGGAATCATCATTTCCTTTGCAGGTCTTGTGGCAGTCGCCAGTGGAGGCCTACATTGGGATGCTCTCGTTAAAGCTAATTTCGAAGCTGTTCCTATGAGTATACCTATAATTGCCCTTTCATTTGTTTACCAGAACGTGGTGCCAGTACTCTGTACGAATCTGGAAGGAAACCTGACAAAAGTAAGGACATCCATTGTACTTGGCACTGCAATACCTCTAGTTCTATTTCTTGTCTGGAATGGTGTCATTCTTGGCACTATTTCAAGCCTTGAAATGGGATCGGATAAGATTATAGACCCATTAGAGCAATTGAGATCAGCAAATGGAGCTGTAGGACCTATAGTAGAAGTATTCTCTCTTTTGGCGATTGCAACATCCTACATCGGATTTGTTTTGGGATTATCGGATTTTCTTGCTGACTTGCTAAAACTACCAACTGGGGAGAGCAGGCCTTTGCCATTCATTTTGACATTAATTCCACCCCTTATACTGTCATTGCTTGACCCAGAAATATTTTTCAAAGCACTGGACTTGGCTGGCACATATGGAGTGTTGCTGCTGTTTGGAATTATTCCAGCTGCAATGTCGTGGTCGGATCGGTATTCGGGTTCACCCCCGTCCGTGAAACTACCGGAGGTGGTTCCCGGAGGAAGGTTTACTCTTTCCCTCGTGATCGGAGGTGCTGGATGGGTAATTTTCTCAGAACTATTGAACAACCTTGGGCATCTA

Protein sequence

MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSELLNNLGHL
Homology
BLAST of MS023210 vs. NCBI nr
Match: XP_022147723.1 (uncharacterized protein LOC111016587 isoform X2 [Momordica charantia])

HSP 1 Score: 924.1 bits (2387), Expect = 5.1e-265
Identity = 486/488 (99.59%), Postives = 488/488 (100.00%), Query Frame = 0

Query: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60
           MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES
Sbjct: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60

Query: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120
           TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA
Sbjct: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120

Query: 121 VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGTVGVQ 180
           VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMA+RTLGTVGVQ
Sbjct: 121 VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMALRTLGTVGVQ 180

Query: 181 VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN 240
           VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN
Sbjct: 181 VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN 240

Query: 241 GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE 300
           GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE
Sbjct: 241 GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE 300

Query: 301 GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAVGPIV 360
           GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPL+QLRSANGAVGPIV
Sbjct: 301 GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLQQLRSANGAVGPIV 360

Query: 361 EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA 420
           EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA
Sbjct: 361 EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA 420

Query: 421 LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE 480
           LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE
Sbjct: 421 LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE 480

Query: 481 LLNNLGHL 489
           LLNNLGHL
Sbjct: 481 LLNNLGHL 488

BLAST of MS023210 vs. NCBI nr
Match: XP_022147722.1 (uncharacterized protein LOC111016587 isoform X1 [Momordica charantia])

HSP 1 Score: 917.1 bits (2369), Expect = 6.2e-263
Identity = 486/495 (98.18%), Postives = 488/495 (98.59%), Query Frame = 0

Query: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60
           MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES
Sbjct: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60

Query: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120
           TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA
Sbjct: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120

Query: 121 VTQESGFLASAITCTFCWVYM-------VVTGLLIAEVNVNTMCELGSGGVSLVSMAMRT 180
           VTQESGFLASAITCTFCWVYM       VVTGLLIAEVNVNTMCELGSGGVSLVSMA+RT
Sbjct: 121 VTQESGFLASAITCTFCWVYMVKLYQQLVVTGLLIAEVNVNTMCELGSGGVSLVSMALRT 180

Query: 181 LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ 240
           LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ
Sbjct: 181 LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ 240

Query: 241 RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP 300
           RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP
Sbjct: 241 RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP 300

Query: 301 VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSAN 360
           VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPL+QLRSAN
Sbjct: 301 VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLQQLRSAN 360

Query: 361 GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD 420
           GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD
Sbjct: 361 GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD 420

Query: 421 PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA 480
           PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA
Sbjct: 421 PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA 480

Query: 481 GWVIFSELLNNLGHL 489
           GWVIFSELLNNLGHL
Sbjct: 481 GWVIFSELLNNLGHL 495

BLAST of MS023210 vs. NCBI nr
Match: XP_023530389.1 (uncharacterized protein LOC111792980 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 817.8 bits (2111), Expect = 5.1e-233
Identity = 430/492 (87.40%), Postives = 460/492 (93.50%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGS-FESLRPRRPTVLRPRLRSTVTCFSRR 60
           M+ISSCLRL FP +   RRSLDL ++N  CL S FESLR RR ++LR RLR+  T FSRR
Sbjct: 1   MSISSCLRLPFPAVQSARRSLDLSQRNFSCLRSTFESLRLRRHSLLRTRLRTISTSFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P ES+ SG+EKEI +++ES++YELERLFSNLNQ T KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVESSASGQEKEIDKKDESEKYELERLFSNLNQATFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCTFCW YMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTFCWAYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALL+AYVARSSDILT FLGIPLWESATLFSLIFGGICYFGSQR I
Sbjct: 181 VGVQVSCWSYILIHYALLIAYVARSSDILTTFLGIPLWESATLFSLIFGGICYFGSQRSI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LV+GIIISF GLVAVASGGLHWDAL+KANFEAVPMSIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVIGIIISFVGLVAVASGGLHWDALLKANFEAVPMSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNL KVRTSIV+GTAIPLVLFLVWNGVILGTIS+L+MGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLAKVRTSIVIGTAIPLVLFLVWNGVILGTISNLDMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLP+GE++PLPFILTL+PPLILSL+DPEI
Sbjct: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPSGENKPLPFILTLVPPLILSLIDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LD+AGTYGVLLLFGIIPAAMSWSDRYS SPPSVKLP VVPGGRFTLSLVIGGAGWV
Sbjct: 421 FFKSLDVAGTYGVLLLFGIIPAAMSWSDRYSRSPPSVKLPTVVPGGRFTLSLVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL N GHL
Sbjct: 481 IFSELLENFGHL 492

BLAST of MS023210 vs. NCBI nr
Match: XP_022967292.1 (uncharacterized protein LOC111466855 isoform X1 [Cucurbita maxima])

HSP 1 Score: 815.1 bits (2104), Expect = 3.3e-232
Identity = 431/492 (87.60%), Postives = 459/492 (93.29%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGS-FESLRPRRPTVLRPRLRSTVTCFSRR 60
           M+ISSCLRL FP +   RRSL   ++N  CL S  ESLR R  ++LR RLR+  T FSRR
Sbjct: 1   MSISSCLRLPFPAVQSARRSLGFSQRNVSCLRSTVESLRLRPYSLLRTRLRTMSTSFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P ES+VSG+EKEI +EEES++YELERLFSNLNQVT KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVESSVSGQEKEIDKEEESEKYELERLFSNLNQVTFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALL+AYVARSS+ILT FLGIPLWESATLFSLIFGGICYFGSQR I
Sbjct: 181 VGVQVSCWSYILIHYALLIAYVARSSNILTTFLGIPLWESATLFSLIFGGICYFGSQRSI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LV+GIIISF GLVAVASGGLHWDAL+KANFEAVPMSIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVIGIIISFVGLVAVASGGLHWDALLKANFEAVPMSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNL KVRTSIV+GTAIPLVLFLVWNGVILGTIS+L+MGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLAKVRTSIVIGTAIPLVLFLVWNGVILGTISNLDMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLP+GE+RPLPFILTL+PPLILSL+DPEI
Sbjct: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPSGENRPLPFILTLVPPLILSLIDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LD+AGTYGVLLLFGIIPAAMSWSDRYS SPPSVKLP VVPGGRFTLSLVIGGAGWV
Sbjct: 421 FFKSLDVAGTYGVLLLFGIIPAAMSWSDRYSRSPPSVKLPTVVPGGRFTLSLVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL N GHL
Sbjct: 481 IFSELLENFGHL 492

BLAST of MS023210 vs. NCBI nr
Match: KAA0055689.1 (tyrosine-specific transport protein [Cucumis melo var. makuwa] >TYK09938.1 tyrosine-specific transport protein [Cucumis melo var. makuwa])

HSP 1 Score: 814.7 bits (2103), Expect = 4.3e-232
Identity = 430/492 (87.40%), Postives = 460/492 (93.50%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGSFESLRPRRPTVLR-PRLRSTVTCFSRR 60
           M+ISS LRL FPVI   RRS++L  QN  CL S      +RP  L+ PRL++  TCFSRR
Sbjct: 1   MSISSSLRLPFPVIQSKRRSINLSHQNVTCLWSTNKSFQQRPLKLQLPRLKAVSTCFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P +S+VSG+EK+I +E ES++Y LERLFSNLNQVT KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVKSSVSGQEKKIDKEVESEEYVLERLFSNLNQVTFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCT CWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTCCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGG+CYFGSQR+I
Sbjct: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGVCYFGSQRVI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LVLGII+SFAGLVAVASGGLHWDALV+ANFEAVP+SIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVLGIIVSFAGLVAVASGGLHWDALVRANFEAVPLSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTIS+LEMGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISNLEMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSL+AIATSYIGFVLGLSDFLADLLKLP+GES+PLPF+LTL+PPLILSLLDPEI
Sbjct: 361 GPIVEVFSLMAIATSYIGFVLGLSDFLADLLKLPSGESKPLPFLLTLVPPLILSLLDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LDLAGTYGVLLLFGIIPAAMSWSDRYS  PPSVKLPEVVPGGRFTL+LVIGGAGWV
Sbjct: 421 FFKSLDLAGTYGVLLLFGIIPAAMSWSDRYSKPPPSVKLPEVVPGGRFTLALVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL NLGHL
Sbjct: 481 IFSELLENLGHL 492

BLAST of MS023210 vs. ExPASy Swiss-Prot
Match: P0AAD4 (Tyrosine-specific transport protein OS=Escherichia coli (strain K12) OX=83333 GN=tyrP PE=1 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 8.1e-32
Identity = 125/413 (30.27%), Postives = 205/413 (49.64%), Query Frame = 0

Query: 102 AIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGS 161
           ++F+VAGTTIGAG+LA+P      GF  + I     W  M  T LL+ EV  +   + G 
Sbjct: 8   SVFIVAGTTIGAGMLAMPLAAAGVGFSVTLILLIGLWALMCYTALLLLEVYQHVPADTGL 67

Query: 162 GGVSLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDIL----TNFLGIPLWESA 221
           G     ++A R LG  G  ++ +S + + YAL  AY++ + ++L    +++ GI +  +A
Sbjct: 68  G-----TLAKRYLGRYGQWLTGFSMMFLMYALTAAYISGAGELLASSISDWTGISMSATA 127

Query: 222 --TLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAV 281
              LF+ + GG+   G+  L+   N  L    II    ++ +    +H     K N   +
Sbjct: 128 GVLLFTFVAGGVVCVGTS-LVDLFNRFLFSAKIIFLVVMLVLLLPHIH-----KVNLLTL 187

Query: 282 PM-------SIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVI 341
           P+       +IP+I  SF +   VP + + ++GN+ K+R   ++G+AIPLV ++ W    
Sbjct: 188 PLQQGLALSAIPVIFTSFGFHGSVPSIVSYMDGNIRKLRWVFIIGSAIPLVAYIFWQVAT 247

Query: 342 LGTISSL--------EMGSDKIIDPLEQLRSANGAVGPIVEVFSLLAIATSYIGFVLGLS 401
           LG+I S           G + ++  L ++  A+  V   V +F+ LA+ATS++G  LGL 
Sbjct: 248 LGSIDSTTFMGLLANHAGLNGLLQALREM-VASPHVELAVHLFADLALATSFLGVALGLF 307

Query: 402 DFLADLL-KLPTGESRPLPFILTLIPPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAA 461
           D+LADL  +  T   R     +T +PPL  +L  P  F  AL  AG   + +L  IIP+ 
Sbjct: 308 DYLADLFQRSNTVGGRLQTGAITFLPPLAFALFYPRGFVMALGYAGV-ALAVLALIIPSL 367

Query: 462 MSWSDRYSGSPPSVKLPEVVPGGR------FTLSLVIGGAGWVIFSELLNNLG 487
           ++W  R        +    V GGR      F   + + G  ++I + LL  +G
Sbjct: 368 LTWQSRKHNPQAGYR----VKGGRPALVVVFLCGIAVIGVQFLIAAGLLPEVG 403

BLAST of MS023210 vs. ExPASy Swiss-Prot
Match: P0AAD5 (Tyrosine-specific transport protein OS=Shigella flexneri OX=623 GN=tyrP PE=3 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 8.1e-32
Identity = 125/413 (30.27%), Postives = 205/413 (49.64%), Query Frame = 0

Query: 102 AIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGS 161
           ++F+VAGTTIGAG+LA+P      GF  + I     W  M  T LL+ EV  +   + G 
Sbjct: 8   SVFIVAGTTIGAGMLAMPLAAAGVGFSVTLILLIGLWALMCYTALLLLEVYQHVPADTGL 67

Query: 162 GGVSLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDIL----TNFLGIPLWESA 221
           G     ++A R LG  G  ++ +S + + YAL  AY++ + ++L    +++ GI +  +A
Sbjct: 68  G-----TLAKRYLGRYGQWLTGFSMMFLMYALTAAYISGAGELLASSISDWTGISMSATA 127

Query: 222 --TLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAV 281
              LF+ + GG+   G+  L+   N  L    II    ++ +    +H     K N   +
Sbjct: 128 GVLLFTFVAGGVVCVGTS-LVDLFNRFLFSAKIIFLVVMLVLLLPHIH-----KVNLLTL 187

Query: 282 PM-------SIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVI 341
           P+       +IP+I  SF +   VP + + ++GN+ K+R   ++G+AIPLV ++ W    
Sbjct: 188 PLQQGLALSAIPVIFTSFGFHGSVPSIVSYMDGNIRKLRWVFIIGSAIPLVAYIFWQVAT 247

Query: 342 LGTISSL--------EMGSDKIIDPLEQLRSANGAVGPIVEVFSLLAIATSYIGFVLGLS 401
           LG+I S           G + ++  L ++  A+  V   V +F+ LA+ATS++G  LGL 
Sbjct: 248 LGSIDSTTFMGLLANHAGLNGLLQALREM-VASPHVELAVHLFADLALATSFLGVALGLF 307

Query: 402 DFLADLL-KLPTGESRPLPFILTLIPPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAA 461
           D+LADL  +  T   R     +T +PPL  +L  P  F  AL  AG   + +L  IIP+ 
Sbjct: 308 DYLADLFQRSNTVGGRLQTGAITFLPPLAFALFYPRGFVMALGYAGV-ALAVLALIIPSL 367

Query: 462 MSWSDRYSGSPPSVKLPEVVPGGR------FTLSLVIGGAGWVIFSELLNNLG 487
           ++W  R        +    V GGR      F   + + G  ++I + LL  +G
Sbjct: 368 LTWQSRKHNPQAGYR----VKGGRPALVVVFLCGIAVIGVQFLIAAGLLPEVG 403

BLAST of MS023210 vs. ExPASy Swiss-Prot
Match: P44747 (Tyrosine-specific transport protein 2 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tyrP-B PE=3 SV=1)

HSP 1 Score: 124.8 bits (312), Expect = 2.7e-27
Identity = 117/390 (30.00%), Postives = 192/390 (49.23%), Query Frame = 0

Query: 105 LVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGV 164
           ++AGTTIGAG+LA+P  +   GF  + +     W  +V +GLL  EV   T  +L  G  
Sbjct: 12  IIAGTTIGAGMLAMPLTSAGMGFGYTLLLLVGLWALLVYSGLLFVEV-YQTADQLDDG-- 71

Query: 165 SLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIP---------LWE 224
            + ++A +  G  G   +  S +++ YAL  AY+     +L+   G+P         L  
Sbjct: 72  -VATLAEKYFGVPGRIFATLSLLVLLYALSAAYITGGGSLLS---GLPTAFGMEAMSLKT 131

Query: 225 SATLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDAL--VKANFE 284
           +  +F+++ G     G++ + G +   L +G +I+FA ++ +    +  D L  +  ++ 
Sbjct: 132 AIIIFTVVLGSFVVVGTKGVDG-LTRVLFIGKLIAFAFVLFMMLPKVATDNLMALPLDYA 191

Query: 285 AVPMSIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTIS 344
            V  + PI   SF +  ++  + + L G++ K R +I++GTAIPL  +LVW     G +S
Sbjct: 192 FVVSAAPIFLTSFGFHVIMASVNSYLGGSVDKFRRAILIGTAIPLAAYLVWQLATHGVLS 251

Query: 345 SLEMGSDKIIDP-----LEQLRSANGA--VGPIVEVFSLLAIATSYIGFVLGLSDFLADL 404
             E       DP     +   R   G+  +G +V VFS LA+ TS++G +LG+ + L DL
Sbjct: 252 QSEFVRILQADPTLNGLVNATREITGSHFMGEVVRVFSSLALITSFLGVMLGVFEGLGDL 311

Query: 405 LK---LPTGESRPLPFILTL---IPPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAAM 464
            K   LP        F+LT+   +PPL+ +L  PE F  AL  AG         I+P ++
Sbjct: 312 FKRYHLPNNR-----FVLTIAAFLPPLVFALFYPEGFITALSYAGLLCAFYCL-ILPISL 371

Query: 465 SWSDRYSGSPPSVKLPEVVPGGRFTLSLVI 471
           +W  R      +  LP  V GG F L L +
Sbjct: 372 AWRTRIE----NPTLPYRVAGGNFALVLAL 383

BLAST of MS023210 vs. ExPASy Swiss-Prot
Match: P44727 (Tyrosine-specific transport protein 1 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tyrP-A PE=3 SV=1)

HSP 1 Score: 115.2 bits (287), Expect = 2.1e-24
Identity = 105/379 (27.70%), Postives = 182/379 (48.02%), Query Frame = 0

Query: 105 LVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGV 164
           LVAGT IGAG+LA+P  +   GF  + +     W  +  + LL  E+      + G G  
Sbjct: 10  LVAGTMIGAGMLAMPLTSAGIGFGFTLVLLLGLWALLTFSALLFVELYQTAESDAGIG-- 69

Query: 165 SLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWE--SATLFSL 224
              ++A +  G  G  ++    I+  YAL+ AY++    +L + L     +  S  LF++
Sbjct: 70  ---TLAEQYFGKTGRIIATAVLIIFLYALIAAYISGGGSLLKDLLPESFGDKVSVLLFTV 129

Query: 225 IFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFE--AVPMSIP 284
           IFG     G+   +  +N  L   ++ +FA ++++    + +D L+    +   +  + P
Sbjct: 130 IFGSFIVIGTHS-VDKINRVLFFVMLAAFAVVLSLMLPEIKFDNLMATPIDKALIISASP 189

Query: 285 IIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLE---- 344
           +   +F +   +P L   L+GN+  +R SI++G+AI L  +++W     G ++  E    
Sbjct: 190 VFFTAFGFHGSIPSLNKYLDGNVKALRFSILVGSAITLCAYILWQLSTHGLLTQNEFLQI 249

Query: 345 MGSDKIIDPLEQLRSA---NGAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLK--LPT 404
           +  D  ++ L +   A   +  +   V++FS LA+ TS++G  LGL + + DLLK     
Sbjct: 250 LKEDATLNGLVKATFAITGSNVIASAVKLFSTLALITSFLGVGLGLLECIEDLLKRSFNV 309

Query: 405 GESRPLPFILTLIPPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPP 464
              R    +LT IPPL+ +L  PE F  AL  AG         ++P ++ W  R +    
Sbjct: 310 TAGRISLGLLTFIPPLVFALFYPEGFILALGYAGQMFAFYAV-VLPVSLVWKARRA---- 369

Query: 465 SVKLPEVVPGGRFTLSLVI 471
              LP  V GG  TL +V+
Sbjct: 370 HANLPYKVWGGNLTLIIVL 377

BLAST of MS023210 vs. ExPASy Swiss-Prot
Match: P44614 (Tryptophan-specific transport protein OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=mtr PE=3 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 5.8e-22
Identity = 108/423 (25.53%), Postives = 191/423 (45.15%), Query Frame = 0

Query: 92  LKREPGSLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEV 151
           ++++  SL     ++AGT IGAG+LA P  T    F+ S +   + W  M  +GL+I E 
Sbjct: 2   IQQKSPSLLGGAMIIAGTAIGAGMLANPTSTAGVWFIGSILALIYTWFCMTTSGLMILEA 61

Query: 152 NVNTMCELGSGGVSLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLG- 211
           N++        G S  ++    LG     ++  S   + Y L  AY+     I  N L  
Sbjct: 62  NLHY-----PTGSSFDTIVKDLLGKSWNTINGLSVAFVLYILTYAYITSGGGITQNLLNQ 121

Query: 212 ----------IPLWESATLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISF----AGLVA 271
                     I     + +F LI     +  S + +      L++G++++F     GL++
Sbjct: 122 AFSSAESAVDIGRTSGSLIFCLILAAFVWL-STKAVDRFTTVLIVGMVVAFFLSTTGLLS 181

Query: 272 VASGGLHWDALVKANFEAVP---MSIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLG 331
                + ++ + ++    +P    ++P+  +SF +   VP L    + +  +V  SI +G
Sbjct: 182 SVKTAVLFNTVAESEQTYLPYLLTALPVCLVSFGFHGNVPSLVKYYDRDGRRVMKSIFIG 241

Query: 332 TAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAVGPIVEV---------FS 391
           T + LV++++W   + G +   E     +I+    + +   A+   +EV         F+
Sbjct: 242 TGLALVIYILWQLAVQGNLPRTEFA--PVIEKGGDVSALLEALHKYIEVEYLSVALNFFA 301

Query: 392 LLAIATSYIGFVLGLSDFLADLLKLPTG-ESRPLPFILTLIPPLILSLLDPEIFFKALDL 451
            +AI+TS++G  LGL D++ADL K       R    ++T +PPL+LSL  P  F  A+  
Sbjct: 302 YMAISTSFLGVTLGLFDYIADLFKFDDSLLGRTKTTLVTFLPPLLLSLQFPYGFVIAIGY 361

Query: 452 AGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSELLN 487
           AG     +   I+PA ++ + R      S K    V GG F +  VI      I +++  
Sbjct: 362 AG-LAATIWAAIVPALLAKASRQKFPQASYK----VYGGNFMIGFVILFGILNIVAQIGA 411

BLAST of MS023210 vs. ExPASy TrEMBL
Match: A0A6J1D230 (uncharacterized protein LOC111016587 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016587 PE=4 SV=1)

HSP 1 Score: 924.1 bits (2387), Expect = 2.5e-265
Identity = 486/488 (99.59%), Postives = 488/488 (100.00%), Query Frame = 0

Query: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60
           MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES
Sbjct: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60

Query: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120
           TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA
Sbjct: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120

Query: 121 VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGTVGVQ 180
           VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMA+RTLGTVGVQ
Sbjct: 121 VTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMALRTLGTVGVQ 180

Query: 181 VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN 240
           VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN
Sbjct: 181 VSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGAVN 240

Query: 241 GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE 300
           GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE
Sbjct: 241 GTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTNLE 300

Query: 301 GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAVGPIV 360
           GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPL+QLRSANGAVGPIV
Sbjct: 301 GNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLQQLRSANGAVGPIV 360

Query: 361 EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA 420
           EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA
Sbjct: 361 EVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIFFKA 420

Query: 421 LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE 480
           LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE
Sbjct: 421 LDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWVIFSE 480

Query: 481 LLNNLGHL 489
           LLNNLGHL
Sbjct: 481 LLNNLGHL 488

BLAST of MS023210 vs. ExPASy TrEMBL
Match: A0A6J1D360 (uncharacterized protein LOC111016587 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016587 PE=4 SV=1)

HSP 1 Score: 917.1 bits (2369), Expect = 3.0e-263
Identity = 486/495 (98.18%), Postives = 488/495 (98.59%), Query Frame = 0

Query: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60
           MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES
Sbjct: 1   MAISSCLRLTFPVIRRSLDLPKQNAPCLGSFESLRPRRPTVLRPRLRSTVTCFSRRPAES 60

Query: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120
           TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA
Sbjct: 61  TVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAIPA 120

Query: 121 VTQESGFLASAITCTFCWVYM-------VVTGLLIAEVNVNTMCELGSGGVSLVSMAMRT 180
           VTQESGFLASAITCTFCWVYM       VVTGLLIAEVNVNTMCELGSGGVSLVSMA+RT
Sbjct: 121 VTQESGFLASAITCTFCWVYMVKLYQQLVVTGLLIAEVNVNTMCELGSGGVSLVSMALRT 180

Query: 181 LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ 240
           LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ
Sbjct: 181 LGTVGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQ 240

Query: 241 RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP 300
           RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP
Sbjct: 241 RLIGAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVP 300

Query: 301 VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSAN 360
           VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPL+QLRSAN
Sbjct: 301 VLCTNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLQQLRSAN 360

Query: 361 GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD 420
           GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD
Sbjct: 361 GAVGPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLD 420

Query: 421 PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA 480
           PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA
Sbjct: 421 PEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGA 480

Query: 481 GWVIFSELLNNLGHL 489
           GWVIFSELLNNLGHL
Sbjct: 481 GWVIFSELLNNLGHL 495

BLAST of MS023210 vs. ExPASy TrEMBL
Match: A0A6J1HQF1 (uncharacterized protein LOC111466855 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466855 PE=4 SV=1)

HSP 1 Score: 815.1 bits (2104), Expect = 1.6e-232
Identity = 431/492 (87.60%), Postives = 459/492 (93.29%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGS-FESLRPRRPTVLRPRLRSTVTCFSRR 60
           M+ISSCLRL FP +   RRSL   ++N  CL S  ESLR R  ++LR RLR+  T FSRR
Sbjct: 1   MSISSCLRLPFPAVQSARRSLGFSQRNVSCLRSTVESLRLRPYSLLRTRLRTMSTSFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P ES+VSG+EKEI +EEES++YELERLFSNLNQVT KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVESSVSGQEKEIDKEEESEKYELERLFSNLNQVTFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALL+AYVARSS+ILT FLGIPLWESATLFSLIFGGICYFGSQR I
Sbjct: 181 VGVQVSCWSYILIHYALLIAYVARSSNILTTFLGIPLWESATLFSLIFGGICYFGSQRSI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LV+GIIISF GLVAVASGGLHWDAL+KANFEAVPMSIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVIGIIISFVGLVAVASGGLHWDALLKANFEAVPMSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNL KVRTSIV+GTAIPLVLFLVWNGVILGTIS+L+MGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLAKVRTSIVIGTAIPLVLFLVWNGVILGTISNLDMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLP+GE+RPLPFILTL+PPLILSL+DPEI
Sbjct: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPSGENRPLPFILTLVPPLILSLIDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LD+AGTYGVLLLFGIIPAAMSWSDRYS SPPSVKLP VVPGGRFTLSLVIGGAGWV
Sbjct: 421 FFKSLDVAGTYGVLLLFGIIPAAMSWSDRYSRSPPSVKLPTVVPGGRFTLSLVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL N GHL
Sbjct: 481 IFSELLENFGHL 492

BLAST of MS023210 vs. ExPASy TrEMBL
Match: A0A5D3CFD4 (Tyrosine-specific transport protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold16G00650 PE=4 SV=1)

HSP 1 Score: 814.7 bits (2103), Expect = 2.1e-232
Identity = 430/492 (87.40%), Postives = 460/492 (93.50%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGSFESLRPRRPTVLR-PRLRSTVTCFSRR 60
           M+ISS LRL FPVI   RRS++L  QN  CL S      +RP  L+ PRL++  TCFSRR
Sbjct: 1   MSISSSLRLPFPVIQSKRRSINLSHQNVTCLWSTNKSFQQRPLKLQLPRLKAVSTCFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P +S+VSG+EK+I +E ES++Y LERLFSNLNQVT KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVKSSVSGQEKKIDKEVESEEYVLERLFSNLNQVTFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCT CWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTCCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGG+CYFGSQR+I
Sbjct: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGVCYFGSQRVI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LVLGII+SFAGLVAVASGGLHWDALV+ANFEAVP+SIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVLGIIVSFAGLVAVASGGLHWDALVRANFEAVPLSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTIS+LEMGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISNLEMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSL+AIATSYIGFVLGLSDFLADLLKLP+GES+PLPF+LTL+PPLILSLLDPEI
Sbjct: 361 GPIVEVFSLMAIATSYIGFVLGLSDFLADLLKLPSGESKPLPFLLTLVPPLILSLLDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LDLAGTYGVLLLFGIIPAAMSWSDRYS  PPSVKLPEVVPGGRFTL+LVIGGAGWV
Sbjct: 421 FFKSLDLAGTYGVLLLFGIIPAAMSWSDRYSKPPPSVKLPEVVPGGRFTLALVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL NLGHL
Sbjct: 481 IFSELLENLGHL 492

BLAST of MS023210 vs. ExPASy TrEMBL
Match: A0A1S3BQL8 (tyrosine-specific transport protein OS=Cucumis melo OX=3656 GN=LOC103492449 PE=4 SV=1)

HSP 1 Score: 812.0 bits (2096), Expect = 1.4e-231
Identity = 428/492 (86.99%), Postives = 459/492 (93.29%), Query Frame = 0

Query: 1   MAISSCLRLTFPVI---RRSLDLPKQNAPCLGSFESLRPRRPTVLR-PRLRSTVTCFSRR 60
           M+ISS LRL FPV+   RRS++L  QN  CL S      +RP  L+ PRL++  TCFSRR
Sbjct: 1   MSISSSLRLPFPVVQSKRRSINLSHQNVTCLWSTNKSFQQRPLKLQLPRLKAVSTCFSRR 60

Query: 61  PAESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGIL 120
           P +S+VSG+EK+I +E ES++Y LERLFSNLNQVT KREPGSLSSAIFLVAGTTIGAGIL
Sbjct: 61  PVKSSVSGQEKKIDKEVESEEYVLERLFSNLNQVTFKREPGSLSSAIFLVAGTTIGAGIL 120

Query: 121 AIPAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180
           AIPAVTQESGFLASAITCT CWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT
Sbjct: 121 AIPAVTQESGFLASAITCTCCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGT 180

Query: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLI 240
           VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGG+CYFGSQR+I
Sbjct: 181 VGVQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGVCYFGSQRVI 240

Query: 241 GAVNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLC 300
           GA+NG LVLGII+SFAGLVAVASGGLHWDALV+ANFEAVP+SIPIIALSFVYQNVVPVLC
Sbjct: 241 GAINGALVLGIIVSFAGLVAVASGGLHWDALVRANFEAVPLSIPIIALSFVYQNVVPVLC 300

Query: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISSLEMGSDKIIDPLEQLRSANGAV 360
           TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTIS+LEMGSDKI+DPL+QLRS NGAV
Sbjct: 301 TNLEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTISNLEMGSDKILDPLQQLRSTNGAV 360

Query: 361 GPIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEI 420
           GPIVEVFSL+AIATSYIGFVLGLSDFLADLLKLP+GES+PLPF+LTL+PPLILSLLDPEI
Sbjct: 361 GPIVEVFSLMAIATSYIGFVLGLSDFLADLLKLPSGESKPLPFLLTLVPPLILSLLDPEI 420

Query: 421 FFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRFTLSLVIGGAGWV 480
           FFK+LDLAGTYGVLLLFGIIPAAMSWSDRYS  PPSVKLPEVVPGG FTL+LVIGGAGWV
Sbjct: 421 FFKSLDLAGTYGVLLLFGIIPAAMSWSDRYSKPPPSVKLPEVVPGGMFTLALVIGGAGWV 480

Query: 481 IFSELLNNLGHL 489
           IFSELL NLGHL
Sbjct: 481 IFSELLENLGHL 492

BLAST of MS023210 vs. TAIR 10
Match: AT5G19500.1 (Tryptophan/tyrosine permease )

HSP 1 Score: 605.9 bits (1561), Expect = 2.8e-173
Identity = 320/429 (74.59%), Postives = 373/429 (86.95%), Query Frame = 0

Query: 59  ESTVSGEEKEIVQEEESQQYELERLFSNLNQVTLKREPGSLSSAIFLVAGTTIGAGILAI 118
           E+ V+ EE+E  ++EE +    ERLFSNLNQ TLKRE GSLSSAIFLVAGTT+GAGILAI
Sbjct: 72  ETQVTTEEEE--EDEEEKIVVFERLFSNLNQSTLKRESGSLSSAIFLVAGTTVGAGILAI 131

Query: 119 PAVTQESGFLASAITCTFCWVYMVVTGLLIAEVNVNTMCELGSGGVSLVSMAMRTLGTVG 178
           PAVTQESGFLASA+ C  CW +MVVTGLL+AEVNVNTM ELGSGGVSLVSMA RTLG+VG
Sbjct: 132 PAVTQESGFLASAVACILCWAFMVVTGLLVAEVNVNTMSELGSGGVSLVSMAKRTLGSVG 191

Query: 179 VQVSCWSYILIHYALLVAYVARSSDILTNFLGIPLWESATLFSLIFGGICYFGSQRLIGA 238
           VQV  WSY+LIHY LLVAY+ARSS ILTNFLGIP+WESATLFSLIFGG+C+FGSQR IGA
Sbjct: 192 VQVVSWSYLLIHYTLLVAYIARSSGILTNFLGIPIWESATLFSLIFGGLCFFGSQRFIGA 251

Query: 239 VNGTLVLGIIISFAGLVAVASGGLHWDALVKANFEAVPMSIPIIALSFVYQNVVPVLCTN 298
            NG LV G+I SFA LVAVASG LHW+AL+KANFEAVPMS+PIIALSFVYQNVVPVLCT+
Sbjct: 252 ANGVLVFGVIASFAALVAVASGDLHWEALLKANFEAVPMSVPIIALSFVYQNVVPVLCTD 311

Query: 299 LEGNLTKVRTSIVLGTAIPLVLFLVWNGVILGTIS-SLEMGSDKIIDPLEQLRSANGAVG 358
           LEG+L +VRT+IVLGTAIPL LFLVW+ VILG+      +  +K++DPL+QLRS++  VG
Sbjct: 312 LEGDLPRVRTAIVLGTAIPLGLFLVWDAVILGSFPVDTGVAVEKMVDPLQQLRSSSVTVG 371

Query: 359 PIVEVFSLLAIATSYIGFVLGLSDFLADLLKLPTGESRPLPFILTLIPPLILSLLDPEIF 418
           P VE FSL AIATSYIGFVLGLSDF +DLLKLP+G+++PL ++LTL+PPL+LSLLDPEIF
Sbjct: 372 PFVEAFSLFAIATSYIGFVLGLSDFFSDLLKLPSGQNKPLLYLLTLVPPLVLSLLDPEIF 431

Query: 419 FKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSV-KLPEVVPGGRFTLSLVIGGAGWV 478
           FKALD AGTYGVL+LFGI+PAAMSWSDRY  S  +V +LP++VPGG+ TLSLV+G AG+V
Sbjct: 432 FKALDFAGTYGVLVLFGILPAAMSWSDRYIVSSSTVTRLPQLVPGGKLTLSLVMGAAGYV 491

Query: 479 IFSELLNNL 486
           I SE++ NL
Sbjct: 492 IISEVIENL 498

BLAST of MS023210 vs. TAIR 10
Match: AT2G33260.1 (Tryptophan/tyrosine permease )

HSP 1 Score: 119.8 bits (299), Expect = 6.2e-27
Identity = 108/425 (25.41%), Postives = 188/425 (44.24%), Query Frame = 0

Query: 89  QVTLKREPG-SLSSAIFLVAGTTIGAGILAIPAVTQESGFLASAITCTFCWVYMVVTGLL 148
           ++T + + G S  +A+ L+ GT +G G+L +PA T  SG + S I     WVY++ + LL
Sbjct: 5   EITHETKKGKSFWAAVSLIIGTAVGPGMLGLPAATIRSGSIPSTIALLCSWVYVISSILL 64

Query: 149 IAEVNVNTMCELGSGGVSLVSMAMRTLGTVGVQVSCWSYILIHYALLVAYVARSSDILTN 208
           +AE++   M E  +  VS   +A ++ G        + Y  + ++L+VA V+    I++ 
Sbjct: 65  VAELSFAAMEEDNAAEVSFTGLATKSFGNKFGVFVAFVYASLSFSLMVACVSGIGSIVSQ 124

Query: 209 -FLGIPLWESATLFSLIFGGICYFGSQRLIGAVNGTLVLGIIISFAGLVAVASGGLHWDA 268
            F  +  + +  +F L+ G +  F     I   N  L   ++ S   LVA+       + 
Sbjct: 125 WFPSMNPFLANAIFPLVSGILIGFFPFNAIDFTNRGLCFLMLFSITSLVAIGLSVARSNV 184

Query: 269 LVKA-----NFEAVPMSIPIIALSFVYQNVVPVLCTNLEGNLTKVRTSIVLGTAIPLVLF 328
           L            V  ++P++ L+  +  + P +C     +++  R +I++G  +PL + 
Sbjct: 185 LASFGQSCWKVSMVLPAVPVMVLTLGFHVITPFICNLAGDSVSDARRAILVGGVVPLAMV 244

Query: 329 LVWNGVILGTIS-SLEMGSDKIIDPLEQLRSANGAVGPIVEVFSLLAIATSYIGFVLGLS 388
           L WN ++LG    ++       IDP+  L S N +    V+ F+  A+ATS IG+ +   
Sbjct: 245 LSWNLIVLGLARITVPAAPSSTIDPISLLLSVNPSALSAVQGFAFSALATSLIGYAVSFP 304

Query: 389 DFLADLLKLPTGES------------------------------------RPLPFILTLI 448
             L D  KL + +S                                      +  +  L 
Sbjct: 305 KQLLDTWKLVSKQSNGNGRLGSVSFSSKERDRRTNGRASYNEPARARDGFEAVVMLFVLG 364

Query: 449 PPLILSLLDPEIFFKALDLAGTYGVLLLFGIIPAAMSWSDRYSGSPPSVKLPEVVPGGRF 470
            P +++   P  F +ALD AG Y    LFG++P AM+    Y         P V+PGG F
Sbjct: 365 VPALIATFFPSTFSRALDFAGVYANCFLFGVLPPAMA----YIQQSRKKLRPWVLPGGNF 424

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147723.15.1e-26599.59uncharacterized protein LOC111016587 isoform X2 [Momordica charantia][more]
XP_022147722.16.2e-26398.18uncharacterized protein LOC111016587 isoform X1 [Momordica charantia][more]
XP_023530389.15.1e-23387.40uncharacterized protein LOC111792980 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022967292.13.3e-23287.60uncharacterized protein LOC111466855 isoform X1 [Cucurbita maxima][more]
KAA0055689.14.3e-23287.40tyrosine-specific transport protein [Cucumis melo var. makuwa] >TYK09938.1 tyros... [more]
Match NameE-valueIdentityDescription
P0AAD48.1e-3230.27Tyrosine-specific transport protein OS=Escherichia coli (strain K12) OX=83333 GN... [more]
P0AAD58.1e-3230.27Tyrosine-specific transport protein OS=Shigella flexneri OX=623 GN=tyrP PE=3 SV=... [more]
P447472.7e-2730.00Tyrosine-specific transport protein 2 OS=Haemophilus influenzae (strain ATCC 519... [more]
P447272.1e-2427.70Tyrosine-specific transport protein 1 OS=Haemophilus influenzae (strain ATCC 519... [more]
P446145.8e-2225.53Tryptophan-specific transport protein OS=Haemophilus influenzae (strain ATCC 519... [more]
Match NameE-valueIdentityDescription
A0A6J1D2302.5e-26599.59uncharacterized protein LOC111016587 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1D3603.0e-26398.18uncharacterized protein LOC111016587 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1HQF11.6e-23287.60uncharacterized protein LOC111466855 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3CFD42.1e-23287.40Tyrosine-specific transport protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
A0A1S3BQL81.4e-23186.99tyrosine-specific transport protein OS=Cucumis melo OX=3656 GN=LOC103492449 PE=4... [more]
Match NameE-valueIdentityDescription
AT5G19500.12.8e-17374.59Tryptophan/tyrosine permease [more]
AT2G33260.16.2e-2725.41Tryptophan/tyrosine permease [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018227Amino acid/polyamine transporter 2PFAMPF03222Trp_Tyr_permcoord: 98..473
e-value: 4.0E-72
score: 243.4
NoneNo IPR availableGENE3D1.20.1740.10Amino acid/polyamine transporter Icoord: 96..484
e-value: 7.4E-11
score: 43.3
NoneNo IPR availablePANTHERPTHR32195:SF26OS07G0662800 PROTEINcoord: 40..485
NoneNo IPR availablePANTHERPTHR32195FAMILY NOT NAMEDcoord: 40..485

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS023210.1MS023210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0003333 amino acid transmembrane transport