Cp4.1LG14g00520 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG14g00520
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionSodium-bile acid cotransporter, putative
LocationCp4.1LG14: 4446070 .. 4458722 (+)
RNA-Seq ExpressionCp4.1LG14g00520
SyntenyCp4.1LG14g00520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACCAAACCAACACAAAATTCTTCATAATGTCTTCTATTTTTCTGCAATTCACCCCCTTCTTGTCCCAATCTCTCCACACCCACCGCAAGCTCCGCCGTTACAGACCCCCCACCCCTCTTTTCCGGCCGTCAAAGCCCCCCGGACACCTCGCCGTCCGAGCACTTCAACGGAAAACAGAGCTTCCATCTCTTCCAACGCCGCCGCAGAAACCAACCCCTGTGGCCAAATTCGTGTCCACGGCGGCAGGGCTGTTCCCTCTGTATATCACGGCTGGTGGCGTTGTTGCGTGCCTGAAACCGTCGACGTTCTCGTGGTTCGTGCGGAGAGGGCCTGGTTCTTATAGCTTGGCTCTTGGGTTGGTTATGTTGGCCATGGGACTCACTCTGGAGCTCAAGGATTTGTTCACTCTGTTCATGGAGAGGCCGCTTTCTGTGAGTTTTTGTAACATTTTTTGAATTTTGGAATATTTGGTGAATTAATGAATGATTTTGAGCTTGTGGGTTGGTTGATTTTTGGTCAGATATTATATGTATGTATAGCTCAATTCACAATTATGCCGGTGGTCGGAGCTCTCATCGGAAAATATCTTGGCCTTCCGCCGCCGCTTTCCGTCGGGTTGGTCTTGCTCGGATGTTGTCCTGGAGGGATTGCTTCGAGTGTGGTAAGTTTATCATCTAGTGTATACTTTGAGTGTCTCGTACCAATGGAGATGTATTCCCTGGCTTATAAATCTATGATCAATGTGGAATTCCCTCCTACCATCCTCCTCTCGAACCAAGTACACTATAGAGCCTCCCCTAAGGCCTATAACTTCTTTCTCTAGAGCCCTTAAACAAAATACCCAAAAATGCCTCGAACCAATCGAGATCTATTCCGTACTTATAAATTCATGATCATTCTCTTAATTAACCAACGTGGGACTCCCTTCCAATAAAGTACACTATAGAGCCTCCCTTAAGGCTCATGGAGCCCTCGAATAGCCTCCGCTTAATTGAGGTTCGACTCCTTTCTCTAGAGCCCTTGAAAAAAGTACACTCTTTGTTCGACACTTCAGTCATTTTTTACTTACACTTCCGAAGGTTCCAACTTCTTTGTTGGACATTCCAGGATTCTATTGACATGTATAAGTTAAGGGCATGACTCTGATACCATGATAATGATATTATATTATCCACTTTGAGCATAAACTCTCGTAACTTTACTTTTTGGTTTCTGCTCTATACCAAATCCATGATATTTCCTTAATTAACCAACGTGGGACTTCTTCCCAACAATCAATTATTTTTTTTGAACAGTAAATAGTGTTTCAGGTGACGTTAATAGCTCGAGGCGACGTTCCTTTATCGATTATAATGACGGTATGCACCACTCTGGGAGCCGTAATTCTCACTCCCTTTCTCACTAAAACGCTCGTTGGAGCTTTCGTTCCGGTCAACGCTTTGAAGCTGTCTCTCAGCACTCTTCAGGTCAAACAAACAAAAGAACTTTGACTAAAATCGATCCGACATAGATTTTGGTTGACTTTTTGTTTGTTGGTTTAGGTGGTGGTAGCTCCCATTTTGTTGGGTTCTTACTTGCAGAAAGCTTATCCAAAGCTTGTGAAACGAATTGTGACATTTTCGCCACTTGTTGCTGTCTTAACTTCGTCACTTCTTGCTTGCAGGTCTTTTTTCTTTGTTGTTGAACCATTTTTTGCTTTCTTAAGCTGTTTTTGAGGTGGATTTACTGATGGTTCTTGTTTTATCTTAGTGTTTTTTCAGAGAATTTCGTTCGATTTAAATCATCACTTGTTAGTTCGACGTTGGGTTCCGACGGATCTTCATGGATTGGTGTTAAAACGATATTGTCCGGAGAGTTGGGAGCTGTAATACTTTCGGTGTTTTTGTTGCACCTCGTTGGTTTCTTTGTCGGGTAAGTCGTCGTTGTATCTCTTTCATTGCTTCTTGACGTGAATTTGGTCAAATGGGTATCGAACTTCAATCCTCGTAGATTTTTTATAACAAACCAAGTCCACTGTTATCAGATATTGTCCGCTTTGACCCGTTACGTATCATCATTAGCCTCATGGTTTTAAAACGCGTTTGTTAGGGAGAGGTTTACACACCCTTATAAGAATGTTTAGTTTCTCTCACCAACCAATGTGAGATCTCACAATCCACCATCTGAAGTGCCCAACGTCCTCGTTGGCAAACTGCTCGATATAGCTCTGATACCATTTATAACAGTCCAAGTCCATTGCTATCAGATATTGTCCATGTCACACAGTTTTAAAGCACGTCTGTTAGATAGAGGTTTTCATATTCTTATAAAAAATGTTTCGTTGCTCTCTCGAACAAATGTGGGATATCACAGTCCATCCCTGAGGGGCCCACGTCTTCATTGACACACCGCCCGATATCTAACTCTAATACCATTTATAACAGCCCAAAACTACCGTTGGCCGATTTTGTCCACTTTACCCATTACTGTCAATTTCACGATTTTAAATGCGTTTATTATTGAGAGTTTTACACACCCATATAAGGAATATTTCGTTTTTCACTCTAAACAACATGAGATCTCACATTTTTTCTTAACCATGTTGATATCATTTTACTTTGCAGCTACACAATAGCCGCAATCGGTGGATTTCAAGAACGAGAACGGAGAGCAATATCCCTTGAGGTATGAATCAAAACATAAACAATCACATCTTTAAATCCAAAGCATGATATGATATGATATGATATGAACGAGAGATCGAGTTTTGGATTCAGGTCGGGATGCAAAATTCATCATTGGGAGTAGTATTGGCAACAGCCCATTTCAGTTCAGCAATGGTGGCATTGCCTCCAGCAATGTCAGCGGTGATAATGAACATGATGGGTAGCAGCCTCGGGTCTTTATGGAGAAATATTCCGCCCACTGTTGCAGAGGCTGCCCAGTGAAACCCCCACACACTGTTTGCTTCACAATATGGAGGGGCCTTTCTGGGGCATTGGTTATCAGAATTTGACGGCTGAGNCTATCTGCCAAAATTTCCAATCTTTATTTGCCACAAGTGCTAGAGGAACTTGTATGATAGTGATCTTTGCCACTGGACTAAACGTTTCCTCATAGTGNATCAGAATTTGACGGCTGAGATTCATTCGTCGGAGTTCTGTACTGACATCTGATGGCTATCAAATCAAGACTGTCGCCATGGACTGTCGTCACCTTGCACACTGACTCCACTTTGTCATCTTGCATCTCTGTCTCTCTTACAATATATGTTATTTTAATTTTTTTTTTTTNCATGAATCTATAGTCTGAGTTTTTTTATCTCACGATTGATCCATCTGGGGTACACTCTATTTTGTAAATCCACTTGCAAAAGATGGATTTGATATATTCTGGTCTTGGTACTAGTCCCAAGTTTGATTTTGCTCCAGGGCTATAATTTCTTCCTCCCTCGCTCTCTGGCAAGCCAAGTTTTGTGATGCTTCACATGTCTCTGGCTCATTAACTCCGTCTTCTACAATAGCTGTATTGGCATACTCTGGATTTAACTTTAGGATTATCTCTGAATGTCTAAGTTGTTGAGGTGTCATTTCCTNGATTAAATTAATAAAAAAATATTATTAGATTTTATATTTTGTCTCCATATATTTATTTACTTCAAAAATACATTTAAAATTTAAAAAAAAAATCATTAATACTTCCAAGTTTAAAAAAAAAATTAAAAGTATCTTTGCTGTTAATATTATATTTTAAAAAAAATACCTAACAAATTTTAGAGGGAGTATTTCAGACTCTAAACCTACTTTCGATTCATCCCGCATTGCCTGCTTACTAAAAATGGCCCACTTGGAGGTCTCGATTCCGTGGCATGGCTCAACAAAGGATTCATGTCGTCCAAACTATTTAAAGTTTGAGAATAGGTCGAGGGCGTTGTGTTGAACACAACTCTCTGTTAGAAACACTAATTGAGAAGAGATTTACAAAACACTTTGTAGAAATTACTCATTTGAATTGATCTGGACCTAACAAATCTAAATCTTTTTTCTGAAATATCTAGAACTCTCCTCTCAAATATCTAGATATCTTTTCTAAAATCTCTATGAAAATAAATATACAATGGGCTCATACAATTATTGGGCTTGACNTGTTTGGTTGAGTCACTTTTTGCTCATCAACATCAGTGTCACTCAAATCTCCAAGCACATTCGCATTTAAGCAAGTGTGTACAGTTTGCTCCCCCGTCTTTTGTGGAGGAATTTCTTCAGTGTTGTCCAAGCCTGGTAATACTTCCTTCTCTGAGGGCCACCCCTCTTCCATNTGTTGGGCTAGTGACGATATTTCAACACTCCCCAGCACTAATCCTCATTCGCTGCACCACAGCGAAAGCTCTCATGTGTGCAAATATTCAGCTCTTTTGTAGACAAGTCTACCACTTGGTCATCCGTATTCTTCTTTCAAGACTTTGTCTCTAATAAAATGGCACTGCACCTCCACATGCTTTGTTCTGCCATGAAACATTGAATTTTCTACTAAACAANCATACTTTTTCATGGACCAACAATCTATAGACATGTGGCCATTTTTCTCCAATTATAGCNGCACTGCACCTCCACATGCTTTGTTCTGCCATGAAACATTGAATTTTCTACTAAACAAATCACAGATTGGTTGTTGCATTGAAGTAGTATCGAATAGTCAATTTTCTGGTGCCAATCTTCTATCAGAAGTTTCAGCCATGTACATTCTTGAGCTGCTCCCGCCACTGCTCTGTACTCTCTGCTTCTCTAGTTGACAATGATACAGTTGGTTGTCTTTTGTTATACCAAGAAATTGTTCTCGAACAGAGCTTGATCACATACCCAGTGATTGATCTTCGAGTACCATGATCTTNGTTTCTTGGCTGGCAAACAAATTTTCGAACTCTACAAGTGATGGTTGTGTGAGCCATCATTGTACAACAATAATAAAGCTTCTATATTTCGGTTGCAATCCATGGATTATAATCCTCTTCATTCAAGATTCTCCAATGGCGGACTTCGGGTCTAGTTCAGTAATCTCCCGACAGANTTCGCTTCTTTTATACGAAGGATTGTAATTGATTGTACCTTTGACATATCTCAAGGTTCGTTGAGCCGCANCCATAAGTTGTAGCTTTGTATCGTTCTTCTTTAAGAACAACATCACGAACGTGTCACATGCTTCTTTCGGTGTCTTGTCATCCCAAATATGTTATAACATCTCTTCTCTGATTTTGGTCTTCAAGGCAAACATTGCTTTGCCTGCTTTGATTATCCATTTGCGCGAGGCGCCATTAGAATCCTCCTCTGGCGGCGTAGTTTCACACCCGCCAACGATCTCCCAAAGATCCTGTCCATGTAAATAGGACATCATGCATGTTGCCCACATGTTGTAGTAGTTGTTGTTGATTTTCTTGATTTCTCCAACGATTTGAAAATCACCCCATCGTGTTAGTCATACCTAAGTAATACCAAACAAGCTCTTTCGACACAAGATAGGAATTGGAAGTGAGATGAAAAGTATGATCCATGAATATTGATATGTATATTCCATAAAAAAAAAGTATATTTAATTCCTAATTAATTTTTCTGATTCACCAGCTCTTATCTCTTTTCAAAAGGATCGGTTAATAAAAAATTTGAATATCCAAAACTGAAATCGGATTTTCTTTTTCTAGTCTTAATTATTCTTAATTTTTTGCCATATACTTAAAATATTTCAAGTCAGGAAGTTAGAATTGTTCAAATAAGATGATATTGCAGAGTCACAGACTACTCACGACACAAAAGAAACTCACTCACTACTAATCTCAAGAAATCGTTTCTGATGTGGAAGATCCTAAGGGATCAGATAACCTACACTAACCTCGCTTTGATACCAAATTGTTGAACACAACTATCTGGAAAACTCGCTTTGATACCAAATTGTTGAACACAACTCTCTGGGAAACACTCGCACTCTTTGTTAACACCAACCGAGAAGAGATTTACAAAACACTCTATAGAAACTGTATTTAAAACCTAGTTAGGGTGGAGGGGTATACTAGAGCCTACTCAATAGACCTAACAAATCTAAATCTTTTTTTCTAAAATATGTAGATCTCTCTCCTAAAATATACCGTATCTTTCCTAAAATTTATACTAAAATAAATATACAATGGGCTCGTACAATTATTGGGCTTCACTCTACTGGGCTAGTGATGATATTTCAACACGTTGCACCCCCGATACCTCTAATCATTGGCTTTATTACATGATAGAACTCGCACGCGGGCTCCAGCTATCCTATAAAAAAAATTCAAATTTTTTTAACTGAGATGTAAATAGTCTATTAACACCTTATTATTGTTTTTTAAACATGGTACTAAGGAAAAACAATTATTTTTTTAGTCTTCAAAAAATATTTAAGTAAAAAGCTTAATCCTTCAAAAAAAAAAAAAAAATTAATCATAATTAATTCAATAATTATTATAATTTTCATGATACAATATTTAAAAAAATTTATAGAAAAAAAGGTTAAATAAGAAGAGAAGAGAGATAAAATAATACTTTGGTCAAATTATATAGCCCTCAATTATTGACTTTTATAGGAATCATGACATTGTCAACACACATCCAATATTAAGATGTCATAAGTTGAATGTGTTTTGAACATCTTGAGTAGGGTTAAGGAGCAATTCGTAGGATTGAGTGAGGTGTGATCCTTTCAACTCGTAGCTTCAAAACATTGCATGCACGAGACTTCTAGCTAGCTTGATTGCACGACATCAATATCCACCAAGTTCAAAATTCTCACTATGAAGGTATATGACAAGTCAACCGATCTGATTGAACATGCAAAGTGTTAGGATCACACAACAACGCACACACTCGATCTAGATGAACACAAAGAACATGATAGAGAAAATGCAAGGAGCACTCTTGCTAAAAGATTTTTATTGATGACTTCAAGTATGTGTACAGTAACAAAGAATACAGTACGTTTTCAAAGCTTTGTATATATATGATTTAGCTTACTATATATATTTTTACGTGATATAACCATCTACTATATATAGCAGATATAGCTTTTTACTATACATAGCTATTTACTATTTATGACAATTTACTTTATATGTCATTTACCAAATATAAATTTTTACTATATATAACTATTTACTATTTATGACAATTTACGTTTACCAAATATAGCTTTTAGATTTTACTACCAAATATATATTTAGATATAGCTTTTTACTATACATAGCTATTTACTATTTATGACATATTTACCAAATATAGCTTTACTTAACACCGCTATAAATAAGCAGCAGACTCCATAACCTAACACAAAAATCTTACGGGTCATGGATGAAACTCCATGCAGTCCGTAGATGTTGAGCATCTCCCTAACCTTCATGAGCTTGCCCAAACCTAGTTCAAGGACCATTGCCTCATTCCAAGAGTTAATTCGTGCCTATATAGATCAATTCATGTGCAACCACGACGACCAATCCAACGATTCAACTCAATTCATGTGCAACCACGACGACCAACCCAACGATTCATCTCGTAGTCTATTTGTCAAGCTTTAATCTTTTAGTCGAGGATAGATATATTGAACACAATTCTTTATTATCAATTAAGAAGAAATTACACGATACTCACAAATTACTAAACTTTGCTTTGTTTGGTATGAAAAACGGGTGGAATGGAAGATATTTTTAGCGAGGACAGACAATATATACCACACATTTTATCATTTTTTCTTAAAGTATCTCAATATCTTTCTAGATATATTTATTTCTAAAAATTTAAATATTTATTCATAAAATATCTCGTTATCTTTCTAAAACATTCTATAAAATTTATAGACAATTTGTTCATACATGGAATTGGTTATCGGCCTAGTGATAATCAACGCTCTCATCAGCCCGGGTCTTCATTCACGATACCATGCCATGTTGAGTTGATAGCAAAAGTTCTCATATTGACTGATATTTTATCTTTTTGGATGAAAGTCCCACATCGGCTAATTAGGAAATGATCATGAGTTTACAATAATAAAAAATACTATCTTCATTGGAGGCATTTTGGGAAAGTCTAAAGCAGAGTCATGAGAGCTTATGTTCAAGGTGGACAATATTACACCAAGAGAATAAAGTAGATACCTATTTCCTTCTTCCGACATAAAGTATGGTGCAAGTGAAGGTAAACAAGAGCATTAAAGTAGCAATTCGAATAACGACAAAGAACAGAAGGACTAGTTTAAATAAACTAGCTTAAATTGACTAATTCCATAAATTAGCTTAAATATGACAAAACAGTGCAACATTAGATGCATATAACCTTAACTCATACGTTAAAAAACTAGCTTAAATATGACAAAACAATGCAAGGAATGAAATGTACTTATTAATCATTCATGGACTAGTTCCATAAATTCAACTACTCATGTGTTAAAAACTAGCAATGAATGAAATGTGTAAACAATGCAATGAATTGAAAACTAGCTTAAATATGACAAATAAATATTTAAGTTATAATTAAACCGTGCAATGAATGAAATGTACTTATTAATCATTCTTGGACTAGTTTCATAAATTCAACTGAGTTCATTTATATAACCTTAACTCCTGCGTTAAAACTAGCTTAAATATAACTCATTTATATAACCTTAACTCTTGCGTTAAAACTAGCTTAAATATAACAAATAAATATTTAAATTATAATGAAACAGGGCAATGAAATAAATATACTTATTAAAGTGCATTTGTTGTAGTTTTTTGGCTCTCATCCTTGTAATTGGACCTTGTGGTATGAAAATTTCTTAGTCGTAGTTCATATCAATTCTCCTTCATATAATCTTTTAAAAAATCAAGTCTTTAATGAAAATTACTTGAATTGGTGAATATGACTTTAACTGCGGTTCAATTAAGGGGGGTTTTGGCAGAGTCGGCGGCAAGGACATGGAATATGACCCTCACCATCTGGCCCATAACTAACGGGAAAAAATCTCTCCCATTTTTCATTTAAAGAGGTATTCCCCACTCCATTTGGGCCGGATTTTAAATTAACATCTCTATTACACGTATTGACTATTTTTAATGATATTTTTCTTAGTACAACAAATGAAGTATGAGAATTTGAATGGACAGCCTCGCGTTAATTGTTCATAATATTTAACATTTTATTAAAAAAAAAAAAGCACATGAGCTTTAAAAATATAAGTAACATGATTATTAATGTTATGAAATTGTATGAAAATTATTGTCTAATTATAAATTAATATAATCAGCACATTTTTCACCATTATATTTTAAATATACTAAAACCCATTATTAAAAAATTTAAAGCAGTGTTAAAAAAATGTTTAAATCCGTCAATTTATATTCCACGTATTTTGACACGTAAGCCAAAGGATAAAAATAAATGAATAATAAATAAATAAATCTCTCTGTTCTTATTGGCAGCTTCCAAATGATTTCCGATCACAATCGCGTAGTTAGCACCACAGAAAACCATTGATTCATTCACCAATTCTTTTCTTCGCAATCGGAGCCGCCGAATTCTGCACAATGGCTTCGATTTCTCTGCAATTCACCCCCTTCATTTCCCCTCTCCAACACCACCACCGCCGCCGTAATCTCCGCCTACACAGACCCAAAATTCCCTGTCTCTTGCCGCCGAACCTCCCCAGATTGCTCGCCGTCCGATCAGTTCAACGAAATAACGAAGACCCATCTCCCCTGCCCCCGGCAAAACCCTCCGGTTTGGATGATTTTCTTTCGACGGCGGCGAGTCTGTACCCTCTCTATGTGACGGTCGGCGGCGTCGTTGCTTGTCTGAAACCGTCGACCTTCTCGTGGTTTGTGGAGAGAGGACCCACTTCGTATAGCTTGGCTCTTGGCTTGATTATGTTGGCTATGGGTCTCACTCTGGAGCTGAAGGATTTGGTTAATTTGTTCATGCAGAGGCCACTTTCTGTGAGTTTTTGGATTCTTGCTTTGTTTATTAGTGGATTTTGTGGGGTTTTGGGATTTGGGTTCTTGATTTTATGGGATTAAATTGTGGACTGCTGGTAATTTTTGTAATGATTCTGTGTTTTTTTGTTTTAGATATTGTTTGGATGTGTAGCTCAGTACACGATTATGCCGGCTGCCGGAGCTCTTATTGGCAAGTTTTTTGGGCTTTCAACGTCGCTTTCGGTTGGTTTGATCTTGCTTTCATGTTGCCCTGGAGGGACAGCCTCCAATGTGGTAATGTTCATGATCTCTGTTGCTTATATGTTCAGTTTTAATGTAGTTGTTCTTGATGTTCCACAACTATTAATCTATTGATTGCATATTATACTCAATTTGGGATGATGTTTCCCCTCTCTCGTAGATGCCTTTGTAGATGACTTTTAAAAACTTTGAGTAGAAGTCCGAAAAGAGAAGCCTAAAGAGGACAATATCTACTTGTGGTGGGCTTTTACCAATGGTATAGAGCCGTACACCAGGCGATGTGCTAGTGAGGAGGTTGAGCCTCGAAAGGGGTAGACATGAGGCGGTATGTTAGCAACCAGACACCTGACAATGTGCTAGTGAGGAGGCTGAGCCCCAAAGGGGGTGGACATGAGGCGGTGTGTCAACAAGGACGTTGGGCCACGAAGGGGGTGGACATGAGGCAATGTGTCAACAAGGACGCTGGGCACCGAAGGGGGTGGACATGAAGCGGTGTGTCAGCAAGGACGCTAGGCCTCGAAGAGGTGGATTGTGAGATTCCACATTGATTGGGGAGGAGAACGAAAACATTCTTTACAAGGGTGTGGAAACCTCTCCCTTCAAAAACTTTGAGGGGAAGCCCGAAAGAGGAAGCCCAAAAAGGATAATATCTATTAGCGATGGGCTTGGGCCGTTACAAATGGTATCAAAGCCAAACACTGGGCAATGTGTCAGTGAGGAGGTTGAGCCTTGAAGTGGTGGACATGAGGCGGAGTGTCAGCAAGGACCCTGGGCCCTGAAAGGGGTGGATTGTGAGATCCCACGTTGCTTGGAGAAGAGGTTGAGCCCTGAAAGGGGTGGACATAAGACAGTGTGTCTGCAAGGACGTTGGGTCCCCATGAGGGTGAATTGTGAGATCCCACCTCGGTTGGAGGGAAGAACGAAATATTCTTTATAAGGGTGTGGAAACCTCTCCCGAAATATTAGTCTGGAAACCTCTCTCTAGTAGAGTAGACGTGTTTAACCTTGAGGGGAAGCCCGAAAGGAAAAACTCAAAAAGGACAATATCTACTAGCGTTGAGCTTGGACCGTTACAATAATGGTGAATAACAAACATGTAGGTTCAAAATAGTATCTCAATTGAGCATAGGTTTATGAATGATATCATGAAGAATTGAAGTAGATAAGTTGATTGCTTAAGCTTTCCGTTCACCTTTACTTCAATCAAATAGTGTTTCAGGTAACCTTAATTGCTCAAGGCGACGTTCCTTTGTCTATTGTAATGACAGTATGCACCACTCTAGGAGCCGTAATTCTCACTCCATTTCTTACCAAATTACTAGCAGGAGCTTACATTCCAGTTGATGCTGCAAAGCTCTCTCTCAGCACCCTTCAGGTGAAATTTGGTTCTCATTTGCCTTGTGTTCCTGTTCCTGTTCGTGTTCCTGTTCCTGTTCATGTTCCTGTTCCCGTTCATGTTCCTGTTCACGTTCATGTTCGTGTTCATGTGTCCATGTCCATGTCCATGGTCTTGTTCTTGTTCTTGTTCTTAACTGAATTCTTACGATCGCCATCTTGTTTGTTCTTGGCTAGGTGGTGGTAGCTCCTATTCTATTAGGTTCCTACTTGCAGAAGGCGTTTCCTCAACTGGTGAAACTGGTGATACCATTTGCACCACTCGTTGCTGTCTTAACTTCGTCGCTGCTCGCTTGCAGGTATTAACTTCGTCGTCGCTTCTTAAGTGTATAAATGATCATTGTTTAATCGAAACTCAGTCGTGTTTTTGTTCATCAGTGTCTTCTCGGAGAACGTCGTTCGTTTCAAATCATCAATGGTTAATGCCTCATTAGCTTCTGATGCATCTCCATGGATGGTTATTAGAAGTATATTATCAGGAGAGTTGGGAACAGTCATACTTTCAGTGTTTTGTTTACACTTCGCGGGTTTCTTTGTCGGGTAAGTTGCATATCATTCGAGATGTTACTCGAAGTTATAACTTAACTATTGGTTTGTAATATAATGTCGATCGACACGATCTTTGTTTGCTTTGCAGCTATATAGCAGCGAGTATCGGGGGGTTTCGAGAACGGGAACGAAGAGCTATATCCATAGAGGTTGGGATGCAGAATTCATCATTGGGAGTTGTTTTGGCGAGCTCACATTTCAGCTCAGCAATGGTGGCATTACCGGCAGCGATGTCAGCTGTGATAATGAACATAATGGGTAGCACTCTTGGGCTTTGTTGGAGATATATAGAACCTGCAGCTGATGAAGTGGAAGCCACTGGTAGTGTTGCCAACTGAAATTCAAGCCCTTTTTTTTAGTGTTTAATTTATCTCTTGTAAGTGTATATTTGTTCTTAGGGTTCAATTAACCCTTTTTTCGTAAAAGAATTTGACCTTTTTATCTTTGATTCTTCTGCTCGTTAAGGTATATGTCGATCAAATGTTCAAAATATTTTGAAACGTATTCT

mRNA sequence

ACCAAACCAACACAAAATTCTTCATAATGTCTTCTATTTTTCTGCAATTCACCCCCTTCTTGTCCCAATCTCTCCACACCCACCGCAAGCTCCGCCGTTACAGACCCCCCACCCCTCTTTTCCGGCCGTCAAAGCCCCCCGGACACCTCGCCGTCCGAGCACTTCAACGGAAAACAGAGCTTCCATCTCTTCCAACGCCGCCGCAGAAACCAACCCCTGTGGCCAAATTCGTGTCCACGGCGGCAGGGCTGTTCCCTCTGTATATCACGGCTGGTGGCGTTGTTGCGTGCCTGAAACCGTCGACGTTCTCGTGGTTCGTGCGGAGAGGGCCTGGTTCTTATAGCTTGGCTCTTGGGTTGGTTATGTTGGCCATGGGACTCACTCTGGAGCTCAAGGATTTGTTCACTCTGTTCATGGAGAGGCCGCTTTCTATATTATATGTATGTATAGCTCAATTCACAATTATGCCGGTGGTCGGAGCTCTCATCGGAAAATATCTTGGCCTTCCGCCGCCGCTTTCCGTCGGGTTGGTCTTGCTCGGATCCGTAATTCTCACTCCCTTTCTCACTAAAACGCTCGTTGGAGCTTTCGTTCCGGTCAACGCTTTGAAGCTGTCTCTCAGCACTCTTCAGGTGGTGGTAGCTCCCATTTTGTTGGGTTCTTACTTGCAGAAAGCTTATCCAAAGCTTGTGAAACGAATTGTGACATTTTCGCCACTTGTTGCTGTCTTAACTTCGTCACTTCTTGCTTGCAGTGTTTTTTCAGAGAATTTCGTTCGATTTAAATCATCACTTGTTAGTTCGACGTTGGGTTCCGACGGATCTTCATGGATTGGTGTTAAAACGATATTGTCCGGAGAGTTGGGAGCTGTAATACTTTCGGTCGGGATGCAAAATTCATCATTGGGAGTAGTATTGGCAACAGCCCATTTCAGTTCAGCAATGGTGGCATTGCCTCCAGCAATGTCAGCGGTGATAATGAACATGATGGGTAGCAGCCTCGGGTCTTTATGGAGAAATATTCCGCCCACTGTTGCAGAGGCTGCCCAAAAACCATTGATTCATTCACCAATTCTTTTCTTCGCAATCGGAGCCGCCGAATTCTGCACAATGGCTTCGATTTCTCTGCAATTCACCCCCTTCATTTCCCCTCTCCAACACCACCACCGCCGCCGTAATCTCCGCCTACACAGACCCAAAATTCCCTGTCTCTTGCCGCCGAACCTCCCCAGATTGCTCGCCGTCCGATCAGTTCAACGAAATAACGAAGACCCATCTCCCCTGCCCCCGGCAAAACCCTCCGGTTTGGATGATTTTCTTTCGACGGCGGCGAGTCTGTACCCTCTCTATGTGACGGTCGGCGGCGTCGTTGCTTGTCTGAAACCGTCGACCTTCTCGTGGTTTGTGGAGAGAGGACCCACTTCGTATAGCTTGGCTCTTGGCTTGATTATGTTGGCTATGGGTCTCACTCTGGAGCTGAAGGATTTGGTTAATTTGTTCATGCAGAGGCCACTTTCTATATTGTTTGGATGTGTAGCTCAGTACACGATTATGCCGGCTGCCGGAGCTCTTATTGGCAAGTTTTTTGGGCTTTCAACGTCGCTTTCGGTTGGTTTGATCTTGCTTTCATGTTGCCCTGGAGGGACAGCCTCCAATGTGGTAACCTTAATTGCTCAAGGCGACGTTCCTTTGTCTATTGTAATGACAGTATGCACCACTCTAGGAGCCGTAATTCTCACTCCATTTCTTACCAAATTACTAGCAGGAGCTTACATTCCAGTTGATGCTGCAAAGCTCTCTCTCAGCACCCTTCAGGTGGTGGTAGCTCCTATTCTATTAGGTTCCTACTTGCAGAAGGCGTTTCCTCAACTGGTGAAACTGGTGATACCATTTGCACCACTCGTTGCTGTCTTAACTTCGTCGCTGCTCGCTTGCAGTGTCTTCTCGGAGAACGTCGTTCGTTTCAAATCATCAATGGTTAATGCCTCATTAGCTTCTGATGCATCTCCATGGATGGTTATTAGAAGTATATTATCAGGAGAGTTGGGAACAGTCATACTTTCAGTGTTTTGTTTACACTTCGCGGGTTTCTTTGTCGGCTATATAGCAGCGAGTATCGGGGGGTTTCGAGAACGGGAACGAAGAGCTATATCCATAGAGGTTGGGATGCAGAATTCATCATTGGGAGTTGTTTTGGCGAGCTCACATTTCAGCTCAGCAATGGTGGCATTACCGGCAGCGATGTCAGCTGTGATAATGAACATAATGGGTAGCACTCTTGGGCTTTGTTGGAGATATATAGAACCTGCAGCTGATGAAGTGGAAGCCACTGGTAGTGTTGCCAACTGAAATTCAAGCCCTTTTTTTTAGTGTTTAATTTATCTCTTGTAAGTGTATATTTGTTCTTAGGGTTCAATTAACCCTTTTTTCGTAAAAGAATTTGACCTTTTTATCTTTGATTCTTCTGCTCGTTAAGGTATATGTCGATCAAATGTTCAAAATATTTTGAAACGTATTCT

Coding sequence (CDS)

ATGTCTTCTATTTTTCTGCAATTCACCCCCTTCTTGTCCCAATCTCTCCACACCCACCGCAAGCTCCGCCGTTACAGACCCCCCACCCCTCTTTTCCGGCCGTCAAAGCCCCCCGGACACCTCGCCGTCCGAGCACTTCAACGGAAAACAGAGCTTCCATCTCTTCCAACGCCGCCGCAGAAACCAACCCCTGTGGCCAAATTCGTGTCCACGGCGGCAGGGCTGTTCCCTCTGTATATCACGGCTGGTGGCGTTGTTGCGTGCCTGAAACCGTCGACGTTCTCGTGGTTCGTGCGGAGAGGGCCTGGTTCTTATAGCTTGGCTCTTGGGTTGGTTATGTTGGCCATGGGACTCACTCTGGAGCTCAAGGATTTGTTCACTCTGTTCATGGAGAGGCCGCTTTCTATATTATATGTATGTATAGCTCAATTCACAATTATGCCGGTGGTCGGAGCTCTCATCGGAAAATATCTTGGCCTTCCGCCGCCGCTTTCCGTCGGGTTGGTCTTGCTCGGATCCGTAATTCTCACTCCCTTTCTCACTAAAACGCTCGTTGGAGCTTTCGTTCCGGTCAACGCTTTGAAGCTGTCTCTCAGCACTCTTCAGGTGGTGGTAGCTCCCATTTTGTTGGGTTCTTACTTGCAGAAAGCTTATCCAAAGCTTGTGAAACGAATTGTGACATTTTCGCCACTTGTTGCTGTCTTAACTTCGTCACTTCTTGCTTGCAGTGTTTTTTCAGAGAATTTCGTTCGATTTAAATCATCACTTGTTAGTTCGACGTTGGGTTCCGACGGATCTTCATGGATTGGTGTTAAAACGATATTGTCCGGAGAGTTGGGAGCTGTAATACTTTCGGTCGGGATGCAAAATTCATCATTGGGAGTAGTATTGGCAACAGCCCATTTCAGTTCAGCAATGGTGGCATTGCCTCCAGCAATGTCAGCGGTGATAATGAACATGATGGGTAGCAGCCTCGGGTCTTTATGGAGAAATATTCCGCCCACTGTTGCAGAGGCTGCCCAAAAACCATTGATTCATTCACCAATTCTTTTCTTCGCAATCGGAGCCGCCGAATTCTGCACAATGGCTTCGATTTCTCTGCAATTCACCCCCTTCATTTCCCCTCTCCAACACCACCACCGCCGCCGTAATCTCCGCCTACACAGACCCAAAATTCCCTGTCTCTTGCCGCCGAACCTCCCCAGATTGCTCGCCGTCCGATCAGTTCAACGAAATAACGAAGACCCATCTCCCCTGCCCCCGGCAAAACCCTCCGGTTTGGATGATTTTCTTTCGACGGCGGCGAGTCTGTACCCTCTCTATGTGACGGTCGGCGGCGTCGTTGCTTGTCTGAAACCGTCGACCTTCTCGTGGTTTGTGGAGAGAGGACCCACTTCGTATAGCTTGGCTCTTGGCTTGATTATGTTGGCTATGGGTCTCACTCTGGAGCTGAAGGATTTGGTTAATTTGTTCATGCAGAGGCCACTTTCTATATTGTTTGGATGTGTAGCTCAGTACACGATTATGCCGGCTGCCGGAGCTCTTATTGGCAAGTTTTTTGGGCTTTCAACGTCGCTTTCGGTTGGTTTGATCTTGCTTTCATGTTGCCCTGGAGGGACAGCCTCCAATGTGGTAACCTTAATTGCTCAAGGCGACGTTCCTTTGTCTATTGTAATGACAGTATGCACCACTCTAGGAGCCGTAATTCTCACTCCATTTCTTACCAAATTACTAGCAGGAGCTTACATTCCAGTTGATGCTGCAAAGCTCTCTCTCAGCACCCTTCAGGTGGTGGTAGCTCCTATTCTATTAGGTTCCTACTTGCAGAAGGCGTTTCCTCAACTGGTGAAACTGGTGATACCATTTGCACCACTCGTTGCTGTCTTAACTTCGTCGCTGCTCGCTTGCAGTGTCTTCTCGGAGAACGTCGTTCGTTTCAAATCATCAATGGTTAATGCCTCATTAGCTTCTGATGCATCTCCATGGATGGTTATTAGAAGTATATTATCAGGAGAGTTGGGAACAGTCATACTTTCAGTGTTTTGTTTACACTTCGCGGGTTTCTTTGTCGGCTATATAGCAGCGAGTATCGGGGGGTTTCGAGAACGGGAACGAAGAGCTATATCCATAGAGGTTGGGATGCAGAATTCATCATTGGGAGTTGTTTTGGCGAGCTCACATTTCAGCTCAGCAATGGTGGCATTACCGGCAGCGATGTCAGCTGTGATAATGAACATAATGGGTAGCACTCTTGGGCTTTGTTGGAGATATATAGAACCTGCAGCTGATGAAGTGGAAGCCACTGGTAGTGTTGCCAACTGA

Protein sequence

MSSIFLQFTPFLSQSLHTHRKLRRYRPPTPLFRPSKPPGHLAVRALQRKTELPSLPTPPQKPTPVAKFVSTAAGLFPLYITAGGVVACLKPSTFSWFVRRGPGSYSLALGLVMLAMGLTLELKDLFTLFMERPLSILYVCIAQFTIMPVVGALIGKYLGLPPPLSVGLVLLGSVILTPFLTKTLVGAFVPVNALKLSLSTLQVVVAPILLGSYLQKAYPKLVKRIVTFSPLVAVLTSSLLACSVFSENFVRFKSSLVSSTLGSDGSSWIGVKTILSGELGAVILSVGMQNSSLGVVLATAHFSSAMVALPPAMSAVIMNMMGSSLGSLWRNIPPTVAEAAQKPLIHSPILFFAIGAAEFCTMASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPPAKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN
Homology
BLAST of Cp4.1LG14g00520 vs. ExPASy Swiss-Prot
Match: Q5VRB2 (Probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=BASS2 PE=2 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 2.8e-55
Identity = 135/343 (39.36%), Postives = 201/343 (58.60%), Query Frame = 0

Query: 417 SPLPPAKPSGLDDF---LSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGL 476
           S LP + PS  + +   +    +L+P++V +G ++   KPS  +W        +++ LG 
Sbjct: 90  SNLPESIPSEANQYEKIVELLTTLFPVWVILGTIIGIYKPSMVTWL---ETDLFTVGLGF 149

Query: 477 IMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILL 536
           +ML+MGLTL  +D     M+ P ++  G +AQY I P  G  I     LS  L+ GLIL+
Sbjct: 150 LMLSMGLTLTFEDF-RRCMRNPWTVGVGFLAQYLIKPMLGFAIAMTLKLSAPLATGLILV 209

Query: 537 SCCPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLS 596
           SCCPGG ASNV T I++G+V LS++MT C+T+GA+++TP LTKLLAG  +PVDAA L++S
Sbjct: 210 SCCPGGQASNVATYISKGNVALSVLMTTCSTIGAIVMTPLLTKLLAGQLVPVDAAGLAIS 269

Query: 597 TLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNA 656
           T QVV+ P ++G    + FP+  + +I   PL+ VL ++LL                   
Sbjct: 270 TFQVVLLPTIVGVLAHEYFPKFTERIISITPLIGVLLTTLLC------------------ 329

Query: 657 SLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVG 716
                ASP   +  +L  + G +I+ V  LH A F +GY  + +  F E   R ISIE G
Sbjct: 330 -----ASPIGQVSEVLKAQGGQLIIPVALLHVAAFALGYWLSKVSSFGESTSRTISIECG 389

Query: 717 MQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWR 757
           MQ+S+LG +LA  HF++ +VA+P+A+S V M + GS L + WR
Sbjct: 390 MQSSALGFLLAQKHFTNPLVAVPSAVSVVCMALGGSALAVFWR 405

BLAST of Cp4.1LG14g00520 vs. ExPASy Swiss-Prot
Match: Q1EBV7 (Sodium/pyruvate cotransporter BASS2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=BASS2 PE=1 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 4.9e-55
Identity = 140/349 (40.11%), Postives = 201/349 (57.59%), Query Frame = 0

Query: 419 LPPAKPSGLDDF---LSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIM 478
           LP + P  L  +   +    +L+PL+V +G +V   KPS  +W        ++L LG +M
Sbjct: 83  LPESTPKELSQYEKIIELLTTLFPLWVILGTLVGIFKPSLVTWL---ETDLFTLGLGFLM 142

Query: 479 LAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSC 538
           L+MGLTL  +D     ++ P ++  G +AQY I P  G LI     LS  L+ GLIL+SC
Sbjct: 143 LSMGLTLTFEDF-RRCLRNPWTVGVGFLAQYMIKPILGFLIAMTLKLSAPLATGLILVSC 202

Query: 539 CPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTL 598
           CPGG ASNV T I++G+V LS++MT C+T+GA+I+TP LTKLLAG  +PVDAA L+LST 
Sbjct: 203 CPGGQASNVATYISKGNVALSVLMTTCSTIGAIIMTPLLTKLLAGQLVPVDAAGLALSTF 262

Query: 599 QVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASL 658
           QVV+ P ++G    + FP+    +I   PL+ V+ ++LL                     
Sbjct: 263 QVVLVPTIIGVLANEFFPKFTSKIITVTPLIGVILTTLLC-------------------- 322

Query: 659 ASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQ 718
              ASP   +  +L  +   +IL V  LH A F +GY  +    F E   R ISIE GMQ
Sbjct: 323 ---ASPIGQVADVLKTQGAQLILPVALLHAAAFAIGYWISKF-SFGESTSRTISIECGMQ 382

Query: 719 NSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADE 765
           +S+LG +LA  HF++ +VA+P+A+S V M + GS L + WR +   AD+
Sbjct: 383 SSALGFLLAQKHFTNPLVAVPSAVSVVCMALGGSGLAVFWRNLPIPADD 403

BLAST of Cp4.1LG14g00520 vs. ExPASy Swiss-Prot
Match: Q93YR2 (Probable sodium/metabolite cotransporter BASS1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=BASS1 PE=2 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 1.1e-51
Identity = 136/356 (38.20%), Postives = 199/356 (55.90%), Query Frame = 0

Query: 409 VQRNNEDPSPLPPAKPSGLDDFL----STAASLYPLYVTVGGVVACLKPSTFSWFVERGP 468
           V R     + LP  K     +++       ++ +P++V++G ++  ++PSTF+W     P
Sbjct: 68  VPRCGISSNDLPTEKKKSFGEWVEFVGEAVSTAFPIWVSLGCLLGLMRPSTFNWVT---P 127

Query: 469 TSYSLALGLIMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLST 528
               + L + ML MG+TL L DL    +  P  +  G + QY++MP +   + K   L  
Sbjct: 128 NWTIVGLTITMLGMGMTLTLDDLRGA-LSMPKELFAGFLLQYSVMPLSAFFVSKLLNLPP 187

Query: 529 SLSVGLILLSCCPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIP 588
             + GLIL+ CCPGGTASN+VT IA+G+V LS++MT  +T+ AVI+TP LT  LA  YI 
Sbjct: 188 HYAAGLILVGCCPGGTASNIVTYIARGNVALSVLMTAASTVSAVIMTPLLTAKLAKQYIT 247

Query: 589 VDAAKLSLSTLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVV 648
           VDA  L +STLQVV+ P+L G++L + F +LVK V P  P +AV T ++L      +N  
Sbjct: 248 VDALGLLMSTLQVVLLPVLAGAFLNQYFKKLVKFVSPVMPPIAVGTVAILCGYAIGQNAS 307

Query: 649 RFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERE 708
                                  ++SG+   V+L+   LH +GF  GY+ + I G     
Sbjct: 308 AI---------------------LMSGK--QVVLASCLLHISGFLFGYLFSRILGIDVAS 367

Query: 709 RRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEP 761
            R ISIEVGMQNS LGVVLA+ HF + + A+P A+S+V  +I+GS L   WR   P
Sbjct: 368 SRTISIEVGMQNSVLGVVLATQHFGNPLTAVPCAVSSVCHSILGSVLAGIWRRSAP 396

BLAST of Cp4.1LG14g00520 vs. ExPASy Swiss-Prot
Match: Q7XVB3 (Probable sodium/metabolite cotransporter BASS1, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=BASS1 PE=2 SV=2)

HSP 1 Score: 204.9 bits (520), Expect = 3.3e-51
Identity = 133/323 (41.18%), Postives = 184/323 (56.97%), Query Frame = 0

Query: 438 YPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLTLELKDLVNLFMQRPLS 497
           +P++V     VA  +P  F W     P +  + +   ML MG+TL L DL    +  P  
Sbjct: 106 FPVWVASACAVALWRPPAFLWV---SPMAQIVGISFTMLGMGMTLTLDDLKTALLM-PKE 165

Query: 498 ILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTASNVVTLIAQGDVPLSI 557
           +  G + QY++MP +G LI K   L +  + GLIL+SCCPGGTASN+VT +A+G+V LS+
Sbjct: 166 LASGFLLQYSVMPLSGFLISKLLNLPSYYAAGLILVSCCPGGTASNIVTYLARGNVALSV 225

Query: 558 VMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAPILLGSYLQKAFPQLVK 617
           +MT  +T  A  LTP LT  LAG Y+ VD   L +ST QVV+AP+LLG+ L +    LV+
Sbjct: 226 LMTAASTFAAAFLTPLLTSKLAGQYVAVDPMGLFVSTSQVVLAPVLLGALLNQYCNGLVQ 285

Query: 618 LVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVI 677
           LV P  P +AV T ++L  +  ++N                        +ILS  L  V+
Sbjct: 286 LVSPLMPFIAVATVAVLCGNAIAQNA----------------------SAILSSGL-QVV 345

Query: 678 LSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPA 737
           +SV  LH +GFF GY+ +   G      R ISIEVGMQNS LGVVLAS HF + + A+P 
Sbjct: 346 MSVCWLHASGFFFGYVLSRTIGIDISSSRTISIEVGMQNSVLGVVLASKHFGNPLTAVPC 401

Query: 738 AMSAVIMNIMGSTLGLCWRYIEP 761
           A+S+V  ++ GS L   WR + P
Sbjct: 406 AVSSVCHSVYGSLLAGIWRSLPP 401

BLAST of Cp4.1LG14g00520 vs. ExPASy Swiss-Prot
Match: Q6K739 (Probable sodium/metabolite cotransporter BASS3, chloroplastic OS=Oryza sativa subsp. japonica OX=39947 GN=BASS3 PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 2.0e-37
Identity = 119/341 (34.90%), Postives = 177/341 (51.91%), Query Frame = 0

Query: 418 PLPPAKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLA 477
           P+     +   D     ++L PL V    V A   P+TFSW  +     Y+ ALG IML+
Sbjct: 88  PVSSPSSAAAGDPSQALSALLPLVVAATAVAALGNPATFSWVSKE---YYAPALGGIMLS 147

Query: 478 MGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCP 537
           +G+ L + D   L  +RP+ +  G +AQY + P  G LI + FG+ ++   G +L  C  
Sbjct: 148 IGIKLSIDDFA-LAFKRPVPLTIGYMAQYIVKPLMGVLIARAFGMPSAFFAGFVLTCCVS 207

Query: 538 GGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQV 597
           G   S+  + +++GDV LSI++T C+T+ +V++TP LT LL G+ +PVD   ++ S LQV
Sbjct: 208 GAQLSSYASFLSKGDVALSILLTSCSTISSVVVTPVLTGLLIGSVVPVDGIAMAKSILQV 267

Query: 598 VVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLAS 657
           V+ P+ LG  L      +V ++ P  P VA+L +SL                        
Sbjct: 268 VLVPVTLGLLLNTYAKAVVNVIQPVMPFVAMLCTSLCI---------------------- 327

Query: 658 DASPWMVIRS-ILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERER--RAISIEVGM 717
             SP  + RS ILS E   ++L +   H A F VGY  + +   R+ E   R IS+  GM
Sbjct: 328 -GSPLAINRSKILSSEGFLLLLPIVTFHIAAFIVGYWISKLPMLRQEEPVCRTISVCTGM 387

Query: 718 QNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCW 756
           Q+S+L  +LA+    S+  A+PAA S VIM I G TL   W
Sbjct: 388 QSSTLAGLLATQFLGSSQ-AVPAACSVVIMAIFGLTLASYW 400

BLAST of Cp4.1LG14g00520 vs. NCBI nr
Match: XP_023551878.1 (probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 776 bits (2005), Expect = 1.91e-276
Identity = 412/412 (100.00%), Postives = 412/412 (100.00%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP
Sbjct: 1   MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 412

BLAST of Cp4.1LG14g00520 vs. NCBI nr
Match: XP_022929395.1 (probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita moschata])

HSP 1 Score: 761 bits (1966), Expect = 1.54e-270
Identity = 407/412 (98.79%), Postives = 408/412 (99.03%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL HHHRR NLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHHHHRR-NLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAV+MNIMGSTLGLCWRYIEPAADEVEATGSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVLMNIMGSTLGLCWRYIEPAADEVEATGSVAN 411

BLAST of Cp4.1LG14g00520 vs. NCBI nr
Match: KAG6577371.1 (putative sodium/metabolite cotransporter BASS1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 755 bits (1949), Expect = 5.87e-268
Identity = 405/412 (98.30%), Postives = 405/412 (98.30%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL HHHRR NLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHHHHRR-NLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLF QRPLSILFGCVAQYTIMPAAGALIGK FGLSTSLSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFRQRPLSILFGCVAQYTIMPAAGALIGKIFGLSTSLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAA EVEATGSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAAHEVEATGSVAN 411

BLAST of Cp4.1LG14g00520 vs. NCBI nr
Match: XP_022985149.1 (probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita maxima])

HSP 1 Score: 747 bits (1929), Expect = 5.92e-265
Identity = 399/412 (96.84%), Postives = 404/412 (98.06%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL H   RRNLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHH---RRNLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLST LSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTPLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WM+IRSILSGELG V+LSVFCLHFAGFFVGY+AASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMIIRSILSGELGMVVLSVFCLHFAGFFVGYVAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEP+ADEVEA+GSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPSADEVEASGSVAN 409

BLAST of Cp4.1LG14g00520 vs. NCBI nr
Match: KAG7027692.1 (putative sodium/metabolite cotransporter BASS1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 742 bits (1915), Expect = 2.52e-262
Identity = 406/441 (92.06%), Postives = 406/441 (92.06%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL HHHRR NLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHHHHRR-NLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLF QRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFRQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQV---- 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQV    
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVKFLR 240

Query: 602 -------------------------VVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSS 661
                                    VVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSS
Sbjct: 241 VPVPVRVRVHVPVHVPVHVHVHVHVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSS 300

Query: 662 LLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGY 721
           LLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGY
Sbjct: 301 LLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGY 360

Query: 722 IAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLG 773
           IAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLG
Sbjct: 361 IAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLG 420

BLAST of Cp4.1LG14g00520 vs. ExPASy TrEMBL
Match: A0A6J1EUA3 (probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111435979 PE=3 SV=1)

HSP 1 Score: 761 bits (1966), Expect = 7.45e-271
Identity = 407/412 (98.79%), Postives = 408/412 (99.03%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL HHHRR NLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHHHHRR-NLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAV+MNIMGSTLGLCWRYIEPAADEVEATGSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVLMNIMGSTLGLCWRYIEPAADEVEATGSVAN 411

BLAST of Cp4.1LG14g00520 vs. ExPASy TrEMBL
Match: A0A6J1JCH6 (probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111483238 PE=3 SV=1)

HSP 1 Score: 747 bits (1929), Expect = 2.87e-265
Identity = 399/412 (96.84%), Postives = 404/412 (98.06%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           MASISLQFTPFISPL H   RRNLRLHRPKIPCLLPP LPRLLAVRSVQRNNE PSPLPP
Sbjct: 1   MASISLQFTPFISPLHH---RRNLRLHRPKIPCLLPPKLPRLLAVRSVQRNNEYPSPLPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
           AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT
Sbjct: 61  AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLST LSVGLILLSCCPGGTA
Sbjct: 121 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTPLSVGLILLSCCPGGTA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP
Sbjct: 241 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           WM+IRSILSGELG V+LSVFCLHFAGFFVGY+AASIGGFRERERRAISIEVGMQNSSLGV
Sbjct: 301 WMIIRSILSGELGMVVLSVFCLHFAGFFVGYVAASIGGFRERERRAISIEVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEATGSVAN 773
           VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEP+ADEVEA+GSVAN
Sbjct: 361 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPSADEVEASGSVAN 409

BLAST of Cp4.1LG14g00520 vs. ExPASy TrEMBL
Match: A0A7J7H4H3 (Uncharacterized protein OS=Camellia sinensis OX=4442 GN=HYC85_017339 PE=3 SV=1)

HSP 1 Score: 671 bits (1730), Expect = 1.17e-230
Identity = 404/690 (58.55%), Postives = 476/690 (68.99%), Query Frame = 0

Query: 136 ILYVCIAQFTIMPVVGALIGKYLGLPPPLSVGLVLL------------------------ 195
           IL+  +AQ+TIMP  G +I K LGLPP +SVGL+LL                        
Sbjct: 19  ILFGFVAQYTIMPSFGWIISKTLGLPPAVSVGLILLACCPGGTASNVVTLIAQGDVPLSI 78

Query: 196 --------GSVILTPFLTKTLVGAFVPVNALKLSLSTLQVVVAPILLGSYLQKAYPKLVK 255
                   G+VILTP LTK L G FVPV+A+KLS+ST+QVVVAPILLGSY+Q  +P  VK
Sbjct: 79  VMTVCTTLGAVILTPLLTKILAGTFVPVDAVKLSISTMQVVVAPILLGSYMQSTFPAAVK 138

Query: 256 RIVTFSPLVAVLTSSLLACSVFSENFVRFKSSLVSSTLGSDGSSWIGVKTILSGELGAVI 315
            +  F PL+AVLTSSLLACSVFSEN VR KSS+V +++ S+ S  +  + ILSGELG +I
Sbjct: 139 VVTPFGPLLAVLTSSLLACSVFSENVVRLKSSVVVASVSSEASPLLHARAILSGELGVII 198

Query: 316 LSV--------------------------------GMQNSSLGVVLATAHFSSAMVALPP 375
           LSV                                GMQNSSLGVVLAT+HF+S MVALPP
Sbjct: 199 LSVLLLHFAGFFVGYISAAISGFREPQRRAISIEVGMQNSSLGVVLATSHFTSPMVALPP 258

Query: 376 AMSAVIMNMMGSSLGSLWRNIPPTVAEAAQKPLIHSPILFFAIGAAEFCTMASISLQFTP 435
           A+SAV+MN+MGSSLG  WR + P+ ++   K     PI    I                 
Sbjct: 259 ALSAVLMNIMGSSLGFFWRYVDPSDSQTTPKDY-DQPIKPTTIST--------------- 318

Query: 436 FISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPPAKPSGLDDFL 495
                                    PP L  L  +RS  +N + P P+  +KP   ++ L
Sbjct: 319 -------------------------PPKLSNLPTIRSGHKNFDHP-PVATSKPRW-ENML 378

Query: 496 STAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLTLELKDLVNLF 555
           STAASLYPLYVTVG VVACL+PSTFSWFV+RGPTSYSLALG IMLAMGL+LEL +  +LF
Sbjct: 379 STAASLYPLYVTVGAVVACLRPSTFSWFVKRGPTSYSLALGFIMLAMGLSLELSEFFSLF 438

Query: 556 MQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTASNVVTLIAQG 615
           +QRPLSILFGCVAQ+TIMP  G +I K  GLS ++SVGLILL+CCPGGTA   VTLIAQG
Sbjct: 439 LQRPLSILFGCVAQFTIMPTFGFIISKALGLSPAVSVGLILLACCPGGTA---VTLIAQG 498

Query: 616 DVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAPILLGSYLQKA 675
           DV LSIVMT CTTLGAVILTP LTK+LAGAY+ VDA KLS+ST+QVVVAPILLGSY+Q+ 
Sbjct: 499 DVTLSIVMTACTTLGAVILTPLLTKILAGAYVHVDAVKLSISTMQVVVAPILLGSYMQRT 558

Query: 676 FPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSG 735
           FP  VKLV PF PL+AVL +SLLACSVFSENVVR KSS V ASL+ +ASP +  R+ILSG
Sbjct: 559 FPAAVKLVTPFGPLLAVLAASLLACSVFSENVVRLKSSTVVASLSPEASPLLRARAILSG 618

Query: 736 ELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGVVLASSHFSSA 761
           ELG +ILSV  LH AGFFVGYI+A+I GFRE +RRAISIEVGMQNSSLGVVLA+SHF+S 
Sbjct: 619 ELGVIILSVLLLHAAGFFVGYISAAISGFREPQRRAISIEVGMQNSSLGVVLATSHFASP 662

BLAST of Cp4.1LG14g00520 vs. ExPASy TrEMBL
Match: A0A0A0L3A1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G106000 PE=3 SV=1)

HSP 1 Score: 615 bits (1587), Expect = 1.70e-213
Identity = 333/407 (81.82%), Postives = 362/407 (88.94%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           M  ISLQ TPFISPL H   R NLRLHRP IP L PP   R L VRSVQ+NNE PSP PP
Sbjct: 1   MPPISLQLTPFISPLIH---RPNLRLHRPPIPPLSPP---RSLTVRSVQQNNEHPSPSPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
            KP+GLDDFLSTAASLYPLYVT GG+VACL+PSTFSWFV+RGP+SYSL+LGLIMLAMGLT
Sbjct: 61  PKPTGLDDFLSTAASLYPLYVTAGGIVACLEPSTFSWFVQRGPSSYSLSLGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LE+KDL NLFMQRPLSILFGCVAQYTIMPA+  LIGK  GLS SL  GL+LL CCPGG+A
Sbjct: 121 LEIKDLFNLFMQRPLSILFGCVAQYTIMPASAVLIGKLLGLSQSLLFGLVLLGCCPGGSA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVI TPFLTK L GAYIPVDAA+LSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVIFTPFLTKFLVGAYIPVDAAQLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGS LQKAFP LVKLV+PFAPLVAVLTSSLLA SVFSENV+R KSSMV+A+LASDAS 
Sbjct: 241 ILLGSCLQKAFPSLVKLVLPFAPLVAVLTSSLLASSVFSENVIRIKSSMVSATLASDASL 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           W V++SILSGELG VILSVFCLHFAGFFVGYIAA+I GFRERERR IS++VGMQNSSLGV
Sbjct: 301 WTVLKSILSGELGVVILSVFCLHFAGFFVGYIAAAICGFRERERRTISMQVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEAT 768
           VLA+SHFSSAMVALP A+SAVIMN+MGSTLG CW+YI+P+ DEV+ +
Sbjct: 361 VLAASHFSSAMVALPPAISAVIMNMMGSTLGFCWKYIQPS-DEVKTS 400

BLAST of Cp4.1LG14g00520 vs. ExPASy TrEMBL
Match: A0A5D3BTX1 (Putative sodium/metabolite cotransporter BASS1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold18G00520 PE=3 SV=1)

HSP 1 Score: 612 bits (1578), Expect = 3.76e-212
Identity = 330/407 (81.08%), Postives = 363/407 (89.19%), Query Frame = 0

Query: 362 MASISLQFTPFISPLQHHHRRRNLRLHRPKIPCLLPPNLPRLLAVRSVQRNNEDPSPLPP 421
           M  ISLQ TPFISPL H   R NL LHRP IP L PP   R L +RSVQ+NNE PSP PP
Sbjct: 1   MPPISLQLTPFISPLLH---RPNLCLHRPPIPRLSPP---RSLTIRSVQQNNEHPSPSPP 60

Query: 422 AKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLT 481
            KP+GLDDFLSTAASLYPLYVT GG+VAC++PSTFSWFV+RGP+SYSL+LGLIMLAMGLT
Sbjct: 61  PKPTGLDDFLSTAASLYPLYVTAGGIVACVEPSTFSWFVQRGPSSYSLSLGLIMLAMGLT 120

Query: 482 LELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTA 541
           LE+KDL NLFMQRPLSI+FGCVAQYTIMPA+ A++GKF GLS SL  GLILL CCPGG+A
Sbjct: 121 LEIKDLFNLFMQRPLSIMFGCVAQYTIMPASAAVVGKFLGLSQSLLSGLILLGCCPGGSA 180

Query: 542 SNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAP 601
           SNVVTLIAQGDVPLSIVMTVCTTLGAVI TPFLTK L GAYIPVDAA+LSLSTLQVVVAP
Sbjct: 181 SNVVTLIAQGDVPLSIVMTVCTTLGAVIFTPFLTKFLVGAYIPVDAAQLSLSTLQVVVAP 240

Query: 602 ILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASP 661
           ILLGS LQKAFP LVKLV+PFAPLVAVLTSSLLA SVFSENV+R KSSMV+A+LASDAS 
Sbjct: 241 ILLGSCLQKAFPSLVKLVLPFAPLVAVLTSSLLASSVFSENVIRIKSSMVSATLASDASL 300

Query: 662 WMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQNSSLGV 721
           W V++SILSGELG VILSVFCLHFAGFFVGYIAA+I GF+ERERR IS++VGMQNSSLGV
Sbjct: 301 WTVLQSILSGELGVVILSVFCLHFAGFFVGYIAAAICGFQERERRTISMQVGMQNSSLGV 360

Query: 722 VLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADEVEAT 768
           VLA+SHFSSAMVALP A+SAVIMNIMGSTLG CW+YI+P+ DEV+ +
Sbjct: 361 VLATSHFSSAMVALPPAISAVIMNIMGSTLGFCWKYIQPS-DEVKTS 400

BLAST of Cp4.1LG14g00520 vs. TAIR 10
Match: AT2G26900.1 (Sodium Bile acid symporter family )

HSP 1 Score: 217.6 bits (553), Expect = 3.4e-56
Identity = 140/349 (40.11%), Postives = 201/349 (57.59%), Query Frame = 0

Query: 419 LPPAKPSGLDDF---LSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIM 478
           LP + P  L  +   +    +L+PL+V +G +V   KPS  +W        ++L LG +M
Sbjct: 83  LPESTPKELSQYEKIIELLTTLFPLWVILGTLVGIFKPSLVTWL---ETDLFTLGLGFLM 142

Query: 479 LAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSC 538
           L+MGLTL  +D     ++ P ++  G +AQY I P  G LI     LS  L+ GLIL+SC
Sbjct: 143 LSMGLTLTFEDF-RRCLRNPWTVGVGFLAQYMIKPILGFLIAMTLKLSAPLATGLILVSC 202

Query: 539 CPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTL 598
           CPGG ASNV T I++G+V LS++MT C+T+GA+I+TP LTKLLAG  +PVDAA L+LST 
Sbjct: 203 CPGGQASNVATYISKGNVALSVLMTTCSTIGAIIMTPLLTKLLAGQLVPVDAAGLALSTF 262

Query: 599 QVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASL 658
           QVV+ P ++G    + FP+    +I   PL+ V+ ++LL                     
Sbjct: 263 QVVLVPTIIGVLANEFFPKFTSKIITVTPLIGVILTTLLC-------------------- 322

Query: 659 ASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERERRAISIEVGMQ 718
              ASP   +  +L  +   +IL V  LH A F +GY  +    F E   R ISIE GMQ
Sbjct: 323 ---ASPIGQVADVLKTQGAQLILPVALLHAAAFAIGYWISKF-SFGESTSRTISIECGMQ 382

Query: 719 NSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEPAADE 765
           +S+LG +LA  HF++ +VA+P+A+S V M + GS L + WR +   AD+
Sbjct: 383 SSALGFLLAQKHFTNPLVAVPSAVSVVCMALGGSGLAVFWRNLPIPADD 403

BLAST of Cp4.1LG14g00520 vs. TAIR 10
Match: AT1G78560.1 (Sodium Bile acid symporter family )

HSP 1 Score: 206.5 bits (524), Expect = 8.0e-53
Identity = 136/356 (38.20%), Postives = 199/356 (55.90%), Query Frame = 0

Query: 409 VQRNNEDPSPLPPAKPSGLDDFL----STAASLYPLYVTVGGVVACLKPSTFSWFVERGP 468
           V R     + LP  K     +++       ++ +P++V++G ++  ++PSTF+W     P
Sbjct: 68  VPRCGISSNDLPTEKKKSFGEWVEFVGEAVSTAFPIWVSLGCLLGLMRPSTFNWVT---P 127

Query: 469 TSYSLALGLIMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIGKFFGLST 528
               + L + ML MG+TL L DL    +  P  +  G + QY++MP +   + K   L  
Sbjct: 128 NWTIVGLTITMLGMGMTLTLDDLRGA-LSMPKELFAGFLLQYSVMPLSAFFVSKLLNLPP 187

Query: 529 SLSVGLILLSCCPGGTASNVVTLIAQGDVPLSIVMTVCTTLGAVILTPFLTKLLAGAYIP 588
             + GLIL+ CCPGGTASN+VT IA+G+V LS++MT  +T+ AVI+TP LT  LA  YI 
Sbjct: 188 HYAAGLILVGCCPGGTASNIVTYIARGNVALSVLMTAASTVSAVIMTPLLTAKLAKQYIT 247

Query: 589 VDAAKLSLSTLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVV 648
           VDA  L +STLQVV+ P+L G++L + F +LVK V P  P +AV T ++L      +N  
Sbjct: 248 VDALGLLMSTLQVVLLPVLAGAFLNQYFKKLVKFVSPVMPPIAVGTVAILCGYAIGQNAS 307

Query: 649 RFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERE 708
                                  ++SG+   V+L+   LH +GF  GY+ + I G     
Sbjct: 308 AI---------------------LMSGK--QVVLASCLLHISGFLFGYLFSRILGIDVAS 367

Query: 709 RRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWRYIEP 761
            R ISIEVGMQNS LGVVLA+ HF + + A+P A+S+V  +I+GS L   WR   P
Sbjct: 368 SRTISIEVGMQNSVLGVVLATQHFGNPLTAVPCAVSSVCHSILGSVLAGIWRRSAP 396

BLAST of Cp4.1LG14g00520 vs. TAIR 10
Match: AT4G22840.1 (Sodium Bile acid symporter family )

HSP 1 Score: 156.4 bits (394), Expect = 9.4e-38
Identity = 127/369 (34.42%), Postives = 184/369 (49.86%), Query Frame = 0

Query: 399 NLPRLLAVRSVQRNNEDP--SPLPPAKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTF 458
           NL R  A  +      DP   P    +   + D +  A S+ P  V    ++A + P +F
Sbjct: 60  NLWRRYASDNFSEMGLDPGADPFKVIEKPSIVDRMKKANSILPHVVLASTILALIYPPSF 119

Query: 459 SWFVERGPTSYSLALGLIMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALI 518
           +WF  R    +  ALG +M A+G+    KD +  F +RP +IL G V QY + P  G + 
Sbjct: 120 TWFTSR---YFVPALGFLMFAVGINSNEKDFLEAF-KRPKAILLGYVGQYLVKPVLGFIF 179

Query: 519 G----KFFGLSTSLSVGLILLSCCPGGTASNVVTLIAQGDV-PLSIVMTVCTTLGAVILT 578
           G      F L T +  G++L+SC  G   SN  T +    + PLSIVMT  +T  AV++T
Sbjct: 180 GLAAVSLFQLPTPIGAGIMLVSCVSGAQLSNYATFLTDPALAPLSIVMTSLSTATAVLVT 239

Query: 579 PFLTKLLAGAYIPVDAAKLSLSTLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTS 638
           P L+ LL G  +PVD   +  S LQVV+API  G  L K FP++   + PF P+++VL +
Sbjct: 240 PMLSLLLIGKKLPVDVKGMISSILQVVIAPIAAGLLLNKLFPKVSNAIRPFLPILSVLDT 299

Query: 639 SLLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVG 698
              AC              V A LA      + I S++S    T++L V   H + F  G
Sbjct: 300 ---AC-------------CVGAPLA------LNINSVMSPFGATILLLVTMFHLSAFLAG 359

Query: 699 YIAASIGGFR-----ERERRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNI 756
           Y       FR     +  +R +S E GMQ+S L + LA+  F   +V +P A+S V+M++
Sbjct: 360 YFLTG-SVFRNAPDAKAMQRTLSYETGMQSSLLALALATKFFQDPLVGIPPAISTVVMSL 401

BLAST of Cp4.1LG14g00520 vs. TAIR 10
Match: AT3G25410.1 (Sodium Bile acid symporter family )

HSP 1 Score: 145.6 bits (366), Expect = 1.7e-34
Identity = 116/323 (35.91%), Postives = 167/323 (51.70%), Query Frame = 0

Query: 435 ASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLALGLIMLAMGLTLELKDLVNLFMQR 494
           ++L P  V +  V A   P +F+W        Y+ ALG IML++G+ L + D   L  +R
Sbjct: 112 SALLPFVVALTAVAALSYPPSFTWV---SKDLYAPALGGIMLSIGIQLSVDDFA-LAFKR 171

Query: 495 PLSILFGCVAQYTIMPAAGALIGKFFGLSTSLSVGLILLSCCPGGTASNVVTLIAQGDVP 554
           P+ +  G VAQY + P  G L+   FG+  +   G IL  C  G   S+  + +++ DV 
Sbjct: 172 PVPLSVGFVAQYVLKPLLGVLVANAFGMPRTFYAGFILTCCVAGAQLSSYASSLSKADVA 231

Query: 555 LSIVMTVCTTLGAVILTPFLTKLLAGAYIPVDAAKLSLSTLQVVVAPILLGSYLQKAFPQ 614
           +SI++T  TT+ +VI TP L+ LL G+ +PVDA  +S S LQVV+ PI LG  L      
Sbjct: 232 MSILLTSSTTIASVIFTPLLSGLLIGSVVPVDAVAMSKSILQVVLVPITLGLVLNTYAKP 291

Query: 615 LVKLVIPFAPLVAVLTSSLLACSVFSENVVRFKSSMVNASLASDASPWMVIRSILSGELG 674
           +V L+ P  P VA++ +SL   S  S          +N S             ILS E  
Sbjct: 292 VVTLLQPVMPFVAMVCTSLCIGSPLS----------INRS------------QILSAEGL 351

Query: 675 TVILSVFCLHFAGFFVGYIAASIGGFRERER--RAISIEVGMQNSSLGVVLASSHFSSAM 734
            +I+ +   H   F +GY  + I G R+ E   R IS+  GMQ+S+L  +LAS    S+ 
Sbjct: 352 GLIVPIVTFHAVAFALGYWFSKIPGLRQEEEVSRTISLCTGMQSSTLAGLLASQFLGSSQ 407

Query: 735 VALPAAMSAVIMNIMGSTLGLCW 756
            A+PAA S V+M IMG  L   W
Sbjct: 412 -AVPAACSVVVMAIMGLCLASFW 407

BLAST of Cp4.1LG14g00520 vs. TAIR 10
Match: AT4G12030.2 (bile acid transporter 5 )

HSP 1 Score: 138.7 bits (348), Expect = 2.0e-32
Identity = 113/354 (31.92%), Postives = 171/354 (48.31%), Query Frame = 0

Query: 412 NNEDPSPLPPAKPSGLDDFLSTAASLYPLYVTVGGVVACLKPSTFSWFVERGPTSYSLAL 471
           +  D + L   K S + + L  A S  P  + +  ++A + P +F+WF    P  +   L
Sbjct: 76  SESDSNELYHKKVSSIMETLKQAYSFIPHGILLSTILALVYPPSFTWF---KPRYFVPGL 135

Query: 472 GLIMLAMGLTLELKDLVNLFMQRPLSILFGCVAQYTIMPAAGALIG----KFFGLSTSLS 531
           G +M A+G+    +D +   ++RP +I  G + QY I P  G + G      F L TS+ 
Sbjct: 136 GFMMFAVGINSNERDFLEA-LKRPDAIFAGYIGQYLIKPLLGYIFGVIAVSLFNLPTSIG 195

Query: 532 VGLILLSCCPGGTASNVVTLIAQGDV-PLSIVMTVCTTLGAVILTPFLTKLLAGAYIPVD 591
            G++L+SC  G   SN  T +    +  LSIVMT  +T  AV++TP L+ LL G  +PVD
Sbjct: 196 AGIMLVSCVSGAQLSNYTTFLTDPSLAALSIVMTSISTATAVLVTPMLSLLLIGKKLPVD 255

Query: 592 AAKLSLSTLQVVVAPILLGSYLQKAFPQLVKLVIPFAPLVAVLTSSLLACSVFSENVVRF 651
              +  S LQVV+ PI  G  L + FP+L   + PF P + V+                 
Sbjct: 256 VFGMISSILQVVITPIAAGLLLNRLFPRLSNAIKPFLPALTVID---------------- 315

Query: 652 KSSMVNASLASDASPWMVIRSILSGELGTVILSVFCLHFAGFFVGYIAASIGGFRERE-- 711
            S  + A LA      + I SILS    T++  V   H   F  GY        +  +  
Sbjct: 316 MSCCIGAPLA------LNIDSILSPFGATILFLVITFHLLAFVAGYFFTGFFFSKAPDVK 375

Query: 712 --RRAISIEVGMQNSSLGVVLASSHFSSAMVALPAAMSAVIMNIMGSTLGLCWR 757
             +R IS E GMQ+S L + LA+  F   +V +P A+S V+M++MG +L   W+
Sbjct: 376 ALQRTISYETGMQSSLLALALATKFFQDPLVGVPPAISTVVMSLMGVSLVTIWK 403

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q5VRB22.8e-5539.36Probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Oryza sativa su... [more]
Q1EBV74.9e-5540.11Sodium/pyruvate cotransporter BASS2, chloroplastic OS=Arabidopsis thaliana OX=37... [more]
Q93YR21.1e-5138.20Probable sodium/metabolite cotransporter BASS1, chloroplastic OS=Arabidopsis tha... [more]
Q7XVB33.3e-5141.18Probable sodium/metabolite cotransporter BASS1, chloroplastic OS=Oryza sativa su... [more]
Q6K7392.0e-3734.90Probable sodium/metabolite cotransporter BASS3, chloroplastic OS=Oryza sativa su... [more]
Match NameE-valueIdentityDescription
XP_023551878.11.91e-276100.00probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita pepo su... [more]
XP_022929395.11.54e-27098.79probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita moschat... [more]
KAG6577371.15.87e-26898.30putative sodium/metabolite cotransporter BASS1, chloroplastic, partial [Cucurbit... [more]
XP_022985149.15.92e-26596.84probable sodium/metabolite cotransporter BASS2, chloroplastic [Cucurbita maxima][more]
KAG7027692.12.52e-26292.06putative sodium/metabolite cotransporter BASS1, chloroplastic [Cucurbita argyros... [more]
Match NameE-valueIdentityDescription
A0A6J1EUA37.45e-27198.79probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Cucurbita mosch... [more]
A0A6J1JCH62.87e-26596.84probable sodium/metabolite cotransporter BASS2, chloroplastic OS=Cucurbita maxim... [more]
A0A7J7H4H31.17e-23058.55Uncharacterized protein OS=Camellia sinensis OX=4442 GN=HYC85_017339 PE=3 SV=1[more]
A0A0A0L3A11.70e-21381.82Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G106000 PE=3 SV=1[more]
A0A5D3BTX13.76e-21281.08Putative sodium/metabolite cotransporter BASS1 OS=Cucumis melo var. makuwa OX=11... [more]
Match NameE-valueIdentityDescription
AT2G26900.13.4e-5640.11Sodium Bile acid symporter family [more]
AT1G78560.18.0e-5338.20Sodium Bile acid symporter family [more]
AT4G22840.19.4e-3834.42Sodium Bile acid symporter family [more]
AT3G25410.11.7e-3435.91Sodium Bile acid symporter family [more]
AT4G12030.22.0e-3231.92bile acid transporter 5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002657Bile acid:sodium symporter/arsenical resistance protein Acr3PFAMPF01758SBFcoord: 469..643
e-value: 4.3E-36
score: 124.4
coord: 107..173
e-value: 9.3E-8
score: 32.0
IPR038770Sodium/solute symporter superfamilyGENE3D1.20.1530.20coord: 63..172
e-value: 5.1E-18
score: 67.1
coord: 424..757
e-value: 5.9E-88
score: 296.9
coord: 173..331
e-value: 1.7E-27
score: 98.3
IPR004710Bile acid:sodium symporterPANTHERPTHR10361SODIUM-BILE ACID COTRANSPORTERcoord: 387..762
coord: 66..173
IPR004710Bile acid:sodium symporterPANTHERPTHR10361SODIUM-BILE ACID COTRANSPORTERcoord: 172..286
coord: 281..337
NoneNo IPR availablePANTHERPTHR10361:SF58SODIUM/METABOLITE COTRANSPORTER BASS2, CHLOROPLASTIC-RELATEDcoord: 281..337
NoneNo IPR availablePANTHERPTHR10361:SF58SODIUM/METABOLITE COTRANSPORTER BASS2, CHLOROPLASTIC-RELATEDcoord: 172..286
coord: 387..762
coord: 66..173

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g00520.1Cp4.1LG14g00520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane