Tan0010904.2 (mRNA) Snake gourd v1

Overview
NameTan0010904.2
TypemRNA
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionABC transporter I family member 6
LocationLG02: 41990558 .. 42012327 (-)
Sequence length947
RNA-Seq ExpressionTan0010904.2
SyntenyTan0010904.2
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTACTCCCGTCTTCAAGTTCTGACGCATTTGGTGTTGGTGTACATCCCCACTTTCCGCAATGGCGACGTCAATGCCATACTGCACTTCATCTTCTACCACATCCACTTCTCCTCTGGTTCGATATGCACCATCTAATCCTACTTTATCCAAGAGGCCCATCCTTCTGCCCCTTCCCTTCCGTTCCAATCGCCGCCGCTCAATTCGGATTGCAACGGCTTCTCTCTCCGCCGTCAATTCTCCGTCTACGAGTTCCTCTGGCAGTCAGAAAAATCTTTTGTTAGAAGTCAAGGATTTGACTGCTGTAATTGCAGAATCCAAGCAGGAGATTCTCAAGGGTGTTAACCTTGTTGTCCGCGAAGGAGAGGTTGGTTCAAATTAATCTCTAGTATCGATTTATGAAGTGGGTTCTTACAAGGTTTTGAGATTTTCATTGAATTTTTTTTGTTCATTCTATGATTCGGAGTAATTCTTCCTGCCACTAGTAGGTTCATGCCATCATGGGAAAGAACGGTTCTGGAAAGAGCACATTTGCAAAGGTCAGGATTGTTTGTGTTAAGTTCCTTTATCCTTATAAAAAGGTTTATGTTGCTCCTTTGAAGTAATAAGCTTAGATGGTTGGTCTTGTTCTTTTGCTTCAATTCGCTTGGAATGGAAACTTTTGCAGTCTATTTTAGTTCTTTAATATAGTGCATGAAGCTCCTCAAATTTAGCTGGAAACGAGTACTTCTTAGAGTCTATAACTATCCTAGATTGAATTTTTGGTAGGGGAAAAAGGAAAAAGAGGGAATGAGGCAGCGTTTTTAGAGTCTCAGGGCGCAAGCCCTCTGGAGCCTAGGCACTAAAAGGCACTAAAAGGCGCAAACTTTTTTTCTAAGGCGCACTATTTATAAAAAATAAAAAATATATATGTATACATACACAATAAGAAAAAACCCTTCATGGAAAGAAATTAAGTTTTAACCAAGAAAAATAGGTAAAAGAAATAAGTTTCAAGTATTAGTCTTTGAGCATAGAGTCATTAAAAAATATGACACAATGCAACACTAATAAACAATACCAGTGTTTTCAAAGCGCAAAGCGCACCTCAGGGCGACAAGCCATTTGATCGCCCGAAGGCGAGAGGCGACAAAAAGGCGTGGCCCGAGTGATGCGAGGCGCAATAATTAAATATATAAATAAATAATATATACAACATAAATTGTATAGTTAATATCATATAAAATGAAAAAACTTTTAAAACTTAATACATCAATCAAATGTCTAAAGTCATCAAGAATGAAAATTATGAAATAATGATCCAGTGAACCAACAATAAAGGTAAAACTAGTCTAAAACTCTAAATCTAAAAGACTAAAATATCAATCAAATGTCATAATCATAATTTTCCTCCCCATCTAAATCTAACTCGTCATCTCCTGCTCCTCCTTGATCATTCTCCTCTTCAAGATCCAATTCCTCTACATTGTCATCATCTTCATCACTAACCGCCTGTATAGTAGGTGTTGGTATTCGAATTCTAGAAGATGAAACTGCAGCAGTTGACTTCTCTTTAGACCTAGTGTACTTTAGAGGTTCTTTAACACCACTAGCATCTGCCACATCTCCCCATGTGAGGTCGTCATCATCAAAAACCAACTCATTATCAGCTTCAAGTTCATTATCTTCTTCTTCAAGTGTTCCAACCAACTATTCATTACTCTCATCAATATGATTAAGAGAAATTGGATCAAGTTGATCCTTCAAATCGAAGCGTTCTTTAAGTGCTTGGTTGTATTTGATATAAACCAAATCATTTAAACGCTTCTGCTCTAACCTGTTTCTCTTCTTCGAATGAATCTACAAAATTAATAATATGAATATAGAGTTAAAAAGTAATGTTTAAATTTTAAATTAATAATGTAGAAGACAAATTAACCACTCACGTGTTCAAAAACACTCCAATTGCGTTCACAACCAGAAGCACTACATGTGAGATTTAGAACTCTTATGGCCAACCTTTGTAGAATTGGAGTACTACCCCCATAAAGAGCCCACCATGATGCTGTTCACAATCACATGTTTAAAATGGTTTAGGATTAACTCTTGGAATAACTTTAGTTTAATATCCAAACATTTAAAGAACGACATTACCTGGCGTCTTAATTTTCCTCATGTTGATAGCTAAATCGATGCCAAACATGCCTTTAGCTTCAGAATACAACTCCAATTCCACCGTACATTGGTTTTGGTCATTCTTTGATGGTATTAGTCGTTGCACAACTTGAAGTAGGCCACTCATAACTTCTACGTCTTGAATAATTCTATCTTTGTGATCATAAAAAAGAGATGGATTTAAATAATAACCGGCTGCATGTAAAGGTCGATGAAGTTGACAATCCCATCTTCTGTCAATGATATCCCAAATAGGTTTGTATTTTGCTTCATTATCATTAAACCCGCTATTGATAGCCTCCTTGGCTCGATCCATAGCCTCATATATGTACCCCATAGCTAGTTTGTCACCATCAACTAATCTAAGAACACGGTTTAGTGGCCCCAATGCCTTTAATGTATAGACAACTGAGTTCCAAAACAATGGCATCAAGATAGTGTCAGCTGCTCTCTTACCCTTTGGATCCTTTGCCCATTTAGAAGTTGTCCATTTCTCAGAGATGAACATCTTTCTCAAATTCACCTTATTTTTATGAATACTCGATAGTGTCAAGAATGATGTTGCAAATCGAGTGACGACGAGTCTTACCAACTCATGTTGATTGGTGAATTCTCTCATCATGTTTAGCACATACAGGTGGTTATAAATAAAACCACCAAGTGTCACGACCTTTTTAATACACCTTTTGATTTGATCCATCTTTCCAATATCTTCTAACATCAAATCTAAACAATGAGCAACACATGGTGTCCAATATAAATGTGGTCGTTTCGCTTCTAAATATTTACCAGCTAGGACATAATTTGAGGCATTATCAGTAACAACTTGGATGACATTTTCTTCTCCAATATTCTCTACCATTCCATCCAAGAGTTCAAATACTTTTTCACCAGTTTTGATGCATGAAGATGCTGTAAGAACTCCTCCTTCAACCTGTTAATAAATCCAAGCAACGAGAGAAAACAATACAGAAACACTGAAATAAAGAAACCTACTGCTTCCATTGAGAGTTCTTCAAGGAACAGGCAATTATCTTGGATTCAACAAAGGAAAATATCACGAATAACAACTGCACCATCCGAGAGGCCTACTCTCCTTTAGAGTGACAACTCTAATCCCCAAAATACTCAAAAAACTCCCCTCACCACTCCGCCCAAAACCCTACTTAAGCCCTTTTTCAGACTGTATAACCCCACACTTTTTAACTAACTCTCCTTTCCACTCACACTCTCCCACTTCCCTTTTACCCCTCCTCTGATAAGTATGCACTATTTGGGGTCTCACAATACTCCCCCGGTTCTGAGAAGCCTTGTCCTCAAGGCTGAATTTTGGATACATCTGTTTCATATAACTTGCTGCTTCCCACTTGGTGTCAAGGTCTCCTCCCTCCTCCCATTGCACGAGCCACTCTTCTTCGTTCGTTTCTTGGTTCCATCGCATCCCGAGGATGTCTTTAGGTGTGGCCTTCCACTGCAGTAACAACTCATGTGATCCGGAACTTGAGGTTGAACCATATCCTGACTTCCAATCATTTTCTTAAGTTGAGACACATGAAATACATTGTGGATGGTAGCCTCTGGTGGTAGTTCTAGCCGGTAAGCCACCTTGCCAATTCTCTCTAGAACCTTAAAGGGGCCAAAATATTTGGGTGCTAGTTTCTCACATCTGCGCCTGGCTAACGATTTCTGTCGGTAGGGTCTAATTTTGAGATAAACCAGATCATCCATGGCAAATTCTACCTCCCTTCTCTTCCTATCAGCATACTTCTTCATTCGGCCTTGGGCCATGCTCAAGTGTTCCTTCAAGGCAACAAGTGCTTCATCCCTCTCCTTCAGCTGTTGTTCCAATGCATCATTAGTCGTTTTTTTATCTCCATAGGATATCAGTGGAGGAGGTTGTCTCCCATACACCACATTGAAAGGTGTGGTATTAATGGAGATGTGGAACGTGGTATTGTACCAATATTCCGCCCACTGAATCCATTGTACCCATTTCTTAGGGTGCTCTCCACACGAAGCATCTCAAATAATTCTCTACACACCTATTGACTCGCCTGCATTGCCCGTCCGTTTGTGGATGGAAAGCGGTGCTTCGCTTGAGTTGTGTGTTTTGCATTCTAAACATCTCTTGCCAGAAGTGACTTACAAAGATTTTATCTCGGTCAGATACAATCGATCGGGGAAAACCGTGGAGCCTCACTATCTCCCTTATGAAGGTTGCGGCCACATCTTTTGTTGAGAAAGGGTGCTTCAGAGGAACAAAGTGCCCATATTTGCTTAAGCGATCAACTATTACGAATATAGCATCGGACCCCGTTGATTTTGGTAGCCCTTCTATGAAGTCCATTGTAATATCTTCCCACACTACAGGTCAGGATTGGCGCAACATGTTGAAGTAACCTGGCCGGAGTGATCGAATCGCCTTATTCCTCCGCAAATTTGGCACTTCTCACATAATTCTTGACGTCTTGTTTCATATTCTTCCAATAGAGCTCACCGATTAATCGCTTATAGGTTCTAAGGAACCCAGAGTGGCCGCCCATCACCGAATCATGATAAGTGTGGAGGATGGTCGGAATTAAGGAGGATGTCTTGGATAAAACCAAACGTCCTTTGTAAAGCAGCTGCCCCTGCTTCAACGTGTATCTATGGGTTTCTCCCGATTGTGACTTTATCCCTTGAACGATGTTCTGTAACTCTGAGTCTTGCTCCACTTCTCTCTGAACCACTGCCACATCGATCAAGTTTGGTACAGAAATTGAATTCAGCTCTCCTTCATGGGTTACCCGAGACAGAGCATCTGCGGCTTTGTTCAGCTGTCCAGGATGGTACCTGATCTCGAAATCATAGCCCAATAATTTAGTCAACCATCTCGATACTCCAATTTGAATTTCCCTTTGATCAATAAACTCCCTGAGGCTCTTTGATCGGTTAAGACCGTGAATCTCGCCCAACAAATAGTGTCTCCATTTCTCGAATGGCTAAAACAATCGCCATTAACTCTCTTTCGTGACGACGGCTTCGCTTCAGCTCTCATCTCTCATCGCGGCTTGACTGAAGTAGGCTATTGGTCTCTGTGCTTGAGATAAACCGCCCCAACTCTCGGTGCGAGAGGCATCGGGCTCGATCACAAATGTTTTGCTGAAATCGGGCAGAGCTAAAACAGGCAATGTTACCATCGCTTGTTTCAATAATTCGAAAGCCGCGGTTGCCTCTTCCTTCCACTGAAATGATTCTTTTTTAAGCAATGCCGTCAACGGGGCGGCCATGTGACCATAGTTCCTTACAAATCGCCGGTAGTACCCTGTAAGCCCTAAGAATCCCCTCAGTTCTTTTACATTCCTGGGAATTGGCCACTGCACCATGGCCTTCACCTTCTCTTCATCCGCTCCTACTCCACTCGAGGACACCCAATGGCCTAGATACGCCACTTTTCTTGAGCAAAAACATTTTTCTTTGATTGGCATACAAGGAGTTTTCCTCAGCGTTTTCAATACAATTTCCAAATGGGTAAGGTGGGTGGTTAAATCGTATTGCACACAGGATGTCGTCAAAAAAAACAAGTATAAACCTTCTAAGGAAGGGTCTGAATACCTGATTCATTAGAGATTGGAAGGTCGCTGGAGCGTTAGTTAACCCAAAGGGCATCACCATAAATTCATAATGTCCTTCATGAGTTCTGAAAGCGGTCTTTTCTACATCCTCATCTCTCATTCGTATCTGGTGATATCCTGACTTGAGGTCTAGTTTTGAGAAGATGGTTGAGCCATGGAGTTCATCTAGCAACTCATCTATCATCGAATCGGAAATTTATCCGGAATGGTTGCCTTATTTAGAGCCACAGAAGTCAACGCAGAAACGCCACCCTCCGCCCTTCTTTTTAACCAAAGAAAACGGGGCTAGAAAAGGACTCTTTACTGGCCTAATAATTCCAGAAGTGAGCATTTGCTCGATCAGCTTTTCAATCTCATTCTTCTGATGATGGGCATATCTGTACGGTCGAACATTCACAGGAGGGTGCCCATCCTTCATGACTATTTTGTGGTCCACTTCCCTTCTACCTGGGTAAGCCTTCTAGCATATCAAACACATCCTTATAGCCGTCTAATGTGTTTGCCACCTCGCGAGGGTAGTTCTTCCATTCTTCCCATTCCACGGTTACAGCTTCCACTCCTCGCTACTTCGCAACTTTCGACTCAGCTAAATTCAATCAAAAACCCTTGATCCGTCTCTTCCCATGATTTGGCGAGATTTTCAAAGACACCTCGGCTCGAGTCGGAGAAGGATCTCCTTTAATCGTTATTTCATCGCCTCCGAATTTGAGCGACATAGTCAGTCGACGATCCACGTTCATGGTTCCCATGGTTCGCAACCATCGCATCCCTAAGATGACATCAACGCCGCCCGCTTCTCTAAATGGCGAAAATCTTCGCATATAGATAGTTCCGGAAGGTGATAATTCTTGCCCGACATACCCCTAAATTCAACATCCGATGGCGTCGAGTCGCTGGCTCCTTCCGGTCGCGCCCTATCGGGTAGGGTTACCATCGGGTTGCCGAGGACCACTCTCGCTACTCGCTTTTCCGGCCCAACTCGCGCTCGGAACATGATTGGGCTTAGGCTTGTCCGTGGTGAACCCAACGTCTTCAAGGCCTTTCATTGCTAGTTCCTGTCTTCCATGAGGCATTCACGGCCGCCACGATCTTCTCAATATTCTCTACCATTCCATCCACTGTCGCCGCCGCTCTGCTTAATGCTTAGTTCAAATAAAGATTCTTTCTTCCCGTGACTTAAACTTCCTTCATTGTGTTCTAAACCATTCCACGATGTTCATGAAGATGCATCAATGGATTCTCCCAAAACCGGCATAATGTTCCACCTCCCCCAAGTCCTACATTTGGGTGGAAGCTTCGACCTCTGCTCCATTCCTCTCTGAACTAAAAAAGCCTTTTTTTGATCAAGAAACAATTTTTACATTGATGAATGAAATAAAGGAGAAAAATCTCCAATCACCAAAGGTGAGTTACGAACGACCATCCGACCTTCCAATTAACTCTAATCCCCAAAATACTCAAAAACTCCCCTCACCACCTCGAAAATTAAGCCCTTTTTCCGGACAATGCATAACCCCACACTTTTTAACTAACTCTCCTTTCCACTCACACTCTCCCCACTTCCCTTTTACCCTCCTCTGAACTATTTGACTATAAAAATAATGCGAACTAAAAATTTTTTTGAAGGATGATATTCCAATCACCAAAGGTGAGTTACACCAAAAATAAGAAAAGCATTAAGAAACCAAAGAAAAGCATTAAGAAAAACCGAATCCAAAAACGATCAAAAGTTGAAGAAACTCCCACAACACTCTCGACCATTCCGTTCAACCCATAAAGTCCAAAAGAAAGCACGAGTGAAGGCAAACCACAAAGTTTTCTTAAGGCCATGGAAAGGATGTCCCACAAAGATTGTAGATAGGAATGCAAAACCTGTGTTTGGAAAAGGTTGACACCAACCAAAAGCTTCTAACATGTAAGACCAAAATCTCGATGCAAAGGAGCAATGCACTAACAGATGACCAACAGTTTCCAAATTAGAGCAACACATAACGCACCAAGATGGTGAAAGAGACATAAAAGGCAATCTTCTCTGAAGTCTGTCGGCAGTATTGATTGCACCCAAACTCAACTCCCACAAGAAAATTTTAATCTTTTTTGGATAACGATCCTTCCAAATCACCGCGTACAGATCCTTTGAAAATTCATCAGAATCACCCAACAGAGAAACCATAAGCGAATGGACGGTAAAAGTGCCAGACGGATCAAAGGGCCAAGACCAAGTGTCTGGGATATTCTGTAGTCTAGCTGAAGACAAAAGTTGGGATAGAGTAACCCATTCCTCAATTTCTAAATCATTTAATCCGCGTCGAAGTTGCAAATCCCAGGCATCATTAACAACATTCCAAAGGTCTACCACCAAACTATCTGGATTAGTTGTGAGTCGAAATAAACGTGGGAAAACCAGAGCACAAGGTCCACAACTGAGCCAAGAATCTTTCCAAAAGAAAGTATGACAACCATTACCCAGTCACGAGTTGCACTAGTGAAACCAAATCGACAAGTGAGAGATATAACGCCAGGGTGATTTATGAGCACATGAAGTAATAACACGTGGCCAACCTACCCCATATAACGCACCATAATATTTGGCGACAATAAGTTTTCGCCAAAGAGCATCTTGTTCATGGACAAACCTCCATACCCATTTAGCAAGAAGAGCAGAATTTCGATGCTTAAAGTTACCAATACCAAGGCCCCCCATCATCTTTGGAAGTTGACCTGATTTCCAATTAACGTTGTGAGAACCACCATCTCCTCGAGAACCTTCCCAGAAGAAGTCGCAAATGATTTTATTCATGGTTTTTAGAACTGAAGAGGGAAGTTTGAATAGAGATAAATAATATGTCGGCAAACTAGCAAGGGTAGCCTGAATAAGCGTATGTCTACCACCTTTCGAAATAAAGGCATACTTCCAGTTTTGTAATTTCTGTTTCATCTTTTCAACTACTGGAATCCAGAAAGAGCATGATTACTAACTATTGTTTTAGTAAATTCTACCTCATTTCTCAAACATGTAACTCTAGCCTCATGATAAGACGGAGGCTTCAAACCGGGACCAAATTGACCAACTGCCTCTAACATTACATGAAAACTTCTTAAGTGAATGGTGTTTAACGGGATACCTGCTTGATAAATCCAACGCATGATGAATTGAATCGTAGTATCCCTTCCCTCTTTGTTATATTTTTCTATAATGCTAGATTGCTTCAACTTCTTCATTCTATTCCTATCATCAACAACTTTGGTTGGATCGGGATAAACAAATTTATCCATGGCTCCCTTTTGTCGTGGTGGCTTCAAGTTCAATTGTGATCGACTTCCACTAGATCCTTCTCCACTGCCAAGAGCTCGCTTTCTAGATGTTCCAGTACTTGAACTCGATCCTCCAATGACATCCATCTCATCTTCTTCATCAATTGCATTCAAATCATAATCCTCAAAGTTCGTTAGTGCATTACTTTCTTCTCGCTTTGCTTTTGTTTTTGTTTGATAATCCTTCAATTCCTCTGTTACATTAGTTGGACACTTTAAACATGCTTTGACATTTCGATTCCCACCAACTAAATGATGAGTTGCTCGACAAATTTCACCTTTTGTGATTTTTCCACAAAATGTACATTTGCAAGAATTTGTGTCCTTCTCATCCTCCAAAACAACGTATTTCCAAGTGGGATGCTTCCTTTTTGTCTCAACTTCTCCTTCTTGATTCCTAAAAGTTGATCTCACGGAAGGTTAGATACTCAACCAAGCTACAATGTGAATTGCACTGCTGAAAAAGAATAGAATGTTAGATACTCAACCGAGCTCCAATGTGAATCACACTGCTGAAAAAGAATAGAAGAAGGATAATTGAAGAAGAATGAAGGTTGCAGCAGTGTTGAAGAGAAGAAGAGGAAGTGAGGAATGCTGAATGGATGAAGGTTGGTAGTCGAAGAAGGCTGCTTTCTTGGTACTGCCTTACTGGTAGTCGATGAAGCAGGAAGTCGATGAAGGCTGCTTTCTTGGTACTGCTGTTTGTCGATTAGAGTTCAATAGTTTGTCGATTGGATTCTGTTTTCATTAAATAAATAAAGAAACAACAACTCTGCATGCATGTAGGTGGGCTTGTATATGGGCTAACTATTTAGGGTTACTTGAGAATTAAGTTAAGATTTTTTTTTTTTTTAAAAAAAGCTAAACCTGCGGAAAATTGCAGGCGATCGCCTGTGTCTGAGCCGCAGCCATCGCCGGGCGCAGCAAGGCGCAACTGAGGTGCACGCCTCATTGAAGGCCATCGCCTCGTGCGATGGTGAGGCGCATACCAGCGTAATGGCCTCGCCTTGCGCCTCAGGCGAGCGCCTGGATGCGCCTCGAAATCACTGAACAATACAAGTTCAAAAGTCTTAAATATCAATATCTTCTTAACTAAATATATATAATATAAGTAAAACGGTAAGTTTTACTTTTAAAACTGAAAAAAAATAATAAATTGAAGAAACCCAGAGGCGCGTGCTTGGTGTGCCTCTTGGGCTTCATGGAAGGCGCGCGCGATAAGAAGTGAGGCGCTAAACGTGGGCCTTGAGCCTAGGCGCGCCTAGGCGCACACCCAAATGGGCTTTTTAAAACACTGCACGCGCGCACACACAAACTTATTATCATAGTCATCAAAATGTCTTGCAACAGCCTCTCTGTTATTGTTATCTGCCTGGAGATTGAATTCTTTCGATTTATCTACTAAATCTAAGTGAAGCTCTGTGGATTAAGGTACTCGTTGGACACCCAGATTATGAGGTTACAGGTGGTAGTATTCTGTTTAAAGGGAGTGACTTGCTTGATATGGAACCAGAAGAAAGGTCACTTGCAGGGCTATTTATGAGTTTCCAGTCCCCGGTTGAGATTCCAGGTGTCAGCAATATTGACTTTCTAAATATGGCTTACAATGCACGAAGAAGGAAACTTGGTCTGGCAGAGCTGGGGCCAATTGAGGTATTCTTTTTCACTTGAAGCACCCTTTTCATGATTTATCCACTCTTGGTAGAATGTATACGTAGTTATGGAATGAGAATTCACATACAAATTTTTTGCATCTACTGAAATGTTGATTTTTTTCTTCCTGTTAAACAAACTTGCTTTAGATTAATGAAAGATTACACGATAATAGGCTCAGGATCCAACACAGAGAGTACAAGACAAAAAGAAAAATTGCTTTCATTAAGATTAATAAAAGTTTGCAAGATATGTTTGAATAACCATGTATTAGGCATCGTGCAATCAATTAGCCTGTTTTTGTGCGGTCTTCTGTTTGGGCATCAGTGTCTAGGAGCGAGCAATTAAACACCAACTTAATTCTGAATAAGGCTCCGTGAATAGGTTACCTTGACTATGCACATATTCAAATTTAAAGTACTCTCCCACCTTCCAAAATCTGAGTTGTGTTTCTCTTGAAAGCAACAGTGATTCATTTTAAGAAATAATCCTACTCAAGATATTACCCTCCACCCCCCTTGAAATAATCCTACTCAAGATAACCACCACCAAGAGAAAAAGGAAATGAGAGGGATGGTCACCTTGTCTTAGGCCTCTAGAAGCGAGAATCTTTCCCCTGGCTTTTTCGTTAACCAGGATAGAGTAATTCCAGTCCAACAATATAATACATTAAATACACATATACTAATTAACTTAAGAAGGAAAATAATCAGAAATGACCAACACTCCCCCTCAAGGTGGTTTAAAGATATCTTCCATTGTCAGCTTGTTAATCATCTTGCCAAACTGTAACTTAGGGAGTCCCATTGTTAATACATCAACAATTTGCTCAGTAGTAGGAAGGTAGGGAATGCAGATTACTCCGGAATCAATCTTCTCCTTTAAGAAATGTCGATCAGCCTCGATATGCTATGTCCTATCATGAAGGACTAGATTATGAGCCATGGAAATCACTGCCTTGTTTCACAATAAATTTGCATAGGTATCTTTTGTTAGGAGTTTCTTCATGGTGTTCTCTGTCCAAGCTCTTTCATGGGTTCTCTTCATCGATCATTAGTTTAAATTGGCATGCTTTTGTGGATCCTAGTGTGTAGCTAGAATTTCTTTTTGTCCTAGGGGTTTACACAGTGTAACTGGTTTTGGGAGGGTATTATGTTTTTTGAGCAATACACAGTTTTCCTTGTGGTCTTATTGTAATCTAAGCATTAGTCTCATTTCATTATTTCAATGAAATTTTTGTTACCTTTTTCAAAAAAGAACAATAACCAGAAGTAGATCTTCTATCTGTGGTATTGCTTGCCCAGTCAGCATCAGTATAAACTTCAACATGCAGATGAACATGCTTCTTGAACGATATTCCTTTTTGTGGGGTACCTTTTTAGGTATCTTAGAATTCTATAGACTGCTTCAAAGGGGGTCGGTCCTAGAGCATGCATAAACTGACTTACCATACTGACTGCAAATGCAATGTCAGGACGTGTGTGTGACAAGTAAATAAGTTTTCCCATAAGTGTTTGATATTTTTCTTTATCTTTTACCTCTTTTTTGGTTGCAATTTGTAGTTTTAAATTTGGTTCAATGAGGGTTTCTACTATTCTGCAACCTAGTAACCCCACGTAAGTCAATAATAGTTCCTTTGATTGACAAGGATACCCTCTTTCGACCTAGCAAATTCCATTCCTAAGAAATATTTCAGGACTCCTAGGTCTTTGATTTGAAATTCATTAGCTAGTTTTTTCTTCAAGGTAGTCAATCTTTCCTCGCCATTTCCTGTAAGGATAATGTCATCAACATAAATTATTAGAATAATAATCTTGTCATTTCTAGTATGTTTATAGAAAATAGTATGATATGCTTGACTTTGGAAGAATCCTTAGCTTGTGACTGCTTTGCCAAACCTTTCAAACCACGCGGTAGGAGATTGTTAAGGTCATATAAAGATTTCTTAATCATCTTCCTCAAAACCAAGGGTAAATTCATAAATACCTCTTCTTCAAGTTCCCTATTGAGAAAGACATTCTTGACATCAAGTTGATAAAGGGGCCAATCTACATGGACGCCCAAAGTTATGATTTGACAAACTCATTGACAAGTTGGCAATGAAAGATATCTTCAAACCAACTTGAGGGGGAGTGTTGGTTTATGTGATTGATACTTTCCTTATTTGTATTTGTTTTCTATAATTACGGCTCAAAATCTTTCCTTTTATACTTTAGAGTCTTCTCCTATTTGATAGGTCTCGTGTAACCTTAACCATTATGAGAAATAGACATGTTCTATTCAATTTTTATTTTTTTTTTATAGGAAACATAGGACAATTCATTGATCAAATGAAATTGGTACAAAATGCCTATCGAGGATAATTACAAAAAACCTATTCAAGATAATTACAAAAAAGAGGTCCAGTGTGTTAGAAGTGTAGATAGACCATAATTACAAAAGGGGGAAGACAAAATACTCCATGACATTCAAAAGAGTGTAGATATCACCCGGTCTTGTGAAATTTAACCAAATGATTGCATTAAGTAGGATCATCCCAAATAGGCGGAAGCACAAGTGTTGAATAGGTGTATAAGGGATTCACTATCAACTTGGCACATGATACAACAGCTTGGAGAGATAGATAACCATGGGGATCTTTTTTGCAGGTTGTCGTAGGTGTTGATACAAGAGTGACTAAGTTCCCAAATGAAAAATTTGACTTTTTTTTGGAGTGGAACTAGACCATAATTGATTGTAGAAAGAAGAATTTGCCGAACATCCCGAAGAAACTAACTTTCTGGTTAATGATTTGGAGGAGAACATCCCTTTTTTTTCTAGGTTCCACAACCATTTATCCTCCTGATCTTTAGGGAAAAAGGTTGTTAACTTATGGGAGAAGGATGCCCATTCTTCAATTTCTGCATCAATCAAATTCCTGCGCAAATGAAGGTTCCAGGAGCTTGCTTCTGTGTTCCATAGGTCTGCAATCCTAGCTCCCTTTTTTGTTGACAGATTGTATATGGAAGGGAAACGAAGTTGAAGAGGATGCCCCCCAATCAAGTGTCTGCCCAAAAGAAAGTACCATCACCCCTTCCAACTTTAACTTTGACGTTTTTGTAAATCAGATTCTGAAAATTATGAATAGATTTCCAAGGCCTTTTGCTTTCTTCAAAGTTGAAGCTCCTGATTTGTTGTTAAGGTGTATATTTCCATATTTTGCATTGATAACTCTTCTCCACAATGCGTCTTTCTCAATGTAAAACCGCCAATGCTATTTTGCTAAAAGGGAGATATTACGGGTGTGGAGGTCAATTAACCCTAGGCCTCCTTCCTCAATAGGTTGACTATGGACCATTCTAGAAGATGGCATCCTATTTTCCAACTCCTCTCCATAAAAAAGTCCGATACAACTTTTCTATAGATTTTGTAACCTTGGTAGGTTCAAGATAGATAGAAAAGTAATAGATTGGCAAATTTGAGAGAGTTGCTTGTATGAGAGTGTGCCTACCTCCTTTGGAGATATGAGAAGAACCCCAGTTGTGCAAGCGGTTGTTCATGTTGATGATTATGAATTGAAACTACGATGGGTTTTTTTTCTCCCTTCTCTCTCCTCGTTCGTCTCTCTCCTCCCGCATCTCCTCGTCCTTCACCACCGACCATGTCGTCGTTCTTTGGTGACCCTGTCTCCGACGTCTTCCTCCGTTAAACCCTTTCGTCCTTCGTTTCCTCGCCATTTTGTCTTTCCGGCTTCTCTGTTTTTTCCTTCTGTTCCGCCATCTTCTCGTTTTCCGTCTCCCCTCCCTCTTCGCTTCTTTCTTCGTCTCCTCTTCTCTCATTCGATCGTCTTCTACCATCCGACCCTTTCGTCCTCCGTCTGTTCTTCGTCTTCCTCTCTCGCACCATTTCCCTGATCCTCCCCGTCCTCCGTCAGTCTCCCCCCTGTTCGTTGAGACCCTGGGGTGGCTCTATTTCGACTTCCTCTTCTTCCTCCATGAAGTCGATTCATCTTCCTGATTTGGTTGCGTCTGTCCCTGGCCCTCTGGTGCTTTTTTCAAGCGTGAACGTTCTATTCGAAATTTTACCCTGTCCTAGCGGTCTAGTTGAATTAGTCTGAAATGGAGAGGGGTAGTTGTTGTATTGAAGGGATAGGCTTTCGTATTTGGAAGGAGGATTCGCGGGTGATTCAAGAAGGTTTCTGGGACTCACTGATCGGGTTCACCGCCATCCAGGTGAAATGGGTTTATGGGCTTTTTATCGAACTAACTGAGAATCCAAATTCTTTTTTCTTGAGAAGAAAATATAGGGATGAAATGCTACTCTGGGCTTGTTCAAGGTGAGAACGAAGAATGAATGGATTGCGGAGGTGGTGGTCTGGCCACCGTTGGGGGGTAGGAAAATGCTTAGAATTCCTTGGGGAGTTTTCTCCATGGGATGAACAGTTTTTAGAGATTTCTTGGATGATTTCATTCACTCGTTTGGTCAGTCGTTTTCTCCCTCAAGTGATATGGATTATGCTGAAAAAGGAAGGGATTTTGAGGAAAAAGGGATATCACCTAATGAACTTGTGCCATAAGGGGTATTGGATTCGCAAGGATGATGAAGTGGTTAAAGTTAAGTTCTCCAGAGTTTGGGTGGTTAATCGACTGTTTGTACATGTGCAGTGGGTTTCTATTAAAAATGCAATGGAGTCTTACTCCAAGCGGCGATTGCAAATCAACCCTTTCATGGCTGACAAGGCTATTGTTCTATTTGAGAAGGATTTTGAAAATATTGTGGAGGGAGAGCAGTTTCTGAATGACACACGATGGCAGGTTGTTGGGGGTGTTCATTTGAAATTGGAAAAATGGGATGACAACACACATAGTTTGCCCTCAGTGATAAAAATATACGGTGCCTGGATATGAATTAAGAATTTACCTTTAAGGCTCTGGAAGAAAAGAGTTTTTCAGGCGATCGGATGGCATTTTGGGGGGTTTGAAGACATTTCATTGGAAACGTTGAACTTTACGGATATATCAGTGGCAAAAATTAAGGTAACCACAAACCTTTGTGGTTTTTGGCCGGCGAACATCCCTGTTGAAAATCCCTCTTGTGGAAGGTTTCTTCTAAGCTTTGAAGCTCTAGAAGAATTGGCCCCTCCTAGTTGTTTGGACAGAGCCCTAGTGTTGAAAGATGTTATTAATCCAATTGATTTCACCAAGATTCTACAGGTATGGGAAGACGAATTTGTGAACATCTCCTCCCATGCCCTTAAACAGACTTTGGTTTATGTTGATGGTGCTGGAATTAGAGAGGAGTATTCGGTAGGAGTTGTTGTAGGGAGTGAGCAGAGGATTGGCCATGCCCTTTCAAATGTCTTTCTTCCATTGGGAAGCATCGAATCTAGCAATAACGATAATGAGGACCTTTCATTGGCCACGGTAGAGGAAACAGAGAAGTTAATTACTGGGTAGCATGTGGTTAGGAGGGATTGGAGTTTGGATAAGGGTGTCACGATTGGGTGGGTTGAGATTGACAACTTTGCACTGAATAAATGAGGTCAGTTAATTCAATCTGATGTGGTTATGGGTCTGGACAGGGGGAAGTAAATGGAGGGGCAGAGGAGTGAGATCTTTAATAATGGTGGTAGGGGGGTATCAAAGGTTTGTGATGCGTTGGAAAATTTAGAGACCCCCTCTGATCTTAATGGGGGTGCCTCTTGTAAATCACCAATTAACCCAAAAGCTTAAGCTGATAGGTTATGACAAATTTAATTATATCAACACCAACACTCCCCCTCACTTGTGGGCTTAAATTTGGAAATCAAACACTCCCCCTAACTTGTGGGCTTGGAAATTTGAACAAAGGCCCAACAAGTACAAATCAATATTAATTGGGAGAAAACGACTTCACCAGGGTTCGAACACAGGACATCCTGCTCTGATACCATGTAAATCACCAATTAACCCAAAAGTTTAAGCTGATAGGTTATGACAAATTTAATTATATCAACACCAACACCTCTAAGGAGACATTTTCGGGTAATTTGGAAGGGGAGAAGTCTCAATCATTACTTAATGATGGAATTGATTCGAGTTATGTTGAGGAGCCTCTTTTTTCCCATGGTGTTCTTTCTAGTGTTAAAAAAGTAAAGGACACTTTTTGTGGGAAGTATTACATAAGAAAACGTGGAGGTGTAACACCTAAGAGCTCAACAGTTCGGTTAGAGGAGGTCTACCTTGATTTTCTGGAACAAACAGTTTGCTTTGATCAACATTGTCCTATGGATTCCTTTTCTGAGGAAGGGGCTCATAATGCTCCGAACATCTCGGGGGTTGTAATTGAAAAATGCCAACTTGGCTCAAAAGGGGTTTCATTCTTTAAAACCTCATTTCAGCCCGGTCTAGACTCAGGTATTAACGTAGCCCCATCTTCCCCAAAGGGTAATCGAAGTTGACAATTGATTATGATTCCGAAATTAGTCTAAGCAGTTCGAATATCAGTTTGGTGAGAAGTGTTGGGAATAAAACTTTGAAGGTTTTTTCTTCGCTAATTAACCTGGCAGGGCTTTTTGAGTCTGTAGCCCATCAATCTCCCTTGGGGGGAAAAGGAAAACCTTGAAAAGATCGTGGTAGATTGCCCACGGTGGAGTTGTCTAGTTTCAGTGATCTAATCAAGGCAAGTGGACTACAATTCAAAGCAATTGTAGCAGGAGGTAATCCGGTAAATTTGCAATGAGTGTGTTACCGTGGAACACAAGAGGGTTGGGGGATGGTTCAAAGAGAGGTACTCTTAAGAGGTATCTTCTGAAGCTTTGTCCGAAAGTGCTTATGTTTCAAGAAACAAAATGGGGGATAATGGTTGATAATGTGGTTAAATCCTTATGGAGTTGATCTAGTTAGTTTTGTTTCAAAACAAAATCTTGAAGGAAGAGTTTAGCTCAGTTGAGTGAGATCTCCACGGCTTTTTGTTGTCTATTTCGTGGTGTTGGTATTTGAAGTAGTTGCAGGGTTGTGGGGTTTTTGAAGGTTTCACCCACTGTTTGTCTTAGTTATGGAAATTTTGGGAGTTGGTAGCTTTGTTCGGGTTTATGATCGGTTCCAAGATTAGCTAATGGTTGTTGGAGGTTAGGGTGGTGCATTTAGGGGTGGTCGTTTTGTGGGCTTCGATTTGGAGGGTTCTCCCTTGCTAGCATTATTGGCTTCTTTCCTTAAGCCTCTTTCGGCTTTTGGTCTTTGTTTTTAGTTAGTTCTCGTATCTTTATGGTTATTCTTCTCTTTCTTTTGGTTTGTTCTTGGAAGAGGTGTTGTTTTCTCTCTTGTTCAAGTTTTGTCTCATGTTGTTTTGGTTATTGTCTGTTCTTTTCTTTTTGTTGTATTTTGAGCATTAGTCTCATTTCATTTTATCAATGAAATTTCATTTGTTACCTTGTCAAAAAAAAAAGATAACCTTTCTCAAAATAGTATAGTGTCGTCTGCAAATTGAAGGTGAGTAATGGAGACTGCATTAACGTTCTTTCCCACATTAAAACCTTCAATTAATCCTCTGTGTTCGACTTGTAGGAGCATCCTACTAAGACAATCAACCACAATGATGAACAAGATAGGGAGAGAGGATCACCTTGTCTAAGGCCTCGAGAAGCCTTAAATTTGCCTCTCGATTTACCATTAATAATGATAGAGACATTGGTTGTTGAGATACATCTACAGATCCATTTCCTCCAAGTTTGTCCAAAGCCTTTGGCTATGAGAATGTTTTTCAAAAAATCCCAATCCACCATATCAAAGGCTTTTTCTATGTCGAGCTTTATGATCACTCCTCTTCGTTTCATCCGAAACCATTCATCGATCACCTCATTTGCAATAAGGGAGGCATCCAAGATTTGGCGGCTAGCAACAAATGTTGATTGTTGATCAGTGATCGTGTGGGGGAGGACCCTTTTTAACCTCTCAGAGAGAACACGAGCAATAATCTTGTATAGTCAGGTAGTTAGACTGATAGGTCTATAATCACTTACTGATTTCGCATTCACAATTTTCGGAATAAGACAGATTTAAGTTTTGTTGAGGCTATCATTTATAATCTCATTCTGAAAAAAGTCATCGAACACTCTTTTGATGTCATTTTTAAGGGTGTTTCATGACTTTCTAAAGAATTCAGCTGTGAAACCATCCGACCTCGGCGCTTTATTTGAGCCCAAATCTGTGACAGCCTTGTAGATATCTTCTTCTGTAAAAGGTACCTCCAAATTCTCTCTTTGGGTTGCTGTAATAGGGCTCCAATCATCCATTTGTGGGAGGATTCTTTGGCCTGGATTTTTTGGTATAAAGCTTCTGGTAAAAGTCAATAAAGGGGCCTTCTATTTTACGGTCAGAAACCACACTGTCTCCTTCAGCTGTGAGGACTTCAACAATTGTACTTTTCCTTCGGTTAGCAGCCATCAATCTAGGGAAAAAAGTTGTATTACCATCACCTTGTTGCAGCCAAATTGGTTTACAGCTTTGTCTCCACAACATTTCTTCTTTGGCAGCAATTGAGATCAGCTCCTTTTTAATATCAGATCTCTTTTGTAAATCCATGCCATTTAGTGGGGCAATCTCCTCCTTTCTGTCAATGAGGGATAGTTCTTTTTGGAGAATTGCTTTGAGCTCCTTTTGATGACCGAAGGTGTTATGGTTCCAGATTTTTAGATCCTATTTTAGGCCTTTTAATTTCTGAATGAAACCATGTCCAGCCCATCCACACAATGGGACATTCGACCACCATGATTAATTTCTTTTGTAAATACTCCTCTCCTTATTAATATCAGTTGGTGGATACTTTTTATATTCTCAATAGGCCTGGGTTGGTAGATGTTTTATCTCTCTCTCTCTCTTCGTACAAATGCATCATCTGTTTATCTAATACAAAGTTGGTTTCTATAAACTAAATAGTGTACCTATTTTTATGAGTTTCTTTTGTTCAATAGAAGCTATTCTATTTCCTAGGATGAAGCTGCACGTTTAAAGGCTTCTCAGTGGTGTTCTATGTCTAAGTCGTTCAGGGGGTTTTCTTCTCAAGACATTTTCTTGAACTGGATTGTTTTCCTTAACCCTTTGGTGGCTAGCTTCTTGTTTTGTTTGGTTATGATTTTCTTGTTTTCTTATCTTTTCACTGCTTATTTGCCTTTTCTTTTTCCTTTCATTCATTCTTCTATGGAGTTTGTATTTTGGAGCATTAGTCTCTTTTCATTATTTCAATGAAAAGCCTTGTCTCTTGTTTCAAAAAAAAAATATTGCACCCTTATTTCAACTTGGTTCGGAAGATTCCCGATATTTGGACCTGTGTTTGTTGTAATTTTATTGGTATCAATGCATTTTTGTCGCTGGTTGGAAGAATTAAAAAAGGGGCAATGCAACCTGAGTGATGTAGTAAGACGAAGACCTTTTGTTATTCAAAATTTCTATTCAGATTTGTTGAAATCAAGTATAAAGCACTTTTCATTCAGCTTCATTCTTTATTAGATTTTCTTGTTTCATGTTAATTACTCTACCTATTGTAATTATTTAGTTCATGAAATTTAGAGACTGTTAAAAATTTAAACTTTTGGCTATTTGAATCACTTCTTCATTATCTCTCCAGTTTTATGCATACATCTTCCCCAAACTTGATCTTGTGAACATGAAGACGGATTTTCTCAATCGAAATGTCAATGAAGGGTTTAGTGGTGGAGAAAGAAAGCGTAATGAGATCTTACAACTTGCGGTAAGGTTACTACTGTTTCCTACTTTCCTGTGCTGTCTCGTCATTTCTGTTTTTTATTTTGACAAGATTTAATGCTGTCTCTGTATACTCAATGACATACCATATGTGATTTGTTTATTTATTTATCATTATTATTTTGATTAGAAAACAAATTTTCGTTAAGATGAATAAAAGTAATAAAGAGAAGACCCCTGTTAGGCCTCCAATACCATGTGTGATCTGTCCTCCCGTCCTCTAACCTTTCTCCTTGCTTTCCTTGGCTTTCTTGTGCCATCTCCCTGATAGAGGGGCCTTGGCTGTCTTCTTTTCATTCTTTTTTTTTTTTTTTTTTTTTGGGGAGGGAGGGAGAGAGAGAAGCTGGATTCTGGACTTTGGATCCGTTTTGGGGCTTCCCTTGCCCCTCCGCCTCCCACTCTGCCTTCCTCTCTCTGGAAGATGAAAATATGTTTTGTCGCAGGATTTACAGGGGAGAGCCAACACTTTGGACCGTGTCTTGAGACATTTCGAGGATTTGTATCATCTGTTGTGGGGTTGCCCATTTGCCTAGTGCCTTTGGGGTCGTTGGTTGGACTCTTTTAACTTATCTTGGGTTTGGCATAGAATTTTTTTTTTGAGGTGATGAGGAGGTTCTCCTCGGTTCTCCTTTTCATGAGAAGGGGACTTTTTTGTGGCGTGCGAGTTTCTTTGCTTTGGTGTAGGGGGTTTGGCTTGAGAGGAATAGTATAATTTTTAGAGAGGTGGAGTGGTCTTGTGGCGAGGTTTGGGATTTGGTGAGGTTTAATGCCTCATGGGTTTGGGTCACTAGGCCTTTTTGTAAGCCTTTTTGTAATTATGATCTTGGTTTGATTTTATTGGATTGAAGTCCCTTTGTGTGGTTTGTGGGGCTTGTTTTCTGCATGCTCTTGTACTTCCTTTCATTTTTCTCAATGAAAACACGATTTCTAACTAAAATATATATATTTTGATGAATCACTAAAATTTATCCCATTTTCATTTGTGTTCAAACATTTATGAGGAATATAGATTTAGATCTTGGCTTAAAAATTAACCTCTGTTACTCCTCAAAAGGCTCACTAGATGAGAAATCATATATCATCATTACACGTGAAATTCACTTTTATACATTGTACTCTCCACCACAACAGTACACTCTTCAGCCAAGAAGAAGATGCATTTCAAGTGGTCCAGTTGCATTATAAAATGGATGAAAATTTGAGAACCTAGTAGCCCCTATATGTTGGTTCAATAAAGGGAGGAAGAAAGAACTTATGCTATAGGCTTGGTAAGGCTAATACTATACAACATTATTATTGTTTCTCAAATTGTTTG

mRNA sequence

AATTACTCCCGTCTTCAAGTTCTGACGCATTTGGTGTTGGTGTACATCCCCACTTTCCGCAATGGCGACGTCAATGCCATACTGCACTTCATCTTCTACCACATCCACTTCTCCTCTGGTTCGATATGCACCATCTAATCCTACTTTATCCAAGAGGCCCATCCTTCTGCCCCTTCCCTTCCGTTCCAATCGCCGCCGCTCAATTCGGATTGCAACGGCTTCTCTCTCCGCCGTCAATTCTCCGTCTACGAGTTCCTCTGGCAGTCAGAAAAATCTTTTGTTAGAAGTCAAGGATTTGACTGCTGTAATTGCAGAATCCAAGCAGGAGATTCTCAAGGGTGTTAACCTTGTTGTCCGCGAAGGAGAGGTTCATGCCATCATGGGAAAGAACGGTTCTGGAAAGAGCACATTTGCAAAGGTACTCGTTGGACACCCAGATTATGAGGTTACAGGTGGTAGTATTCTGTTTAAAGGGAGTGACTTGCTTGATATGGAACCAGAAGAAAGGTCACTTGCAGGGCTATTTATGAGTTTCCAGTCCCCGGTTGAGATTCCAGGTGTCAGCAATATTGACTTTCTAAATATGGCTTACAATGCACGAAGAAGGAAACTTGGTCTGGCAGAGCTGGGGCCAATTGAGTTTTATGCATACATCTTCCCCAAACTTGATCTTGTGAACATGAAGACGGATTTTCTCAATCGAAATGTCAATGAAGGGTTTAGTGGTGGAGAAAGAAAGCGTAATGAGATCTTACAACTTGCGTACACTCTTCAGCCAAGAAGAAGATGCATTTCAAGTGGTCCAGTTGCATTATAAAATGGATGAAAATTTGAGAACCTAGTAGCCCCTATATGTTGGTTCAATAAAGGGAGGAAGAAAGAACTTATGCTATAGGCTTGGTAAGGCTAATACTATACAACATTATTATTGTTTCTCAAATTGTTTG

Coding sequence (CDS)

ATGGCGACGTCAATGCCATACTGCACTTCATCTTCTACCACATCCACTTCTCCTCTGGTTCGATATGCACCATCTAATCCTACTTTATCCAAGAGGCCCATCCTTCTGCCCCTTCCCTTCCGTTCCAATCGCCGCCGCTCAATTCGGATTGCAACGGCTTCTCTCTCCGCCGTCAATTCTCCGTCTACGAGTTCCTCTGGCAGTCAGAAAAATCTTTTGTTAGAAGTCAAGGATTTGACTGCTGTAATTGCAGAATCCAAGCAGGAGATTCTCAAGGGTGTTAACCTTGTTGTCCGCGAAGGAGAGGTTCATGCCATCATGGGAAAGAACGGTTCTGGAAAGAGCACATTTGCAAAGGTACTCGTTGGACACCCAGATTATGAGGTTACAGGTGGTAGTATTCTGTTTAAAGGGAGTGACTTGCTTGATATGGAACCAGAAGAAAGGTCACTTGCAGGGCTATTTATGAGTTTCCAGTCCCCGGTTGAGATTCCAGGTGTCAGCAATATTGACTTTCTAAATATGGCTTACAATGCACGAAGAAGGAAACTTGGTCTGGCAGAGCTGGGGCCAATTGAGTTTTATGCATACATCTTCCCCAAACTTGATCTTGTGAACATGAAGACGGATTTTCTCAATCGAAATGTCAATGAAGGGTTTAGTGGTGGAGAAAGAAAGCGTAATGAGATCTTACAACTTGCGTACACTCTTCAGCCAAGAAGAAGATGCATTTCAAGTGGTCCAGTTGCATTATAA

Protein sequence

MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNSPSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLAYTLQPRRRCISSGPVAL
Homology
BLAST of Tan0010904.2 vs. ExPASy Swiss-Prot
Match: Q9CAF5 (ABC transporter I family member 6, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=ABCI6 PE=1 SV=1)

HSP 1 Score: 283.9 bits (725), Expect = 1.8e-75
Identity = 148/199 (74.37%), Postives = 171/199 (85.93%), Query Frame = 0

Query: 39  PFRSNRRRSIRIATASL-SAVNSPSTSSSGSQ--KNLLLEVKDLTAVIAESKQEILKGVN 98
           P     RRS+ ++ +S+ SAV+S S         +  LLEV+DL AVIAES+QEILKGVN
Sbjct: 54  PILRTTRRSVIVSASSVSSAVDSDSLVEDRDDVGRIPLLEVRDLRAVIAESRQEILKGVN 113

Query: 99  LVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLF 158
           LVV EGEVHA+MGKNGSGKSTF+KVLVGHPDYEVTGGSI+FKG +LLDMEPE+RSLAGLF
Sbjct: 114 LVVYEGEVHAVMGKNGSGKSTFSKVLVGHPDYEVTGGSIVFKGQNLLDMEPEDRSLAGLF 173

Query: 159 MSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRN 218
           MSFQSPVEIPGVSN+DFLNMA+NAR+RKLG  EL PI+FY+++  KL++VNMKTDFLNRN
Sbjct: 174 MSFQSPVEIPGVSNMDFLNMAFNARKRKLGQPELDPIQFYSHLVSKLEVVNMKTDFLNRN 233

Query: 219 VNEGFSGGERKRNEILQLA 235
           VNEGFSGGERKRNEILQLA
Sbjct: 234 VNEGFSGGERKRNEILQLA 252

BLAST of Tan0010904.2 vs. ExPASy Swiss-Prot
Match: P48255 (Probable ATP-dependent transporter ycf16 OS=Cyanophora paradoxa OX=2762 GN=ycf16 PE=3 SV=1)

HSP 1 Score: 217.6 bits (553), Expect = 1.6e-55
Identity = 105/167 (62.87%), Postives = 133/167 (79.64%), Query Frame = 0

Query: 68  SQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDY 127
           ++K  +LEVK+L A +     EILKGVNL +  GE+HAIMG NGSGKSTF+K+L GHP Y
Sbjct: 3   TEKTKILEVKNLKAQV--DGTEILKGVNLTINSGEIHAIMGPNGSGKSTFSKILAGHPAY 62

Query: 128 EVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLA 187
           +VTGG ILFK  +LL++EPEER+ AG+F++FQ P+EI GVSNIDFL +AYN RR++ GL 
Sbjct: 63  QVTGGEILFKNKNLLELEPEERARAGVFLAFQYPIEIAGVSNIDFLRLAYNNRRKEEGLT 122

Query: 188 ELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           EL P+ FY+ +  KL++V M   FLNRNVNEGFSGGE+KRNEILQ+A
Sbjct: 123 ELDPLTFYSIVKEKLNVVKMDPHFLNRNVNEGFSGGEKKRNEILQMA 167

BLAST of Tan0010904.2 vs. ExPASy Swiss-Prot
Match: Q55791 (Probable ATP-dependent transporter slr0075 OS=Synechocystis sp. (strain PCC 6803 / Kazusa) OX=1111708 GN=slr0075 PE=3 SV=1)

HSP 1 Score: 208.4 bits (529), Expect = 9.6e-53
Identity = 99/162 (61.11%), Postives = 129/162 (79.63%), Query Frame = 0

Query: 73  LLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGG 132
           +L +K+LTA +     +ILKGVNL ++ GEVHAIMG+NGSGKST +KV+ GHPDYE+TGG
Sbjct: 5   ILSIKNLTASV--DGNQILKGVNLEIKAGEVHAIMGRNGSGKSTLSKVITGHPDYEITGG 64

Query: 133 SILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPI 192
            I+++G DL  +EP ER+LAG+F++FQ P+EIPGVSN+DFL +AYNA+R+ LGL EL   
Sbjct: 65  EIIYQGQDLSALEPHERALAGIFLAFQYPLEIPGVSNLDFLRIAYNAKRKHLGLEELDTF 124

Query: 193 EFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           +F   I  KLD+V M   FL R++NEGFSGGE+KRNEILQ+A
Sbjct: 125 DFEDLIQEKLDVVKMNPAFLERSLNEGFSGGEKKRNEILQMA 164

BLAST of Tan0010904.2 vs. ExPASy Swiss-Prot
Match: Q1XDP6 (Probable ATP-dependent transporter ycf16 OS=Pyropia yezoensis OX=2788 GN=ycf16 PE=3 SV=1)

HSP 1 Score: 206.5 bits (524), Expect = 3.6e-52
Identity = 100/162 (61.73%), Postives = 126/162 (77.78%), Query Frame = 0

Query: 73  LLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGG 132
           +LE+KDL A + E+   ILKGVNL +R GE+HAIMG NGSGKST +KV+ GHP Y +  G
Sbjct: 5   ILEIKDLYASVGET--TILKGVNLSIRAGEIHAIMGPNGSGKSTLSKVIAGHPAYSLISG 64

Query: 133 SILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPI 192
            ILF G  +L++EP+ER+ AG+F++FQ PVEIPGVSN DFL +A NARR+  GL+E  P+
Sbjct: 65  DILFFGQSILEIEPDERAKAGIFLAFQYPVEIPGVSNSDFLRIALNARRKFQGLSEFSPL 124

Query: 193 EFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           EF+  I  K+DLV M+  FL RNVNEGFSGGE+KRNEILQ+A
Sbjct: 125 EFFQLITEKIDLVGMQESFLTRNVNEGFSGGEKKRNEILQMA 164

BLAST of Tan0010904.2 vs. ExPASy Swiss-Prot
Match: O78474 (Probable ATP-dependent transporter ycf16 OS=Guillardia theta OX=55529 GN=ycf16 PE=3 SV=1)

HSP 1 Score: 202.6 bits (514), Expect = 5.2e-51
Identity = 97/165 (58.79%), Postives = 128/165 (77.58%), Query Frame = 0

Query: 70  KNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEV 129
           K  +LEV +L A + E K  I+KG+NLVV  GE+HAIMGKNGSGKSTFAK++ GHPDY +
Sbjct: 2   KKKILEVTNLHAAVNEIK--IVKGLNLVVNAGEIHAIMGKNGSGKSTFAKIIAGHPDYTI 61

Query: 130 TGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAEL 189
           T G I ++ + +L++ PE+R+  G+F+SFQ P+EIPGV+N DFL +A NARR   GL E+
Sbjct: 62  TNGDITYQHTSILELTPEDRAKRGIFLSFQYPIEIPGVTNADFLRLACNARRIYQGLPEM 121

Query: 190 GPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
            P+EF+ YI  KL LV++K  FL R+VNEGFSGGE+KRNEILQ++
Sbjct: 122 EPLEFFEYINSKLPLVDLKPSFLTRDVNEGFSGGEKKRNEILQMS 164

BLAST of Tan0010904.2 vs. NCBI nr
Match: XP_038880111.1 (ABC transporter I family member 6, chloroplastic [Benincasa hispida])

HSP 1 Score: 414.8 bits (1065), Expect = 5.1e-112
Identity = 218/234 (93.16%), Postives = 222/234 (94.87%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YC+SSSTTSTSPLVRY PSN TLS  P   PLPF SNRRRS+RIATASLSAVNS
Sbjct: 9   MAMSMAYCSSSSTTSTSPLVRYLPSNSTLSMMPTFSPLPFHSNRRRSVRIATASLSAVNS 68

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTS SGSQK+LLLEVKDLTAVIAESKQEILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 69  PSTSFSGSQKSLLLEVKDLTAVIAESKQEILKGVNLVVYEGEVHAIMGKNGSGKSTFAKV 128

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYEVTGGSILFKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 129 LVGHPDYEVTGGSILFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 188

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 189 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 242

BLAST of Tan0010904.2 vs. NCBI nr
Match: KAG6588809.1 (ABC transporter I family member 6, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia] >KAG7022578.1 ABC transporter I family member 6, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 412.1 bits (1058), Expect = 3.3e-111
Identity = 215/234 (91.88%), Postives = 221/234 (94.44%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPLVRY+P N T+S R    PLPFRSNRRRSIRIATASL AVNS
Sbjct: 1   MAISMAYCTSSSTAATSPLVRYSPCNSTISTRLTFFPLPFRSNRRRSIRIATASLPAVNS 60

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES+QEILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 61  PSTSSSGSQKNLLLEVKDLTAVIAESRQEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 120

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 121 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 234

BLAST of Tan0010904.2 vs. NCBI nr
Match: XP_022927723.1 (ABC transporter I family member 6, chloroplastic [Cucurbita moschata])

HSP 1 Score: 410.6 bits (1054), Expect = 9.7e-111
Identity = 214/234 (91.45%), Postives = 221/234 (94.44%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPLVRY+P N T+S R    PLPFRSNRRRSIRIATASL AVNS
Sbjct: 1   MAISMAYCTSSSTAATSPLVRYSPCNSTISTRLTFFPLPFRSNRRRSIRIATASLPAVNS 60

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES+QEILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 61  PSTSSSGSQKNLLLEVKDLTAVIAESRQEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 120

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 121 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RR+LGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 181 RRQLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 234

BLAST of Tan0010904.2 vs. NCBI nr
Match: XP_023531475.1 (ABC transporter I family member 6, chloroplastic [Cucurbita pepo subsp. pepo])

HSP 1 Score: 410.2 bits (1053), Expect = 1.3e-110
Identity = 214/234 (91.45%), Postives = 220/234 (94.02%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPL RY+P N TLS R    PLPFRSNRRRSIRIATASL AVNS
Sbjct: 1   MAMSMAYCTSSSTAATSPLARYSPCNSTLSTRLTFFPLPFRSNRRRSIRIATASLPAVNS 60

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES+QEILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 61  PSTSSSGSQKNLLLEVKDLTAVIAESRQEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 120

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 121 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGE+KRNEILQLA
Sbjct: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGEKKRNEILQLA 234

BLAST of Tan0010904.2 vs. NCBI nr
Match: XP_022989219.1 (ABC transporter I family member 6, chloroplastic [Cucurbita maxima])

HSP 1 Score: 409.1 bits (1050), Expect = 2.8e-110
Identity = 214/234 (91.45%), Postives = 220/234 (94.02%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPLVRY+P N TLS R    PLPFRSNRRR IRIATASL AVNS
Sbjct: 5   MAISMAYCTSSSTAATSPLVRYSPCNSTLSTRLTFFPLPFRSNRRRLIRIATASLPAVNS 64

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES++EILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 65  PSTSSSGSQKNLLLEVKDLTAVIAESRKEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 124

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 125 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 184

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 185 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 238

BLAST of Tan0010904.2 vs. ExPASy TrEMBL
Match: A0A6J1ELT6 (ABC transporter I family member 6, chloroplastic OS=Cucurbita moschata OX=3662 GN=LOC111434534 PE=4 SV=1)

HSP 1 Score: 410.6 bits (1054), Expect = 4.7e-111
Identity = 214/234 (91.45%), Postives = 221/234 (94.44%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPLVRY+P N T+S R    PLPFRSNRRRSIRIATASL AVNS
Sbjct: 1   MAISMAYCTSSSTAATSPLVRYSPCNSTISTRLTFFPLPFRSNRRRSIRIATASLPAVNS 60

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES+QEILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 61  PSTSSSGSQKNLLLEVKDLTAVIAESRQEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 120

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 121 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RR+LGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 181 RRQLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 234

BLAST of Tan0010904.2 vs. ExPASy TrEMBL
Match: A0A6J1JNR4 (ABC transporter I family member 6, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111486361 PE=4 SV=1)

HSP 1 Score: 409.1 bits (1050), Expect = 1.4e-110
Identity = 214/234 (91.45%), Postives = 220/234 (94.02%), Query Frame = 0

Query: 1   MATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNS 60
           MA SM YCTSSST +TSPLVRY+P N TLS R    PLPFRSNRRR IRIATASL AVNS
Sbjct: 5   MAISMAYCTSSSTAATSPLVRYSPCNSTLSTRLTFFPLPFRSNRRRLIRIATASLPAVNS 64

Query: 61  PSTSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKV 120
           PSTSSSGSQKNLLLEVKDLTAVIAES++EILKGVNLVV EGEVHAIMGKNGSGKSTFAKV
Sbjct: 65  PSTSSSGSQKNLLLEVKDLTAVIAESRKEILKGVNLVVNEGEVHAIMGKNGSGKSTFAKV 124

Query: 121 LVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 180
           LVGHPDYE+TGGSI FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR
Sbjct: 125 LVGHPDYEITGGSIQFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR 184

Query: 181 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 185 RRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 238

BLAST of Tan0010904.2 vs. ExPASy TrEMBL
Match: A0A5D3CKJ6 (ABC transporter I family member 6 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold152G00350 PE=4 SV=1)

HSP 1 Score: 386.7 bits (992), Expect = 7.2e-104
Identity = 207/233 (88.84%), Postives = 213/233 (91.42%), Query Frame = 0

Query: 2   ATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNSP 61
           A SM Y +SS TTSTSPLV Y PS  TLS       LP R NRRRS+RIATASLSAVNSP
Sbjct: 10  AMSMAYGSSSFTTSTSPLVPYLPSKSTLSMMTTFSHLPLRFNRRRSVRIATASLSAVNSP 69

Query: 62  STSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVL 121
           STS S SQK+LLLEVKDLTAVIAE+KQEILKGVNLVV EGEVHAIMGKNGSGKSTFAKVL
Sbjct: 70  STSFSDSQKSLLLEVKDLTAVIAETKQEILKGVNLVVYEGEVHAIMGKNGSGKSTFAKVL 129

Query: 122 VGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 181
           VGHPDYEVTGGSI+FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR
Sbjct: 130 VGHPDYEVTGGSIVFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 189

Query: 182 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 190 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 242

BLAST of Tan0010904.2 vs. ExPASy TrEMBL
Match: A0A5A7UXL8 (ABC transporter I family member 6 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G005950 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.6e-103
Identity = 206/233 (88.41%), Postives = 213/233 (91.42%), Query Frame = 0

Query: 2   ATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNSP 61
           A SM Y +SS TTSTSPLV Y PS  TLS       LP R NRRRS+RIATASLSAVNSP
Sbjct: 10  AMSMAYGSSSFTTSTSPLVPYLPSKSTLSMMTTFSHLPLRFNRRRSVRIATASLSAVNSP 69

Query: 62  STSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVL 121
           STS S SQK++LLEVKDLTAVIAE+KQEILKGVNLVV EGEVHAIMGKNGSGKSTFAKVL
Sbjct: 70  STSFSDSQKSVLLEVKDLTAVIAETKQEILKGVNLVVYEGEVHAIMGKNGSGKSTFAKVL 129

Query: 122 VGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 181
           VGHPDYEVTGGSI+FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR
Sbjct: 130 VGHPDYEVTGGSIVFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 189

Query: 182 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 190 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 242

BLAST of Tan0010904.2 vs. ExPASy TrEMBL
Match: A0A0A0LXA8 (ABC transporter domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G423200 PE=4 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 3.6e-103
Identity = 206/233 (88.41%), Postives = 213/233 (91.42%), Query Frame = 0

Query: 2   ATSMPYCTSSSTTSTSPLVRYAPSNPTLSKRPILLPLPFRSNRRRSIRIATASLSAVNSP 61
           A SM Y +SS TTST+ LV Y PSN TLS       LPFR N RRS+RIATASLSAVNSP
Sbjct: 10  AMSMAYGSSSFTTSTTSLVPYLPSNSTLSMMTTFSHLPFRFNLRRSVRIATASLSAVNSP 69

Query: 62  STSSSGSQKNLLLEVKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVL 121
           STS S SQK+LLLEVKDLTAVIAE+KQEILKGVNLVV EGEVHAIMGKNGSGKSTFAKVL
Sbjct: 70  STSFSDSQKSLLLEVKDLTAVIAETKQEILKGVNLVVYEGEVHAIMGKNGSGKSTFAKVL 129

Query: 122 VGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 181
           VGHPDYEVTGGSI+FKGSDLL+MEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR
Sbjct: 130 VGHPDYEVTGGSIVFKGSDLLEMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARR 189

Query: 182 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 235
           RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA
Sbjct: 190 RKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRNVNEGFSGGERKRNEILQLA 242

BLAST of Tan0010904.2 vs. TAIR 10
Match: AT3G10670.1 (non-intrinsic ABC protein 7 )

HSP 1 Score: 283.9 bits (725), Expect = 1.3e-76
Identity = 148/199 (74.37%), Postives = 171/199 (85.93%), Query Frame = 0

Query: 39  PFRSNRRRSIRIATASL-SAVNSPSTSSSGSQ--KNLLLEVKDLTAVIAESKQEILKGVN 98
           P     RRS+ ++ +S+ SAV+S S         +  LLEV+DL AVIAES+QEILKGVN
Sbjct: 54  PILRTTRRSVIVSASSVSSAVDSDSLVEDRDDVGRIPLLEVRDLRAVIAESRQEILKGVN 113

Query: 99  LVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSILFKGSDLLDMEPEERSLAGLF 158
           LVV EGEVHA+MGKNGSGKSTF+KVLVGHPDYEVTGGSI+FKG +LLDMEPE+RSLAGLF
Sbjct: 114 LVVYEGEVHAVMGKNGSGKSTFSKVLVGHPDYEVTGGSIVFKGQNLLDMEPEDRSLAGLF 173

Query: 159 MSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFYAYIFPKLDLVNMKTDFLNRN 218
           MSFQSPVEIPGVSN+DFLNMA+NAR+RKLG  EL PI+FY+++  KL++VNMKTDFLNRN
Sbjct: 174 MSFQSPVEIPGVSNMDFLNMAFNARKRKLGQPELDPIQFYSHLVSKLEVVNMKTDFLNRN 233

Query: 219 VNEGFSGGERKRNEILQLA 235
           VNEGFSGGERKRNEILQLA
Sbjct: 234 VNEGFSGGERKRNEILQLA 252

BLAST of Tan0010904.2 vs. TAIR 10
Match: AT5G06530.2 (ABC-2 type transporter family protein )

HSP 1 Score: 52.4 bits (124), Expect = 6.2e-07
Identity = 47/160 (29.38%), Postives = 72/160 (45.00%), Query Frame = 0

Query: 76  VKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSIL 135
           +K LT+ +   ++EIL G++  V  GEV A+MG +GSGK+T   +L G      TGGS+ 
Sbjct: 168 IKKLTSSV---EKEILTGISGSVNPGEVLALMGPSGSGKTTLLSLLAGRISQSSTGGSVT 227

Query: 136 FKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFY 195
           +      D    +   + +    Q  V  P ++  + L  A   R  K  L      +  
Sbjct: 228 YN-----DKPYSKYLKSKIGFVTQDDVLFPHLTVKETLTYAARLRLPKT-LTREQKKQRA 287

Query: 196 AYIFPKLDLVNMKTDFLNRNVNEGFSGGERKR----NEIL 232
             +  +L L   +   +      G SGGERKR    NEI+
Sbjct: 288 LDVIQELGLERCQDTMIGGAFVRGVSGGERKRVSIGNEII 318

BLAST of Tan0010904.2 vs. TAIR 10
Match: AT5G06530.1 (ABC-2 type transporter family protein )

HSP 1 Score: 52.4 bits (124), Expect = 6.2e-07
Identity = 47/160 (29.38%), Postives = 72/160 (45.00%), Query Frame = 0

Query: 76  VKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSIL 135
           +K LT+ +   ++EIL G++  V  GEV A+MG +GSGK+T   +L G      TGGS+ 
Sbjct: 168 IKKLTSSV---EKEILTGISGSVNPGEVLALMGPSGSGKTTLLSLLAGRISQSSTGGSVT 227

Query: 136 FKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFY 195
           +      D    +   + +    Q  V  P ++  + L  A   R  K  L      +  
Sbjct: 228 YN-----DKPYSKYLKSKIGFVTQDDVLFPHLTVKETLTYAARLRLPKT-LTREQKKQRA 287

Query: 196 AYIFPKLDLVNMKTDFLNRNVNEGFSGGERKR----NEIL 232
             +  +L L   +   +      G SGGERKR    NEI+
Sbjct: 288 LDVIQELGLERCQDTMIGGAFVRGVSGGERKRVSIGNEII 318

BLAST of Tan0010904.2 vs. TAIR 10
Match: AT5G06530.3 (ABC-2 type transporter family protein )

HSP 1 Score: 52.4 bits (124), Expect = 6.2e-07
Identity = 47/160 (29.38%), Postives = 72/160 (45.00%), Query Frame = 0

Query: 76  VKDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSIL 135
           +K LT+ +   ++EIL G++  V  GEV A+MG +GSGK+T   +L G      TGGS+ 
Sbjct: 168 IKKLTSSV---EKEILTGISGSVNPGEVLALMGPSGSGKTTLLSLLAGRISQSSTGGSVT 227

Query: 136 FKGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNARRRKLGLAELGPIEFY 195
           +      D    +   + +    Q  V  P ++  + L  A   R  K  L      +  
Sbjct: 228 YN-----DKPYSKYLKSKIGFVTQDDVLFPHLTVKETLTYAARLRLPKT-LTREQKKQRA 287

Query: 196 AYIFPKLDLVNMKTDFLNRNVNEGFSGGERKR----NEIL 232
             +  +L L   +   +      G SGGERKR    NEI+
Sbjct: 288 LDVIQELGLERCQDTMIGGAFVRGVSGGERKRVSIGNEII 318

BLAST of Tan0010904.2 vs. TAIR 10
Match: AT2G37010.1 (non-intrinsic ABC protein 12 )

HSP 1 Score: 50.1 bits (118), Expect = 3.1e-06
Identity = 41/152 (26.97%), Postives = 68/152 (44.74%), Query Frame = 0

Query: 77  KDLTAVIAESKQEILKGVNLVVREGEVHAIMGKNGSGKSTFAKVLVGHPDYEVTGGSILF 136
           KDLT  +    + IL+ V   +  G V A+MG +G+GK+TF   L G        G IL 
Sbjct: 487 KDLTLTLKGKHKHILRSVTGKIMPGRVSAVMGPSGAGKTTFLSALAGKATGCTRTGLILI 546

Query: 137 KGSDLLDMEPEERSLAGLFMSFQSPVEIPGVSNIDFLNMAYNAR-RRKLGLAELGPIEFY 196
            G +  D     + + G     Q  V + G   ++  N+ ++AR R    +++   +   
Sbjct: 547 NGRN--DSINSYKKITGFVP--QDDV-VHGNLTVE-ENLRFSARCRLSAYMSKADKVLII 606

Query: 197 AYIFPKLDLVNMKTDFLNRNVNEGFSGGERKR 228
             +   L L +++   +      G SGG+RKR
Sbjct: 607 ERVIESLGLQHVRDSLVGTIEKRGISGGQRKR 632

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9CAF51.8e-7574.37ABC transporter I family member 6, chloroplastic OS=Arabidopsis thaliana OX=3702... [more]
P482551.6e-5562.87Probable ATP-dependent transporter ycf16 OS=Cyanophora paradoxa OX=2762 GN=ycf16... [more]
Q557919.6e-5361.11Probable ATP-dependent transporter slr0075 OS=Synechocystis sp. (strain PCC 6803... [more]
Q1XDP63.6e-5261.73Probable ATP-dependent transporter ycf16 OS=Pyropia yezoensis OX=2788 GN=ycf16 P... [more]
O784745.2e-5158.79Probable ATP-dependent transporter ycf16 OS=Guillardia theta OX=55529 GN=ycf16 P... [more]
Match NameE-valueIdentityDescription
XP_038880111.15.1e-11293.16ABC transporter I family member 6, chloroplastic [Benincasa hispida][more]
KAG6588809.13.3e-11191.88ABC transporter I family member 6, chloroplastic, partial [Cucurbita argyrosperm... [more]
XP_022927723.19.7e-11191.45ABC transporter I family member 6, chloroplastic [Cucurbita moschata][more]
XP_023531475.11.3e-11091.45ABC transporter I family member 6, chloroplastic [Cucurbita pepo subsp. pepo][more]
XP_022989219.12.8e-11091.45ABC transporter I family member 6, chloroplastic [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1ELT64.7e-11191.45ABC transporter I family member 6, chloroplastic OS=Cucurbita moschata OX=3662 G... [more]
A0A6J1JNR41.4e-11091.45ABC transporter I family member 6, chloroplastic OS=Cucurbita maxima OX=3661 GN=... [more]
A0A5D3CKJ67.2e-10488.84ABC transporter I family member 6 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7UXL81.6e-10388.41ABC transporter I family member 6 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A0A0LXA83.6e-10388.41ABC transporter domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G42... [more]
Match NameE-valueIdentityDescription
AT3G10670.11.3e-7674.37non-intrinsic ABC protein 7 [more]
AT5G06530.26.2e-0729.38ABC-2 type transporter family protein [more]
AT5G06530.16.2e-0729.38ABC-2 type transporter family protein [more]
AT5G06530.36.2e-0729.38ABC-2 type transporter family protein [more]
AT2G37010.13.1e-0626.97non-intrinsic ABC protein 12 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 69..89
NoneNo IPR availablePANTHERPTHR43204:SF1ABC TRANSPORTER I FAMILY MEMBER 6, CHLOROPLASTICcoord: 57..235
IPR003439ABC transporter-like, ATP-binding domainPFAMPF00005ABC_trancoord: 91..233
e-value: 5.3E-16
score: 59.4
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 73..242
e-value: 1.6E-31
score: 111.7
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 73..233
IPR010230FeS cluster assembly SUF system, ATPase SufCPANTHERPTHR43204ABC TRANSPORTER I FAMILY MEMBER 6, CHLOROPLASTICcoord: 57..235

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Tan0010904Tan0010904gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010904.2-exonTan0010904.2-exon-LG02:41990558..41990741exon
Tan0010904.2-exonTan0010904.2-exon-LG02:41991887..41992009exon
Tan0010904.2-exonTan0010904.2-exon-LG02:42001592..42001813exon
Tan0010904.2-exonTan0010904.2-exon-LG02:42011789..42011839exon
Tan0010904.2-exonTan0010904.2-exon-LG02:42011961..42012327exon


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010904.2-three_prime_utrTan0010904.2-three_prime_utr-LG02:41990558..41990687three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010904.2-cdsTan0010904.2-cds-LG02:41990688..41990741CDS
Tan0010904.2-cdsTan0010904.2-cds-LG02:41991887..41992009CDS
Tan0010904.2-cdsTan0010904.2-cds-LG02:42001592..42001813CDS
Tan0010904.2-cdsTan0010904.2-cds-LG02:42011789..42011839CDS
Tan0010904.2-cdsTan0010904.2-cds-LG02:42011961..42012266CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Tan0010904.2-five_prime_utrTan0010904.2-five_prime_utr-LG02:42012267..42012327five_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Tan0010904.2Tan0010904.2-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0010027 thylakoid membrane organization
biological_process GO:0055085 transmembrane transport
cellular_component GO:0009507 chloroplast
molecular_function GO:0005524 ATP binding
molecular_function GO:0042626 ATPase-coupled transmembrane transporter activity