HG10019907 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10019907
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionpleiotropic drug resistance protein 3-like
LocationChr04: 26739922 .. 26757074 (-)
RNA-Seq ExpressionHG10019907
SyntenyHG10019907
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCAAATGGTTGGGAGTGCTGAGAGAGGAAGAAGCTCCTCGATCGCGGAGGAAGACAATGATGGCGATGTTGAAGATGCTTCACTTTGGGCGACGATCGAGAGATTGCCAACATTCGAACGGTTGAGATCGTCATTGTTCGACATTAACGATGAAGGAAAGGTGGAAGAGAAAGGAAGAAGAGTTGTAGATGTTACTAAGCTTAGGAATGAGGAGCGCCATATCTTTGTTCATAAACTCATAAAAAATGTTGAGAATGATAATCTTAAGCTCTTAACTAAAGTTAGAGACAGAATCCATAAGTAAGCTATATATCTTTTTCTCTCTATTCCTCATTTTTTTTAAGTATTTGGATTAATTAATTTATAAGTTAATCATCATTGTTTGATTATTAATGAAGGGTTAATTAATTGATTTAGGGTTGGTGAGAAATTTCCAAGTGTTGAAGTGAAGTATAAAAATGTGCATATTGAAGCAGAGTGTGAGGTTGTTCATGGGAAAGCCATTCCAACACTTTGGAATTCTCTTCAAAGCAAGCTTTATGTGAGTTATTTTTTTTCCCTTTCTTTTTGGAAAATGGAGAAGAAGAATTAGAGAATAAAATAAAATTATAGAATTGAGAATGAAAGTTTCTTTGATAAATATTTGATTTTTGGTTTTTAATTTTTGAAAATTATATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATTTTCTTAGTTACTCATCTTTCTTAAATAAACATTTAGTTTTTTTTAGTTAAATTCTAAAAATAAAAAGTAAATTTTGAAAACTACAATTCGGTTTTGATAACCATTTGATTTTTAAAAGTTGTGCTTAATATTTCACACAATTTCTTGATCATGATTTTCATAGTTTATATGTATATATATAAATTCTTAACCAAATTCTAAAAACAAAAACAAAAACAAGTTTTTAAAAACTATTTTTTTTAAACCTCTTTTGGTAAAACTATTTGGCTTTTAGTTTTTGAAAATTAAGCTTACAAACACCCTTCCTACCCTAAATGTCTTGTTTTCTTATCTACTTTTTACCTGTATATTAAAAAACCAAGCTAAAATTTGAAAACTAAAAAAAGTAGTTTTTGTTTTGGGTTGTTTTCAAATATAGAAAAATGAGCAAACTATGTACTAATATAGAAAAATAAGATGCAAACTATTATAAGAAAATGTGAGGAAATATGCTTAATTTTCAAAAACCAAAAACCAAAAACCAAAAACCAAAAACCAAAAATGAAATGGTTACCAAATGGAGCATTAATTTTTAAATTTTTGTTTGGATTTTGAAGACACTCTTAAAATGTAGACAACAAACATATAAATCAAATATTAATATACTTAAATCTTGGTTACTTTTATTAAAAATAAAAATATTTTTATGTTCAGGGTAGGGATCAATTAATTAATAGGGTAAATTGTTTTAAATAACAAAACTGTCAAAAATATTTTCAAATATAACAAAATATCATTGTCTATTAGTGACAGACATTGACATTTTGCTATATTTATAAATAAATGGACTCATTTTGCTATATTTAAAAATGTAATAATCATTCTTTCTCATTTTTTCCATTTTAAGTATACCATTCTACCTATTTTCTCATGATCTCATTTTCAAGATTCAAATGTTTTTCTTTTTTAATAAAAATTTTGCTATTGATCATACCAAAAGAAACCCTCAAAATGTAGATATGGTTGACAAGAAAATTAACTAAAAAAGAAGTAAGAAAAAGAAAATTATAGATCTCAATATAAACACGAAAAGGAAAGTTTTTTCCCCTTTTAAGATTAGTGATATTTGAATATCAATTGAAACAAAAATAAATAGTTAAGACTCAAATATTAATATACTTAAATCTTGGTTACTTTTATTAATAGGAAATTTTCACAAATAGGAAAATTTTCAAACTATTTACAGAAAATAGCAAAAAAAACACTGATAGATATTGATAAACTTCTATCAACATCTATCAATGATAGACTTCTATCATTTCTATTACTGATAAACCATGATAGACTTCTATCAGCATTTATCACAACTATCTAAAAATTTTACTATTTTGTGTAAATATTTTTCCTTATTTTTCTATTTTTAAAAATCTCCCTTTATTAATTTATGTATACTAAATTCCAATGTTTTTGCTTTTTTTTTTTTGGGCTATTTAATTTTTTTTATAGGACATCATCAAATTTTGTGGTGTAAAGTCCCATAAAGCTAAAATTGACATAATTGAGGATGTGAGTGGTGTTATAAATCCAGGAAGGTAGCATGCACTTACTTTAATTTTTTGATTCTTTTTACACGATGGATATTAAATAAATTACAATTAATTTACTTTATAATCAATTTTTACATACTTTTTCACCTATGCACTTTAATTCTTTCATCATCTTTTTAGCCTAAACTTTTAAGTTTGAATCAGATAAGTTTTTTAATTTAATTTTTTTTAATAAATTCTTAAAGTTTTAATTTTGTGAATAATGTAGTTTTTTAAACTTTAAAAATGCCTAGTAAATTTCCAAACTCCCGATCTTGTGTCAAATAAGTTTTGTTAAAACTTTAAACTTGTGTTCTATGTTTTTTTTTTAATTAATTTTGATTTTTTAATTTTAAAAATGTTGAATATGTCATGAAATTTTTAGATATAAAATTGAAAGTTTCTATATATTAGATATTTTTAAATTTAAGAATTTATTAGATAAAATTAAAAGTTTAATCACCTATTATGTTTTAAAACAAATATAAAAATTCAGAGATCAAGAAGGTGTAATTTAGTATAGAATAGAAACTACATATATAATTTAATCTACCATCTAAAATTTAGGAATGTAATGAAAATTACAAAATTTATAATTAGATGGCATAGATTGACAAGTTTAGAATTCCACCCAATCAAATTAAAAGCTCAAAATTGTAGTGGATTATGTTGGAAAAATTCCCACACTTTTTCGTAAAAATAATAAAAAAAAATTAAAAGAAAACAATAAAATTGGAAACACAAGAATTAACGTGAGAAAACTCAAAATTGGAGAAATAACCTAAAAATTATTACAATCATACAAAATAATTATCTCTCCTCTCGATCCCAATTACAAGAGCACTCTCTCAAAGCTTTTATACTGTTTACACCCCTTTTTCCCACTCTCAAACTAGAGAATACAAAATGAAATTTTAACTAGAGTTAGCACACCTAAGCTTAAAATATTTTTGATTGGGACGATTACATTTGAAACCAAAGGTGTAGACCATCACTTTCTTGAAATTTTTCGATATAGGACAAATGCAATTCTAATATTTTGCCAAGACTCAGCAGACTAAACTAAATAAAAGAGTGCCAAAATGAAAAATTTTCAAATTTGGACAATATTTATTTAATTTCAGGTTAACTTTGCTACTTGGTCCTCCAGGGTGTGGGAAGACCACACTACTGAAGGCCCTCTCTGGAAATCTTAACAAATCTCTCAAGGTATTTTTTTTTTTTTTTTAAATTTAAACTTAAATATATAGAGACAAAGTTGATAATAATTAACCCTTTTTTTAATGTCTTTTTTTTGTTGATTTTATTTAATCTCTTTTTTCATCCAAAAGTTTAGTGGAGAAATTTGTTACAATGGACATAAACTTGAAGAATTTGTCCCCCAAAAAACTTCAGCATATGTTAGTCAGCATGACCTACATATTCCTCAAATAACTGTGAGGGAAACCCTAGATTTTTCGGCGAGGTGTCAAGGCATTGGAAGTCGAGCAGGTGAGCTTTCGTATCTAACTTTAAGTTAAATTATAAGTTTAGTGTATGAACTTTTTAATTTCTATTTATTTTATTCCTGAACTTTTAAAAGTATCTAATAAAGTTCCTGAACTTTCATATATTTTCAACTGTGTGTCTAGCAAATTTCTTACATATTTATAACTCTTAAGAAATTGAGAAAATAACAAAAAAAAAAAAAAAAAACACTGATAGACATTGATTGACTTCTATCATTTCTATCGCCGATAGACCTTAATAGACTTTTATCAGCGTCTATCACAACTATCTAAAAATTTTGCTATTTTGTATAAATAGTTTTTTTTATTTTTTTATTTTTAAAAATTTTCCTATTTTTTTTTTTTTTTTAAGGAAAAAGTCATGAGTTACCACTTTATGGCTAATTTAGAAAATTTAGAAAACAATAATTTTTTAGAAACAAGATATATATTACCAAAAATAATGTTATCTCCTAAAACATTATAAACAAGAAACATGAATTGAAAGAAAACATTCCAAATCTTATTCTCCAAATTTCAATAATGCTTATTTTAAAAAATATATTAATAACATTTTTTTTGGTGGTAATTTGTTCTTGCAATGAAGATATTATGAAAGAGGTTAATAAAAAGGAGAAGGAACAAGGGATCATTCCAAATCCTGATATTGATACTTACATGAAGGTAAAAAAAATAAATTAAAAATTAAGAATATTTTCCTTTTCATTGCCTATTATTACCATTTTTCTTTATTTAATTATCTTTTTATGTTATTTTAAACAGGCAATTTCTATTGAAGGATTAAAACAAAGTCTTCAAACTGACTACATACTCAAGGTCAATTCTATTCTTTTATTGAATAATATTGAAATTATTATTATAAGTTAAAAGACAAGATAAAGTGGACAAAAAAAAAAAAAAAATAGAAATCTAGCTTGAATTTGGTTGAATATAATGGATGTTGCTTGAATATAGAAGAAAGTAAATTAACATAAGAGGCCTATACACTAAGTTTAATGTGTCTATATAAAACATATTATTGTAGGGTAAAATTGGAAAAATAAAATTTAAGTTTAAAATCCTTCTAAATCCGATCGATTTAGGATGGAAATTTTTTCTTAGAAAATATGATTTCAAACCTAAATTAATACAAATTCTTTAATTTTTTTTAAAAATCTTTCTAAATCTATGGGCTCCAAACCGGGGATAGAAAAATTTAATCCCCTCTTCAATAAAATATTTAAGGCTCCCTTTAGTAACTATTTCGTTTTTAAGTTTTTTTATTTTTAAAAATTAAGTCTATTTTCTCACATTTCTTACAATGATTTGTATCTTTCTGGAATACAAGAGTTCAATTCTTAATCAAATTCTAAAAATAAAAACAAGTTTTTAAAACTACTTTTTTTTTTTTTTAGTTTTCAAATTTTGGTTTGATTTTTTAAAATATGGGTAAAAAGTAGATAAGAAAACAAGAAATTAAGGATGAAGGGGGTGTTTGCAAGTTTAATTTTCAAAAACTAAAAATAAAAAACCAAATAGTTACAAAAAAAAAGGGTCTAAATAATTCGGTTTGAACATTTCTAAACTCACACCTAAACACATGCATAAAGTCAATTAGCTAAATGATATATATAAAAAAAGATATTTTGATGCATTCCAAGCATTCATCTTTGTGCAACCTACTCAAGTATGAAAATACAATTTTTGTCGTGTGTTCAAATTTTTTTCTTCAAATAAAGATATTTTACAAATATTTTTTCTCCTTATAGTTGAATGAAATGGGGCTGCACCCCAAATATATTTCTATTGAAATATAGTCAAATTAAACATTCAAACATAGCCACAAAAATAAACAATTACAAAAATTTCCATATCGTCATTTATTTAAGTTTAGCATCCAACACATAACGGTTACTAATTGCACCATTTATTTAATTTATTGAGTTTAGCAGGGAGATTTTAAAAATAAAAGTTTAATTCAGGAAGATAATTGTCTTAAATGACAAAACTGGTGGAAATATTTTTAAATATAGCAAAATGTTACCCGTCTCAATATCTATCACTGATAAATATCGATATTTTAGTATCTTTATAAATAAGTTGACTTATTTTACTATATAATTTATCTATACTTTTTAATTCATAAAGTCAAGAAAAATTTACAATAATTTTTTTCGATGTTCTCAGATACTTGGATTAGATATTTGTGCTGACACCTTAGTAGGAGATGCCATGAGAAGAGGTATTTCAGGTGGTCAAAAGAAAAGATTAACTACAGGTAATGTACAAAATACATAGACCAATTATTTCCAAATTAAGAATAATTTTATTTTTTCAAAAAGTTCTTCAATTTTGAAAAATCGTAGGGTTATTGGTTACATCATTAAAAAAAAAAAAAAACCTTTTAGAAATAGAAAATGATATATATATATATATATGTATGTATATATATATATATATATGTATGTATATATATATATATATATTTGTTGTTGAGTTTGAGATGCACCCCCACATTTTACAAACTTTGAGATATGTATGCTTTTTATTTGATTAAATAACAAATTTTGTCTGTCCTTAAATTTTGAAATCCATTCTTATAGATTTTCAAATTTTTAATTTTGTGTCGTTTTGAAAAGCACCTACTAAATTCATAAACTTTCAATATTGTATTATTGTGTATTAATAAATTAATAAATTTTTAAATATTCAAACATTTCTTCCACAAAAGTGAAAACCACCCGAAAAGAAATGGTGAGAAAATCGAACATAAATTTCAAACATTTACAAAATTAAAAACAAAATATTTAGTATCAAACATAGCATTATATAGGGCGTTCGGCTCGACATAATCCAAGTTTACATGGTAGTACATCAATTAATAATTTTTAAAAAACGTGTATATATATCATGCCGCCGAATGCACCGCATCTGAGAGTGCGGGGTCATCGGTTCAATCCATGGTGGCCACCGACCTAGAATTTAATATCCTACAAATTTTCTAGGGTCAACAGGGTTGTCGTGTGAGATTAGTCGAGGTGCATGTAAGCTAGCCCGGACACTCACAGATATAAAAAAAAACTATACTAATCTACTATACACAAAATTGAAAGTTCAGGGATCTATTAAATATTTTTAAATTGACTATATTAGACACTTTTGAAAGTTCAAGAATTATCAAATAGACAAGCAAACCTCAAGGGGTGCTTAGGCCTCAGCTTCAAGTAGGTGGAGTGAGCTATTTAACCATGTTTGGGGTTCCAATTATAATAGTTGATGTTTCCAACTATTATAACCTACAATTAACTACAATATTATTATTTCAAATCTCTTGTTATAATGTTTACTATTTCCTACTCATCCTCTTTTCTTTCTTTGTTATCGTCTTTACTATTTTTTATTCAGACTAAAATAGCGTACACCCCAAATACAAAATATTATAACTTAGACTAAAATAATCTACACCTCAAACTTAAACTTTTATAATCTATAAACTATTATAATCTACTGACTATTATAATCAACTTAGCGCTCCAAGCACCTCCTCAATATTTAGAGGTTAAACTCAGTTTAACGTACTTCTTTTCTTCTTCCAATGAAATAAAATAAAAGGTAAATGATTGAACTGAAAAAAAAAAATAATGCAGGGGAAATGATGGTTGGTCCAAACAGAGCTTTGTTCATGGATGAAATAACAAATGGATTGGACAGTTCCACTGCCTTCCAAATTGTTTCTTGCCTTCAAAATCTAGCTCATCTTACAGATGCTACAATACTTGTCTCTCTTCTTCAACCAGATCTAGAAACCTTTCGACTTTTCGACGATTTAATCTTGATGGCGCAAAAAAAAGATCGTATATCAAGGTCGTCGTGATCGAGTTCTCGAATTCTTCGAGCATTGTGGATTCAAATGCCCCAAAAGGAAAAGCATTGCAGATTTTCTTCAAGAAGTGATTTCCAAAAAAGATCAACCACAGTTTTGGTATTGTAAGCAAACTCCTTATGCATATATTTCTATTGACACATTTAGTAGAAAGTTCAAGTGTTGGAATAATTTAGGAAGAAAGATGGAAGAGGAGACTTTGAAACCTTTTGATGAACAAGAAGAATATTCCAAAAATAATGGTGTTTTCTTGAATGGGAAGAATAGTGTTTCTAAATGGCAAGTGTCTAAAGCTTGTGCATCAAGGGAATTCCTCCTCATGAGAAGAAATTCATTTGTTTATGTCTTCAAAACAAGCCAGGTTAGTGCTTTTAATTCCATTTATATAGTTCGAGTTTGATTTTATTTTAGTTTACGTACTTTCAAAATACTTAAATTGGGTTCTTGTGTATTTTTAATAAGTCTTAAAAATAGCTCACAATATATATACGGTTAGTTTATTGTTGATATTTGTCTAAAAACACTTCGTCTAATCATTTATGAGTTGGCCTAACGATTAATAAGTGTTGTGGGTATAATAAAACTACGAGCTTAGAAATAATAGTTGACCTACATAAATTTATTATTTAATTATGACTTTTCTTAGCATCAAAATATTGTTGAATTAAGTGGGTTGTTTTGTGAGACTAGTCGATATATGCATACAAGTTGGCCTAGATATTTAGTGGGAAAAAAAACACTCCTATCTTATATGAATTTTGGAAATATATTCACATATATTATTTTTTAATTGATCAATTCGATAAAAATTAACTTTGTAGGCCAGACTTAATTAAAATTTATAAAACAAATACCGAGACTAAATTTAGTACATGAACTAAAATGTTATGAAATTCAAATTAAATTATTAAGGGACGAAAATGACTTTTTTTTTTTTTTTACCTTGAATTTAGTATTGTTGGTTTAATAAACTTTCTGTTATTGTTTTTTCTATTGGTAATAATCATATTTTTTTTATGTTATTCGTAAAATTTCTAGTTAATCTTATCCTAATTTTTAAAATCTAAACTATAATTTGATTACTAGAAAAATGAATCCTTAAAACTTGTTTAAAACTACAAAATTGTTTAGTTAAAAATTATATACAAGAAAAACTAAATAAACAAAAGGTTAATATATATGTGTGTATATATACATATATAGTTTTTTCCCACTTAGTTTCTAAAACCTAAAAATTAAACCTAAAATTCTTGTACATAAAGGAAGCATTTTGTTTAATTTTAACTAGATCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACTAGTTAAGTATATATTTTTCCACAACATTTTAAAGTGTTTTTAATATATAGAGAGAGTTGTTTTCATAAATATAGAAAAATTAGCAAAACCATTTACAAAAGAAGAATTTTAAAAATAGAAAAATAAGGGAAACTATTTACACAAAATAACAAACTTTTTAGATAGTTGATAGGCGTTGATAGAAGTCTATCAAGGTCTATCAGTGATAGAAATCTATCACGGATAAAGTCTATCAATGTTTATCCGTGTTGTTTTTGCTACTTTTTTATAAATAGTTTGACATATTTTTTATCGGTGAAAATTTGCCTTTACAAATATAGAAAAACTTTGGCTATCAACGATAGACCACGAGACAGTGAAAGTTTTCTATATTTGTAAATAGTTTGTTATTTTTTCTGTTTACAGTCATATATATGTGTGTACAAAAGCATATTTTGTGGTAAAAAAATTGCTAATATTGTGTTTCTTTGAACAGCTATTCTTCATTGCTTCAATCACCATGACAGTGTTTATAAGAACTGAAATGAAAATAGATCTTCAACATGGAAATTACTACATGGGAGCCTTATTTTATTCTCTCATTATATTACTTGTTGATGCATTACCAGAATTGTCATTGACAATTCAAAGACTTCAAGTTGTCTACAAACAGAAAGAATTACTATTTTATCCAGCTTGGGCTTATGTAATTCCAGCTGCCATTTTGAAGCTCCCTCTTTCACTTGTTCAATCCTTAGTATGGACTTCTCTTACTTATTATGTCATCGGTTACACCCCGGAGGTCTCCAGGTTAGTTTTTTCACCCTTCAACTTCCTCTCGTTATGGGTAATTATTTAGACTCCGTTTGATAACTGTTTCTTTCTCGTTAATTGAAATTTATGCATGGTTCATTTTCATATTTCTTAAAGATCAAGTATGGTTTTTCAAGTACTCTTAGTCGAGTTCTGAAAACAAAAACTACTTTTTTTTTGTTAAATTACAAATTTAGTCCCTACGGTTTGGAGAAAGTAATTACAATTTAGTCCATATTATTTATAATTAGAATTTAGTCCTTATGATTTGAAATTTAGAATTTAGTCCTTAAACTCACACGTCTTTAAACTTTTCATGATACAAGATAAAACAATTATATTGATATGTTTGCTCACACTTAATAATAGAATTAGAAACTTTACTAAATTTAACCTAATAATAAAGTAAGTGTACTTAAAAAAAAAAGTAAGACATTATTAATTTTTTGTTCTTCTATTTTTATATAATAAATAGTCAACATTTAATTTCAGAAGCCCTATATGTTTGACTTTCAGCCTTCTGGTTTGGTGTTAAGTTTTTTTTTTTCTCGAGCTTAAGATTAATAGCACAAATTTTTGTCCGTTGATTAGTTATGCTCATATGACAATTTGGTATTAAGTTTAACACTTTGATTCATGACAATAAATAAATAAATAAAGTTAACATATTGATATTTATTTCTTAGGTTCTTTCGACATTTTCTTGTGTTATTTGCGGTGCACGTATCTTCATTATCGATGTTTCGAATGATAGCTTTAGTGAGTCAGCACGTTGTTGCTCTTACACTCAGCAGTTTTATAATATTACAAATCATGATATTTGGTGGTTTCATCATCACTCATCGTAAGTTTCTCTAAATCACTTCATTCAACTCATCTTACATAAATTATCACAAGAGACAACCCTTATACAAGTAATGTTCTGTTTGATAACAATTTTAATTTTTAAGATTATATATGCTTGTATCCTCTGCATTTAAAAACTATATATATACTTTTCAAAACTTGGTTTGAATTTTGAAAATATTGGTAGAAAGTAAAAAAAAAAAAAAACTCATTGATGAAAGTGGTGTTTTATAAGCTTAGACTCTATTTGATAACGATTTTATTTTTAATTTTCTATTTTCAAAACTTGTTCTTGTTTTCTCTCAATTTCTATATTATAATTTTTATCTATCTTAAAAATACTAGAATTCTTAGTTAGATTCTAAAAAGAAAATTAAACGAGTTTTTAAAAATTAAAAGAACTTAACTTTGATATTTGAAAACACGAGTTAAAATAAATAACAAATCATAGAAACTTATAAGTGGAAGTAATGTTTGGAAGCTTAAATTTTAAAACTAAAAGAAAAATCAAAAAATCATTAATAGCACCTTAATTTTCGAAATTTGAAAACCAAAATCAAATGCCAACTATCAAGTATGGTCTAAATTAAACTATCCATATTAATTACAAAAAAATAACACAACGAGAAGTTTTATTAAATTTAGAGCTTCCTACCATTTAAAAATATTAGGACAAATTGCAAAAACCACCGCAGAAGTATGATGGTGGTTGTAATTAGATCTCAAACTTTTAATGATAAAAATTGGATTCGCAAACTTCTACAAATGTTAAAATAGACTCTCAAATTTATAATATTTGTAGAAATTACCCTCAAATTATAAAAATTGAACCCTCAAACTTACACAATTGTTACAATTTCTACCATTATGTAAGTTTGGAGGTTCAATTTTAATGCTTGTCTAAGTTTGAAGGTATTAATTACAACTACCACCATATTTTTAGGGGTGATTTTTACAATTTACCTAAAATATTATTGTTAAATCACCACTAAACTTACACATTTAAGTTAATAATCGTATTTATTATTATCACGGTAATTATTTTAAATAGAAAACTGCTGAAAATATTTTCAAATTAAATATAGCAAAAAATGTCATCGTCTATCCATGATAGATAATAATAGACTCCTATCATCTATCACGATCTATTTATGAAGGACACCTATCATTGATAGACAATGAAAATTTGCTATATTTATAAATAATTTGACTCATTTTGTTATATTTGAAAACAACTCTCATTATTATTCCTACTATTATTATTATTATTTTGGAATTTTGCAGCCTCAATGTCACCTTGGTTGCGATGGGGATTTTGGCTTTCACCAATTAGTTATGGAGAAATTGGCCTTTCTATCAATGAATTTCTTGCTCCAAGATGGCAAAAGGTGTGCCATTCTATTCATATAATATATAAGAAAAGGCTCTTACCATTATTTCTTAATCTTATAGTTAAAAAAAGAGTTTATATTAGGTTTATTTACAACATAATTTAGGAAGAAATATATATTCAGCCTAGTGAATCAAATTTAACATTAATTTTTTTTTGGGATTAAAACTCTAAATTTTGAATAAATAGTCACATTTCTATAAACTTTAAATAAATAGAAACATCACCACTAGAAACTAAATGGAAAAAAAAAAATTAGGTTATTTGCATTTTCTTCCTTCTCATTTCTGATTTTTTTTCATTTTCTTTTAAAAAATTTGTTTGTTTTTTTCTTTTTTTTCTTTTCATTTTGTTTTAATGAACCATTCTTCCTTTTCATAAGAAAATTAACCAACATTTTTTTTTATTTTTATTCAATTCTGATCGAGATTCTATTGAGGAAGTCTGAAAAATTGGTAATTGAATGTGATATTAAAAGAACGTTAAAGTATTTATTTGAGTATCGATATTGATCTTGACCGAGATGGACTGAGACGAGTCATTATAATAATTAAAAAAATGCTATTTGAATTTTTATTTTATTTCTCTCATTATTATTATTATTTGCTATTTCTAACGATGAGCCGAGTTGTTTGGTTGTGTAGATGCAAGGTACGAACAGTACGATAGGGCATGTAATTCTTCAAAGTCGAGGACTTGATTATCATCAATACTTCTACTGGATTTCACTCGCAGCTTTGTTTGGATTTGCTATCATTTTCAATGTTGGATTCGCCTTCGCTCTTACTTTCCTCAACGGCAAGTTCCATATTTTTTTTTTCCTTCTTTCAAGTTCTAATTTGAAGTTCTTACTGCAAAGATTCATCAACATTTTTCAATTTCATTCTACATTCAGCTCCTGGATCTTCCCCTGCTATTATATCATATGAAAAGCTCTCACAATCAAACTGCAATGGCGGTGCTAATTCTGTCCAAAACCCTCCTTCTTCTCCTAAGACCTCCATAGAATCGACCAAAGGTAACATAATCTGAATTCTGCAATTGTTATCGTTATATACATTTATTGATGGGTTTTCCTTTTGATTCTTTTGCATGATCTTGTTGCTTCAAAAAATCAGGCGGAATAGCATTGCCTTTCACACCTTTAACAGTAGTGTTTCAACATTTGCAGTATTATGTGGACATGCCATCGGTAATAGTCTCAATACTCTTTAACTATTATTACATTAGCTTGTTAACTTTTTTTTTTTTTTTTTTTTTTAAGACTTGTGATTAGTTGTTTTACTCAATTTTTGTTCTGGAAAGTGTGGCGAATTGTTTTGAGAATAGTTTCTCAGATAAGCATAAGCTGCATTTTTATGGTTGAAGCACTGAAGGCCATTGTTATCACAGTTAACTTTTTCTTTGGATCCCAGGGAATGAGAGAGAGGGGTTTTACCCAGAAGAAACTCCAGCTTCTTTCTGATATTACAGGGGCTTTAAGGCCTGGCATACTCACAGCACTGATGGGTGTCAGTGGAGCTGGGAAAACAACTTTGCTTGATGTTCTTGCAGGAAGAAAAACAAGCGGATACGTCGAAGGAGAAATCAAAATTGGTGGTTTTCCGAAAGTTCAAGAAACATTTGCTAGGATCTCTGGTTACTGTGAGCAAACTGATATACATTCTTCACAAATAACAGTGGAAGAATCTTTGATTTTCTCTGCTTGGCTCCGTTTGGCACCTAACATTGACTCAAAAACAAAAGCAGTATGGTTTTCTGTTGTTTCTCTCTTAAGCTAAAATTTAAATGGCTATTGATTGTAATCACCAAGGATAACATGTTATTTAAATTGACATTAGTTTCTTGTTTTATAGCAATTTGTGAATGAAGTCCTTGAGACCATCGAACTTGATAGCATAAAGGATTCCTTAGTTGGCATACCTGGTGTTAGTGGTCTATCAACCGAGCAGCGTAAACGGCTAACCATAGCTGTGGAGCTTGTTTCCAACCCCTCTATCATTTTCATGGATGAGCCTACCACTGGTTTAGATGCAAGAGCAGCTGCAATTGTCATGCGGGCAGTCAAGAATGTGGCTGATACTGGAAGAACGATAGTTTGTACCATCCATCAGCCAAGTATTGACATCTTTGAATCATTTGATGAGGTAAAACTCTTACTCTTTGCTGTAATGCTTTGCTCTGACATGAGGCGTTTGTAGACATTTAAGTAAATGAGACAAACAAATTTGTTTAGTTTATGTTGACAAGATGGGAAGGAAAAAATTTGATTTTGATTTTCTTTTACAATATTATGGATTATTTTGGAGGTTTTTTTCAAAACTGATTTATGTTTTTGTTTTGAAATTGTTTACAACAACTATTATCAAGTAGCTTGTTCAAACAAAAAAGTCAAAACTACAATCACAGGTCCAAACGTTTAAGAGGTAGCATTCATAAAAACAAACAAGGGAAAACAACTTCTAAAAAGCACTAACCTGTACATGCTTACTCTCGTTAAATTTTTGTTAAGGAAGCATAAAACGTATTTTGAACATGGACTCAATTTAGGCTGTTCCGGTTTTCTATGCAGTTGATTCTTCTGAAAACTGGTGGTCATATGATCTATTATGGACCATTGGGACATGATTCAAGCAAGGTCATAGAATATTTTGAGGTCAGTCTGCTCTGCAATTAGCAATGCTGCTTGTGTCATATATTGCATTTTCTTTGTTTCTAAACATTTCATTTTTATTAATACAGCATGTTCCTGGGGTTTCAAAGATTAGGGAAAACTACAATCCTGCAACTTGGATGCTAGAGGTTACCTCCTCAGCTGCAGAAGCTAAACTTGGCATAGATTTTGCTCAAGTGTATAAGAACTCTGCTCTATATGAGTGAGTTAAGCTCTACATTTTTTTTAATTTAATCCGAGTACATAAGTTTTTTTTTTTTTAATTAAATCTTCTTTAGACAAGCAAATAAGAAGCTAATTTATTGGGTATTGAATATGAGATTTATGGACTTTTAGGAAGAAAAACATAAAGCTAGCTTTTCAACCATTCTTGTTCTGCAGGAACAACAAAGAACTTGTTAAGCAGTTGAGTGTTCCACCTCCTGGTTCGAGAGATTTGCACTTTTCGAATATCTTTGCACAAAATTTCGTGAGACAATTTGGGGCTTGCCTTTGGAAACAAAACTTGTCTTATTGGAGGAATCCTCACTATAACTTGCTGCGTATCTTGCATACTGTTGCATCATCTTTGATTTTTGGGATACTATTTTGGAAGCACGGAAAAAAGCTGTAAGTATTTTATGCATTTGTGCAATGTCTAACTTTCCTAATGTGAAAGCTTGTGGAGCCATTCCATTTCAGTTTTCAAAATATAATTAGGAGGCTCACCCGTATGTGCAACATTTTATCTTGTTTCTGTATCCATGTCTGAAGAGAAAACCAACAGAACTTATTCAACAACTTCGGCGTAATGTACTCTAGTGTCATTTTCCTCGGCATCTACAATTGCTCATCAGTCTTTCCTAATATATCAAGGGAAAGAACTGTCATGTACAGGGAAAGGTTTGCCGGAATGTATTCCTCATGGGCTTATTCACTTGCACAGGTATATATTCATCTCAACTTTTTTCGATCTTTATAATATAAACCTACAATAAAAGAAAAATGAAAGTATTAAAGTGAGGCCAAACAAAAAACCAAGCTCCAATTGTATATAAAATAAATGGTATGTGTATACTCCTAGACATTGAGACACATTATTAAAACAGGTTAAACTTCGTACTTCGTCCCAAGAACCCTCTAAGTAGTAAAGAAAGTGGCCTTACACAACTCAAATCTGCTCAATGTTTTATTTTTCTTAATGTCTTGAAGCTCTATGCTATGAAGTCCTCTCATGAGTTTAGCTTTTAAACTTCAAAAACTTTATTCCAACTCTGCAGGTGATTATTGAGGTTCCCTACATATTTGTACAAGCAGCTATCTATGTAATTATCACTTATCCAATGATTGGATTCTATGGCTCTGGATGGAAAATATTTTGGTGTTTCTACTCAATGTTCTGTGCACTTCTCTATTTCAAAAATCTTGGGTTGCTGCTTGTGACCATCACCCCAAACTATCATATTGCTACCATTTTGTCCTCTGCTTTCTACGTCATGTTCAATCTCTTTGCTGGCTTTCTCGTTCCGAAACCGGTAAGCCATTTCTGACTGAGGAAGGAATAGAAGTTCCTGTTTGATGTTTATTTTGAATAAAATGAGTTTGCTTTCCTTCTATGTGATTGAGAGTCTGAGAATAAATCAAATACCAACTTTAAACAGAAAGGGACTGAAGCTATAATATCTTTGAAGTATAGAAACTAAAATTTAACTTTTAAAGTACAAAGACTAAATTCTATATAAAAATTTGGGTTTAATCTTATTTTTGTTTCTAAATTTTCACATTTATTATACTTTAGTACTTATTTTTACTAGTGTTTTAGATCTTATTCCATGAGCTTTCATTCAAAATTATCCATTTTAACTCTTATTCTTAAATTTTTGTGTCTTATTATAGAATTGTGGTAATAGATCTAAATCATCTGATCCATATGACTGACAGCACAAGTCCAGGCACTGACATGTAAATGTTTAACAAAAATTTAATGATAATGACTAAAATTTTTAGATGTTAATAGATATCCATAGATGAACACTTGCACTAAAGTTTAAGGTTTGAATAGAATATAGATCTAAAAGAGAAAGGGGCTAAAGATATATTAAGTGTAGCCTATTAACCAGATAACAAAAGAGCATAAAAGCTATCCAAGAGTAAATATATGTGTTACCATAAGATTTAACCCTTGGATCAAAATTTTTCCTTCTTTATGCAGAGAATTCCAAGTTGGTGGATATGGTTTTATTATATGATCCCAACATCATGGACTTTGAATTGCTTGCTAACTTCACAATATGGAGATATAGACAAGGCAATGGTGGTGTTTGGAGAAAGAACAACAACAGTATCAACTTTCTTGAGAGATTATTTTGGGTTTCACAACAATCAACTTCCTCTTGTGAGGTTCATTCTCATCCTCTTGCCTATTGTATTTGCTTGTCTTTTTGGATTTTGTATTGGAAGATTAAACTTCCAGAAGAGATGA

mRNA sequence

ATGGCTCAAATGGTTGGGAGTGCTGAGAGAGGAAGAAGCTCCTCGATCGCGGAGGAAGACAATGATGGCGATGTTGAAGATGCTTCACTTTGGGCGACGATCGAGAGATTGCCAACATTCGAACGGTTGAGATCGTCATTGTTCGACATTAACGATGAAGGAAAGGTGGAAGAGAAAGGAAGAAGAGTTGTAGATGTTACTAAGCTTAGGAATGAGGAGCGCCATATCTTTGTTCATAAACTCATAAAAAATGTTGAGAATGATAATCTTAAGCTCTTAACTAAAGTTAGAGACAGAATCCATAAGGTTGGTGAGAAATTTCCAAGTGTTGAAGTGAAGTATAAAAATGTGCATATTGAAGCAGAGTGTGAGGTTGTTCATGGGAAAGCCATTCCAACACTTTGGAATTCTCTTCAAAGCAAGCTTTATGACATCATCAAATTTTGTGGTGTAAAGTCCCATAAAGCTAAAATTGACATAATTGAGGATGTGAGTGGTGTTATAAATCCAGGAAGGTTAACTTTGCTACTTGGTCCTCCAGGGTGTGGGAAGACCACACTACTGAAGGCCCTCTCTGGAAATCTTAACAAATCTCTCAAGTTTAGTGGAGAAATTTGTTACAATGGACATAAACTTGAAGAATTTGTCCCCCAAAAAACTTCAGCATATGTTAGTCAGCATGACCTACATATTCCTCAAATAACTGTGAGGGAAACCCTAGATTTTTCGGCGAGGTGTCAAGGCATTGGAAGTCGAGCAGATATTATGAAAGAGGTTAATAAAAAGGAGAAGGAACAAGGGATCATTCCAAATCCTGATATTGATACTTACATGAAGATACTTGGATTAGATATTTGTGCTGACACCTTAGTAGGAGATGCCATGAGAAGAGGTATTTCAGGTGGTCAAAAGAAAAGATTAACTACAGGTCGTCGTGATCGAGTTCTCGAATTCTTCGAGCATTGTGGATTCAAATGCCCCAAAAGGAAAAGCATTGCAGATTTTCTTCAAGAAGTGATTTCCAAAAAAGATCAACCACAGTTTTGGTATTGTAAGCAAACTCCTTATGCATATATTTCTATTGACACATTTAGTAGAAAGTTCAAGTGTTGGAATAATTTAGGAAGAAAGATGGAAGAGGAGACTTTGAAACCTTTTGATGAACAAGAAGAATATTCCAAAAATAATGGTGTTTTCTTGAATGGGAAGAATAGTGTTTCTAAATGGCAAGTGTCTAAAGCTTGTGCATCAAGGGAATTCCTCCTCATGAGAAGAAATTCATTTGTTTATGTCTTCAAAACAAGCCAGCTATTCTTCATTGCTTCAATCACCATGACAGTGTTTATAAGAACTGAAATGAAAATAGATCTTCAACATGGAAATTACTACATGGGAGCCTTATTTTATTCTCTCATTATATTACTTGTTGATGCATTACCAGAATTGTCATTGACAATTCAAAGACTTCAAGTTGTCTACAAACAGAAAGAATTACTATTTTATCCAGCTTGGGCTTATGTAATTCCAGCTGCCATTTTGAAGCTCCCTCTTTCACTTGTTCAATCCTTAGTATGGACTTCTCTTACTTATTATGTCATCGGTTACACCCCGGAGGTCTCCAGGTTCTTTCGACATTTTCTTGTGTTATTTGCGGTGCACGTATCTTCATTATCGATGTTTCGAATGATAGCTTTAGTGAGTCAGCACGTTGTTGCTCTTACACTCAGCAGTTTTATAATATTACAAATCATGATATTTGGTGGTTTCATCATCACTCATCCCTCAATGTCACCTTGGTTGCGATGGGGATTTTGGCTTTCACCAATTAGTTATGGAGAAATTGGCCTTTCTATCAATGAATTTCTTGCTCCAAGATGGCAAAAGATGCAAGGTACGAACAGTACGATAGGGCATGTAATTCTTCAAAGTCGAGGACTTGATTATCATCAATACTTCTACTGGATTTCACTCGCAGCTTTGTTTGGATTTGCTATCATTTTCAATGTTGGATTCGCCTTCGCTCTTACTTTCCTCAACGGCAATTCTTACTGCAAAGATTCATCAACATTTTTCAATTTCATTCTACATTCAGCTCCTGGATCTTCCCCTGCTATTATATCATATGAAAAGCTCTCACAATCAAACTGCAATGGCGGTGCTAATTCTGTCCAAAACCCTCCTTCTTCTCCTAAGACCTCCATAGAATCGACCAAAGGCGGAATAGCATTGCCTTTCACACCTTTAACAGTAGTGTTTCAACATTTGCAGTATTATGTGGACATGCCATCGGGAATGAGAGAGAGGGGTTTTACCCAGAAGAAACTCCAGCTTCTTTCTGATATTACAGGGGCTTTAAGGCCTGGCATACTCACAGCACTGATGGGTGTCAGTGGAGCTGGGAAAACAACTTTGCTTGATGTTCTTGCAGGAAGAAAAACAAGCGGATACGTCGAAGGAGAAATCAAAATTGGTGGTTTTCCGAAAGTTCAAGAAACATTTGCTAGGATCTCTGGTTACTGTGAGCAAACTGATATACATTCTTCACAAATAACAGTGGAAGAATCTTTGATTTTCTCTGCTTGGCTCCGTTTGGCACCTAACATTGACTCAAAAACAAAAGCACAATTTGTGAATGAAGTCCTTGAGACCATCGAACTTGATAGCATAAAGGATTCCTTAGTTGGCATACCTGGTGTTAGTGGTCTATCAACCGAGCAGCGTAAACGGCTAACCATAGCTGTGGAGCTTGTTTCCAACCCCTCTATCATTTTCATGGATGAGCCTACCACTGGTTTAGATGCAAGAGCAGCTGCAATTGTCATGCGGGCAGTCAAGAATGTGGCTGATACTGGAAGAACGATAGTTTGTACCATCCATCAGCCAAGTATTGACATCTTTGAATCATTTGATGAGTTGATTCTTCTGAAAACTGGTGGTCATATGATCTATTATGGACCATTGGGACATGATTCAAGCAAGGTCATAGAATATTTTGAGCATGTTCCTGGGGTTTCAAAGATTAGGGAAAACTACAATCCTGCAACTTGGATGCTAGAGGTTACCTCCTCAGCTGCAGAAGCTAAACTTGGCATAGATTTTGCTCAAGTGTATAAGAACTCTGCTCTATATGAGAACAACAAAGAACTTGTTAAGCAGTTGAGTGTTCCACCTCCTGGTTCGAGAGATTTGCACTTTTCGAATATCTTTGCACAAAATTTCGTGAGACAATTTGGGGCTTGCCTTTGGAAACAAAACTTGTCTTATTGGAGGAATCCTCACTATAACTTGCTGCGTATCTTGCATACTGTTGCATCATCTTTGATTTTTGGGATACTATTTTGGAAGCACGGAAAAAAGCTAGAAAACCAACAGAACTTATTCAACAACTTCGGCGTAATGTACTCTAGTGTCATTTTCCTCGGCATCTACAATTGCTCATCAGTCTTTCCTAATATATCAAGGGAAAGAACTGTCATGTACAGGGAAAGGTTTGCCGGAATGTATTCCTCATGGGCTTATTCACTTGCACAGGTGATTATTGAGGTTCCCTACATATTTGTACAAGCAGCTATCTATGTAATTATCACTTATCCAATGATTGGATTCTATGGCTCTGGATGGAAAATATTTTGGTGTTTCTACTCAATGTTCTGTGCACTTCTCTATTTCAAAAATCTTGGGTTGCTGCTTGTGACCATCACCCCAAACTATCATATTGCTACCATTTTGTCCTCTGCTTTCTACGTCATGTTCAATCTCTTTGCTGGCTTTCTCGTTCCGAAACCGAGAATTCCAAGTTGGTGGATATGGTTTTATTATATGATCCCAACATCATGGACTTTGAATTGCTTGCTAACTTCACAATATGGAGATATAGACAAGGCAATGGTGGTGTTTGGAGAAAGAACAACAACAGTATCAACTTTCTTGAGAGATTATTTTGGGTTTCACAACAATCAACTTCCTCTTGTGAGGTTCATTCTCATCCTCTTGCCTATTGTATTTGCTTGTCTTTTTGGATTTTGTATTGGAAGATTAAACTTCCAGAAGAGATGA

Coding sequence (CDS)

ATGGCTCAAATGGTTGGGAGTGCTGAGAGAGGAAGAAGCTCCTCGATCGCGGAGGAAGACAATGATGGCGATGTTGAAGATGCTTCACTTTGGGCGACGATCGAGAGATTGCCAACATTCGAACGGTTGAGATCGTCATTGTTCGACATTAACGATGAAGGAAAGGTGGAAGAGAAAGGAAGAAGAGTTGTAGATGTTACTAAGCTTAGGAATGAGGAGCGCCATATCTTTGTTCATAAACTCATAAAAAATGTTGAGAATGATAATCTTAAGCTCTTAACTAAAGTTAGAGACAGAATCCATAAGGTTGGTGAGAAATTTCCAAGTGTTGAAGTGAAGTATAAAAATGTGCATATTGAAGCAGAGTGTGAGGTTGTTCATGGGAAAGCCATTCCAACACTTTGGAATTCTCTTCAAAGCAAGCTTTATGACATCATCAAATTTTGTGGTGTAAAGTCCCATAAAGCTAAAATTGACATAATTGAGGATGTGAGTGGTGTTATAAATCCAGGAAGGTTAACTTTGCTACTTGGTCCTCCAGGGTGTGGGAAGACCACACTACTGAAGGCCCTCTCTGGAAATCTTAACAAATCTCTCAAGTTTAGTGGAGAAATTTGTTACAATGGACATAAACTTGAAGAATTTGTCCCCCAAAAAACTTCAGCATATGTTAGTCAGCATGACCTACATATTCCTCAAATAACTGTGAGGGAAACCCTAGATTTTTCGGCGAGGTGTCAAGGCATTGGAAGTCGAGCAGATATTATGAAAGAGGTTAATAAAAAGGAGAAGGAACAAGGGATCATTCCAAATCCTGATATTGATACTTACATGAAGATACTTGGATTAGATATTTGTGCTGACACCTTAGTAGGAGATGCCATGAGAAGAGGTATTTCAGGTGGTCAAAAGAAAAGATTAACTACAGGTCGTCGTGATCGAGTTCTCGAATTCTTCGAGCATTGTGGATTCAAATGCCCCAAAAGGAAAAGCATTGCAGATTTTCTTCAAGAAGTGATTTCCAAAAAAGATCAACCACAGTTTTGGTATTGTAAGCAAACTCCTTATGCATATATTTCTATTGACACATTTAGTAGAAAGTTCAAGTGTTGGAATAATTTAGGAAGAAAGATGGAAGAGGAGACTTTGAAACCTTTTGATGAACAAGAAGAATATTCCAAAAATAATGGTGTTTTCTTGAATGGGAAGAATAGTGTTTCTAAATGGCAAGTGTCTAAAGCTTGTGCATCAAGGGAATTCCTCCTCATGAGAAGAAATTCATTTGTTTATGTCTTCAAAACAAGCCAGCTATTCTTCATTGCTTCAATCACCATGACAGTGTTTATAAGAACTGAAATGAAAATAGATCTTCAACATGGAAATTACTACATGGGAGCCTTATTTTATTCTCTCATTATATTACTTGTTGATGCATTACCAGAATTGTCATTGACAATTCAAAGACTTCAAGTTGTCTACAAACAGAAAGAATTACTATTTTATCCAGCTTGGGCTTATGTAATTCCAGCTGCCATTTTGAAGCTCCCTCTTTCACTTGTTCAATCCTTAGTATGGACTTCTCTTACTTATTATGTCATCGGTTACACCCCGGAGGTCTCCAGGTTCTTTCGACATTTTCTTGTGTTATTTGCGGTGCACGTATCTTCATTATCGATGTTTCGAATGATAGCTTTAGTGAGTCAGCACGTTGTTGCTCTTACACTCAGCAGTTTTATAATATTACAAATCATGATATTTGGTGGTTTCATCATCACTCATCCCTCAATGTCACCTTGGTTGCGATGGGGATTTTGGCTTTCACCAATTAGTTATGGAGAAATTGGCCTTTCTATCAATGAATTTCTTGCTCCAAGATGGCAAAAGATGCAAGGTACGAACAGTACGATAGGGCATGTAATTCTTCAAAGTCGAGGACTTGATTATCATCAATACTTCTACTGGATTTCACTCGCAGCTTTGTTTGGATTTGCTATCATTTTCAATGTTGGATTCGCCTTCGCTCTTACTTTCCTCAACGGCAATTCTTACTGCAAAGATTCATCAACATTTTTCAATTTCATTCTACATTCAGCTCCTGGATCTTCCCCTGCTATTATATCATATGAAAAGCTCTCACAATCAAACTGCAATGGCGGTGCTAATTCTGTCCAAAACCCTCCTTCTTCTCCTAAGACCTCCATAGAATCGACCAAAGGCGGAATAGCATTGCCTTTCACACCTTTAACAGTAGTGTTTCAACATTTGCAGTATTATGTGGACATGCCATCGGGAATGAGAGAGAGGGGTTTTACCCAGAAGAAACTCCAGCTTCTTTCTGATATTACAGGGGCTTTAAGGCCTGGCATACTCACAGCACTGATGGGTGTCAGTGGAGCTGGGAAAACAACTTTGCTTGATGTTCTTGCAGGAAGAAAAACAAGCGGATACGTCGAAGGAGAAATCAAAATTGGTGGTTTTCCGAAAGTTCAAGAAACATTTGCTAGGATCTCTGGTTACTGTGAGCAAACTGATATACATTCTTCACAAATAACAGTGGAAGAATCTTTGATTTTCTCTGCTTGGCTCCGTTTGGCACCTAACATTGACTCAAAAACAAAAGCACAATTTGTGAATGAAGTCCTTGAGACCATCGAACTTGATAGCATAAAGGATTCCTTAGTTGGCATACCTGGTGTTAGTGGTCTATCAACCGAGCAGCGTAAACGGCTAACCATAGCTGTGGAGCTTGTTTCCAACCCCTCTATCATTTTCATGGATGAGCCTACCACTGGTTTAGATGCAAGAGCAGCTGCAATTGTCATGCGGGCAGTCAAGAATGTGGCTGATACTGGAAGAACGATAGTTTGTACCATCCATCAGCCAAGTATTGACATCTTTGAATCATTTGATGAGTTGATTCTTCTGAAAACTGGTGGTCATATGATCTATTATGGACCATTGGGACATGATTCAAGCAAGGTCATAGAATATTTTGAGCATGTTCCTGGGGTTTCAAAGATTAGGGAAAACTACAATCCTGCAACTTGGATGCTAGAGGTTACCTCCTCAGCTGCAGAAGCTAAACTTGGCATAGATTTTGCTCAAGTGTATAAGAACTCTGCTCTATATGAGAACAACAAAGAACTTGTTAAGCAGTTGAGTGTTCCACCTCCTGGTTCGAGAGATTTGCACTTTTCGAATATCTTTGCACAAAATTTCGTGAGACAATTTGGGGCTTGCCTTTGGAAACAAAACTTGTCTTATTGGAGGAATCCTCACTATAACTTGCTGCGTATCTTGCATACTGTTGCATCATCTTTGATTTTTGGGATACTATTTTGGAAGCACGGAAAAAAGCTAGAAAACCAACAGAACTTATTCAACAACTTCGGCGTAATGTACTCTAGTGTCATTTTCCTCGGCATCTACAATTGCTCATCAGTCTTTCCTAATATATCAAGGGAAAGAACTGTCATGTACAGGGAAAGGTTTGCCGGAATGTATTCCTCATGGGCTTATTCACTTGCACAGGTGATTATTGAGGTTCCCTACATATTTGTACAAGCAGCTATCTATGTAATTATCACTTATCCAATGATTGGATTCTATGGCTCTGGATGGAAAATATTTTGGTGTTTCTACTCAATGTTCTGTGCACTTCTCTATTTCAAAAATCTTGGGTTGCTGCTTGTGACCATCACCCCAAACTATCATATTGCTACCATTTTGTCCTCTGCTTTCTACGTCATGTTCAATCTCTTTGCTGGCTTTCTCGTTCCGAAACCGAGAATTCCAAGTTGGTGGATATGGTTTTATTATATGATCCCAACATCATGGACTTTGAATTGCTTGCTAACTTCACAATATGGAGATATAGACAAGGCAATGGTGGTGTTTGGAGAAAGAACAACAACAGTATCAACTTTCTTGAGAGATTATTTTGGGTTTCACAACAATCAACTTCCTCTTGTGAGGTTCATTCTCATCCTCTTGCCTATTGTATTTGCTTGTCTTTTTGGATTTTGTATTGGAAGATTAAACTTCCAGAAGAGATGA

Protein sequence

MAQMVGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYMKILGLDICADTLVGDAMRRGISGGQKKRLTTGRRDRVLEFFEHCGFKCPKRKSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDIDKAMVVFGERTTTVSTFLRDYFGFHNNQLPLVRFILILLPIVFACLFGFCIGRLNFQKR
Homology
BLAST of HG10019907 vs. NCBI nr
Match: XP_008443153.1 (PREDICTED: pleiotropic drug resistance protein 3-like isoform X1 [Cucumis melo])

HSP 1 Score: 2289.6 bits (5932), Expect = 0.0e+00
Identity = 1180/1454 (81.16%), Postives = 1252/1454 (86.11%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEK 60
            MAQMV + + GRSSS  EED DGDVEDASLWA IERLPTFERLR SLFDI +DEG+V+EK
Sbjct: 1    MAQMV-ATQIGRSSSSIEEDCDGDVEDASLWAEIERLPTFERLRLSLFDISDDEGEVKEK 60

Query: 61   GRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHI 120
             RRV DVTKL N+ER +F+ KLIKNV+ DNLKLLT+VRDRIHKVGEKFP+VEVKYKNVHI
Sbjct: 61   RRRVADVTKLSNKERRLFIEKLIKNVKKDNLKLLTEVRDRIHKVGEKFPTVEVKYKNVHI 120

Query: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGP 180
            EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSG+I PGRLTLLLGP
Sbjct: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGIIKPGRLTLLLGP 180

Query: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRET 240
            PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQ+TVRET
Sbjct: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQMTVRET 240

Query: 241  LDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KIL 300
            LDFSARCQGIGSRADIMKE+ KKEKEQGIIPNPDID YM                  KI 
Sbjct: 241  LDFSARCQGIGSRADIMKEIIKKEKEQGIIPNPDIDIYMKAISIEGLKHSLQTDYILKIF 300

Query: 301  GLDICADTLVGDAMRRGISGGQKKRLTT-------------------------------- 360
            GLD+C DTLVGDAMRRGISGGQKKRLTT                                
Sbjct: 301  GLDVCGDTLVGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCL 360

Query: 361  ----------------------------------------GRRDRVLEFFEHCGFKCPKR 420
                                                    GRRDRVL+FFEHCGFKCPKR
Sbjct: 361  QNLAHLTNATILISLLQPAPETFELFDDLILMAQKKIVYQGRRDRVLDFFEHCGFKCPKR 420

Query: 421  KSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCW---NNLGRKMEEETLKPF 480
            KS ADFLQEV+S+KDQPQFWY  QTPYAY+SIDT S KFK W   NNL RK EEE LKP+
Sbjct: 421  KSTADFLQEVLSRKDQPQFWYRNQTPYAYVSIDTLSTKFKHWNNNNNLERKAEEEILKPY 480

Query: 481  D----EQEEYSK-NNGVFLN----GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQL 540
            D    E + YSK ++G+ LN       SVSKWQV KACASREFLLMRRNSFVYVFK SQL
Sbjct: 481  DNDDQEDQYYSKDDDGILLNIGKINNYSVSKWQVFKACASREFLLMRRNSFVYVFKISQL 540

Query: 541  FFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKE 600
            F IASITMTVFIRTEMKID++HGNYYMGALFYSL++LLVDALPEL++TIQRL+V YKQK+
Sbjct: 541  FLIASITMTVFIRTEMKIDVEHGNYYMGALFYSLLMLLVDALPELAMTIQRLEVFYKQKQ 600

Query: 601  LLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSL 660
            LLFYP WAYVIP AI KLPLSL+QS VWTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSSL
Sbjct: 601  LLFYPPWAYVIPPAIFKLPLSLLQSFVWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSSL 660

Query: 661  SMFRMIALVSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLS 720
            SMFRM+ALV+Q +VA T+SSF+ILQIMIFGGFII+H SMS WLRWGFW+SPISYGEIGLS
Sbjct: 661  SMFRMMALVNQQIVASTVSSFVILQIMIFGGFIISHSSMSAWLRWGFWVSPISYGEIGLS 720

Query: 721  INEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTF 780
            INEFLAPRWQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA++FN GFA ALTF
Sbjct: 721  INEFLAPRWQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALLFNFGFALALTF 780

Query: 781  LNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIEST 840
            LN                   PGSS AIISYEKLSQSN N  ANS Q+P SSPKTSIEST
Sbjct: 781  LN------------------PPGSSTAIISYEKLSQSNINADANSAQSPLSSPKTSIEST 840

Query: 841  KGGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVS 900
            KGGIALPF PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITGA+RPGILTALMGVS
Sbjct: 841  KGGIALPFRPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITGAIRPGILTALMGVS 900

Query: 901  GAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLI 960
            GAGKTTLLDVLAGRKTSGY+EGEIKIGGFPKVQETFAR+SGYCEQTD+HSSQITVEESL 
Sbjct: 901  GAGKTTLLDVLAGRKTSGYIEGEIKIGGFPKVQETFARVSGYCEQTDVHSSQITVEESLF 960

Query: 961  FSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV 1020
            FSAWLRLAP IDSKTKAQFVNEVLE IELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV
Sbjct: 961  FSAWLRLAPEIDSKTKAQFVNEVLEIIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV 1020

Query: 1021 SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKT 1080
            SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI+IFESFDELILLKT
Sbjct: 1021 SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSINIFESFDELILLKT 1080

Query: 1081 GGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVY 1140
            GG MIYYGPLG DS++VIEYFEHVPGVS+IRENYNPATW+LE+TSSAAEAKLGIDFA VY
Sbjct: 1081 GGRMIYYGPLGQDSNQVIEYFEHVPGVSRIRENYNPATWILEITSSAAEAKLGIDFALVY 1140

Query: 1141 KNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLL 1200
            KNS+LYENNKELVKQLS P PGSRDLHFSN+FAQNF RQFGACLWKQNLSYWRNP YNLL
Sbjct: 1141 KNSSLYENNKELVKQLSAPSPGSRDLHFSNVFAQNFARQFGACLWKQNLSYWRNPRYNLL 1200

Query: 1201 RILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERT 1260
            RILHTVASSLIFG+LFWK GKKLENQQ+LFNNFGVMYSSV+F+GIYNCSSVFPN+SRERT
Sbjct: 1201 RILHTVASSLIFGVLFWKKGKKLENQQDLFNNFGVMYSSVVFMGIYNCSSVFPNVSRERT 1260

Query: 1261 VMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMF 1320
            VMYRERFAGMYS WAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFYGS WKIFWCFYSMF
Sbjct: 1261 VMYRERFAGMYSPWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFYGSAWKIFWCFYSMF 1320

Query: 1321 CALLYFKNLGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPT 1352
             ALLYFK+LGLLLV+ITPNYHIATILSSAFYV FNLFAGFLVPKPRIP WWIWFYY+ PT
Sbjct: 1321 FALLYFKSLGLLLVSITPNYHIATILSSAFYVTFNLFAGFLVPKPRIPRWWIWFYYITPT 1380

BLAST of HG10019907 vs. NCBI nr
Match: XP_004136753.2 (pleiotropic drug resistance protein 3 isoform X1 [Cucumis sativus] >KGN59364.1 hypothetical protein Csa_001402 [Cucumis sativus])

HSP 1 Score: 2283.1 bits (5915), Expect = 0.0e+00
Identity = 1174/1446 (81.19%), Postives = 1245/1446 (86.10%), Query Frame = 0

Query: 11   GRSSSIAEEDNDG-DVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEKGRRVVDVTK 70
            GRSSS AEED +G DVEDASLWA IERLPTF++LRSSLFDI ND+G+V++K RRVVDVTK
Sbjct: 2    GRSSSSAEEDGNGSDVEDASLWAEIERLPTFKQLRSSLFDITNDKGEVKKKRRRVVDVTK 61

Query: 71   LRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHG 130
            L NEER +F+ KLIKN+E+DN+KLLTKVRDRIH+VGEKFP+VEVKYKNVHIE ECEVVHG
Sbjct: 62   LSNEERGLFIKKLIKNIEDDNVKLLTKVRDRIHRVGEKFPTVEVKYKNVHIEVECEVVHG 121

Query: 131  KAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLL 190
            KAIPTLWNSLQSKLY+IIKFCGVKS+KAKIDIIEDVSG+I PGRLTLLLGPPGCGKTTLL
Sbjct: 122  KAIPTLWNSLQSKLYEIIKFCGVKSNKAKIDIIEDVSGIIKPGRLTLLLGPPGCGKTTLL 181

Query: 191  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQG 250
            KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYV QHDLHIPQ+TVRETLDFSARCQG
Sbjct: 182  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVGQHDLHIPQMTVRETLDFSARCQG 241

Query: 251  IGSRADIMKEVNKKEKEQGIIPNPDIDTYMK------------------ILGLDICADTL 310
            IGSRADIMKE+ KKEKEQGIIPN DID YMK                  I GLDIC DTL
Sbjct: 242  IGSRADIMKEIIKKEKEQGIIPNTDIDIYMKAISIEGLKQSLQTDYILNIFGLDICGDTL 301

Query: 311  VGDAMRRGISGGQKKRLTT----------------------------------------- 370
            VGDAMRRGISGGQKKRLTT                                         
Sbjct: 302  VGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCLQNLSHLTNA 361

Query: 371  -------------------------------GRRDRVLEFFEHCGFKCPKRKSIADFLQE 430
                                           GRRD+VL FFEHCGFKCPKRKSIADFLQE
Sbjct: 362  TILISLLQPAPETFELFDDLILMAQKKIVYQGRRDQVLNFFEHCGFKCPKRKSIADFLQE 421

Query: 431  VISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLG---RKMEEETLKPFD---EQEEY 490
            V+S+KDQPQFWY  QTPY Y+SIDT SRKFKCWNN     RK+E E LKPFD   E + Y
Sbjct: 422  VLSRKDQPQFWYRNQTPYTYVSIDTLSRKFKCWNNNNNNERKVEGENLKPFDNDREDQYY 481

Query: 491  SKN-NGVFLNGKN------SVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITM 550
            SKN +G+ LN         SVSKW+V KACASREFLLMRRNSFVYVFK SQLF IASITM
Sbjct: 482  SKNDDGILLNNTGQKINNYSVSKWEVFKACASREFLLMRRNSFVYVFKISQLFLIASITM 541

Query: 551  TVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWA 610
            TVFIRTEMK D++HGNYYMGALFYSL +LLVDALPEL++TI RL+V YKQK+LLFYP WA
Sbjct: 542  TVFIRTEMKTDVEHGNYYMGALFYSLNMLLVDALPELAMTIHRLEVFYKQKQLLFYPPWA 601

Query: 611  YVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIAL 670
            YVIP AILKLPLS +QS +WTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSS+SMFRM+AL
Sbjct: 602  YVIPPAILKLPLSFLQSFLWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSSVSMFRMMAL 661

Query: 671  VSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPR 730
            V+QH+VA TLSSF+ILQ MIFGGFII+HPSMS WLRWGFW+SPISYGEIGLSINEFLAPR
Sbjct: 662  VNQHIVASTLSSFVILQTMIFGGFIISHPSMSAWLRWGFWVSPISYGEIGLSINEFLAPR 721

Query: 731  WQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCK 790
            WQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA+IFN GFA ALTFLN      
Sbjct: 722  WQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALIFNFGFALALTFLN------ 781

Query: 791  DSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPF 850
                         PGSS AIISYEKLSQSN N  ANS QNP SSPKTSIESTKGGIALPF
Sbjct: 782  ------------PPGSSTAIISYEKLSQSNINADANSAQNPLSSPKTSIESTKGGIALPF 841

Query: 851  TPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 910
             PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL
Sbjct: 842  RPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 901

Query: 911  DVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA 970
            DV+AGRKTSGY+EGEIKIGGFPKVQETFARISGYCEQTD+HSSQITVEESL FSAWLRLA
Sbjct: 902  DVVAGRKTSGYIEGEIKIGGFPKVQETFARISGYCEQTDVHSSQITVEESLFFSAWLRLA 961

Query: 971  PNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1030
            P IDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM
Sbjct: 962  PEIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1021

Query: 1031 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYG 1090
            DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGG MIYYG
Sbjct: 1022 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGRMIYYG 1081

Query: 1091 PLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYEN 1150
            PLG DS+KVIEYFEHVPGVS+IRENYNPATW+LE+TSS AEAKLGIDFAQVYKNS+LYEN
Sbjct: 1082 PLGRDSNKVIEYFEHVPGVSRIRENYNPATWILEITSSGAEAKLGIDFAQVYKNSSLYEN 1141

Query: 1151 NKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVAS 1210
            NKELVKQLS PPPGSRDL FSN+FAQNF RQFGACLWKQNLSYWRNP YNLLRILHTVAS
Sbjct: 1142 NKELVKQLSAPPPGSRDLQFSNVFAQNFARQFGACLWKQNLSYWRNPRYNLLRILHTVAS 1201

Query: 1211 SLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFA 1270
            SLIFG+LFWK GKKLENQQ+LFNNFGVM++SV+F+GIYNCSSVFPN+SRERTVMYRERFA
Sbjct: 1202 SLIFGVLFWKKGKKLENQQDLFNNFGVMFASVVFIGIYNCSSVFPNVSRERTVMYRERFA 1261

Query: 1271 GMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKN 1330
            GMYSSWAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFYGS WKIFWCFYSMF ALLYFKN
Sbjct: 1262 GMYSSWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFYGSAWKIFWCFYSMFFALLYFKN 1321

Query: 1331 LGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLL 1352
            LGLLLV+ITPNYHIATIL+SAFYV FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLL
Sbjct: 1322 LGLLLVSITPNYHIATILASAFYVTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLL 1381

BLAST of HG10019907 vs. NCBI nr
Match: XP_031738520.1 (pleiotropic drug resistance protein 3 isoform X2 [Cucumis sativus])

HSP 1 Score: 2277.3 bits (5900), Expect = 0.0e+00
Identity = 1174/1446 (81.19%), Postives = 1244/1446 (86.03%), Query Frame = 0

Query: 11   GRSSSIAEEDNDG-DVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEKGRRVVDVTK 70
            GRSSS AEED +G DVEDASLWA IERLPTF++LRSSLFDI ND+G+V++K RRVVDVTK
Sbjct: 2    GRSSSSAEEDGNGSDVEDASLWAEIERLPTFKQLRSSLFDITNDKGEVKKKRRRVVDVTK 61

Query: 71   LRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHG 130
            L NEER +F+ KLIKN+E+DN+KLLTKVRDRIH+VGEKFP+VEVKYKNVHIE ECEVVHG
Sbjct: 62   LSNEERGLFIKKLIKNIEDDNVKLLTKVRDRIHRVGEKFPTVEVKYKNVHIEVECEVVHG 121

Query: 131  KAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLL 190
            KAIPTLWNSLQSKLY IIKFCGVKS+KAKIDIIEDVSG+I PGRLTLLLGPPGCGKTTLL
Sbjct: 122  KAIPTLWNSLQSKLY-IIKFCGVKSNKAKIDIIEDVSGIIKPGRLTLLLGPPGCGKTTLL 181

Query: 191  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQG 250
            KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYV QHDLHIPQ+TVRETLDFSARCQG
Sbjct: 182  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVGQHDLHIPQMTVRETLDFSARCQG 241

Query: 251  IGSRADIMKEVNKKEKEQGIIPNPDIDTYMK------------------ILGLDICADTL 310
            IGSRADIMKE+ KKEKEQGIIPN DID YMK                  I GLDIC DTL
Sbjct: 242  IGSRADIMKEIIKKEKEQGIIPNTDIDIYMKAISIEGLKQSLQTDYILNIFGLDICGDTL 301

Query: 311  VGDAMRRGISGGQKKRLTT----------------------------------------- 370
            VGDAMRRGISGGQKKRLTT                                         
Sbjct: 302  VGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCLQNLSHLTNA 361

Query: 371  -------------------------------GRRDRVLEFFEHCGFKCPKRKSIADFLQE 430
                                           GRRD+VL FFEHCGFKCPKRKSIADFLQE
Sbjct: 362  TILISLLQPAPETFELFDDLILMAQKKIVYQGRRDQVLNFFEHCGFKCPKRKSIADFLQE 421

Query: 431  VISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLG---RKMEEETLKPFD---EQEEY 490
            V+S+KDQPQFWY  QTPY Y+SIDT SRKFKCWNN     RK+E E LKPFD   E + Y
Sbjct: 422  VLSRKDQPQFWYRNQTPYTYVSIDTLSRKFKCWNNNNNNERKVEGENLKPFDNDREDQYY 481

Query: 491  SKN-NGVFLNGKN------SVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITM 550
            SKN +G+ LN         SVSKW+V KACASREFLLMRRNSFVYVFK SQLF IASITM
Sbjct: 482  SKNDDGILLNNTGQKINNYSVSKWEVFKACASREFLLMRRNSFVYVFKISQLFLIASITM 541

Query: 551  TVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWA 610
            TVFIRTEMK D++HGNYYMGALFYSL +LLVDALPEL++TI RL+V YKQK+LLFYP WA
Sbjct: 542  TVFIRTEMKTDVEHGNYYMGALFYSLNMLLVDALPELAMTIHRLEVFYKQKQLLFYPPWA 601

Query: 611  YVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIAL 670
            YVIP AILKLPLS +QS +WTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSS+SMFRM+AL
Sbjct: 602  YVIPPAILKLPLSFLQSFLWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSSVSMFRMMAL 661

Query: 671  VSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPR 730
            V+QH+VA TLSSF+ILQ MIFGGFII+HPSMS WLRWGFW+SPISYGEIGLSINEFLAPR
Sbjct: 662  VNQHIVASTLSSFVILQTMIFGGFIISHPSMSAWLRWGFWVSPISYGEIGLSINEFLAPR 721

Query: 731  WQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCK 790
            WQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA+IFN GFA ALTFLN      
Sbjct: 722  WQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALIFNFGFALALTFLN------ 781

Query: 791  DSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPF 850
                         PGSS AIISYEKLSQSN N  ANS QNP SSPKTSIESTKGGIALPF
Sbjct: 782  ------------PPGSSTAIISYEKLSQSNINADANSAQNPLSSPKTSIESTKGGIALPF 841

Query: 851  TPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 910
             PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL
Sbjct: 842  RPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 901

Query: 911  DVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA 970
            DV+AGRKTSGY+EGEIKIGGFPKVQETFARISGYCEQTD+HSSQITVEESL FSAWLRLA
Sbjct: 902  DVVAGRKTSGYIEGEIKIGGFPKVQETFARISGYCEQTDVHSSQITVEESLFFSAWLRLA 961

Query: 971  PNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1030
            P IDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM
Sbjct: 962  PEIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1021

Query: 1031 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYG 1090
            DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGG MIYYG
Sbjct: 1022 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGRMIYYG 1081

Query: 1091 PLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYEN 1150
            PLG DS+KVIEYFEHVPGVS+IRENYNPATW+LE+TSS AEAKLGIDFAQVYKNS+LYEN
Sbjct: 1082 PLGRDSNKVIEYFEHVPGVSRIRENYNPATWILEITSSGAEAKLGIDFAQVYKNSSLYEN 1141

Query: 1151 NKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVAS 1210
            NKELVKQLS PPPGSRDL FSN+FAQNF RQFGACLWKQNLSYWRNP YNLLRILHTVAS
Sbjct: 1142 NKELVKQLSAPPPGSRDLQFSNVFAQNFARQFGACLWKQNLSYWRNPRYNLLRILHTVAS 1201

Query: 1211 SLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFA 1270
            SLIFG+LFWK GKKLENQQ+LFNNFGVM++SV+F+GIYNCSSVFPN+SRERTVMYRERFA
Sbjct: 1202 SLIFGVLFWKKGKKLENQQDLFNNFGVMFASVVFIGIYNCSSVFPNVSRERTVMYRERFA 1261

Query: 1271 GMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKN 1330
            GMYSSWAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFYGS WKIFWCFYSMF ALLYFKN
Sbjct: 1262 GMYSSWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFYGSAWKIFWCFYSMFFALLYFKN 1321

Query: 1331 LGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLL 1352
            LGLLLV+ITPNYHIATIL+SAFYV FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLL
Sbjct: 1322 LGLLLVSITPNYHIATILASAFYVTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLL 1381

BLAST of HG10019907 vs. NCBI nr
Match: XP_022154233.1 (pleiotropic drug resistance protein 3-like isoform X2 [Momordica charantia])

HSP 1 Score: 2113.6 bits (5475), Expect = 0.0e+00
Identity = 1083/1424 (76.05%), Postives = 1203/1424 (84.48%), Query Frame = 0

Query: 8    AERGRSSSIAEEDND-GDVEDASLWATIERLPTFERLRSSLF-DINDEGKVEEKGRRVVD 67
            AE  RS   +  DND  DVEDASLWA IERLPTFER+RSS+F DI+  G+V+EKGRRVVD
Sbjct: 19   AEIRRSLKSSSSDNDSNDVEDASLWAAIERLPTFERVRSSVFDDISSRGEVKEKGRRVVD 78

Query: 68   VTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEV 127
            VTKL ++ERH+FVHKLIK+VE+DNLKLL KVR+RI +VG  FPSVEVKYKNVHIEAECEV
Sbjct: 79   VTKLDDQERHLFVHKLIKHVESDNLKLLRKVRERIDRVGVTFPSVEVKYKNVHIEAECEV 138

Query: 128  VHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKT 187
            VHGKAIPTLWNSL++KLYDIIKFCG KSH+AKIDIIED SGVI PGRLTLLLGPPGCGKT
Sbjct: 139  VHGKAIPTLWNSLRTKLYDIIKFCGAKSHEAKIDIIEDASGVIKPGRLTLLLGPPGCGKT 198

Query: 188  TLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSAR 247
            TLLKALSGNL+KSLK SGEICYNGHKLEEFVPQKTSAY+SQ++LHI Q+TVRETLDFS R
Sbjct: 199  TLLKALSGNLDKSLKMSGEICYNGHKLEEFVPQKTSAYISQNELHIAQMTVRETLDFSTR 258

Query: 248  CQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYMKILGLDICADTLVGDAMRRGISGGQKK 307
            CQGIGSRAD+MKE+ K+EKEQGIIP+ D+DTYMKILGLDICA+TL GDAMRRGISGGQKK
Sbjct: 259  CQGIGSRADLMKEIIKREKEQGIIPDSDVDTYMKILGLDICAETLTGDAMRRGISGGQKK 318

Query: 308  RLTTGR------------------------------------------------------ 367
            RLT G                                                       
Sbjct: 319  RLTIGEMIVGPKRVLLMDEITNGLDSSTAFQIVSCLQHLAHFTDATLLVSLLQPAPETFD 378

Query: 368  ------------------RDRVLEFFEHCGFKCPKRKSIADFLQEVISKKDQPQFWY-CK 427
                              RD+VL+FFE+CGFKCP+RK++ADFLQEV+SKKDQPQ+WY   
Sbjct: 379  LFDDLILMVQKKIIYHGPRDQVLQFFENCGFKCPERKNVADFLQEVVSKKDQPQYWYRHD 438

Query: 428  QTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFLNGKNSVSKWQVS 487
            +  Y Y+S +TF R FK  ++LGRK++EE  +P+D++ + SK N     G +SVSKWQV 
Sbjct: 439  EARYTYVSNNTFCRMFKS-SSLGRKLDEEVSQPYDDKTK-SKRNVSSSLGVDSVSKWQVF 498

Query: 488  KACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLI 547
            KACASREFLLM+RNSFVYVFKTSQLFF+ASI MTVF+R++MK+DLQH NYYMGALFY LI
Sbjct: 499  KACASREFLLMKRNSFVYVFKTSQLFFLASIAMTVFLRSQMKVDLQHANYYMGALFYGLI 558

Query: 548  ILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYV 607
            +L+ +A+PEL+LT+QRL+V YKQKEL FYPAWAY IPAAILK+P SLVQ+LVWTSLTYYV
Sbjct: 559  MLVFNAVPELALTVQRLEVFYKQKELKFYPAWAYAIPAAILKIPFSLVQALVWTSLTYYV 618

Query: 608  IGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHV-VALTLSSFIILQIMIFGGFII 667
            IGYTPE SRFFRHFLVLFAV++ SLSMFR++A V + + VA ++SSF +L I+ F GFII
Sbjct: 619  IGYTPEFSRFFRHFLVLFAVNILSLSMFRLLASVIRSIDVAPSISSFTLLLILTFAGFII 678

Query: 668  THPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQY 727
            TH SM  W+ WGFW+SPISYGEIGLSINEFLAPRWQK Q TN+TIGH+ILQSRGLD+HQY
Sbjct: 679  THTSMPAWMEWGFWVSPISYGEIGLSINEFLAPRWQKRQSTNTTIGHIILQSRGLDFHQY 738

Query: 728  FYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKL 787
            FYWISL ALFGFA++FNVGF  ALTFLN                  +PGSS AIISYEKL
Sbjct: 739  FYWISLGALFGFAVLFNVGFTLALTFLN------------------SPGSSRAIISYEKL 798

Query: 788  ----SQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVFQHLQYYVDMPSGMRE 847
                S  +CNGGANSV+   +SPK +IES+KG IALPFTPLTVVF+ L YYVDMP  MRE
Sbjct: 799  GRAKSSEDCNGGANSVEQQAASPKAAIESSKGRIALPFTPLTVVFRDLHYYVDMPVAMRE 858

Query: 848  RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFP 907
            RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGY+EGEIKIGGFP
Sbjct: 859  RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYIEGEIKIGGFP 918

Query: 908  KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELD 967
            KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA NIDSKTK QFVNEVLETIELD
Sbjct: 919  KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLASNIDSKTKEQFVNEVLETIELD 978

Query: 968  SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV 1027
            SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV
Sbjct: 979  SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV 1038

Query: 1028 ADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKI 1087
             DTGRTIVCTIHQPSIDIFESFDELILLKTGGH+IYYGPLG  SSKVIE+FE VPGVS I
Sbjct: 1039 VDTGRTIVCTIHQPSIDIFESFDELILLKTGGHVIYYGPLGRHSSKVIEFFEQVPGVSMI 1098

Query: 1088 RENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSN 1147
            REN+NPATWMLEVTSSAAEAKLGIDFAQVYKNSALY+NNKE+VKQLS PPPGSRDLHFSN
Sbjct: 1099 RENHNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYKNNKEIVKQLSTPPPGSRDLHFSN 1158

Query: 1148 IFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLF 1207
            +FAQ+F  QF ACLWKQNLSYWRNP YNL RIL+T+ASSL+FG LFWKHGKKLENQQNLF
Sbjct: 1159 VFAQSFAGQFKACLWKQNLSYWRNPCYNLTRILYTIASSLVFGTLFWKHGKKLENQQNLF 1218

Query: 1208 NNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV 1267
            NNFG MYSSV F+GI+NC++VFPN+SRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV
Sbjct: 1219 NNFGSMYSSVNFIGIHNCATVFPNVSRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV 1278

Query: 1268 QAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTITPNYHIATILSSAF 1327
            QAA +VIITYPMIG+YGS  K+FWCFYSMFCALLYF  LG+LL+++TPN+HIA+IL+SAF
Sbjct: 1279 QAAAFVIITYPMIGYYGSSSKVFWCFYSMFCALLYFNYLGMLLISVTPNFHIASILASAF 1338

Query: 1328 YVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDIDKAMVVFGERTTTVS 1352
            Y  FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLLTSQYGDI+K ++VFGER  TVS
Sbjct: 1339 YSTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLLTSQYGDINKTLMVFGER-RTVS 1398

BLAST of HG10019907 vs. NCBI nr
Match: KAA0043676.1 (pleiotropic drug resistance protein 3-like isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 2112.0 bits (5471), Expect = 0.0e+00
Identity = 1103/1379 (79.99%), Postives = 1171/1379 (84.92%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEK 60
            MAQMV + + GRSSS  EED DGDVEDASLWA IERLPTFERLR SLFDI +DEG+V+EK
Sbjct: 1    MAQMV-ATQIGRSSSSIEEDCDGDVEDASLWAEIERLPTFERLRLSLFDISDDEGEVKEK 60

Query: 61   GRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHI 120
             RRV DVTKL N+ER +F+ KLIKNV+ DNLKLLT+VRDRIHKVGEKFP+VEVKYKNVHI
Sbjct: 61   RRRVADVTKLSNKERRLFIEKLIKNVKKDNLKLLTEVRDRIHKVGEKFPTVEVKYKNVHI 120

Query: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGP 180
            EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSG+I PGRLTLLLGP
Sbjct: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGIIKPGRLTLLLGP 180

Query: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRET 240
            PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQ+TVRET
Sbjct: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQMTVRET 240

Query: 241  LDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KIL 300
            LDFSARCQGIGSRADIMKE+ KKEKEQGIIPNPDID YM                  KI 
Sbjct: 241  LDFSARCQGIGSRADIMKEIIKKEKEQGIIPNPDIDIYMKAISIEGLKHSLQTDYILKIF 300

Query: 301  GLDICADTLVGDAMRRGISGGQKKRLTT-------------------------------- 360
            GLD+C DTLVGDAMRRGISGGQKKRLTT                                
Sbjct: 301  GLDVCGDTLVGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCL 360

Query: 361  ----------------------------------------GRRDRVLEFFEHCGFKCPKR 420
                                                    GRRDRVL+FFEHCGFKCPKR
Sbjct: 361  QNLAHLTNATILISLLQPAPETFELFDDLILMAQKKIVYQGRRDRVLDFFEHCGFKCPKR 420

Query: 421  KSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCW---NNLGRKMEEETLKPF 480
            KS ADFLQEV+S+KDQPQFWY  QTPYAY+SIDT S KFK W   NNL RK EEE LKP+
Sbjct: 421  KSTADFLQEVLSRKDQPQFWYRNQTPYAYVSIDTLSTKFKHWNNNNNLERKAEEEILKPY 480

Query: 481  D----EQEEYSK-NNGVFLN----GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTS-Q 540
            D    E + YSK ++G+ LN       SVSKWQV KACASREFLLMRRNSFVYVFK S Q
Sbjct: 481  DNDDQEDQYYSKDDDGILLNIGKINNYSVSKWQVFKACASREFLLMRRNSFVYVFKISQQ 540

Query: 541  LFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQK 600
            LF IASITMTVFIRTEMKID++HGNYYMGALFYSL++LLVDALPEL++TIQRL+V YKQK
Sbjct: 541  LFLIASITMTVFIRTEMKIDVEHGNYYMGALFYSLLMLLVDALPELAMTIQRLEVFYKQK 600

Query: 601  ELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSS 660
            +LLFYP WAYVIP AI KLPLSL+QS VWTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSS
Sbjct: 601  QLLFYPPWAYVIPPAIFKLPLSLLQSFVWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSS 660

Query: 661  LSMFRMIALVSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGL 720
            LSMFRM+ALV+Q +VA T+SSF+ILQIMIFGGFII+H SMS WLRWGFW+SPISYGEIGL
Sbjct: 661  LSMFRMMALVNQQIVASTVSSFVILQIMIFGGFIISHSSMSAWLRWGFWVSPISYGEIGL 720

Query: 721  SINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALT 780
            SINEFLAPRWQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA++FN GFA ALT
Sbjct: 721  SINEFLAPRWQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALLFNFGFALALT 780

Query: 781  FLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIES 840
            FLN                   PGSS AIISYEKLSQSN N  ANS Q+P SSPKTSIES
Sbjct: 781  FLN------------------PPGSSTAIISYEKLSQSNINADANSAQSPLSSPKTSIES 840

Query: 841  TK-------------GGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITG 900
            TK             GGIALPF PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITG
Sbjct: 841  TKGNILLNFTVRLLMGGIALPFRPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITG 900

Query: 901  ALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQT 960
            A+RPGILTALMGVSGAGKTTLLDVLAGRKTSGY+EGEIKIGGFPKVQETFAR+SGYCEQT
Sbjct: 901  AIRPGILTALMGVSGAGKTTLLDVLAGRKTSGYIEGEIKIGGFPKVQETFARVSGYCEQT 960

Query: 961  DIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLS 1020
            D+HSSQITVEESL FSAWLRLAP IDSKTKAQFVNEVLE IELDSIKDSLVGIPGVSGLS
Sbjct: 961  DVHSSQITVEESLFFSAWLRLAPEIDSKTKAQFVNEVLEIIELDSIKDSLVGIPGVSGLS 1020

Query: 1021 TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI 1080
            TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI
Sbjct: 1021 TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI 1080

Query: 1081 DIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSS 1140
            +IFESFDELILLKTGG MIYYGPLG DS++VIEYFEHVPGVS+IRENYNPATW+LE+TSS
Sbjct: 1081 NIFESFDELILLKTGGRMIYYGPLGQDSNQVIEYFEHVPGVSRIRENYNPATWILEITSS 1140

Query: 1141 AAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWK 1200
            AAEAKLGIDFA VYKNS+LYENNKELVKQLS P PGSRDLHFSN+FAQNF RQFGACLWK
Sbjct: 1141 AAEAKLGIDFALVYKNSSLYENNKELVKQLSAPSPGSRDLHFSNVFAQNFARQFGACLWK 1200

Query: 1201 QNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIY 1260
            QNLSYWRNP YNLLRILHTVASSLIFG+LFWK GKKLENQQ+LFNNFGVMYSSV+F+GIY
Sbjct: 1201 QNLSYWRNPRYNLLRILHTVASSLIFGVLFWKKGKKLENQQDLFNNFGVMYSSVVFMGIY 1260

Query: 1261 NCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFY 1263
            NCSSVFPN+SRERTVMYRERFAGMYS WAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFY
Sbjct: 1261 NCSSVFPNVSRERTVMYRERFAGMYSPWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFY 1320

BLAST of HG10019907 vs. ExPASy Swiss-Prot
Match: Q5W274 (Pleiotropic drug resistance protein 3 OS=Nicotiana tabacum OX=4097 GN=PDR3 PE=2 SV=1)

HSP 1 Score: 1709.5 bits (4426), Expect = 0.0e+00
Identity = 890/1471 (60.50%), Postives = 1095/1471 (74.44%), Query Frame = 0

Query: 1    MAQMVGSAE-------------------RGRSSSI--------AEEDNDGDVEDASLWAT 60
            MAQ+VGS E                   RG+SSS         +++D+  D E+   WA 
Sbjct: 1    MAQLVGSDEIESFRMDLAEIGRSLRSSFRGQSSSFRSNSALSASQKDDAVDEENMLAWAA 60

Query: 61   IERLPTFERLRSSLFDINDEGKVEEKGRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLL 120
            IERLPTF+RLRSSLF+  +      K +RV DVTKL   ERH+F+ K+IK++E+DNL+LL
Sbjct: 61   IERLPTFDRLRSSLFEEINGNDANVKRKRVTDVTKLGALERHVFIEKMIKHIEHDNLQLL 120

Query: 121  TKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKS 180
             K+R RI KVG + P+VEV+YKN+ IEAECE+VHGK +PTLWNSL+S   ++ +  G++S
Sbjct: 121  HKIRKRIDKVGVELPTVEVRYKNLTIEAECELVHGKPLPTLWNSLKSITMNLARLPGLQS 180

Query: 181  HKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLE 240
              AKI I+ DVSGVI PGR+TLLLGPPGCGKT+LLKALSGNL+KSLK SGEI YNG+KLE
Sbjct: 181  ELAKIKILNDVSGVIKPGRMTLLLGPPGCGKTSLLKALSGNLDKSLKVSGEISYNGYKLE 240

Query: 241  EFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPD 300
            EFVPQKTSAYVSQ+DLHIP++TVRETLD+S+R QG+GSRA+IM +++++EKE G++P+PD
Sbjct: 241  EFVPQKTSAYVSQNDLHIPEMTVRETLDYSSRFQGVGSRAEIMTDLSRREKEAGVVPDPD 300

Query: 301  IDTYM------------------KILGLDICADTLVGDAMRRGISGGQKKRLTTGR---- 360
            IDTYM                  KILGLDICADTLVGDAMRRGISGGQKKRLTTG     
Sbjct: 301  IDTYMKAISIEGQKKNLQTDYILKILGLDICADTLVGDAMRRGISGGQKKRLTTGELIVG 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  PIKALFMDEISNGLDSSTTYQIVACLQQLAHITDATILVSLLQPAPETFDLFDDIILMAE 420

Query: 421  --------RDRVLEFFEHCGFKCPKRKSIADFLQEVISKKDQPQFWYCKQTPYAYISIDT 480
                    R+  LEFFE CGFKCP+RK +ADFLQEV SKKDQ Q+W+  +  Y ++S+D 
Sbjct: 421  GKILYHGPRNSALEFFESCGFKCPERKGVADFLQEVTSKKDQAQYWHGTKETYKFVSVDM 480

Query: 481  FSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFLNGKNSVSKWQVSKACASREFLLM 540
             SRKFK  +   +K+ EE   P+D    + +N+  F +   S+ KW++ +AC SREFLLM
Sbjct: 481  LSRKFK-ESPYRKKLNEELSVPYDNSRSH-RNSITFRD--YSLPKWELFRACMSREFLLM 540

Query: 541  RRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELS 600
            +RNSF+Y+FKT QL  IASITMTVF+RT M  DL H NYY+GALFY+LIILLVD  PELS
Sbjct: 541  KRNSFIYIFKTVQLAIIASITMTVFLRTRMDTDLVHANYYLGALFYALIILLVDGFPELS 600

Query: 601  LTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFF 660
            +TI RL V YKQ EL FYPAWAY IPA ILK+PLSL++S++W S+TYYVIG++PE  RFF
Sbjct: 601  MTITRLAVFYKQSELCFYPAWAYTIPATILKIPLSLLESVIWASMTYYVIGFSPEAGRFF 660

Query: 661  RHFLVLFAVHVSSLSMFRMIALVSQHVVALTLSSFI-ILQIMIFGGFIITHPSMSPWLRW 720
            R  L+LFAVH++S+SMFR +A V + +VA T +  + IL ++ F GFII  PSM  WL+W
Sbjct: 661  RQLLLLFAVHMTSISMFRFLASVCRTIVASTAAGGLSILFVLCFSGFIIPRPSMPIWLKW 720

Query: 721  GFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFG 780
            GFW+SP++YGEIGL++NEFLAPRWQK   TN++IG+ +L+SRGL++  YFYWIS+ ALFG
Sbjct: 721  GFWISPLTYGEIGLAVNEFLAPRWQKTLPTNTSIGNEVLESRGLNFDGYFYWISVCALFG 780

Query: 781  FAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANS 840
            F I+FN+GF  ALTFL                   APGS  AIIS +K SQ   +  +  
Sbjct: 781  FTILFNIGFTLALTFL------------------KAPGSR-AIISTDKYSQIEGSSDSID 840

Query: 841  VQNPPSSPKTSIESTK--GGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSD 900
              +   + K +++S +  G + LPF PL++VFQ +QYYVD P+ M E GFTQK+LQLLSD
Sbjct: 841  KADAAENSKATMDSHERAGRMVLPFEPLSLVFQDVQYYVDTPAAMTELGFTQKRLQLLSD 900

Query: 901  ITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYC 960
            ITGALRPGILTALMGVSGAGKTTLLDVLAGRKT+GYVEGEIK+GG+PKVQETFAR+SGYC
Sbjct: 901  ITGALRPGILTALMGVSGAGKTTLLDVLAGRKTTGYVEGEIKVGGYPKVQETFARVSGYC 960

Query: 961  EQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVS 1020
            EQTDIHS QITVEES+IFSAWLRL P IDSKTK +FV EV+ETIELD IK  LVG+PGVS
Sbjct: 961  EQTDIHSPQITVEESVIFSAWLRLHPQIDSKTKYEFVKEVIETIELDGIKGMLVGMPGVS 1020

Query: 1021 GLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQ 1080
            GLSTEQRKRLTIAVELV+NPSIIFMDEPTTGLDAR+AAIVMRAVKNVADTGRTIVCTIHQ
Sbjct: 1021 GLSTEQRKRLTIAVELVANPSIIFMDEPTTGLDARSAAIVMRAVKNVADTGRTIVCTIHQ 1080

Query: 1081 PSIDIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEV 1140
            PSIDIFE+FDELILLKTGG MIY+G LG +S K+IEYFE +  V KI+ N+NPATWMLEV
Sbjct: 1081 PSIDIFEAFDELILLKTGGRMIYWGHLGRNSCKMIEYFEGISCVPKIKNNHNPATWMLEV 1140

Query: 1141 TSSAAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGAC 1200
            TS+++EA + IDFA+VYKNSAL++NN+ELVK+LS PP GS+DLHF   F+QN   QF  C
Sbjct: 1141 TSTSSEADISIDFAEVYKNSALHKNNEELVKKLSFPPAGSKDLHFPTRFSQNGWGQFKTC 1200

Query: 1201 LWKQNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFL 1260
             WKQ  SYWR+P YNL+R LH + +SL+ G+LFW  GKKL+NQQ++F+ FG M+++VIF 
Sbjct: 1201 FWKQYWSYWRSPSYNLMRSLHMLFASLVSGLLFWDKGKKLDNQQSVFSVFGAMFTAVIFC 1260

Query: 1261 GIYNCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMI 1320
            GI N SSV P ++ ER+V+YRERFAGMY+SWAY+LAQV IE+PY+  QA  + +ITYPMI
Sbjct: 1261 GINNSSSVLPYVTTERSVLYRERFAGMYASWAYALAQVAIEIPYLLAQALAFTVITYPMI 1320

Query: 1321 GFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVP 1352
            G+Y S +K+FW FYSMFC LLYF  LG++LV++TP++ +A IL S+FY MFNLFAGFL+P
Sbjct: 1321 GYYWSAYKVFWYFYSMFCTLLYFTYLGMMLVSMTPSFPVAAILQSSFYTMFNLFAGFLMP 1380

BLAST of HG10019907 vs. ExPASy Swiss-Prot
Match: Q9LFH0 (ABC transporter G family member 37 OS=Arabidopsis thaliana OX=3702 GN=ABCG37 PE=1 SV=1)

HSP 1 Score: 1657.9 bits (4292), Expect = 0.0e+00
Identity = 853/1439 (59.28%), Postives = 1050/1439 (72.97%), Query Frame = 0

Query: 13   SSSIAEEDNDGDVED-----ASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVVDVT 72
            SSSI E +NDGDV D     A  WA IERLPT +R+RS+L D  DE  + EKGRRVVDVT
Sbjct: 38   SSSIYEVENDGDVNDHDAEYALQWAEIERLPTVKRMRSTLLDDGDE-SMTEKGRRVVDVT 97

Query: 73   KLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVH 132
            KL   ERH+ + KLIK++ENDNLKLL K+R RI +VG + P++EV+Y+++ + AECEVV 
Sbjct: 98   KLGAVERHLMIEKLIKHIENDNLKLLKKIRRRIDRVGMELPTIEVRYESLKVVAECEVVE 157

Query: 133  GKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTL 192
            GKA+PTLWN+ +  L +++K  G K+H+AKI+II DV+G+I PGRLTLLLGPP CGKTTL
Sbjct: 158  GKALPTLWNTAKRVLSELVKLTGAKTHEAKINIINDVNGIIKPGRLTLLLGPPSCGKTTL 217

Query: 193  LKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQ 252
            LKALSGNL  +LK SGEI YNGH+L+EFVPQKTSAY+SQ+DLHI ++TVRET+DFSARCQ
Sbjct: 218  LKALSGNLENNLKCSGEISYNGHRLDEFVPQKTSAYISQYDLHIAEMTVRETVDFSARCQ 277

Query: 253  GIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILGLDICADT 312
            G+GSR DIM EV+K+EKE+GIIP+ ++D YM                  KILGLDICA+ 
Sbjct: 278  GVGSRTDIMMEVSKREKEKGIIPDTEVDAYMKAISVEGLQRSLQTDYILKILGLDICAEI 337

Query: 313  LVGDAMRRGISGGQKKRLTT---------------------------------------- 372
            L+GD MRRGISGGQKKRLTT                                        
Sbjct: 338  LIGDVMRRGISGGQKKRLTTAEMIVGPTKALFMDEITNGLDSSTAFQIVKSLQQFAHISS 397

Query: 373  --------------------------------GRRDRVLEFFEHCGFKCPKRKSIADFLQ 432
                                            G R  VL FFE CGF+CP+RK +ADFLQ
Sbjct: 398  ATVLVSLLQPAPESYDLFDDIMLMAKGRIVYHGPRGEVLNFFEDCGFRCPERKGVADFLQ 457

Query: 433  EVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNG 492
            EVISKKDQ Q+W+ +  PY+++S++  S+KFK   ++G+K+E+   KP+D  + +     
Sbjct: 458  EVISKKDQAQYWWHEDLPYSFVSVEMLSKKFKDL-SIGKKIEDTLSKPYDRSKSH---KD 517

Query: 493  VFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDL 552
                   S+  W++  AC SRE+LLM+RN FVY+FKT+QL   A ITMTVFIRT M ID+
Sbjct: 518  ALSFSVYSLPNWELFIACISREYLLMKRNYFVYIFKTAQLVMAAFITMTVFIRTRMGIDI 577

Query: 553  QHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPL 612
             HGN YM ALF++LIILLVD  PELS+T QRL V YKQK+L FYPAWAY IPA +LK+PL
Sbjct: 578  IHGNSYMSALFFALIILLVDGFPELSMTAQRLAVFYKQKQLCFYPAWAYAIPATVLKVPL 637

Query: 613  SLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHVVA-LTLS 672
            S  +SLVWT L+YYVIGYTPE SRFF+ F++LFAVH +S+SMFR +A + Q VVA +T  
Sbjct: 638  SFFESLVWTCLSYYVIGYTPEASRFFKQFILLFAVHFTSISMFRCLAAIFQTVVASITAG 697

Query: 673  SFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTI 732
            SF IL   +F GF+I  PSM  WL+WGFW +P+SYGEIGLS+NEFLAPRW +MQ  N T+
Sbjct: 698  SFGILFTFVFAGFVIPPPSMPAWLKWGFWANPLSYGEIGLSVNEFLAPRWNQMQPNNFTL 757

Query: 733  GHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILH 792
            G  ILQ+RG+DY+ Y YW+SL AL GF ++FN+ F  ALTFL                  
Sbjct: 758  GRTILQTRGMDYNGYMYWVSLCALLGFTVLFNIIFTLALTFL------------------ 817

Query: 793  SAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTK----GGIALPFTPLTVVF 852
             +P SS A+IS +KLS+    G   S ++     KT+    K      + LPF PLTV F
Sbjct: 818  KSPTSSRAMISQDKLSE--LQGTEKSTEDSSVRKKTTDSPVKTEEEDKMVLPFKPLTVTF 877

Query: 853  QHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRK 912
            Q L Y+VDMP  MR++G+ QKKLQLLSDITGA RPGILTALMGVSGAGKTTLLDVLAGRK
Sbjct: 878  QDLNYFVDMPVEMRDQGYDQKKLQLLSDITGAFRPGILTALMGVSGAGKTTLLDVLAGRK 937

Query: 913  TSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKT 972
            TSGY+EG+I+I GFPKVQETFAR+SGYCEQTDIHS  ITVEES+I+SAWLRLAP ID+ T
Sbjct: 938  TSGYIEGDIRISGFPKVQETFARVSGYCEQTDIHSPNITVEESVIYSAWLRLAPEIDATT 997

Query: 973  KAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGL 1032
            K +FV +VLETIELD IKDSLVG+ GVSGLSTEQRKRLTIAVELV+NPSIIFMDEPTTGL
Sbjct: 998  KTKFVKQVLETIELDEIKDSLVGVTGVSGLSTEQRKRLTIAVELVANPSIIFMDEPTTGL 1057

Query: 1033 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSS 1092
            DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFE+FDEL+LLK GG MIY GPLG  S 
Sbjct: 1058 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFEAFDELVLLKRGGRMIYTGPLGQHSR 1117

Query: 1093 KVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQ 1152
             +IEYFE VP + KI++N+NPATWML+V+S + E +LG+DFA++Y +SALY+ N ELVKQ
Sbjct: 1118 HIIEYFESVPEIPKIKDNHNPATWMLDVSSQSVEIELGVDFAKIYHDSALYKRNSELVKQ 1177

Query: 1153 LSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGIL 1212
            LS P  GS D+ F   FAQ++  QF + LWK NLSYWR+P YNL+R++HT+ SSLIFG L
Sbjct: 1178 LSQPDSGSSDIQFKRTFAQSWWGQFKSILWKMNLSYWRSPSYNLMRMMHTLVSSLIFGAL 1237

Query: 1213 FWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWA 1272
            FWK G+ L+ QQ++F  FG +Y  V+FLGI NC+S       ER VMYRERFAGMYS+ A
Sbjct: 1238 FWKQGQNLDTQQSMFTVFGAIYGLVLFLGINNCASALQYFETERNVMYRERFAGMYSATA 1297

Query: 1273 YSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVT 1332
            Y+L QV+ E+PYIF+QAA +VI+TYPMIGFY S +K+FW  YSMFC+LL F  L + LV+
Sbjct: 1298 YALGQVVTEIPYIFIQAAEFVIVTYPMIGFYPSAYKVFWSLYSMFCSLLTFNYLAMFLVS 1357

Query: 1333 ITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDI 1352
            ITPN+ +A IL S FYV FNLF+GFL+P+ ++P WWIW YY+ PTSWTLN  ++SQYGDI
Sbjct: 1358 ITPNFMVAAILQSLFYVGFNLFSGFLIPQTQVPGWWIWLYYLTPTSWTLNGFISSQYGDI 1417

BLAST of HG10019907 vs. ExPASy Swiss-Prot
Match: Q9ZUT8 (ABC transporter G family member 33 OS=Arabidopsis thaliana OX=3702 GN=ABCG33 PE=2 SV=1)

HSP 1 Score: 1648.3 bits (4267), Expect = 0.0e+00
Identity = 847/1439 (58.86%), Postives = 1041/1439 (72.34%), Query Frame = 0

Query: 5    VGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVV 64
            +GS+ R  SS    ED   + E A  WA I+RLPTF+RLRSSL D   EG   EKG++VV
Sbjct: 1    MGSSFRSSSSRNEHEDGGDEAEHALQWAEIQRLPTFKRLRSSLVDKYGEG--TEKGKKVV 60

Query: 65   DVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECE 124
            DVTKL   ERH+ + KLIK++ENDNLKLL K+R R+ +VG +FPS+EV+Y+++ +EA CE
Sbjct: 61   DVTKLGAMERHLMIEKLIKHIENDNLKLLKKIRRRMERVGVEFPSIEVRYEHLGVEAACE 120

Query: 125  VVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGK 184
            VV GKA+PTLWNSL+    D++K  GV++++A I I+ DVSG+I+PGRLTLLLGPPGCGK
Sbjct: 121  VVEGKALPTLWNSLKHVFLDLLKLSGVRTNEANIKILTDVSGIISPGRLTLLLGPPGCGK 180

Query: 185  TTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSA 244
            TTLLKALSGNL  +LK  GEI YNGH L E VPQKTSAY+SQHDLHI ++T RET+DFSA
Sbjct: 181  TTLLKALSGNLENNLKCYGEISYNGHGLNEVVPQKTSAYISQHDLHIAEMTTRETIDFSA 240

Query: 245  RCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILGLDIC 304
            RCQG+GSR DIM EV+K+EK+ GIIP+P+ID YM                  KILGLDIC
Sbjct: 241  RCQGVGSRTDIMMEVSKREKDGGIIPDPEIDAYMKAISVKGLKRSLQTDYILKILGLDIC 300

Query: 305  ADTLVGDAMRRGISGGQKKRLTT------------------------------------- 364
            A+TLVG+AM+RGISGGQKKRLTT                                     
Sbjct: 301  AETLVGNAMKRGISGGQKKRLTTAEMIVGPTKALFMDEITNGLDSSTAFQIIKSLQQVAH 360

Query: 365  -----------------------------------GRRDRVLEFFEHCGFKCPKRKSIAD 424
                                               G RD VL+FFE CGF+CP+RK +AD
Sbjct: 361  ITNATVFVSLLQPAPESYDLFDDIVLMAEGKIVYHGPRDDVLKFFEECGFQCPERKGVAD 420

Query: 425  FLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSK 484
            FLQEVISKKDQ Q+W  +  P++++S+DT S++FK    +GRK+EE   KP+D     SK
Sbjct: 421  FLQEVISKKDQGQYWLHQNLPHSFVSVDTLSKRFKDL-EIGRKIEEALSKPYD----ISK 480

Query: 485  NNGVFLN-GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEM 544
             +   L+    S+ KW++ +AC SREFLLM+RN FVY+FKT QL   A ITMTVFIRT M
Sbjct: 481  THKDALSFNVYSLPKWELFRACISREFLLMKRNYFVYLFKTFQLVLAAIITMTVFIRTRM 540

Query: 545  KIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAIL 604
             ID+ HGN YM  LF++ ++LLVD +PELS+T+QRL V YKQK+L FYPAWAY IPA +L
Sbjct: 541  DIDIIHGNSYMSCLFFATVVLLVDGIPELSMTVQRLSVFYKQKQLCFYPAWAYAIPATVL 600

Query: 605  KLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQ-HVVA 664
            K+PLS  +SLVWT LTYYVIGYTPE  RFFR F++LFAVH +S+SMFR IA + Q  V A
Sbjct: 601  KIPLSFFESLVWTCLTYYVIGYTPEPYRFFRQFMILFAVHFTSISMFRCIAAIFQTGVAA 660

Query: 665  LTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGT 724
            +T  SF++L   +F GF I +  M  WL+WGFW++PISY EIGLS+NEFLAPRWQKMQ T
Sbjct: 661  MTAGSFVMLITFVFAGFAIPYTDMPGWLKWGFWVNPISYAEIGLSVNEFLAPRWQKMQPT 720

Query: 725  NSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFN 784
            N T+G  IL+SRGL+Y  Y YW+SL+AL G  IIFN  F  AL+FL              
Sbjct: 721  NVTLGRTILESRGLNYDDYMYWVSLSALLGLTIIFNTIFTLALSFL-------------- 780

Query: 785  FILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVF 844
                 +P SS  +IS +KLS+      ++  +N P           G + LPF PLT+ F
Sbjct: 781  ----KSPTSSRPMISQDKLSELQGTKDSSVKKNKPLDSSIKTNEDPGKMILPFKPLTITF 840

Query: 845  QHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRK 904
            Q L YYVD+P  M+ +G+ +KKLQLLS+ITGA RPG+LTALMG+SGAGKTTLLDVLAGRK
Sbjct: 841  QDLNYYVDVPVEMKGQGYNEKKLQLLSEITGAFRPGVLTALMGISGAGKTTLLDVLAGRK 900

Query: 905  TSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKT 964
            TSGY+EGEI+I GF KVQETFAR+SGYCEQTDIHS  ITVEESLI+SAWLRL P I+ +T
Sbjct: 901  TSGYIEGEIRISGFLKVQETFARVSGYCEQTDIHSPSITVEESLIYSAWLRLVPEINPQT 960

Query: 965  KAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGL 1024
            K +FV +VLETIEL+ IKD+LVG+ GVSGLSTEQRKRLT+AVELV+NPSIIFMDEPTTGL
Sbjct: 961  KIRFVKQVLETIELEEIKDALVGVAGVSGLSTEQRKRLTVAVELVANPSIIFMDEPTTGL 1020

Query: 1025 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSS 1084
            DARAAAIVMRAVKNVA+TGRTIVCTIHQPSI IFE+FDEL+LLK GG MIY GPLG  SS
Sbjct: 1021 DARAAAIVMRAVKNVAETGRTIVCTIHQPSIHIFEAFDELVLLKRGGRMIYSGPLGQHSS 1080

Query: 1085 KVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQ 1144
             VIEYF+++PGV+KIR+ YNPATWMLEVTS + E +L +DFA++Y  S LY+NN ELVK+
Sbjct: 1081 CVIEYFQNIPGVAKIRDKYNPATWMLEVTSESVETELDMDFAKIYNESDLYKNNSELVKE 1140

Query: 1145 LSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGIL 1204
            LS P  GS DLHF   FAQN+  QF +CLWK +LSYWR+P YNL+RI HT  SS IFG+L
Sbjct: 1141 LSKPDHGSSDLHFKRTFAQNWWEQFKSCLWKMSLSYWRSPSYNLMRIGHTFISSFIFGLL 1200

Query: 1205 FWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWA 1264
            FW  GKK++ QQNLF   G +Y  V+F+GI NC+S       ER VMYRERFAGMYS++A
Sbjct: 1201 FWNQGKKIDTQQNLFTVLGAIYGLVLFVGINNCTSALQYFETERNVMYRERFAGMYSAFA 1260

Query: 1265 YSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVT 1324
            Y+LAQV+ E+PYIF+Q+A +VI+ YPMIGFY S  K+FW  Y+MFC LL F  L + L++
Sbjct: 1261 YALAQVVTEIPYIFIQSAEFVIVIYPMIGFYASFSKVFWSLYAMFCNLLCFNYLAMFLIS 1320

Query: 1325 ITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDI 1352
            ITPN+ +A IL S F+  FN+FAGFL+PKP+IP WW+WFYY+ PTSWTLN   +SQYGDI
Sbjct: 1321 ITPNFMVAAILQSLFFTTFNIFAGFLIPKPQIPKWWVWFYYITPTSWTLNLFFSSQYGDI 1380

BLAST of HG10019907 vs. ExPASy Swiss-Prot
Match: Q7PC83 (ABC transporter G family member 41 OS=Arabidopsis thaliana OX=3702 GN=ABCG41 PE=2 SV=1)

HSP 1 Score: 1455.3 bits (3766), Expect = 0.0e+00
Identity = 759/1445 (52.53%), Postives = 997/1445 (69.00%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASL---WATIERLPTFERLRSSLFDINDEGKVE 60
            MAQ     ++ +S  +     +G  ++  L   WAT+ERLPTF+R+ ++L    D+    
Sbjct: 1    MAQTGEDVDKAKSFQVEFACGNGVDDEEKLRSQWATVERLPTFKRVTTALLHTGDDSS-- 60

Query: 61   EKGRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNV 120
                 ++DVTKL + ER + + KL+K +E DNL+LL K+R RI +VG + P+VEV++ ++
Sbjct: 61   ----DIIDVTKLEDAERRLLIEKLVKQIEADNLRLLRKIRKRIDEVGIELPTVEVRFNDL 120

Query: 121  HIEAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLL 180
             +EAEC+VVHGK IPTLWN+++  L   +  C  K  + KI I++ VSG++ PGR+TLLL
Sbjct: 121  SVEAECQVVHGKPIPTLWNTIKGSLSKFV--CSKK--ETKIGILKGVSGIVRPGRMTLLL 180

Query: 181  GPPGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVR 240
            GPPGCGKTTLL+ALSG L+ S+K  G++ YNG  L EF+P+KTS+Y+SQ+DLHIP+++VR
Sbjct: 181  GPPGCGKTTLLQALSGRLSHSVKVGGKVSYNGCLLSEFIPEKTSSYISQNDLHIPELSVR 240

Query: 241  ETLDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------K 300
            ETLDFSA CQGIGSR +IMKE++++EK + I+P+PDID YM                  K
Sbjct: 241  ETLDFSACCQGIGSRMEIMKEISRREKLKEIVPDPDIDAYMKAISVEGLKNSMQTDYILK 300

Query: 301  ILGLDICADTLVGDAMRRGISGGQKKRLTTGR---------------------------- 360
            ILGLDICADT  GDA R GISGGQK+RLTTG                             
Sbjct: 301  ILGLDICADTRAGDATRPGISGGQKRRLTTGEIVVGPATTLLMDEISNGLDSSTTFQIVS 360

Query: 361  --------------------------------------------RDRVLEFFEHCGFKCP 420
                                                        R  + +FFE CGFKCP
Sbjct: 361  CLQQLAHIAGATILISLLQPAPETFELFDDVILLGEGKIIYHAPRADICKFFEGCGFKCP 420

Query: 421  KRKSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFD 480
            +RK +ADFLQEV+S+KDQ Q+W  +  PY+YIS+D+F +KF   +NLG  ++EE  KPFD
Sbjct: 421  ERKGVADFLQEVMSRKDQEQYWCHRSKPYSYISVDSFIKKFN-ESNLGFLLKEELSKPFD 480

Query: 481  EQEEYSKNNGVFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTV 540
            + +   K++  F   K S+SKW++ KAC+ RE LLM+RNSF+Y+FK+  L F A +TMTV
Sbjct: 481  KSQT-RKDSLCF--RKYSLSKWEMLKACSRREILLMKRNSFIYLFKSGLLVFNALVTMTV 540

Query: 541  FIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYV 600
            F++     D +HGNY MG++F +L  LL D LPEL+LTI RL V  KQK+L FYPAWAY 
Sbjct: 541  FLQAGATRDARHGNYLMGSMFTALFRLLADGLPELTLTISRLGVFCKQKDLYFYPAWAYA 600

Query: 601  IPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVS 660
            IP+ IL++PLS++ S +WT LTYYVIGY+PEV RFFRHF++L   H+S +SMFR IA + 
Sbjct: 601  IPSIILRIPLSVLDSFIWTVLTYYVIGYSPEVGRFFRHFIILLTFHLSCISMFRAIASIC 660

Query: 661  QHVVALTLSSFI-ILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRW 720
            +  VA +++  I +L + +FGGF+I   SM  WL WGFWLSP+SY EIGL+ NEF +PRW
Sbjct: 661  RTFVACSITGAISVLLLALFGGFVIPKSSMPTWLGWGFWLSPLSYAEIGLTANEFFSPRW 720

Query: 721  QKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKD 780
            +K+   N T G  +L  RGL++ ++ YW +  AL GF + FN  +  ALT+ N       
Sbjct: 721  RKLTSGNITAGEQVLDVRGLNFGRHSYWTAFGALVGFVLFFNALYTLALTYRNN------ 780

Query: 781  SSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFT 840
                        P  S AI+S+ K SQ        S ++    P+ +  +  G + LPF 
Sbjct: 781  ------------PQRSRAIVSHGKNSQC-------SEEDFKPCPEITSRAKTGKVILPFK 840

Query: 841  PLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLD 900
            PLTV FQ++QYY++ P G        K  QLL DITGAL+PG+LT+LMGVSGAGKTTLLD
Sbjct: 841  PLTVTFQNVQYYIETPQG--------KTRQLLFDITGALKPGVLTSLMGVSGAGKTTLLD 900

Query: 901  VLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAP 960
            VL+GRKT G ++GEI++GG+PKVQETFAR+SGYCEQ DIHS  ITVEESL +SAWLRL  
Sbjct: 901  VLSGRKTRGIIKGEIRVGGYPKVQETFARVSGYCEQFDIHSPNITVEESLKYSAWLRLPY 960

Query: 961  NIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMD 1020
            NID+KTK + V EVLET+EL+ IKDS+VG+PG+SGLSTEQRKRLTIAVELVSNPSIIF+D
Sbjct: 961  NIDAKTKNELVKEVLETVELEDIKDSMVGLPGISGLSTEQRKRLTIAVELVSNPSIIFLD 1020

Query: 1021 EPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGP 1080
            EPTTGLDARAAAIVMRAVKNVA+TGRT+VCTIHQPSIDIFE+FDELIL+K GG ++YYGP
Sbjct: 1021 EPTTGLDARAAAIVMRAVKNVAETGRTVVCTIHQPSIDIFETFDELILMKDGGQLVYYGP 1080

Query: 1081 LGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENN 1140
            LG  SSKVI+YFE +PGV K+++N NPATWML++T  +AE +LG+DFAQ YK+S LY+ N
Sbjct: 1081 LGKHSSKVIKYFESIPGVPKVQKNCNPATWMLDITCKSAEHRLGMDFAQAYKDSTLYKEN 1140

Query: 1141 KELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASS 1200
            K +V+QLS    GS  L F + ++Q    Q  ACLWKQ+ SYWRNP +NL RI+  + +S
Sbjct: 1141 KMVVEQLSSASLGSEALSFPSRYSQTGWGQLKACLWKQHCSYWRNPSHNLTRIVFILLNS 1200

Query: 1201 LIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAG 1260
            L+  +LFW+  K + NQQ+LF+ FG MY+ VIF GI NC++V   I+ ER V YRERFA 
Sbjct: 1201 LLCSLLFWQKAKDINNQQDLFSIFGSMYTIVIFSGINNCATVMNFIATERNVFYRERFAR 1260

Query: 1261 MYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNL 1320
            MYSSWAYS +QV++EVPY  +Q+ +  II YPMIG++ S +K+FW  YS+FC+LL F   
Sbjct: 1261 MYSSWAYSFSQVLVEVPYSLLQSLLCTIIVYPMIGYHMSVYKMFWSLYSIFCSLLIFNYC 1320

Query: 1321 GLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLT 1352
            G+L+V +TPN H+A  L S F+ M NLFAGF++PK +IP WWIW YY+ PTSW L  LL+
Sbjct: 1321 GMLMVALTPNIHMALTLRSTFFSMVNLFAGFVMPKQKIPKWWIWMYYLSPTSWVLEGLLS 1380

BLAST of HG10019907 vs. ExPASy Swiss-Prot
Match: Q8GU83 (ABC transporter G family member 41 OS=Oryza sativa subsp. japonica OX=39947 GN=ABCG41 PE=3 SV=1)

HSP 1 Score: 1453.0 bits (3760), Expect = 0.0e+00
Identity = 758/1440 (52.64%), Postives = 988/1440 (68.61%), Query Frame = 0

Query: 13   SSSIA-EEDNDGDVEDASL-WATIERLPTFERLRSSLFDINDEGKVEEKGRRVVDVTKLR 72
            SSS+  +   D D E+A L WA IERLPT +R+R+S+                VDV +L 
Sbjct: 41   SSSLRWDHRGDDDEEEAELRWAAIERLPTLDRMRTSVL-----------SSEAVDVRRLG 100

Query: 73   NEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHGKA 132
              +R + V +L+ +++ DNL+LL K R R+ +VG + P+VEV+++NV +EA+C+VV GK 
Sbjct: 101  AAQRRVLVERLVADIQRDNLRLLRKQRRRMERVGVRQPTVEVRWRNVRVEADCQVVSGKP 160

Query: 133  IPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLLKA 192
            +PTL N++ +    + +    + H A+I I+ DV+G++ P RLTLLLGPPGCGKTTLL A
Sbjct: 161  LPTLLNTVLATARGLSR----RPH-ARIPILNDVTGILKPSRLTLLLGPPGCGKTTLLLA 220

Query: 193  LSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQGIG 252
            L+G L+K+LK +GE+ YNG  L  FVP+KTSAY+SQ+DLH+P++TVRETLDFSAR QG+G
Sbjct: 221  LAGKLDKNLKVTGEVEYNGANLNTFVPEKTSAYISQYDLHVPEMTVRETLDFSARFQGVG 280

Query: 253  SRADIMKEVNKKEKEQGIIPNPDIDTY------------------MKILGLDICADTLVG 312
            +RA+IMKEV ++EKE GI P+PDIDTY                  MKI+GLDICAD +VG
Sbjct: 281  TRAEIMKEVIRREKEAGITPDPDIDTYMKAISVEGLERSMQTDYIMKIMGLDICADIIVG 340

Query: 313  DAMRRGISGGQKKRLTTGR----------------------------------------- 372
            D MRRGISGG+KKRLTTG                                          
Sbjct: 341  DIMRRGISGGEKKRLTTGEMIVGPSRALFMDEISTGLDSSTTFQIVSCLQQVAHISESTI 400

Query: 373  -------------------------------RDRVLEFFEHCGFKCPKRKSIADFLQEVI 432
                                           +  ++ FFE CGFKCP+RK  ADFLQEV+
Sbjct: 401  LVSLLQPAPETYDLFDDIILMAEGKIVYHGSKSCIMNFFESCGFKCPERKGAADFLQEVL 460

Query: 433  SKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFL 492
            SKKDQ Q+W   +  Y +++ID F  KFK  + +G+ + EE   PFD+ E Y  NN + L
Sbjct: 461  SKKDQQQYWSRTEETYNFVTIDHFCEKFKA-SQVGQNLVEELANPFDKSEVY--NNALSL 520

Query: 493  NGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHG 552
            N   S++KW + KAC +RE LLMRRN+F+Y+ K  QL  +A IT TVF+RT M +D  H 
Sbjct: 521  N-IYSLTKWDLLKACFAREILLMRRNAFIYITKVVQLGLLAVITGTVFLRTHMGVDRAHA 580

Query: 553  NYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLV 612
            +YYMG+LFY+LI+LLV+  PEL++ + RL V YKQ++  FYPAWAY IP+ ILK+PLSLV
Sbjct: 581  DYYMGSLFYALILLLVNGFPELAIAVSRLPVFYKQRDYYFYPAWAYAIPSFILKIPLSLV 640

Query: 613  QSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHVVALTL-SSFI 672
            +S+ WTS++YY+IGYTPE SRFF   L+LF VH  +LS+FR +A   Q +VA ++  +  
Sbjct: 641  ESITWTSISYYLIGYTPEASRFFCQLLILFLVHTGALSLFRCVASYCQTMVASSVGGTMS 700

Query: 673  ILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHV 732
             L I++FGGFII   SM  WL+WGFW+SP+SY EIGL+ NEFLAPRW K   +  T+G  
Sbjct: 701  FLVILLFGGFIIPRLSMPNWLKWGFWISPLSYAEIGLTGNEFLAPRWLKTTTSGVTLGRR 760

Query: 733  ILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAP 792
            +L  RGLD+  YFYWIS +AL GF ++ NVG+A  LT                  +    
Sbjct: 761  VLMDRGLDFSSYFYWISASALIGFILLLNVGYAIGLT------------------IKKPT 820

Query: 793  GSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSI-----ESTKGGIALPFTPLTVVFQH 852
            G+S AIIS +K S  +  G   S       PK  +      +  G + LPF+PLT+ FQ 
Sbjct: 821  GTSRAIISRDKFSTFDRRGKDMSKDMDNRMPKLQVGNALAPNKTGTMVLPFSPLTISFQD 880

Query: 853  LQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTS 912
            + YYVD P  MRE+G+ ++KLQLL +ITGA +PG+L+ALMGV+GAGKTTLLDVLAGRKT 
Sbjct: 881  VNYYVDTPVEMREQGYKERKLQLLHNITGAFQPGVLSALMGVTGAGKTTLLDVLAGRKTG 940

Query: 913  GYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKA 972
            G +EG+I++GG+PK+Q+TFARISGYCEQTD+HS QITVEES+ +SAWLRL   +DSKT+ 
Sbjct: 941  GVIEGDIRVGGYPKIQQTFARISGYCEQTDVHSPQITVEESVAYSAWLRLPTEVDSKTRR 1000

Query: 973  QFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDA 1032
            +FV+EV++TIELD I+D+LVG+PGVSGLSTEQRKRLTIAVELVSNPS+IFMDEPT+GLDA
Sbjct: 1001 EFVDEVIQTIELDDIRDALVGLPGVSGLSTEQRKRLTIAVELVSNPSVIFMDEPTSGLDA 1060

Query: 1033 RAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSKV 1092
            RAAAIVMRAVKNVADTGRT+VCTIHQPSI+IFE+FDEL+L+K GG +IY GPLG  S  V
Sbjct: 1061 RAAAIVMRAVKNVADTGRTVVCTIHQPSIEIFEAFDELMLMKRGGELIYAGPLGLHSCNV 1120

Query: 1093 IEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQLS 1152
            I YFE +PGV KI++NYNP+TWMLEVT ++ EA+LG+DFAQ+Y+ S + ++   LVK LS
Sbjct: 1121 IHYFETIPGVPKIKDNYNPSTWMLEVTCASMEAQLGVDFAQIYRESTMCKDKDALVKSLS 1180

Query: 1153 VPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILFW 1212
             P  G+ DLHF   F Q F  Q  AC+WKQ LSYWR+P YNL+RIL    S ++FG+LFW
Sbjct: 1181 KPALGTSDLHFPTRFPQKFREQLKACIWKQCLSYWRSPSYNLVRILFITISCIVFGVLFW 1240

Query: 1213 KHG--KKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWA 1272
            + G    + +QQ LF   G MY + +F GI NC SV P IS ER+V+YRERFAGMYS WA
Sbjct: 1241 QQGDINHINDQQGLFTILGCMYGTTLFTGINNCQSVIPFISIERSVVYRERFAGMYSPWA 1300

Query: 1273 YSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVT 1332
            YSLAQV +E+PY+ VQ  + + I YPMIG+  +  K FW  Y++ C LLYF   G+++V+
Sbjct: 1301 YSLAQVAMEIPYVLVQILLIMFIAYPMIGYAWTAAKFFWFMYTIACTLLYFLYFGMMIVS 1360

Query: 1333 ITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGD- 1352
            +TPN  +A+IL+S FY + NL +GF+VP P+IP WWIW YY  P SWTLN   T+Q+GD 
Sbjct: 1361 LTPNIQVASILASMFYTLQNLMSGFIVPAPQIPRWWIWLYYTSPLSWTLNVFFTTQFGDE 1420

BLAST of HG10019907 vs. ExPASy TrEMBL
Match: A0A1S3B7F0 (pleiotropic drug resistance protein 3-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486827 PE=3 SV=1)

HSP 1 Score: 2289.6 bits (5932), Expect = 0.0e+00
Identity = 1180/1454 (81.16%), Postives = 1252/1454 (86.11%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEK 60
            MAQMV + + GRSSS  EED DGDVEDASLWA IERLPTFERLR SLFDI +DEG+V+EK
Sbjct: 1    MAQMV-ATQIGRSSSSIEEDCDGDVEDASLWAEIERLPTFERLRLSLFDISDDEGEVKEK 60

Query: 61   GRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHI 120
             RRV DVTKL N+ER +F+ KLIKNV+ DNLKLLT+VRDRIHKVGEKFP+VEVKYKNVHI
Sbjct: 61   RRRVADVTKLSNKERRLFIEKLIKNVKKDNLKLLTEVRDRIHKVGEKFPTVEVKYKNVHI 120

Query: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGP 180
            EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSG+I PGRLTLLLGP
Sbjct: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGIIKPGRLTLLLGP 180

Query: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRET 240
            PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQ+TVRET
Sbjct: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQMTVRET 240

Query: 241  LDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KIL 300
            LDFSARCQGIGSRADIMKE+ KKEKEQGIIPNPDID YM                  KI 
Sbjct: 241  LDFSARCQGIGSRADIMKEIIKKEKEQGIIPNPDIDIYMKAISIEGLKHSLQTDYILKIF 300

Query: 301  GLDICADTLVGDAMRRGISGGQKKRLTT-------------------------------- 360
            GLD+C DTLVGDAMRRGISGGQKKRLTT                                
Sbjct: 301  GLDVCGDTLVGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCL 360

Query: 361  ----------------------------------------GRRDRVLEFFEHCGFKCPKR 420
                                                    GRRDRVL+FFEHCGFKCPKR
Sbjct: 361  QNLAHLTNATILISLLQPAPETFELFDDLILMAQKKIVYQGRRDRVLDFFEHCGFKCPKR 420

Query: 421  KSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCW---NNLGRKMEEETLKPF 480
            KS ADFLQEV+S+KDQPQFWY  QTPYAY+SIDT S KFK W   NNL RK EEE LKP+
Sbjct: 421  KSTADFLQEVLSRKDQPQFWYRNQTPYAYVSIDTLSTKFKHWNNNNNLERKAEEEILKPY 480

Query: 481  D----EQEEYSK-NNGVFLN----GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQL 540
            D    E + YSK ++G+ LN       SVSKWQV KACASREFLLMRRNSFVYVFK SQL
Sbjct: 481  DNDDQEDQYYSKDDDGILLNIGKINNYSVSKWQVFKACASREFLLMRRNSFVYVFKISQL 540

Query: 541  FFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKE 600
            F IASITMTVFIRTEMKID++HGNYYMGALFYSL++LLVDALPEL++TIQRL+V YKQK+
Sbjct: 541  FLIASITMTVFIRTEMKIDVEHGNYYMGALFYSLLMLLVDALPELAMTIQRLEVFYKQKQ 600

Query: 601  LLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSL 660
            LLFYP WAYVIP AI KLPLSL+QS VWTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSSL
Sbjct: 601  LLFYPPWAYVIPPAIFKLPLSLLQSFVWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSSL 660

Query: 661  SMFRMIALVSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLS 720
            SMFRM+ALV+Q +VA T+SSF+ILQIMIFGGFII+H SMS WLRWGFW+SPISYGEIGLS
Sbjct: 661  SMFRMMALVNQQIVASTVSSFVILQIMIFGGFIISHSSMSAWLRWGFWVSPISYGEIGLS 720

Query: 721  INEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTF 780
            INEFLAPRWQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA++FN GFA ALTF
Sbjct: 721  INEFLAPRWQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALLFNFGFALALTF 780

Query: 781  LNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIEST 840
            LN                   PGSS AIISYEKLSQSN N  ANS Q+P SSPKTSIEST
Sbjct: 781  LN------------------PPGSSTAIISYEKLSQSNINADANSAQSPLSSPKTSIEST 840

Query: 841  KGGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVS 900
            KGGIALPF PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITGA+RPGILTALMGVS
Sbjct: 841  KGGIALPFRPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITGAIRPGILTALMGVS 900

Query: 901  GAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLI 960
            GAGKTTLLDVLAGRKTSGY+EGEIKIGGFPKVQETFAR+SGYCEQTD+HSSQITVEESL 
Sbjct: 901  GAGKTTLLDVLAGRKTSGYIEGEIKIGGFPKVQETFARVSGYCEQTDVHSSQITVEESLF 960

Query: 961  FSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV 1020
            FSAWLRLAP IDSKTKAQFVNEVLE IELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV
Sbjct: 961  FSAWLRLAPEIDSKTKAQFVNEVLEIIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELV 1020

Query: 1021 SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKT 1080
            SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI+IFESFDELILLKT
Sbjct: 1021 SNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSINIFESFDELILLKT 1080

Query: 1081 GGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVY 1140
            GG MIYYGPLG DS++VIEYFEHVPGVS+IRENYNPATW+LE+TSSAAEAKLGIDFA VY
Sbjct: 1081 GGRMIYYGPLGQDSNQVIEYFEHVPGVSRIRENYNPATWILEITSSAAEAKLGIDFALVY 1140

Query: 1141 KNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLL 1200
            KNS+LYENNKELVKQLS P PGSRDLHFSN+FAQNF RQFGACLWKQNLSYWRNP YNLL
Sbjct: 1141 KNSSLYENNKELVKQLSAPSPGSRDLHFSNVFAQNFARQFGACLWKQNLSYWRNPRYNLL 1200

Query: 1201 RILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERT 1260
            RILHTVASSLIFG+LFWK GKKLENQQ+LFNNFGVMYSSV+F+GIYNCSSVFPN+SRERT
Sbjct: 1201 RILHTVASSLIFGVLFWKKGKKLENQQDLFNNFGVMYSSVVFMGIYNCSSVFPNVSRERT 1260

Query: 1261 VMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMF 1320
            VMYRERFAGMYS WAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFYGS WKIFWCFYSMF
Sbjct: 1261 VMYRERFAGMYSPWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFYGSAWKIFWCFYSMF 1320

Query: 1321 CALLYFKNLGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPT 1352
             ALLYFK+LGLLLV+ITPNYHIATILSSAFYV FNLFAGFLVPKPRIP WWIWFYY+ PT
Sbjct: 1321 FALLYFKSLGLLLVSITPNYHIATILSSAFYVTFNLFAGFLVPKPRIPRWWIWFYYITPT 1380

BLAST of HG10019907 vs. ExPASy TrEMBL
Match: A0A0A0LEL9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G813820 PE=3 SV=1)

HSP 1 Score: 2283.1 bits (5915), Expect = 0.0e+00
Identity = 1174/1446 (81.19%), Postives = 1245/1446 (86.10%), Query Frame = 0

Query: 11   GRSSSIAEEDNDG-DVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEKGRRVVDVTK 70
            GRSSS AEED +G DVEDASLWA IERLPTF++LRSSLFDI ND+G+V++K RRVVDVTK
Sbjct: 2    GRSSSSAEEDGNGSDVEDASLWAEIERLPTFKQLRSSLFDITNDKGEVKKKRRRVVDVTK 61

Query: 71   LRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHG 130
            L NEER +F+ KLIKN+E+DN+KLLTKVRDRIH+VGEKFP+VEVKYKNVHIE ECEVVHG
Sbjct: 62   LSNEERGLFIKKLIKNIEDDNVKLLTKVRDRIHRVGEKFPTVEVKYKNVHIEVECEVVHG 121

Query: 131  KAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLL 190
            KAIPTLWNSLQSKLY+IIKFCGVKS+KAKIDIIEDVSG+I PGRLTLLLGPPGCGKTTLL
Sbjct: 122  KAIPTLWNSLQSKLYEIIKFCGVKSNKAKIDIIEDVSGIIKPGRLTLLLGPPGCGKTTLL 181

Query: 191  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQG 250
            KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYV QHDLHIPQ+TVRETLDFSARCQG
Sbjct: 182  KALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVGQHDLHIPQMTVRETLDFSARCQG 241

Query: 251  IGSRADIMKEVNKKEKEQGIIPNPDIDTYMK------------------ILGLDICADTL 310
            IGSRADIMKE+ KKEKEQGIIPN DID YMK                  I GLDIC DTL
Sbjct: 242  IGSRADIMKEIIKKEKEQGIIPNTDIDIYMKAISIEGLKQSLQTDYILNIFGLDICGDTL 301

Query: 311  VGDAMRRGISGGQKKRLTT----------------------------------------- 370
            VGDAMRRGISGGQKKRLTT                                         
Sbjct: 302  VGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCLQNLSHLTNA 361

Query: 371  -------------------------------GRRDRVLEFFEHCGFKCPKRKSIADFLQE 430
                                           GRRD+VL FFEHCGFKCPKRKSIADFLQE
Sbjct: 362  TILISLLQPAPETFELFDDLILMAQKKIVYQGRRDQVLNFFEHCGFKCPKRKSIADFLQE 421

Query: 431  VISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLG---RKMEEETLKPFD---EQEEY 490
            V+S+KDQPQFWY  QTPY Y+SIDT SRKFKCWNN     RK+E E LKPFD   E + Y
Sbjct: 422  VLSRKDQPQFWYRNQTPYTYVSIDTLSRKFKCWNNNNNNERKVEGENLKPFDNDREDQYY 481

Query: 491  SKN-NGVFLNGKN------SVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITM 550
            SKN +G+ LN         SVSKW+V KACASREFLLMRRNSFVYVFK SQLF IASITM
Sbjct: 482  SKNDDGILLNNTGQKINNYSVSKWEVFKACASREFLLMRRNSFVYVFKISQLFLIASITM 541

Query: 551  TVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWA 610
            TVFIRTEMK D++HGNYYMGALFYSL +LLVDALPEL++TI RL+V YKQK+LLFYP WA
Sbjct: 542  TVFIRTEMKTDVEHGNYYMGALFYSLNMLLVDALPELAMTIHRLEVFYKQKQLLFYPPWA 601

Query: 611  YVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIAL 670
            YVIP AILKLPLS +QS +WTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSS+SMFRM+AL
Sbjct: 602  YVIPPAILKLPLSFLQSFLWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSSVSMFRMMAL 661

Query: 671  VSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPR 730
            V+QH+VA TLSSF+ILQ MIFGGFII+HPSMS WLRWGFW+SPISYGEIGLSINEFLAPR
Sbjct: 662  VNQHIVASTLSSFVILQTMIFGGFIISHPSMSAWLRWGFWVSPISYGEIGLSINEFLAPR 721

Query: 731  WQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCK 790
            WQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA+IFN GFA ALTFLN      
Sbjct: 722  WQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALIFNFGFALALTFLN------ 781

Query: 791  DSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPF 850
                         PGSS AIISYEKLSQSN N  ANS QNP SSPKTSIESTKGGIALPF
Sbjct: 782  ------------PPGSSTAIISYEKLSQSNINADANSAQNPLSSPKTSIESTKGGIALPF 841

Query: 851  TPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 910
             PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL
Sbjct: 842  RPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLL 901

Query: 911  DVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA 970
            DV+AGRKTSGY+EGEIKIGGFPKVQETFARISGYCEQTD+HSSQITVEESL FSAWLRLA
Sbjct: 902  DVVAGRKTSGYIEGEIKIGGFPKVQETFARISGYCEQTDVHSSQITVEESLFFSAWLRLA 961

Query: 971  PNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1030
            P IDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM
Sbjct: 962  PEIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFM 1021

Query: 1031 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYG 1090
            DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGG MIYYG
Sbjct: 1022 DEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGRMIYYG 1081

Query: 1091 PLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYEN 1150
            PLG DS+KVIEYFEHVPGVS+IRENYNPATW+LE+TSS AEAKLGIDFAQVYKNS+LYEN
Sbjct: 1082 PLGRDSNKVIEYFEHVPGVSRIRENYNPATWILEITSSGAEAKLGIDFAQVYKNSSLYEN 1141

Query: 1151 NKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVAS 1210
            NKELVKQLS PPPGSRDL FSN+FAQNF RQFGACLWKQNLSYWRNP YNLLRILHTVAS
Sbjct: 1142 NKELVKQLSAPPPGSRDLQFSNVFAQNFARQFGACLWKQNLSYWRNPRYNLLRILHTVAS 1201

Query: 1211 SLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFA 1270
            SLIFG+LFWK GKKLENQQ+LFNNFGVM++SV+F+GIYNCSSVFPN+SRERTVMYRERFA
Sbjct: 1202 SLIFGVLFWKKGKKLENQQDLFNNFGVMFASVVFIGIYNCSSVFPNVSRERTVMYRERFA 1261

Query: 1271 GMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKN 1330
            GMYSSWAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFYGS WKIFWCFYSMF ALLYFKN
Sbjct: 1262 GMYSSWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFYGSAWKIFWCFYSMFFALLYFKN 1321

Query: 1331 LGLLLVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLL 1352
            LGLLLV+ITPNYHIATIL+SAFYV FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLL
Sbjct: 1322 LGLLLVSITPNYHIATILASAFYVTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLL 1381

BLAST of HG10019907 vs. ExPASy TrEMBL
Match: A0A6J1DJR4 (pleiotropic drug resistance protein 3-like isoform X2 OS=Momordica charantia OX=3673 GN=LOC111021539 PE=3 SV=1)

HSP 1 Score: 2113.6 bits (5475), Expect = 0.0e+00
Identity = 1083/1424 (76.05%), Postives = 1203/1424 (84.48%), Query Frame = 0

Query: 8    AERGRSSSIAEEDND-GDVEDASLWATIERLPTFERLRSSLF-DINDEGKVEEKGRRVVD 67
            AE  RS   +  DND  DVEDASLWA IERLPTFER+RSS+F DI+  G+V+EKGRRVVD
Sbjct: 19   AEIRRSLKSSSSDNDSNDVEDASLWAAIERLPTFERVRSSVFDDISSRGEVKEKGRRVVD 78

Query: 68   VTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEV 127
            VTKL ++ERH+FVHKLIK+VE+DNLKLL KVR+RI +VG  FPSVEVKYKNVHIEAECEV
Sbjct: 79   VTKLDDQERHLFVHKLIKHVESDNLKLLRKVRERIDRVGVTFPSVEVKYKNVHIEAECEV 138

Query: 128  VHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKT 187
            VHGKAIPTLWNSL++KLYDIIKFCG KSH+AKIDIIED SGVI PGRLTLLLGPPGCGKT
Sbjct: 139  VHGKAIPTLWNSLRTKLYDIIKFCGAKSHEAKIDIIEDASGVIKPGRLTLLLGPPGCGKT 198

Query: 188  TLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSAR 247
            TLLKALSGNL+KSLK SGEICYNGHKLEEFVPQKTSAY+SQ++LHI Q+TVRETLDFS R
Sbjct: 199  TLLKALSGNLDKSLKMSGEICYNGHKLEEFVPQKTSAYISQNELHIAQMTVRETLDFSTR 258

Query: 248  CQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYMKILGLDICADTLVGDAMRRGISGGQKK 307
            CQGIGSRAD+MKE+ K+EKEQGIIP+ D+DTYMKILGLDICA+TL GDAMRRGISGGQKK
Sbjct: 259  CQGIGSRADLMKEIIKREKEQGIIPDSDVDTYMKILGLDICAETLTGDAMRRGISGGQKK 318

Query: 308  RLTTGR------------------------------------------------------ 367
            RLT G                                                       
Sbjct: 319  RLTIGEMIVGPKRVLLMDEITNGLDSSTAFQIVSCLQHLAHFTDATLLVSLLQPAPETFD 378

Query: 368  ------------------RDRVLEFFEHCGFKCPKRKSIADFLQEVISKKDQPQFWY-CK 427
                              RD+VL+FFE+CGFKCP+RK++ADFLQEV+SKKDQPQ+WY   
Sbjct: 379  LFDDLILMVQKKIIYHGPRDQVLQFFENCGFKCPERKNVADFLQEVVSKKDQPQYWYRHD 438

Query: 428  QTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFLNGKNSVSKWQVS 487
            +  Y Y+S +TF R FK  ++LGRK++EE  +P+D++ + SK N     G +SVSKWQV 
Sbjct: 439  EARYTYVSNNTFCRMFKS-SSLGRKLDEEVSQPYDDKTK-SKRNVSSSLGVDSVSKWQVF 498

Query: 488  KACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLI 547
            KACASREFLLM+RNSFVYVFKTSQLFF+ASI MTVF+R++MK+DLQH NYYMGALFY LI
Sbjct: 499  KACASREFLLMKRNSFVYVFKTSQLFFLASIAMTVFLRSQMKVDLQHANYYMGALFYGLI 558

Query: 548  ILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYV 607
            +L+ +A+PEL+LT+QRL+V YKQKEL FYPAWAY IPAAILK+P SLVQ+LVWTSLTYYV
Sbjct: 559  MLVFNAVPELALTVQRLEVFYKQKELKFYPAWAYAIPAAILKIPFSLVQALVWTSLTYYV 618

Query: 608  IGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHV-VALTLSSFIILQIMIFGGFII 667
            IGYTPE SRFFRHFLVLFAV++ SLSMFR++A V + + VA ++SSF +L I+ F GFII
Sbjct: 619  IGYTPEFSRFFRHFLVLFAVNILSLSMFRLLASVIRSIDVAPSISSFTLLLILTFAGFII 678

Query: 668  THPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQY 727
            TH SM  W+ WGFW+SPISYGEIGLSINEFLAPRWQK Q TN+TIGH+ILQSRGLD+HQY
Sbjct: 679  THTSMPAWMEWGFWVSPISYGEIGLSINEFLAPRWQKRQSTNTTIGHIILQSRGLDFHQY 738

Query: 728  FYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKL 787
            FYWISL ALFGFA++FNVGF  ALTFLN                  +PGSS AIISYEKL
Sbjct: 739  FYWISLGALFGFAVLFNVGFTLALTFLN------------------SPGSSRAIISYEKL 798

Query: 788  ----SQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVFQHLQYYVDMPSGMRE 847
                S  +CNGGANSV+   +SPK +IES+KG IALPFTPLTVVF+ L YYVDMP  MRE
Sbjct: 799  GRAKSSEDCNGGANSVEQQAASPKAAIESSKGRIALPFTPLTVVFRDLHYYVDMPVAMRE 858

Query: 848  RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFP 907
            RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGY+EGEIKIGGFP
Sbjct: 859  RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYIEGEIKIGGFP 918

Query: 908  KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELD 967
            KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA NIDSKTK QFVNEVLETIELD
Sbjct: 919  KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLASNIDSKTKEQFVNEVLETIELD 978

Query: 968  SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV 1027
            SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV
Sbjct: 979  SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV 1038

Query: 1028 ADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKI 1087
             DTGRTIVCTIHQPSIDIFESFDELILLKTGGH+IYYGPLG  SSKVIE+FE VPGVS I
Sbjct: 1039 VDTGRTIVCTIHQPSIDIFESFDELILLKTGGHVIYYGPLGRHSSKVIEFFEQVPGVSMI 1098

Query: 1088 RENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSN 1147
            REN+NPATWMLEVTSSAAEAKLGIDFAQVYKNSALY+NNKE+VKQLS PPPGSRDLHFSN
Sbjct: 1099 RENHNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYKNNKEIVKQLSTPPPGSRDLHFSN 1158

Query: 1148 IFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLF 1207
            +FAQ+F  QF ACLWKQNLSYWRNP YNL RIL+T+ASSL+FG LFWKHGKKLENQQNLF
Sbjct: 1159 VFAQSFAGQFKACLWKQNLSYWRNPCYNLTRILYTIASSLVFGTLFWKHGKKLENQQNLF 1218

Query: 1208 NNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV 1267
            NNFG MYSSV F+GI+NC++VFPN+SRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV
Sbjct: 1219 NNFGSMYSSVNFIGIHNCATVFPNVSRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV 1278

Query: 1268 QAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTITPNYHIATILSSAF 1327
            QAA +VIITYPMIG+YGS  K+FWCFYSMFCALLYF  LG+LL+++TPN+HIA+IL+SAF
Sbjct: 1279 QAAAFVIITYPMIGYYGSSSKVFWCFYSMFCALLYFNYLGMLLISVTPNFHIASILASAF 1338

Query: 1328 YVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDIDKAMVVFGERTTTVS 1352
            Y  FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLLTSQYGDI+K ++VFGER  TVS
Sbjct: 1339 YSTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLLTSQYGDINKTLMVFGER-RTVS 1398

BLAST of HG10019907 vs. ExPASy TrEMBL
Match: A0A5A7TNN3 (Pleiotropic drug resistance protein 3-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G00010 PE=3 SV=1)

HSP 1 Score: 2112.0 bits (5471), Expect = 0.0e+00
Identity = 1103/1379 (79.99%), Postives = 1171/1379 (84.92%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDI-NDEGKVEEK 60
            MAQMV + + GRSSS  EED DGDVEDASLWA IERLPTFERLR SLFDI +DEG+V+EK
Sbjct: 1    MAQMV-ATQIGRSSSSIEEDCDGDVEDASLWAEIERLPTFERLRLSLFDISDDEGEVKEK 60

Query: 61   GRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHI 120
             RRV DVTKL N+ER +F+ KLIKNV+ DNLKLLT+VRDRIHKVGEKFP+VEVKYKNVHI
Sbjct: 61   RRRVADVTKLSNKERRLFIEKLIKNVKKDNLKLLTEVRDRIHKVGEKFPTVEVKYKNVHI 120

Query: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGP 180
            EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSG+I PGRLTLLLGP
Sbjct: 121  EAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGIIKPGRLTLLLGP 180

Query: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRET 240
            PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQ+TVRET
Sbjct: 181  PGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQMTVRET 240

Query: 241  LDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KIL 300
            LDFSARCQGIGSRADIMKE+ KKEKEQGIIPNPDID YM                  KI 
Sbjct: 241  LDFSARCQGIGSRADIMKEIIKKEKEQGIIPNPDIDIYMKAISIEGLKHSLQTDYILKIF 300

Query: 301  GLDICADTLVGDAMRRGISGGQKKRLTT-------------------------------- 360
            GLD+C DTLVGDAMRRGISGGQKKRLTT                                
Sbjct: 301  GLDVCGDTLVGDAMRRGISGGQKKRLTTGEMMVGPNKALFMDEITNGLDSSTAFQIISCL 360

Query: 361  ----------------------------------------GRRDRVLEFFEHCGFKCPKR 420
                                                    GRRDRVL+FFEHCGFKCPKR
Sbjct: 361  QNLAHLTNATILISLLQPAPETFELFDDLILMAQKKIVYQGRRDRVLDFFEHCGFKCPKR 420

Query: 421  KSIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCW---NNLGRKMEEETLKPF 480
            KS ADFLQEV+S+KDQPQFWY  QTPYAY+SIDT S KFK W   NNL RK EEE LKP+
Sbjct: 421  KSTADFLQEVLSRKDQPQFWYRNQTPYAYVSIDTLSTKFKHWNNNNNLERKAEEEILKPY 480

Query: 481  D----EQEEYSK-NNGVFLN----GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTS-Q 540
            D    E + YSK ++G+ LN       SVSKWQV KACASREFLLMRRNSFVYVFK S Q
Sbjct: 481  DNDDQEDQYYSKDDDGILLNIGKINNYSVSKWQVFKACASREFLLMRRNSFVYVFKISQQ 540

Query: 541  LFFIASITMTVFIRTEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQK 600
            LF IASITMTVFIRTEMKID++HGNYYMGALFYSL++LLVDALPEL++TIQRL+V YKQK
Sbjct: 541  LFLIASITMTVFIRTEMKIDVEHGNYYMGALFYSLLMLLVDALPELAMTIQRLEVFYKQK 600

Query: 601  ELLFYPAWAYVIPAAILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSS 660
            +LLFYP WAYVIP AI KLPLSL+QS VWTSLTYYVIGYTPEVSRFFRHFLVLFA+HVSS
Sbjct: 601  QLLFYPPWAYVIPPAIFKLPLSLLQSFVWTSLTYYVIGYTPEVSRFFRHFLVLFALHVSS 660

Query: 661  LSMFRMIALVSQHVVALTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGL 720
            LSMFRM+ALV+Q +VA T+SSF+ILQIMIFGGFII+H SMS WLRWGFW+SPISYGEIGL
Sbjct: 661  LSMFRMMALVNQQIVASTVSSFVILQIMIFGGFIISHSSMSAWLRWGFWVSPISYGEIGL 720

Query: 721  SINEFLAPRWQKMQGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALT 780
            SINEFLAPRWQK+QG+N TIGH+ILQSRGLDYHQYFYWISLAALFGFA++FN GFA ALT
Sbjct: 721  SINEFLAPRWQKIQGSNVTIGHIILQSRGLDYHQYFYWISLAALFGFALLFNFGFALALT 780

Query: 781  FLNGNSYCKDSSTFFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIES 840
            FLN                   PGSS AIISYEKLSQSN N  ANS Q+P SSPKTSIES
Sbjct: 781  FLN------------------PPGSSTAIISYEKLSQSNINADANSAQSPLSSPKTSIES 840

Query: 841  TK-------------GGIALPFTPLTVVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITG 900
            TK             GGIALPF PLTVVF+ LQYYVDMPSGMRERGFTQKKLQLLSDITG
Sbjct: 841  TKGNILLNFTVRLLMGGIALPFRPLTVVFRDLQYYVDMPSGMRERGFTQKKLQLLSDITG 900

Query: 901  ALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFPKVQETFARISGYCEQT 960
            A+RPGILTALMGVSGAGKTTLLDVLAGRKTSGY+EGEIKIGGFPKVQETFAR+SGYCEQT
Sbjct: 901  AIRPGILTALMGVSGAGKTTLLDVLAGRKTSGYIEGEIKIGGFPKVQETFARVSGYCEQT 960

Query: 961  DIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLS 1020
            D+HSSQITVEESL FSAWLRLAP IDSKTKAQFVNEVLE IELDSIKDSLVGIPGVSGLS
Sbjct: 961  DVHSSQITVEESLFFSAWLRLAPEIDSKTKAQFVNEVLEIIELDSIKDSLVGIPGVSGLS 1020

Query: 1021 TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI 1080
            TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI
Sbjct: 1021 TEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSI 1080

Query: 1081 DIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSS 1140
            +IFESFDELILLKTGG MIYYGPLG DS++VIEYFEHVPGVS+IRENYNPATW+LE+TSS
Sbjct: 1081 NIFESFDELILLKTGGRMIYYGPLGQDSNQVIEYFEHVPGVSRIRENYNPATWILEITSS 1140

Query: 1141 AAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWK 1200
            AAEAKLGIDFA VYKNS+LYENNKELVKQLS P PGSRDLHFSN+FAQNF RQFGACLWK
Sbjct: 1141 AAEAKLGIDFALVYKNSSLYENNKELVKQLSAPSPGSRDLHFSNVFAQNFARQFGACLWK 1200

Query: 1201 QNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIY 1260
            QNLSYWRNP YNLLRILHTVASSLIFG+LFWK GKKLENQQ+LFNNFGVMYSSV+F+GIY
Sbjct: 1201 QNLSYWRNPRYNLLRILHTVASSLIFGVLFWKKGKKLENQQDLFNNFGVMYSSVVFMGIY 1260

Query: 1261 NCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFY 1263
            NCSSVFPN+SRERTVMYRERFAGMYS WAYSLAQVIIEVPY+FVQAAIYVIITYPMIGFY
Sbjct: 1261 NCSSVFPNVSRERTVMYRERFAGMYSPWAYSLAQVIIEVPYVFVQAAIYVIITYPMIGFY 1320

BLAST of HG10019907 vs. ExPASy TrEMBL
Match: A0A6J1DL52 (pleiotropic drug resistance protein 3-like isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021539 PE=3 SV=1)

HSP 1 Score: 2102.4 bits (5446), Expect = 0.0e+00
Identity = 1083/1442 (75.10%), Postives = 1203/1442 (83.43%), Query Frame = 0

Query: 8    AERGRSSSIAEEDND-GDVEDASLWATIERLPTFERLRSSLF-DINDEGKVEEKGRRVVD 67
            AE  RS   +  DND  DVEDASLWA IERLPTFER+RSS+F DI+  G+V+EKGRRVVD
Sbjct: 19   AEIRRSLKSSSSDNDSNDVEDASLWAAIERLPTFERVRSSVFDDISSRGEVKEKGRRVVD 78

Query: 68   VTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEV 127
            VTKL ++ERH+FVHKLIK+VE+DNLKLL KVR+RI +VG  FPSVEVKYKNVHIEAECEV
Sbjct: 79   VTKLDDQERHLFVHKLIKHVESDNLKLLRKVRERIDRVGVTFPSVEVKYKNVHIEAECEV 138

Query: 128  VHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKT 187
            VHGKAIPTLWNSL++KLYDIIKFCG KSH+AKIDIIED SGVI PGRLTLLLGPPGCGKT
Sbjct: 139  VHGKAIPTLWNSLRTKLYDIIKFCGAKSHEAKIDIIEDASGVIKPGRLTLLLGPPGCGKT 198

Query: 188  TLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSAR 247
            TLLKALSGNL+KSLK SGEICYNGHKLEEFVPQKTSAY+SQ++LHI Q+TVRETLDFS R
Sbjct: 199  TLLKALSGNLDKSLKMSGEICYNGHKLEEFVPQKTSAYISQNELHIAQMTVRETLDFSTR 258

Query: 248  CQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILGLDICA 307
            CQGIGSRAD+MKE+ K+EKEQGIIP+ D+DTYM                  KILGLDICA
Sbjct: 259  CQGIGSRADLMKEIIKREKEQGIIPDSDVDTYMKAISVQGLKRNLHTDYILKILGLDICA 318

Query: 308  DTLVGDAMRRGISGGQKKRLTTGR------------------------------------ 367
            +TL GDAMRRGISGGQKKRLT G                                     
Sbjct: 319  ETLTGDAMRRGISGGQKKRLTIGEMIVGPKRVLLMDEITNGLDSSTAFQIVSCLQHLAHF 378

Query: 368  ------------------------------------RDRVLEFFEHCGFKCPKRKSIADF 427
                                                RD+VL+FFE+CGFKCP+RK++ADF
Sbjct: 379  TDATLLVSLLQPAPETFDLFDDLILMVQKKIIYHGPRDQVLQFFENCGFKCPERKNVADF 438

Query: 428  LQEVISKKDQPQFWY-CKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSK 487
            LQEV+SKKDQPQ+WY   +  Y Y+S +TF R FK  ++LGRK++EE  +P+D++ + SK
Sbjct: 439  LQEVVSKKDQPQYWYRHDEARYTYVSNNTFCRMFKS-SSLGRKLDEEVSQPYDDKTK-SK 498

Query: 488  NNGVFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMK 547
             N     G +SVSKWQV KACASREFLLM+RNSFVYVFKTSQLFF+ASI MTVF+R++MK
Sbjct: 499  RNVSSSLGVDSVSKWQVFKACASREFLLMKRNSFVYVFKTSQLFFLASIAMTVFLRSQMK 558

Query: 548  IDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILK 607
            +DLQH NYYMGALFY LI+L+ +A+PEL+LT+QRL+V YKQKEL FYPAWAY IPAAILK
Sbjct: 559  VDLQHANYYMGALFYGLIMLVFNAVPELALTVQRLEVFYKQKELKFYPAWAYAIPAAILK 618

Query: 608  LPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHV-VAL 667
            +P SLVQ+LVWTSLTYYVIGYTPE SRFFRHFLVLFAV++ SLSMFR++A V + + VA 
Sbjct: 619  IPFSLVQALVWTSLTYYVIGYTPEFSRFFRHFLVLFAVNILSLSMFRLLASVIRSIDVAP 678

Query: 668  TLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTN 727
            ++SSF +L I+ F GFIITH SM  W+ WGFW+SPISYGEIGLSINEFLAPRWQK Q TN
Sbjct: 679  SISSFTLLLILTFAGFIITHTSMPAWMEWGFWVSPISYGEIGLSINEFLAPRWQKRQSTN 738

Query: 728  STIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNF 787
            +TIGH+ILQSRGLD+HQYFYWISL ALFGFA++FNVGF  ALTFLN              
Sbjct: 739  TTIGHIILQSRGLDFHQYFYWISLGALFGFAVLFNVGFTLALTFLN-------------- 798

Query: 788  ILHSAPGSSPAIISYEKL----SQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLT 847
                +PGSS AIISYEKL    S  +CNGGANSV+   +SPK +IES+KG IALPFTPLT
Sbjct: 799  ----SPGSSRAIISYEKLGRAKSSEDCNGGANSVEQQAASPKAAIESSKGRIALPFTPLT 858

Query: 848  VVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLA 907
            VVF+ L YYVDMP  MRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLA
Sbjct: 859  VVFRDLHYYVDMPVAMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLA 918

Query: 908  GRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNID 967
            GRKTSGY+EGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLA NID
Sbjct: 919  GRKTSGYIEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLASNID 978

Query: 968  SKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPT 1027
            SKTK QFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPT
Sbjct: 979  SKTKEQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPT 1038

Query: 1028 TGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGH 1087
            TGLDARAAAIVMRAVKNV DTGRTIVCTIHQPSIDIFESFDELILLKTGGH+IYYGPLG 
Sbjct: 1039 TGLDARAAAIVMRAVKNVVDTGRTIVCTIHQPSIDIFESFDELILLKTGGHVIYYGPLGR 1098

Query: 1088 DSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKEL 1147
             SSKVIE+FE VPGVS IREN+NPATWMLEVTSSAAEAKLGIDFAQVYKNSALY+NNKE+
Sbjct: 1099 HSSKVIEFFEQVPGVSMIRENHNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYKNNKEI 1158

Query: 1148 VKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIF 1207
            VKQLS PPPGSRDLHFSN+FAQ+F  QF ACLWKQNLSYWRNP YNL RIL+T+ASSL+F
Sbjct: 1159 VKQLSTPPPGSRDLHFSNVFAQSFAGQFKACLWKQNLSYWRNPCYNLTRILYTIASSLVF 1218

Query: 1208 GILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYS 1267
            G LFWKHGKKLENQQNLFNNFG MYSSV F+GI+NC++VFPN+SRERTVMYRERFAGMYS
Sbjct: 1219 GTLFWKHGKKLENQQNLFNNFGSMYSSVNFIGIHNCATVFPNVSRERTVMYRERFAGMYS 1278

Query: 1268 SWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLL 1327
            SWAYSLAQVIIEVPYIFVQAA +VIITYPMIG+YGS  K+FWCFYSMFCALLYF  LG+L
Sbjct: 1279 SWAYSLAQVIIEVPYIFVQAAAFVIITYPMIGYYGSSSKVFWCFYSMFCALLYFNYLGML 1338

Query: 1328 LVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQY 1352
            L+++TPN+HIA+IL+SAFY  FNLFAGFLVPKPRIP WWIWFYYM PTSWTLNCLLTSQY
Sbjct: 1339 LISVTPNFHIASILASAFYSTFNLFAGFLVPKPRIPRWWIWFYYMSPTSWTLNCLLTSQY 1398

BLAST of HG10019907 vs. TAIR 10
Match: AT3G53480.1 (pleiotropic drug resistance 9 )

HSP 1 Score: 1657.9 bits (4292), Expect = 0.0e+00
Identity = 853/1439 (59.28%), Postives = 1050/1439 (72.97%), Query Frame = 0

Query: 13   SSSIAEEDNDGDVED-----ASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVVDVT 72
            SSSI E +NDGDV D     A  WA IERLPT +R+RS+L D  DE  + EKGRRVVDVT
Sbjct: 38   SSSIYEVENDGDVNDHDAEYALQWAEIERLPTVKRMRSTLLDDGDE-SMTEKGRRVVDVT 97

Query: 73   KLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVH 132
            KL   ERH+ + KLIK++ENDNLKLL K+R RI +VG + P++EV+Y+++ + AECEVV 
Sbjct: 98   KLGAVERHLMIEKLIKHIENDNLKLLKKIRRRIDRVGMELPTIEVRYESLKVVAECEVVE 157

Query: 133  GKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTL 192
            GKA+PTLWN+ +  L +++K  G K+H+AKI+II DV+G+I PGRLTLLLGPP CGKTTL
Sbjct: 158  GKALPTLWNTAKRVLSELVKLTGAKTHEAKINIINDVNGIIKPGRLTLLLGPPSCGKTTL 217

Query: 193  LKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQ 252
            LKALSGNL  +LK SGEI YNGH+L+EFVPQKTSAY+SQ+DLHI ++TVRET+DFSARCQ
Sbjct: 218  LKALSGNLENNLKCSGEISYNGHRLDEFVPQKTSAYISQYDLHIAEMTVRETVDFSARCQ 277

Query: 253  GIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILGLDICADT 312
            G+GSR DIM EV+K+EKE+GIIP+ ++D YM                  KILGLDICA+ 
Sbjct: 278  GVGSRTDIMMEVSKREKEKGIIPDTEVDAYMKAISVEGLQRSLQTDYILKILGLDICAEI 337

Query: 313  LVGDAMRRGISGGQKKRLTT---------------------------------------- 372
            L+GD MRRGISGGQKKRLTT                                        
Sbjct: 338  LIGDVMRRGISGGQKKRLTTAEMIVGPTKALFMDEITNGLDSSTAFQIVKSLQQFAHISS 397

Query: 373  --------------------------------GRRDRVLEFFEHCGFKCPKRKSIADFLQ 432
                                            G R  VL FFE CGF+CP+RK +ADFLQ
Sbjct: 398  ATVLVSLLQPAPESYDLFDDIMLMAKGRIVYHGPRGEVLNFFEDCGFRCPERKGVADFLQ 457

Query: 433  EVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNG 492
            EVISKKDQ Q+W+ +  PY+++S++  S+KFK   ++G+K+E+   KP+D  + +     
Sbjct: 458  EVISKKDQAQYWWHEDLPYSFVSVEMLSKKFKDL-SIGKKIEDTLSKPYDRSKSH---KD 517

Query: 493  VFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDL 552
                   S+  W++  AC SRE+LLM+RN FVY+FKT+QL   A ITMTVFIRT M ID+
Sbjct: 518  ALSFSVYSLPNWELFIACISREYLLMKRNYFVYIFKTAQLVMAAFITMTVFIRTRMGIDI 577

Query: 553  QHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPL 612
             HGN YM ALF++LIILLVD  PELS+T QRL V YKQK+L FYPAWAY IPA +LK+PL
Sbjct: 578  IHGNSYMSALFFALIILLVDGFPELSMTAQRLAVFYKQKQLCFYPAWAYAIPATVLKVPL 637

Query: 613  SLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHVVA-LTLS 672
            S  +SLVWT L+YYVIGYTPE SRFF+ F++LFAVH +S+SMFR +A + Q VVA +T  
Sbjct: 638  SFFESLVWTCLSYYVIGYTPEASRFFKQFILLFAVHFTSISMFRCLAAIFQTVVASITAG 697

Query: 673  SFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTI 732
            SF IL   +F GF+I  PSM  WL+WGFW +P+SYGEIGLS+NEFLAPRW +MQ  N T+
Sbjct: 698  SFGILFTFVFAGFVIPPPSMPAWLKWGFWANPLSYGEIGLSVNEFLAPRWNQMQPNNFTL 757

Query: 733  GHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILH 792
            G  ILQ+RG+DY+ Y YW+SL AL GF ++FN+ F  ALTFL                  
Sbjct: 758  GRTILQTRGMDYNGYMYWVSLCALLGFTVLFNIIFTLALTFL------------------ 817

Query: 793  SAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTK----GGIALPFTPLTVVF 852
             +P SS A+IS +KLS+    G   S ++     KT+    K      + LPF PLTV F
Sbjct: 818  KSPTSSRAMISQDKLSE--LQGTEKSTEDSSVRKKTTDSPVKTEEEDKMVLPFKPLTVTF 877

Query: 853  QHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRK 912
            Q L Y+VDMP  MR++G+ QKKLQLLSDITGA RPGILTALMGVSGAGKTTLLDVLAGRK
Sbjct: 878  QDLNYFVDMPVEMRDQGYDQKKLQLLSDITGAFRPGILTALMGVSGAGKTTLLDVLAGRK 937

Query: 913  TSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKT 972
            TSGY+EG+I+I GFPKVQETFAR+SGYCEQTDIHS  ITVEES+I+SAWLRLAP ID+ T
Sbjct: 938  TSGYIEGDIRISGFPKVQETFARVSGYCEQTDIHSPNITVEESVIYSAWLRLAPEIDATT 997

Query: 973  KAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGL 1032
            K +FV +VLETIELD IKDSLVG+ GVSGLSTEQRKRLTIAVELV+NPSIIFMDEPTTGL
Sbjct: 998  KTKFVKQVLETIELDEIKDSLVGVTGVSGLSTEQRKRLTIAVELVANPSIIFMDEPTTGL 1057

Query: 1033 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSS 1092
            DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFE+FDEL+LLK GG MIY GPLG  S 
Sbjct: 1058 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFEAFDELVLLKRGGRMIYTGPLGQHSR 1117

Query: 1093 KVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQ 1152
             +IEYFE VP + KI++N+NPATWML+V+S + E +LG+DFA++Y +SALY+ N ELVKQ
Sbjct: 1118 HIIEYFESVPEIPKIKDNHNPATWMLDVSSQSVEIELGVDFAKIYHDSALYKRNSELVKQ 1177

Query: 1153 LSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGIL 1212
            LS P  GS D+ F   FAQ++  QF + LWK NLSYWR+P YNL+R++HT+ SSLIFG L
Sbjct: 1178 LSQPDSGSSDIQFKRTFAQSWWGQFKSILWKMNLSYWRSPSYNLMRMMHTLVSSLIFGAL 1237

Query: 1213 FWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWA 1272
            FWK G+ L+ QQ++F  FG +Y  V+FLGI NC+S       ER VMYRERFAGMYS+ A
Sbjct: 1238 FWKQGQNLDTQQSMFTVFGAIYGLVLFLGINNCASALQYFETERNVMYRERFAGMYSATA 1297

Query: 1273 YSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVT 1332
            Y+L QV+ E+PYIF+QAA +VI+TYPMIGFY S +K+FW  YSMFC+LL F  L + LV+
Sbjct: 1298 YALGQVVTEIPYIFIQAAEFVIVTYPMIGFYPSAYKVFWSLYSMFCSLLTFNYLAMFLVS 1357

Query: 1333 ITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDI 1352
            ITPN+ +A IL S FYV FNLF+GFL+P+ ++P WWIW YY+ PTSWTLN  ++SQYGDI
Sbjct: 1358 ITPNFMVAAILQSLFYVGFNLFSGFLIPQTQVPGWWIWLYYLTPTSWTLNGFISSQYGDI 1417

BLAST of HG10019907 vs. TAIR 10
Match: AT2G37280.1 (pleiotropic drug resistance 5 )

HSP 1 Score: 1648.3 bits (4267), Expect = 0.0e+00
Identity = 847/1439 (58.86%), Postives = 1041/1439 (72.34%), Query Frame = 0

Query: 5    VGSAERGRSSSIAEEDNDGDVEDASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVV 64
            +GS+ R  SS    ED   + E A  WA I+RLPTF+RLRSSL D   EG   EKG++VV
Sbjct: 1    MGSSFRSSSSRNEHEDGGDEAEHALQWAEIQRLPTFKRLRSSLVDKYGEG--TEKGKKVV 60

Query: 65   DVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECE 124
            DVTKL   ERH+ + KLIK++ENDNLKLL K+R R+ +VG +FPS+EV+Y+++ +EA CE
Sbjct: 61   DVTKLGAMERHLMIEKLIKHIENDNLKLLKKIRRRMERVGVEFPSIEVRYEHLGVEAACE 120

Query: 125  VVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGK 184
            VV GKA+PTLWNSL+    D++K  GV++++A I I+ DVSG+I+PGRLTLLLGPPGCGK
Sbjct: 121  VVEGKALPTLWNSLKHVFLDLLKLSGVRTNEANIKILTDVSGIISPGRLTLLLGPPGCGK 180

Query: 185  TTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSA 244
            TTLLKALSGNL  +LK  GEI YNGH L E VPQKTSAY+SQHDLHI ++T RET+DFSA
Sbjct: 181  TTLLKALSGNLENNLKCYGEISYNGHGLNEVVPQKTSAYISQHDLHIAEMTTRETIDFSA 240

Query: 245  RCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILGLDIC 304
            RCQG+GSR DIM EV+K+EK+ GIIP+P+ID YM                  KILGLDIC
Sbjct: 241  RCQGVGSRTDIMMEVSKREKDGGIIPDPEIDAYMKAISVKGLKRSLQTDYILKILGLDIC 300

Query: 305  ADTLVGDAMRRGISGGQKKRLTT------------------------------------- 364
            A+TLVG+AM+RGISGGQKKRLTT                                     
Sbjct: 301  AETLVGNAMKRGISGGQKKRLTTAEMIVGPTKALFMDEITNGLDSSTAFQIIKSLQQVAH 360

Query: 365  -----------------------------------GRRDRVLEFFEHCGFKCPKRKSIAD 424
                                               G RD VL+FFE CGF+CP+RK +AD
Sbjct: 361  ITNATVFVSLLQPAPESYDLFDDIVLMAEGKIVYHGPRDDVLKFFEECGFQCPERKGVAD 420

Query: 425  FLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSK 484
            FLQEVISKKDQ Q+W  +  P++++S+DT S++FK    +GRK+EE   KP+D     SK
Sbjct: 421  FLQEVISKKDQGQYWLHQNLPHSFVSVDTLSKRFKDL-EIGRKIEEALSKPYD----ISK 480

Query: 485  NNGVFLN-GKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEM 544
             +   L+    S+ KW++ +AC SREFLLM+RN FVY+FKT QL   A ITMTVFIRT M
Sbjct: 481  THKDALSFNVYSLPKWELFRACISREFLLMKRNYFVYLFKTFQLVLAAIITMTVFIRTRM 540

Query: 545  KIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAIL 604
             ID+ HGN YM  LF++ ++LLVD +PELS+T+QRL V YKQK+L FYPAWAY IPA +L
Sbjct: 541  DIDIIHGNSYMSCLFFATVVLLVDGIPELSMTVQRLSVFYKQKQLCFYPAWAYAIPATVL 600

Query: 605  KLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQ-HVVA 664
            K+PLS  +SLVWT LTYYVIGYTPE  RFFR F++LFAVH +S+SMFR IA + Q  V A
Sbjct: 601  KIPLSFFESLVWTCLTYYVIGYTPEPYRFFRQFMILFAVHFTSISMFRCIAAIFQTGVAA 660

Query: 665  LTLSSFIILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGT 724
            +T  SF++L   +F GF I +  M  WL+WGFW++PISY EIGLS+NEFLAPRWQKMQ T
Sbjct: 661  MTAGSFVMLITFVFAGFAIPYTDMPGWLKWGFWVNPISYAEIGLSVNEFLAPRWQKMQPT 720

Query: 725  NSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFN 784
            N T+G  IL+SRGL+Y  Y YW+SL+AL G  IIFN  F  AL+FL              
Sbjct: 721  NVTLGRTILESRGLNYDDYMYWVSLSALLGLTIIFNTIFTLALSFL-------------- 780

Query: 785  FILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVF 844
                 +P SS  +IS +KLS+      ++  +N P           G + LPF PLT+ F
Sbjct: 781  ----KSPTSSRPMISQDKLSELQGTKDSSVKKNKPLDSSIKTNEDPGKMILPFKPLTITF 840

Query: 845  QHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRK 904
            Q L YYVD+P  M+ +G+ +KKLQLLS+ITGA RPG+LTALMG+SGAGKTTLLDVLAGRK
Sbjct: 841  QDLNYYVDVPVEMKGQGYNEKKLQLLSEITGAFRPGVLTALMGISGAGKTTLLDVLAGRK 900

Query: 905  TSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKT 964
            TSGY+EGEI+I GF KVQETFAR+SGYCEQTDIHS  ITVEESLI+SAWLRL P I+ +T
Sbjct: 901  TSGYIEGEIRISGFLKVQETFARVSGYCEQTDIHSPSITVEESLIYSAWLRLVPEINPQT 960

Query: 965  KAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGL 1024
            K +FV +VLETIEL+ IKD+LVG+ GVSGLSTEQRKRLT+AVELV+NPSIIFMDEPTTGL
Sbjct: 961  KIRFVKQVLETIELEEIKDALVGVAGVSGLSTEQRKRLTVAVELVANPSIIFMDEPTTGL 1020

Query: 1025 DARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSS 1084
            DARAAAIVMRAVKNVA+TGRTIVCTIHQPSI IFE+FDEL+LLK GG MIY GPLG  SS
Sbjct: 1021 DARAAAIVMRAVKNVAETGRTIVCTIHQPSIHIFEAFDELVLLKRGGRMIYSGPLGQHSS 1080

Query: 1085 KVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQ 1144
             VIEYF+++PGV+KIR+ YNPATWMLEVTS + E +L +DFA++Y  S LY+NN ELVK+
Sbjct: 1081 CVIEYFQNIPGVAKIRDKYNPATWMLEVTSESVETELDMDFAKIYNESDLYKNNSELVKE 1140

Query: 1145 LSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGIL 1204
            LS P  GS DLHF   FAQN+  QF +CLWK +LSYWR+P YNL+RI HT  SS IFG+L
Sbjct: 1141 LSKPDHGSSDLHFKRTFAQNWWEQFKSCLWKMSLSYWRSPSYNLMRIGHTFISSFIFGLL 1200

Query: 1205 FWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWA 1264
            FW  GKK++ QQNLF   G +Y  V+F+GI NC+S       ER VMYRERFAGMYS++A
Sbjct: 1201 FWNQGKKIDTQQNLFTVLGAIYGLVLFVGINNCTSALQYFETERNVMYRERFAGMYSAFA 1260

Query: 1265 YSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVT 1324
            Y+LAQV+ E+PYIF+Q+A +VI+ YPMIGFY S  K+FW  Y+MFC LL F  L + L++
Sbjct: 1261 YALAQVVTEIPYIFIQSAEFVIVIYPMIGFYASFSKVFWSLYAMFCNLLCFNYLAMFLIS 1320

Query: 1325 ITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDI 1352
            ITPN+ +A IL S F+  FN+FAGFL+PKP+IP WW+WFYY+ PTSWTLN   +SQYGDI
Sbjct: 1321 ITPNFMVAAILQSLFFTTFNIFAGFLIPKPQIPKWWVWFYYITPTSWTLNLFFSSQYGDI 1380

BLAST of HG10019907 vs. TAIR 10
Match: AT4G15215.1 (pleiotropic drug resistance 13 )

HSP 1 Score: 1455.3 bits (3766), Expect = 0.0e+00
Identity = 758/1438 (52.71%), Postives = 996/1438 (69.26%), Query Frame = 0

Query: 1    MAQMVGSAERGRSSSIAEEDNDGDVEDASL---WATIERLPTFERLRSSLFDINDEGKVE 60
            MAQ     ++ +S  +     +G  ++  L   WAT+ERLPTF+R+ ++L    D+    
Sbjct: 1    MAQTGEDVDKAKSFQVEFACGNGVDDEEKLRSQWATVERLPTFKRVTTALLHTGDDSS-- 60

Query: 61   EKGRRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNV 120
                 ++DVTKL + ER + + KL+K +E DNL+LL K+R RI +VG + P+VEV++ ++
Sbjct: 61   ----DIIDVTKLEDAERRLLIEKLVKQIEADNLRLLRKIRKRIDEVGIELPTVEVRFNDL 120

Query: 121  HIEAECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLL 180
             +EAEC+VVHGK IPTLWN+++  L   +  C  K  + KI I++ VSG++ PGR+TLLL
Sbjct: 121  SVEAECQVVHGKPIPTLWNTIKGSLSKFV--CSKK--ETKIGILKGVSGIVRPGRMTLLL 180

Query: 181  GPPGCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVR 240
            GPPGCGKTTLL+ALSG L+ S+K  G++ YNG  L EF+P+KTS+Y+SQ+DLHIP+++VR
Sbjct: 181  GPPGCGKTTLLQALSGRLSHSVKVGGKVSYNGCLLSEFIPEKTSSYISQNDLHIPELSVR 240

Query: 241  ETLDFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------K 300
            ETLDFSA CQGIGSR +IMKE++++EK + I+P+PDID YM                  K
Sbjct: 241  ETLDFSACCQGIGSRMEIMKEISRREKLKEIVPDPDIDAYMKAISVEGLKNSMQTDYILK 300

Query: 301  ILGLDICADTLVGDAMRRGISGGQKKRLTTGR---------------------------- 360
            ILGLDICADT  GDA R GISGGQK+RLTT                              
Sbjct: 301  ILGLDICADTRAGDATRPGISGGQKRRLTTATTLLMDEISNGLDSSTTFQIVSCLQQLAH 360

Query: 361  -------------------------------------RDRVLEFFEHCGFKCPKRKSIAD 420
                                                 R  + +FFE CGFKCP+RK +AD
Sbjct: 361  IAGATILISLLQPAPETFELFDDVILLGEGKIIYHAPRADICKFFEGCGFKCPERKGVAD 420

Query: 421  FLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSK 480
            FLQEV+S+KDQ Q+W  +  PY+YIS+D+F +KF   +NLG  ++EE  KPFD+ +   K
Sbjct: 421  FLQEVMSRKDQEQYWCHRSKPYSYISVDSFIKKFN-ESNLGFLLKEELSKPFDKSQT-RK 480

Query: 481  NNGVFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMK 540
            ++  F   K S+SKW++ KAC+ RE LLM+RNSF+Y+FK+  L F A +TMTVF++    
Sbjct: 481  DSLCF--RKYSLSKWEMLKACSRREILLMKRNSFIYLFKSGLLVFNALVTMTVFLQAGAT 540

Query: 541  IDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILK 600
             D +HGNY MG++F +L  LL D LPEL+LTI RL V  KQK+L FYPAWAY IP+ IL+
Sbjct: 541  RDARHGNYLMGSMFTALFRLLADGLPELTLTISRLGVFCKQKDLYFYPAWAYAIPSIILR 600

Query: 601  LPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHVVALT 660
            +PLS++ S +WT LTYYVIGY+PEV RFFRHF++L   H+S +SMFR IA + +  VA +
Sbjct: 601  IPLSVLDSFIWTVLTYYVIGYSPEVGRFFRHFIILLTFHLSCISMFRAIASICRTFVACS 660

Query: 661  LSSFI-ILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTN 720
            ++  I +L + +FGGF+I   SM  WL WGFWLSP+SY EIGL+ NEF +PRW+K+   N
Sbjct: 661  ITGAISVLLLALFGGFVIPKSSMPTWLGWGFWLSPLSYAEIGLTANEFFSPRWRKLTSGN 720

Query: 721  STIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNF 780
             T G  +L  RGL++ ++ YW +  AL GF + FN  +  ALT+ N              
Sbjct: 721  ITAGEQVLDVRGLNFGRHSYWTAFGALVGFVLFFNALYTLALTYRNN------------- 780

Query: 781  ILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVFQ 840
                 P  S AI+S+ K SQ        S ++    P+ +  +  G + LPF PLTV FQ
Sbjct: 781  -----PQRSRAIVSHGKNSQC-------SEEDFKPCPEITSRAKTGKVILPFKPLTVTFQ 840

Query: 841  HLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKT 900
            ++QYY++ P G        K  QLL DITGAL+PG+LT+LMGVSGAGKTTLLDVL+GRKT
Sbjct: 841  NVQYYIETPQG--------KTRQLLFDITGALKPGVLTSLMGVSGAGKTTLLDVLSGRKT 900

Query: 901  SGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTK 960
             G ++GEI++GG+PKVQETFAR+SGYCEQ DIHS  ITVEESL +SAWLRL  NID+KTK
Sbjct: 901  RGIIKGEIRVGGYPKVQETFARVSGYCEQFDIHSPNITVEESLKYSAWLRLPYNIDAKTK 960

Query: 961  AQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLD 1020
             + V EVLET+EL+ IKDS+VG+PG+SGLSTEQRKRLTIAVELVSNPSIIF+DEPTTGLD
Sbjct: 961  NELVKEVLETVELEDIKDSMVGLPGISGLSTEQRKRLTIAVELVSNPSIIFLDEPTTGLD 1020

Query: 1021 ARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSK 1080
            ARAAAIVMRAVKNVA+TGRT+VCTIHQPSIDIFE+FDELIL+K GG ++YYGPLG  SSK
Sbjct: 1021 ARAAAIVMRAVKNVAETGRTVVCTIHQPSIDIFETFDELILMKDGGQLVYYGPLGKHSSK 1080

Query: 1081 VIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQL 1140
            VI+YFE +PGV K+++N NPATWML++T  +AE +LG+DFAQ YK+S LY+ NK +V+QL
Sbjct: 1081 VIKYFESIPGVPKVQKNCNPATWMLDITCKSAEHRLGMDFAQAYKDSTLYKENKMVVEQL 1140

Query: 1141 SVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILF 1200
            S    GS  L F + ++Q    Q  ACLWKQ+ SYWRNP +NL RI+  + +SL+  +LF
Sbjct: 1141 SSASLGSEALSFPSRYSQTGWGQLKACLWKQHCSYWRNPSHNLTRIVFILLNSLLCSLLF 1200

Query: 1201 WKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWAY 1260
            W+  K + NQQ+LF+ FG MY+ VIF GI NC++V   I+ ER V YRERFA MYSSWAY
Sbjct: 1201 WQKAKDINNQQDLFSIFGSMYTIVIFSGINNCATVMNFIATERNVFYRERFARMYSSWAY 1260

Query: 1261 SLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTI 1320
            S +QV++EVPY  +Q+ +  II YPMIG++ S +K+FW  YS+FC+LL F   G+L+V +
Sbjct: 1261 SFSQVLVEVPYSLLQSLLCTIIVYPMIGYHMSVYKMFWSLYSIFCSLLIFNYCGMLMVAL 1320

Query: 1321 TPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDID 1352
            TPN H+A  L S F+ M NLFAGF++PK +IP WWIW YY+ PTSW L  LL+SQYGD++
Sbjct: 1321 TPNIHMALTLRSTFFSMVNLFAGFVMPKQKIPKWWIWMYYLSPTSWVLEGLLSSQYGDVE 1380

BLAST of HG10019907 vs. TAIR 10
Match: AT4G15236.1 (ABC-2 and Plant PDR ABC-type transporter family protein )

HSP 1 Score: 1449.9 bits (3752), Expect = 0.0e+00
Identity = 761/1424 (53.44%), Postives = 979/1424 (68.75%), Query Frame = 0

Query: 19   EDNDGDVEDASLWATIERLPTFERLRSSLFDINDEGKVEEKGRRVVDVTKLRNEERHIFV 78
            E+ DGD +  S W  IER PT +R+ ++LF   DE + +   RRV+DV+KL + +R +F+
Sbjct: 16   ENGDGD-QVRSQWVAIERSPTCKRITTALFCKRDE-QGKRSQRRVMDVSKLEDLDRRLFI 75

Query: 79   HKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIEAECEVVHGKAIPTLWNSL 138
             +LI++VE+DN  LL K+R R  +VG   P +EV++ ++ +EAECEVVHGK IPTLWN++
Sbjct: 76   DELIRHVEDDNRVLLQKIRTRTDEVGIDLPKIEVRFSDLFVEAECEVVHGKPIPTLWNAI 135

Query: 139  QSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPPGCGKTTLLKALSGNLNKS 198
             SKL            + KI I++ VSG+I P R+TLLLGPPGCGKTTLL ALSG L+ S
Sbjct: 136  ASKLSRFT----FSKQEDKISILKGVSGIIRPKRMTLLLGPPGCGKTTLLLALSGRLDPS 195

Query: 199  LKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETLDFSARCQGIGSRADIMKE 258
            LK  GE+ YNGH   EFVP+KTS+YVSQ+DLHIP+++VRETLDFS   QG GSR ++MKE
Sbjct: 196  LKTRGEVSYNGHLFSEFVPEKTSSYVSQNDLHIPELSVRETLDFSGCFQGAGSRLEMMKE 255

Query: 259  VNKKEKEQGIIPNPDIDTYM------------------KILGLDICADTLVGDAMRRGIS 318
            ++++EK +GI+P+PDID YM                  KILGL ICADT VGDA R GIS
Sbjct: 256  ISRREKLKGIVPDPDIDAYMKAASIEGSKTNLQTDYILKILGLTICADTRVGDASRPGIS 315

Query: 319  GGQKKRLTTGR------------------------------------------------- 378
            GGQK+RLTTG                                                  
Sbjct: 316  GGQKRRLTTGEMIVGPIKTLFMDEISNGLDSSTTFQILSCLQQFARLSEGTILVSLLQPA 375

Query: 379  -----------------------RDRVLEFFEHCGFKCPKRKSIADFLQEVISKKDQPQF 438
                                   RD +  FFE CGFKCP+RKS+A+FLQEVIS+KDQ Q+
Sbjct: 376  PETFELFDDLILMGEGKIIYHGPRDFICSFFEDCGFKCPQRKSVAEFLQEVISRKDQEQY 435

Query: 439  WYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQEEYSKNNGVFLNGKNSVSK 498
            W  +  PY Y+SID+F  KFK  ++LG ++++E  K +D+ +  ++ +G+ +  K S+S 
Sbjct: 436  WCHRDKPYCYVSIDSFIEKFK-KSDLGLQLQDELSKTYDKSQ--TQKDGLCIR-KYSLSN 495

Query: 499  WQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIRTEMKIDLQHGNYYMGALF 558
            W + KAC+ REFLLM+RNSFVYVFK+  L FI SI MTV++RT    D  H NY +G+LF
Sbjct: 496  WDMFKACSRREFLLMKRNSFVYVFKSGLLIFIGSIAMTVYLRTGSTRDSLHANYLLGSLF 555

Query: 559  YSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPAAILKLPLSLVQSLVWTSL 618
            +SLI LL D LPEL+LT+ R+ V  KQKEL FYPAWAY IP+AILK+P+S ++S +WT L
Sbjct: 556  FSLIKLLADGLPELTLTVSRIAVFCKQKELYFYPAWAYAIPSAILKIPISFLESFLWTML 615

Query: 619  TYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHV-VALTLSSFIILQIMIFG 678
            TYYVIGY+PE  RF R  L+LFA+H+S +SMFR I  V +   VA T+ S  I+ + +FG
Sbjct: 616  TYYVIGYSPEAGRFIRQVLILFALHLSCISMFRAIGAVFRDFDVATTIGSISIVLLSVFG 675

Query: 679  GFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKMQGTNSTIGHVILQSRGLD 738
            GFI+  PSM  WL WGFWLSP+SY EIGL+ NEF AP W+KM   N T+G  +L +RGL+
Sbjct: 676  GFIVRKPSMPSWLEWGFWLSPLSYAEIGLTSNEFFAPMWRKMTSENRTLGEQVLDARGLN 735

Query: 739  YHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSSTFFNFILHSAPGSSPAIIS 798
            +    YW +  AL GF + FN  FA ALTFL  +                    S  I+S
Sbjct: 736  FGNQSYWNAFGALIGFTLFFNTVFALALTFLKTSQ------------------RSRVIVS 795

Query: 799  YEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLTVVFQHLQYYVDMPSGMRE 858
            ++K              N  SS K S  ++    ALPF PLT  FQ +QY+++ P G   
Sbjct: 796  HDK--------------NTQSSEKDSKIASHSKNALPFEPLTFTFQDVQYFIETPQG--- 855

Query: 859  RGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLAGRKTSGYVEGEIKIGGFP 918
                 KKLQLLSD+TGA +PG+LTALMGVSGAGKTTLLDVL+GRKT G ++G+I++GG+ 
Sbjct: 856  -----KKLQLLSDVTGAFKPGVLTALMGVSGAGKTTLLDVLSGRKTRGDIKGQIEVGGYV 915

Query: 919  KVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNIDSKTKAQFVNEVLETIELD 978
            KVQ+TF+R+SGYCEQ DIHS  +TV+ESL +SAWLRL  NI S+TK+  VNEVLETIEL+
Sbjct: 916  KVQDTFSRVSGYCEQFDIHSPNLTVQESLKYSAWLRLPCNISSETKSAIVNEVLETIELE 975

Query: 979  SIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNV 1038
             IKDSLVG+PG+SG++ EQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKN+
Sbjct: 976  EIKDSLVGVPGISGVTAEQRKRLTIAVELVSNPSIIFMDEPTTGLDARAAAIVMRAVKNI 1035

Query: 1039 ADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGHDSSKVIEYFEHVPGVSKI 1098
            A+TGRT+VCTIHQPSIDIFE+FDELIL+K GG +IYYGPLG  SSKVIEYF  +PGV K+
Sbjct: 1036 AETGRTVVCTIHQPSIDIFEAFDELILMKNGGKIIYYGPLGQHSSKVIEYFMSIPGVPKL 1095

Query: 1099 RENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKELVKQLSVPPPGSRDLHFSN 1158
            +EN NPATW+L++TS ++E KLG+D A +Y+ S L++ NK +++Q      GS  L  S+
Sbjct: 1096 KENSNPATWILDITSKSSEDKLGVDLAHIYEESTLFKENKMVIEQTRCTSLGSERLILSS 1155

Query: 1159 IFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIFGILFWKHGKKLENQQNLF 1218
             +AQ    QF ACLWKQ+LSYWRNP YNL RI+    + ++ GILF +  K++ NQQ+LF
Sbjct: 1156 RYAQTSWEQFKACLWKQHLSYWRNPSYNLTRIIFMCFTCMLCGILFLQKAKEINNQQDLF 1215

Query: 1219 NNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYSSWAYSLAQVIIEVPYIFV 1278
            N FG M++ V+F GI NCS+V   ++ ER V YRERF+ MY+ WAYSLAQV++E+PY   
Sbjct: 1216 NVFGSMFTVVLFSGINNCSTVIFCVATERNVFYRERFSRMYNPWAYSLAQVLVEIPYSLF 1275

Query: 1279 QAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLLLVTITPNYHIATILSSAF 1338
            Q+ IYVII YPM+G++ S +K+FW FYS+FC+LL F   G+LLV +TPN HIA  L S+F
Sbjct: 1276 QSIIYVIIVYPMVGYHWSVYKVFWSFYSIFCSLLIFNYFGMLLVVVTPNVHIAFTLRSSF 1335

Query: 1339 YVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQYGDIDKAMVVFGERTTTVS 1352
            Y + NLFAG+++PKP IP WWIW YY+ PTSW LN LLTSQYGD++K ++ FGE+   VS
Sbjct: 1336 YAIVNLFAGYVMPKPNIPRWWIWMYYLSPTSWVLNGLLTSQYGDMEKEILAFGEK-KKVS 1388

BLAST of HG10019907 vs. TAIR 10
Match: AT4G15230.1 (pleiotropic drug resistance 2 )

HSP 1 Score: 1446.8 bits (3744), Expect = 0.0e+00
Identity = 766/1442 (53.12%), Postives = 994/1442 (68.93%), Query Frame = 0

Query: 6    GSAERGRSSSIAEEDNDG----DVEDASL-WATIERLPTFERLRSSLFDINDEGKVEEKG 65
            G  +  +++S+  E   G    D E+  L WAT+ERLPTF+R+ ++L    DE  V  KG
Sbjct: 5    GEEDEEKATSLEVEFASGNGVDDEEELRLQWATVERLPTFKRVTTALL-ARDE--VSGKG 64

Query: 66   RRVVDVTKLRNEERHIFVHKLIKNVENDNLKLLTKVRDRIHKVGEKFPSVEVKYKNVHIE 125
             RV+DVT+L   ER + +  L+K +E+DNL+LL K+R RI KVG + P+VEV++ N+ +E
Sbjct: 65   -RVIDVTRLEGAERRLLIEMLVKQIEDDNLRLLRKIRKRIDKVGIELPTVEVRFNNLSVE 124

Query: 126  AECEVVHGKAIPTLWNSLQSKLYDIIKFCGVKSHKAKIDIIEDVSGVINPGRLTLLLGPP 185
            AEC+V+HGK IPTLWN+++  L + I  C  K  + KI I++ VSG++ PGR+TLLLGPP
Sbjct: 125  AECQVIHGKPIPTLWNTIKGLLSEFI--CSKK--ETKIGILKGVSGIVRPGRMTLLLGPP 184

Query: 186  GCGKTTLLKALSGNLNKSLKFSGEICYNGHKLEEFVPQKTSAYVSQHDLHIPQITVRETL 245
            GCGKTTLL+ALSG  + S+K  GE+CYNG  L EF+P+KTS+Y+SQ+DLHIP+++VRETL
Sbjct: 185  GCGKTTLLQALSGKFSDSVKVGGEVCYNGCSLSEFIPEKTSSYISQNDLHIPELSVRETL 244

Query: 246  DFSARCQGIGSRADIMKEVNKKEKEQGIIPNPDIDTYM------------------KILG 305
            DFSA CQGIGSR +IMKE+++ EK Q IIP+P +D YM                  KILG
Sbjct: 245  DFSACCQGIGSRMEIMKEISRMEKLQEIIPDPAVDAYMKATSVEGLKNNLQTDYILKILG 304

Query: 306  LDICADTLVGDAMRRGISGGQKKRLTTGR------------------------------- 365
            LDICADT VGDA R GISGG+K+RLTTG                                
Sbjct: 305  LDICADTRVGDATRPGISGGEKRRLTTGELVVGPATTLFMDEISNGLDSSTTFQIVSCLQ 364

Query: 366  -----------------------------------------RDRVLEFFEHCGFKCPKRK 425
                                                     R  +  FFE  GFKCP+RK
Sbjct: 365  QLAHIAEATILISLLQPAPETFELFDDVILMGEGKIIYHAPRADICRFFEEFGFKCPERK 424

Query: 426  SIADFLQEVISKKDQPQFWYCKQTPYAYISIDTFSRKFKCWNNLGRKMEEETLKPFDEQE 485
             +ADFLQE++SKKDQ Q+W  +  PY+YIS+D+F  KFK  +NLG  ++EE  KPF++ +
Sbjct: 425  GVADFLQEIMSKKDQEQYWCHRDKPYSYISVDSFINKFK-ESNLGLLLKEELSKPFNKSQ 484

Query: 486  EYSKNNGVFLNGKNSVSKWQVSKACASREFLLMRRNSFVYVFKTSQLFFIASITMTVFIR 545
              ++ +G+    K S+ KW++ KAC+ REFLLM+RNSF+Y+FK++ L F A +TMTVF++
Sbjct: 485  --TRKDGLCYK-KYSLGKWEMLKACSRREFLLMKRNSFIYLFKSALLVFNALVTMTVFLQ 544

Query: 546  TEMKIDLQHGNYYMGALFYSLIILLVDALPELSLTIQRLQVVYKQKELLFYPAWAYVIPA 605
                 D  HGNY MG+LF +L  LL D LPEL+LTI RL V  KQK+L FYPAWAY IP+
Sbjct: 545  VGATTDSLHGNYLMGSLFTALFRLLADGLPELTLTISRLGVFCKQKDLYFYPAWAYAIPS 604

Query: 606  AILKLPLSLVQSLVWTSLTYYVIGYTPEVSRFFRHFLVLFAVHVSSLSMFRMIALVSQHV 665
             ILK+PLS++ S +WT LTYYVIGY+PEV RFF  FL+L   ++S +SMFR IA + + +
Sbjct: 605  IILKIPLSVLDSFIWTLLTYYVIGYSPEVKRFFLQFLILSTFNLSCVSMFRAIAAIFRTI 664

Query: 666  VALTLSSFI-ILQIMIFGGFIITHPSMSPWLRWGFWLSPISYGEIGLSINEFLAPRWQKM 725
            +A T++  I IL + +FGGF+I   SM  WL WGFWLSP+SY EIGL+ NEF +PRW K+
Sbjct: 665  IASTITGAISILVLSLFGGFVIPKSSMPAWLGWGFWLSPLSYAEIGLTANEFFSPRWSKV 724

Query: 726  QGTNSTIGHVILQSRGLDYHQYFYWISLAALFGFAIIFNVGFAFALTFLNGNSYCKDSST 785
              + +T G  +L  RGL++ ++ YW +  AL GF + FN  +  ALT+ N          
Sbjct: 725  ISSKTTAGEQMLDIRGLNFGRHSYWTAFGALVGFVLFFNALYVLALTYQNN--------- 784

Query: 786  FFNFILHSAPGSSPAIISYEKLSQSNCNGGANSVQNPPSSPKTSIESTKGGIALPFTPLT 845
                     P  S AIIS+EK S+          ++    PK +  +  G I LPF PLT
Sbjct: 785  ---------PQRSRAIISHEKYSRP-------IEEDFKPCPKITSRAKTGKIILPFKPLT 844

Query: 846  VVFQHLQYYVDMPSGMRERGFTQKKLQLLSDITGALRPGILTALMGVSGAGKTTLLDVLA 905
            V FQ++QYY++ P G        K  QLLSDITGAL+PG+LT+LMGVSGAGKTTLLDVL+
Sbjct: 845  VTFQNVQYYIETPQG--------KTRQLLSDITGALKPGVLTSLMGVSGAGKTTLLDVLS 904

Query: 906  GRKTSGYVEGEIKIGGFPKVQETFARISGYCEQTDIHSSQITVEESLIFSAWLRLAPNID 965
            GRKT G ++GEIK+GG+PKVQETFAR+SGYCEQ DIHS  ITVEESL +SAWLRL  NID
Sbjct: 905  GRKTRGIIKGEIKVGGYPKVQETFARVSGYCEQFDIHSPNITVEESLKYSAWLRLPYNID 964

Query: 966  SKTKAQFVNEVLETIELDSIKDSLVGIPGVSGLSTEQRKRLTIAVELVSNPSIIFMDEPT 1025
            SKTK + V EVLET+ELD IKDS+VG+PG+SGLS EQRKRLTIAVELV+NPSIIFMDEPT
Sbjct: 965  SKTKNELVKEVLETVELDDIKDSVVGLPGISGLSIEQRKRLTIAVELVANPSIIFMDEPT 1024

Query: 1026 TGLDARAAAIVMRAVKNVADTGRTIVCTIHQPSIDIFESFDELILLKTGGHMIYYGPLGH 1085
            TGLDARAAAIVMRAVKNVA+TGRT+VCTIHQPSIDIFE+FDELIL+K GG ++YYGP G 
Sbjct: 1025 TGLDARAAAIVMRAVKNVAETGRTVVCTIHQPSIDIFETFDELILMKNGGQLVYYGPPGQ 1084

Query: 1086 DSSKVIEYFEHVPGVSKIRENYNPATWMLEVTSSAAEAKLGIDFAQVYKNSALYENNKEL 1145
            +SSKVIEYFE   G+ KI++N NPATW+L++TS +AE KLGIDF+Q YK+S LY+ NK +
Sbjct: 1085 NSSKVIEYFESFSGLPKIQKNCNPATWILDITSKSAEEKLGIDFSQSYKDSTLYKQNKMV 1144

Query: 1146 VKQLSVPPPGSRDLHFSNIFAQNFVRQFGACLWKQNLSYWRNPHYNLLRILHTVASSLIF 1205
            V+QLS    GS  L F + F+Q    Q  ACLWKQ+ SYWRNP +N+ RI+  +  S + 
Sbjct: 1145 VEQLSSASLGSEALRFPSQFSQTAWVQLKACLWKQHYSYWRNPSHNITRIVFILLDSTLC 1204

Query: 1206 GILFWKHGKKLENQQNLFNNFGVMYSSVIFLGIYNCSSVFPNISRERTVMYRERFAGMYS 1265
            G+LFW+  + + NQQ+L + FG MY+ V+F G+ NC++V   I+ ER V YRERFA MYS
Sbjct: 1205 GLLFWQKAEDINNQQDLISIFGSMYTLVVFPGMNNCAAVINFIAAERNVFYRERFARMYS 1264

Query: 1266 SWAYSLAQVIIEVPYIFVQAAIYVIITYPMIGFYGSGWKIFWCFYSMFCALLYFKNLGLL 1325
            SWAYS +QV+IEVPY  +Q+ +  II YP IG++ S +K+FW  YS+FC+LL F   G+L
Sbjct: 1265 SWAYSFSQVLIEVPYSLLQSLLCTIIVYPTIGYHMSVYKMFWSLYSIFCSLLIFNYSGML 1324

Query: 1326 LVTITPNYHIATILSSAFYVMFNLFAGFLVPKPRIPSWWIWFYYMIPTSWTLNCLLTSQY 1352
            +V +TPN H+A  L S+F+ M NLFAGF++PK +IP WWIW YY+ PTSW L  LL+SQY
Sbjct: 1325 MVALTPNIHMAVTLRSSFFSMLNLFAGFVIPKQKIPKWWIWMYYLSPTSWVLEGLLSSQY 1384

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008443153.10.0e+0081.16PREDICTED: pleiotropic drug resistance protein 3-like isoform X1 [Cucumis melo][more]
XP_004136753.20.0e+0081.19pleiotropic drug resistance protein 3 isoform X1 [Cucumis sativus] >KGN59364.1 h... [more]
XP_031738520.10.0e+0081.19pleiotropic drug resistance protein 3 isoform X2 [Cucumis sativus][more]
XP_022154233.10.0e+0076.05pleiotropic drug resistance protein 3-like isoform X2 [Momordica charantia][more]
KAA0043676.10.0e+0079.99pleiotropic drug resistance protein 3-like isoform X1 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Q5W2740.0e+0060.50Pleiotropic drug resistance protein 3 OS=Nicotiana tabacum OX=4097 GN=PDR3 PE=2 ... [more]
Q9LFH00.0e+0059.28ABC transporter G family member 37 OS=Arabidopsis thaliana OX=3702 GN=ABCG37 PE=... [more]
Q9ZUT80.0e+0058.86ABC transporter G family member 33 OS=Arabidopsis thaliana OX=3702 GN=ABCG33 PE=... [more]
Q7PC830.0e+0052.53ABC transporter G family member 41 OS=Arabidopsis thaliana OX=3702 GN=ABCG41 PE=... [more]
Q8GU830.0e+0052.64ABC transporter G family member 41 OS=Oryza sativa subsp. japonica OX=39947 GN=A... [more]
Match NameE-valueIdentityDescription
A0A1S3B7F00.0e+0081.16pleiotropic drug resistance protein 3-like isoform X1 OS=Cucumis melo OX=3656 GN... [more]
A0A0A0LEL90.0e+0081.19Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G813820 PE=3 SV=1[more]
A0A6J1DJR40.0e+0076.05pleiotropic drug resistance protein 3-like isoform X2 OS=Momordica charantia OX=... [more]
A0A5A7TNN30.0e+0079.99Pleiotropic drug resistance protein 3-like isoform X1 OS=Cucumis melo var. makuw... [more]
A0A6J1DL520.0e+0075.10pleiotropic drug resistance protein 3-like isoform X1 OS=Momordica charantia OX=... [more]
Match NameE-valueIdentityDescription
AT3G53480.10.0e+0059.28pleiotropic drug resistance 9 [more]
AT2G37280.10.0e+0058.86pleiotropic drug resistance 5 [more]
AT4G15215.10.0e+0052.71pleiotropic drug resistance 13 [more]
AT4G15236.10.0e+0053.44ABC-2 and Plant PDR ABC-type transporter family protein [more]
AT4G15230.10.0e+0053.12pleiotropic drug resistance 2 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003593AAA+ ATPase domainSMARTSM00382AAA_5coord: 787..979
e-value: 9.2E-10
score: 48.4
coord: 170..323
e-value: 0.06
score: 22.5
IPR013525ABC-2 type transporterPFAMPF01061ABC2_membranecoord: 1075..1288
e-value: 2.8E-56
score: 190.3
coord: 413..621
e-value: 3.8E-32
score: 111.4
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 765..997
e-value: 1.1E-44
score: 154.8
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 141..312
e-value: 1.1E-26
score: 96.0
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 771..987
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 146..308
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 174..495
IPR013581Plant PDR ABC transporter associatedPFAMPF08370PDR_assoccoord: 626..679
e-value: 2.3E-18
score: 65.6
IPR003439ABC transporter-like, ATP-binding domainPFAMPF00005ABC_trancoord: 162..307
e-value: 5.8E-16
score: 59.2
IPR003439ABC transporter-like, ATP-binding domainPFAMPF00005ABC_trancoord: 778..929
e-value: 5.1E-19
score: 69.1
IPR003439ABC transporter-like, ATP-binding domainPROSITEPS50893ABC_TRANSPORTER_2coord: 750..1003
score: 14.132471
IPR043926ABC transporter family G domainPFAMPF19055ABC2_membrane_7coord: 959..1022
e-value: 1.2E-6
score: 27.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..21
NoneNo IPR availablePANTHERPTHR48040PLEIOTROPIC DRUG RESISTANCE PROTEIN 1-LIKE ISOFORM X1coord: 18..310
NoneNo IPR availablePANTHERPTHR48040:SF18PLEIOTROPIC DRUG RESISTANCE PROTEIN 3-LIKEcoord: 310..1351
coord: 18..310
NoneNo IPR availablePANTHERPTHR48040PLEIOTROPIC DRUG RESISTANCE PROTEIN 1-LIKE ISOFORM X1coord: 310..1351
IPR034003ATP-binding cassette transporter, PDR-like subfamily G, domain 2CDDcd03232ABCG_PDR_domain2coord: 747..985
e-value: 4.46964E-101
score: 318.033

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10019907.1HG10019907.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0055085 transmembrane transport
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0140359 ABC-type transporter activity
molecular_function GO:0005524 ATP binding