HG10018299 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10018299
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionauxin transport protein BIG
LocationChr04: 2778699 .. 2801759 (+)
RNA-Seq ExpressionHG10018299
SyntenyHG10018299
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGGAGCAGAGTTTCGTCAAACTCTTAGACACTATCTTCCTTGACGATTCCAGCACCAGCGCCAATACCAGGAAGCATTTTTCTTCCTCCGATCTTCTCCAGCTTCTTCGCTCCGATGATTCTTCGATCAAACTCGGCCTTCGTCAATTCTATTCGATTCTCAAGGCCGGCCTTCGGGACCTCGGTGACGGAAACTTTGCTTTTCAGTCATGGACCGATCCTCAGATCCAAGCTGTTTGTTCAATTGCTCATGCAATTGCTTCTGCTTCTCGATCCCTGACCGGTACGCTTTGCTTCACTCACGAGCTTTTCTCGACTGAGAAATGTGTTTTTCCGTCTTTTTTTCTTTTCTTTAATCTTGATTTCTTTTGGACGCGTTTTATCACTCGTTTTTGAAATTTTTTTATGTGCTTTTGGTTTACTCGTGCAATGTTATTAAACATAATGTCTTGAATATTTTGATCCTTTGAGGACTGGAAACTCAGTTGGAAAAATATTTAAGTGCTTCTAACTCTAAACATTCTTTTTCTTCAAGAAATGAAATTACAAATGTGTAAATTGCGTAAGCCATTTGCTTCTATATAAGCTTCAAATTTATTAAAATTTAATTTCGGCTTTGCTTCTTGATTCTCGGACTTTCTTCTTCTTCTTCTTCAATATTTCTTTATTTTCTCGGGAAATGGATTTCGGTTGGTATTCAATTGATGCATGTTGTCTTTGGTGCATTTGCCTCGACAGTGGATCAAGCTGAAGCTATAGTTGTTGCGGTTATTAAAAAATCACTCGAGTTGGTTTTCTGTTACTTGGAGAAATCAGAGTTTAAGTGCGATGACTTTAGTATTCAGGTTTGTTGCTTCTTGGTAGTTTCTAAATTATGTGGGAGGGGAGGGGTTCATCATTGGTTTCTTTTACTCACCTCATACAAGTTCTGTAGGAGACCTCTGTTTCCCTGTATGGAAGTAAATCTTGAGCCATGGTTTGTTTGTGCTTAGATGTATTGTCAAATCGTATTTGCTTCTTATTTTTTAGCCTTCTCTATTCTATAGTAGTTTTTTACTTTATGTCACTTCATTTCATTTTCCTTTACTACACTTCTTGTGTATCAAATTTTCATTCCACATCCTATTGACAGATTTTGTGATGTATTCTCACGTTGTACTGAGTCAGGATATTAGAATGTTCCTCTCAATGTCATATTTCTTAGTCTTGATGAAACTTGTTATTTATTATTACTATTAGGTCAAATTATAAGCTTTGTTTTGGTTATTGTGTCTACTTAGTCTTGAACTCCTTAAATTACATACCGAGATTTAAATTATGTTTATTTTAGTCCCTACCTATCGTACTGGTTGACGAAATGTTTGTGTTACTCCATTAGATAAAATTTGGAGAAGGCGTTCTGATAAGACATCCGGTTCGACGTATTACAAACGAATGGTTAGATACAAGAATTGGTTGTGGGGTTAGACGATTTGGTTGGATGGGTTAAAATTAAGACTTAGTGTAATCGCATCAAACTCACTGCCAGATTACTACCTCATGAAAATTTTATCTAATTGGATCTTCCTTAGCGTTTGGTCAACTGGTGTTGACAACAAGGACTAAATAGGTCTATACCTCAACGTTTAGAGATATAATTTAAGCCTTTTAAGTTTCAGACAATTTTTTTTTTTGAAATAGGCTTGTATTAAATAGGCACAATGTTTAAAGTACAATAGCGAAATTTACTATTTGTCATCATCACCATCATTATTTGTTTTTGTTTTTAATTTTTTTAATTTTTTTTAAACATACGAGCTCCTTAACCTTATGCTCACGAGAAACTTAAAAAGCAAGATTTTGGTCCTGACATATTTTAAAGATTCCTTTTTATTGAGAGGTAGAAAGTAATGTTTTTACCATACAATCCATAATTGTCTTAATGAAACCTTCTGTTACTGGTAGAATGTGCTTTATTTTATCTTATTCAGCTGACCATTCTAGGGGGTTTCTCCTTTGTATCTCTTTGGATATTTCATTTATCAATGAAATTGTTTCTAGGGGTTTATTAATGAAACTATTAAAAGCTGCTTTATTTAGCTATAAAATGTTATTTTAGCACCAAATGCATCCATGAATATGAAATGATATCAATGTAAAATTACAAAAATTACAAAAATGTATTTTTCAAAAGAACGAGAGAAAGAGGGGAAAATGAAAATAAAAAGAGAGAGAGAGAATATATATGTAAAGAGAAAGGGAAGAAATGAGAGGGAAAAGTGTTATATTAGAAAAATGTATTTTAATAGCTATTTTTGAGGTTTAAATATGAGTTAGTCTTACTTTCAAAAAGTTGGTTGATGTACATTGGCAACGAGCTTGTGACTTGAGAACTCATAACAGAAAAATGGTATCAAGTTTGAGATAATGTAGTAATAATTGAACTTTATTGTTTCTTTTATTTTATTTTTTCTAATAAAAATTTGTTGCCCATCATGATCAACATTTTGTTGATATTTTCATGACATTATACACATATATTAACCACCTTGATTTATGGAAGTTTACTTCATATTCATTACGTGCCTTAATTTTAAGATTAATATTATCAATCGGCAAATTTCTCCTTTTTCTTTTTTTGATGTTTTCATTATTTTTCTTGATATTTCAAGTTACATGTATATATATATTTTTGTATTATATCAAGTACCCTTTGGCATCTTCTGCAGAATAATATGTTAATGATTCTGGAGACTATTTTGGTTGATGGGATGGATAAAGTATCAGACTTTGCACAGCTTTGTGCTAAGAAAAGTCTTATGGACTTGTTAAAATCGACTGGTGGAGACTGTGATGCTACTATTGAGTTCGATAATACCATTGAATGTGGTTCTACAGGTACCCGTTAAAATTTAAATTACTTAATTTGAACTGCCTTATGCATTAATACTAGTTTTTCCTCAATAAAATTGCGTTTGCTTTTTCTTGTATTGTTTTAGGAGTTTGTTGCTCCAGAGAAGAAAAACAGGTAGGTAGGCTTTTAATGACAATAGCTGCTGAATGCGTGCAAGCTGATCAGCTGACCTCTGAATCTGGATTCAGTCAACCGACATTCCTTGAAGATATGAACAAGTTGATTTTCCTTTGCCAACATTGGGCAGTCACGCATTTGGCATGCATTCAACATTTGATTTTGATCTGCAAAGAATTGGTAGTACTCCCGGATGCGCTTGATGAGAAGACAGGAAGTACAAGTTTTCGGAAGAGACTGTCATGTAGTTTGAGGATATTAAAGCTTCTAACCGATCTCTCAAAGAAATTTCCATATATTGAATATGATGCTAAAATGATGCAGGCATTCGCATTGTTTGCCAACTCATTGCCTTGCTTGTTTGGACTATGTTTTGAGTTTGCAAATAGTCATGCTACAGTCGAGGGTAGTTTTGAGAACACCATTTTGTTACTCCTGGAAGAATTTTTGGAGCTAGTTCAGGTTGTATTTCGCAACAGCTATGTTAGTGTGAACATCCAAACATGTGTAGTGGCTTCTATATTGGATAATTTGAGTTCTTCAGTTTGGCGGTATGATGCATCTACTGCAAACCTGAAGACTCCACTGGTTTACTTTCCACGAAGTGTTATGGTTATAATTAAACTCATTCAAGATCTAAAGGGCCATAAATATCATGCTTTCAGTTTTAAAGATCTTGAAACGCATCACACGAGCACTCTTGCTGATTTATCCGTGGACATACCTAAATGCTATGCTCGTTTGGAGATTGTTCCTTTGCATAAGAATTATAAAGTAGAAGAAATTTTGAGAATGATATTTCCTCTGTCAAAACAATGGATGGATGATTTAATGCATCTACTCTTCTTTCTTTATTCTGAAGGAGTGAGATTAAGACCAAAAATAGAGCGATCATTATCCAGTATGAAGAGCAGTAGTACGGTCGAACAAGAAACTGCTGTCTGTCATGAAGATGAAGCACTATTTGGGGACCTCTTCTCTGAGAGTGGCCGCTCTGTTGGATCTGTAGATGGATATGATTTGCAGCATCTTGCTGTCAACTCTACCTCTAGCTTTTGCAATCTGCTTCTCCAAGCTGCTAAAGAACTATTGAGCTTTATCAAGCTATGTATCTTCTCTCCTGAATGGAATGCATCTGTTTTTGATGATGGTTGCAACAAACTTAATCAGAACCATATTGATATATTACTTTCCTTACTAAACTGTGAGGGATGCTGTTCTGATGACAAGTCTTCTGCTAGTTGTCTACCTGCACATGATGAGAGGAAATCTGGCCACATCCATGAAATTTGTTACAGGTTGTTGCATGGTCTTCTCACACGCCATGCATTGCCAGATTCGCTTGAAGAGTACCTTGTGAAGAAAATTTTGAATGCTGAAAATGGAAATTTTGTTTACAATGATCAGACCCTAAGCTTACTGGCGCACACCCTTTTCCGTAGAACTGGTGTTGCTGGGACGCTGTTGAGGACCCAAATATACAGGCAATTTGTGGAATTTATCATTGAGAAGTCCAAAACTATTTCCTCAAACTATTCCAGCCTCCAGGAATTTATGGGGACTCTTCCCTCTGTCTTTCATATTGAAATTCTTCTGGTGGCATTTCACTTATCTTCTGAAGGAGAAAAGAGAGAAATTTCGAGCTTAATTTTTTCTTCCATTAGGGCAATTGATGCTCCATCCACATTTTCTAACTGTACAGAATTGTCAATGTGGGGTTTATTGGTTTCAAGGTTGATTATAGTACTTCGACACATTATTTTTCACCCGCATACATGTTCCTCTTCGTTGCTTTTTGATTTTCGATCTAAGTTGAGGGATGCTCCTGCATTTTCTTCTAGTTTGCCTTATACACTGAATGATCATTTATCATCTTGGGGTGCAAGTGTCGCTAAGAATATAATTGGTTCATCCGTGGAATCCAAGCCCTTCTTTCATAGCTTGATCAACCAGTTGATTGATATTTCTTCATTTCCTGCTTCACTACGCCAGCATGATTTGACAGTAGAATGTCCATGGTTTAATGCTGGTGATATATTTTCGACATTCTCATGGATTTTGGGGTTCTGGAATGGTAAACAAGCTGTTACGGTTGAAGACCTCATCATTGAAAGATATATTTTTGTTCTCTGCTGGGATTTTCCTTCCATGAATGCTTTATCACATGGAGGCCCATTATGGAGCGACCCAGACACACTGGACATTTCTAATACCACATGCTTCTTTTACTTCAGTTATTTACTCCTAGATCATGGTGGTGTTATTGGCGAACACATGAAGTTTCCTCAAGTTGTGATTGGTTTGCTTCAGCGTTTGCATGGTGGGAGTATCCTGGAGGACTTCAAAGCTTTGGGCTGGAATTTTTTAAGAAATGGAGCATGGCTATCTCTGGTTCTTTCCTTCCTCAGTGTTGGGATATGGAGATACTGCAGTAAGAATATGATTCCAACAGTGGGTTCTTTATTGACAGATACCACAGTTACAGATAATGAGCAGGCCAATTTTGCTGAAAGCTTAATTTCCTCGGTGATTACCGATAGCCAAGTTTCAATTTTAATCAGGGAGTTGTCATCTGTATTGAGCATGTATTTACAAGTGTATCAGAAAGCCTTTGTTGCTACTCTTAGTAGTAGTAATGATCATGCTACTGAGTTTTCCCCACTCTTGCTCTTTAAGCATTCTGAATTTGATAGCTGTGTCCAGAACAAGACCCTTGAGAACTATGGGACAACATCCTGCTTATTGGAATCTGTTTTTAACCTCATGTCTAGGTTGGATGAAATAGTAGACAAAAGAACCCTTGGGTTCTTATCAAGGTTCTGTTGGGAATCGATGTTTCATGGTTTTCCCTCTCATTTGGAAACTTCCAGTGGAATTCTACTTTCTTGTGTCCTTAGCATAGGAAGGATTATTTCTGTTTTAGCTGGGCTGCTGAGGATAGTAGATGTTAAACGTAATATCATTTTGGAGACTGAGGTAACTCGTGGGATTCTTGATGCAGTAATGACTATAAAATTTGACAAGACCTTTGAAAGTGTTCATGGCCTGTGTGAAGGTATATATCAGAGTTTGAATAAGGAATTGGATGGATGTTCTTATGGGGTTCTGTTTCTGTTGAAACAGCTTGAGGGGTACTTAAGACACATGAATATGAGGGGGGCGAGTGATAGCACTATTCATGAATTGGTAATTGTTAAAGCTACAGATATCATGGACAACCTGCGGAAAGATGTTTCGAAGTCTTCTGTTTTCCAATTCTATCTTGGTGCTGAAGTTGTACTGGAGCAGGTTAGAGAACTTTATACATTTCAACATGGTAATTTGTTGGTTCTTCTAGACTCCTTAGACAATTGTTGCTCCGAACTAGTTAACTTGAAGGTTCTTGGTTTCTTTGTGGAACTCTTGTCCGGAGAGCCATGCCCTAAACTTAAACAGGAAGTACAGAATAAATTCCTTAGTATGGATTTGCTTAGCCTATCACAATGGTTGGAAAAGAGGATTTTTGGTTTAGTAGCCGAAGATTCAAGCGGAGGCAATGTGAAGGGATCTTCTATTTCTCTTAGAGAATCATCCATGAATTTTGTATTTTGTCTCATATCATCACCCTCAGAACCTCTGGCACTTCAATTGCAGAGTCACATTTTTGAGGCTGCACTAGTATCACTTGACATGGCATTTTTGCGATTTGACATCAGTGTTTCCAAGTCCTATTTCCATTTTGTTGTTCAGCTATTGAAAGGGGACAAATCAATGAAATTACTTTTGGAGAGAATTCTCATATTGATGGGGAAATTGGCTAGCGATGAGCGCCTGCTTCCAGGGCTGAAGTACCTCTTCAGTTTTCTCGAAATGATTTTGATTGAGAGTGGATCTGGTAAGAATGTTTTTGAGAGACCTTCTGGCAAACCTCTGTCGAGGTATGCACCTGAAGTTGGACCTCTTTCTTCTAAGTCAGTGGGGCCTAGGAAGAATTCAGAGACATTGGTTCTTTCTTCCAATCAAGAAGAGGGTCCTGCATCTTTTGAATGTGATGCAACTTCTGCTGAGGAAGATGAGGATGATGGAACTTCCGATGGTGAGGTAGCTAGTCTGGACAAGGATGAGGAGGAGGACACAAACAGTGAAAGGGCACTTGCTTCAAAAGTCTGCACATTTACATCCAGTGGCAGCAATTTCATGGAACAACACTGGTACTTTTGTTATACATGTGATCTGACTGTCTCGAAGGGATGTTGTTCTGTGTGTGCAAAAGTCTGTCATCGTGGTCATCGTGTTGTTTATTCCCGGTCTAGTCGCTTCTTTTGTGACTGTGGTGCTGGGGGTGTTAGGGGCAGCAGTTGCCAATGTTTGAAGCCTCGCAAGTATACTGGACATGGCAGTGCCCCTGTACGTGGTGCCAGTAACTTCCAGTGCTTTTTGCCTTTCTCTGAGGAGGGAGATCAACTTCCTGAAAGTGAATCTGATCTGGAAGACGATGTGTCAGTTACAGACACAGATAAGTGTCTCAGACCCTCTGTTCCTAGGGAGCTTCTAGATGGTGTTTCCGTTTTACTTGAGGAACTGGATGTTGAAGGAAGGATGCTCGAGCTTTGCTCGTGTTTATTGCCTACTATAACTAACCAAAGGGACCCAGACCTTTCAAAAGACAAGAAAATTATTCTCGGTAAAGACAAGGTGCTATCATATGGGCTTGATCTCTTGCAATTGAAAAAGGCATATAAAGGTGGGTCTTTGGACCTTAAGATTAAGGCAGAATATGCAAATGCCAAGGAGCTCAAATCACATCTAGCCAGTGGTTCTCTTGTGAAATCTCTGCTCAGTGTCAGTATCAGAGGTCGTCTTGCTGTTGGTGAAGGTGATAAAGTGTCTATTTTTGACATCAGGCAGTTAATAGAACAGACTACTGTTGCCCCCATGACGGCAGACAAGACTAATGTTAAGCCACTCTCCAAAAACGTTGTTCGTTTTGAAATTGTACATCTTGCTTTTAATCCCACAGTAGAGAACTATCTTGCTGTGGCAGGCTATGAAGATTGCCAAGTCTTGACTTTGAACCACCGTGGTGAAGTTGTTGACCGTCTTGCTATTGAACTCGCTCTGCAAGGGGCTCATATTAAACGAATGGAATGGGTTCCAGGATCTCAAGTCCAATTAATGGTGGTTACAAACAGGTTTGTCAAAATATATGATCTATCCCTGGATAATATCAGTCCAATGCACTACTTCACGTTGCCAGATGACATGGTAGTTGATGCTACACTGTCCACAGCTTCACAGGGAAGGATGTTTCTTATTGTTCTTTCAGAAAATGGAAGGATATTCAGACTTGAGTTGTCAGTGCTAGGAAATGTTGGAGCTACACCCTTGAAGGAGATCATTGAGATTCAGGGCAGAGAAATGAGTGCAAAGGGATTGTCATTGTACTTTTCTTCATGTTACAAATTATTATTTCTTGCATATGCAGATGGCACCACATTGGTTGGTCAGTTAAGCCCTGATGCAACAAAATTGACCGAGATATCGGTTATATACGAAGAGGAACAAGACAGAAAACTCCGGCCTGCTGGATTACACCGTTGGAAGGAGCTGTTTGCTGGCAGTGGCTTATTTGTTTGCTTTTCAAGCGTCAAATCAAATTCGGCTTTGGCTGTATCTATGGGGGCTCATGATATTTATGCTCAAAACTTGAGACATGCGGGGGGTTCATCTTTGCCATTAGTTGGCATAACTGCATATAAGCCTTTATCCAAAGATAAAATACATTGTCTTGTACTACATGATGATGGTAGCCTTCAAATATACACACATACTGCTGTTGGAGTGGATGCTAGTGCATACGCAACTGCCGAAAAAATCAAGAAGTTGGGTTCTGGCATTCTCAATAACAAGGTTTATGCCAGTACAAATCCAGAATTCCCACTTGATTTTTTTGAGAATACTGTTTGCATCACTGCAGATGTGAGATTGGGAGGCGACGCTATTCGAAATGGTGATTCTGAAGGAGCCAAACAGAGTTTGGCATCCGAGGATGGTTTTCTTGAGAGTCCCAGTTCTTCAGGTTTCAAGGTATTGTTTCACCTTGCACATAATTATGCATTTCAATGCTTACTTTTAAAATTTAATCAACGAAATTTAGAATATGAATGATGATGTGTTTTTTTTTTTTTTTCTGTTGCTGTGATATCTTATTTTATTACTTTTATAATCATTCGGGATGCATTTTCTTTCAGATGAGATAATTACTTTCCTAGAAACACGAACAGTACTTTCTTGAATCACCTACTTTTTGACTGTCATATGGCTTCTCGGATTTGAAAATGTATCTTCTTAAGAGTTTGGATAGTTACCTCTCCTGGGAACATGAGTAGCTCACTTTTTTTCCCCTCCTGGGAAGGTCACGTCATCTGTTTGAAATGCAAGCTAAAGTCCTTTGGCTCAACTTTATTTGTGCTTTCCTTTGGAACTTGGATTAATGGAAGGTTTTTTCTTGATGAAAATATCTGTTATTGTCGGATCAAATGACTTCTATTGCTGTTTCTTGGTTTAGCTCTTCTTTATGTAACTTACAGTACAACTTCTCTGGGTTGGTTCATGATGTTCCAGATTTTAATCAGTCTGTCCTAGGAATCTTAATGGTCAAGAGGTAGAACCTTAATTCTCCAAACCTGTCATTGGCAGCATCACAGACACCAAGACAACCTGAGAAAATTTACAGCAGATTAATAATGCAGATGAGCAACGAGGAAAAGAAAGAAACTGTGTGTGTGTGTGTGTGTGTTTTTGGATATGAAACAGACAGATATATTCATATTGAAACAAAGAATAGCTTAGGGGCAAGGGGTAGAGAACCCCTTTCCAAATGGGAAAACTATTACCAAAAGCCTTCCAATTTCTAATAATCAAGGCAAGGCTATAAAAACTCTTAAAATCTTTGTGACTTGAACTCCACCAAGAAGCCAAGTGTCAGAGCCGCCCCGTGACACCAAGTGCCGTATACTACAGACAAAAAAATCTTCCAAACTGAGGCGTTCTCAAAAATTCGGGAAAAACTGAGAGATGAAATACCCTAACCACTTGCAAACAAATGAGCTGGCCCTTGAATTGATAAACCTCTGCAAAAGTAGATGTTCAGGGTTATGCTACTGTAAATGAGACCGATTGTCCTCTTAGACCCTTCTTACAATAGTTTTTCCTGTGTCCCTTGGTCCCTTTTAGCAACGCCAATTAGGATTTATGTAGATAGGATGGACTAGATGTATTGTAAATTTGCAGCAGATAGTATCCGTCAGTCCCTTTTATATACACAGTACCTGCTAAACATACAGTACTTATAATCTTTGGTATTTTTGCTTGGGACCATTGTGAAGCATCAGTTTGAAATTGAGGTCCTGAAATGCTTGGTTATGTTATAAAATCTACCTCAGGGTTAAATTGCTATTTAATTGATTAATTAATTATAATAGTAATTATTATAGTTTTTTCATTTATTATTAGGTGTATTTCTGGGTTGGCTAGGTTGGATTGAAGGGGTTTCTTAAATCAACCACGAGTATTTGGGTTATAAGTCGGTCGACTATTTAGCCACTTCAAATAACCTATGTTTGTACCTCAACTCAACCCAATCCAATCCAACATAATCTTACGCACATGCGAACTTATTAGTTTCGGTTACAATATTTAAAAACTACAAAACGTTACTTTAAAAGCTAATTTTCACCCTGGTTGTTGAATGTCAAGGAAACATTAACATTTTATTTTGAATTTTACTGATAGATTGCATAGAAAACGTCTCACTGTGGTTTAGTCCATTGGTGTATGCAAACATTAAGTTTATATATACATGTGGTTACTCTTTTCTTTTTTTGAAACGGAAATTGATGGAATAAAAAGAAGCTAATCCTCAATATATAATGAGACATAAAAAGCAAGATAAGACAAATACATCCAATATAGCCCAACCCTGGACTTACCCAGAAAATTAAAGCAAATACAAACAGATACAACCCATTGTGGATCAAATTGGCTCTTTAAGTGAAAACTTCAAACACCACTGAAATTGAATCTCTGAGAACAAGCAACCATATGATGAGGACAACTGGTAGGAGAGAAGAGCCCTCTGGTTACTCTTTTTCTGAATAATATATCTCTTCCTCATGTGAGAACACGTCAAGTTCACAAGAAATTGATTTGTATTTTTAGCTGCATAATAAAACAACAGGATGGAGTTTTGTAAATTAATAATTCTTGTCTCTGATTGAAGCCTTCTCTATCTAACTATCAACTGTTCTTTCCCTGCTCAGATCACTGTCTCTAATTCCAACCCTGATATTGTTATGGTTGGATTTCGCATCCATGTTGGTAATACGTCTGCGAACCATATACCTTCAGAGATAACTATTTTCCAGAGAGTTATAAAATTAGACGAGGGCATGCGATCATGGTATGATATACCATTTACTGTTGCTGAGTCTCTTCTTGCTGATGAAGAATTCTCTGTAACTGTTGGGCCAGCATTCAATGGTACTGCACTTCCTAGGATAGACTCTCTTGAAGTGTATGGTCGAGCAAAAGATGAATTTGGTTGGAAAGAAAAATTGGATGCTGTTCTGGACATGGAGGCACGTGCACTTGGCTCCAATTCCTTGCTTGCCAGATCTGGAAAAAAGAGGCGATCCATTCAATGTGCTCCTATTCAACAGCAGGTGTTAGCAGATGGTCTGAAGGTCTTGTCCAGTTATTATTTGCTTCGTAGATCACAAGGATGCCCAAAACTTAATGATGTGAATCAGGAGCTGACTAAACTGAAGTGCAAGCAATTATTAGAAACAATATACGAAAGTGATCGGGAGCCCTTGTTGCAGTCTGCTGCTTGTCGTGTCCTGCAAGCTATCTTCCCGAAAAAAGAGATATACTACCAAGTAATCATTTAATTGGTGTTGTTTTACTTGTATTGTGATAAATTCTTTTGAACTTACTCCTGAAATTCCCCCCTCGTAATTAGGTGAAGGACACCATGCGTCTGACTGGTGTGGTGAAATCAACATCAGTGCTCTCCTCTAGGCTTGGAGTTGGAGGTGCTGCAGGAGGATGGATTATTGAAGAATTTACATCACAAATGCGTGCAGTTTCTAAGATTGCCCTGCATCGTAGATCTAATTTGGCTTGTTTTCTGGAAAGAAATGGTATTTGTACTTCTTTTCTACTTGGTTCAATTCCTTTTATAATTGTTTAAAGTTTTTGATAATCGTGAAAAGTTGAGGTGTAGGAACTTTTTTTTTTTTTTTTTTAACATTTCATGTGAATAGTCAGCCAAAGTGGGTTGTGAAGTGTGATGGATTACATTAGAGATTATTATTATCTGCTTAAAATCAAAGGAAACCAAAAACCTTGGGTGGGTTTGATTATTAGATTGAGCAAGAGCACCCTTTTCATGCATAGTTGCTGCTCTTTTGAATCATTATGTTTAAATCCATAGAGTGGCCATGTTTTCTGTTCATTTTATTATTGGCCTTTTTTTCCAAATAAGAAACAATTTCATTGGTGTATGAAATTCAGAAAATAGATGGATAATCTACAGTGATTACAGAGGGTTTTAGAAGAAAGGGAAGCTTAACTATATAAAGCAAAGAGATAAGACAATTTACACCAAGAAATTGAACAATAAATAACAAGATCAAACACTTCAAAGGTGTGCCCATTGTTCAAGAAAACCCTTTGATATCTCTCTTTCCAAATAACCCAAAAGAAGGCCTTGTTGATGTGTACTTTTTGGTCAATTTGCTTTGTGATAACTGAGTGTAGAAACTCTAATATTTTATTGTTGTGTAAGAGCATGAAATACCATGGCAAGTTATCAATATCATTATCTGTATTTTCCTCTTATTTTGCGTTATCCATCTTATGGTGAATCTATAGTTTCAATCTATAAACTGCTATTATTTTTTGAATGTTCAAATGTGTTGGGATTCTATTGGAATGGTCTTTATTTTGTTGACATATTTGAATTAGTCTTACCTGTCGGGTTGAACTGGGAGATAGCAGTTTAGCCTCAATTTTATACTGTAGAAAAGGGTTAACTAATCAGTTGTACCATGATTTTGTACTTATTTGGTCCATGCTCCCAACCATGTTCAAACAACATCACATCTATTTCCAGGCTTTAAGTTATTGTTCCTTTTAAAGCATGGCACTTCCACTGGGTCTTCCTTCGAAAATATTGTAAAGTAGATGGATATGATATTTATGTGCATTGTTAAATTGTTCTTTAGGCCATTGATTAGAGAATCAAGTAGAATGTTTGTCATATATATTTTTTGTGGTCGCCTTATGGTTAGAATCACTAATGTAGGTTCTCAAGTGGTGGATGGACTCATGCAAATTCTGTGGGGAATTTTGGACTTGGAGCAGCCCAACACTCAGACTTTGAACAACATTGTTATTTCCTCTGTTGAACTTATTTATTGCTATGCTGAGTGCCTGGCATTGCATGGCCCAGACACTGGCAGGCACTCTGTTGCACCTGCTGTTGTGTTATTTAAGAAACTTCTGTTTTCCTCCAGTGAGGCTGTTCAGGCTTCAAGCAGGTTTTGTTTCGTCAATTTTATTCTTATTCTGAAATTTTATTCTAAGGTATTTACCTACGATTTATCCTTCCCGTTTCTTTTCAGCTTGGCTATATCTTCAAGGTTGCTTCAGGTTCCATTTCCAAAGCAAACAATGTTAGCTACTGATGATGGTGCTGATATTCCATTATCTGCACCTGTACCCACTGAAACAACTGGTACCAATCCCCAGGTCATGATTGAAGAAGACGCTGTCGCTTCTTCTGTTCAATACTGTTGTGATGGTTGCTCCACAGTTCCTATACTGAGGCGACGATGGCATTGTACAATTTGCCCTGATTTTGATTTATGTGAATCATGTTATGAGGTACTTGATGCCGACAGGCTTCCTTCCCCTCATTCTAGAGATCATCCTATGACTGCCATCCCAATTGAAGTGGACTCACTAGGAGATGGAAATGAATATCACTTCGCCACAGAAGATATCAATGATTCAAGCTTAACATCATTAATTCCAGATATTAGCGTGAAGAACCCAGTGTCATCAATTCATGTCTTGGAGCCAGCTGATTCTGGGGATTTTTCTGCCTCAGTGACTGATCCAGTTTCAATATCAGCTTCTAAACAAACCGTTAATTCCTTGCTTCTCTCTGAGCTCCTTGAACAGTTAAAAGGATGGATGGAGACAACTTCAGGTGTTCAGGCTGTTCCTGTTATGCAGCTTTTCTACAGATTATCATCCACAATGGGCGGACCTTTTATGAACAGTTTGAAATCTGAAAACTTGAACTTGGAAAGACTTATTAAATGGTTTTTGGATGAGATTAATCTCAACAAACCCTTTGAAGCAAAAACCCGTACTTCATTTGGGGAAGTTGCAATCCTTGTTTTCATGTTCTTCACTTTGATGCTAAGAAACTGGCACCAACCTGGTAGCGATGGTCCAGGTGCCAAACCTAGTACTACGACAGATACACATGACAAGAATTCTACACAGGTTGCACCATCTACTTCAGTGACTGCACAATCTTCCATGGATGATCAAGGAAAAAATGACTTTACTTCACAACTACTTCGTGCTTGTAGCTCTATTAGGCAACAATCTTTTGTAAATTATCTTATGGATGTGCTGCAGCAGCTTGTGCATGTCTTCAAGTCATCCACAATTGATTATGACAGTGGACATGGTTTTCATAATGGTTCTGGATGTGGAGCTCTGCTAACAGTTCGTAAGGATCTCCCTGCTGGCAATTTCTCTCCATTTTTTTCAGATTCTTATGCAAAAGCACACCGGACAGATCTTTTTATAGACTATCACAGGCTATTGTTAGAAAATGCTTTTCGTCTTGTATATACATTGGTTCGACCAGAAAAATATGACAAGACATTGGAGAAGGAGAAGGTTTATAAGATTTATAGCAGCAAGGATTTGAAGTTGGATGCCTATCAAGATGTTCTTTGCAGTTATATTAACAATCCAAATACTAGCTTTGTTAGAAGATATGCAAGAAGGCTTTTCCTCCACATTTGTGGCAGCAAAAGTCACTATTACAGTATTCGAGACTCTTGGCAGTTTTCGACTGAAGTAAAGAGACTTTTCAAATACATAAACAAGGTTGGTGGTTTTCAAAATCCTATGTCATACGAGAGAAGCGTAAAGATAGTGAAATGCCTAACAACTATGGCTGAAGTAGCTGCCGCAAGGCCTCGAAATTGGCAGAAATATTGCTTACGACATGCGGATGTGCTGCCATTTTTGCTGAATGGAATTTTCTACTTTGGAGAAGAGTCTGTTGTTCAAACTCTCAAACTTTTGAATCTTGCCTTCTATACTGGAAAAGATATTGGCCATTCTGTACAAAAGTCTGAAGCAGGAGATACTGGGACTAGTACAAATAAATCTGGTACACAAACTGTGGATTCAAGAAAGAAAAGAAAAGGTGAAGATGGAAATGATTCTGCGTTGGAGAAGTCCTATTTGGATATGGAGATCATGGTCAATATCTTTGTTGATAAGGGTAGTAATGTCTTGAGCCATTTCATTGATTGCTTTCTTCTTGAGTGGAATTCAAGCTCTGTCCGGGCAGAGACCAAAGGTGTTGTTTGTGGCATTTGGCATCATGGAAAGCAGACATTTAAAGAAACTCTATTGATGGCTCTCTTGCAAAAGGTTAAAACTCTTCCTATGTATGGTCTGAACATTGCTGAATATACAGAACTGGTCACATGGTTGCTGGGAAAAGTTCCTGATGTTGGTTCTAAGCAGCAGAGTTCTGAACTTCTGGATAGATGCTTGACCTCTGATGTAATTCGATCAATTTATCAAACACTTCACTCACAAAATGAGCTGTTAGCTAATCATCCAAATTCACGCATATACAATACTTTGAGTGGTTTAGTTGAGTTTGATGGTTACTATCTAGAGAGTGAACCTTGTGCGGCTTGCAGTTCTCCTGAGGTGCCTTACAGCAGGATGAAACTTGAGAGTCTTAAATCTGAAACAAAATTCACTGACAACCGCATCATTGTTAAATGTACAGGGAGTTACACAATTCAAACTGTTATAATGAATGTTCATGATGCTCGGAAGTCCAAATCTGTGAAAGTTTTGAACCTGTACTACAATAATCGGCCCGTGGCAGACTTATCAGAGTTGAAAAATAATTGGTCTTTGTGGAAGCGTGCAAAAAGTTGTCATCTTGCATTCAATCAAACTGAACTAAAAGTAGAGTTTCCCATTCCAATCACTGCATGTAATTTCATGATTGAGCTGGATTCTTTCTATGAAAATCTTCAAGCTTTGTCCCTTGAACCTTTGCAATGCCCTCGATGCAGTCGTCCAGTCACTGATAAGCATGGAATATGCAGCAATTGCCATGAAAATGCATATCAATGTAGGCAATGCCGGAACATAAATTATGAGAACCTTGACTCATTTTTATGCAACGAGTGTGGATACAGCAAGTATGGAAGATTTGAATTTAATTTTATGGCAAAGCCAAGTTTTACATTTGATAATATGGAGAACGATGAAGATATGAAGAGAGGTCTTGCTGCAATAGAATCTGAATCAGAAAATGCTCACAGGAGATATCAACAACTCTTAGGGTACAAGAAACCCCTGCTGAAAATTGTTTCAAGCATTGGTGAGAATGAAATGGACTCGCAACAGAAGGATTCTGTTCAGCAAATGATGGTCTCACTTCCAGGACCATCGTGCAAGATTAATCGTAAAATTGCTCTCCTTGGGGTTTTATATGGTGAAAAATGCAAAGCAGCCTTTGATTCTGTCAGTAAAAGTGTCCAGACGCTTCAAGGTCTTCGTCGGGTTTTAATGACGTATTTGCACCAGAAACATACTGATGACGGGTTTCCGGCTTCAAGATTTGTGATTTCTAGATCTCCTAATAATTGCTATGGCTGTGCGACTACATTTGTAACTCAATGTCTTGAGATATTGCAGGTGTTATCAAAGCATCAGAGTTCGAAGAAACAACTTGTTAGCCTGGGTATATTATCTGAGCTGTTCGAGAATAATATTCATCAAGGGCCCAAAACTGCTCGAATACAAGCTAGGGCAGTTCTTTGTTCTTTCTCCGAGGGTGACGTAAATGCAGTGAATGGACTGAACAATCTAATCCAAAAGAAAGTCATGTACTGCCTTGAACACCATCGTTCTATGGACATCGCATTGGCAACTCGAGAAGAGTTATCACTGCTCTCAGAAGTTTGTTCTTTGGCTGATGAATTCTGGGAAGCTAGATTGCGAGTTGTTTTCCAGCTGTTATTTTCATCCATTAAGTCGGGTGCCAAACATCCAGCAATTGCTGAGCACATCATTCTTCCATGTCTGAGGATCATATCTCAAGCTTGTACTCCTCCTAAATCTGATACTGTAGACAAGGAGCAGAGGATGGGAAAATTGACATCTGTTTCACAAAATAAGGATGAAAACGCTACAAATATATCTGGATCTTTCAGTGGACCTGTTAGTGGGAATAAGTCTGCACCTGAATCACTTGAACATAATTGGGATTCTTCTCATAGGACTCAGGATATTCAATTGCTGAGTTATGCAGAGTGGGAAAAGGGAGCATCATATCTTGACTTTGTTAGAAGGCAGTACAAGGTGTCTCAGGTGTGTAAAGGTACAGTTCAAAGATCTCGAACACAAAAAGGCGATTATTTGTCCCTGAAGTATGCGCTTAAGTGGAAGCGGTTTGTATGTAGAAATGCTAAAAGTGATTTGTCAGCTTTTGAGCTGGGGTCATGGGTTACAGAACTTGTGCTATGTGCCTGTTCCCAGTCTATAAGATCGGAGATGTGTATGCTGATAAGTTTGCTTTGTGCTCAAAGTTCATCAAGACGATTTCGACTATTGGATTTACTGGTGTCTCTGTTGCCAGCAACTCTTTCTGCTGGCGAGAGTGCTGCTGAATATTTTGATTTACTTTTCAAGATGGTAGATTCAGAAGATGCGCGCTTGTTTTTGACTGTTCGAGGGTGCTTACGTACAATTTGCCAATTAATTTCCCAAGAAGTGGGCAATGTTGAGTCTCTGGAGAGAAGCCTCCATATCGATATTTCCCAGGGATTTATTCTTCACAAGCTCATAGAGCTCCTTGGGAAATTTCTAGAGATCCCCAACATTAGATCAAGGTATAACTTCTTCCTTAAGCATCTGTTTTTTAGACTACTTCTGTTTAAGCTTATACAATGTGATCTTATTGTTAGATTCATGCGGGATAATCTACTCTCTGAAGTTCTTGAGGCTCTCATTGTGATTCGTGGTTTGGTAGTACAGAAGACAAAATTGATTAGCGATTGTAATCGGCTTTTGAAAGATCTCTTGGACAGCCTTCTGCTGGAAAGCAATGAGAACAAGAGGCAATTTATTCGAGCCTGCATTTGTGGCTTGCAAATCCATGGAGAGGAAAGGAAAGGGCGGACTTGTTTGGTACGATTAATCTTGAGAACTGGTCAATCAATTAAGGCCCATTTTCTCCTAGACTGTTCTGCTCCTTAGGAAACTATTTATCTTACCGAAATAAATTGAAAATATTAGCATCTTCTATGATTTATGCTATTTATACTCTAATAAATATTCAGATATCACTTTCTAAAAATATGCCCTTTCCCTTTTTAGTTCATCTATTTTTTCTGTTTGACGCCGTACGTAGATTGGCAGCTTACTTTGAAATATTATGTGCAGTTTATTCTAGAGCAGCTCTGCAATCTGATCTCTCCCTCAAAGCCAGAGCCAGTGTATCTCTTGGTTCTAAACAAGGCACACACGCAAGAAGAATTTATTAGGGGATCCATGACAAAGAATCCTTACTCCAGTGCTGAGATTGGTCCATTAATGCGTGATGTCAAAAATAAAATTTGTCACCAATTGGATTTACTTGGTTTTCTTGAAGATGATTACGGCATGGAGTTGCTAGTTGCTGGAAATATAATTTCTCTTGATTTGAGCATAGCACTAGTCTATGAGCAAGTGTGGAAGAAGTCTAACCAGTCTTCAAATGCCATATCTAATACTGCACTAATATCTACCACCGCTGCAAGAGACTCTCCTCCTATGACAGTTACTTACCGGCTTCAGGTTTGTTTAAATAATCCATGGCCAGCCTTTTTGGATAATGTTTACCTCATGTTTCTTATTTCCTTGCTATGGGAATCCCTATATTTGAGTTTTATGTTGAAGATTTCATGAAATATGGGTTAGAGAAATTTGTAGGTGAGATCCTTAGTGATACCATTAATTTTCTTGGGGGTTGGATATTATATCCAGGGGCTAGATGGTGAAGCAACGGAACCTATGATTAAGGAATTGGAAGAGGACAGAGAGGAATCGCAAGATCCAGAACTGGAATTTGCTATAGCAGGTGCAGTTCGTGAGTATGGAGGTCTGGAAATTTTGTTGGGCATGATCCAGGTGTGTAGAGCAGACCTTTAATCATCGCTTTATTTTGTTTTCATACCTTGTTTATTGAGTCTTCTAGTTTTACCAAAAATGTCGGGGCGATGTTTGTTTTAAAAGTATATTTACTATAGTTCTGTACTTTCACATTCTATTTTAGTCCAACCTTTCTTCTTCTTATTTACTTGGTGAGTAAACGCCTAAGTGAACAAGTATTTGGAGTTATATATTGTGGGTTAAGGTAGATCAAGATTTTGGTCAAACATTTACCATTAAGTAAACTACAAGGGCTCAAATTAGCTTTTCTCTTGGAATATTAAAATAGAACATTGGAAGTTTAAGAACTAAAATAGAATGATACATTGAGTACTAAAACGGAGTCAGTGTCAAAGTTCATGGTAAGCGTTCGCTCTTTCAACCTATTGAACCTAAACTTATGGCTTAAAATTTGGGTTTTAAAATTATGTTGCTTCAATTTATACCTTAATATTCTTTTTTCTTTCTTTTTCCCCTTCCAATTCACAATCCACCCAATGAAATGTACCACGCTGTATTTCATTCTTTCATGACTGCTATGACATACCCCTGCAATTTTCATTTGTTTTTTCTTTAATATGTTCTGTACTTTCATTTGAAAATTTCTCAGTTCTCTATACATGCACTATTCGTTTACCATCTACATGGGCCACGCATAGTAATTTGTTAACAAAAAACTTGGTTCTTTGTGAAGTTCTCTCTTTTTCTTTTTTCCTTTTTCCTTTTTCCTCTTCTTCTTAGGTATAAGAAATCCCCCTACTAGTTAGAGGGTCTTTCTGTAATTTCCTTTGGAAATATAGTATTTGAAAACTTATATAGACAAAAAATGTTAAGAATATGATTTTGATAATTGAATTTACTGTAATTTTTTATGCTATAGTTAATACGACTTGGTATAAATTTACTGTAATTTCACCACTATATCTTTACCATACTCATTCCCAATTCTACGAGTGGGGAAATATTAAGAATTCTACATTAATTGTAATAAATTAATCTTAACCCATTGACCTAAGCTCTTGGGTTCGGTTACCGCTAATGTTATATTTTATTAGCACACACCTTGCTTGTGGGCTTGAAAATTTGTACAAGGTCCAATTAGTGGTTTTCAACTTTATTGGGTAAGAAATTTTATATTTCAATGTTCACGGCAATTAAATAATGCTTCTAATTAGCATATTAATTGTTCAATCAAAATTATTTGTTTCATCTCTTCAATTAAAAAAAATGTCTCAAATTCTATTTACTTGATCCCCCTCCCTCCCTCCCTCCCTCTTCTTTGTTTGTACGCTTCAATCTAGTGTAGTGGTATGTTCAATGTCAACTTCCTTGCTTTGCCAACTGTATGCCTACTTCTATATTCAGTTATATGGTCTTTCCATCCTATATCTAACCACATGTGGGGTAATTGGAGTTAAGCTCGGTCCTTTTTCCTGGTCTTACTTTTTCTTCTTTATTTGAACCAGCGCATATGGGATAATTTCAAGTCAAACCAAGAGCAGTTGGTTGCAGTTCTTAATCTTCTTATGCATTGTTGCAAAATAAGAGAGAACAGGCGTGCTTTATTAAGGCTTGGAGCTCTCGGATTACTTCTAGAAACGGCAAGGCGTGCCTTCTCTGTGGATGCCATGGAGTCAGCTGAAGGCATTCTCTTGATTGTGGAGAGTCTAACAATTGAAGCGAATGAAAGTGAAAGTATTAGCATTGGACAAAGTGCTCTTACCGTCACCAGTGAACAAACTGGTACTGGTGAACAGGCCAAAAAAATTGTCCTCATGTTTCTGGAGAGATTATCTCATCCTTTTGGTTCTAAGAAATCAAACAAACAGCAGAGGAACACTGAAATGGTTGCTAGAATCTTGCCTTACTTGACCTATGGTGAACCTGCTGCTATGGATGCACTCATCCAACATTTCACTCCATATCTGAATGATTGGGATGAGTTTGACCGATTACAGAAACAGCATGAAGACAATCCAGAGGATAAGAGCATCTCTGAGCAAGCTGCCAAGCAGAGATTTACTGTGGAAAATTTCGTTAGAGTCTCAGAGTCACTGAAGACAAGTTCCTGTGGGGAGAGACTGAAGGATATTATTTTGGAAAAGGGCATTACTGGCCTTGCAATTAAGCATCTGAGAGATAGTTTTGCTGTTGCAGGACAGACCGGTTTCAGATCTAGTGTGGAATGGGCATTTGCCTTGAAACGTCCTTCCATTCCGCTTATATTGTCTATGCTAAGGGGTTTGTCAATGGGGCATTTGGCTACACAGAGATGTATTGATGAAGGAAGGATCTTACCTGTGCTTCATGCTCTGGAAAGAGTTCCGGGAGAAAATGAGATTGGGGCAAGGGCTGAAAACTTGTTAGATACCCTCTCTAACAAGGAGGGAAATGGAGATGGATTCTTGGAGGATAAAGTACGAATGTTAAGACATGCCACAAGGGATGAAATGAGGCGACTTGCTTTGAAGAACAGAGAAGACATGCTACAGGTGCTGTCTCAATGTAGTCCATGATTTAATGTTGGCCTAGTATTTTCTAATGCTTTTGATTATACTTGTATTTTCATACAGTATTTGGTTAGTTTTAGGATTTTTTCTGCTAGAGAGTTTTATCACTATACAAATCGATTTCAAAATGAATTTCCTTTTTGCAATAGAATTTGAATTAGAGCTCTATTTCTTTTCGCAATAGAATTTTCACTGGAATGCCAAAGCAAGCAAATTTTTTTAACTTTCCTTGTCATAAAAATATCATGAACATCGAAAATATGAATAGGTAACATGTTTGGGAGTGGGAAGTCTGTAACATGAAATTTGCTAGCTTTTCATTGTTTGAAAGTGTGGTTGGATAGCATTGATGTATAATAAGAATAACTCTTAAGCTCTTCCGATTTCATGATTTTGTTCTTTTGCCGTCTTCATATATATTTATAAATTCCGTGGTTTCTGATTTTAAAACTCTGTGGAGATAATTCATTTTTGGGGATAATGCAGGGACTTGGAATGCGACAGGTGGCTTCAGATGGCGGTGAACGGATCATTGTCTCTAGACCAGCTCTTGAAGGTTTGGAAGATGTAGAGGAAGAGGAAGATGGGTTGGCATGCATGGTGTGCAGAGAGGGTTATAGTTTGAGGCCTACTGACTTGCTGGGTGTCTATTCATACAGCAAAAGGGTTAACCTTGGTGTTGGGACTTCAGGAAGTACTCGTGGGGAGTGTGTATATACAACCGTGAGTTATTTCAATATCATTCATTATCAATGCCATCAAGAGGCCAAAAGAACAGATGCTGGTTTGAAAATCCCAAAGAAAGAATGGGAAGGAGCAACACTCAGGAACAACGAATCCCTTTGCAATTCCTTGTTCCCTGTGAGAGGTCCGTCTGTCCCGTTAGCACAATATATCCGATATGTTGACCAGCATTGGGACAACCTAAATGCTCTTGGTCGTGCTGATGGAAACAGACTTCGGCTCTTGACATATGATATTGTTCTGGTATGGTTATTGCCTGTTGTCAGTTTGTGA

mRNA sequence

ATGGCGGAGCAGAGTTTCGTCAAACTCTTAGACACTATCTTCCTTGACGATTCCAGCACCAGCGCCAATACCAGGAAGCATTTTTCTTCCTCCGATCTTCTCCAGCTTCTTCGCTCCGATGATTCTTCGATCAAACTCGGCCTTCGTCAATTCTATTCGATTCTCAAGGCCGGCCTTCGGGACCTCGGTGACGGAAACTTTGCTTTTCAGTCATGGACCGATCCTCAGATCCAAGCTGTTTGTTCAATTGCTCATGCAATTGCTTCTGCTTCTCGATCCCTGACCGTGGATCAAGCTGAAGCTATAGTTGTTGCGGTTATTAAAAAATCACTCGAGTTGGTTTTCTGTTACTTGGAGAAATCAGAGTTTAAGTGCGATGACTTTAGTATTCAGAATAATATGTTAATGATTCTGGAGACTATTTTGGTTGATGGGATGGATAAAGTATCAGACTTTGCACAGCTTTGTGCTAAGAAAAGTCTTATGGACTTGTTAAAATCGACTGGTGGAGACTGTGATGCTACTATTGAGTTCGATAATACCATTGAATGTGGTTCTACAGGAGTTTGTTGCTCCAGAGAAGAAAAACAGGTAGGTAGGCTTTTAATGACAATAGCTGCTGAATGCGTGCAAGCTGATCAGCTGACCTCTGAATCTGGATTCAGTCAACCGACATTCCTTGAAGATATGAACAAGTTGATTTTCCTTTGCCAACATTGGGCAGTCACGCATTTGGCATGCATTCAACATTTGATTTTGATCTGCAAAGAATTGGTAGTACTCCCGGATGCGCTTGATGAGAAGACAGGAAGTACAAGTTTTCGGAAGAGACTGTCATGTAGTTTGAGGATATTAAAGCTTCTAACCGATCTCTCAAAGAAATTTCCATATATTGAATATGATGCTAAAATGATGCAGGCATTCGCATTGTTTGCCAACTCATTGCCTTGCTTGTTTGGACTATGTTTTGAGTTTGCAAATAGTCATGCTACAGTCGAGGGTAGTTTTGAGAACACCATTTTGTTACTCCTGGAAGAATTTTTGGAGCTAGTTCAGGTTGTATTTCGCAACAGCTATGTTAGTGTGAACATCCAAACATGTGTAGTGGCTTCTATATTGGATAATTTGAGTTCTTCAGTTTGGCGGTATGATGCATCTACTGCAAACCTGAAGACTCCACTGGTTTACTTTCCACGAAGTGTTATGGTTATAATTAAACTCATTCAAGATCTAAAGGGCCATAAATATCATGCTTTCAGTTTTAAAGATCTTGAAACGCATCACACGAGCACTCTTGCTGATTTATCCGTGGACATACCTAAATGCTATGCTCGTTTGGAGATTGTTCCTTTGCATAAGAATTATAAAGTAGAAGAAATTTTGAGAATGATATTTCCTCTGTCAAAACAATGGATGGATGATTTAATGCATCTACTCTTCTTTCTTTATTCTGAAGGAGTGAGATTAAGACCAAAAATAGAGCGATCATTATCCAGTATGAAGAGCAGTAGTACGGTCGAACAAGAAACTGCTGTCTGTCATGAAGATGAAGCACTATTTGGGGACCTCTTCTCTGAGAGTGGCCGCTCTGTTGGATCTGTAGATGGATATGATTTGCAGCATCTTGCTGTCAACTCTACCTCTAGCTTTTGCAATCTGCTTCTCCAAGCTGCTAAAGAACTATTGAGCTTTATCAAGCTATGTATCTTCTCTCCTGAATGGAATGCATCTGTTTTTGATGATGGTTGCAACAAACTTAATCAGAACCATATTGATATATTACTTTCCTTACTAAACTGTGAGGGATGCTGTTCTGATGACAAGTCTTCTGCTAGTTGTCTACCTGCACATGATGAGAGGAAATCTGGCCACATCCATGAAATTTGTTACAGGTTGTTGCATGGTCTTCTCACACGCCATGCATTGCCAGATTCGCTTGAAGAGTACCTTGTGAAGAAAATTTTGAATGCTGAAAATGGAAATTTTGTTTACAATGATCAGACCCTAAGCTTACTGGCGCACACCCTTTTCCGTAGAACTGGTGTTGCTGGGACGCTGTTGAGGACCCAAATATACAGGCAATTTGTGGAATTTATCATTGAGAAGTCCAAAACTATTTCCTCAAACTATTCCAGCCTCCAGGAATTTATGGGGACTCTTCCCTCTGTCTTTCATATTGAAATTCTTCTGGTGGCATTTCACTTATCTTCTGAAGGAGAAAAGAGAGAAATTTCGAGCTTAATTTTTTCTTCCATTAGGGCAATTGATGCTCCATCCACATTTTCTAACTGTACAGAATTGTCAATGTGGGGTTTATTGGTTTCAAGGTTGATTATAGTACTTCGACACATTATTTTTCACCCGCATACATGTTCCTCTTCGTTGCTTTTTGATTTTCGATCTAAGTTGAGGGATGCTCCTGCATTTTCTTCTAGTTTGCCTTATACACTGAATGATCATTTATCATCTTGGGGTGCAAGTGTCGCTAAGAATATAATTGGTTCATCCGTGGAATCCAAGCCCTTCTTTCATAGCTTGATCAACCAGTTGATTGATATTTCTTCATTTCCTGCTTCACTACGCCAGCATGATTTGACAGTAGAATGTCCATGGTTTAATGCTGGTGATATATTTTCGACATTCTCATGGATTTTGGGGTTCTGGAATGGTAAACAAGCTGTTACGGTTGAAGACCTCATCATTGAAAGATATATTTTTGTTCTCTGCTGGGATTTTCCTTCCATGAATGCTTTATCACATGGAGGCCCATTATGGAGCGACCCAGACACACTGGACATTTCTAATACCACATGCTTCTTTTACTTCAGTTATTTACTCCTAGATCATGGTGGTGTTATTGGCGAACACATGAAGTTTCCTCAAGTTGTGATTGGTTTGCTTCAGCGTTTGCATGGTGGGAGTATCCTGGAGGACTTCAAAGCTTTGGGCTGGAATTTTTTAAGAAATGGAGCATGGCTATCTCTGGTTCTTTCCTTCCTCAGTGTTGGGATATGGAGATACTGCAGTAAGAATATGATTCCAACAGTGGGTTCTTTATTGACAGATACCACAGTTACAGATAATGAGCAGGCCAATTTTGCTGAAAGCTTAATTTCCTCGGTGATTACCGATAGCCAAGTTTCAATTTTAATCAGGGAGTTGTCATCTGTATTGAGCATGTATTTACAAGTGTATCAGAAAGCCTTTGTTGCTACTCTTAGTAGTAGTAATGATCATGCTACTGAGTTTTCCCCACTCTTGCTCTTTAAGCATTCTGAATTTGATAGCTGTGTCCAGAACAAGACCCTTGAGAACTATGGGACAACATCCTGCTTATTGGAATCTGTTTTTAACCTCATGTCTAGGTTGGATGAAATAGTAGACAAAAGAACCCTTGGGTTCTTATCAAGGTTCTGTTGGGAATCGATGTTTCATGGTTTTCCCTCTCATTTGGAAACTTCCAGTGGAATTCTACTTTCTTGTGTCCTTAGCATAGGAAGGATTATTTCTGTTTTAGCTGGGCTGCTGAGGATAGTAGATGTTAAACGTAATATCATTTTGGAGACTGAGGTAACTCGTGGGATTCTTGATGCAGTAATGACTATAAAATTTGACAAGACCTTTGAAAGTGTTCATGGCCTGTGTGAAGGTATATATCAGAGTTTGAATAAGGAATTGGATGGATGTTCTTATGGGGTTCTGTTTCTGTTGAAACAGCTTGAGGGGTACTTAAGACACATGAATATGAGGGGGGCGAGTGATAGCACTATTCATGAATTGGTAATTGTTAAAGCTACAGATATCATGGACAACCTGCGGAAAGATGTTTCGAAGTCTTCTGTTTTCCAATTCTATCTTGGTGCTGAAGTTGTACTGGAGCAGGTTAGAGAACTTTATACATTTCAACATGGTAATTTGTTGGTTCTTCTAGACTCCTTAGACAATTGTTGCTCCGAACTAGTTAACTTGAAGGTTCTTGGTTTCTTTGTGGAACTCTTGTCCGGAGAGCCATGCCCTAAACTTAAACAGGAAGTACAGAATAAATTCCTTAGTATGGATTTGCTTAGCCTATCACAATGGTTGGAAAAGAGGATTTTTGGTTTAGTAGCCGAAGATTCAAGCGGAGGCAATGTGAAGGGATCTTCTATTTCTCTTAGAGAATCATCCATGAATTTTGTATTTTGTCTCATATCATCACCCTCAGAACCTCTGGCACTTCAATTGCAGAGTCACATTTTTGAGGCTGCACTAGTATCACTTGACATGGCATTTTTGCGATTTGACATCAGTGTTTCCAAGTCCTATTTCCATTTTGTTGTTCAGCTATTGAAAGGGGACAAATCAATGAAATTACTTTTGGAGAGAATTCTCATATTGATGGGGAAATTGGCTAGCGATGAGCGCCTGCTTCCAGGGCTGAAGTACCTCTTCAGTTTTCTCGAAATGATTTTGATTGAGAGTGGATCTGGTAAGAATGTTTTTGAGAGACCTTCTGGCAAACCTCTGTCGAGGTATGCACCTGAAGTTGGACCTCTTTCTTCTAAGTCAGTGGGGCCTAGGAAGAATTCAGAGACATTGGTTCTTTCTTCCAATCAAGAAGAGGGTCCTGCATCTTTTGAATGTGATGCAACTTCTGCTGAGGAAGATGAGGATGATGGAACTTCCGATGGTGAGGTAGCTAGTCTGGACAAGGATGAGGAGGAGGACACAAACAGTGAAAGGGCACTTGCTTCAAAAGTCTGCACATTTACATCCAGTGGCAGCAATTTCATGGAACAACACTGGTACTTTTGTTATACATGTGATCTGACTGTCTCGAAGGGATGTTGTTCTGTGTGTGCAAAAGTCTGTCATCGTGGTCATCGTGTTGTTTATTCCCGGTCTAGTCGCTTCTTTTGTGACTGTGGTGCTGGGGGTGTTAGGGGCAGCAGTTGCCAATGTTTGAAGCCTCGCAAGTATACTGGACATGGCAGTGCCCCTGTACGTGGTGCCAGTAACTTCCAGTGCTTTTTGCCTTTCTCTGAGGAGGGAGATCAACTTCCTGAAAGTGAATCTGATCTGGAAGACGATGTGTCAGTTACAGACACAGATAAGTGTCTCAGACCCTCTGTTCCTAGGGAGCTTCTAGATGGTGTTTCCGTTTTACTTGAGGAACTGGATGTTGAAGGAAGGATGCTCGAGCTTTGCTCGTGTTTATTGCCTACTATAACTAACCAAAGGGACCCAGACCTTTCAAAAGACAAGAAAATTATTCTCGGTAAAGACAAGGTGCTATCATATGGGCTTGATCTCTTGCAATTGAAAAAGGCATATAAAGGTGGGTCTTTGGACCTTAAGATTAAGGCAGAATATGCAAATGCCAAGGAGCTCAAATCACATCTAGCCAGTGGTTCTCTTGTGAAATCTCTGCTCAGTGTCAGTATCAGAGGTCGTCTTGCTGTTGGTGAAGGTGATAAAGTGTCTATTTTTGACATCAGGCAGTTAATAGAACAGACTACTGTTGCCCCCATGACGGCAGACAAGACTAATGTTAAGCCACTCTCCAAAAACGTTGTTCGTTTTGAAATTGTACATCTTGCTTTTAATCCCACAGTAGAGAACTATCTTGCTGTGGCAGGCTATGAAGATTGCCAAGTCTTGACTTTGAACCACCGTGGTGAAGTTGTTGACCGTCTTGCTATTGAACTCGCTCTGCAAGGGGCTCATATTAAACGAATGGAATGGGTTCCAGGATCTCAAGTCCAATTAATGGTGGTTACAAACAGGTTTGTCAAAATATATGATCTATCCCTGGATAATATCAGTCCAATGCACTACTTCACGTTGCCAGATGACATGGTAGTTGATGCTACACTGTCCACAGCTTCACAGGGAAGGATGTTTCTTATTGTTCTTTCAGAAAATGGAAGGATATTCAGACTTGAGTTGTCAGTGCTAGGAAATGTTGGAGCTACACCCTTGAAGGAGATCATTGAGATTCAGGGCAGAGAAATGAGTGCAAAGGGATTGTCATTGTACTTTTCTTCATGTTACAAATTATTATTTCTTGCATATGCAGATGGCACCACATTGGTTGGTCAGTTAAGCCCTGATGCAACAAAATTGACCGAGATATCGGTTATATACGAAGAGGAACAAGACAGAAAACTCCGGCCTGCTGGATTACACCGTTGGAAGGAGCTGTTTGCTGGCAGTGGCTTATTTGTTTGCTTTTCAAGCGTCAAATCAAATTCGGCTTTGGCTGTATCTATGGGGGCTCATGATATTTATGCTCAAAACTTGAGACATGCGGGGGGTTCATCTTTGCCATTAGTTGGCATAACTGCATATAAGCCTTTATCCAAAGATAAAATACATTGTCTTGTACTACATGATGATGGTAGCCTTCAAATATACACACATACTGCTGTTGGAGTGGATGCTAGTGCATACGCAACTGCCGAAAAAATCAAGAAGTTGGGTTCTGGCATTCTCAATAACAAGGTTTATGCCAGTACAAATCCAGAATTCCCACTTGATTTTTTTGAGAATACTGTTTGCATCACTGCAGATGTGAGATTGGGAGGCGACGCTATTCGAAATGGTGATTCTGAAGGAGCCAAACAGAGTTTGGCATCCGAGGATGGTTTTCTTGAGAGTCCCAGTTCTTCAGGTTTCAAGATCACTGTCTCTAATTCCAACCCTGATATTGTTATGGTTGGATTTCGCATCCATGTTGGTAATACGTCTGCGAACCATATACCTTCAGAGATAACTATTTTCCAGAGAGTTATAAAATTAGACGAGGGCATGCGATCATGGTATGATATACCATTTACTGTTGCTGAGTCTCTTCTTGCTGATGAAGAATTCTCTGTAACTGTTGGGCCAGCATTCAATGGTACTGCACTTCCTAGGATAGACTCTCTTGAAGTGTATGGTCGAGCAAAAGATGAATTTGGTTGGAAAGAAAAATTGGATGCTGTTCTGGACATGGAGGCACGTGCACTTGGCTCCAATTCCTTGCTTGCCAGATCTGGAAAAAAGAGGCGATCCATTCAATGTGCTCCTATTCAACAGCAGGTGTTAGCAGATGGTCTGAAGGTCTTGTCCAGTTATTATTTGCTTCGTAGATCACAAGGATGCCCAAAACTTAATGATGTGAATCAGGAGCTGACTAAACTGAAGTGCAAGCAATTATTAGAAACAATATACGAAAGTGATCGGGAGCCCTTGTTGCAGTCTGCTGCTTGTCGTGTCCTGCAAGCTATCTTCCCGAAAAAAGAGATATACTACCAAGTGAAGGACACCATGCGTCTGACTGGTGTGGTGAAATCAACATCAGTGCTCTCCTCTAGGCTTGGAGTTGGAGGTGCTGCAGGAGGATGGATTATTGAAGAATTTACATCACAAATGCGTGCAGTTTCTAAGATTGCCCTGCATCGTAGATCTAATTTGGCTTGTTTTCTGGAAAGAAATGGTTCTCAAGTGGTGGATGGACTCATGCAAATTCTGTGGGGAATTTTGGACTTGGAGCAGCCCAACACTCAGACTTTGAACAACATTGTTATTTCCTCTGTTGAACTTATTTATTGCTATGCTGAGTGCCTGGCATTGCATGGCCCAGACACTGGCAGGCACTCTGTTGCACCTGCTGTTGTGTTATTTAAGAAACTTCTGTTTTCCTCCAGTGAGGCTGTTCAGGCTTCAAGCAGCTTGGCTATATCTTCAAGGTTGCTTCAGGTTCCATTTCCAAAGCAAACAATGTTAGCTACTGATGATGGTGCTGATATTCCATTATCTGCACCTGTACCCACTGAAACAACTGGTACCAATCCCCAGGTCATGATTGAAGAAGACGCTGTCGCTTCTTCTGTTCAATACTGTTGTGATGGTTGCTCCACAGTTCCTATACTGAGGCGACGATGGCATTGTACAATTTGCCCTGATTTTGATTTATGTGAATCATGTTATGAGGTACTTGATGCCGACAGGCTTCCTTCCCCTCATTCTAGAGATCATCCTATGACTGCCATCCCAATTGAAGTGGACTCACTAGGAGATGGAAATGAATATCACTTCGCCACAGAAGATATCAATGATTCAAGCTTAACATCATTAATTCCAGATATTAGCGTGAAGAACCCAGTGTCATCAATTCATGTCTTGGAGCCAGCTGATTCTGGGGATTTTTCTGCCTCAGTGACTGATCCAGTTTCAATATCAGCTTCTAAACAAACCGTTAATTCCTTGCTTCTCTCTGAGCTCCTTGAACAGTTAAAAGGATGGATGGAGACAACTTCAGGTGTTCAGGCTGTTCCTGTTATGCAGCTTTTCTACAGATTATCATCCACAATGGGCGGACCTTTTATGAACAGTTTGAAATCTGAAAACTTGAACTTGGAAAGACTTATTAAATGGTTTTTGGATGAGATTAATCTCAACAAACCCTTTGAAGCAAAAACCCGTACTTCATTTGGGGAAGTTGCAATCCTTGTTTTCATGTTCTTCACTTTGATGCTAAGAAACTGGCACCAACCTGGTAGCGATGGTCCAGGTGCCAAACCTAGTACTACGACAGATACACATGACAAGAATTCTACACAGGTTGCACCATCTACTTCAGTGACTGCACAATCTTCCATGGATGATCAAGGAAAAAATGACTTTACTTCACAACTACTTCGTGCTTGTAGCTCTATTAGGCAACAATCTTTTGTAAATTATCTTATGGATGTGCTGCAGCAGCTTGTGCATGTCTTCAAGTCATCCACAATTGATTATGACAGTGGACATGGTTTTCATAATGGTTCTGGATGTGGAGCTCTGCTAACAGTTCGTAAGGATCTCCCTGCTGGCAATTTCTCTCCATTTTTTTCAGATTCTTATGCAAAAGCACACCGGACAGATCTTTTTATAGACTATCACAGGCTATTGTTAGAAAATGCTTTTCGTCTTGTATATACATTGGTTCGACCAGAAAAATATGACAAGACATTGGAGAAGGAGAAGGTTTATAAGATTTATAGCAGCAAGGATTTGAAGTTGGATGCCTATCAAGATGTTCTTTGCAGTTATATTAACAATCCAAATACTAGCTTTGTTAGAAGATATGCAAGAAGGCTTTTCCTCCACATTTGTGGCAGCAAAAGTCACTATTACAGTATTCGAGACTCTTGGCAGTTTTCGACTGAAGTAAAGAGACTTTTCAAATACATAAACAAGGTTGGTGGTTTTCAAAATCCTATGTCATACGAGAGAAGCGTAAAGATAGTGAAATGCCTAACAACTATGGCTGAAGTAGCTGCCGCAAGGCCTCGAAATTGGCAGAAATATTGCTTACGACATGCGGATGTGCTGCCATTTTTGCTGAATGGAATTTTCTACTTTGGAGAAGAGTCTGTTGTTCAAACTCTCAAACTTTTGAATCTTGCCTTCTATACTGGAAAAGATATTGGCCATTCTGTACAAAAGTCTGAAGCAGGAGATACTGGGACTAGTACAAATAAATCTGGTACACAAACTGTGGATTCAAGAAAGAAAAGAAAAGGTGAAGATGGAAATGATTCTGCGTTGGAGAAGTCCTATTTGGATATGGAGATCATGGTCAATATCTTTGTTGATAAGGGTAGTAATGTCTTGAGCCATTTCATTGATTGCTTTCTTCTTGAGTGGAATTCAAGCTCTGTCCGGGCAGAGACCAAAGGTGTTGTTTGTGGCATTTGGCATCATGGAAAGCAGACATTTAAAGAAACTCTATTGATGGCTCTCTTGCAAAAGGTTAAAACTCTTCCTATGTATGGTCTGAACATTGCTGAATATACAGAACTGGTCACATGGTTGCTGGGAAAAGTTCCTGATGTTGGTTCTAAGCAGCAGAGTTCTGAACTTCTGGATAGATGCTTGACCTCTGATGTAATTCGATCAATTTATCAAACACTTCACTCACAAAATGAGCTGTTAGCTAATCATCCAAATTCACGCATATACAATACTTTGAGTGGTTTAGTTGAGTTTGATGGTTACTATCTAGAGAGTGAACCTTGTGCGGCTTGCAGTTCTCCTGAGGTGCCTTACAGCAGGATGAAACTTGAGAGTCTTAAATCTGAAACAAAATTCACTGACAACCGCATCATTGTTAAATGTACAGGGAGTTACACAATTCAAACTGTTATAATGAATGTTCATGATGCTCGGAAGTCCAAATCTGTGAAAGTTTTGAACCTGTACTACAATAATCGGCCCGTGGCAGACTTATCAGAGTTGAAAAATAATTGGTCTTTGTGGAAGCGTGCAAAAAGTTGTCATCTTGCATTCAATCAAACTGAACTAAAAGTAGAGTTTCCCATTCCAATCACTGCATGTAATTTCATGATTGAGCTGGATTCTTTCTATGAAAATCTTCAAGCTTTGTCCCTTGAACCTTTGCAATGCCCTCGATGCAGTCGTCCAGTCACTGATAAGCATGGAATATGCAGCAATTGCCATGAAAATGCATATCAATGTAGGCAATGCCGGAACATAAATTATGAGAACCTTGACTCATTTTTATGCAACGAGTGTGGATACAGCAAGTATGGAAGATTTGAATTTAATTTTATGGCAAAGCCAAGTTTTACATTTGATAATATGGAGAACGATGAAGATATGAAGAGAGGTCTTGCTGCAATAGAATCTGAATCAGAAAATGCTCACAGGAGATATCAACAACTCTTAGGGTACAAGAAACCCCTGCTGAAAATTGTTTCAAGCATTGGTGAGAATGAAATGGACTCGCAACAGAAGGATTCTGTTCAGCAAATGATGGTCTCACTTCCAGGACCATCGTGCAAGATTAATCGTAAAATTGCTCTCCTTGGGGTTTTATATGGTGAAAAATGCAAAGCAGCCTTTGATTCTGTCAGTAAAAGTGTCCAGACGCTTCAAGGTCTTCGTCGGGTTTTAATGACGTATTTGCACCAGAAACATACTGATGACGGGTTTCCGGCTTCAAGATTTGTGATTTCTAGATCTCCTAATAATTGCTATGGCTGTGCGACTACATTTGTAACTCAATGTCTTGAGATATTGCAGGTGTTATCAAAGCATCAGAGTTCGAAGAAACAACTTGTTAGCCTGGGTATATTATCTGAGCTGTTCGAGAATAATATTCATCAAGGGCCCAAAACTGCTCGAATACAAGCTAGGGCAGTTCTTTGTTCTTTCTCCGAGGGTGACGTAAATGCAGTGAATGGACTGAACAATCTAATCCAAAAGAAAGTCATGTACTGCCTTGAACACCATCGTTCTATGGACATCGCATTGGCAACTCGAGAAGAGTTATCACTGCTCTCAGAAGTTTGTTCTTTGGCTGATGAATTCTGGGAAGCTAGATTGCGAGTTGTTTTCCAGCTGTTATTTTCATCCATTAAGTCGGGTGCCAAACATCCAGCAATTGCTGAGCACATCATTCTTCCATGTCTGAGGATCATATCTCAAGCTTGTACTCCTCCTAAATCTGATACTGTAGACAAGGAGCAGAGGATGGGAAAATTGACATCTGTTTCACAAAATAAGGATGAAAACGCTACAAATATATCTGGATCTTTCAGTGGACCTGTTAGTGGGAATAAGTCTGCACCTGAATCACTTGAACATAATTGGGATTCTTCTCATAGGACTCAGGATATTCAATTGCTGAGTTATGCAGAGTGGGAAAAGGGAGCATCATATCTTGACTTTGTTAGAAGGCAGTACAAGGTGTCTCAGGTGTGTAAAGGTACAGTTCAAAGATCTCGAACACAAAAAGGCGATTATTTGTCCCTGAAGTATGCGCTTAAGTGGAAGCGGTTTGTATGTAGAAATGCTAAAAGTGATTTGTCAGCTTTTGAGCTGGGGTCATGGGTTACAGAACTTGTGCTATGTGCCTGTTCCCAGTCTATAAGATCGGAGATGTGTATGCTGATAAGTTTGCTTTGTGCTCAAAGTTCATCAAGACGATTTCGACTATTGGATTTACTGGTGTCTCTGTTGCCAGCAACTCTTTCTGCTGGCGAGAGTGCTGCTGAATATTTTGATTTACTTTTCAAGATGGTAGATTCAGAAGATGCGCGCTTGTTTTTGACTGTTCGAGGGTGCTTACGTACAATTTGCCAATTAATTTCCCAAGAAGTGGGCAATGTTGAGTCTCTGGAGAGAAGCCTCCATATCGATATTTCCCAGGGATTTATTCTTCACAAGCTCATAGAGCTCCTTGGGAAATTTCTAGAGATCCCCAACATTAGATCAAGGTATAACTTCTTCCTTAAGCATCTGTTTTTTAGACTACTTCTGTTTAAGCTTATACAATGTGATCTTATTGTTAGATTCATGCGGGATAATCTACTCTCTGAAGTTCTTGAGGCTCTCATTGTGATTCGTGGTTTGGTAGTACAGAAGACAAAATTGATTAGCGATTGTAATCGGCTTTTGAAAGATCTCTTGGACAGCCTTCTGCTGGAAAGCAATGAGAACAAGAGGCAATTTATTCGAGCCTGCATTTGTGGCTTGCAAATCCATGGAGAGGAAAGGAAAGGGCGGACTTGTTTGTTTATTCTAGAGCAGCTCTGCAATCTGATCTCTCCCTCAAAGCCAGAGCCAGTGTATCTCTTGGTTCTAAACAAGGCACACACGCAAGAAGAATTTATTAGGGGATCCATGACAAAGAATCCTTACTCCAGTGCTGAGATTGGTCCATTAATGCGTGATGTCAAAAATAAAATTTGTCACCAATTGGATTTACTTGGTTTTCTTGAAGATGATTACGGCATGGAGTTGCTAGTTGCTGGAAATATAATTTCTCTTGATTTGAGCATAGCACTAGTCTATGAGCAAGTGTGGAAGAAGTCTAACCAGTCTTCAAATGCCATATCTAATACTGCACTAATATCTACCACCGCTGCAAGAGACTCTCCTCCTATGACAGTTACTTACCGGCTTCAGGGGCTAGATGGTGAAGCAACGGAACCTATGATTAAGGAATTGGAAGAGGACAGAGAGGAATCGCAAGATCCAGAACTGGAATTTGCTATAGCAGGTGCAGTTCGTGAGTATGGAGGTCTGGAAATTTTGTTGGGCATGATCCAGCGCATATGGGATAATTTCAAGTCAAACCAAGAGCAGTTGGTTGCAGTTCTTAATCTTCTTATGCATTGTTGCAAAATAAGAGAGAACAGGCGTGCTTTATTAAGGCTTGGAGCTCTCGGATTACTTCTAGAAACGGCAAGGCGTGCCTTCTCTGTGGATGCCATGGAGTCAGCTGAAGGCATTCTCTTGATTGTGGAGAGTCTAACAATTGAAGCGAATGAAAGTGAAAGTATTAGCATTGGACAAAGTGCTCTTACCGTCACCAGTGAACAAACTGGTACTGGTGAACAGGCCAAAAAAATTGTCCTCATGTTTCTGGAGAGATTATCTCATCCTTTTGGTTCTAAGAAATCAAACAAACAGCAGAGGAACACTGAAATGGTTGCTAGAATCTTGCCTTACTTGACCTATGGTGAACCTGCTGCTATGGATGCACTCATCCAACATTTCACTCCATATCTGAATGATTGGGATGAGTTTGACCGATTACAGAAACAGCATGAAGACAATCCAGAGGATAAGAGCATCTCTGAGCAAGCTGCCAAGCAGAGATTTACTGTGGAAAATTTCGTTAGAGTCTCAGAGTCACTGAAGACAAGTTCCTGTGGGGAGAGACTGAAGGATATTATTTTGGAAAAGGGCATTACTGGCCTTGCAATTAAGCATCTGAGAGATAGTTTTGCTGTTGCAGGACAGACCGGTTTCAGATCTAGTGTGGAATGGGCATTTGCCTTGAAACGTCCTTCCATTCCGCTTATATTGTCTATGCTAAGGGGTTTGTCAATGGGGCATTTGGCTACACAGAGATGTATTGATGAAGGAAGGATCTTACCTGTGCTTCATGCTCTGGAAAGAGTTCCGGGAGAAAATGAGATTGGGGCAAGGGCTGAAAACTTGTTAGATACCCTCTCTAACAAGGAGGGAAATGGAGATGGATTCTTGGAGGATAAAGTACGAATGTTAAGACATGCCACAAGGGATGAAATGAGGCGACTTGCTTTGAAGAACAGAGAAGACATGCTACAGGGACTTGGAATGCGACAGGTGGCTTCAGATGGCGGTGAACGGATCATTGTCTCTAGACCAGCTCTTGAAGGTTTGGAAGATGTAGAGGAAGAGGAAGATGGGTTGGCATGCATGGTGTGCAGAGAGGGTTATAGTTTGAGGCCTACTGACTTGCTGGGTGTCTATTCATACAGCAAAAGGGTTAACCTTGGTGTTGGGACTTCAGGAAGTACTCGTGGGGAGTGTGTATATACAACCGTGAGTTATTTCAATATCATTCATTATCAATGCCATCAAGAGGCCAAAAGAACAGATGCTGGTTTGAAAATCCCAAAGAAAGAATGGGAAGGAGCAACACTCAGGAACAACGAATCCCTTTGCAATTCCTTGTTCCCTGTGAGAGGTCCGTCTGTCCCGTTAGCACAATATATCCGATATGTTGACCAGCATTGGGACAACCTAAATGCTCTTGGTCGTGCTGATGGAAACAGACTTCGGCTCTTGACATATGATATTGTTCTGGTATGGTTATTGCCTGTTGTCAGTTTGTGA

Coding sequence (CDS)

ATGGCGGAGCAGAGTTTCGTCAAACTCTTAGACACTATCTTCCTTGACGATTCCAGCACCAGCGCCAATACCAGGAAGCATTTTTCTTCCTCCGATCTTCTCCAGCTTCTTCGCTCCGATGATTCTTCGATCAAACTCGGCCTTCGTCAATTCTATTCGATTCTCAAGGCCGGCCTTCGGGACCTCGGTGACGGAAACTTTGCTTTTCAGTCATGGACCGATCCTCAGATCCAAGCTGTTTGTTCAATTGCTCATGCAATTGCTTCTGCTTCTCGATCCCTGACCGTGGATCAAGCTGAAGCTATAGTTGTTGCGGTTATTAAAAAATCACTCGAGTTGGTTTTCTGTTACTTGGAGAAATCAGAGTTTAAGTGCGATGACTTTAGTATTCAGAATAATATGTTAATGATTCTGGAGACTATTTTGGTTGATGGGATGGATAAAGTATCAGACTTTGCACAGCTTTGTGCTAAGAAAAGTCTTATGGACTTGTTAAAATCGACTGGTGGAGACTGTGATGCTACTATTGAGTTCGATAATACCATTGAATGTGGTTCTACAGGAGTTTGTTGCTCCAGAGAAGAAAAACAGGTAGGTAGGCTTTTAATGACAATAGCTGCTGAATGCGTGCAAGCTGATCAGCTGACCTCTGAATCTGGATTCAGTCAACCGACATTCCTTGAAGATATGAACAAGTTGATTTTCCTTTGCCAACATTGGGCAGTCACGCATTTGGCATGCATTCAACATTTGATTTTGATCTGCAAAGAATTGGTAGTACTCCCGGATGCGCTTGATGAGAAGACAGGAAGTACAAGTTTTCGGAAGAGACTGTCATGTAGTTTGAGGATATTAAAGCTTCTAACCGATCTCTCAAAGAAATTTCCATATATTGAATATGATGCTAAAATGATGCAGGCATTCGCATTGTTTGCCAACTCATTGCCTTGCTTGTTTGGACTATGTTTTGAGTTTGCAAATAGTCATGCTACAGTCGAGGGTAGTTTTGAGAACACCATTTTGTTACTCCTGGAAGAATTTTTGGAGCTAGTTCAGGTTGTATTTCGCAACAGCTATGTTAGTGTGAACATCCAAACATGTGTAGTGGCTTCTATATTGGATAATTTGAGTTCTTCAGTTTGGCGGTATGATGCATCTACTGCAAACCTGAAGACTCCACTGGTTTACTTTCCACGAAGTGTTATGGTTATAATTAAACTCATTCAAGATCTAAAGGGCCATAAATATCATGCTTTCAGTTTTAAAGATCTTGAAACGCATCACACGAGCACTCTTGCTGATTTATCCGTGGACATACCTAAATGCTATGCTCGTTTGGAGATTGTTCCTTTGCATAAGAATTATAAAGTAGAAGAAATTTTGAGAATGATATTTCCTCTGTCAAAACAATGGATGGATGATTTAATGCATCTACTCTTCTTTCTTTATTCTGAAGGAGTGAGATTAAGACCAAAAATAGAGCGATCATTATCCAGTATGAAGAGCAGTAGTACGGTCGAACAAGAAACTGCTGTCTGTCATGAAGATGAAGCACTATTTGGGGACCTCTTCTCTGAGAGTGGCCGCTCTGTTGGATCTGTAGATGGATATGATTTGCAGCATCTTGCTGTCAACTCTACCTCTAGCTTTTGCAATCTGCTTCTCCAAGCTGCTAAAGAACTATTGAGCTTTATCAAGCTATGTATCTTCTCTCCTGAATGGAATGCATCTGTTTTTGATGATGGTTGCAACAAACTTAATCAGAACCATATTGATATATTACTTTCCTTACTAAACTGTGAGGGATGCTGTTCTGATGACAAGTCTTCTGCTAGTTGTCTACCTGCACATGATGAGAGGAAATCTGGCCACATCCATGAAATTTGTTACAGGTTGTTGCATGGTCTTCTCACACGCCATGCATTGCCAGATTCGCTTGAAGAGTACCTTGTGAAGAAAATTTTGAATGCTGAAAATGGAAATTTTGTTTACAATGATCAGACCCTAAGCTTACTGGCGCACACCCTTTTCCGTAGAACTGGTGTTGCTGGGACGCTGTTGAGGACCCAAATATACAGGCAATTTGTGGAATTTATCATTGAGAAGTCCAAAACTATTTCCTCAAACTATTCCAGCCTCCAGGAATTTATGGGGACTCTTCCCTCTGTCTTTCATATTGAAATTCTTCTGGTGGCATTTCACTTATCTTCTGAAGGAGAAAAGAGAGAAATTTCGAGCTTAATTTTTTCTTCCATTAGGGCAATTGATGCTCCATCCACATTTTCTAACTGTACAGAATTGTCAATGTGGGGTTTATTGGTTTCAAGGTTGATTATAGTACTTCGACACATTATTTTTCACCCGCATACATGTTCCTCTTCGTTGCTTTTTGATTTTCGATCTAAGTTGAGGGATGCTCCTGCATTTTCTTCTAGTTTGCCTTATACACTGAATGATCATTTATCATCTTGGGGTGCAAGTGTCGCTAAGAATATAATTGGTTCATCCGTGGAATCCAAGCCCTTCTTTCATAGCTTGATCAACCAGTTGATTGATATTTCTTCATTTCCTGCTTCACTACGCCAGCATGATTTGACAGTAGAATGTCCATGGTTTAATGCTGGTGATATATTTTCGACATTCTCATGGATTTTGGGGTTCTGGAATGGTAAACAAGCTGTTACGGTTGAAGACCTCATCATTGAAAGATATATTTTTGTTCTCTGCTGGGATTTTCCTTCCATGAATGCTTTATCACATGGAGGCCCATTATGGAGCGACCCAGACACACTGGACATTTCTAATACCACATGCTTCTTTTACTTCAGTTATTTACTCCTAGATCATGGTGGTGTTATTGGCGAACACATGAAGTTTCCTCAAGTTGTGATTGGTTTGCTTCAGCGTTTGCATGGTGGGAGTATCCTGGAGGACTTCAAAGCTTTGGGCTGGAATTTTTTAAGAAATGGAGCATGGCTATCTCTGGTTCTTTCCTTCCTCAGTGTTGGGATATGGAGATACTGCAGTAAGAATATGATTCCAACAGTGGGTTCTTTATTGACAGATACCACAGTTACAGATAATGAGCAGGCCAATTTTGCTGAAAGCTTAATTTCCTCGGTGATTACCGATAGCCAAGTTTCAATTTTAATCAGGGAGTTGTCATCTGTATTGAGCATGTATTTACAAGTGTATCAGAAAGCCTTTGTTGCTACTCTTAGTAGTAGTAATGATCATGCTACTGAGTTTTCCCCACTCTTGCTCTTTAAGCATTCTGAATTTGATAGCTGTGTCCAGAACAAGACCCTTGAGAACTATGGGACAACATCCTGCTTATTGGAATCTGTTTTTAACCTCATGTCTAGGTTGGATGAAATAGTAGACAAAAGAACCCTTGGGTTCTTATCAAGGTTCTGTTGGGAATCGATGTTTCATGGTTTTCCCTCTCATTTGGAAACTTCCAGTGGAATTCTACTTTCTTGTGTCCTTAGCATAGGAAGGATTATTTCTGTTTTAGCTGGGCTGCTGAGGATAGTAGATGTTAAACGTAATATCATTTTGGAGACTGAGGTAACTCGTGGGATTCTTGATGCAGTAATGACTATAAAATTTGACAAGACCTTTGAAAGTGTTCATGGCCTGTGTGAAGGTATATATCAGAGTTTGAATAAGGAATTGGATGGATGTTCTTATGGGGTTCTGTTTCTGTTGAAACAGCTTGAGGGGTACTTAAGACACATGAATATGAGGGGGGCGAGTGATAGCACTATTCATGAATTGGTAATTGTTAAAGCTACAGATATCATGGACAACCTGCGGAAAGATGTTTCGAAGTCTTCTGTTTTCCAATTCTATCTTGGTGCTGAAGTTGTACTGGAGCAGGTTAGAGAACTTTATACATTTCAACATGGTAATTTGTTGGTTCTTCTAGACTCCTTAGACAATTGTTGCTCCGAACTAGTTAACTTGAAGGTTCTTGGTTTCTTTGTGGAACTCTTGTCCGGAGAGCCATGCCCTAAACTTAAACAGGAAGTACAGAATAAATTCCTTAGTATGGATTTGCTTAGCCTATCACAATGGTTGGAAAAGAGGATTTTTGGTTTAGTAGCCGAAGATTCAAGCGGAGGCAATGTGAAGGGATCTTCTATTTCTCTTAGAGAATCATCCATGAATTTTGTATTTTGTCTCATATCATCACCCTCAGAACCTCTGGCACTTCAATTGCAGAGTCACATTTTTGAGGCTGCACTAGTATCACTTGACATGGCATTTTTGCGATTTGACATCAGTGTTTCCAAGTCCTATTTCCATTTTGTTGTTCAGCTATTGAAAGGGGACAAATCAATGAAATTACTTTTGGAGAGAATTCTCATATTGATGGGGAAATTGGCTAGCGATGAGCGCCTGCTTCCAGGGCTGAAGTACCTCTTCAGTTTTCTCGAAATGATTTTGATTGAGAGTGGATCTGGTAAGAATGTTTTTGAGAGACCTTCTGGCAAACCTCTGTCGAGGTATGCACCTGAAGTTGGACCTCTTTCTTCTAAGTCAGTGGGGCCTAGGAAGAATTCAGAGACATTGGTTCTTTCTTCCAATCAAGAAGAGGGTCCTGCATCTTTTGAATGTGATGCAACTTCTGCTGAGGAAGATGAGGATGATGGAACTTCCGATGGTGAGGTAGCTAGTCTGGACAAGGATGAGGAGGAGGACACAAACAGTGAAAGGGCACTTGCTTCAAAAGTCTGCACATTTACATCCAGTGGCAGCAATTTCATGGAACAACACTGGTACTTTTGTTATACATGTGATCTGACTGTCTCGAAGGGATGTTGTTCTGTGTGTGCAAAAGTCTGTCATCGTGGTCATCGTGTTGTTTATTCCCGGTCTAGTCGCTTCTTTTGTGACTGTGGTGCTGGGGGTGTTAGGGGCAGCAGTTGCCAATGTTTGAAGCCTCGCAAGTATACTGGACATGGCAGTGCCCCTGTACGTGGTGCCAGTAACTTCCAGTGCTTTTTGCCTTTCTCTGAGGAGGGAGATCAACTTCCTGAAAGTGAATCTGATCTGGAAGACGATGTGTCAGTTACAGACACAGATAAGTGTCTCAGACCCTCTGTTCCTAGGGAGCTTCTAGATGGTGTTTCCGTTTTACTTGAGGAACTGGATGTTGAAGGAAGGATGCTCGAGCTTTGCTCGTGTTTATTGCCTACTATAACTAACCAAAGGGACCCAGACCTTTCAAAAGACAAGAAAATTATTCTCGGTAAAGACAAGGTGCTATCATATGGGCTTGATCTCTTGCAATTGAAAAAGGCATATAAAGGTGGGTCTTTGGACCTTAAGATTAAGGCAGAATATGCAAATGCCAAGGAGCTCAAATCACATCTAGCCAGTGGTTCTCTTGTGAAATCTCTGCTCAGTGTCAGTATCAGAGGTCGTCTTGCTGTTGGTGAAGGTGATAAAGTGTCTATTTTTGACATCAGGCAGTTAATAGAACAGACTACTGTTGCCCCCATGACGGCAGACAAGACTAATGTTAAGCCACTCTCCAAAAACGTTGTTCGTTTTGAAATTGTACATCTTGCTTTTAATCCCACAGTAGAGAACTATCTTGCTGTGGCAGGCTATGAAGATTGCCAAGTCTTGACTTTGAACCACCGTGGTGAAGTTGTTGACCGTCTTGCTATTGAACTCGCTCTGCAAGGGGCTCATATTAAACGAATGGAATGGGTTCCAGGATCTCAAGTCCAATTAATGGTGGTTACAAACAGGTTTGTCAAAATATATGATCTATCCCTGGATAATATCAGTCCAATGCACTACTTCACGTTGCCAGATGACATGGTAGTTGATGCTACACTGTCCACAGCTTCACAGGGAAGGATGTTTCTTATTGTTCTTTCAGAAAATGGAAGGATATTCAGACTTGAGTTGTCAGTGCTAGGAAATGTTGGAGCTACACCCTTGAAGGAGATCATTGAGATTCAGGGCAGAGAAATGAGTGCAAAGGGATTGTCATTGTACTTTTCTTCATGTTACAAATTATTATTTCTTGCATATGCAGATGGCACCACATTGGTTGGTCAGTTAAGCCCTGATGCAACAAAATTGACCGAGATATCGGTTATATACGAAGAGGAACAAGACAGAAAACTCCGGCCTGCTGGATTACACCGTTGGAAGGAGCTGTTTGCTGGCAGTGGCTTATTTGTTTGCTTTTCAAGCGTCAAATCAAATTCGGCTTTGGCTGTATCTATGGGGGCTCATGATATTTATGCTCAAAACTTGAGACATGCGGGGGGTTCATCTTTGCCATTAGTTGGCATAACTGCATATAAGCCTTTATCCAAAGATAAAATACATTGTCTTGTACTACATGATGATGGTAGCCTTCAAATATACACACATACTGCTGTTGGAGTGGATGCTAGTGCATACGCAACTGCCGAAAAAATCAAGAAGTTGGGTTCTGGCATTCTCAATAACAAGGTTTATGCCAGTACAAATCCAGAATTCCCACTTGATTTTTTTGAGAATACTGTTTGCATCACTGCAGATGTGAGATTGGGAGGCGACGCTATTCGAAATGGTGATTCTGAAGGAGCCAAACAGAGTTTGGCATCCGAGGATGGTTTTCTTGAGAGTCCCAGTTCTTCAGGTTTCAAGATCACTGTCTCTAATTCCAACCCTGATATTGTTATGGTTGGATTTCGCATCCATGTTGGTAATACGTCTGCGAACCATATACCTTCAGAGATAACTATTTTCCAGAGAGTTATAAAATTAGACGAGGGCATGCGATCATGGTATGATATACCATTTACTGTTGCTGAGTCTCTTCTTGCTGATGAAGAATTCTCTGTAACTGTTGGGCCAGCATTCAATGGTACTGCACTTCCTAGGATAGACTCTCTTGAAGTGTATGGTCGAGCAAAAGATGAATTTGGTTGGAAAGAAAAATTGGATGCTGTTCTGGACATGGAGGCACGTGCACTTGGCTCCAATTCCTTGCTTGCCAGATCTGGAAAAAAGAGGCGATCCATTCAATGTGCTCCTATTCAACAGCAGGTGTTAGCAGATGGTCTGAAGGTCTTGTCCAGTTATTATTTGCTTCGTAGATCACAAGGATGCCCAAAACTTAATGATGTGAATCAGGAGCTGACTAAACTGAAGTGCAAGCAATTATTAGAAACAATATACGAAAGTGATCGGGAGCCCTTGTTGCAGTCTGCTGCTTGTCGTGTCCTGCAAGCTATCTTCCCGAAAAAAGAGATATACTACCAAGTGAAGGACACCATGCGTCTGACTGGTGTGGTGAAATCAACATCAGTGCTCTCCTCTAGGCTTGGAGTTGGAGGTGCTGCAGGAGGATGGATTATTGAAGAATTTACATCACAAATGCGTGCAGTTTCTAAGATTGCCCTGCATCGTAGATCTAATTTGGCTTGTTTTCTGGAAAGAAATGGTTCTCAAGTGGTGGATGGACTCATGCAAATTCTGTGGGGAATTTTGGACTTGGAGCAGCCCAACACTCAGACTTTGAACAACATTGTTATTTCCTCTGTTGAACTTATTTATTGCTATGCTGAGTGCCTGGCATTGCATGGCCCAGACACTGGCAGGCACTCTGTTGCACCTGCTGTTGTGTTATTTAAGAAACTTCTGTTTTCCTCCAGTGAGGCTGTTCAGGCTTCAAGCAGCTTGGCTATATCTTCAAGGTTGCTTCAGGTTCCATTTCCAAAGCAAACAATGTTAGCTACTGATGATGGTGCTGATATTCCATTATCTGCACCTGTACCCACTGAAACAACTGGTACCAATCCCCAGGTCATGATTGAAGAAGACGCTGTCGCTTCTTCTGTTCAATACTGTTGTGATGGTTGCTCCACAGTTCCTATACTGAGGCGACGATGGCATTGTACAATTTGCCCTGATTTTGATTTATGTGAATCATGTTATGAGGTACTTGATGCCGACAGGCTTCCTTCCCCTCATTCTAGAGATCATCCTATGACTGCCATCCCAATTGAAGTGGACTCACTAGGAGATGGAAATGAATATCACTTCGCCACAGAAGATATCAATGATTCAAGCTTAACATCATTAATTCCAGATATTAGCGTGAAGAACCCAGTGTCATCAATTCATGTCTTGGAGCCAGCTGATTCTGGGGATTTTTCTGCCTCAGTGACTGATCCAGTTTCAATATCAGCTTCTAAACAAACCGTTAATTCCTTGCTTCTCTCTGAGCTCCTTGAACAGTTAAAAGGATGGATGGAGACAACTTCAGGTGTTCAGGCTGTTCCTGTTATGCAGCTTTTCTACAGATTATCATCCACAATGGGCGGACCTTTTATGAACAGTTTGAAATCTGAAAACTTGAACTTGGAAAGACTTATTAAATGGTTTTTGGATGAGATTAATCTCAACAAACCCTTTGAAGCAAAAACCCGTACTTCATTTGGGGAAGTTGCAATCCTTGTTTTCATGTTCTTCACTTTGATGCTAAGAAACTGGCACCAACCTGGTAGCGATGGTCCAGGTGCCAAACCTAGTACTACGACAGATACACATGACAAGAATTCTACACAGGTTGCACCATCTACTTCAGTGACTGCACAATCTTCCATGGATGATCAAGGAAAAAATGACTTTACTTCACAACTACTTCGTGCTTGTAGCTCTATTAGGCAACAATCTTTTGTAAATTATCTTATGGATGTGCTGCAGCAGCTTGTGCATGTCTTCAAGTCATCCACAATTGATTATGACAGTGGACATGGTTTTCATAATGGTTCTGGATGTGGAGCTCTGCTAACAGTTCGTAAGGATCTCCCTGCTGGCAATTTCTCTCCATTTTTTTCAGATTCTTATGCAAAAGCACACCGGACAGATCTTTTTATAGACTATCACAGGCTATTGTTAGAAAATGCTTTTCGTCTTGTATATACATTGGTTCGACCAGAAAAATATGACAAGACATTGGAGAAGGAGAAGGTTTATAAGATTTATAGCAGCAAGGATTTGAAGTTGGATGCCTATCAAGATGTTCTTTGCAGTTATATTAACAATCCAAATACTAGCTTTGTTAGAAGATATGCAAGAAGGCTTTTCCTCCACATTTGTGGCAGCAAAAGTCACTATTACAGTATTCGAGACTCTTGGCAGTTTTCGACTGAAGTAAAGAGACTTTTCAAATACATAAACAAGGTTGGTGGTTTTCAAAATCCTATGTCATACGAGAGAAGCGTAAAGATAGTGAAATGCCTAACAACTATGGCTGAAGTAGCTGCCGCAAGGCCTCGAAATTGGCAGAAATATTGCTTACGACATGCGGATGTGCTGCCATTTTTGCTGAATGGAATTTTCTACTTTGGAGAAGAGTCTGTTGTTCAAACTCTCAAACTTTTGAATCTTGCCTTCTATACTGGAAAAGATATTGGCCATTCTGTACAAAAGTCTGAAGCAGGAGATACTGGGACTAGTACAAATAAATCTGGTACACAAACTGTGGATTCAAGAAAGAAAAGAAAAGGTGAAGATGGAAATGATTCTGCGTTGGAGAAGTCCTATTTGGATATGGAGATCATGGTCAATATCTTTGTTGATAAGGGTAGTAATGTCTTGAGCCATTTCATTGATTGCTTTCTTCTTGAGTGGAATTCAAGCTCTGTCCGGGCAGAGACCAAAGGTGTTGTTTGTGGCATTTGGCATCATGGAAAGCAGACATTTAAAGAAACTCTATTGATGGCTCTCTTGCAAAAGGTTAAAACTCTTCCTATGTATGGTCTGAACATTGCTGAATATACAGAACTGGTCACATGGTTGCTGGGAAAAGTTCCTGATGTTGGTTCTAAGCAGCAGAGTTCTGAACTTCTGGATAGATGCTTGACCTCTGATGTAATTCGATCAATTTATCAAACACTTCACTCACAAAATGAGCTGTTAGCTAATCATCCAAATTCACGCATATACAATACTTTGAGTGGTTTAGTTGAGTTTGATGGTTACTATCTAGAGAGTGAACCTTGTGCGGCTTGCAGTTCTCCTGAGGTGCCTTACAGCAGGATGAAACTTGAGAGTCTTAAATCTGAAACAAAATTCACTGACAACCGCATCATTGTTAAATGTACAGGGAGTTACACAATTCAAACTGTTATAATGAATGTTCATGATGCTCGGAAGTCCAAATCTGTGAAAGTTTTGAACCTGTACTACAATAATCGGCCCGTGGCAGACTTATCAGAGTTGAAAAATAATTGGTCTTTGTGGAAGCGTGCAAAAAGTTGTCATCTTGCATTCAATCAAACTGAACTAAAAGTAGAGTTTCCCATTCCAATCACTGCATGTAATTTCATGATTGAGCTGGATTCTTTCTATGAAAATCTTCAAGCTTTGTCCCTTGAACCTTTGCAATGCCCTCGATGCAGTCGTCCAGTCACTGATAAGCATGGAATATGCAGCAATTGCCATGAAAATGCATATCAATGTAGGCAATGCCGGAACATAAATTATGAGAACCTTGACTCATTTTTATGCAACGAGTGTGGATACAGCAAGTATGGAAGATTTGAATTTAATTTTATGGCAAAGCCAAGTTTTACATTTGATAATATGGAGAACGATGAAGATATGAAGAGAGGTCTTGCTGCAATAGAATCTGAATCAGAAAATGCTCACAGGAGATATCAACAACTCTTAGGGTACAAGAAACCCCTGCTGAAAATTGTTTCAAGCATTGGTGAGAATGAAATGGACTCGCAACAGAAGGATTCTGTTCAGCAAATGATGGTCTCACTTCCAGGACCATCGTGCAAGATTAATCGTAAAATTGCTCTCCTTGGGGTTTTATATGGTGAAAAATGCAAAGCAGCCTTTGATTCTGTCAGTAAAAGTGTCCAGACGCTTCAAGGTCTTCGTCGGGTTTTAATGACGTATTTGCACCAGAAACATACTGATGACGGGTTTCCGGCTTCAAGATTTGTGATTTCTAGATCTCCTAATAATTGCTATGGCTGTGCGACTACATTTGTAACTCAATGTCTTGAGATATTGCAGGTGTTATCAAAGCATCAGAGTTCGAAGAAACAACTTGTTAGCCTGGGTATATTATCTGAGCTGTTCGAGAATAATATTCATCAAGGGCCCAAAACTGCTCGAATACAAGCTAGGGCAGTTCTTTGTTCTTTCTCCGAGGGTGACGTAAATGCAGTGAATGGACTGAACAATCTAATCCAAAAGAAAGTCATGTACTGCCTTGAACACCATCGTTCTATGGACATCGCATTGGCAACTCGAGAAGAGTTATCACTGCTCTCAGAAGTTTGTTCTTTGGCTGATGAATTCTGGGAAGCTAGATTGCGAGTTGTTTTCCAGCTGTTATTTTCATCCATTAAGTCGGGTGCCAAACATCCAGCAATTGCTGAGCACATCATTCTTCCATGTCTGAGGATCATATCTCAAGCTTGTACTCCTCCTAAATCTGATACTGTAGACAAGGAGCAGAGGATGGGAAAATTGACATCTGTTTCACAAAATAAGGATGAAAACGCTACAAATATATCTGGATCTTTCAGTGGACCTGTTAGTGGGAATAAGTCTGCACCTGAATCACTTGAACATAATTGGGATTCTTCTCATAGGACTCAGGATATTCAATTGCTGAGTTATGCAGAGTGGGAAAAGGGAGCATCATATCTTGACTTTGTTAGAAGGCAGTACAAGGTGTCTCAGGTGTGTAAAGGTACAGTTCAAAGATCTCGAACACAAAAAGGCGATTATTTGTCCCTGAAGTATGCGCTTAAGTGGAAGCGGTTTGTATGTAGAAATGCTAAAAGTGATTTGTCAGCTTTTGAGCTGGGGTCATGGGTTACAGAACTTGTGCTATGTGCCTGTTCCCAGTCTATAAGATCGGAGATGTGTATGCTGATAAGTTTGCTTTGTGCTCAAAGTTCATCAAGACGATTTCGACTATTGGATTTACTGGTGTCTCTGTTGCCAGCAACTCTTTCTGCTGGCGAGAGTGCTGCTGAATATTTTGATTTACTTTTCAAGATGGTAGATTCAGAAGATGCGCGCTTGTTTTTGACTGTTCGAGGGTGCTTACGTACAATTTGCCAATTAATTTCCCAAGAAGTGGGCAATGTTGAGTCTCTGGAGAGAAGCCTCCATATCGATATTTCCCAGGGATTTATTCTTCACAAGCTCATAGAGCTCCTTGGGAAATTTCTAGAGATCCCCAACATTAGATCAAGGTATAACTTCTTCCTTAAGCATCTGTTTTTTAGACTACTTCTGTTTAAGCTTATACAATGTGATCTTATTGTTAGATTCATGCGGGATAATCTACTCTCTGAAGTTCTTGAGGCTCTCATTGTGATTCGTGGTTTGGTAGTACAGAAGACAAAATTGATTAGCGATTGTAATCGGCTTTTGAAAGATCTCTTGGACAGCCTTCTGCTGGAAAGCAATGAGAACAAGAGGCAATTTATTCGAGCCTGCATTTGTGGCTTGCAAATCCATGGAGAGGAAAGGAAAGGGCGGACTTGTTTGTTTATTCTAGAGCAGCTCTGCAATCTGATCTCTCCCTCAAAGCCAGAGCCAGTGTATCTCTTGGTTCTAAACAAGGCACACACGCAAGAAGAATTTATTAGGGGATCCATGACAAAGAATCCTTACTCCAGTGCTGAGATTGGTCCATTAATGCGTGATGTCAAAAATAAAATTTGTCACCAATTGGATTTACTTGGTTTTCTTGAAGATGATTACGGCATGGAGTTGCTAGTTGCTGGAAATATAATTTCTCTTGATTTGAGCATAGCACTAGTCTATGAGCAAGTGTGGAAGAAGTCTAACCAGTCTTCAAATGCCATATCTAATACTGCACTAATATCTACCACCGCTGCAAGAGACTCTCCTCCTATGACAGTTACTTACCGGCTTCAGGGGCTAGATGGTGAAGCAACGGAACCTATGATTAAGGAATTGGAAGAGGACAGAGAGGAATCGCAAGATCCAGAACTGGAATTTGCTATAGCAGGTGCAGTTCGTGAGTATGGAGGTCTGGAAATTTTGTTGGGCATGATCCAGCGCATATGGGATAATTTCAAGTCAAACCAAGAGCAGTTGGTTGCAGTTCTTAATCTTCTTATGCATTGTTGCAAAATAAGAGAGAACAGGCGTGCTTTATTAAGGCTTGGAGCTCTCGGATTACTTCTAGAAACGGCAAGGCGTGCCTTCTCTGTGGATGCCATGGAGTCAGCTGAAGGCATTCTCTTGATTGTGGAGAGTCTAACAATTGAAGCGAATGAAAGTGAAAGTATTAGCATTGGACAAAGTGCTCTTACCGTCACCAGTGAACAAACTGGTACTGGTGAACAGGCCAAAAAAATTGTCCTCATGTTTCTGGAGAGATTATCTCATCCTTTTGGTTCTAAGAAATCAAACAAACAGCAGAGGAACACTGAAATGGTTGCTAGAATCTTGCCTTACTTGACCTATGGTGAACCTGCTGCTATGGATGCACTCATCCAACATTTCACTCCATATCTGAATGATTGGGATGAGTTTGACCGATTACAGAAACAGCATGAAGACAATCCAGAGGATAAGAGCATCTCTGAGCAAGCTGCCAAGCAGAGATTTACTGTGGAAAATTTCGTTAGAGTCTCAGAGTCACTGAAGACAAGTTCCTGTGGGGAGAGACTGAAGGATATTATTTTGGAAAAGGGCATTACTGGCCTTGCAATTAAGCATCTGAGAGATAGTTTTGCTGTTGCAGGACAGACCGGTTTCAGATCTAGTGTGGAATGGGCATTTGCCTTGAAACGTCCTTCCATTCCGCTTATATTGTCTATGCTAAGGGGTTTGTCAATGGGGCATTTGGCTACACAGAGATGTATTGATGAAGGAAGGATCTTACCTGTGCTTCATGCTCTGGAAAGAGTTCCGGGAGAAAATGAGATTGGGGCAAGGGCTGAAAACTTGTTAGATACCCTCTCTAACAAGGAGGGAAATGGAGATGGATTCTTGGAGGATAAAGTACGAATGTTAAGACATGCCACAAGGGATGAAATGAGGCGACTTGCTTTGAAGAACAGAGAAGACATGCTACAGGGACTTGGAATGCGACAGGTGGCTTCAGATGGCGGTGAACGGATCATTGTCTCTAGACCAGCTCTTGAAGGTTTGGAAGATGTAGAGGAAGAGGAAGATGGGTTGGCATGCATGGTGTGCAGAGAGGGTTATAGTTTGAGGCCTACTGACTTGCTGGGTGTCTATTCATACAGCAAAAGGGTTAACCTTGGTGTTGGGACTTCAGGAAGTACTCGTGGGGAGTGTGTATATACAACCGTGAGTTATTTCAATATCATTCATTATCAATGCCATCAAGAGGCCAAAAGAACAGATGCTGGTTTGAAAATCCCAAAGAAAGAATGGGAAGGAGCAACACTCAGGAACAACGAATCCCTTTGCAATTCCTTGTTCCCTGTGAGAGGTCCGTCTGTCCCGTTAGCACAATATATCCGATATGTTGACCAGCATTGGGACAACCTAAATGCTCTTGGTCGTGCTGATGGAAACAGACTTCGGCTCTTGACATATGATATTGTTCTGGTATGGTTATTGCCTGTTGTCAGTTTGTGA

Protein sequence

MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLRDLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEKSEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDNTIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHWAVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEYDAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYVSVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENGNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTLPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIVLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESKPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLIIERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFPQVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLLTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCWESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVMTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHELVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSELVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGKPLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISVIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNNKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSLIPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVLVWLLPVVSL
Homology
BLAST of HG10018299 vs. NCBI nr
Match: XP_038890252.1 (auxin transport protein BIG isoform X1 [Benincasa hispida])

HSP 1 Score: 9194.7 bits (23858), Expect = 0.0e+00
Identity = 4680/4873 (96.04%), Postives = 4751/4873 (97.50%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRS-DDSSIKLGLRQFYSILKAGL 60
            MAEQSFVKLLDTIFLDDS+++ANT+K FSSSDLL LLRS DDSSIKLGLRQFYSIL AGL
Sbjct: 1    MAEQSFVKLLDTIFLDDSTSTANTKKPFSSSDLLHLLRSDDDSSIKLGLRQFYSILNAGL 60

Query: 61   RDLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLE 120
            RDLGDGN AFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELV CYLE
Sbjct: 61   RDLGDGNLAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVLCYLE 120

Query: 121  KSEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFD 180
            KSEFKCDDFSIQ+NMLMILETILVDGMDKV+D AQLCAKK+L+DLLKSTGGDCDATIEF+
Sbjct: 121  KSEFKCDDFSIQSNMLMILETILVDGMDKVTDCAQLCAKKALIDLLKSTGGDCDATIEFE 180

Query: 181  NTIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQH 240
            N IECG  GVCCSREEKQVGRLLMT+AAECVQADQLTSESGFS+PTFLEDMNKLIFL QH
Sbjct: 181  NAIECGFAGVCCSREEKQVGRLLMTVAAECVQADQLTSESGFSEPTFLEDMNKLIFLFQH 240

Query: 241  WAVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIE 300
            WAVTHLACIQ LILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIE
Sbjct: 241  WAVTHLACIQRLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIE 300

Query: 301  YDAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSY 360
            YDAK+MQAFALFANSLPCLFGLCFEFANSHA VEGS ENTILLLLEEFLELVQVVFRNSY
Sbjct: 301  YDAKLMQAFALFANSLPCLFGLCFEFANSHAIVEGSLENTILLLLEEFLELVQVVFRNSY 360

Query: 361  VSVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAF 420
            V VNIQTC+VASILDNLSSSVWRYDAS ANLK PLVYFPRSVMVIIKLIQDLKGHKYHAF
Sbjct: 361  VCVNIQTCIVASILDNLSSSVWRYDASAANLKPPLVYFPRSVMVIIKLIQDLKGHKYHAF 420

Query: 421  SFKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLL 480
            SFKDLE HHTSTLADLSVDIPKCYAR EIVPLHKNY V+EILRMIFPLSKQWMDDLMHLL
Sbjct: 421  SFKDLEMHHTSTLADLSVDIPKCYARSEIVPLHKNYTVDEILRMIFPLSKQWMDDLMHLL 480

Query: 481  FFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDL 540
            FFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDL
Sbjct: 481  FFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDL 540

Query: 541  QHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLN 600
            QHLAVNSTSS CNLLLQA KELLSFIKLCIFSPEWNASVFDDGCNKLNQNH+DILLSLLN
Sbjct: 541  QHLAVNSTSSLCNLLLQATKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHVDILLSLLN 600

Query: 601  CEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAEN 660
            CEGCCSDDKSS+SCLPAHDERKSGHIHEICYRLLHGLLT HALPDSLEEYLVKKILNAEN
Sbjct: 601  CEGCCSDDKSSSSCLPAHDERKSGHIHEICYRLLHGLLTSHALPDSLEEYLVKKILNAEN 660

Query: 661  GNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGT 720
            GNFVYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFII KSKTIS  YSSLQEFMGT
Sbjct: 661  GNFVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIVKSKTISLKYSSLQEFMGT 720

Query: 721  LPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLII 780
            LPSVFHIEILLVAFHLSSEGEKREIS LIFSSIRAIDAPS+FS CTELSMWGLLVSRLII
Sbjct: 721  LPSVFHIEILLVAFHLSSEGEKREISCLIFSSIRAIDAPSSFSKCTELSMWGLLVSRLII 780

Query: 781  VLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVES 840
            VLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYT+NDHLSSWGAS+AKNIIGSSVES
Sbjct: 781  VLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTVNDHLSSWGASIAKNIIGSSVES 840

Query: 841  KPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDL 900
            +PFF+SLINQLIDISSFPASLRQHD T+ECPWFN  DIFSTFSWILGFWNGKQAV VEDL
Sbjct: 841  QPFFNSLINQLIDISSFPASLRQHDSTIECPWFNPSDIFSTFSWILGFWNGKQAVAVEDL 900

Query: 901  IIERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKF 960
            IIERYIFVLCWDFPSMN LSHGG LWSD DTLDIS+TTCFFYFSYLLLDHG VIGE MKF
Sbjct: 901  IIERYIFVLCWDFPSMNVLSHGGTLWSDLDTLDISDTTCFFYFSYLLLDHGDVIGERMKF 960

Query: 961  PQVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSL 1020
            PQVVI LL+RLHGGS LEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYC+KN IPTVGSL
Sbjct: 961  PQVVIDLLRRLHGGSTLEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCNKNTIPTVGSL 1020

Query: 1021 LTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDH 1080
             TDTTVTDNE ANFAESLISSVIT+SQVSILI ELS VLSMYLQVYQKAFVATLSSSNDH
Sbjct: 1021 WTDTTVTDNELANFAESLISSVITNSQVSILIGELSFVLSMYLQVYQKAFVATLSSSNDH 1080

Query: 1081 ATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFC 1140
            ATEFSPLLLFKHSEFD C+QNKTLENYGTTSCLLESVFNLMSRLDEIVDK+TLGFLSR C
Sbjct: 1081 ATEFSPLLLFKHSEFDRCIQNKTLENYGTTSCLLESVFNLMSRLDEIVDKKTLGFLSRVC 1140

Query: 1141 WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAV 1200
            WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVK NIILETEVTRGILDAV
Sbjct: 1141 WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKPNIILETEVTRGILDAV 1200

Query: 1201 MTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHE 1260
            MTIKFDKTFESVHGLCEGIYQSLN ELDGCSYGVLFLLKQLEGYLRH+N RG SDSTIHE
Sbjct: 1201 MTIKFDKTFESVHGLCEGIYQSLNAELDGCSYGVLFLLKQLEGYLRHINTRGVSDSTIHE 1260

Query: 1261 LVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSE 1320
            LVIVKATDIMD+LRKDVSKSSVFQFYLGAEVV EQVRELY FQHGNLLVLLDSLDNCCSE
Sbjct: 1261 LVIVKATDIMDSLRKDVSKSSVFQFYLGAEVVPEQVRELYAFQHGNLLVLLDSLDNCCSE 1320

Query: 1321 LVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVK 1380
            +VNLKVLGFF ELLSGEPC KLKQEVQNKFLSMDLLSLS+WLEKRIFGLVAEDSSG NVK
Sbjct: 1321 IVNLKVLGFFGELLSGEPCSKLKQEVQNKFLSMDLLSLSKWLEKRIFGLVAEDSSGVNVK 1380

Query: 1381 GSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFV 1440
            GSS SLRESSMNFVFCLISSP+EPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFV
Sbjct: 1381 GSSTSLRESSMNFVFCLISSPTEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFV 1440

Query: 1441 VQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSG 1500
            VQLLKG+KSMKLLLERILILM KL SDERLLPGLKYLF FLEMILIESGSGKNVFERP+G
Sbjct: 1441 VQLLKGEKSMKLLLERILILMEKLVSDERLLPGLKYLFIFLEMILIESGSGKNVFERPTG 1500

Query: 1501 KPLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEV 1560
            KPLSRYAPEVGPLSSKSVGPRK+SETLV SS+QEEGPASFECDATSAEEDEDDGTSDGEV
Sbjct: 1501 KPLSRYAPEVGPLSSKSVGPRKSSETLVFSSSQEEGPASFECDATSAEEDEDDGTSDGEV 1560

Query: 1561 ASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR 1620
            ASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR
Sbjct: 1561 ASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR 1620

Query: 1621 GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQ 1680
            GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQ
Sbjct: 1621 GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQ 1680

Query: 1681 LPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQ 1740
            LPESESDLEDDVSVTDTDKCLRPSVPRELLDG+SVLLEELDVEG MLELCS LLPTITNQ
Sbjct: 1681 LPESESDLEDDVSVTDTDKCLRPSVPRELLDGISVLLEELDVEGSMLELCSRLLPTITNQ 1740

Query: 1741 RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL 1800
            RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL
Sbjct: 1741 RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL 1800

Query: 1801 VKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHL 1860
            VKSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHL
Sbjct: 1801 VKSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHL 1860

Query: 1861 AFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVV 1920
            AFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHI+RMEWVPGSQVQLMVV
Sbjct: 1861 AFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIRRMEWVPGSQVQLMVV 1920

Query: 1921 TNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVL 1980
            TN+FVKIYDLSLDNISPMHYFTLPDDMVVDATL TASQGRMFLIVLSENGRIFRLELSVL
Sbjct: 1921 TNKFVKIYDLSLDNISPMHYFTLPDDMVVDATLFTASQGRMFLIVLSENGRIFRLELSVL 1980

Query: 1981 GNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 2040
            GNVGATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLT IS
Sbjct: 1981 GNVGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTGIS 2040

Query: 2041 VIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGG 2100
            VIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSAL VSMGAH+IYAQNL+HAGG
Sbjct: 2041 VIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALVVSMGAHEIYAQNLKHAGG 2100

Query: 2101 SSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILN 2160
            SSLPLVGITAYKPLSKDKIHC VLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILN
Sbjct: 2101 SSLPLVGITAYKPLSKDKIHCFVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILN 2160

Query: 2161 NKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGF 2220
            NKVYASTNPEFPLDFFE TVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGF
Sbjct: 2161 NKVYASTNPEFPLDFFEKTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGF 2220

Query: 2221 KITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLL 2280
            KITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLL
Sbjct: 2221 KITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLL 2280

Query: 2281 ADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARS 2340
            ADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARS
Sbjct: 2281 ADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARS 2340

Query: 2341 GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYE 2400
            GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKL+D NQELTKLKCKQLLETIYE
Sbjct: 2341 GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLDDANQELTKLKCKQLLETIYE 2400

Query: 2401 SDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIE 2460
            SDREPLLQSAACRVLQAIFPKKE YYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIE
Sbjct: 2401 SDREPLLQSAACRVLQAIFPKKETYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIE 2460

Query: 2461 EFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISS 2520
            EFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQ LNNIVISS
Sbjct: 2461 EFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQILNNIVISS 2520

Query: 2521 VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPK 2580
            VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPK
Sbjct: 2521 VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPK 2580

Query: 2581 QTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCT 2640
            QTMLATDDGADIPLSAPV TETTGTNPQVMIEEDAVASSVQYCCDGCS VPILRRRWHCT
Sbjct: 2581 QTMLATDDGADIPLSAPVTTETTGTNPQVMIEEDAVASSVQYCCDGCSKVPILRRRWHCT 2640

Query: 2641 ICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTS 2700
            +CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+S+GDGNEYHFATEDINDSSLTS
Sbjct: 2641 VCPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESVGDGNEYHFATEDINDSSLTS 2700

Query: 2701 LI-PDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME 2760
             +  DISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME
Sbjct: 2701 SVRADISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME 2760

Query: 2761 TTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTS 2820
            TTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEI+LNKPFEAK RTS
Sbjct: 2761 TTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEIDLNKPFEAKARTS 2820

Query: 2821 FGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQ 2880
            FGE+AILVFMFFTLMLRNWHQPGSDGPGAKPST  DTHDK+STQVAPSTSVTAQSS+DDQ
Sbjct: 2821 FGEIAILVFMFFTLMLRNWHQPGSDGPGAKPSTAADTHDKSSTQVAPSTSVTAQSSVDDQ 2880

Query: 2881 GKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLT 2940
            GKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLT
Sbjct: 2881 GKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLT 2940

Query: 2941 VRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKV 3000
            VRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKV
Sbjct: 2941 VRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKV 3000

Query: 3001 YKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTE 3060
            YKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRY+RRLFLHICGSKSHYYSIRDSWQFSTE
Sbjct: 3001 YKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYSRRLFLHICGSKSHYYSIRDSWQFSTE 3060

Query: 3061 VKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLN 3120
            VK+LFKYINKVGGFQNPMSYERSVKIVKCLTT+AEVAA+RPRNWQKYCLRH DVLPFLLN
Sbjct: 3061 VKKLFKYINKVGGFQNPMSYERSVKIVKCLTTLAEVAASRPRNWQKYCLRHGDVLPFLLN 3120

Query: 3121 GIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGE 3180
            GIFY GEESV+QTLKLLNLAFYTGKD G+S+QKSEAGD+GTSTNKSGTQ VDSRKKRKGE
Sbjct: 3121 GIFYIGEESVIQTLKLLNLAFYTGKDTGYSIQKSEAGDSGTSTNKSGTQPVDSRKKRKGE 3180

Query: 3181 DGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHG 3240
            DGNDSALEKSYLDME MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVV GIWHHG
Sbjct: 3181 DGNDSALEKSYLDMETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVYGIWHHG 3240

Query: 3241 KQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSD 3300
            KQTFKETLLMALLQKV+TLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSD
Sbjct: 3241 KQTFKETLLMALLQKVRTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSD 3300

Query: 3301 VIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE 3360
            VIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE
Sbjct: 3301 VIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE 3360

Query: 3361 SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN 3420
            SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN
Sbjct: 3361 SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN 3420

Query: 3421 WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV 3480
            WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV
Sbjct: 3421 WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV 3480

Query: 3481 TDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME 3540
            TDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME
Sbjct: 3481 TDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME 3540

Query: 3541 NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL 3600
            NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL
Sbjct: 3541 NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL 3600

Query: 3601 PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASR 3660
            PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASR
Sbjct: 3601 PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASR 3660

Query: 3661 FVISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTAR 3720
            FVISRSPNNCYGCATTFVTQCLEILQVLS+HQSSKKQLVSLGILSELFENNIHQGPKTAR
Sbjct: 3661 FVISRSPNNCYGCATTFVTQCLEILQVLSRHQSSKKQLVSLGILSELFENNIHQGPKTAR 3720

Query: 3721 IQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD 3780
            IQARAVLCSFSEGDV+AV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD
Sbjct: 3721 IQARAVLCSFSEGDVHAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD 3780

Query: 3781 EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKL 3840
            EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKL
Sbjct: 3781 EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKL 3840

Query: 3841 TSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYL 3900
            TSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSS RTQDIQLLSYAEWEKGASYL
Sbjct: 3841 TSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSQRTQDIQLLSYAEWEKGASYL 3900

Query: 3901 DFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTEL 3960
            DFVRRQYKVSQV KGTVQRSRT KGDYLSLKYALKWKRFVCRNAKSDLS FELGSWVTEL
Sbjct: 3901 DFVRRQYKVSQVFKGTVQRSRTHKGDYLSLKYALKWKRFVCRNAKSDLSTFELGSWVTEL 3960

Query: 3961 VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMV 4020
            VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMV
Sbjct: 3961 VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMV 4020

Query: 4021 DSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIP 4080
            DSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIP
Sbjct: 4021 DSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIP 4080

Query: 4081 NIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISD 4140
            NIRS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISD
Sbjct: 4081 NIRS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISD 4140

Query: 4141 CNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPE 4200
            CNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPE
Sbjct: 4141 CNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPE 4200

Query: 4201 PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMEL 4260
            PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMEL
Sbjct: 4201 PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMEL 4260

Query: 4261 LVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGE 4320
            LVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTAL+STTAARDSPPMTVTYRLQGLDGE
Sbjct: 4261 LVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALMSTTAARDSPPMTVTYRLQGLDGE 4320

Query: 4321 ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVL 4380
            ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVL
Sbjct: 4321 ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVL 4380

Query: 4381 NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES 4440
            NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES
Sbjct: 4381 NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES 4440

Query: 4441 ISIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTY 4500
            ISIGQSALTVTSEQ+GTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTY
Sbjct: 4441 ISIGQSALTVTSEQSGTGEQAKKIVLMFLERLSHPFGLKKSNKQQRNTEMVARILPYLTY 4500

Query: 4501 GEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL 4560
            GEPAAMDALIQHFTPYLN WDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL
Sbjct: 4501 GEPAAMDALIQHFTPYLNAWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL 4560

Query: 4561 KTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSML 4620
            KTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSML
Sbjct: 4561 KTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSML 4620

Query: 4621 RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDK 4680
            RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDK
Sbjct: 4621 RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDK 4680

Query: 4681 VRMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGL 4740
            VRMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDV EEEDGL
Sbjct: 4681 VRMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVLEEEDGL 4740

Query: 4741 ACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAK 4800
            ACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAK
Sbjct: 4741 ACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAK 4800

Query: 4801 RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADG 4860
            RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADG
Sbjct: 4801 RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADG 4848

Query: 4861 NRLRLLTYDIVLV 4872
            NRLRLLTYDIVL+
Sbjct: 4861 NRLRLLTYDIVLM 4848

BLAST of HG10018299 vs. NCBI nr
Match: XP_008459406.1 (PREDICTED: auxin transport protein BIG [Cucumis melo])

HSP 1 Score: 9174.7 bits (23806), Expect = 0.0e+00
Identity = 4661/4871 (95.69%), Postives = 4738/4871 (97.27%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAE+SFVKLLDTIFLDDSST+ NT+K FSSSDLLQLLRSDDSS+KLGLRQFYSIL+ GLR
Sbjct: 1    MAEESFVKLLDTIFLDDSSTTVNTKKPFSSSDLLQLLRSDDSSVKLGLRQFYSILEVGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDGNFAFQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGDGNFAFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQN MLMILETILVDGMDKVSD AQ C KK L+DLLKS GGD DATIEF+N
Sbjct: 121  SEFKCDDFSIQNTMLMILETILVDGMDKVSDCAQHCTKKDLIDLLKSFGGDFDATIEFNN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
            T ECG TGVCCSREEKQVGRLLMTIAAEC QAD LTSE GFS+PTF E+MNKLIFLCQHW
Sbjct: 181  TAECGFTGVCCSREEKQVGRLLMTIAAECEQADHLTSEPGFSEPTFFENMNKLIFLCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY
Sbjct: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            D K+MQAFAL ANSLPCLFGLCFEFANSHAT E SFENTILLLLEEFLELVQVVFRNSY+
Sbjct: 301  DDKLMQAFALLANSLPCLFGLCFEFANSHATGESSFENTILLLLEEFLELVQVVFRNSYI 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            FKDLE H  STLA+LSVD+PKC+A LE VPLHKNY VEEILRMIFP S+QWMDDLMHLLF
Sbjct: 421  FKDLEMHQMSTLAELSVDLPKCHAPLETVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540
            FLYSEG+RLRPKIERSLSSMKSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQ
Sbjct: 481  FLYSEGMRLRPKIERSLSSMKSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540

Query: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600
            HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC
Sbjct: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600

Query: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660
            EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG
Sbjct: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660

Query: 661  NFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTL 720
            N VYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTL
Sbjct: 661  NSVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIEKSKTISLKYSSLQEFMGTL 720

Query: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIV 780
            PSVFHIEILLVAFHL SEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+V
Sbjct: 721  PSVFHIEILLVAFHLFSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVV 780

Query: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESK 840
            LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSS LPYT+NDHLSSWGASVAK+IIGSS+ESK
Sbjct: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSHLPYTVNDHLSSWGASVAKSIIGSSMESK 840

Query: 841  PFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLI 900
            PF +SLINQLIDISSFPASLRQHDLT+ECPWFN  DIFSTFSWILGFWNGKQAVTVEDLI
Sbjct: 841  PFLNSLINQLIDISSFPASLRQHDLTIECPWFNPSDIFSTFSWILGFWNGKQAVTVEDLI 900

Query: 901  IERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFP 960
            IERYIFVLCWDFPS NALSHGGPLWSD D LDIS T CFFYFSYLLLDHGGVI EHMKFP
Sbjct: 901  IERYIFVLCWDFPSTNALSHGGPLWSDLDALDISKTACFFYFSYLLLDHGGVIDEHMKFP 960

Query: 961  QVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLL 1020
            QVVIGLL+RLHGGS+LEDFKALGWNFLRNG WLSL+LSFL VGI RYCSKN IPTVGS L
Sbjct: 961  QVVIGLLRRLHGGSVLEDFKALGWNFLRNGTWLSLILSFLGVGISRYCSKNKIPTVGSFL 1020

Query: 1021 TDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHA 1080
            TDTTVTD+EQANFAESLISSVI DSQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHA
Sbjct: 1021 TDTTVTDSEQANFAESLISSVIIDSQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHA 1080

Query: 1081 TEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCW 1140
            TEFSPLLLFKHS+FD CVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF SR CW
Sbjct: 1081 TEFSPLLLFKHSKFDRCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFSSRVCW 1140

Query: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVM 1200
            ESMFHGFPSHLETSSGILLSCVLS GRIISVLAGLLRIVDVKR++ILETEVTRGILDAVM
Sbjct: 1141 ESMFHGFPSHLETSSGILLSCVLSTGRIISVLAGLLRIVDVKRSVILETEVTRGILDAVM 1200

Query: 1201 TIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHEL 1260
            T+KFDKTFESVHGLCEGIYQSLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHEL
Sbjct: 1201 TVKFDKTFESVHGLCEGIYQSLNAELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHEL 1260

Query: 1261 VIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSEL 1320
            VIVKA DIMD+LRKDVSKSSVFQFYLGAE V EQVRELY FQHGNLLVLLDSLDNCCSEL
Sbjct: 1261 VIVKAIDIMDSLRKDVSKSSVFQFYLGAEDVPEQVRELYAFQHGNLLVLLDSLDNCCSEL 1320

Query: 1321 VNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKG 1380
            VNLKVLGFFV+LLSGEPCPKLKQEVQNKFL MDLLSLS+WLEKRIFGLVAEDSSG NVKG
Sbjct: 1321 VNLKVLGFFVDLLSGEPCPKLKQEVQNKFLYMDLLSLSKWLEKRIFGLVAEDSSGVNVKG 1380

Query: 1381 SSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440
            SSISLRESSMNFVFCLISSPSEPLA QLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV
Sbjct: 1381 SSISLRESSMNFVFCLISSPSEPLAHQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440

Query: 1441 QLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGK 1500
            QLLKGDKSMKLLLERIL+LM KLA+DERLLPGLKYLF+FLEMILIESGSGKNVFER SGK
Sbjct: 1441 QLLKGDKSMKLLLERILVLMEKLANDERLLPGLKYLFNFLEMILIESGSGKNVFERTSGK 1500

Query: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVA 1560
            PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVA
Sbjct: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVA 1560

Query: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620
            SLDKDEEED+NSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG
Sbjct: 1561 SLDKDEEEDSNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620

Query: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQL 1680
            HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQL
Sbjct: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQL 1680

Query: 1681 PESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQR 1740
            PESESDLEDDVSVTDTDKCLRPSVP ELLDGVSVLLEEL+VEGRMLELCSCLLPTITNQR
Sbjct: 1681 PESESDLEDDVSVTDTDKCLRPSVPMELLDGVSVLLEELNVEGRMLELCSCLLPTITNQR 1740

Query: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800
            DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL+
Sbjct: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLM 1800

Query: 1801 KSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLA 1860
            KSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLA
Sbjct: 1801 KSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLA 1860

Query: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920
            FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT
Sbjct: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920

Query: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLG 1980
            NRFVKIYDLSLDNISPMHYFTLPDDMVVDATL  ASQG+MFLIVLSENGRIFRLELSVLG
Sbjct: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLFIASQGKMFLIVLSENGRIFRLELSVLG 1980

Query: 1981 NVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISV 2040
            N+GATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 
Sbjct: 1981 NIGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISF 2040

Query: 2041 IYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGS 2100
            IYEEEQD+KLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAH+IYAQNLRHAGGS
Sbjct: 2041 IYEEEQDKKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHEIYAQNLRHAGGS 2100

Query: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNN 2160
            SLPLVGITAYKPLSK+KIHCLVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNN
Sbjct: 2101 SLPLVGITAYKPLSKNKIHCLVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNN 2160

Query: 2161 KVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220
            KVYASTNPEF LDFFE TVCITADVRLGGD IRNGDSEGAKQSLASEDGFLESPSSSGFK
Sbjct: 2161 KVYASTNPEFALDFFEKTVCITADVRLGGDTIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220

Query: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280
            ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA
Sbjct: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280

Query: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340
            DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARA+GSNSLLARSG
Sbjct: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARAIGSNSLLARSG 2340

Query: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYES 2400
            KKRRSIQCAPIQQQVLADGLKV+SSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYES
Sbjct: 2341 KKRRSIQCAPIQQQVLADGLKVMSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYES 2400

Query: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEE 2460
            DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLS+RLGVGG AGGWIIEE
Sbjct: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSTRLGVGGVAGGWIIEE 2460

Query: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520
            FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV
Sbjct: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520

Query: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580
            ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ
Sbjct: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580

Query: 2581 TMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTI 2640
            TMLATDDGADIPLSAPV TET GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTI
Sbjct: 2581 TMLATDDGADIPLSAPVSTETPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTI 2640

Query: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSL 2700
            CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHFATEDINDSSLTS+
Sbjct: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFATEDINDSSLTSV 2700

Query: 2701 IPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760
              DISVKNP SSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT
Sbjct: 2701 RSDISVKNPASSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760

Query: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820
            SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG
Sbjct: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820

Query: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGK 2880
            EVAILVFMFFTLMLRNWHQPGSDGPGAK ST  DTHDKNSTQVAPSTS+TAQSSMDDQGK
Sbjct: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKSSTAADTHDKNSTQVAPSTSLTAQSSMDDQGK 2880

Query: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVR 2940
            NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVR
Sbjct: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVR 2940

Query: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000
            KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK
Sbjct: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000

Query: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060
            IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK
Sbjct: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060

Query: 3061 RLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGI 3120
            +LFKY+NKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGI
Sbjct: 3061 KLFKYVNKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGI 3120

Query: 3121 FYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDG 3180
            FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVD RKK+KGEDG
Sbjct: 3121 FYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDIRKKKKGEDG 3180

Query: 3181 NDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQ 3240
            +DSALEKSYLDME MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQ
Sbjct: 3181 SDSALEKSYLDMETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQ 3240

Query: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300
            TFKETLLMALLQKVK LPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI
Sbjct: 3241 TFKETLLMALLQKVKNLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300

Query: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360
            RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL
Sbjct: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360

Query: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420
            KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS
Sbjct: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420

Query: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480
            LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD
Sbjct: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480

Query: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540
            KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND
Sbjct: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540

Query: 3541 EDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600
            EDMKRGL AIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG
Sbjct: 3541 EDMKRGLTAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600

Query: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660
            PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV
Sbjct: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660

Query: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720
            ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ
Sbjct: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720

Query: 3721 ARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780
            ARAVLCSFSEGDVNAV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF
Sbjct: 3721 ARAVLCSFSEGDVNAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780

Query: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTS 3840
            WEARLRVVFQLLFSSIKSGAKHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTS
Sbjct: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTS 3840

Query: 3841 VSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900
            VSQNKDEN TNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF
Sbjct: 3841 VSQNKDENTTNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900

Query: 3901 VRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960
            VRRQYKVSQV KGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL
Sbjct: 3901 VRRQYKVSQVFKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960

Query: 3961 CACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDS 4020
            CACSQSIRSEMCMLISLLC+QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDS
Sbjct: 3961 CACSQSIRSEMCMLISLLCSQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDS 4020

Query: 4021 EDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080
            EDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNI
Sbjct: 4021 EDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080

Query: 4081 RSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140
            RS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN
Sbjct: 4081 RS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140

Query: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPV 4200
            RLLKDLLDSLLLESNENKRQFIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPV
Sbjct: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPV 4200

Query: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLV 4260
            YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLV
Sbjct: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLV 4260

Query: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEAT 4320
            AGNIISLDLSIALVYEQVWKKSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEAT
Sbjct: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEAT 4320

Query: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380
            EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL
Sbjct: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380

Query: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440
            LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS
Sbjct: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440

Query: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGE 4500
            IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGE
Sbjct: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGE 4500

Query: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKT 4560
            PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKS+SEQAAKQRFTVENFVRVSESLKT
Sbjct: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSLSEQAAKQRFTVENFVRVSESLKT 4560

Query: 4561 SSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRG 4620
            SSCGERLKDIILEKGITGLAIKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRG
Sbjct: 4561 SSCGERLKDIILEKGITGLAIKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRG 4620

Query: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680
            LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR
Sbjct: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680

Query: 4681 MLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740
            MLRHATRDEMRRLALKNREDMLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC
Sbjct: 4681 MLRHATRDEMRRLALKNREDMLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740

Query: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRT 4800
            MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRT
Sbjct: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRT 4800

Query: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4860
            DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR
Sbjct: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4846

Query: 4861 LRLLTYDIVLV 4872
            LRLLTYDIVL+
Sbjct: 4861 LRLLTYDIVLM 4846

BLAST of HG10018299 vs. NCBI nr
Match: KAA0039419.1 (auxin transport protein BIG [Cucumis melo var. makuwa])

HSP 1 Score: 9167.4 bits (23787), Expect = 0.0e+00
Identity = 4659/4871 (95.65%), Postives = 4734/4871 (97.19%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAE+SFVKLLD IFLDDSST+ NT+K FSSSDLLQLLRSDDS +KLGLRQFYSIL+ GLR
Sbjct: 1    MAEESFVKLLDAIFLDDSSTTVNTKKPFSSSDLLQLLRSDDSFVKLGLRQFYSILEVGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDGNF+FQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGDGNFSFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQNNMLMILETILVDGMDKVSD AQ CAKK L+DLLKS GGD DATIEF+N
Sbjct: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDCAQHCAKKDLIDLLKSFGGDFDATIEFNN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
            T ECG TGVCCSREEKQVGRLLMTIAAEC QAD LTSE GFS+PTF E+MNKLIFLCQHW
Sbjct: 181  TAECGFTGVCCSREEKQVGRLLMTIAAECEQADHLTSEPGFSEPTFFENMNKLIFLCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY
Sbjct: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            D K+MQAFAL ANSLPCLFGLCFEFANSHAT E SFENTILLLLEEFLELVQVVFRNSY+
Sbjct: 301  DDKLMQAFALLANSLPCLFGLCFEFANSHATGESSFENTILLLLEEFLELVQVVFRNSYI 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            FKDLE H  STLA+LSVD+PKC+A LE VPLHKNY VEEILRMIFP S+QWMDDLMHLLF
Sbjct: 421  FKDLEMHQMSTLAELSVDLPKCHAPLETVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540
            FLYSEG+RLRPKIERSLSSMKSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQ
Sbjct: 481  FLYSEGMRLRPKIERSLSSMKSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540

Query: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600
            HLAVNSTSSFCNLLL AAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC
Sbjct: 541  HLAVNSTSSFCNLLLLAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600

Query: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660
            EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG
Sbjct: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660

Query: 661  NFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTL 720
            N VYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTL
Sbjct: 661  NSVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIEKSKTISLKYSSLQEFMGTL 720

Query: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIV 780
            PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+V
Sbjct: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVV 780

Query: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESK 840
            LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSS LPYT+NDHLSSWGASVAK+IIGSS+ESK
Sbjct: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSHLPYTVNDHLSSWGASVAKSIIGSSMESK 840

Query: 841  PFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLI 900
            PF +SLINQLIDISSFPASLRQHDLT+ECPWFN  DIFSTFSWILGFWNGKQAVTVEDLI
Sbjct: 841  PFLNSLINQLIDISSFPASLRQHDLTIECPWFNPSDIFSTFSWILGFWNGKQAVTVEDLI 900

Query: 901  IERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFP 960
            IERYIFVLCWDFPS NALSHGGPLWSD D LDIS T CFFYFSYLLLDHGGVI EHMKFP
Sbjct: 901  IERYIFVLCWDFPSTNALSHGGPLWSDLDALDISKTACFFYFSYLLLDHGGVIDEHMKFP 960

Query: 961  QVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLL 1020
            QVVIGLL+RLHGGS+LEDFKALGWNFLRNG WLSL+LSFL VGI RYCSKN IPTVGS L
Sbjct: 961  QVVIGLLRRLHGGSVLEDFKALGWNFLRNGTWLSLILSFLGVGISRYCSKNKIPTVGSFL 1020

Query: 1021 TDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHA 1080
            TDTTVTD+EQANFAESLISSVI DSQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHA
Sbjct: 1021 TDTTVTDSEQANFAESLISSVIIDSQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHA 1080

Query: 1081 TEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCW 1140
            TEFSPLLLFKHS+FD CVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF SR CW
Sbjct: 1081 TEFSPLLLFKHSKFDRCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFSSRVCW 1140

Query: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVM 1200
            ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKR++ILETEVTRGILDAVM
Sbjct: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRSVILETEVTRGILDAVM 1200

Query: 1201 TIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHEL 1260
            T+KFDKTFESVHGLCEGIYQSLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHEL
Sbjct: 1201 TVKFDKTFESVHGLCEGIYQSLNAELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHEL 1260

Query: 1261 VIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSEL 1320
            VIVKA DIMD+LRKDVSKSSVFQFYLGAE V EQVRELY FQHGNLLVLLDSLDNCCSEL
Sbjct: 1261 VIVKAIDIMDSLRKDVSKSSVFQFYLGAEDVPEQVRELYAFQHGNLLVLLDSLDNCCSEL 1320

Query: 1321 VNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKG 1380
            VNLKVLGFFV+LLSGEPCPKLKQEVQNKFL MDLLSLS+WLEKRIFGLVAEDSSG NVKG
Sbjct: 1321 VNLKVLGFFVDLLSGEPCPKLKQEVQNKFLCMDLLSLSKWLEKRIFGLVAEDSSGVNVKG 1380

Query: 1381 SSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440
            SSISLRESSMNFVFCLISSPSEPLA QLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV
Sbjct: 1381 SSISLRESSMNFVFCLISSPSEPLAHQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440

Query: 1441 QLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGK 1500
            QLLKGDKSMKLLLERILILM KLA DERLLPGLKYLF+FLEMILIESGSGKNVFER SGK
Sbjct: 1441 QLLKGDKSMKLLLERILILMEKLAKDERLLPGLKYLFNFLEMILIESGSGKNVFERTSGK 1500

Query: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVA 1560
            PLSRYAPEVGPLSSK VGPRKNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVA
Sbjct: 1501 PLSRYAPEVGPLSSKLVGPRKNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVA 1560

Query: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620
            SLDKDEEED+NSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG
Sbjct: 1561 SLDKDEEEDSNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620

Query: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQL 1680
            HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQL
Sbjct: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQL 1680

Query: 1681 PESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQR 1740
            PESESDLEDDVSV DTDKCLRPSVP ELLDGVSVLLEEL+VE RMLELCSCLLPTITNQR
Sbjct: 1681 PESESDLEDDVSVADTDKCLRPSVPMELLDGVSVLLEELNVERRMLELCSCLLPTITNQR 1740

Query: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800
            DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL+
Sbjct: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLM 1800

Query: 1801 KSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLA 1860
            KSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLA
Sbjct: 1801 KSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLA 1860

Query: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920
            FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT
Sbjct: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920

Query: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLG 1980
            NRFVKIYDLSLDNISPMHYFTLPDDMVVDATL  ASQG+MFLIVLSENGRIFRLELSVLG
Sbjct: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLFIASQGKMFLIVLSENGRIFRLELSVLG 1980

Query: 1981 NVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISV 2040
            N+GATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 
Sbjct: 1981 NIGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISF 2040

Query: 2041 IYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGS 2100
            IYEEEQD+KLRPAGLHRWKELFAGSGLFVCFSS KSNSALAVSMGAH+IYAQNLRHAGGS
Sbjct: 2041 IYEEEQDKKLRPAGLHRWKELFAGSGLFVCFSSFKSNSALAVSMGAHEIYAQNLRHAGGS 2100

Query: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNN 2160
            SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNN
Sbjct: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNN 2160

Query: 2161 KVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220
            KVYASTNPEF LDFFE TVCITADVRLGGD IRNGDSEGAKQSLASEDGFLESPSSSGFK
Sbjct: 2161 KVYASTNPEFALDFFEKTVCITADVRLGGDTIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220

Query: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280
            ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA
Sbjct: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280

Query: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340
            DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARA+GSNSLLARSG
Sbjct: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARAIGSNSLLARSG 2340

Query: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYES 2400
            KKRRSIQCAPIQQQVLADGLKV+SSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYES
Sbjct: 2341 KKRRSIQCAPIQQQVLADGLKVMSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYES 2400

Query: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEE 2460
            DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLS+RLGVGG AGGWIIEE
Sbjct: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSTRLGVGGVAGGWIIEE 2460

Query: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520
            FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV
Sbjct: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520

Query: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580
            ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ
Sbjct: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580

Query: 2581 TMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTI 2640
            TMLATDDGADIPLSAPV TET GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTI
Sbjct: 2581 TMLATDDGADIPLSAPVSTETPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTI 2640

Query: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSL 2700
            CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHFATEDINDSSLTS+
Sbjct: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFATEDINDSSLTSV 2700

Query: 2701 IPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760
              DISVKNP SSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT
Sbjct: 2701 RSDISVKNPASSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760

Query: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820
            SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG
Sbjct: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820

Query: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGK 2880
            EVAILVFMFFTLMLRNWHQPGSDGPGAK STT DTHDKNSTQVAPSTS+TAQSSMDDQGK
Sbjct: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKSSTTADTHDKNSTQVAPSTSLTAQSSMDDQGK 2880

Query: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVR 2940
            NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVR
Sbjct: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVR 2940

Query: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000
            KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK
Sbjct: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000

Query: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060
            IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK
Sbjct: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060

Query: 3061 RLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGI 3120
            +LFKY+NKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGI
Sbjct: 3061 KLFKYVNKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGI 3120

Query: 3121 FYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDG 3180
            FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQ VD RKK+KGEDG
Sbjct: 3121 FYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQPVDIRKKKKGEDG 3180

Query: 3181 NDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQ 3240
            +DSALEKSYLDME MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQ
Sbjct: 3181 SDSALEKSYLDMETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQ 3240

Query: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300
            TFKETLLMALLQKVK LPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI
Sbjct: 3241 TFKETLLMALLQKVKNLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300

Query: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360
            RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL
Sbjct: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360

Query: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420
            KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS
Sbjct: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420

Query: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480
            LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD
Sbjct: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480

Query: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540
            KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND
Sbjct: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540

Query: 3541 EDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600
            EDMKRGL AIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG
Sbjct: 3541 EDMKRGLTAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600

Query: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660
            PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV
Sbjct: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660

Query: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720
            ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ
Sbjct: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720

Query: 3721 ARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780
            ARAVLCSFSEGDVNAV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF
Sbjct: 3721 ARAVLCSFSEGDVNAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780

Query: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTS 3840
            WEARLRVVFQLLFSSIKSGAKHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTS
Sbjct: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTS 3840

Query: 3841 VSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900
            VSQNKDEN TNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF
Sbjct: 3841 VSQNKDENTTNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900

Query: 3901 VRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960
            VRRQYKVSQV KGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL
Sbjct: 3901 VRRQYKVSQVFKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960

Query: 3961 CACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDS 4020
            CACSQSIRSEMCMLISLLC+QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDS
Sbjct: 3961 CACSQSIRSEMCMLISLLCSQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDS 4020

Query: 4021 EDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080
            EDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNI
Sbjct: 4021 EDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080

Query: 4081 RSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140
            RS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN
Sbjct: 4081 RS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140

Query: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPV 4200
            RLLKDLLDSLLLESNENKRQFIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPV
Sbjct: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPV 4200

Query: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLV 4260
            YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLV
Sbjct: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLV 4260

Query: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEAT 4320
            AGNIISLDLSIALVYEQVWKKSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEAT
Sbjct: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEAT 4320

Query: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380
            EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL
Sbjct: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380

Query: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440
            LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS
Sbjct: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440

Query: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGE 4500
            IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGE
Sbjct: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGE 4500

Query: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKT 4560
            PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKS+SEQAAKQRFTVENFVRVSESLKT
Sbjct: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSLSEQAAKQRFTVENFVRVSESLKT 4560

Query: 4561 SSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRG 4620
            SSCGERLKDIILEKGITGLAIKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRG
Sbjct: 4561 SSCGERLKDIILEKGITGLAIKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRG 4620

Query: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680
            LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR
Sbjct: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680

Query: 4681 MLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740
            MLRHATRDEMRRLALKNREDMLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC
Sbjct: 4681 MLRHATRDEMRRLALKNREDMLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740

Query: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRT 4800
            MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRT
Sbjct: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRT 4800

Query: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4860
            DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR
Sbjct: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4846

Query: 4861 LRLLTYDIVLV 4872
            LRLLTYDIVL+
Sbjct: 4861 LRLLTYDIVLM 4846

BLAST of HG10018299 vs. NCBI nr
Match: XP_004141595.1 (auxin transport protein BIG isoform X1 [Cucumis sativus] >KAE8648950.1 hypothetical protein Csa_009391 [Cucumis sativus])

HSP 1 Score: 9129.6 bits (23689), Expect = 0.0e+00
Identity = 4641/4871 (95.28%), Postives = 4729/4871 (97.08%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MA+QSFVKLLDTIFLDDS+T+ANT+K FSSSDLL LLRSDDSSIKLGL QFYSIL+ GLR
Sbjct: 1    MADQSFVKLLDTIFLDDSTTTANTKKPFSSSDLLHLLRSDDSSIKLGLPQFYSILQLGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLG  NFAFQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGHRNFAFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQNNMLMILETILVDGMDKVSD AQ CAKK L+DLLKS GGD DATIEF+N
Sbjct: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDCAQHCAKKDLIDLLKSFGGDFDATIEFNN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
            T+ECG TGVCCSREEKQVGRLLMTIAAEC QAD LTSE GFS+PTFLE+MNKLIFLCQHW
Sbjct: 181  TVECGFTGVCCSREEKQVGRLLMTIAAECEQADNLTSEPGFSEPTFLENMNKLIFLCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            AVTHLACIQ LILICK+LVVLPDALDEKTGST FRKRLSCSLRILKLL DLSKKFPYIEY
Sbjct: 241  AVTHLACIQRLILICKDLVVLPDALDEKTGSTIFRKRLSCSLRILKLLADLSKKFPYIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            DAK+MQAFAL ANSLPCLFGLCFEFANSHAT E SFENTILLLLEEFLELVQ+VFRN YV
Sbjct: 301  DAKLMQAFALLANSLPCLFGLCFEFANSHATGESSFENTILLLLEEFLELVQIVFRNIYV 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            FKDLE HHTSTL DLSVD+PKC+ARLE VPLHKNY VEEILRMIFP S+QWMDDLMHLLF
Sbjct: 421  FKDLEMHHTSTLTDLSVDLPKCHARLEAVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540
            FLYSEG+RLRPKIERSLSSMKSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQ
Sbjct: 481  FLYSEGMRLRPKIERSLSSMKSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540

Query: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600
            HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC
Sbjct: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600

Query: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660
            EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG
Sbjct: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660

Query: 661  NFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTL 720
            N VYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTL
Sbjct: 661  NSVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIEKSKTISLQYSSLQEFMGTL 720

Query: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIV 780
            PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+V
Sbjct: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVV 780

Query: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESK 840
            LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSS LPYT+NDHLSSWGASVAKNIIGSS+ESK
Sbjct: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSHLPYTVNDHLSSWGASVAKNIIGSSMESK 840

Query: 841  PFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLI 900
            PF +SLINQLIDISSFPASLRQHDLT+ECPWFN  DIFSTFSWILGFWNGKQA+TVEDLI
Sbjct: 841  PFLNSLINQLIDISSFPASLRQHDLTIECPWFNPSDIFSTFSWILGFWNGKQALTVEDLI 900

Query: 901  IERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFP 960
            IERYIFVLCWDFPS NALS GGPLWSDPD LDIS TTCFFYFSYLLLDHG VIGEHMKF 
Sbjct: 901  IERYIFVLCWDFPSANALSRGGPLWSDPDALDISKTTCFFYFSYLLLDHGSVIGEHMKFS 960

Query: 961  QVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLL 1020
            +VVIGLLQRLHGGS+LEDFKALGWNFLRNG WLSL+LSFLSVGI RYCSKN IPTVGS L
Sbjct: 961  RVVIGLLQRLHGGSVLEDFKALGWNFLRNGTWLSLILSFLSVGISRYCSKNTIPTVGSFL 1020

Query: 1021 TDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHA 1080
            TDTTVTD+EQANFAESLISSVIT+SQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHA
Sbjct: 1021 TDTTVTDSEQANFAESLISSVITESQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHA 1080

Query: 1081 TEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCW 1140
            TEFSPLLLFKHSEFD CVQNKTLENYGTTSC LESV NLMSRLDEIVDKRTLGF SR CW
Sbjct: 1081 TEFSPLLLFKHSEFDKCVQNKTLENYGTTSCSLESVLNLMSRLDEIVDKRTLGFSSRVCW 1140

Query: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVM 1200
            ESMFHGFPSHLETSSGILLSCVL+IGRIISVLAGLLR+VDVKR++ILETEVTRGILDAVM
Sbjct: 1141 ESMFHGFPSHLETSSGILLSCVLNIGRIISVLAGLLRLVDVKRSVILETEVTRGILDAVM 1200

Query: 1201 TIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHEL 1260
            T+KFDKTFESVHGLC+GIY+SLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHEL
Sbjct: 1201 TVKFDKTFESVHGLCDGIYKSLNVELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHEL 1260

Query: 1261 VIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSEL 1320
            VIVK  DIMD+LRKDVSKSSVFQFYLG+  V EQVRELY FQHGNLLVLLDSLDNC SEL
Sbjct: 1261 VIVKVIDIMDSLRKDVSKSSVFQFYLGSADVPEQVRELYAFQHGNLLVLLDSLDNCFSEL 1320

Query: 1321 VNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKG 1380
            VNLKVLGFFV+LLSGEPC KLKQEVQNKFL MDL SLS+WLEKRIFGLVAEDSSG NVKG
Sbjct: 1321 VNLKVLGFFVDLLSGEPCRKLKQEVQNKFLQMDLPSLSKWLEKRIFGLVAEDSSGVNVKG 1380

Query: 1381 SSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440
            SSISLRESSMNFVFCLISSP+EPLALQLQSHIFEAALVSLDMAF+RFDISVSKSYFHFVV
Sbjct: 1381 SSISLRESSMNFVFCLISSPTEPLALQLQSHIFEAALVSLDMAFMRFDISVSKSYFHFVV 1440

Query: 1441 QLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGK 1500
            QLLKGDKSMKLLLERILILM KLA+DERLLPG+K+LF+FLEMILIESGSGKNVFER +GK
Sbjct: 1441 QLLKGDKSMKLLLERILILMEKLANDERLLPGMKFLFNFLEMILIESGSGKNVFERTAGK 1500

Query: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVA 1560
            PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVA
Sbjct: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVA 1560

Query: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620
            SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG
Sbjct: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620

Query: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQL 1680
            HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQL
Sbjct: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQL 1680

Query: 1681 PESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQR 1740
            PESESDLEDDVSVTDTDKCL+PSVP ELLDGVSVLLEEL+VE RMLELCSCLLPTITNQR
Sbjct: 1681 PESESDLEDDVSVTDTDKCLKPSVPMELLDGVSVLLEELNVEERMLELCSCLLPTITNQR 1740

Query: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800
            DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV
Sbjct: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800

Query: 1801 KSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLA 1860
            KSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLA
Sbjct: 1801 KSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLA 1860

Query: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920
            FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGA+IKRMEWVPGSQVQLMVVT
Sbjct: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAYIKRMEWVPGSQVQLMVVT 1920

Query: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLG 1980
            NRFVKIYDLSLDNISPMHYFTLPDDMVVDATL TASQG+MFLIVLSENGRIFRLELSVLG
Sbjct: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLFTASQGKMFLIVLSENGRIFRLELSVLG 1980

Query: 1981 NVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISV 2040
            N+GATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 
Sbjct: 1981 NIGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISF 2040

Query: 2041 IYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGS 2100
            IYEEEQD+KLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAH+IYAQNLRHAGGS
Sbjct: 2041 IYEEEQDKKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHEIYAQNLRHAGGS 2100

Query: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNN 2160
            SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNN
Sbjct: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNN 2160

Query: 2161 KVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220
            KVYASTNPEF LDFFE TVCITADVRLGGD IRNGD EGAKQSLASEDGFLESPSSSGFK
Sbjct: 2161 KVYASTNPEFALDFFEKTVCITADVRLGGDTIRNGDFEGAKQSLASEDGFLESPSSSGFK 2220

Query: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280
            ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA
Sbjct: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280

Query: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340
            DEEFSVTVGPAFNGTALPRIDSLEVYGR KDEFGWKEKLDAVLDMEARALGSNSLLARSG
Sbjct: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRGKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340

Query: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYES 2400
            KKRRSIQCAPIQQQVLADGLKVLSSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYES
Sbjct: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYES 2400

Query: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEE 2460
            DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRL GVVKSTSVLS+RLGVGGAAGGWIIEE
Sbjct: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLAGVVKSTSVLSTRLGVGGAAGGWIIEE 2460

Query: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520
            FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV
Sbjct: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520

Query: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580
            ELIYCYAECLALHGPDTGR SVAPAV+LFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ
Sbjct: 2521 ELIYCYAECLALHGPDTGRRSVAPAVLLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580

Query: 2581 TMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTI 2640
            TMLATDDGADIPLSAPV TET GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTI
Sbjct: 2581 TMLATDDGADIPLSAPVSTETPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTI 2640

Query: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSL 2700
            CPDFDLCESCYEVLDADRLPSPHSRDH MTAIPIEV+SLGDGNEYHFATEDINDSSLTS+
Sbjct: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHLMTAIPIEVESLGDGNEYHFATEDINDSSLTSV 2700

Query: 2701 IPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760
              DI VKNP SSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT
Sbjct: 2701 KSDIGVKNPASSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760

Query: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820
            SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG
Sbjct: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820

Query: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGK 2880
            EVAILVFMFFTLMLRNWHQPGSDG GAK STT D HDKNSTQVAPSTS+TAQSS+DDQGK
Sbjct: 2821 EVAILVFMFFTLMLRNWHQPGSDGTGAKSSTTADMHDKNSTQVAPSTSLTAQSSVDDQGK 2880

Query: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVR 2940
            NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVR
Sbjct: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVR 2940

Query: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000
            KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK
Sbjct: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000

Query: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060
            IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK
Sbjct: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060

Query: 3061 RLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGI 3120
            +LFKY+NKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGI
Sbjct: 3061 KLFKYVNKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGI 3120

Query: 3121 FYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDG 3180
            FYFGEESV+QTLKLLNLAFYTGKDIGHS QKSEAGDTGTSTNKSGTQTVD RKK+KGEDG
Sbjct: 3121 FYFGEESVIQTLKLLNLAFYTGKDIGHSAQKSEAGDTGTSTNKSGTQTVDVRKKKKGEDG 3180

Query: 3181 NDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQ 3240
            +DSALEKSYLDME MVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQ
Sbjct: 3181 SDSALEKSYLDMETMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQ 3240

Query: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300
            TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI
Sbjct: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300

Query: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360
            RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL
Sbjct: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360

Query: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420
            KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS
Sbjct: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420

Query: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480
            LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD
Sbjct: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480

Query: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540
            KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND
Sbjct: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540

Query: 3541 EDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600
            EDMKRGL AIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG
Sbjct: 3541 EDMKRGLTAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600

Query: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660
            PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV
Sbjct: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660

Query: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720
            ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ
Sbjct: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720

Query: 3721 ARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780
            ARAVLCSFSEGDVNAV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF
Sbjct: 3721 ARAVLCSFSEGDVNAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780

Query: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTS 3840
            WEARLRVVFQLLFSSIKSGAKHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTS
Sbjct: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTS 3840

Query: 3841 VSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900
            VSQNKDENATNISGSFSGPV GNKSAPESLEHNWDSSH+TQDIQLLSYAEWEKGASYLDF
Sbjct: 3841 VSQNKDENATNISGSFSGPVIGNKSAPESLEHNWDSSHKTQDIQLLSYAEWEKGASYLDF 3900

Query: 3901 VRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960
            VRRQYKVSQV KGTVQRSRTQKGDYLSLKYALKWKRFVCR+A SDLSAFELGSWVTELVL
Sbjct: 3901 VRRQYKVSQVFKGTVQRSRTQKGDYLSLKYALKWKRFVCRSAISDLSAFELGSWVTELVL 3960

Query: 3961 CACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDS 4020
            CACSQSIRSEMCMLISLLC+QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDS
Sbjct: 3961 CACSQSIRSEMCMLISLLCSQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDS 4020

Query: 4021 EDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080
            EDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNI
Sbjct: 4021 EDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080

Query: 4081 RSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140
            RS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN
Sbjct: 4081 RS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140

Query: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPV 4200
            RLLKDLLDSLLLESNENKRQFIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPV
Sbjct: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPV 4200

Query: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLV 4260
            YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLV
Sbjct: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLV 4260

Query: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEAT 4320
            AGNIISLDLSIALVYEQVWKKSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEAT
Sbjct: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEAT 4320

Query: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380
            EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL
Sbjct: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380

Query: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440
            LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS
Sbjct: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440

Query: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGE 4500
            IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGE
Sbjct: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGE 4500

Query: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKT 4560
            PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNP+DKS+SEQAAKQRFTVENFVRVSESLKT
Sbjct: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPDDKSLSEQAAKQRFTVENFVRVSESLKT 4560

Query: 4561 SSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRG 4620
            SSCGERLKDIILEKGITGLAIKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRG
Sbjct: 4561 SSCGERLKDIILEKGITGLAIKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRG 4620

Query: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680
            LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR
Sbjct: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680

Query: 4681 MLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740
            MLRHATRDEMRRLALKNREDMLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC
Sbjct: 4681 MLRHATRDEMRRLALKNREDMLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740

Query: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRT 4800
            MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRT
Sbjct: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRT 4800

Query: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4860
            DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR
Sbjct: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4846

Query: 4861 LRLLTYDIVLV 4872
            LRLLTYDIVL+
Sbjct: 4861 LRLLTYDIVLM 4846

BLAST of HG10018299 vs. NCBI nr
Match: XP_038890253.1 (auxin transport protein BIG isoform X2 [Benincasa hispida])

HSP 1 Score: 8967.8 bits (23269), Expect = 0.0e+00
Identity = 4557/4739 (96.16%), Postives = 4623/4739 (97.55%), Query Frame = 0

Query: 134  MLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDNTIECGSTGVCCSR 193
            MLMILETILVDGMDKV+D AQLCAKK+L+DLLKSTGGDCDATIEF+N IECG  GVCCSR
Sbjct: 1    MLMILETILVDGMDKVTDCAQLCAKKALIDLLKSTGGDCDATIEFENAIECGFAGVCCSR 60

Query: 194  EEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHWAVTHLACIQHLIL 253
            EEKQVGRLLMT+AAECVQADQLTSESGFS+PTFLEDMNKLIFL QHWAVTHLACIQ LIL
Sbjct: 61   EEKQVGRLLMTVAAECVQADQLTSESGFSEPTFLEDMNKLIFLFQHWAVTHLACIQRLIL 120

Query: 254  ICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEYDAKMMQAFALFAN 313
            ICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEYDAK+MQAFALFAN
Sbjct: 121  ICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEYDAKLMQAFALFAN 180

Query: 314  SLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYVSVNIQTCVVASIL 373
            SLPCLFGLCFEFANSHA VEGS ENTILLLLEEFLELVQVVFRNSYV VNIQTC+VASIL
Sbjct: 181  SLPCLFGLCFEFANSHAIVEGSLENTILLLLEEFLELVQVVFRNSYVCVNIQTCIVASIL 240

Query: 374  DNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLETHHTSTLA 433
            DNLSSSVWRYDAS ANLK PLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLE HHTSTLA
Sbjct: 241  DNLSSSVWRYDASAANLKPPLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLEMHHTSTLA 300

Query: 434  DLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKI 493
            DLSVDIPKCYAR EIVPLHKNY V+EILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKI
Sbjct: 301  DLSVDIPKCYARSEIVPLHKNYTVDEILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKI 360

Query: 494  ERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNL 553
            ERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSS CNL
Sbjct: 361  ERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSLCNL 420

Query: 554  LLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASC 613
            LLQA KELLSFIKLCIFSPEWNASVFDDGCNKLNQNH+DILLSLLNCEGCCSDDKSS+SC
Sbjct: 421  LLQATKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHVDILLSLLNCEGCCSDDKSSSSC 480

Query: 614  LPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENGNFVYNDQTLSLLA 673
            LPAHDERKSGHIHEICYRLLHGLLT HALPDSLEEYLVKKILNAENGNFVYNDQTLSLLA
Sbjct: 481  LPAHDERKSGHIHEICYRLLHGLLTSHALPDSLEEYLVKKILNAENGNFVYNDQTLSLLA 540

Query: 674  HTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTLPSVFHIEILLVAF 733
            HTLFRRTGVAGT LRTQIYRQFVEFII KSKTIS  YSSLQEFMGTLPSVFHIEILLVAF
Sbjct: 541  HTLFRRTGVAGTQLRTQIYRQFVEFIIVKSKTISLKYSSLQEFMGTLPSVFHIEILLVAF 600

Query: 734  HLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIVLRHIIFHPHTCSS 793
            HLSSEGEKREIS LIFSSIRAIDAPS+FS CTELSMWGLLVSRLIIVLRHIIFHPHTCSS
Sbjct: 601  HLSSEGEKREISCLIFSSIRAIDAPSSFSKCTELSMWGLLVSRLIIVLRHIIFHPHTCSS 660

Query: 794  SLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESKPFFHSLINQLIDI 853
            SLLFDFRSKLRDAPAFSSSLPYT+NDHLSSWGAS+AKNIIGSSVES+PFF+SLINQLIDI
Sbjct: 661  SLLFDFRSKLRDAPAFSSSLPYTVNDHLSSWGASIAKNIIGSSVESQPFFNSLINQLIDI 720

Query: 854  SSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLIIERYIFVLCWDFP 913
            SSFPASLRQHD T+ECPWFN  DIFSTFSWILGFWNGKQAV VEDLIIERYIFVLCWDFP
Sbjct: 721  SSFPASLRQHDSTIECPWFNPSDIFSTFSWILGFWNGKQAVAVEDLIIERYIFVLCWDFP 780

Query: 914  SMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFPQVVIGLLQRLHGG 973
            SMN LSHGG LWSD DTLDIS+TTCFFYFSYLLLDHG VIGE MKFPQVVI LL+RLHGG
Sbjct: 781  SMNVLSHGGTLWSDLDTLDISDTTCFFYFSYLLLDHGDVIGERMKFPQVVIDLLRRLHGG 840

Query: 974  SILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLLTDTTVTDNEQANF 1033
            S LEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYC+KN IPTVGSL TDTTVTDNE ANF
Sbjct: 841  STLEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCNKNTIPTVGSLWTDTTVTDNELANF 900

Query: 1034 AESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHATEFSPLLLFKHSE 1093
            AESLISSVIT+SQVSILI ELS VLSMYLQVYQKAFVATLSSSNDHATEFSPLLLFKHSE
Sbjct: 901  AESLISSVITNSQVSILIGELSFVLSMYLQVYQKAFVATLSSSNDHATEFSPLLLFKHSE 960

Query: 1094 FDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCWESMFHGFPSHLET 1153
            FD C+QNKTLENYGTTSCLLESVFNLMSRLDEIVDK+TLGFLSR CWESMFHGFPSHLET
Sbjct: 961  FDRCIQNKTLENYGTTSCLLESVFNLMSRLDEIVDKKTLGFLSRVCWESMFHGFPSHLET 1020

Query: 1154 SSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVMTIKFDKTFESVHG 1213
            SSGILLSCVLSIGRIISVLAGLLRIVDVK NIILETEVTRGILDAVMTIKFDKTFESVHG
Sbjct: 1021 SSGILLSCVLSIGRIISVLAGLLRIVDVKPNIILETEVTRGILDAVMTIKFDKTFESVHG 1080

Query: 1214 LCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHELVIVKATDIMDNLR 1273
            LCEGIYQSLN ELDGCSYGVLFLLKQLEGYLRH+N RG SDSTIHELVIVKATDIMD+LR
Sbjct: 1081 LCEGIYQSLNAELDGCSYGVLFLLKQLEGYLRHINTRGVSDSTIHELVIVKATDIMDSLR 1140

Query: 1274 KDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSELVNLKVLGFFVELL 1333
            KDVSKSSVFQFYLGAEVV EQVRELY FQHGNLLVLLDSLDNCCSE+VNLKVLGFF ELL
Sbjct: 1141 KDVSKSSVFQFYLGAEVVPEQVRELYAFQHGNLLVLLDSLDNCCSEIVNLKVLGFFGELL 1200

Query: 1334 SGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKGSSISLRESSMNFV 1393
            SGEPC KLKQEVQNKFLSMDLLSLS+WLEKRIFGLVAEDSSG NVKGSS SLRESSMNFV
Sbjct: 1201 SGEPCSKLKQEVQNKFLSMDLLSLSKWLEKRIFGLVAEDSSGVNVKGSSTSLRESSMNFV 1260

Query: 1394 FCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKGDKSMKLLL 1453
            FCLISSP+EPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKG+KSMKLLL
Sbjct: 1261 FCLISSPTEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKGEKSMKLLL 1320

Query: 1454 ERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGKPLSRYAPEVGPLS 1513
            ERILILM KL SDERLLPGLKYLF FLEMILIESGSGKNVFERP+GKPLSRYAPEVGPLS
Sbjct: 1321 ERILILMEKLVSDERLLPGLKYLFIFLEMILIESGSGKNVFERPTGKPLSRYAPEVGPLS 1380

Query: 1514 SKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSE 1573
            SKSVGPRK+SETLV SS+QEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSE
Sbjct: 1381 SKSVGPRKSSETLVFSSSQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSE 1440

Query: 1574 RALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFC 1633
            RALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFC
Sbjct: 1441 RALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFC 1500

Query: 1634 DCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSV 1693
            DCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSV
Sbjct: 1501 DCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSV 1560

Query: 1694 TDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQRDPDLSKDKKIILG 1753
            TDTDKCLRPSVPRELLDG+SVLLEELDVEG MLELCS LLPTITNQRDPDLSKDKKIILG
Sbjct: 1561 TDTDKCLRPSVPRELLDGISVLLEELDVEGSMLELCSRLLPTITNQRDPDLSKDKKIILG 1620

Query: 1754 KDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLA 1813
            KDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLA
Sbjct: 1621 KDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLA 1680

Query: 1814 VGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAG 1873
            VGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAG
Sbjct: 1681 VGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAG 1740

Query: 1874 YEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVTNRFVKIYDLSLDN 1933
            YEDCQVLTLNHRGEVVDRLAIELALQGAHI+RMEWVPGSQVQLMVVTN+FVKIYDLSLDN
Sbjct: 1741 YEDCQVLTLNHRGEVVDRLAIELALQGAHIRRMEWVPGSQVQLMVVTNKFVKIYDLSLDN 1800

Query: 1934 ISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEIIEI 1993
            ISPMHYFTLPDDMVVDATL TASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEII I
Sbjct: 1801 ISPMHYFTLPDDMVVDATLFTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEIIHI 1860

Query: 1994 QGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISVIYEEEQDRKLRPA 2053
            QGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLT ISVIYEEEQDRKLRPA
Sbjct: 1861 QGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTGISVIYEEEQDRKLRPA 1920

Query: 2054 GLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGSSLPLVGITAYKPL 2113
            GLHRWKELFAGSGLFVCFSSVKSNSAL VSMGAH+IYAQNL+HAGGSSLPLVGITAYKPL
Sbjct: 1921 GLHRWKELFAGSGLFVCFSSVKSNSALVVSMGAHEIYAQNLKHAGGSSLPLVGITAYKPL 1980

Query: 2114 SKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNNKVYASTNPEFPLD 2173
            SKDKIHC VLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNNKVYASTNPEFPLD
Sbjct: 1981 SKDKIHCFVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNNKVYASTNPEFPLD 2040

Query: 2174 FFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMV 2233
            FFE TVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMV
Sbjct: 2041 FFEKTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMV 2100

Query: 2234 GFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFN 2293
            GFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFN
Sbjct: 2101 GFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFN 2160

Query: 2294 GTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQ 2353
            GTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQ
Sbjct: 2161 GTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQ 2220

Query: 2354 QVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPLLQSAACRV 2413
            QVLADGLKVLSSYYLLRRSQGCPKL+D NQELTKLKCKQLLETIYESDREPLLQSAACRV
Sbjct: 2221 QVLADGLKVLSSYYLLRRSQGCPKLDDANQELTKLKCKQLLETIYESDREPLLQSAACRV 2280

Query: 2414 LQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIAL 2473
            LQAIFPKKE YYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIAL
Sbjct: 2281 LQAIFPKKETYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIAL 2340

Query: 2474 HRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLALH 2533
            HRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQ LNNIVISSVELIYCYAECLALH
Sbjct: 2341 HRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQILNNIVISSVELIYCYAECLALH 2400

Query: 2534 GPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPL 2593
            GPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPL
Sbjct: 2401 GPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPL 2460

Query: 2594 SAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEV 2653
            SAPV TETTGTNPQVMIEEDAVASSVQYCCDGCS VPILRRRWHCT+CPDFDLCESCYEV
Sbjct: 2461 SAPVTTETTGTNPQVMIEEDAVASSVQYCCDGCSKVPILRRRWHCTVCPDFDLCESCYEV 2520

Query: 2654 LDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSLI-PDISVKNPVSS 2713
            LDADRLPSPHSRDHPMTAIPIEV+S+GDGNEYHFATEDINDSSLTS +  DISVKNPVSS
Sbjct: 2521 LDADRLPSPHSRDHPMTAIPIEVESVGDGNEYHFATEDINDSSLTSSVRADISVKNPVSS 2580

Query: 2714 IHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLF 2773
            IHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLF
Sbjct: 2581 IHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLF 2640

Query: 2774 YRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMFFTL 2833
            YRLSSTMGGPFMNSLKSENLNLERLIKWFLDEI+LNKPFEAK RTSFGE+AILVFMFFTL
Sbjct: 2641 YRLSSTMGGPFMNSLKSENLNLERLIKWFLDEIDLNKPFEAKARTSFGEIAILVFMFFTL 2700

Query: 2834 MLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLRACS 2893
            MLRNWHQPGSDGPGAKPST  DTHDK+STQVAPSTSVTAQSS+DDQGKNDFTSQLLRACS
Sbjct: 2701 MLRNWHQPGSDGPGAKPSTAADTHDKSSTQVAPSTSVTAQSSVDDQGKNDFTSQLLRACS 2760

Query: 2894 SIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFSPFF 2953
            SIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVRKDLPAGNFSPFF
Sbjct: 2761 SIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVRKDLPAGNFSPFF 2820

Query: 2954 SDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAY 3013
            SDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAY
Sbjct: 2821 SDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAY 2880

Query: 3014 QDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKVGGF 3073
            QDVLCSYINNPNTSFVRRY+RRLFLHICGSKSHYYSIRDSWQFSTEVK+LFKYINKVGGF
Sbjct: 2881 QDVLCSYINNPNTSFVRRYSRRLFLHICGSKSHYYSIRDSWQFSTEVKKLFKYINKVGGF 2940

Query: 3074 QNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEESVVQTL 3133
            QNPMSYERSVKIVKCLTT+AEVAA+RPRNWQKYCLRH DVLPFLLNGIFY GEESV+QTL
Sbjct: 2941 QNPMSYERSVKIVKCLTTLAEVAASRPRNWQKYCLRHGDVLPFLLNGIFYIGEESVIQTL 3000

Query: 3134 KLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDGNDSALEKSYLDM 3193
            KLLNLAFYTGKD G+S+QKSEAGD+GTSTNKSGTQ VDSRKKRKGEDGNDSALEKSYLDM
Sbjct: 3001 KLLNLAFYTGKDTGYSIQKSEAGDSGTSTNKSGTQPVDSRKKRKGEDGNDSALEKSYLDM 3060

Query: 3194 EIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQTFKETLLMALLQ 3253
            E MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVV GIWHHGKQTFKETLLMALLQ
Sbjct: 3061 ETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVYGIWHHGKQTFKETLLMALLQ 3120

Query: 3254 KVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNE 3313
            KV+TLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNE
Sbjct: 3121 KVRTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNE 3180

Query: 3314 LLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRII 3373
            LLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRII
Sbjct: 3181 LLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRII 3240

Query: 3374 VKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAF 3433
            VKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAF
Sbjct: 3241 VKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAF 3300

Query: 3434 NQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENA 3493
            NQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENA
Sbjct: 3301 NQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENA 3360

Query: 3494 YQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAAIES 3553
            YQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAAIES
Sbjct: 3361 YQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAAIES 3420

Query: 3554 ESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALL 3613
            ESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALL
Sbjct: 3421 ESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALL 3480

Query: 3614 GVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCA 3673
            GVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCA
Sbjct: 3481 GVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCA 3540

Query: 3674 TTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGD 3733
            TTFVTQCLEILQVLS+HQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGD
Sbjct: 3541 TTFVTQCLEILQVLSRHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGD 3600

Query: 3734 VNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLL 3793
            V+AV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLL
Sbjct: 3601 VHAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLL 3660

Query: 3794 FSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENATNI 3853
            FSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENATNI
Sbjct: 3661 FSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENATNI 3720

Query: 3854 SGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVCK 3913
            SGSFSGPVSGNKSAPESLEHNWDSS RTQDIQLLSYAEWEKGASYLDFVRRQYKVSQV K
Sbjct: 3721 SGSFSGPVSGNKSAPESLEHNWDSSQRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVFK 3780

Query: 3914 GTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVLCACSQSIRSEMC 3973
            GTVQRSRT KGDYLSLKYALKWKRFVCRNAKSDLS FELGSWVTELVLCACSQSIRSEMC
Sbjct: 3781 GTVQRSRTHKGDYLSLKYALKWKRFVCRNAKSDLSTFELGSWVTELVLCACSQSIRSEMC 3840

Query: 3974 MLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLFLTVRGC 4033
            MLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDSEDARLFLTVRGC
Sbjct: 3841 MLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDSEDARLFLTVRGC 3900

Query: 4034 LRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRSRYNFFLKHLF 4093
            LRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRS          
Sbjct: 3901 LRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRS---------- 3960

Query: 4094 FRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLL 4153
                           RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLL
Sbjct: 3961 ---------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLL 4020

Query: 4154 ESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQE 4213
            ESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQE
Sbjct: 4021 ESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQE 4080

Query: 4214 EFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIISLDLSIA 4273
            EFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLVAGNIISLDLSIA
Sbjct: 4081 EFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLVAGNIISLDLSIA 4140

Query: 4274 LVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDRE 4333
            LVYEQVWKKSNQSSNAISNTAL+STTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDRE
Sbjct: 4141 LVYEQVWKKSNQSSNAISNTALMSTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDRE 4200

Query: 4334 ESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRR 4393
            ESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRR
Sbjct: 4201 ESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRR 4260

Query: 4394 ALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQ 4453
            ALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQ
Sbjct: 4261 ALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQ 4320

Query: 4454 TGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFT 4513
            +GTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFT
Sbjct: 4321 SGTGEQAKKIVLMFLERLSHPFGLKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFT 4380

Query: 4514 PYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIIL 4573
            PYLN WDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIIL
Sbjct: 4381 PYLNAWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIIL 4440

Query: 4574 EKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCI 4633
            EKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCI
Sbjct: 4441 EKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCI 4500

Query: 4634 DEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRR 4693
            DEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRR
Sbjct: 4501 DEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRR 4560

Query: 4694 LALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPT 4753
            LALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDV EEEDGLACMVCREGYSLRPT
Sbjct: 4561 LALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVLEEEDGLACMVCREGYSLRPT 4620

Query: 4754 DLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWE 4813
            DLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWE
Sbjct: 4621 DLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWE 4680

Query: 4814 GATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVLV 4872
            GATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVL+
Sbjct: 4681 GATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVLM 4714

BLAST of HG10018299 vs. ExPASy Swiss-Prot
Match: Q9SRU2 (Auxin transport protein BIG OS=Arabidopsis thaliana OX=3702 GN=BIG PE=1 SV=2)

HSP 1 Score: 6129.7 bits (15901), Expect = 0.0e+00
Identity = 3189/4894 (65.16%), Postives = 3816/4894 (77.97%), Query Frame = 0

Query: 14   FLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLRDLG------DGNF 73
            FL D +   +     SS    + LRSDD SIK GLR FY +L+ G+  +G       G  
Sbjct: 11   FLFDDTAFPSLSSSASSDLFSRRLRSDD-SIKRGLRSFYLLLRWGVAPIGGDDADSSGKL 70

Query: 74   AFQSWTDPQIQAVCSIAHAIASASRSL----------TVDQAEAIVVAVIKKSLELVFCY 133
             F++W+D Q+QA+ SI+ AI   SRSL           VDQ E IV+ VI++ +E    +
Sbjct: 71   RFETWSDSQLQALVSISQAILLLSRSLLGTDLTLNQGLVDQLEPIVLGVIQEVMEFSLSF 130

Query: 134  LEKSEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIE 193
            LEKS F+ +D  ++ NM ++LE    DG +K  D     +   + +L  +  G+ D  ++
Sbjct: 131  LEKSSFRQNDLKMEINMEILLEIASFDGSEKQYDILPDFSPAEVAELWPAFSGEHD-NMD 190

Query: 194  FDNTIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLC 253
              + ++C   G  CS EEK V RLL+T+ +EC+++D + ++S    P F +D   L    
Sbjct: 191  AQSLVKCTFQGGRCSNEEKPVDRLLITLMSECIESD-VQAQSVVKSP-FQQDCGDLNPFT 250

Query: 254  QHWAVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPY 313
            +H AV HL C+  LI++CKELV LP+ LDEKT   +   +LS  LRILKLL  LSK    
Sbjct: 251  RHLAVVHLRCVCRLIMVCKELVQLPNMLDEKTVDQAVLDKLSFCLRILKLLGSLSKDVQS 310

Query: 314  IEYDAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRN 373
            IE D  ++QA A F ++ P LF + FEF N H   EG+ E+  L L+E FL LVQ++F  
Sbjct: 311  IENDGSLLQAVASFTDAFPKLFRVFFEFTN-HTATEGNIESLSLALVEGFLNLVQLIFGK 370

Query: 374  SYVSVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYH 433
            S V  N+Q CV ASI+ NL SSVWRYD S+ NL  PL YFPRSV+  +KLIQDLK   YH
Sbjct: 371  SSVFQNVQACVAASIVSNLDSSVWRYDGSSCNLTPPLAYFPRSVIYTLKLIQDLKRQPYH 430

Query: 434  AFSFKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMH 493
                + LE+  T      +VD    + R E +PL K + VE+I+R+IFP S QWMD+  H
Sbjct: 431  IHDLRVLESEVTYEDVSSTVDSVYFHLRQEKIPLLKCFTVEDIMRVIFPSSSQWMDNFFH 490

Query: 494  LLFFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGY 553
            L++FL+ EGV+LRPK+ER+ SS++S+S  E E+ + H+DEALFG+LFSE  RS+ S++  
Sbjct: 491  LVYFLHREGVKLRPKVERTYSSLRSNSFAEVESQISHDDEALFGNLFSEGSRSLCSIEPN 550

Query: 554  DLQHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSL 613
            D   ++V+S     NLLLQAAKELL+F++ CI   EW  S+++DGC KL+  HIDILL++
Sbjct: 551  DQPPVSVSS-----NLLLQAAKELLNFLRACILCQEWVPSIYEDGCKKLDTGHIDILLNI 610

Query: 614  LNCEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNA 673
            +   GC  +DK+S       DE + GH   + + LL  LL   AL D LE YL ++IL  
Sbjct: 611  V---GCSIEDKASDGGCMLQDEGRPGH---VAFELLLNLLRSRALSDFLESYLFQQILVV 670

Query: 674  ENGNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFM 733
            EN +F YND+TL+LLAHTL  R G+AG  LR +IY  FV F+ E+++ I +   SL+E  
Sbjct: 671  ENSDFNYNDKTLALLAHTLLCRPGLAGAQLRAKIYDGFVSFVTERARGICAEALSLKELT 730

Query: 734  GTLPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRL 793
              LPS FHIEILL+AFHLS+E EK + S+LI S +  +D P+   +  +LS W +L+SRL
Sbjct: 731  ACLPSAFHIEILLMAFHLSNEAEKAKFSNLIASCLHKVDTPAGICDGPQLSSWAMLISRL 790

Query: 794  IIVLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSV 853
            +++L H++ HP+TC +SL+ D RSKLR+  +  S+L  T+ DHLSSW + VA+ I  S  
Sbjct: 791  LVLLHHMLLHPNTCPTSLMLDLRSKLREVRSCGSNLHVTVGDHLSSWASLVARGITDSWA 850

Query: 854  ESKPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVE 913
            E +   H L++Q+ID S  P + +    T +    + GD+ ++   +LG W GK+A  VE
Sbjct: 851  EDESVSH-LMSQMIDFSPHPPTFQNDVSTAKTLNLDYGDLSASLCRVLGLWKGKKAGKVE 910

Query: 914  DLIIERYIFVLCWDFPSMN-ALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEH 973
            DL++ERYIF+L  D   +N AL     L  +   +DISN+      S+LL+    V+G +
Sbjct: 911  DLLVERYIFMLSSDIARINCALDSQPSLHVNYQNVDISNSVDLISTSHLLVGDINVVGRN 970

Query: 974  MKFPQVVIGLLQRLHGG--SILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIP 1033
            ++   ++IG+L +L      ++ED   LGW+++R GAWLSL+L FL  G+W YC+KN   
Sbjct: 971  IELRNILIGVLNQLQAAPEQVVED---LGWDYIREGAWLSLLLYFLDGGVWDYCNKNSCS 1030

Query: 1034 TVGSLLTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLS 1093
             +     + T  D +    AE ++S ++    ++ L+R LSS++  YL+VY+KAF+AT S
Sbjct: 1031 EIDPFWKECTSVDAKYVAAAEGVVSYLMKTGDIAELLRMLSSLVGKYLRVYKKAFLATFS 1090

Query: 1094 SSNDHATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF 1153
              N H      LLL KH++F   +Q +     G  S  L+ +F L S+LD + D R  G 
Sbjct: 1091 DWNHHGHSSPSLLLLKHTQFGKSLQGE-YAKIGDNSLHLQCIFYL-SKLDSLGDGRGSGV 1150

Query: 1154 LSRFCWESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRG 1213
            L +  WE M HGFP+ L+TSS ILLSC+LSI  I+  + GLL++ + K    ++T V   
Sbjct: 1151 LWKVFWEFMVHGFPTSLQTSSAILLSCILSIRCIVLTINGLLKLGNSKEKFGVDTSVLHQ 1210

Query: 1214 ILDAVMTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASD 1273
            +LD++M IKFD+ FES HG CE I+Q++   L       LFL+K +EG++R ++      
Sbjct: 1211 LLDSIMIIKFDQVFESFHGKCEEIHQNICAVLQLPDLTELFLMKDMEGFVRDISAEQIDR 1270

Query: 1274 STIHELVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLD 1333
            S + E VI K  D+MD+L KD SKS +F+FYLG + V E  RE Y  Q G+L V +DSLD
Sbjct: 1271 SQVLEGVITKIVDVMDSLSKDSSKSDIFKFYLGVDAVSEHTREFYELQRGDLSVFIDSLD 1330

Query: 1334 NCCSELVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSS 1393
             C  E VN+KVL F V+LLS    P L++ VQ KF+ MDL+SLS WLE+R+ G   E+  
Sbjct: 1331 YCSLEPVNIKVLNFLVDLLSVAQSPDLRRRVQQKFIDMDLISLSGWLERRLLGSFVEEID 1390

Query: 1394 G-GNVKGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSK 1453
            G    KG+S+  RE++MNF+ CL+SS ++    +LQ+H+FEA L+SLD AFL FDI +S 
Sbjct: 1391 GKKTAKGNSLPFREAAMNFINCLVSSTNDLQTRELQNHLFEALLISLDTAFLSFDIHMSM 1450

Query: 1454 SYFHFVVQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNV 1513
            SYFHFV+QL + D  MK++L+R ++LM KLA++E+LLPGLK++F  +  +L  S    + 
Sbjct: 1451 SYFHFVLQLAREDNLMKMVLKRTIMLMEKLAAEEKLLPGLKFIFGVIGTLL--SNRSPSH 1510

Query: 1514 FERPSGKPLSRYA-PEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDD 1573
             E   GK L+ Y     GPL  K  G  K S+TL L  +QE    S ECD TS +EDEDD
Sbjct: 1511 GESLCGKSLASYKNTATGPLVPKLSGTTKKSDTLALPVDQEGSSISLECDVTSVDEDEDD 1570

Query: 1574 GTSDGEVASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV 1633
            GTSDGEVASLDK++EED NSER LASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV
Sbjct: 1571 GTSDGEVASLDKEDEEDANSERYLASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV 1630

Query: 1634 CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLP 1693
            CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKY G+GSAP RG +NFQ FLP
Sbjct: 1631 CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYNGNGSAPARGTNNFQSFLP 1690

Query: 1694 FSEEGDQLPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCL 1753
             SE+ DQL ES+SD+E+D    +    L   +P+E    +S+LLEEL +E R+LEL S L
Sbjct: 1691 LSEDADQLGESDSDVEEDGFGEENHVVL--YIPKETQYKMSLLLEELGIEDRVLELFSSL 1750

Query: 1754 LPTITNQRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKS 1813
            LP+IT++RD  LSK+K++ LGKDKVLS+  DLLQLKKAYK GSLDLKIKA+Y N+K+LKS
Sbjct: 1751 LPSITSKRDSGLSKEKQVNLGKDKVLSFDTDLLQLKKAYKSGSLDLKIKADYTNSKDLKS 1810

Query: 1814 HLASGSLVKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVV 1873
             LA+GSLVKSLLSVS+RGRLAVGEGDKV+IFD+ QLI Q T+AP+ ADK NVKPLS+N+V
Sbjct: 1811 LLANGSLVKSLLSVSVRGRLAVGEGDKVAIFDVGQLIGQATIAPINADKANVKPLSRNIV 1870

Query: 1874 RFEIVHLAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGS 1933
            RFEIVHL+FNP VENYLAVAG EDCQ+LTLNHRGEV+DRLA+ELALQGA I+R++WVPGS
Sbjct: 1871 RFEIVHLSFNPVVENYLAVAGLEDCQILTLNHRGEVIDRLAVELALQGAFIRRIDWVPGS 1930

Query: 1934 QVQLMVVTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIF 1993
            QVQLMVVTN+FVKIYDLS D+ISP  YFTLP+DM+VDATL  AS+GR+FL+VLSE G ++
Sbjct: 1931 QVQLMVVTNKFVKIYDLSQDSISPTQYFTLPNDMIVDATLFVASRGRVFLLVLSEQGNLY 1990

Query: 1994 RLELSVLGNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDA 2053
            R ELS  GN GATPLKEI++I G++++ KG S+YFS  Y+LLF++Y DG++ +G+LS DA
Sbjct: 1991 RFELSWGGNAGATPLKEIVQIMGKDVTGKGSSVYFSPTYRLLFISYHDGSSFMGRLSSDA 2050

Query: 2054 TKLTEISVIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQ 2113
            T LT+ S ++EEE D K R AGLHRWKEL AGSGLF+CFSSVKSN+ LAVS+    + AQ
Sbjct: 2051 TSLTDTSGMFEEESDCKQRVAGLHRWKELLAGSGLFICFSSVKSNAVLAVSLRGDGVCAQ 2110

Query: 2114 NLRHAGGSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKK 2173
            NLRH  GSS P+VGITAYKPLSKD +HCLVLHDDGSLQIY+H   GVD  +  TAEK+KK
Sbjct: 2111 NLRHPTGSSSPMVGITAYKPLSKDNVHCLVLHDDGSLQIYSHVRSGVDTDSNFTAEKVKK 2170

Query: 2174 LGSGILNNKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLE 2233
            LGS ILNNK YA   PEFPLDFFE   CITADVRLG DAIRNGDSEGAKQSLASEDGF+E
Sbjct: 2171 LGSKILNNKTYAGAKPEFPLDFFERAFCITADVRLGSDAIRNGDSEGAKQSLASEDGFIE 2230

Query: 2234 SPSSSGFKITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPF 2293
            SPS  GFKI+VSN NPDIVMVG R+HVG TSA+ IPSE+TIFQR IK+DEGMR WYDIPF
Sbjct: 2231 SPSPVGFKISVSNPNPDIVMVGIRMHVGTTSASSIPSEVTIFQRSIKMDEGMRCWYDIPF 2290

Query: 2294 TVAESLLADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGS 2353
            TVAESLLADE+  ++VGP  +GTALPRIDSLEVYGRAKDEFGWKEK+DAVLDMEAR LG 
Sbjct: 2291 TVAESLLADEDVVISVGPTTSGTALPRIDSLEVYGRAKDEFGWKEKMDAVLDMEARVLGH 2350

Query: 2354 NSLLARSGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQ 2413
              LL  S KKR   Q A +++QV+ADGLK+LS YY + R    P+   V   L++LKCKQ
Sbjct: 2351 GLLLPGSSKKRALAQSASMEEQVIADGLKLLSIYYSVCR----PRQEVV---LSELKCKQ 2410

Query: 2414 LLETIYESDREPLLQSAACRVLQAIFPKKEIYYQ-------VKDTMRLTGVVKSTSVLSS 2473
            LLETI+ESDRE LLQ+ ACRVLQ++FP+KEIYYQ       VKDTMRL GVVK TS+LSS
Sbjct: 2411 LLETIFESDRETLLQTTACRVLQSVFPRKEIYYQVMFLPNSVKDTMRLLGVVKVTSILSS 2470

Query: 2474 RLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLE 2533
            RLG+ G  GG I+EEF +QMRAVSK+AL R+SN + FLE NGS+VVD LMQ+LWGIL+ E
Sbjct: 2471 RLGILG-TGGSIVEEFNAQMRAVSKVALTRKSNFSVFLEMNGSEVVDNLMQVLWGILESE 2530

Query: 2534 QPNTQTLNNIVISSVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSS 2593
              +T T+NN+V+SSVELIY YAECLA  G DTG HSVAPAV L K L+   +E+VQ SS 
Sbjct: 2531 PLDTPTMNNVVMSSVELIYSYAECLASQGKDTGVHSVAPAVQLLKALMLFPNESVQTSSR 2590

Query: 2594 ----LAISSRLLQVPFPKQTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQY 2653
                LAISSRLLQVPFPKQTML TDD  D   +  VP  T G N  VMIEED++ SSVQY
Sbjct: 2591 CVLVLAISSRLLQVPFPKQTMLTTDDLVDNVTTPSVPIRTAGGNTHVMIEEDSITSSVQY 2650

Query: 2654 CCDGCSTVPILRRRWHCTICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLG- 2713
            CCDGCSTVPILRRRWHCT+CPDFDLCE+CYEVLDADRLP PH+RDHPMTAIPIEV+SLG 
Sbjct: 2651 CCDGCSTVPILRRRWHCTVCPDFDLCEACYEVLDADRLPPPHTRDHPMTAIPIEVESLGA 2710

Query: 2714 DGNEYHFATEDINDSSLTSLIPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQT 2773
            D NE  F+ +++  S++  ++     +    SIHVLEP +S +FSAS+TDP+SISASK+ 
Sbjct: 2711 DTNEIQFSADEVGISNMLPVVTSSIPQASTPSIHVLEPGESAEFSASLTDPISISASKRA 2770

Query: 2774 VNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKW 2833
            VNSL+LSE L++L GWMET SGVQA+PVMQLFYRLSS +GG FM+S K E ++L++LIKW
Sbjct: 2771 VNSLILSEFLQELSGWMETVSGVQAIPVMQLFYRLSSAIGGAFMDSSKPEEISLDKLIKW 2830

Query: 2834 FLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNS 2893
             L EINL+KPF A TR+S GE+ ILVFMFFTLMLR+WHQPGSDG  +K   +TD HD+  
Sbjct: 2831 LLGEINLSKPFAASTRSSLGEIVILVFMFFTLMLRSWHQPGSDGSSSKLGGSTDVHDRRI 2890

Query: 2894 TQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTID 2953
             Q   ST V  QSS+  Q ++DF SQL+RACS +R Q FVNYLM++LQQLVHVFKS   +
Sbjct: 2891 VQ--SSTVVATQSSLHVQERDDFASQLVRACSCLRNQEFVNYLMNILQQLVHVFKSRAAN 2950

Query: 2954 YDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLV 3013
             ++  G  +GSGCGA+LTVR+DLPAGN+SPFFSDSYAKAHR D+F+DYHRLLLEN FRLV
Sbjct: 2951 VEA-RGSSSGSGCGAMLTVRRDLPAGNYSPFFSDSYAKAHRADIFVDYHRLLLENVFRLV 3010

Query: 3014 YTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHIC 3073
            YTLVRPEK +K  EKEKVY+  SSKDLKLD +QDVLCSYINNP+T+FVRRYARRLFLH+C
Sbjct: 3011 YTLVRPEKQEKMGEKEKVYRNASSKDLKLDGFQDVLCSYINNPHTAFVRRYARRLFLHLC 3070

Query: 3074 GSKSHYYSIRDSWQFSTEVKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPR 3133
            GSK+ YYS+RDSWQFS EVK L+K++ K GGF+N +SYERSVKIVK L+T+AEVA ARPR
Sbjct: 3071 GSKTQYYSVRDSWQFSNEVKNLYKHVEKSGGFENNVSYERSVKIVKSLSTIAEVAVARPR 3130

Query: 3134 NWQKYCLRHADVLPFLLNGIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTS 3193
            NWQKYCLRH D L FLLNG+F+F EESV+QTLKLLNLAFY GKD+  SVQK+EA +  T 
Sbjct: 3131 NWQKYCLRHGDFLSFLLNGVFHFAEESVIQTLKLLNLAFYQGKDVSSSVQKAEATEVVTG 3190

Query: 3194 TNKSGTQTVDSRKKRKGEDGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNS 3253
            +N+SG+Q+VDS+KK+KGEDG+DS LEK Y+DME +V+IF     ++L  FID FLLEWNS
Sbjct: 3191 SNRSGSQSVDSKKKKKGEDGHDSGLEKLYVDMEGVVDIFSANCGDLLRQFIDFFLLEWNS 3250

Query: 3254 SSVRAETKGVVCGIWHHGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPD 3313
            SSVR E K V+ G+WHHG+ +FKE+LL ALLQKV+ LP YG NI EYTELV+ LL K P+
Sbjct: 3251 SSVRTEAKSVIYGLWHHGRHSFKESLLAALLQKVRYLPAYGQNIVEYTELVSLLLDKAPE 3310

Query: 3314 VGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESE 3373
              SKQ  +EL+DRCL  DVIR  ++TLHSQNEL+ANHPNSRIY+TL  LVEFDGYYLESE
Sbjct: 3311 NNSKQAINELVDRCLNPDVIRCFFETLHSQNELIANHPNSRIYSTLGNLVEFDGYYLESE 3370

Query: 3374 PCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVL 3433
            PC ACSSP+VPYS+MKLESLKSETKFTDNRIIVKCTGSYTIQ+V MNVHDARKSKSVKVL
Sbjct: 3371 PCVACSSPDVPYSKMKLESLKSETKFTDNRIIVKCTGSYTIQSVTMNVHDARKSKSVKVL 3430

Query: 3434 NLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYEN 3493
            NLYYNNRPV+DLSELKNNWSLWKRAKSCHL+FNQTELKVEFPIPITACNFMIELDSFYEN
Sbjct: 3431 NLYYNNRPVSDLSELKNNWSLWKRAKSCHLSFNQTELKVEFPIPITACNFMIELDSFYEN 3490

Query: 3494 LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG 3553
            LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG
Sbjct: 3491 LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG 3550

Query: 3554 RFEFNFMAKPSFTFDNMENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGE 3613
            RFEFNFMAKPSF FDNMENDEDMK+GLAAIESESENAH+RYQQLLG+KKPLLKIVSSIGE
Sbjct: 3551 RFEFNFMAKPSFIFDNMENDEDMKKGLAAIESESENAHKRYQQLLGFKKPLLKIVSSIGE 3610

Query: 3614 NEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV 3673
             EMDSQ KD+VQQMM SLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV
Sbjct: 3611 TEMDSQHKDTVQQMMASLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV 3670

Query: 3674 LMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLG 3733
            LM+YLHQK+++    ASR V+S++PNNCYGCATTFVTQCLEILQVLSKH  S+KQLV+ G
Sbjct: 3671 LMSYLHQKNSNFSSGASRCVVSKTPNNCYGCATTFVTQCLEILQVLSKHPRSRKQLVAAG 3730

Query: 3734 ILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIA 3793
            ILSELFENNIHQGPKTAR QARA L +FSEGD++AVN LNNL+QKK+MYCLEHHRSMDIA
Sbjct: 3731 ILSELFENNIHQGPKTARAQARAALSTFSEGDLSAVNELNNLVQKKIMYCLEHHRSMDIA 3790

Query: 3794 LATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQA 3853
            LATREE+ LLSEVCSL DEFWE+RLR+VFQLLFSSIK GAKHPAI+EHIILPCL+IIS A
Sbjct: 3791 LATREEMLLLSEVCSLTDEFWESRLRLVFQLLFSSIKLGAKHPAISEHIILPCLKIISVA 3850

Query: 3854 CTPPKSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRT 3913
            CTPPK DT +KEQ MGK     Q KDENA  +           K + ES E+N + S +T
Sbjct: 3851 CTPPKPDTAEKEQTMGKSAPAVQEKDENAAGVI----------KYSSESEENNLNVSQKT 3910

Query: 3914 QDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCR 3973
            +DIQL+SY EWEKGASYLDFVRRQYK SQ  +G  Q+SRT + D+L+LKY L+WKR   R
Sbjct: 3911 RDIQLVSYLEWEKGASYLDFVRRQYKASQSIRGASQKSRTHRSDFLALKYTLRWKRRSSR 3970

Query: 3974 NAKSDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPAT 4033
             +K  L AFELGSWVTEL+L ACSQSIRSEMC LISLL AQSS RR+RL++LL+ LLPAT
Sbjct: 3971 TSKGGLQAFELGSWVTELILSACSQSIRSEMCTLISLLAAQSSPRRYRLINLLIGLLPAT 4030

Query: 4034 LSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQ 4093
            L+AGES+AEYF+LLFKM++++DA LFLTVRGCL TIC+LISQEVGN+ESLERSL IDISQ
Sbjct: 4031 LAAGESSAEYFELLFKMIETQDALLFLTVRGCLTTICKLISQEVGNIESLERSLQIDISQ 4090

Query: 4094 GFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLE 4153
            GF LHKL+ELLGKFLE+PNIRS                         RFMRDNLLS VLE
Sbjct: 4091 GFTLHKLLELLGKFLEVPNIRS-------------------------RFMRDNLLSHVLE 4150

Query: 4154 ALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRT 4213
            ALIVIRGL+VQKTKLI+DCNR LKDLLD LLLES+ENKRQFIRAC+ GLQ H EE KGRT
Sbjct: 4151 ALIVIRGLIVQKTKLINDCNRRLKDLLDGLLLESSENKRQFIRACVSGLQTHAEENKGRT 4210

Query: 4214 CLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI 4273
            CLFILEQLCNLI PSKPE VY+L+LNK+HTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI
Sbjct: 4211 CLFILEQLCNLICPSKPEAVYMLILNKSHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI 4270

Query: 4274 CHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAA 4333
            C QLDLLG LEDDYGMELLVAGNIISLDLSIA VYE VWKKSNQSS +++N+AL+++ AA
Sbjct: 4271 CQQLDLLGLLEDDYGMELLVAGNIISLDLSIAQVYELVWKKSNQSSTSLTNSALLASNAA 4330

Query: 4334 --RDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLG 4393
              RD PPMTVTYRLQGLDGEATEPMIKELEEDREESQDPE+EFAIAGAVREYGGLEILL 
Sbjct: 4331 PSRDCPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPEIEFAIAGAVREYGGLEILLD 4390

Query: 4394 MIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMES 4453
            MI+ + D+FKSNQE++VAVL+LL HCCKIRENRRALLRLGAL LLLETARRAFSVDAME 
Sbjct: 4391 MIKSLQDDFKSNQEEMVAVLDLLNHCCKIRENRRALLRLGALSLLLETARRAFSVDAMEP 4450

Query: 4454 AEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKK 4513
            AEGILLIVESLT+EANES+SIS  QSALTV++E+TGT EQAKKIVLMFLERLSHP G KK
Sbjct: 4451 AEGILLIVESLTLEANESDSISAAQSALTVSNEETGTWEQAKKIVLMFLERLSHPSGLKK 4510

Query: 4514 SNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSIS 4573
            SNKQQRNTEMVARILPYLTYGEPAAM+ALI+HF+PYL +W EFD+LQ++HE++P+D SI+
Sbjct: 4511 SNKQQRNTEMVARILPYLTYGEPAAMEALIEHFSPYLQNWSEFDQLQQRHEEDPKDDSIA 4570

Query: 4574 EQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRS 4633
            +QAAKQRFTVENFVRVSESLKTSSCGERLKDI+LE GI  +A+KH+++ FA+ GQTGF+S
Sbjct: 4571 QQAAKQRFTVENFVRVSESLKTSSCGERLKDIVLENGIIAVAVKHIKEIFAITGQTGFKS 4630

Query: 4634 SVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAE 4693
            S EW  ALK PS+PLILSMLRGLSMGHL TQ CIDEG IL +LHALE V GEN+IGARAE
Sbjct: 4631 SKEWLLALKLPSVPLILSMLRGLSMGHLPTQTCIDEGGILTLLHALEGVSGENDIGARAE 4690

Query: 4694 NLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNREDMLQGLGMRQ-VASDGGER 4753
            NLLDTL++KEG GDGFL +KVR LR AT+DEMRR AL+ RE++LQGLGMRQ ++SDGGER
Sbjct: 4691 NLLDTLADKEGKGDGFLGEKVRALRDATKDEMRRRALRKREELLQGLGMRQELSSDGGER 4750

Query: 4754 IIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGE 4813
            I+VS+P LEG EDVEEEEDGLACMVCREGY LRP+DLLGVYSYSKRVNLGVG SGS RGE
Sbjct: 4751 IVVSQPILEGFEDVEEEEDGLACMVCREGYKLRPSDLLGVYSYSKRVNLGVGNSGSARGE 4810

Query: 4814 CVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLA 4872
            CVYTTVSYFNIIH+QCHQEAKR DA LK PKKEWEGA LRNNESLCNSLFPV+GPSVPLA
Sbjct: 4811 CVYTTVSYFNIIHFQCHQEAKRADAALKNPKKEWEGAMLRNNESLCNSLFPVKGPSVPLA 4832

BLAST of HG10018299 vs. ExPASy Swiss-Prot
Match: B9G2A8 (Auxin transport protein BIG OS=Oryza sativa subsp. japonica OX=39947 GN=Os09g0247700 PE=2 SV=1)

HSP 1 Score: 5394.7 bits (13993), Expect = 0.0e+00
Identity = 2851/4810 (59.27%), Postives = 3549/4810 (73.78%), Query Frame = 0

Query: 94   LTVDQAEAIVVAVIKKSLELVFCYLEKSEFKCDDFSIQNNMLMILETILVDGMDKVSDFA 153
            L+V+Q E+ VV ++++SLE    YLEKS + C+D+ + N +   +E +L+ G        
Sbjct: 29   LSVEQVESTVVEIVERSLEFCLLYLEKSSYACEDYGLLNEVAYFMECVLLRGTPSKVYSL 88

Query: 154  QLCAKKSLMDLLKSTGGDCDATIEFDNTIECGSTGVCCSREEKQVGRLLMTIAAECVQAD 213
            +      +++   S   D +  I       C   G  CS     + R  +T++ EC+Q D
Sbjct: 89   EPSVVNDVIEQWSSVQVDSE-RISPQEKYFCYLKGFNCSNSGDDLQRFRLTLSPECLQQD 148

Query: 214  QLTSESGFSQPTFLEDMNKLIFLCQHWAVTHLACIQHLILICKELVVLPDALDEKTGSTS 273
             + +E+  S  T     N ++ + QH+AV HL CI  L+ + ++L   P ALD     T+
Sbjct: 149  YVIAENTESSHT--ASPNGMVSIAQHFAVVHLHCIPVLLTLVQKLCQSP-ALD-VIEDTN 208

Query: 274  FRKRLSCSLRILKLLTDLSKKFPYIEYDAKMMQAFALFANSLPCLFGLCFEFANSHATVE 333
            F  RLS   RILKL+  L+ +FP    DA M+ + A   +SLP LF L F+FAN      
Sbjct: 209  FNMRLSFGQRILKLVHGLAMEFPCDASDAMMLCSVARCTDSLPVLFKLKFKFANHDRVFS 268

Query: 334  GSFENTILL-LLEEFLELVQVVFRNSYVSVNIQTCVVASILDNLSSSVWRYDASTANLKT 393
            G    T+LL +L+EFL+L+ ++F NS +   +Q C++AS+L+  S   W+YD S A L  
Sbjct: 269  GDGVGTVLLQILDEFLQLIHIIFCNSDICCTVQVCILASLLEIFSPEKWKYDRSAACLMP 328

Query: 394  PLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLETHHTSTL---ADLSVDIPKCYARLEIV 453
            PLVY P  V  ++KL+ D K       S  D +      L    +   D   C+AR + V
Sbjct: 329  PLVYSPHIVQYVLKLLNDTK----RWTSRVDRDRPGKDVLGYSCNSETDGLSCHARSKKV 388

Query: 454  PLHKNYKVEEILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKIER-SLSSMKSSSTVEQ 513
            PL K Y  EE L++IFP  +QW+DDL+HL+FFL+ EGV+  P +E+  +S  K  +  E 
Sbjct: 389  PLLKKYTSEEYLQLIFPSEEQWLDDLVHLIFFLHEEGVKSMPLLEKPQMSCTKQVTLSEL 448

Query: 514  ETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNLLLQAAKELLSFIKLC 573
            E+   HE+EALFG+LF+E+ RS G  D  +      +  SS  +  +Q A +L+ F+K+ 
Sbjct: 449  ESVASHEEEALFGNLFAEA-RSTGVADSVEQPISLGSGPSSSQHGPIQLAADLICFMKMS 508

Query: 574  IFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASCLPAHDERKSGHIHEI 633
            IFSPEW  +++ D C K + NH++  LS+L C   CSD+  + + L    E  S HI+  
Sbjct: 509  IFSPEWCTAIYVDACRKFHSNHLEQFLSILQCPAFCSDESIATTSL---SEVNSLHINTA 568

Query: 634  CYRLLHGLLTRHALPDSLEEYLVKKILNAENGNFVYNDQTLSLLAHTLFRRTGVAGTLLR 693
            C+ LL   L  H  P SL E LV K+ NAENG + YN+ TL+L+A  +     ++G    
Sbjct: 569  CFELLQMFLISHECPASLREDLVDKVFNAENGMYTYNNYTLALVARAI-----ISGA--- 628

Query: 694  TQIYRQFVEFIIEKSKTISSNYSSLQEFMGTLPSVFHIEILLVAFHLSSEGEKREISSLI 753
            + IY    +  +++++T                                           
Sbjct: 629  SSIYNLGRKVFVQENET------------------------------------------- 688

Query: 754  FSSIRAIDAPSTFSNCTELSMWGLLVSRLIIVLRHIIFHPHTCSSSLLFDFRSKLRDAPA 813
                      ST+S C      G    +    L   +      S  + ++ + +      
Sbjct: 689  ---------ASTWSKCIWTHKMGFTPVKTPACLAAYVVVSGNTSVMVAYEVKIQNEGDIL 748

Query: 814  FSSSLPYTLNDHLSSWGASVAKNIIGSSVESKPFFHSLINQLIDISSFPASLRQHDLTVE 873
                   ++ND+L S+ A V + I   +V+      SL  QLID++   A +      +E
Sbjct: 749  LKEGQSRSMNDYLPSFTAEVVEGIFADTVKEYASTSSLFPQLIDVTPAHAEIYFDKSALE 808

Query: 874  CPWFNAGDIFSTFSWILGFWNGKQAVTVEDLIIERYIFVLCWDFPSMNALSHGGPLWSDP 933
                N  ++ S  S ILG W G++A   EDLI ERY+F++CW   S    S G     +P
Sbjct: 809  ALGLNFANLGSNISEILGVWKGRKAEVAEDLIAERYLFLICWSTLSGIGYSGGYEGLLNP 868

Query: 934  DTLDISNTTCFFYFSYLLL---DHGGVIGEHMKFPQVVIGLLQRLH-----GGSILEDFK 993
            D  D++     F+ S+ L    D   ++  ++  P V+ G L+ L      G S+LE   
Sbjct: 869  DFADVN-----FFISFALSVSDDASSLLDANL--PSVIFGFLKLLQSEILCGPSVLE--- 928

Query: 994  ALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLLTDTTVTDNEQANFAESLISS 1053
               W+FLR GAWLSL+LS ++ G W + +    P V  L     V D E   F +SL++ 
Sbjct: 929  --SWDFLRKGAWLSLILSLINTGFWGHQTSGK-PDV-DLQGKQVVQDAE--IFGKSLLTF 988

Query: 1054 VITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSS----NDHATEFSPLLLFKHSEFDS 1113
            +  +S     +  LSS+L  YL  +++AF++ +        DH     P  L KHS FD 
Sbjct: 989  ISENS--GHCLHVLSSLLETYLHAFKEAFISFVVEKGRVCEDHC---YPSWLLKHSAFDK 1048

Query: 1114 CVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCWESMFHGFPSHLETSSG 1173
                   E  G+   +LE + +L SR+D +  K   G    F  + + HGFP +  +++ 
Sbjct: 1049 SKHPLLFEKVGSNIGMLEPICDLSSRIDRVATKLGDGRKEYFLLKCLLHGFPVNSASNNS 1108

Query: 1174 ILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVMTIKFDKTFESVHGLCE 1233
             +LSCVL I  II +L G ++I+      +++  V   +L  +MTIK D  F S+H LC+
Sbjct: 1109 AILSCVLVINEIIYMLNGCIKIMQPNDRDLVDVGVISKLLSMIMTIKSDGMFTSIHKLCD 1168

Query: 1234 GIYQSL-NKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHELVIVKATDIMDNLRKD 1293
             I+ SL +++ D   Y  LF+LKQLEGYL  +N +   D+ + E+++    D++++LR  
Sbjct: 1169 SIFMSLIDQKDDLAGYSDLFVLKQLEGYLADINSKEIMDNEVKEIIVFTIVDLVEDLR-- 1228

Query: 1294 VSKSSVFQFYLG-AEVVLEQVRELYTFQHGNLLVLLDSLDNCCSELVNLKVLGFFVELLS 1353
             SK++VF+F+LG AE   E+   L+  +  ++ V +D LD C SE VNLK+L  F ++L 
Sbjct: 1229 -SKTNVFKFFLGEAEGAPERANSLFALEQADMSVFIDVLDKCQSEQVNLKILNLFTDILG 1288

Query: 1354 GEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKGSSISLRESSMNFVF 1413
               CP LKQ++Q+KF+ MD+   S WLE R  G   +  S  +      +LRE +M+F+ 
Sbjct: 1289 DGLCPDLKQKLQHKFIGMDVSCFSSWLEFRTLGHSMKIESTNSTTSGPTALRELTMDFLM 1348

Query: 1414 CLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKGDKSMKLLLE 1473
             L    SE LA +LQ H+F++ L+ LD AF+  D+ + K++FHF+ QL   +   K L E
Sbjct: 1349 RLTCPSSETLAKELQHHLFDSMLLLLDKAFMSCDLQIVKAHFHFIAQLSTDESHFKELFE 1408

Query: 1474 RILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGKPLSRYAPEVGPLSS 1533
            + L LM  +  +E LL  LK+LF+ +E +  ++GS ++  +R S K             S
Sbjct: 1409 KTLKLMENMVGNEGLLHTLKFLFTCVESVFGDAGSNRSALKRLSSKSSG------NSFGS 1468

Query: 1534 KSVGPR--KNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNS 1593
             S+ P+  KNS++LVL +NQE   ++ +CDA+S EEDEDDGTSDGE+ S+D+DEEED NS
Sbjct: 1469 GSLIPKQLKNSDSLVLRTNQESN-STVDCDASSGEEDEDDGTSDGELVSIDRDEEEDGNS 1528

Query: 1594 ERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFF 1653
            ERALA+KVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCH+GHRVVYSRSSRFF
Sbjct: 1529 ERALATKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHQGHRVVYSRSSRFF 1588

Query: 1654 CDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVS 1713
            CDCGAGGVRGSSCQCLKPRK+TG  S      S+FQ  LP+ E+ + + +S SD EDD+S
Sbjct: 1589 CDCGAGGVRGSSCQCLKPRKFTGTSSVSPPVTSSFQPILPYHEDVEPVADSGSDFEDDIS 1648

Query: 1714 VTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQRDPDLSKDKKIIL 1773
             T+ + C++ SVP+   D + V L+ LDVE RMLELC  LLP I +QR+ +L KD+K+ L
Sbjct: 1649 -TEAENCIKLSVPKGFSDELPVFLKNLDVEVRMLELCKKLLPMILSQRELNLLKDRKVFL 1708

Query: 1774 GKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRL 1833
            G +  +S   D+ QLKKA+K GSLDLKIKA+Y N++ELKSHLA+GSL KSLLS+SIRG+L
Sbjct: 1709 GGEMPMSQASDIFQLKKAFKSGSLDLKIKADYPNSRELKSHLANGSLTKSLLSISIRGKL 1768

Query: 1834 AVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVA 1893
            AVGEGDKV+IFD+ Q+I Q T AP+TADKTNVKPLS+N+VRFEIVHL FNP VE+YL+VA
Sbjct: 1769 AVGEGDKVAIFDVGQIIGQPTAAPITADKTNVKPLSRNIVRFEIVHLIFNPLVEHYLSVA 1828

Query: 1894 GYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVTNRFVKIYDLSLD 1953
            GYEDCQVLTLN RGEV DRLAIELALQGA+I+ +EWVPGSQVQLMVVTN+FVKIYDLS D
Sbjct: 1829 GYEDCQVLTLNSRGEVTDRLAIELALQGAYIRCVEWVPGSQVQLMVVTNKFVKIYDLSQD 1888

Query: 1954 NISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEIIE 2013
            NISP+HYFT+ DD++VDATL  +S G++ L+VLSE G ++RL +++ G+VGA  L + + 
Sbjct: 1889 NISPLHYFTVADDIIVDATLVPSSMGKLVLLVLSEGGLLYRLNVALAGDVGAKTLTDTVL 1948

Query: 2014 IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISVIYEEEQDRKLRP 2073
            ++      KGLSLYFSS Y+LLF+++ DGTT +G+L  D++ +TE+S I E +QD K +P
Sbjct: 1949 VKDAVSMHKGLSLYFSSTYRLLFVSHQDGTTYMGRLDGDSSSITELSYICENDQDGKSKP 2008

Query: 2074 AGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGSSLPLVGITAYKP 2133
            AGL+RW+EL AGSG   C S  KSNS LAVS+G H+++A N+RHA GS+ P+VGI AYKP
Sbjct: 2009 AGLYRWRELIAGSGALACLSKFKSNSPLAVSLGPHELFAHNMRHASGSNAPVVGIAAYKP 2068

Query: 2134 LSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNNKVYASTNPEFPL 2193
            LSKDK HCL+L+DDGSL IY+HT  G D+S   TAE+ KKLGS IL+++ YA T PEFPL
Sbjct: 2069 LSKDKAHCLLLYDDGSLNIYSHTPNGSDSSTTLTAEQTKKLGSSILSSRAYAGTKPEFPL 2128

Query: 2194 DFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVM 2253
            DFFE T CIT DV+   D  ++ DSE  KQ L+S+DG+LES +S+GFK+T+SN NPDIVM
Sbjct: 2129 DFFEKTTCITCDVKFNSDTTKSSDSESIKQRLSSDDGYLESLTSAGFKVTISNPNPDIVM 2188

Query: 2254 VGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAF 2313
            VG RIHVGNTSA++IPSEITIF RVIKLDEGMRSWYDIPFT AESLLADEEF++ VG  F
Sbjct: 2189 VGCRIHVGNTSASNIPSEITIFHRVIKLDEGMRSWYDIPFTTAESLLADEEFTIVVGRTF 2248

Query: 2314 NGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQ 2373
            +G+++PRIDS+EVYGRAKDEFGWKEK+DA LDMEA  LG +S   +SGKK +++Q APIQ
Sbjct: 2249 DGSSIPRIDSIEVYGRAKDEFGWKEKMDAALDMEAHVLGGSSASGKSGKKAQTMQAAPIQ 2308

Query: 2374 QQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPLLQSAACR 2433
            +QVLAD L++LS  YLL +   C    D + EL  LKC+ LLETI++SDREPLL SAACR
Sbjct: 2309 EQVLADALRILSRIYLLCQPGFCTDTIDADMELNNLKCRSLLETIFQSDREPLLHSAACR 2368

Query: 2434 VLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIA 2493
            VLQA+FPKKEIYY VKDTMRL GV+KS   ++SR+GVGGAA  W+ +EF +Q+  VSK+A
Sbjct: 2369 VLQAVFPKKEIYYHVKDTMRLLGVIKSLPSITSRIGVGGAASSWVTKEFIAQIHTVSKVA 2428

Query: 2494 LHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLAL 2553
            +HR+SNLA FLE +G+++VDGLMQ+ WGILDL++P+TQ +N++V+  VE IY YAECLAL
Sbjct: 2429 VHRKSNLASFLETHGTELVDGLMQVFWGILDLDRPDTQRINSLVVPCVEFIYSYAECLAL 2488

Query: 2554 HGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIP 2613
            H  +    SVAPAV L KKLLF+  EAVQ SSSLAISSR LQVPFPKQTM+A DD  D  
Sbjct: 2489 HSNEKSGVSVAPAVALLKKLLFAPYEAVQTSSSLAISSRFLQVPFPKQTMIANDDAPDNH 2548

Query: 2614 LSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYE 2673
              A   + +T  N QVMIEED   SSVQYCCDGCSTVPILRRRWHC ICPDFDLCE+CYE
Sbjct: 2549 AKASAASNSTTGNAQVMIEEDPATSSVQYCCDGCSTVPILRRRWHCNICPDFDLCETCYE 2608

Query: 2674 VLDADRLPSPHSRDHPMTAIPIEVDSL-GDGNEYHFATEDINDSSLTSLIPDISVKNPVS 2733
            +LDADRLP+PHSRDHPM+AIPIE+D+  G+GNE HF+ +++ DSS+     D +++   S
Sbjct: 2609 ILDADRLPAPHSRDHPMSAIPIELDTFGGEGNEIHFSVDELTDSSVLQAPADRTIQTSPS 2668

Query: 2734 SIHVLEPADSGDFSASVTD--PVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVM 2793
            SIHVL+ ++S DF  S+T+   VSISASK+ +NSLLLS L+E+L GWMETT+G +A+P+M
Sbjct: 2669 SIHVLDASESVDFHGSMTEQRTVSISASKRAINSLLLSRLIEELSGWMETTAGTRAIPIM 2728

Query: 2794 QLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMF 2853
            QLFYRLSS +GGPFM+S K ENL+LE+ +KW +DEIN++KPF AKTR SFGEV+ILVFMF
Sbjct: 2729 QLFYRLSSAVGGPFMDSTKPENLDLEKFVKWLIDEINISKPFPAKTRCSFGEVSILVFMF 2788

Query: 2854 FTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLR 2913
            FTLM RNWHQPG+DG  +K   ++D  +K    V  ST+ T QSS DD  KN+F SQL+R
Sbjct: 2789 FTLMFRNWHQPGTDGSHSKSGGSSDLTEKGPVHVQVSTT-TLQSSNDDHDKNEFASQLIR 2848

Query: 2914 ACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFS 2973
            ACS++RQQSF+NYLMD+LQQLVHVFKSS+I   +G G  + SGCG+LLTVR++LPAGNFS
Sbjct: 2849 ACSALRQQSFLNYLMDILQQLVHVFKSSSI---NGEGGSSSSGCGSLLTVRRELPAGNFS 2908

Query: 2974 PFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKL 3033
            PFFSDSYAK+H TDLF+DY++LLLEN FRLVY++VRPEK +K+ +K+K  K+ ++KDLKL
Sbjct: 2909 PFFSDSYAKSHPTDLFMDYYKLLLENTFRLVYSMVRPEK-EKSADKDKSCKVPNTKDLKL 2968

Query: 3034 DAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKV 3093
            D YQDVLCSYI+N +T+FVRRYARRLFLH+CGSK+HYYS+RDSWQ+S EVK+L K INK 
Sbjct: 2969 DGYQDVLCSYISNAHTTFVRRYARRLFLHLCGSKTHYYSVRDSWQYSHEVKKLHKIINKS 3028

Query: 3094 GGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEESVV 3153
            GGF+NP+ YERSVK++KCL+T+ +VAA+RPRNWQK+CL+H D+LPFL++  +YF EE +V
Sbjct: 3029 GGFRNPVPYERSVKLIKCLSTLCDVAASRPRNWQKFCLKHTDLLPFLMDNFYYFSEECIV 3088

Query: 3154 QTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDGNDSALEKSY 3213
            QTLKLLNLAFY+GKD  H+ QK+E+GD G+ST ++G+Q+ DS+KKRKG+D ++ + EKS 
Sbjct: 3089 QTLKLLNLAFYSGKDANHNAQKTESGDIGSST-RTGSQSSDSKKKRKGDDSSEGSSEKSC 3148

Query: 3214 LDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQTFKETLLMA 3273
            +DME  V +F  K  +VL  F+D FLLEWNS+SVR E K V+ G+W+H K +FKE +L  
Sbjct: 3149 MDMEQAVVVFTGKDGDVLKRFVDTFLLEWNSTSVRHEAKSVLFGLWYHAKSSFKENMLTT 3208

Query: 3274 LLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHS 3333
            LLQKVK LPMYG NI EYT+L+T LLGK  D  +KQ  +ELL++CLTSDV+  I+ TLHS
Sbjct: 3209 LLQKVKYLPMYGQNIIEYTDLMTCLLGKANDSTAKQSDTELLNKCLTSDVVSCIFDTLHS 3268

Query: 3334 QNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDN 3393
            QNELLANHPNSRIYNTLS LVEFDGYYLESEPC  CS P+VPYSRMKLESLKSETKFTDN
Sbjct: 3269 QNELLANHPNSRIYNTLSCLVEFDGYYLESEPCVTCSCPDVPYSRMKLESLKSETKFTDN 3328

Query: 3394 RIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCH 3453
            RIIVKCTGS+TIQ+V MNV+DARKSKSVKVLNLYYNNRPV DLSELKNNWSLWKRAKSCH
Sbjct: 3329 RIIVKCTGSFTIQSVTMNVYDARKSKSVKVLNLYYNNRPVTDLSELKNNWSLWKRAKSCH 3388

Query: 3454 LAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCH 3513
            L FNQTELKVEFPIPITACNFMIELDSFYENLQA SLE LQCPRCSR VTDKHGICSNCH
Sbjct: 3389 LTFNQTELKVEFPIPITACNFMIELDSFYENLQASSLESLQCPRCSRSVTDKHGICSNCH 3448

Query: 3514 ENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAA 3573
            ENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEF+FMAKPSF+FDNMEND+DM++GL A
Sbjct: 3449 ENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFHFMAKPSFSFDNMENDDDMRKGLTA 3508

Query: 3574 IESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKI 3633
            IESESENAHRRYQQL+G+KKPL+K+VSSIGE E+DSQQKD+VQQMMVSLPGP+ K+NRKI
Sbjct: 3509 IESESENAHRRYQQLMGFKKPLIKLVSSIGEQEIDSQQKDAVQQMMVSLPGPTGKVNRKI 3568

Query: 3634 ALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCY 3693
            ALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQK+++D        I RSP++CY
Sbjct: 3569 ALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKNSNDTDALPACSIPRSPSSCY 3628

Query: 3694 GCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFS 3753
            GC+TTFVTQCLE+LQVLSKH +S+KQLVS GILSELFENNIHQGP+TAR  ARAVL SFS
Sbjct: 3629 GCSTTFVTQCLELLQVLSKHATSRKQLVSAGILSELFENNIHQGPRTARTLARAVLSSFS 3688

Query: 3754 EGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVF 3813
            EGD +AV  LNNLIQKKVMYCLEHHRSMDI+ +TREEL LLSE C+L DEFWEARLRV F
Sbjct: 3689 EGDADAVQELNNLIQKKVMYCLEHHRSMDISQSTREELLLLSETCALVDEFWEARLRVAF 3748

Query: 3814 QLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENA 3873
            QLLFSSIK GAKHPAI+EHIILPCLRIISQACTPPKSD+ +KE  MGK +S+ Q K+++ 
Sbjct: 3749 QLLFSSIKVGAKHPAISEHIILPCLRIISQACTPPKSDSGEKEPGMGK-SSLMQAKNDDT 3808

Query: 3874 TNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQ 3933
                G     +S +K+  E      D S R QDI LLSY+EWE GASYLDFVRRQYKVSQ
Sbjct: 3809 V---GHSVTNLSTSKTQSELSGKIPDGSRRRQDISLLSYSEWESGASYLDFVRRQYKVSQ 3868

Query: 3934 VCKGTVQRSR--TQKGDYLSLKYALKWKRFVCR-NAKSDLSAFELGSWVTELVLCACSQS 3993
              KG +Q++R  +QK DYL LKY L+WKR  CR ++K D S F LGSWV++L+L +CSQS
Sbjct: 3869 AVKG-LQKTRHDSQKSDYLVLKYGLRWKRRACRKSSKGDFSKFALGSWVSDLILSSCSQS 3928

Query: 3994 IRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLF 4053
            IRSE+C LISLLC  +SSR+F+LL+LL+SLLP TLSAGESAAEYF+LL  M+D+E +RLF
Sbjct: 3929 IRSEICTLISLLCPSNSSRQFQLLNLLMSLLPRTLSAGESAAEYFELLGTMIDTEASRLF 3988

Query: 4054 LTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRSRYNF 4113
            LTVRGCL T+C LI++EV NVES ERSL IDISQGFILHKL+ELL KFLEIPNIR+    
Sbjct: 3989 LTVRGCLTTLCSLITKEVSNVESQERSLSIDISQGFILHKLVELLNKFLEIPNIRA---- 4048

Query: 4114 FLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDL 4173
                                 RFM DNLLS+VLEA +VIRGLVVQKTKLI+DCNRLLKDL
Sbjct: 4049 ---------------------RFMSDNLLSDVLEAFLVIRGLVVQKTKLINDCNRLLKDL 4108

Query: 4174 LDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLN 4233
            LDSLL+ES  NKRQFIRACI GLQ H +E+K RT LFILEQLCNLI P KPEPVYLL+LN
Sbjct: 4109 LDSLLVESTANKRQFIRACISGLQKHVKEKKRRTSLFILEQLCNLICPVKPEPVYLLILN 4168

Query: 4234 KAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIIS 4293
            KAHTQEEFIRGSMT+NPYSSAEIGPLMRDVKNKICHQLDL+G LEDDYGMELLVAGNIIS
Sbjct: 4169 KAHTQEEFIRGSMTRNPYSSAEIGPLMRDVKNKICHQLDLIGLLEDDYGMELLVAGNIIS 4228

Query: 4294 LDLSIALVYEQVWKKSN-QSSNAISNTALISTTAA--RDSPPMTVTYRLQGLDGEATEPM 4353
            LDLSI+ VYEQVW+K + Q+ +++SN + +S  A+  RD PPMTVTYRLQGLDGEATEPM
Sbjct: 4229 LDLSISQVYEQVWRKHHGQTQHSLSNASQLSAAASSVRDCPPMTVTYRLQGLDGEATEPM 4288

Query: 4354 IKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRI-WDNFKSNQEQLVAVLNLLM 4413
            IKELE++REESQDPE+EFAIAGAVRE GGLEI+L MIQ +  D  +SNQE+L +VLNLL 
Sbjct: 4289 IKELEDEREESQDPEVEFAIAGAVRECGGLEIILSMIQSLREDELRSNQEELGSVLNLLK 4348

Query: 4414 HCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIG 4473
            +CCKIRENR ALLRLGALGLLLETARRAFSVDAME AEGILLIVESLT+EANES+ ISI 
Sbjct: 4349 YCCKIRENRCALLRLGALGLLLETARRAFSVDAMEPAEGILLIVESLTMEANESD-ISIA 4408

Query: 4474 QSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPA 4533
            QS  T T+E+TG GE+AKKIVLMFLERL  P G+KKSNKQQRN EMVARILP LTYGEPA
Sbjct: 4409 QSVFTTTTEETGAGEEAKKIVLMFLERLCPPDGAKKSNKQQRNEEMVARILPNLTYGEPA 4468

Query: 4534 AMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSS 4593
            AM+AL+ HF PYL +W EFD+LQKQHE+NP+D+++S+ A+ QR  VENFVRVSESLKTSS
Sbjct: 4469 AMEALVLHFEPYLMNWSEFDQLQKQHEENPKDETLSKNASMQRSAVENFVRVSESLKTSS 4528

Query: 4594 CGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLS 4653
            CGERLK+IILEKGIT  A+ HLR+SFA AGQ  FR+S EW   LK PSIPLILSML+GL+
Sbjct: 4529 CGERLKEIILEKGITKAAVGHLRESFASAGQASFRTSAEWTVGLKLPSIPLILSMLKGLA 4588

Query: 4654 MGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRML 4713
             G L TQ+C+DE  ILP+LHALE VPGENEIGARAENLLDTL+NKE NGDGFL +K++ L
Sbjct: 4589 KGDLPTQKCVDEEDILPLLHALEGVPGENEIGARAENLLDTLANKENNGDGFLAEKIQEL 4648

Query: 4714 RHATRDEMRRLALKNREDMLQGLGMRQ-VASDGGERIIVSRPALEGLEDVEEEEDGLACM 4773
            RHATRDEMRR ALK RE +LQGLGMRQ  ASDGG RI+VS+P +EGL+DVEEEEDGLACM
Sbjct: 4649 RHATRDEMRRRALKKREMLLQGLGMRQEFASDGGRRIVVSQPIIEGLDDVEEEEDGLACM 4696

Query: 4774 VCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTD 4833
            VCREGY+LRPTD+LGVY++SKRVNLG  +SGS RG+CVYTTVS+FNIIHYQCHQEAKR D
Sbjct: 4709 VCREGYTLRPTDMLGVYAFSKRVNLGATSSGSGRGDCVYTTVSHFNIIHYQCHQEAKRAD 4696

Query: 4834 AGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRL 4872
            A LK PKKEW+GATLRNNE+LCN +FP+RGPSVP  QY R +DQ+WD LN+LGRADG+RL
Sbjct: 4769 AALKNPKKEWDGATLRNNETLCNCIFPLRGPSVPPGQYTRCLDQYWDQLNSLGRADGSRL 4696

BLAST of HG10018299 vs. ExPASy Swiss-Prot
Match: Q54QG5 (Probable E3 ubiquitin-protein ligase DDB_G0283893 OS=Dictyostelium discoideum OX=44689 GN=DDB_G0283893 PE=3 SV=2)

HSP 1 Score: 1485.7 bits (3845), Expect = 0.0e+00
Identity = 1161/3942 (29.45%), Postives = 1862/3942 (47.23%), Query Frame = 0

Query: 1412 IFEAALVSLDMAFLRF--DISVSKSYFHFVVQLLKGDKSMKLLLERILILMGKLASDERL 1471
            +F+  L SL+ AF  +     + K YF  +  +  G   +  L   I  L   L ++  +
Sbjct: 1869 LFDVLLSSLNNAFTVWIDQPKLLKGYFELLQFIAMGQHQLLKLFNEISNLSSPLLTNPTI 1928

Query: 1472 --LPGLKYLFSFLEMILIESGSGK----NVFERPSGKPLSRYAPEVGPLSSKSVGPRKNS 1531
              L  L  L  F+E IL  S S K    +      G   S +           V   K+S
Sbjct: 1929 DQLESLNLLVEFIENILDLSKSLKKPSSDQQHHSGGCHHSNHHHHHHHSRKDEVMVDKSS 1988

Query: 1532 ETLVLSSNQEEGPASFECDATSAEEDEDD---GTSDGEVASLDKDEEEDTNSERALASKV 1591
             T V+  +  +            EEDED+    + D +V + +++  E+ + ER L+SKV
Sbjct: 1989 ITNVVDEDILKDDVE-----VMDEEDEDELQYLSEDEKVVNGNENTGEEDDEERKLSSKV 2048

Query: 1592 CTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGV 1651
            CT+T + +++++QHWYFCYTC L  S+GCCSVC KVCH+GH+V YSR SRFFCDCGAG  
Sbjct: 2049 CTYTFTKNDYIDQHWYFCYTCGLKFSEGCCSVCVKVCHKGHQVSYSRYSRFFCDCGAGAG 2108

Query: 1652 RGSSCQCLKPRKYT--------GHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVS 1711
            +G+ C+ LKPR Y+             P +   + Q  +   ++  Q  E +       S
Sbjct: 2109 KGNPCKALKPRLYSPPKQLQQQQQQQQPQQPPQDQQKNVAAEQQPQQQQEEQQVASTTSS 2168

Query: 1712 VTDTDKCLRPSVPRELLDGVS--------------------------------------- 1771
             T+T+   +P +P  L    S                                       
Sbjct: 2169 ATNTND--QPIIPDSLSSSSSNSSPSFNNNNNNNNNGTTGSGNTFSSINNYFFNLPTQQD 2228

Query: 1772 -------VLLEELDVEGRMLELCSCLL------PTITNQRDPDLSKDKKIILGKDKVLSY 1831
                   +  ++ ++  ++ EL   L+       T   +   D + +K  +  +    S 
Sbjct: 2229 KEQLFNEIFNDKSNIISKLSELYPKLIEMYKKVQTDLIKPQSDSNSNKLEVFEESSTQSI 2288

Query: 1832 G-------LDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLA 1891
                     D+   KK  K G+ + K+K E     +LK+ L+SG + +  ++ + +G LA
Sbjct: 2289 SSREIISKTDIFTTKKVSKNGTFEAKLKLEGVEGNQLKTFLSSGPIQRKAIASTSKGLLA 2348

Query: 1892 VGEGDKVSIFDIRQLIEQTTVAPMTADKTNV-KPLSKNVVRFEIVHLAFNPTVENYLAVA 1951
            + EGD VS+F+  ++++         DK +  K LSK+ V + IV + FNP  E +LAV 
Sbjct: 2349 IAEGDSVSLFNSSKILDDDA----QLDKHSFSKALSKSSVAYPIVSMVFNPLNERFLAVV 2408

Query: 1952 GYEDCQVLTLNHRGEVVDRLAIELAL----QGAHIKRMEWVPGSQVQLMVVTNRFVKIYD 2011
            G+++ ++LT+N + E+VD+L I+L+L    +  +I ++EW+ GSQV+L +VTN F+KIYD
Sbjct: 2409 GFKEVKILTINQKDEIVDQLVIDLSLDALGETIYIIKVEWIIGSQVELAIVTNEFIKIYD 2468

Query: 2012 LSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLK 2071
            LS DN+SP+H+F+L +D + D  L     G+  ++ LS  G ++   +    +  +  + 
Sbjct: 2469 LSKDNLSPIHFFSLLEDSIKDMCL-VQKNGKNHILALSNYGLLYFQAIEDSIDNESCIMI 2528

Query: 2072 EIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISVIYEEEQDR 2131
            E +++   ++SA G+S++++    L+  +Y +G     Q++   T +T    I +  +  
Sbjct: 2529 ETLQVPINKISA-GVSVHYNIDLDLVIASYTNGECYAFQVNDTMTNVTRSFPIMDPAKKL 2588

Query: 2132 KLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGSSLPLVGIT 2191
             +          +F    ++ C ++ +    L   MG  DI  QNL+     +  + G+T
Sbjct: 2589 PMPAQYFINLSPMF--PNVYACLAA-RGGYLLGFKMGTKDISIQNLK----LTQRVEGMT 2648

Query: 2192 AYKPLSKDKIHCLVLHDDGSLQIYT----------------------------------- 2251
                L++     L+L DDGS+  Y                                    
Sbjct: 2649 I---LNRSSPKLLILFDDGSIGRYDFNLENTIQQPLPTTSTTTDSINESDKKGLDILLYL 2708

Query: 2252 ---------------------HTAVGVDASAYATAEKIKK---LGSGILNNKVYASTNPE 2311
                                 +    V +S  +T+  IK    L S +L      +T+P 
Sbjct: 2709 KSKYQDSINKNNSGSSINSSGNNVATVSSSTTSTSAVIKSPPPLSSLLLQPTTTTTTSPV 2768

Query: 2312 FPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPD 2371
            FP+DFFE+T CIT +V+ GGD ++    +  KQ LAS D ++   S     + + N+N +
Sbjct: 2769 FPIDFFESTECITPNVKYGGDPLQWFSQDVIKQKLASNDEYIVCQSLETLTLVIMNNNFN 2828

Query: 2372 IVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVG 2431
              + G RI VGN S  HIP+EI +F R I+L EG R WYDIPFT  E+L + ++ S T G
Sbjct: 2829 NAICGIRILVGNASTKHIPTEIRVFNRTIQLKEGQRRWYDIPFTTEETLRSIKKVSFTAG 2888

Query: 2432 PAFNGTALPRIDSLEVYGRAKDEFGWKEKLDA-----VLDMEARALGSNSLLARSGKKRR 2491
              F     P ID +EVY + KD  G+ +  D+      +D    + G ++    SG    
Sbjct: 2889 STFTMGTSPIIDQVEVYAKNKDSLGFNDSDDSDDEFPTVDENVTSSGLSTSAGGSGGGVA 2948

Query: 2492 SIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQL--LETIYESDR 2551
                +   +    + + +L  +  L+        ND +Q+  +LK K L  L ++    +
Sbjct: 2949 GTNDSTADKHTPLE-IVILHCFNSLKNYFSNHVANDDSQKFIELKEKTLSILPSMITDSQ 3008

Query: 2552 EPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFT 2611
               ++S+  ++L+ +    E Y  +K +++L    ++ S + S  G G       +E+  
Sbjct: 3009 LSFVRSSIKKLLKILSTNSEHYQTLKHSIQLKYASQTVSSILSNSG-GTIPNSADLEKLD 3068

Query: 2612 SQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQ---------------- 2671
              + A+ KI+    +NL  +L ++    +  L+ I                         
Sbjct: 3069 YLVAALKKISSSNPNNLEKYLFKDHPNFLSDLLTIYRNTTSPSSSKGKSSSASASSSSST 3128

Query: 2672 --------PNTQT-LNNIVISSVELIY--CYAECLALHGPDTGRHSVAPAVVLF---KKL 2731
                     NTQ+  NN  +SS  L+Y   +   L     +  R  +    ++F   K L
Sbjct: 3129 TTATSTLPSNTQSGSNNGAVSSNALLYSDSFISNLVQLLWNCQRSKIFSTDLIFTLLKSL 3188

Query: 2732 LFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPLSAPVPTETTGTNPQVMIEE 2791
            L  S+E ++  SSL + S              +  G +  ++  +P  ++ +N  V+  +
Sbjct: 3189 LSHSNEIIRTRSSLILVS------------FISKSGNNSTVANLLPPPSSSSNENVVDND 3248

Query: 2792 DA---------VASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEVLDADRLPSPH 2851
            +          +   V + CD C+  PI  +RW+C+ C DFDLC  CY+  + D     H
Sbjct: 3249 NTNKENEGDIQMVDEVLFSCDLCNINPITGKRWNCSNCGDFDLCNQCYQNPEKD-----H 3308

Query: 2852 SRDHPMTAIPIEVDSLGDGNEYHFATE-----DINDSSLTSLIPDISVKNPVSSIHV--- 2911
             +DH      I+ + + DG+E     E        D  L   + D S  +    I +   
Sbjct: 3309 PKDHIFKEFIID-EPMKDGDEKESTNEPPQQQKQQDQQLQQDLQDDSEYDEELKIAISMS 3368

Query: 2912 LEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMET---------TSGVQ-A 2971
            L   ++ + +    D  +++ +  T N    +   E + G+++            G    
Sbjct: 3369 LNNNNNNNNNNESMDTSTLTTTTTTTNKTTPTTNEEPMVGFIKLIIEEIIVSYEKGFSFM 3428

Query: 2972 VPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWF-LDEINLNKPFEAKTRTSFGEVAI 3031
            +P MQ+ Y          +N+ +  N  +  L+      + NL+K    K+    G+  I
Sbjct: 3429 IPYMQILYSTILHNTTFILNNNQLSNQLVTTLVNLISKHKSNLSKFLSTKSNNLEGD--I 3488

Query: 3032 LVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFT 3091
            L+F   +L+L +  Q       +K +TTT      +T  A +T VT  S +     +   
Sbjct: 3489 LIFSLLSLLLDSDEQK-RQKIQSKQTTTTAAAAATTTTTATAT-VTTPSVVTT--PHSLP 3548

Query: 3092 SQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNG----SGCGALLTVR 3151
            S L+   S +    F++   D++Q L    +         HG   G    S  GALL   
Sbjct: 3549 STLIYHLSKL----FID--RDIIQLLRLWIEQLYTCISESHGLSVGEERDSPFGALLVPS 3608

Query: 3152 KD----LPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKE 3211
             D    LP   F+PFF     ++  +      H LL +  F+L+ T  R E+  K++ + 
Sbjct: 3609 IDENQTLPKNRFTPFFGKYLPQSMGS-----IHLLLSKAIFKLMITFYRCERRKKSITQT 3668

Query: 3212 KVYKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFS 3271
                I  S+      + +++CS+I++  T  + +Y ++L   I  +KS YYSIRD +   
Sbjct: 3669 TPTLIKPSE------WTNLICSFIHSKKTVSIVKYPKKLLFLIYQTKSSYYSIRDEFLLK 3728

Query: 3272 TEVKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFL 3331
             +   +     K  GF + + Y+   K++  LT M EVA+ RP++WQ +C  H DVLP L
Sbjct: 3729 KKFAGILDLEGKTKGFSDEIGYDHLAKLISYLTLMLEVASDRPKSWQFFC-AHNDVLPKL 3788

Query: 3332 LNGIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRK 3391
               +F   EE     L+LL   F             E  +   S+    TQ  +S     
Sbjct: 3789 YKILFNLAEEPSSLLLELLTYVFV-----------DEIAEQPLSSTSQDTQQ-ESSNNNN 3848

Query: 3392 GEDGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWH 3451
              + ND  ++      + +     ++  NVL   I   LLE NSS +R+     +  +W 
Sbjct: 3849 NNNSNDILMQDVDTKAKHISIFLQEQYFNVL---IFNILLESNSSDLRSIASSFIYYLWR 3908

Query: 3452 HGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLT 3511
                  +  +  +L  K+  +  YG N +E+ +L+T+ L +      K Q +E  ++   
Sbjct: 3909 SSNNEQRIFINKSLWSKLNNVASYGKNASEFMDLLTYFLNETDSQSWKDQHNEFSNK--- 3968

Query: 3512 SDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMK 3571
                  + ++   QN++  NHPNS+IYN+L  ++EFDGYYLESEPC  C++PEV Y   +
Sbjct: 3969 ------LIESFKQQNQISLNHPNSQIYNSLGKILEFDGYYLESEPCLVCNNPEVQYQTSR 4028

Query: 3572 LESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELK 3631
            LESLK E KF++   ++K  G Y I  +++ +HD +K K +K +NL+YNN+PVAD+ +LK
Sbjct: 4029 LESLKQEVKFSEYSQLIKFNGVYNISKIMIQLHDVKKGKMIKTINLFYNNKPVADIGDLK 4088

Query: 3632 NNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSR 3691
              ++ WK+ K  H   +QTE  V F IPI+A NFMIE   F++NLQA S E LQCPRCSR
Sbjct: 4089 GKFNQWKKLKQVHFTPSQTEKAVVFQIPISARNFMIEYFDFHDNLQAASSEKLQCPRCSR 4148

Query: 3692 PVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDN 3751
             VTDKHGIC NCHENAYQC+ CRNINYENLD+FLCNECG+ K+ +F+++F+ KP+   + 
Sbjct: 4149 IVTDKHGICKNCHENAYQCKHCRNINYENLDAFLCNECGFCKHAKFDYSFVCKPTIAIEK 4208

Query: 3752 MENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENE-------------- 3811
            +EN ED KR +  IE ESENAH++YQ+L+G+KK +  +++S    E              
Sbjct: 4209 IENQEDHKRAIQTIEKESENAHKKYQRLIGFKKVISGLINSFETQEPWSKDDLIKSSGGT 4268

Query: 3812 -----MDSQQKDSVQQMMVSLPGP------------------------------------ 3871
                   +    S  Q + S  G                                     
Sbjct: 4269 ISIANTSTNSTGSNNQSINSSSGNISTNSSSSSSSSFGISNQSSSGNGGGGVGSGGGGVI 4328

Query: 3872 ------------SCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLH--- 3931
                        + +IN+KI  L  LY  +C+  ++ +SKSVQ LQ  R  +  Y++   
Sbjct: 4329 NQSGTSNNSSFLTLRINKKIGYLSRLYERECRNIYEGLSKSVQILQTNRMEISKYMNFIS 4388

Query: 3932 -----------------QKHTDDGFPASRFVISRSPNNCYGCATTFVTQCLEILQVLSKH 3991
                             Q+ +    P S  +  R  N CYGC+ +++ Q L +L    ++
Sbjct: 4389 GGGQPSSNDKQQQQQQQQQQSSRQCPVS--IHLREENKCYGCSNSYIEQVLCLLNSFCRN 4448

Query: 3992 QS---SKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKK 4051
                  K  L+  G+  E+F NNIH G   A+  A++ L   ++ +++  + +N  I+ K
Sbjct: 4449 SELTPIKNLLIEKGLPKEIFFNNIHHGKSIAKGWAKSSLSYLTKSNIDCTSMVNQWIKDK 4508

Query: 4052 VMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIA 4111
            + Y L ++ S+D+      E+SLL E  SL+D  W  RL  + +L F ++ SG++ P ++
Sbjct: 4509 IYYTLCYYSSLDVPNMVSSEISLLKECSSLSDNIWPQRLSFIMELFFKALSSGSQSPVVS 4568

Query: 4112 EHIILPCLRIISQACTPPK----------------------------------------- 4171
            E+IILPCL+II   CT  +                                         
Sbjct: 4569 EYIILPCLKIIIYLCTLDRKGAFITKDAKDLEISQKLLATRFEKLKSSKKQAIALAATAT 4628

Query: 4172 ----------SDTVDK-------------------------------------------- 4231
                      + T++K                                            
Sbjct: 4629 ATATAANASLTSTIEKKIEALSSLLSPNTVNNIALSVASSLLSPQQMQLQIQQQIALQQQ 4688

Query: 4232 ---------EQRMGKLTS----------------VSQNKDENATNISGSFSG-PVSGNKS 4291
                     +Q++ +  S                V     EN    SGS SG  VSG+ S
Sbjct: 4689 QIQQQIQQQQQQLNESVSGLKILSPSSSSSSPSGVGATGSENGGGGSGSSSGSSVSGSGS 4748

Query: 4292 APESLEHN-----------------WDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVS 4351
               S + N                 WD      D   ++ A   K  +YL        + 
Sbjct: 4749 ISSSQDPNILSIFDNDTSNAGANESWDG-----DDNPIANAWSSKYENYLSNFNVNSSIG 4808

Query: 4352 QVCKGTVQRSRTQ-KGDYLSLKYALKWKRFVCRNAKSD------LSAFEL---GSWVTEL 4411
            ++ K     S+ + K  Y    YA   KR   +  +S        S+FE+     W+ +L
Sbjct: 4809 EISKSLKPLSQDELKSKYFKRWYAQVKKRKQQQQQQSSGIGYATQSSFEVLFEEKWLEKL 4868

Query: 4412 VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMV 4471
            +  + S SIR E+  L+ +L   S+SR  + LDLL  +LP    AGE +AE+F L    +
Sbjct: 4869 LFNSTS-SIRLEIITLMGILSKNSNSRSLKFLDLLTKILPNATEAGEYSAEFFGLFNSFI 4928

Query: 4472 D-SEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEI 4531
              S+D +++L V+G +  IC  I +E+ +++S E S   D+SQGF+L  L+ +L  FL++
Sbjct: 4929 STSQDRKIYLAVKGFIPFICDAIIKEIEHIKSKEGSFSTDVSQGFVLKTLVAILKSFLDV 4988

Query: 4532 PNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLIS 4591
            P +++                         +  +DN+L +VL+A + +RG++VQK KL  
Sbjct: 4989 PTLKA-------------------------KMKKDNMLEKVLDAFLSLRGVIVQKNKLTE 5048

Query: 4592 DCNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKP 4651
            D  R L++L+ SL  ES ++ ++F+ A I  L  +  +  GRT +FI EQLCN++ P+KP
Sbjct: 5049 DSVRYLQELMKSLNNESVQDNKKFMAANIKALAKY--QSDGRTPIFIFEQLCNIVCPTKP 5108

Query: 4652 EPVYLLVLNKAHTQEEFIRGSMTKNPYSSAEI-GPLMRDVKNKICHQLDLLGFLEDDYGM 4711
            +P+Y L+L KA +QEE+IRGSM +NPY+S    GPLMRDVKNKIC  LDL  FL+DD GM
Sbjct: 5109 DPIYQLILFKAASQEEYIRGSMNRNPYTSNTFGGPLMRDVKNKICKALDLGSFLDDDNGM 5168

Query: 4712 ELLVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLD 4771
            ELLV   II LDL I  VYE VWKKS Q+      TA I+        PM V YRLQGLD
Sbjct: 5169 ELLVDNKIIKLDLPIKKVYELVWKKSPQA----LRTADINI-------PMNVVYRLQGLD 5228

Query: 4772 GEATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVA 4831
            GEATE +I+ L ++  E +DPE+E+ I   + E GGLE ++ MI+RI ++F   +E    
Sbjct: 5229 GEATEEIIETLNDNNSEEKDPEVEYEITSVMAECGGLESMISMIERI-NDFSIEKELAQL 5288

Query: 4832 VLNLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANES 4869
            V+ LL HCCKI+ NR+ LL L  +G LLE  ++AF     E +E +L+I+ES+  EAN  
Sbjct: 5289 VIKLLYHCCKIKINRQKLLTLNTVGRLLEKLKQAF--HQPELSEHLLVIIESVVSEAN-- 5348

BLAST of HG10018299 vs. ExPASy Swiss-Prot
Match: Q5T4S7 (E3 ubiquitin-protein ligase UBR4 OS=Homo sapiens OX=9606 GN=UBR4 PE=1 SV=1)

HSP 1 Score: 1195.6 bits (3092), Expect = 0.0e+00
Identity = 1022/3598 (28.40%), Postives = 1628/3598 (45.25%), Query Frame = 0

Query: 1525 TLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKVCTFT 1584
            T  LS +  +GP+    D      + D    + E+A  ++D + + + E +L +K+CTFT
Sbjct: 1607 TNALSQSNGQGPSHLSVDGEERAIEVDSDWVE-ELAVEEEDSQAEDSDEDSLCNKLCTFT 1666

Query: 1585 SSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSS 1644
             +   FM QHWY C+TC +    G C+VCAKVCH+ H + Y++   FFCDCGA      S
Sbjct: 1667 ITQKEFMNQHWYHCHTCKMVDGVGVCTVCAKVCHKDHEISYAKYGSFFCDCGA--KEDGS 1726

Query: 1645 CQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLE------DDVSVTDTDK 1704
            C  L  R     G +     S FQ     SE   +   + S  +       D  V D +K
Sbjct: 1727 CLALVKRT-PSSGMSSTMKESAFQSEPRISESLVRHASTSSPADKAKVTISDGKVADEEK 1786

Query: 1705 CLRPSVPRELLDGVSVLLEELDVEGR------MLELCSCLLPTI---------------T 1764
              + S+ R     V    EEL  +        +L++ + L+  I                
Sbjct: 1787 PKKSSLCRT----VEGCREELQNQANFSFAPLVLDMLNFLMDAIQTNFQQASAVGSSSRA 1846

Query: 1765 NQRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASG 1824
             Q   +L   +K +   D+++   L       + +G   ++++       + ++  +++ 
Sbjct: 1847 QQALSELHTVEKAVEMTDQLMVPTLG------SQEGAFENVRMNYSGDQGQTIRQLISAH 1906

Query: 1825 SLVKSLLSV--SIRGR---LAVG-EGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNV 1884
             L +  + V  S  GR   LAV  E  K+++  +  L++Q   A  +  K  +  L+   
Sbjct: 1907 VLRRVAMCVLSSPHGRRQHLAVSHEKGKITVLQLSALLKQ---ADSSKRKLTLTRLASAP 1966

Query: 1885 VRFEIVHLAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELAL-QGAHIKRMEWVP 1944
            V F ++ L  NP  E+YLAV G +DC VLT +  G V D L +   L  G  I +  W+P
Sbjct: 1967 VPFTVLSLTGNPCKEDYLAVCGLKDCHVLTFSSSGSVSDHLVLHPQLATGNFIIKAVWLP 2026

Query: 1945 GSQVQLMVVTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGR 2004
            GSQ +L +VT  FVKIYDL +D +SP  YF LP   + D T     +G+  ++++S  G 
Sbjct: 2027 GSQTELAIVTADFVKIYDLCVDALSPTFYFLLPSSKIRDVTFLFNEEGKNIIVIMSSAGY 2086

Query: 2005 IFR--LELSVLGNVGATPLKEIIEI-------QGREMSAKGLSLYFSSCYKLLFLAYADG 2064
            I+   +E +     G   +  ++EI          +++  G+S+Y+S   ++LF +Y  G
Sbjct: 2087 IYTQLMEEASSAQQGPFYVTNVLEINHEDLKDSNSQVAGGGVSVYYSHVLQMLFFSYCQG 2146

Query: 2065 TTLVGQLSPDATKLTEISVIYEEEQD--RKLRPAGLHRWKELFAGSGLFVCFSSVKSNSA 2124
             +    +S    ++ ++  I  +  +   K  PA L +W E+    GL VC     +   
Sbjct: 2147 KSFAATISRTTLEVLQLFPINIKSSNGGSKTSPA-LCQWSEVMNHPGL-VCCVQQTTGVP 2206

Query: 2125 LAVSMGAHDIYAQNLR--HAGGSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAV 2184
            L V +       Q ++   A      +V I       + +   ++L +DGSL+IY     
Sbjct: 2207 LVVMVKPDTFLIQEIKTLPAKAKIQDMVAIRHTACNEQQRTTMILLCEDGSLRIYMANVE 2266

Query: 2185 GVD---------ASAYATAEKIKKLGSGILNNKVYASTNPEFPLDFFENTVCITADVRLG 2244
                        +S  +  + ++K  +  +  +   S+   FP+DFFE+   +T DV  G
Sbjct: 2267 NTSYWLQPSLQPSSVISIMKPVRKRKTATITTR--TSSQVTFPIDFFEHNQQLT-DVEFG 2326

Query: 2245 G-DAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVGNTSANHI 2304
            G D ++  +++  K  L S   ++ +    GF I +SN+N  +VM G RI +G  +    
Sbjct: 2327 GNDLLQVYNAQQIKHRLNSTGMYVANTKPGGFTIEISNNNSTMVMTGMRIQIGTQAIERA 2386

Query: 2305 PSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRIDSLEVYG 2364
            PS I IF R ++L+     W+D PFT  E+L AD++ ++ +G + +   +  ID++++YG
Sbjct: 2387 PSYIEIFGRTMQLNLSRSRWFDFPFTREEALQADKKLNLFIGASVDPAGVTMIDAVKIYG 2446

Query: 2365 RAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGLKVLSSYY 2424
            + K++FGW ++        + +    S L +S     S   AP           V+SS  
Sbjct: 2447 KTKEQFGWPDEPPEEFPSASVSNICPSNLNQSNGTGDSDSAAPTTTSGTVLERLVVSS-- 2506

Query: 2425 LLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPL-LQSAACRVLQAIFPKKEIYYQ 2484
             L   + C  +  + ++       Q L T+  S   P  +Q  +  +L ++   +  Y+ 
Sbjct: 2507 -LEALESCFAVGPIIEKERNKNAAQELATLLLSLPAPASVQQQSKSLLASLHTSRSAYHS 2566

Query: 2485 VKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLACFLERN 2544
             KD   L+  V+  +  SS+ G          E F   +     IA+ R +NL  F E  
Sbjct: 2567 HKDQALLSKAVQCLNT-SSKEGKDLDP-----EVFQRLVITARSIAIMRPNNLVHFTESK 2626

Query: 2545 GSQV----------------------VDGLMQILWGILDLEQPNTQTLNNIVISSVELIY 2604
              Q+                      +  L+   W  L   +P    L    +  +  I 
Sbjct: 2627 LPQMETEGMDEGKEPQKQLEGDCCSFITQLVNHFWK-LHASKPKNAFLAPACLPGLTHIE 2686

Query: 2605 CYAECLA--LHGPDTGR-HSVAPAVVLFKKLLFSSSEAVQASSSLAI------SSRLLQV 2664
                 L   +HG  T     +  A  ++ ++L     AV  S   A+       ++   V
Sbjct: 2687 ATVNALVDIIHGYCTCELDCINTASKIYMQMLLCPDPAVSFSCKQALIRVLRPRNKRRHV 2746

Query: 2665 PFPKQTMLAT---------DDGADIPLSA------------------------------- 2724
              P      T         DD AD  + +                               
Sbjct: 2747 TLPSSPRSNTPMGDKDDDDDDDADEKMQSSGIPNGGHIRQESQEQSEVDHGDFEMVSESM 2806

Query: 2725 ------------PVPTE-----TTGTNPQVMIEED---------AVASSVQYCCDGCSTV 2784
                        P P E       G  P + I  D         A+A S+Q    G S+ 
Sbjct: 2807 VLETAENVNNGNPSPLEALLAGAEGFPPMLDIPPDADDETMVELAIALSLQQDQQGSSSS 2866

Query: 2785 PI-LRRRWHCTICPDFDLCESCYEVLDADRLPSPHSRDHPMTAI---------PIE---- 2844
             + L+        P     ++    L      +P S D   TA          P +    
Sbjct: 2867 ALGLQSLGLSGQAPSSSSLDA--GTLSDTTASAPASDDEGSTAATDGSTLRTSPADHGGS 2926

Query: 2845 VDSLGDGNEYHFATEDINDSSLTSLIPDI----------SVKNPVSSIHVLEPADSGDFS 2904
            V S   G+       + + S  +S   D           SV +   +I        GD S
Sbjct: 2927 VGSESGGSAVDSVAGEHSVSGRSSAYGDATAEGHPAGPGSVSSSTGAISTTTGHQEGDGS 2986

Query: 2905 -----ASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMG 2964
                       V  S     V  +LL  LL+ L   +    GV+A+P MQ+   L++ + 
Sbjct: 2987 EGEGEGETEGDVHTSNRLHMVRLMLLERLLQTLP-QLRNVGGVRAIPYMQVILMLTTDLD 3046

Query: 2965 GPFMNSLKSENLNLERLIKWFLDEINLNKPFEAK--TRTSFGEVAILVFMFFTLMLRNWH 3024
            G      + +   L+ L+   + E+ ++K   +K   R++  EV ++V    ++ +    
Sbjct: 3047 G----EDEKDKGALDNLLSQLIAELGMDKKDVSKKNERSALNEVHLVVMRLLSVFM---- 3106

Query: 3025 QPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQS 3084
                          + T   + + +  S+S+ + ++                 +++    
Sbjct: 3107 --------------SRTKSGSKSSICESSSLISSAT----------------AAALLSSG 3166

Query: 3085 FVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYAK 3144
             V+Y + VL+ L+  +KS   D +             LL         + SPFF   Y K
Sbjct: 3167 AVDYCLHVLKSLLEYWKSQQNDEEP-------VATSQLLKPHTTSSPPDMSPFFLRQYVK 3226

Query: 3145 AHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCS 3204
             H  D+F  Y +LL E   RL Y + +    +  +       ++        ++   L  
Sbjct: 3227 GHAADVFEAYTQLLTEMVLRLPYQIKKITDTNSRIPP----PVFD------HSWFYFLSE 3286

Query: 3205 YINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKVGGF------ 3264
            Y+    T FVRR  R+L L ICGSK  Y  +RD     + V+ + K + + G F      
Sbjct: 3287 YLMIQQTPFVRRQVRKLLLFICGSKEKYRQLRDLHTLDSHVRGIKKLLEEQGIFLRASVV 3346

Query: 3265 ----QNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEESV 3324
                 + + Y+  + +++ L   AE+AA R  NWQK+C++   VL FLL   F   E   
Sbjct: 3347 TASSGSALQYDTLISLMEHLKACAEIAAQRTINWQKFCIKDDSVLYFLLQVSFLVDEGVS 3406

Query: 3325 VQTLKLLN-----------LAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRK--KR 3384
               L+LL+           LA  +G     S     A  +G +T +S + T  S+K  K 
Sbjct: 3407 PVLLQLLSCALCGSKVLAALAASSGSSSASSSSAPVAASSGQATTQSKSSTKKSKKEEKE 3466

Query: 3385 KGEDGNDSALEKSYLDMEIM--VNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCG 3444
            K +DG  S  ++  L   ++  +N F DK    L  F+ CFLLE NSSSVR +   +   
Sbjct: 3467 KEKDGETSGSQEDQLCTALVNQLNKFADK--ETLIQFLRCFLLESNSSSVRWQAHCLTLH 3526

Query: 3445 IWHHGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDR 3504
            I+ +  ++ +E LL  +      LP YG   A++ +L+ +   K P    K         
Sbjct: 3527 IYRNSSKSQQELLLDLMWSIWPELPAYGRKAAQFVDLLGYFSLKTPQTEKK--------- 3586

Query: 3505 CLTSDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYS 3564
                +  +   + L +QN +L NHPNS IYNTLSGLVEFDGYYLES+PC  C++PEVP+ 
Sbjct: 3587 --LKEYSQKAVEILRTQNHILTNHPNSNIYNTLSGLVEFDGYYLESDPCLVCNNPEVPFC 3646

Query: 3565 RMKLESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLS 3624
             +KL S+K +T++T  + +VK  GS+TI  V + + D +++K V+ +NLYYNNR V  + 
Sbjct: 3647 YIKLSSIKVDTRYTTTQQVVKLIGSHTISKVTVKIGDLKRTKMVRTINLYYNNRTVQAIV 3706

Query: 3625 ELKNNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPR 3684
            ELKN  + W +AK   L   QTE+K++ P+PI A N MIE   FYEN QA S E LQCPR
Sbjct: 3707 ELKNKPARWHKAKKVQLTPGQTEVKIDLPLPIVASNLMIEFADFYENYQA-STETLQCPR 3766

Query: 3685 CSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFT 3744
            CS  V    G+C NC EN YQC +CR+INY+  D FLCN CG+ KY RF+F   AKP   
Sbjct: 3767 CSASVPANPGVCGNCGENVYQCHKCRSINYDEKDPFLCNACGFCKYARFDFMLYAKPCCA 3826

Query: 3745 FDNMENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQ 3804
             D +EN+ED K+ ++ I +  + A R Y QL+G++  L  ++  + E   +  Q DS   
Sbjct: 3827 VDPIENEEDRKKAVSNINTLLDKADRVYHQLMGHRPQLENLLCKVNEAAPEKPQDDSGTA 3886

Query: 3805 MMVSLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTY-LHQKH--- 3864
              +S    S  +NR I  L   Y   CK +FD +SK +Q +   R+ L+ Y L Q+    
Sbjct: 3887 GGIS--STSASVNRYILQLAQEYCGDCKNSFDELSKIIQKVFASRKELLEYDLQQREAAT 3946

Query: 3865 ------TDDGFPASRFVI-------SRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQL 3924
                      F AS++           S   CYGCA+     C+ +L+ L+ + + +  L
Sbjct: 3947 KSSRTSVQPTFTASQYRALSVLGCGHTSSTKCYGCASAVTEHCITLLRALATNPALRHIL 4006

Query: 3925 VSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLE-HHR 3984
            VS G++ ELF+ N+ +G    R + R ++C  +  +  A   +N+LI  KV   L+ H  
Sbjct: 4007 VSQGLIRELFDYNLRRGAAAMREEVRQLMCLLTRDNPEATQQMNDLIIGKVSTALKGHWA 4066

Query: 3985 SMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLR 4044
            + D+A + + E+ LL++  S  D  WE RLR    L   ++    K P + E+I L CLR
Sbjct: 4067 NPDLASSLQYEMLLLTDSISKEDSCWELRLRCALSLFLMAV--NIKTPVVVENITLMCLR 4126

Query: 4045 IISQACTPP-KSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNW 4104
            I+ +   PP  +   +K+  +  LT+V                 P      A   L    
Sbjct: 4127 ILQKLIKPPAPTSKKNKDVPVEALTTVK----------------PYCNEIHAQAQLWLKR 4186

Query: 4105 DSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKW 4164
            D           SY  W+K                   G        +  YL+ KY  +W
Sbjct: 4187 DPK--------ASYDAWKKCLPIRGIDG---------NGKAPSKSELRHLYLTEKYVWRW 4246

Query: 4165 KRFVCRNAKSDLSAFEL----GSWVTELVLCACSQSIRSEMCMLISLLCAQSSSRRFRLL 4224
            K+F+ R  K   S  +L     +W+ +++    +Q+ R   C ++  L A   SR+ ++L
Sbjct: 4247 KQFLSRRGKR-TSPLDLKLGHNNWLRQVLFTPATQAARQAACTIVEAL-ATIPSRKQQVL 4306

Query: 4225 DLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLISQEVGNVESL 4284
            DLL S L     AGE AAEY  L  K++ S   +++L  RG L  +  LI++E+  + +L
Sbjct: 4307 DLLTSYLDELSIAGECAAEYLALYQKLITSAHWKVYLAARGVLPYVGNLITKEIARLLAL 4366

Query: 4285 ER-SLHIDISQGFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRF 4344
            E  +L  D+ QG+ L  L  LL  F+E+ +I+                          R 
Sbjct: 4367 EEATLSTDLQQGYALKSLTGLLSSFVEVESIK--------------------------RH 4426

Query: 4345 MRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQFIRACICGL 4404
             +  L+  VL   + +R LVVQ+TKLI +   +L ++L+ +   +    + F+  CI   
Sbjct: 4427 FKSRLVGTVLNGYLCLRKLVVQRTKLIDETQDMLLEMLEDMTTGTESETKAFMAVCIETA 4486

Query: 4405 QIHGEERKGRTCLFILEQLCNLISPSKPEPV-YLLVLNKAHTQEEFIRGSMTKNPYSSAE 4464
            + +  +   RT +FI E+LC++I P + E   + + L K   QE+F++G M  NPYSS E
Sbjct: 4487 KRYNLD-DYRTPVFIFERLCSIIYPEENEVTEFFVTLEKDPQQEDFLQGRMPGNPYSSNE 4546

Query: 4465 --IGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWKKSNQSS 4524
              IGPLMRD+KNKIC   DL+  LEDD GMELLV   IISLDL +A VY++VW  +N+  
Sbjct: 4547 PGIGPLMRDIKNKICQDCDLVALLEDDSGMELLVNNKIISLDLPVAEVYKKVWCTTNEGE 4606

Query: 4525 NAISNTALISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELEFAIAGA 4584
                              PM + YR++GL G+ATE  I+ L+   +E +D E  + +AG 
Sbjct: 4607 ------------------PMRIVYRMRGLLGDATEEFIESLDSTTDEEEDEEEVYKMAGV 4666

Query: 4585 VREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLR---------L 4644
            + + GGLE +L  +  I D FK  +  L  +L L  +C K++ NR+ L++         L
Sbjct: 4667 MAQCGGLECMLNRLAGIRD-FKQGRHLLTVLLKLFSYCVKVKVNRQQLVKLEMNTLNVML 4726

Query: 4645 GALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGE 4704
            G L L L   + +        AE +L I+E +  E+N +E +S  +  L +T +      
Sbjct: 4727 GTLNLALVAEQESKDSGGAAVAEQVLSIMEIILDESN-AEPLSEDKGNLLLTGD------ 4786

Query: 4705 QAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLND 4764
              K  ++M L++++  F     +  Q     + RI+PYL++GE   M  L++ F PY N 
Sbjct: 4787 --KDQLVMLLDQINSTFVRSNPSVLQG----LLRIIPYLSFGEVEKMQILVERFKPYCN- 4846

Query: 4765 WDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGIT 4824
               FD+  + H            +   +  ++ F +++  +K +S G +LKD+IL+KGIT
Sbjct: 4847 ---FDKYDEDH------------SGDDKVFLDCFCKIAAGIKNNSNGHQLKDLILQKGIT 4906

Query: 4825 GLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRI 4874
              A+ +++     A       +  W   L RP++P IL +LRGL++ H  TQ  I    I
Sbjct: 4907 QNALDYMKKHIPSAKNL---DADIWKKFLSRPALPFILRLLRGLAIQHPGTQVLIGTDSI 4966

BLAST of HG10018299 vs. ExPASy Swiss-Prot
Match: A2AN08 (E3 ubiquitin-protein ligase UBR4 OS=Mus musculus OX=10090 GN=Ubr4 PE=1 SV=1)

HSP 1 Score: 1189.1 bits (3075), Expect = 0.0e+00
Identity = 1014/3598 (28.18%), Postives = 1624/3598 (45.14%), Query Frame = 0

Query: 1525 TLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKVCTFT 1584
            T  LS +  +GP+    D      + D    + E+A  ++D + + + E +L +K+CTFT
Sbjct: 1606 TNALSQSNGQGPSHLSVDGEERAIEVDSDWVE-ELAVEEEDSQAEDSDEDSLCNKLCTFT 1665

Query: 1585 SSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSS 1644
             +   FM QHWY C+TC +    G C+VCAKVCH+ H + Y++   FFCDCGA      S
Sbjct: 1666 ITQKEFMNQHWYHCHTCKMVDGVGVCTVCAKVCHKDHEISYAKYGSFFCDCGA--KEDGS 1725

Query: 1645 CQCLKPRKYTGHGSAPVRGASNFQCFLPFSE----EGDQLPESESDLE-DDVSVTDTDKC 1704
            C  L  R     G +     S FQ     SE         P  ++ +   D  VTD +K 
Sbjct: 1726 CLALVKRT-PSSGMSSTMKESAFQSEPRVSESLVRHASTSPADKAKVTISDGKVTDEEKP 1785

Query: 1705 LRPSVPRELLDGVSVLLEELDVEGR------MLELCSCLLPTI-TNQRDPDL---SKDKK 1764
             + S+ R     V    EEL  +        +L++ S L+  I TN +       S   +
Sbjct: 1786 KKSSLCRT----VEGCREELQNQANFSFAPLVLDMLSFLMDAIQTNFQQASAVGSSSRAQ 1845

Query: 1765 IILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIR 1824
              L +   +  G+++         GS +      + N + +      G  ++ L+S  + 
Sbjct: 1846 QALSELHTVDKGVEMTDQLMVPTLGSQE----GAFENVR-MNYSGDQGQTIRQLISAHVL 1905

Query: 1825 GRLAV----------------GEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVR 1884
             R+A+                 E  K+++  +  L++Q   A  +  K  +  L+   V 
Sbjct: 1906 RRVAMCVLSSPHGRRQHLAVSHEKGKITVLQLSALLKQ---ADSSKRKLTLTRLASAPVP 1965

Query: 1885 FEIVHLAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELAL-QGAHIKRMEWVPGS 1944
            F ++ L  NP  E+YLAV G +DC VLT +  G V D L +   L  G  I +  W+PGS
Sbjct: 1966 FTVLSLTGNPCKEDYLAVCGLKDCHVLTFSSSGSVSDHLVLHPQLATGNFIIKAVWLPGS 2025

Query: 1945 QVQLMVVTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIF 2004
            Q +L +VT  FVKIYDLS+D +SP  YF LP   + D T     +G+  ++++S  G ++
Sbjct: 2026 QTELAIVTADFVKIYDLSIDALSPTFYFLLPSSKIRDVTFLFNEEGKNIIVIMSSAGYMY 2085

Query: 2005 R--LELSVLGNVGATPLKEIIEI-------QGREMSAKGLSLYFSSCYKLLFLAYADGTT 2064
               +E +     G   +  ++EI          +++  G+S+Y+S   ++LF +Y+ G +
Sbjct: 2086 TQLMEEASSAQQGPFYVTNVLEINHEDLKDSNSQVAGGGVSVYYSHVLQMLFFSYSQGRS 2145

Query: 2065 LVGQLSPDATKLTEISVIYEEEQD--RKLRPAGLHRWKELFAGSGLFVCFSSVKS-NSAL 2124
                +S    ++ ++  I  +  +   K  PA L +W E+    GL  C          +
Sbjct: 2146 FAATVSRSTLEVLQLFPINIKSSNGGSKTSPA-LCQWSEVMNHPGLVCCVQQTTGVPLVV 2205

Query: 2125 AVSMGAHDIYAQNLRHAGGSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVD 2184
             V  G   I       A      +V I       + +   ++L +DGSL+IY        
Sbjct: 2206 MVKPGTFLIQEIKTLPAKAKIQDMVAIRHTACNEQQRTTMILLCEDGSLRIYMANVENTS 2265

Query: 2185 ---------ASAYATAEKIKKLGSGILNNKVYASTNPEFPLDFFENTVCITADVRLGG-D 2244
                     +S  +  + ++K  +  +  +   S+   FP+DFFE+   +T DV  GG D
Sbjct: 2266 YWLQPSLQPSSVISIMKPVRKRKTATITAR--TSSQVTFPIDFFEHNQQLT-DVEFGGND 2325

Query: 2245 AIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVGNTSANHIPSE 2304
             ++  +++  K  L S   ++ +    GF I +SN++  +VM G RI +G  +    PS 
Sbjct: 2326 LLQVYNAQQIKHRLNSTGMYVANTKPGGFTIEISNNSSTMVMTGMRIQIGTQAIERAPSY 2385

Query: 2305 ITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRIDSLEVYGRAK 2364
            I IF R ++L+     W+D PFT  E+L AD + S+ +G + +   +  ID++++YG+ K
Sbjct: 2386 IEIFGRTMQLNLSRSRWFDFPFTREEALQADRKLSLFIGASVDPAGVTMIDAVKIYGKTK 2445

Query: 2365 DEFGWKEK------LDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGLKVLS 2424
            ++FGW ++        +V ++    L  ++    S     +     + ++++   L+ L 
Sbjct: 2446 EQFGWPDEPPEDFPSASVSNICPPNLNQSNGTGESDSAAPATTSGTVLERLVVSSLEALE 2505

Query: 2425 SYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPLLQSAACRVLQAIFPKKEIY 2484
            S + +           + +E  K   ++L   +        +Q  +  +L ++   +  Y
Sbjct: 2506 SCFAVGPI--------IEKERNKHAAQELATLLLSLPAPASVQQQSKSLLASLHSSRSAY 2565

Query: 2485 YQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLACFLE 2544
            +  KD   L+  V+  +  SS+ G          E F   +     IA+ R +NL  F E
Sbjct: 2566 HSHKDQALLSKAVQCLNT-SSKEGKDLDP-----EVFQRLVITARSIAVTRPNNLVHFTE 2625

Query: 2545 R---------------------NGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELI 2604
                                  +G   +  L+   W  L   +P    L    +  +  I
Sbjct: 2626 SKLPQMETEGADEGKEPQKQEGDGCSFITQLVNHFWK-LHASKPKNAFLAPACLPGLTHI 2685

Query: 2605 YCYAECLA--LHGPDTGR-HSVAPAVVLFKKLLFSSSEAVQASSSLAI------SSRLLQ 2664
                  L   +HG  T     +  A  ++ ++L     AV  S   A+       ++   
Sbjct: 2686 EATVNALVDIIHGYCTCELDCINTASKIYMQMLLCPDPAVSFSCKQALIRVLRPRNKRRH 2745

Query: 2665 VPFPKQTMLAT---------DDGADIPLSA------------------------------ 2724
            V  P      T         DD AD  + +                              
Sbjct: 2746 VTLPSSPRSNTPMGDKDDDDDDDADEKMQSSGIPDGGHIRQESQEQSEVDHGDFEMVSES 2805

Query: 2725 -------------PVPTE-----TTGTNPQVMIEED---------AVASSVQYCCDGCST 2784
                         P P E       G  P + I  D         A+A S+Q    G S+
Sbjct: 2806 MVLETAENVNNGNPSPLEALLAGAEGFPPMLDIPPDADDETMVELAIALSLQQDQQGSSS 2865

Query: 2785 VPI-LRRRWHCTICPDFDLCESCYEVLDADRLPSPHSRDHPMTAI---------PIE--- 2844
              + L+        P     ++    L      +P S D   TA          P +   
Sbjct: 2866 SALGLQSLGLSGQAPSSSSLDA--GTLSDTTASAPASDDEGSTAATDGSTLRTSPADHGG 2925

Query: 2845 -VDSLGDGNEYHFATEDINDSSLTSLIPDI----------SVKNPVSSIHVLEPADSGDF 2904
             V S   G+       + + S  +S   D           SV +   +I        GD 
Sbjct: 2926 SVGSESGGSAVDSVAGEHSVSGRSSAYGDATAEGHPAGPGSVSSSTGAISTATGHQEGDG 2985

Query: 2905 S-----ASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTM 2964
            S           V  S     V  +LL  LL+ L   +    GV+A+P MQ+   L++ +
Sbjct: 2986 SEGEGEGEAEGDVHTSNRLHMVRLMLLERLLQTLP-QLRNVGGVRAIPYMQVILMLTTDL 3045

Query: 2965 GGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAK--TRTSFGEVAILVFMFFTLMLRNW 3024
             G      + +   L+ L+   + E+ ++K   +K   R++  EV ++V    ++ +   
Sbjct: 3046 DG----EDEKDKGALDNLLAQLIAELGMDKKDVSKKNERSALNEVHLVVMRLLSVFM--- 3105

Query: 3025 HQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQ 3084
                           + T   + + +  S+S+ + ++                 +++   
Sbjct: 3106 ---------------SRTKSGSKSSICESSSLISSAT----------------AAALLSS 3165

Query: 3085 SFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYA 3144
              V+Y + VL+ L+  +KS   D +             LL         + SPFF   Y 
Sbjct: 3166 GAVDYCLHVLKSLLEYWKSQQSDEEP-------VAASQLLKPHTTSSPPDMSPFFLRQYV 3225

Query: 3145 KAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLC 3204
            K H  D+F  Y +LL E   RL Y + +       +       ++        ++   L 
Sbjct: 3226 KGHAADVFEAYTQLLTEMVLRLPYQIKKIADTSSRIPP----PVFD------HSWFYFLS 3285

Query: 3205 SYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKVGGF----- 3264
             Y+    T FVRR  R+L L ICGSK  Y  +RD     + V+ + K + + G F     
Sbjct: 3286 EYLMIQQTPFVRRQVRKLLLFICGSKEKYRQLRDLHTLDSHVRGIKKLLEEQGIFLRASV 3345

Query: 3265 -----QNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEES 3324
                  + + Y+  + +++ L   AE+AA R  NWQK+C++   VL FLL   F   E  
Sbjct: 3346 VTASSGSALQYDTLISLMEHLKACAEIAAQRTINWQKFCIKDDSVLYFLLQVSFLVDEGV 3405

Query: 3325 VVQTLKLLN-----------LAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRK--K 3384
                L+LL+           LA  TG     S     A  +G +T +S + T  S+K  K
Sbjct: 3406 SPVLLQLLSCALCGSKVLAALAASTGSSSVASSSAPPAASSGQATTQSKSSTKKSKKEEK 3465

Query: 3385 RKGEDGNDSALEKSYLDMEIM--VNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVC 3444
             K ++G  S  ++  L   ++  +N F DK    L  F+ CFLLE NSSSVR +   +  
Sbjct: 3466 EKEKEGESSGSQEDQLCTALVNQLNRFADK--ETLIQFLRCFLLESNSSSVRWQAHCLTL 3525

Query: 3445 GIWHHGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLD 3504
             I+ +  +  +E LL  +      LP YG   A++ +L+ +   K      K        
Sbjct: 3526 HIYRNSNKAQQELLLDLMWSIWPELPAYGRKAAQFVDLLGYFSLKTAQTEKK-------- 3585

Query: 3505 RCLTSDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPY 3564
                 +  +   + L +QN +L NHPNS IYNTLSGLVEFDGYYLES+PC  C++PEVP+
Sbjct: 3586 ---LKEYSQKAVEILRTQNHILTNHPNSNIYNTLSGLVEFDGYYLESDPCLVCNNPEVPF 3645

Query: 3565 SRMKLESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADL 3624
              +KL S+K +T++T  + +VK  GS+TI  V + + D +++K V+ +NLYYNNR V  +
Sbjct: 3646 CYIKLSSIKVDTRYTTTQQVVKLIGSHTISKVTVKIGDLKRTKMVRTINLYYNNRTVQAI 3705

Query: 3625 SELKNNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCP 3684
             ELKN  + W +AK   L   QTE+K++ P+PI A N MIE   FYEN QA S E LQCP
Sbjct: 3706 VELKNKPARWHKAKKVQLTPGQTEVKIDLPLPIVASNLMIEFADFYENYQA-STETLQCP 3765

Query: 3685 RCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSF 3744
            RCS  V    G+C NC EN YQC +CR+INY+  D FLCN CG+ KY RF+F   AKP  
Sbjct: 3766 RCSASVPANPGVCGNCGENVYQCHKCRSINYDEKDPFLCNACGFCKYARFDFMLYAKPCC 3825

Query: 3745 TFDNMENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQ 3804
              D +EN+ED K+ ++ I +  + A R Y QL+G++  L  ++  + E   +  Q+DS  
Sbjct: 3826 AVDPIENEEDRKKAVSNINTLLDKADRVYHQLMGHRPQLENLLCKVNEAAPEKPQEDSGT 3885

Query: 3805 QMMVSLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTY-LHQKH-- 3864
               +S    S  +NR I  L   Y   CK +FD +SK +Q +   R+ L+ Y L Q+   
Sbjct: 3886 AGGIS--STSASVNRYILQLAQEYCGDCKNSFDELSKIIQKVFASRKELLEYDLQQREAA 3945

Query: 3865 -------TDDGFPASRFVI-------SRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQ 3924
                       F AS++           S   CYGCA+     C+ +L+ L+ + + +  
Sbjct: 3946 TKSSRTSVQPTFTASQYRALSVLGCGHTSSTKCYGCASAVTEHCITLLRALATNPALRHI 4005

Query: 3925 LVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLE-HH 3984
            LVS G++ ELF+ N+ +G    R + R ++C  +  +  A   +N+LI  KV   L+ H 
Sbjct: 4006 LVSQGLIRELFDYNLRRGAAAIREEVRQLMCLLTRDNPEATQQMNDLIIGKVSTALKGHW 4065

Query: 3985 RSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCL 4044
             + D+A + + E+ LL++  S  D  WE RLR    L   ++    K P + E+I L CL
Sbjct: 4066 ANPDLASSLQYEMLLLTDSISKEDSCWELRLRCALSLFLMAV--NIKTPVVVENITLMCL 4125

Query: 4045 RIISQACTPP-KSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPVSGNKSAPESLEHN 4104
            RI+ +   PP  +   +K+  +  LT+V                 P      A   L   
Sbjct: 4126 RILQKLIKPPAPTSKKNKDVPVEALTTVK----------------PYCNEIHAQAQLWLK 4185

Query: 4105 WDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALK 4164
             D           SY  W+K    L             K  + R       YL+ KY  +
Sbjct: 4186 RDPK--------ASYEAWKK---CLPIRGVDGNGKSPSKSELHRL------YLTEKYVWR 4245

Query: 4165 WKRFVCRNAKSDLSA-FELG--SWVTELVLCACSQSIRSEMCMLISLLCAQSSSRRFRLL 4224
            WK+F+ R  K       +LG  +W+ +++    +Q+ R   C ++  L A   SR+ ++L
Sbjct: 4246 WKQFLSRRGKRTTPLDLKLGHNNWLRQVLFTPATQAARQAACTIVEAL-ATVPSRKQQVL 4305

Query: 4225 DLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLISQEVGNVESL 4284
            DLL S L     AGE AAEY  L  K++ S   +++L  RG L  +  LI++E+  + +L
Sbjct: 4306 DLLTSYLDELSVAGECAAEYLALYQKLIASCHWKVYLAARGVLPYVGNLITKEIARLLAL 4365

Query: 4285 ER-SLHIDISQGFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRF 4344
            E  +L  D+ QG+ L  L  LL  F+E+ +I+                          R 
Sbjct: 4366 EEATLSTDLQQGYALKSLTGLLSSFVEVESIK--------------------------RH 4425

Query: 4345 MRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQFIRACICGL 4404
             +  L+  VL   + +R LV+Q+TKLI +   +L ++L+ +   +    + F+  CI   
Sbjct: 4426 FKSRLVGTVLNGYLCLRKLVLQRTKLIDETQDMLLEMLEDMTTGTESETKAFMAVCIETA 4485

Query: 4405 QIHGEERKGRTCLFILEQLCNLISPSKPEPV-YLLVLNKAHTQEEFIRGSMTKNPYSSAE 4464
            + +  +   RT +FI E+LC++I P + E   + + L K   QE+F++G M  NPYSS E
Sbjct: 4486 KRYNLD-DYRTPVFIFERLCSIIYPEENEVTEFFVTLEKDPQQEDFLQGRMPGNPYSSNE 4545

Query: 4465 --IGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWKKSNQSS 4524
              IGPLMRD+KNKIC   DL+  LEDD GMELLV   IISLDL +A VY++VW  +N+  
Sbjct: 4546 PGIGPLMRDIKNKICQDCDLVALLEDDSGMELLVNNKIISLDLPVAEVYKKVWCATNEGE 4605

Query: 4525 NAISNTALISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELEFAIAGA 4584
                              PM + YR++GL G+ATE  I+ L+   +E +D E  + +AG 
Sbjct: 4606 ------------------PMRIVYRMRGLLGDATEEFIESLDSTTDEEEDEEEVYRMAGV 4665

Query: 4585 VREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLR---------L 4644
            + + GGL+ +L  +  + D FK  +  L  +L L  +C K++ NR+ L++         L
Sbjct: 4666 MAQCGGLQCMLNRLAGVKD-FKQGRHLLTVLLKLFSYCVKVKVNRQQLVKLETNTLNVML 4725

Query: 4645 GALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGE 4704
            G L L L   + +        AE +L I+E +  E+N +E +S  +  L +T +      
Sbjct: 4726 GTLNLALVAEQESKDSGGAAVAEQVLSIMEIILDESN-AEPLSEDKGNLLLTGD------ 4785

Query: 4705 QAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLND 4764
              K  ++M L++++  F     +  Q     + RI+PYL++GE   M  L++ F PY + 
Sbjct: 4786 --KDQLVMLLDQINSTFVRSNPSVLQG----LLRIIPYLSFGEVEKMQILVERFKPYCS- 4845

Query: 4765 WDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGIT 4824
                   +K  ED+  D  +          ++ F +++  +K +S G +LKD+IL+KGIT
Sbjct: 4846 ------FEKYDEDHSGDDKV---------FLDCFCKIAAGIKNNSNGHQLKDLILQKGIT 4905

Query: 4825 GLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRI 4874
              A+ +++     A       +  W   L RP++P IL +LRGL+M H ATQ  I    I
Sbjct: 4906 QNALDYMKKHIPSAKNL---DADIWKKFLSRPALPFILRLLRGLAMQHPATQVLIGTDSI 4963

BLAST of HG10018299 vs. ExPASy TrEMBL
Match: A0A1S3CBB8 (auxin transport protein BIG OS=Cucumis melo OX=3656 GN=LOC103498551 PE=3 SV=1)

HSP 1 Score: 9174.7 bits (23806), Expect = 0.0e+00
Identity = 4661/4871 (95.69%), Postives = 4738/4871 (97.27%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAE+SFVKLLDTIFLDDSST+ NT+K FSSSDLLQLLRSDDSS+KLGLRQFYSIL+ GLR
Sbjct: 1    MAEESFVKLLDTIFLDDSSTTVNTKKPFSSSDLLQLLRSDDSSVKLGLRQFYSILEVGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDGNFAFQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGDGNFAFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQN MLMILETILVDGMDKVSD AQ C KK L+DLLKS GGD DATIEF+N
Sbjct: 121  SEFKCDDFSIQNTMLMILETILVDGMDKVSDCAQHCTKKDLIDLLKSFGGDFDATIEFNN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
            T ECG TGVCCSREEKQVGRLLMTIAAEC QAD LTSE GFS+PTF E+MNKLIFLCQHW
Sbjct: 181  TAECGFTGVCCSREEKQVGRLLMTIAAECEQADHLTSEPGFSEPTFFENMNKLIFLCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY
Sbjct: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            D K+MQAFAL ANSLPCLFGLCFEFANSHAT E SFENTILLLLEEFLELVQVVFRNSY+
Sbjct: 301  DDKLMQAFALLANSLPCLFGLCFEFANSHATGESSFENTILLLLEEFLELVQVVFRNSYI 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            FKDLE H  STLA+LSVD+PKC+A LE VPLHKNY VEEILRMIFP S+QWMDDLMHLLF
Sbjct: 421  FKDLEMHQMSTLAELSVDLPKCHAPLETVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540
            FLYSEG+RLRPKIERSLSSMKSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQ
Sbjct: 481  FLYSEGMRLRPKIERSLSSMKSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540

Query: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600
            HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC
Sbjct: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600

Query: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660
            EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG
Sbjct: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660

Query: 661  NFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTL 720
            N VYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTL
Sbjct: 661  NSVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIEKSKTISLKYSSLQEFMGTL 720

Query: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIV 780
            PSVFHIEILLVAFHL SEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+V
Sbjct: 721  PSVFHIEILLVAFHLFSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVV 780

Query: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESK 840
            LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSS LPYT+NDHLSSWGASVAK+IIGSS+ESK
Sbjct: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSHLPYTVNDHLSSWGASVAKSIIGSSMESK 840

Query: 841  PFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLI 900
            PF +SLINQLIDISSFPASLRQHDLT+ECPWFN  DIFSTFSWILGFWNGKQAVTVEDLI
Sbjct: 841  PFLNSLINQLIDISSFPASLRQHDLTIECPWFNPSDIFSTFSWILGFWNGKQAVTVEDLI 900

Query: 901  IERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFP 960
            IERYIFVLCWDFPS NALSHGGPLWSD D LDIS T CFFYFSYLLLDHGGVI EHMKFP
Sbjct: 901  IERYIFVLCWDFPSTNALSHGGPLWSDLDALDISKTACFFYFSYLLLDHGGVIDEHMKFP 960

Query: 961  QVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLL 1020
            QVVIGLL+RLHGGS+LEDFKALGWNFLRNG WLSL+LSFL VGI RYCSKN IPTVGS L
Sbjct: 961  QVVIGLLRRLHGGSVLEDFKALGWNFLRNGTWLSLILSFLGVGISRYCSKNKIPTVGSFL 1020

Query: 1021 TDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHA 1080
            TDTTVTD+EQANFAESLISSVI DSQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHA
Sbjct: 1021 TDTTVTDSEQANFAESLISSVIIDSQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHA 1080

Query: 1081 TEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCW 1140
            TEFSPLLLFKHS+FD CVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF SR CW
Sbjct: 1081 TEFSPLLLFKHSKFDRCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFSSRVCW 1140

Query: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVM 1200
            ESMFHGFPSHLETSSGILLSCVLS GRIISVLAGLLRIVDVKR++ILETEVTRGILDAVM
Sbjct: 1141 ESMFHGFPSHLETSSGILLSCVLSTGRIISVLAGLLRIVDVKRSVILETEVTRGILDAVM 1200

Query: 1201 TIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHEL 1260
            T+KFDKTFESVHGLCEGIYQSLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHEL
Sbjct: 1201 TVKFDKTFESVHGLCEGIYQSLNAELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHEL 1260

Query: 1261 VIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSEL 1320
            VIVKA DIMD+LRKDVSKSSVFQFYLGAE V EQVRELY FQHGNLLVLLDSLDNCCSEL
Sbjct: 1261 VIVKAIDIMDSLRKDVSKSSVFQFYLGAEDVPEQVRELYAFQHGNLLVLLDSLDNCCSEL 1320

Query: 1321 VNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKG 1380
            VNLKVLGFFV+LLSGEPCPKLKQEVQNKFL MDLLSLS+WLEKRIFGLVAEDSSG NVKG
Sbjct: 1321 VNLKVLGFFVDLLSGEPCPKLKQEVQNKFLYMDLLSLSKWLEKRIFGLVAEDSSGVNVKG 1380

Query: 1381 SSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440
            SSISLRESSMNFVFCLISSPSEPLA QLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV
Sbjct: 1381 SSISLRESSMNFVFCLISSPSEPLAHQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440

Query: 1441 QLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGK 1500
            QLLKGDKSMKLLLERIL+LM KLA+DERLLPGLKYLF+FLEMILIESGSGKNVFER SGK
Sbjct: 1441 QLLKGDKSMKLLLERILVLMEKLANDERLLPGLKYLFNFLEMILIESGSGKNVFERTSGK 1500

Query: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVA 1560
            PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVA
Sbjct: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVA 1560

Query: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620
            SLDKDEEED+NSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG
Sbjct: 1561 SLDKDEEEDSNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620

Query: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQL 1680
            HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQL
Sbjct: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQL 1680

Query: 1681 PESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQR 1740
            PESESDLEDDVSVTDTDKCLRPSVP ELLDGVSVLLEEL+VEGRMLELCSCLLPTITNQR
Sbjct: 1681 PESESDLEDDVSVTDTDKCLRPSVPMELLDGVSVLLEELNVEGRMLELCSCLLPTITNQR 1740

Query: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800
            DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL+
Sbjct: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLM 1800

Query: 1801 KSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLA 1860
            KSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLA
Sbjct: 1801 KSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLA 1860

Query: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920
            FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT
Sbjct: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920

Query: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLG 1980
            NRFVKIYDLSLDNISPMHYFTLPDDMVVDATL  ASQG+MFLIVLSENGRIFRLELSVLG
Sbjct: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLFIASQGKMFLIVLSENGRIFRLELSVLG 1980

Query: 1981 NVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISV 2040
            N+GATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 
Sbjct: 1981 NIGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISF 2040

Query: 2041 IYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGS 2100
            IYEEEQD+KLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAH+IYAQNLRHAGGS
Sbjct: 2041 IYEEEQDKKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHEIYAQNLRHAGGS 2100

Query: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNN 2160
            SLPLVGITAYKPLSK+KIHCLVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNN
Sbjct: 2101 SLPLVGITAYKPLSKNKIHCLVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNN 2160

Query: 2161 KVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220
            KVYASTNPEF LDFFE TVCITADVRLGGD IRNGDSEGAKQSLASEDGFLESPSSSGFK
Sbjct: 2161 KVYASTNPEFALDFFEKTVCITADVRLGGDTIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220

Query: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280
            ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA
Sbjct: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280

Query: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340
            DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARA+GSNSLLARSG
Sbjct: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARAIGSNSLLARSG 2340

Query: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYES 2400
            KKRRSIQCAPIQQQVLADGLKV+SSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYES
Sbjct: 2341 KKRRSIQCAPIQQQVLADGLKVMSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYES 2400

Query: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEE 2460
            DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLS+RLGVGG AGGWIIEE
Sbjct: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSTRLGVGGVAGGWIIEE 2460

Query: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520
            FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV
Sbjct: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520

Query: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580
            ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ
Sbjct: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580

Query: 2581 TMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTI 2640
            TMLATDDGADIPLSAPV TET GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTI
Sbjct: 2581 TMLATDDGADIPLSAPVSTETPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTI 2640

Query: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSL 2700
            CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHFATEDINDSSLTS+
Sbjct: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFATEDINDSSLTSV 2700

Query: 2701 IPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760
              DISVKNP SSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT
Sbjct: 2701 RSDISVKNPASSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760

Query: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820
            SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG
Sbjct: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820

Query: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGK 2880
            EVAILVFMFFTLMLRNWHQPGSDGPGAK ST  DTHDKNSTQVAPSTS+TAQSSMDDQGK
Sbjct: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKSSTAADTHDKNSTQVAPSTSLTAQSSMDDQGK 2880

Query: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVR 2940
            NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVR
Sbjct: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVR 2940

Query: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000
            KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK
Sbjct: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000

Query: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060
            IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK
Sbjct: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060

Query: 3061 RLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGI 3120
            +LFKY+NKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGI
Sbjct: 3061 KLFKYVNKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGI 3120

Query: 3121 FYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDG 3180
            FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVD RKK+KGEDG
Sbjct: 3121 FYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDIRKKKKGEDG 3180

Query: 3181 NDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQ 3240
            +DSALEKSYLDME MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQ
Sbjct: 3181 SDSALEKSYLDMETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQ 3240

Query: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300
            TFKETLLMALLQKVK LPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI
Sbjct: 3241 TFKETLLMALLQKVKNLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300

Query: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360
            RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL
Sbjct: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360

Query: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420
            KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS
Sbjct: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420

Query: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480
            LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD
Sbjct: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480

Query: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540
            KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND
Sbjct: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540

Query: 3541 EDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600
            EDMKRGL AIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG
Sbjct: 3541 EDMKRGLTAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600

Query: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660
            PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV
Sbjct: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660

Query: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720
            ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ
Sbjct: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720

Query: 3721 ARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780
            ARAVLCSFSEGDVNAV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF
Sbjct: 3721 ARAVLCSFSEGDVNAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780

Query: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTS 3840
            WEARLRVVFQLLFSSIKSGAKHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTS
Sbjct: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTS 3840

Query: 3841 VSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900
            VSQNKDEN TNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF
Sbjct: 3841 VSQNKDENTTNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900

Query: 3901 VRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960
            VRRQYKVSQV KGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL
Sbjct: 3901 VRRQYKVSQVFKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960

Query: 3961 CACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDS 4020
            CACSQSIRSEMCMLISLLC+QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDS
Sbjct: 3961 CACSQSIRSEMCMLISLLCSQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDS 4020

Query: 4021 EDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080
            EDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNI
Sbjct: 4021 EDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080

Query: 4081 RSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140
            RS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN
Sbjct: 4081 RS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140

Query: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPV 4200
            RLLKDLLDSLLLESNENKRQFIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPV
Sbjct: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPV 4200

Query: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLV 4260
            YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLV
Sbjct: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLV 4260

Query: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEAT 4320
            AGNIISLDLSIALVYEQVWKKSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEAT
Sbjct: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEAT 4320

Query: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380
            EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL
Sbjct: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380

Query: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440
            LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS
Sbjct: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440

Query: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGE 4500
            IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGE
Sbjct: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGE 4500

Query: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKT 4560
            PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKS+SEQAAKQRFTVENFVRVSESLKT
Sbjct: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSLSEQAAKQRFTVENFVRVSESLKT 4560

Query: 4561 SSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRG 4620
            SSCGERLKDIILEKGITGLAIKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRG
Sbjct: 4561 SSCGERLKDIILEKGITGLAIKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRG 4620

Query: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680
            LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR
Sbjct: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680

Query: 4681 MLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740
            MLRHATRDEMRRLALKNREDMLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC
Sbjct: 4681 MLRHATRDEMRRLALKNREDMLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740

Query: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRT 4800
            MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRT
Sbjct: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRT 4800

Query: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4860
            DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR
Sbjct: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4846

Query: 4861 LRLLTYDIVLV 4872
            LRLLTYDIVL+
Sbjct: 4861 LRLLTYDIVLM 4846

BLAST of HG10018299 vs. ExPASy TrEMBL
Match: A0A5A7TDS6 (Auxin transport protein BIG OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold64G002230 PE=3 SV=1)

HSP 1 Score: 9167.4 bits (23787), Expect = 0.0e+00
Identity = 4659/4871 (95.65%), Postives = 4734/4871 (97.19%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAE+SFVKLLD IFLDDSST+ NT+K FSSSDLLQLLRSDDS +KLGLRQFYSIL+ GLR
Sbjct: 1    MAEESFVKLLDAIFLDDSSTTVNTKKPFSSSDLLQLLRSDDSFVKLGLRQFYSILEVGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDGNF+FQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGDGNFSFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQNNMLMILETILVDGMDKVSD AQ CAKK L+DLLKS GGD DATIEF+N
Sbjct: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDCAQHCAKKDLIDLLKSFGGDFDATIEFNN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
            T ECG TGVCCSREEKQVGRLLMTIAAEC QAD LTSE GFS+PTF E+MNKLIFLCQHW
Sbjct: 181  TAECGFTGVCCSREEKQVGRLLMTIAAECEQADHLTSEPGFSEPTFFENMNKLIFLCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY
Sbjct: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            D K+MQAFAL ANSLPCLFGLCFEFANSHAT E SFENTILLLLEEFLELVQVVFRNSY+
Sbjct: 301  DDKLMQAFALLANSLPCLFGLCFEFANSHATGESSFENTILLLLEEFLELVQVVFRNSYI 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            FKDLE H  STLA+LSVD+PKC+A LE VPLHKNY VEEILRMIFP S+QWMDDLMHLLF
Sbjct: 421  FKDLEMHQMSTLAELSVDLPKCHAPLETVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540
            FLYSEG+RLRPKIERSLSSMKSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQ
Sbjct: 481  FLYSEGMRLRPKIERSLSSMKSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQ 540

Query: 541  HLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600
            HLAVNSTSSFCNLLL AAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC
Sbjct: 541  HLAVNSTSSFCNLLLLAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNC 600

Query: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660
            EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG
Sbjct: 601  EGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENG 660

Query: 661  NFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTL 720
            N VYNDQTLSLLAHTLFRRTGVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTL
Sbjct: 661  NSVYNDQTLSLLAHTLFRRTGVAGTQLRTQIYRQFVEFIIEKSKTISLKYSSLQEFMGTL 720

Query: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIV 780
            PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+V
Sbjct: 721  PSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVV 780

Query: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESK 840
            LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSS LPYT+NDHLSSWGASVAK+IIGSS+ESK
Sbjct: 781  LRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSHLPYTVNDHLSSWGASVAKSIIGSSMESK 840

Query: 841  PFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLI 900
            PF +SLINQLIDISSFPASLRQHDLT+ECPWFN  DIFSTFSWILGFWNGKQAVTVEDLI
Sbjct: 841  PFLNSLINQLIDISSFPASLRQHDLTIECPWFNPSDIFSTFSWILGFWNGKQAVTVEDLI 900

Query: 901  IERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFP 960
            IERYIFVLCWDFPS NALSHGGPLWSD D LDIS T CFFYFSYLLLDHGGVI EHMKFP
Sbjct: 901  IERYIFVLCWDFPSTNALSHGGPLWSDLDALDISKTACFFYFSYLLLDHGGVIDEHMKFP 960

Query: 961  QVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLL 1020
            QVVIGLL+RLHGGS+LEDFKALGWNFLRNG WLSL+LSFL VGI RYCSKN IPTVGS L
Sbjct: 961  QVVIGLLRRLHGGSVLEDFKALGWNFLRNGTWLSLILSFLGVGISRYCSKNKIPTVGSFL 1020

Query: 1021 TDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHA 1080
            TDTTVTD+EQANFAESLISSVI DSQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHA
Sbjct: 1021 TDTTVTDSEQANFAESLISSVIIDSQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHA 1080

Query: 1081 TEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCW 1140
            TEFSPLLLFKHS+FD CVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF SR CW
Sbjct: 1081 TEFSPLLLFKHSKFDRCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFSSRVCW 1140

Query: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVM 1200
            ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKR++ILETEVTRGILDAVM
Sbjct: 1141 ESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRSVILETEVTRGILDAVM 1200

Query: 1201 TIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHEL 1260
            T+KFDKTFESVHGLCEGIYQSLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHEL
Sbjct: 1201 TVKFDKTFESVHGLCEGIYQSLNAELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHEL 1260

Query: 1261 VIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSEL 1320
            VIVKA DIMD+LRKDVSKSSVFQFYLGAE V EQVRELY FQHGNLLVLLDSLDNCCSEL
Sbjct: 1261 VIVKAIDIMDSLRKDVSKSSVFQFYLGAEDVPEQVRELYAFQHGNLLVLLDSLDNCCSEL 1320

Query: 1321 VNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKG 1380
            VNLKVLGFFV+LLSGEPCPKLKQEVQNKFL MDLLSLS+WLEKRIFGLVAEDSSG NVKG
Sbjct: 1321 VNLKVLGFFVDLLSGEPCPKLKQEVQNKFLCMDLLSLSKWLEKRIFGLVAEDSSGVNVKG 1380

Query: 1381 SSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440
            SSISLRESSMNFVFCLISSPSEPLA QLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV
Sbjct: 1381 SSISLRESSMNFVFCLISSPSEPLAHQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVV 1440

Query: 1441 QLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGK 1500
            QLLKGDKSMKLLLERILILM KLA DERLLPGLKYLF+FLEMILIESGSGKNVFER SGK
Sbjct: 1441 QLLKGDKSMKLLLERILILMEKLAKDERLLPGLKYLFNFLEMILIESGSGKNVFERTSGK 1500

Query: 1501 PLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVA 1560
            PLSRYAPEVGPLSSK VGPRKNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVA
Sbjct: 1501 PLSRYAPEVGPLSSKLVGPRKNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVA 1560

Query: 1561 SLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620
            SLDKDEEED+NSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG
Sbjct: 1561 SLDKDEEEDSNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRG 1620

Query: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQL 1680
            HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQL
Sbjct: 1621 HRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQL 1680

Query: 1681 PESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQR 1740
            PESESDLEDDVSV DTDKCLRPSVP ELLDGVSVLLEEL+VE RMLELCSCLLPTITNQR
Sbjct: 1681 PESESDLEDDVSVADTDKCLRPSVPMELLDGVSVLLEELNVERRMLELCSCLLPTITNQR 1740

Query: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLV 1800
            DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL+
Sbjct: 1741 DPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLM 1800

Query: 1801 KSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLA 1860
            KSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLA
Sbjct: 1801 KSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLA 1860

Query: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920
            FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT
Sbjct: 1861 FNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVT 1920

Query: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLG 1980
            NRFVKIYDLSLDNISPMHYFTLPDDMVVDATL  ASQG+MFLIVLSENGRIFRLELSVLG
Sbjct: 1921 NRFVKIYDLSLDNISPMHYFTLPDDMVVDATLFIASQGKMFLIVLSENGRIFRLELSVLG 1980

Query: 1981 NVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISV 2040
            N+GATPLKEII IQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 
Sbjct: 1981 NIGATPLKEIIHIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISF 2040

Query: 2041 IYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGS 2100
            IYEEEQD+KLRPAGLHRWKELFAGSGLFVCFSS KSNSALAVSMGAH+IYAQNLRHAGGS
Sbjct: 2041 IYEEEQDKKLRPAGLHRWKELFAGSGLFVCFSSFKSNSALAVSMGAHEIYAQNLRHAGGS 2100

Query: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNN 2160
            SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNN
Sbjct: 2101 SLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNN 2160

Query: 2161 KVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220
            KVYASTNPEF LDFFE TVCITADVRLGGD IRNGDSEGAKQSLASEDGFLESPSSSGFK
Sbjct: 2161 KVYASTNPEFALDFFEKTVCITADVRLGGDTIRNGDSEGAKQSLASEDGFLESPSSSGFK 2220

Query: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280
            ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA
Sbjct: 2221 ITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLA 2280

Query: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSG 2340
            DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARA+GSNSLLARSG
Sbjct: 2281 DEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARAIGSNSLLARSG 2340

Query: 2341 KKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYES 2400
            KKRRSIQCAPIQQQVLADGLKV+SSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYES
Sbjct: 2341 KKRRSIQCAPIQQQVLADGLKVMSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYES 2400

Query: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEE 2460
            DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLS+RLGVGG AGGWIIEE
Sbjct: 2401 DREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSTRLGVGGVAGGWIIEE 2460

Query: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520
            FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV
Sbjct: 2461 FTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSV 2520

Query: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580
            ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ
Sbjct: 2521 ELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQ 2580

Query: 2581 TMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTI 2640
            TMLATDDGADIPLSAPV TET GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTI
Sbjct: 2581 TMLATDDGADIPLSAPVSTETPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTI 2640

Query: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSL 2700
            CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHFATEDINDSSLTS+
Sbjct: 2641 CPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFATEDINDSSLTSV 2700

Query: 2701 IPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760
              DISVKNP SSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT
Sbjct: 2701 RSDISVKNPASSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETT 2760

Query: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820
            SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG
Sbjct: 2761 SGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFG 2820

Query: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGK 2880
            EVAILVFMFFTLMLRNWHQPGSDGPGAK STT DTHDKNSTQVAPSTS+TAQSSMDDQGK
Sbjct: 2821 EVAILVFMFFTLMLRNWHQPGSDGPGAKSSTTADTHDKNSTQVAPSTSLTAQSSMDDQGK 2880

Query: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVR 2940
            NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVR
Sbjct: 2881 NDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVR 2940

Query: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000
            KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK
Sbjct: 2941 KDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYK 3000

Query: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060
            IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK
Sbjct: 3001 IYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK 3060

Query: 3061 RLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGI 3120
            +LFKY+NKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGI
Sbjct: 3061 KLFKYVNKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGI 3120

Query: 3121 FYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDG 3180
            FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQ VD RKK+KGEDG
Sbjct: 3121 FYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQPVDIRKKKKGEDG 3180

Query: 3181 NDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQ 3240
            +DSALEKSYLDME MVNIF+DKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQ
Sbjct: 3181 SDSALEKSYLDMETMVNIFIDKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQ 3240

Query: 3241 TFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300
            TFKETLLMALLQKVK LPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI
Sbjct: 3241 TFKETLLMALLQKVKNLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVI 3300

Query: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360
            RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL
Sbjct: 3301 RSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESL 3360

Query: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420
            KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS
Sbjct: 3361 KSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWS 3420

Query: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480
            LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD
Sbjct: 3421 LWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTD 3480

Query: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540
            KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND
Sbjct: 3481 KHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEND 3540

Query: 3541 EDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600
            EDMKRGL AIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG
Sbjct: 3541 EDMKRGLTAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPG 3600

Query: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660
            PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV
Sbjct: 3601 PSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFV 3660

Query: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720
            ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ
Sbjct: 3661 ISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQ 3720

Query: 3721 ARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780
            ARAVLCSFSEGDVNAV+GLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF
Sbjct: 3721 ARAVLCSFSEGDVNAVSGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEF 3780

Query: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTS 3840
            WEARLRVVFQLLFSSIKSGAKHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTS
Sbjct: 3781 WEARLRVVFQLLFSSIKSGAKHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTS 3840

Query: 3841 VSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900
            VSQNKDEN TNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF
Sbjct: 3841 VSQNKDENTTNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDF 3900

Query: 3901 VRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960
            VRRQYKVSQV KGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL
Sbjct: 3901 VRRQYKVSQVFKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVL 3960

Query: 3961 CACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDS 4020
            CACSQSIRSEMCMLISLLC+QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDS
Sbjct: 3961 CACSQSIRSEMCMLISLLCSQSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDS 4020

Query: 4021 EDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080
            EDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNI
Sbjct: 4021 EDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNI 4080

Query: 4081 RSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140
            RS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN
Sbjct: 4081 RS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCN 4140

Query: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPV 4200
            RLLKDLLDSLLLESNENKRQFIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPV
Sbjct: 4141 RLLKDLLDSLLLESNENKRQFIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPV 4200

Query: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLV 4260
            YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLV
Sbjct: 4201 YLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLV 4260

Query: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEAT 4320
            AGNIISLDLSIALVYEQVWKKSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEAT
Sbjct: 4261 AGNIISLDLSIALVYEQVWKKSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEAT 4320

Query: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380
            EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL
Sbjct: 4321 EPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNL 4380

Query: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440
            LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS
Sbjct: 4381 LMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESIS 4440

Query: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGE 4500
            IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGE
Sbjct: 4441 IGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGE 4500

Query: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKT 4560
            PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKS+SEQAAKQRFTVENFVRVSESLKT
Sbjct: 4501 PAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSLSEQAAKQRFTVENFVRVSESLKT 4560

Query: 4561 SSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRG 4620
            SSCGERLKDIILEKGITGLAIKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRG
Sbjct: 4561 SSCGERLKDIILEKGITGLAIKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRG 4620

Query: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680
            LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR
Sbjct: 4621 LSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVR 4680

Query: 4681 MLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740
            MLRHATRDEMRRLALKNREDMLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC
Sbjct: 4681 MLRHATRDEMRRLALKNREDMLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLAC 4740

Query: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRT 4800
            MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRT
Sbjct: 4741 MVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRT 4800

Query: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4860
            DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR
Sbjct: 4801 DAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNR 4846

Query: 4861 LRLLTYDIVLV 4872
            LRLLTYDIVL+
Sbjct: 4861 LRLLTYDIVLM 4846

BLAST of HG10018299 vs. ExPASy TrEMBL
Match: A0A0A0KVU7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642140 PE=3 SV=1)

HSP 1 Score: 9110.0 bits (23638), Expect = 0.0e+00
Identity = 4641/4911 (94.50%), Postives = 4729/4911 (96.29%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MA+QSFVKLLDTIFLDDS+T+ANT+K FSSSDLL LLRSDDSSIKLGL QFYSIL+ GLR
Sbjct: 1    MADQSFVKLLDTIFLDDSTTTANTKKPFSSSDLLHLLRSDDSSIKLGLPQFYSILQLGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLG  NFAFQSWTDPQIQAVCSIA+AIASASRSLTVDQAEAIVVAVIKKSLE VFCYLEK
Sbjct: 61   DLGHRNFAFQSWTDPQIQAVCSIAYAIASASRSLTVDQAEAIVVAVIKKSLEFVFCYLEK 120

Query: 121  SEFKCDDFSIQ----------------------------------------NNMLMILET 180
            SEFKCDDFSIQ                                        NNMLMILET
Sbjct: 121  SEFKCDDFSIQSLPVNTGCGNVCVTPLDKIWRRCSHTAFDWMHYKRMVRYKNNMLMILET 180

Query: 181  ILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDNTIECGSTGVCCSREEKQVGR 240
            ILVDGMDKVSD AQ CAKK L+DLLKS GGD DATIEF+NT+ECG TGVCCSREEKQVGR
Sbjct: 181  ILVDGMDKVSDCAQHCAKKDLIDLLKSFGGDFDATIEFNNTVECGFTGVCCSREEKQVGR 240

Query: 241  LLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHWAVTHLACIQHLILICKELVV 300
            LLMTIAAEC QAD LTSE GFS+PTFLE+MNKLIFLCQHWAVTHLACIQ LILICK+LVV
Sbjct: 241  LLMTIAAECEQADNLTSEPGFSEPTFLENMNKLIFLCQHWAVTHLACIQRLILICKDLVV 300

Query: 301  LPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEYDAKMMQAFALFANSLPCLFG 360
            LPDALDEKTGST FRKRLSCSLRILKLL DLSKKFPYIEYDAK+MQAFAL ANSLPCLFG
Sbjct: 301  LPDALDEKTGSTIFRKRLSCSLRILKLLADLSKKFPYIEYDAKLMQAFALLANSLPCLFG 360

Query: 361  LCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYVSVNIQTCVVASILDNLSSSV 420
            LCFEFANSHAT E SFENTILLLLEEFLELVQ+VFRN YV VNIQTC+VASILDNLSSSV
Sbjct: 361  LCFEFANSHATGESSFENTILLLLEEFLELVQIVFRNIYVCVNIQTCIVASILDNLSSSV 420

Query: 421  WRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFSFKDLETHHTSTLADLSVDIP 480
            WRYDASTANLK PLVYFPR VMVIIKLIQDLKGHKYHAFSFKDLE HHTSTL DLSVD+P
Sbjct: 421  WRYDASTANLKPPLVYFPRGVMVIIKLIQDLKGHKYHAFSFKDLEMHHTSTLTDLSVDLP 480

Query: 481  KCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLFFLYSEGVRLRPKIERSLSSM 540
            KC+ARLE VPLHKNY VEEILRMIFP S+QWMDDLMHLLFFLYSEG+RLRPKIERSLSSM
Sbjct: 481  KCHARLEAVPLHKNYTVEEILRMIFPPSRQWMDDLMHLLFFLYSEGMRLRPKIERSLSSM 540

Query: 541  KSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNLLLQAAKE 600
            KSSSTVEQE AVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNLLLQAAKE
Sbjct: 541  KSSSTVEQEAAVCHEDEALFGDLFSESGRSVGSVDGYDLQHLAVNSTSSFCNLLLQAAKE 600

Query: 601  LLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASCLPAHDER 660
            LLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASCLPAHDER
Sbjct: 601  LLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLNCEGCCSDDKSSASCLPAHDER 660

Query: 661  KSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENGNFVYNDQTLSLLAHTLFRRT 720
            KSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENGN VYNDQTLSLLAHTLFRRT
Sbjct: 661  KSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAENGNSVYNDQTLSLLAHTLFRRT 720

Query: 721  GVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGTLPSVFHIEILLVAFHLSSEGE 780
            GVAGT LRTQIYRQFVEFIIEKSKTIS  YSSLQEFMGTLPSVFHIEILLVAFHLSSEGE
Sbjct: 721  GVAGTQLRTQIYRQFVEFIIEKSKTISLQYSSLQEFMGTLPSVFHIEILLVAFHLSSEGE 780

Query: 781  KREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIIVLRHIIFHPHTCSSSLLFDFR 840
            KREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI+VLRHIIFHPHTCSSSLLFDFR
Sbjct: 781  KREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLIVVLRHIIFHPHTCSSSLLFDFR 840

Query: 841  SKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVESKPFFHSLINQLIDISSFPASL 900
            SKLRDAPAFSS LPYT+NDHLSSWGASVAKNIIGSS+ESKPF +SLINQLIDISSFPASL
Sbjct: 841  SKLRDAPAFSSHLPYTVNDHLSSWGASVAKNIIGSSMESKPFLNSLINQLIDISSFPASL 900

Query: 901  RQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDLIIERYIFVLCWDFPSMNALSH 960
            RQHDLT+ECPWFN  DIFSTFSWILGFWNGKQA+TVEDLIIERYIFVLCWDFPS NALS 
Sbjct: 901  RQHDLTIECPWFNPSDIFSTFSWILGFWNGKQALTVEDLIIERYIFVLCWDFPSANALSR 960

Query: 961  GGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKFPQVVIGLLQRLHGGSILEDFK 1020
            GGPLWSDPD LDIS TTCFFYFSYLLLDHG VIGEHMKF +VVIGLLQRLHGGS+LEDFK
Sbjct: 961  GGPLWSDPDALDISKTTCFFYFSYLLLDHGSVIGEHMKFSRVVIGLLQRLHGGSVLEDFK 1020

Query: 1021 ALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSLLTDTTVTDNEQANFAESLISS 1080
            ALGWNFLRNG WLSL+LSFLSVGI RYCSKN IPTVGS LTDTTVTD+EQANFAESLISS
Sbjct: 1021 ALGWNFLRNGTWLSLILSFLSVGISRYCSKNTIPTVGSFLTDTTVTDSEQANFAESLISS 1080

Query: 1081 VITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDHATEFSPLLLFKHSEFDSCVQN 1140
            VIT+SQV ILIRELSSVLSMYL+VYQKA+VATLSSSNDHATEFSPLLLFKHSEFD CVQN
Sbjct: 1081 VITESQVPILIRELSSVLSMYLRVYQKAYVATLSSSNDHATEFSPLLLFKHSEFDKCVQN 1140

Query: 1141 KTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFCWESMFHGFPSHLETSSGILLS 1200
            KTLENYGTTSC LESV NLMSRLDEIVDKRTLGF SR CWESMFHGFPSHLETSSGILLS
Sbjct: 1141 KTLENYGTTSCSLESVLNLMSRLDEIVDKRTLGFSSRVCWESMFHGFPSHLETSSGILLS 1200

Query: 1201 CVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAVMTIKFDKTFESVHGLCEGIYQ 1260
            CVL+IGRIISVLAGLLR+VDVKR++ILETEVTRGILDAVMT+KFDKTFESVHGLC+GIY+
Sbjct: 1201 CVLNIGRIISVLAGLLRLVDVKRSVILETEVTRGILDAVMTVKFDKTFESVHGLCDGIYK 1260

Query: 1261 SLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHELVIVKATDIMDNLRKDVSKSS 1320
            SLN ELDGCSYGVLFLLKQLE YLRH+NMRG SDSTIHELVIVK  DIMD+LRKDVSKSS
Sbjct: 1261 SLNVELDGCSYGVLFLLKQLEEYLRHINMRGVSDSTIHELVIVKVIDIMDSLRKDVSKSS 1320

Query: 1321 VFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSELVNLKVLGFFVELLSGEPCPK 1380
            VFQFYLG+  V EQVRELY FQHGNLLVLLDSLDNC SELVNLKVLGFFV+LLSGEPC K
Sbjct: 1321 VFQFYLGSADVPEQVRELYAFQHGNLLVLLDSLDNCFSELVNLKVLGFFVDLLSGEPCRK 1380

Query: 1381 LKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVKGSSISLRESSMNFVFCLISSP 1440
            LKQEVQNKFL MDL SLS+WLEKRIFGLVAEDSSG NVKGSSISLRESSMNFVFCLISSP
Sbjct: 1381 LKQEVQNKFLQMDLPSLSKWLEKRIFGLVAEDSSGVNVKGSSISLRESSMNFVFCLISSP 1440

Query: 1441 SEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFVVQLLKGDKSMKLLLERILILM 1500
            +EPLALQLQSHIFEAALVSLDMAF+RFDISVSKSYFHFVVQLLKGDKSMKLLLERILILM
Sbjct: 1441 TEPLALQLQSHIFEAALVSLDMAFMRFDISVSKSYFHFVVQLLKGDKSMKLLLERILILM 1500

Query: 1501 GKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSGKPLSRYAPEVGPLSSKSVGPR 1560
             KLA+DERLLPG+K+LF+FLEMILIESGSGKNVFER +GKPLSRYAPEVGPLSSKSVGPR
Sbjct: 1501 EKLANDERLLPGMKFLFNFLEMILIESGSGKNVFERTAGKPLSRYAPEVGPLSSKSVGPR 1560

Query: 1561 KNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKV 1620
            KNSETLVLSSNQEEGPASF+CDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKV
Sbjct: 1561 KNSETLVLSSNQEEGPASFDCDATSAEEDEDDGTSDGEVASLDKDEEEDTNSERALASKV 1620

Query: 1621 CTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGV 1680
            CTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGV
Sbjct: 1621 CTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGV 1680

Query: 1681 RGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSVTDTDKCL 1740
            RGSSCQCLKPRK+TGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSVTDTDKCL
Sbjct: 1681 RGSSCQCLKPRKFTGHGSAPVRGASNFQCFLPFSEEGDQLPESESDLEDDVSVTDTDKCL 1740

Query: 1741 RPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQRDPDLSKDKKIILGKDKVLSY 1800
            +PSVP ELLDGVSVLLEEL+VE RMLELCSCLLPTITNQRDPDLSKDKKIILGKDKVLSY
Sbjct: 1741 KPSVPMELLDGVSVLLEELNVEERMLELCSCLLPTITNQRDPDLSKDKKIILGKDKVLSY 1800

Query: 1801 GLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLAVGEGDKV 1860
            GLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLAVGEGDKV
Sbjct: 1801 GLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSLVKSLLSVSIRGRLAVGEGDKV 1860

Query: 1861 SIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAGYEDCQVL 1920
            SIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAGYEDCQVL
Sbjct: 1861 SIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHLAFNPTVENYLAVAGYEDCQVL 1920

Query: 1921 TLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVVTNRFVKIYDLSLDNISPMHYF 1980
            TLNHRGEVVDRLAIELALQGA+IKRMEWVPGSQVQLMVVTNRFVKIYDLSLDNISPMHYF
Sbjct: 1921 TLNHRGEVVDRLAIELALQGAYIKRMEWVPGSQVQLMVVTNRFVKIYDLSLDNISPMHYF 1980

Query: 1981 TLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVLGNVGATPLKEIIEIQGREMSA 2040
            TLPDDMVVDATL TASQG+MFLIVLSENGRIFRLELSVLGN+GATPLKEII IQGREMSA
Sbjct: 1981 TLPDDMVVDATLFTASQGKMFLIVLSENGRIFRLELSVLGNIGATPLKEIIHIQGREMSA 2040

Query: 2041 KGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISVIYEEEQDRKLRPAGLHRWKE 2100
            KGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS IYEEEQD+KLRPAGLHRWKE
Sbjct: 2041 KGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEISFIYEEEQDKKLRPAGLHRWKE 2100

Query: 2101 LFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGGSSLPLVGITAYKPLSKDKIHC 2160
            LFAGSGLFVCFSSVKSNSALAVSMGAH+IYAQNLRHAGGSSLPLVGITAYKPLSKDKIHC
Sbjct: 2101 LFAGSGLFVCFSSVKSNSALAVSMGAHEIYAQNLRHAGGSSLPLVGITAYKPLSKDKIHC 2160

Query: 2161 LVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILNNKVYASTNPEFPLDFFENTVC 2220
            LVLHDDGSLQIYTHTAVGVDASA ATAEKIKKLGSGILNNKVYASTNPEF LDFFE TVC
Sbjct: 2161 LVLHDDGSLQIYTHTAVGVDASANATAEKIKKLGSGILNNKVYASTNPEFALDFFEKTVC 2220

Query: 2221 ITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVG 2280
            ITADVRLGGD IRNGD EGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVG
Sbjct: 2221 ITADVRLGGDTIRNGDFEGAKQSLASEDGFLESPSSSGFKITVSNSNPDIVMVGFRIHVG 2280

Query: 2281 NTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRI 2340
            NTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRI
Sbjct: 2281 NTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLLADEEFSVTVGPAFNGTALPRI 2340

Query: 2341 DSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGL 2400
            DSLEVYGR KDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGL
Sbjct: 2341 DSLEVYGRGKDEFGWKEKLDAVLDMEARALGSNSLLARSGKKRRSIQCAPIQQQVLADGL 2400

Query: 2401 KVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYESDREPLLQSAACRVLQAIFPK 2460
            KVLSSYYLL R QGCPKL+DVNQELTKLKCKQLLETIYESDREPLLQSAACRVLQAIFPK
Sbjct: 2401 KVLSSYYLLCRPQGCPKLDDVNQELTKLKCKQLLETIYESDREPLLQSAACRVLQAIFPK 2460

Query: 2461 KEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLA 2520
            KEIYYQVKDTMRL GVVKSTSVLS+RLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLA
Sbjct: 2461 KEIYYQVKDTMRLAGVVKSTSVLSTRLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLA 2520

Query: 2521 CFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLALHGPDTGRH 2580
            CFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLALHGPDTGR 
Sbjct: 2521 CFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISSVELIYCYAECLALHGPDTGRR 2580

Query: 2581 SVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPLSAPVPTE 2640
            SVAPAV+LFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPLSAPV TE
Sbjct: 2581 SVAPAVLLFKKLLFSSSEAVQASSSLAISSRLLQVPFPKQTMLATDDGADIPLSAPVSTE 2640

Query: 2641 TTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEVLDADRLP 2700
            T GTNPQV+IEEDA+ASSVQYCCDGCS VPILRRRWHCTICPDFDLCESCYEVLDADRLP
Sbjct: 2641 TPGTNPQVVIEEDAIASSVQYCCDGCSKVPILRRRWHCTICPDFDLCESCYEVLDADRLP 2700

Query: 2701 SPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTSLIPDISVKNPVSSIHVLEPAD 2760
            SPHSRDH MTAIPIEV+SLGDGNEYHFATEDINDSSLTS+  DI VKNP SSIHVLEPAD
Sbjct: 2701 SPHSRDHLMTAIPIEVESLGDGNEYHFATEDINDSSLTSVKSDIGVKNPASSIHVLEPAD 2760

Query: 2761 SGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMG 2820
            SGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMG
Sbjct: 2761 SGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMG 2820

Query: 2821 GPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQP 2880
            GPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQP
Sbjct: 2821 GPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQP 2880

Query: 2881 GSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQSFV 2940
            GSDG GAK STT D HDKNSTQVAPSTS+TAQSS+DDQGKNDFTSQLLRACSSIRQQSFV
Sbjct: 2881 GSDGTGAKSSTTADMHDKNSTQVAPSTSLTAQSSVDDQGKNDFTSQLLRACSSIRQQSFV 2940

Query: 2941 NYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYAKAH 3000
            NYLMDVLQQLVHVFKSSTIDYDSGHGF+NGSGCGALLTVRKDLPAGNFSPFFSDSYAKAH
Sbjct: 2941 NYLMDVLQQLVHVFKSSTIDYDSGHGFNNGSGCGALLTVRKDLPAGNFSPFFSDSYAKAH 3000

Query: 3001 RTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYI 3060
            RTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYI
Sbjct: 3001 RTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYI 3060

Query: 3061 NNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKRLFKYINKVGGFQNPMSYER 3120
            NNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVK+LFKY+NKVGGFQNPMSYER
Sbjct: 3061 NNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEVKKLFKYVNKVGGFQNPMSYER 3120

Query: 3121 SVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNGIFYFGEESVVQTLKLLNLAFY 3180
            SVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNGIFYFGEESV+QTLKLLNLAFY
Sbjct: 3121 SVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNGIFYFGEESVIQTLKLLNLAFY 3180

Query: 3181 TGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGEDGNDSALEKSYLDMEIMVNIFV 3240
            TGKDIGHS QKSEAGDTGTSTNKSGTQTVD RKK+KGEDG+DSALEKSYLDME MVNIFV
Sbjct: 3181 TGKDIGHSAQKSEAGDTGTSTNKSGTQTVDVRKKKKGEDGSDSALEKSYLDMETMVNIFV 3240

Query: 3241 DKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGKQTFKETLLMALLQKVKTLPMY 3300
            DKGSNVLSHFIDCFLLEWNSSSVRAE KGVVCGIWHHGKQTFKETLLMALLQKVKTLPMY
Sbjct: 3241 DKGSNVLSHFIDCFLLEWNSSSVRAEAKGVVCGIWHHGKQTFKETLLMALLQKVKTLPMY 3300

Query: 3301 GLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNS 3360
            GLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNS
Sbjct: 3301 GLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNS 3360

Query: 3361 RIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYT 3420
            RIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYT
Sbjct: 3361 RIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYT 3420

Query: 3421 IQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVE 3480
            IQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVE
Sbjct: 3421 IQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVE 3480

Query: 3481 FPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRN 3540
            FPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRN
Sbjct: 3481 FPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRN 3540

Query: 3541 INYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLAAIESESENAHRR 3600
            INYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGL AIESESENAHRR
Sbjct: 3541 INYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMENDEDMKRGLTAIESESENAHRR 3600

Query: 3601 YQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKC 3660
            YQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKC
Sbjct: 3601 YQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKC 3660

Query: 3661 KAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCL 3720
            KAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCL
Sbjct: 3661 KAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCL 3720

Query: 3721 EILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLN 3780
            EILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAV+GLN
Sbjct: 3721 EILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVSGLN 3780

Query: 3781 NLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGA 3840
            NLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGA
Sbjct: 3781 NLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGA 3840

Query: 3841 KHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPV 3900
            KHPAIAEHII PCLRIISQACTPPKS+TVDKEQR GKLTSVSQNKDENATNISGSFSGPV
Sbjct: 3841 KHPAIAEHIIHPCLRIISQACTPPKSETVDKEQRTGKLTSVSQNKDENATNISGSFSGPV 3900

Query: 3901 SGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRT 3960
             GNKSAPESLEHNWDSSH+TQDIQLLSYAEWEKGASYLDFVRRQYKVSQV KGTVQRSRT
Sbjct: 3901 IGNKSAPESLEHNWDSSHKTQDIQLLSYAEWEKGASYLDFVRRQYKVSQVFKGTVQRSRT 3960

Query: 3961 QKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLCA 4020
            QKGDYLSLKYALKWKRFVCR+A SDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLC+
Sbjct: 3961 QKGDYLSLKYALKWKRFVCRSAISDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLCS 4020

Query: 4021 QSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLI 4080
            QSSSRRFRLLDLLVSLLPATLSAGESAAEYF+LLFKMVDSEDARLFLTVRGCLRTICQLI
Sbjct: 4021 QSSSRRFRLLDLLVSLLPATLSAGESAAEYFELLFKMVDSEDARLFLTVRGCLRTICQLI 4080

Query: 4081 SQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKL 4140
            SQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRS                  
Sbjct: 4081 SQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPNIRS------------------ 4140

Query: 4141 IQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQ 4200
                   RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQ
Sbjct: 4141 -------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQ 4200

Query: 4201 FIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMT 4260
            FIRACICGLQ HGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMT
Sbjct: 4201 FIRACICGLQNHGEERKGRTCLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMT 4260

Query: 4261 KNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWK 4320
            KNPYSSAEIGPLMRDVKNKICHQLDLL FLEDDYGMELLVAGNIISLDLSIALVYEQVWK
Sbjct: 4261 KNPYSSAEIGPLMRDVKNKICHQLDLLSFLEDDYGMELLVAGNIISLDLSIALVYEQVWK 4320

Query: 4321 KSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELE 4380
            KSNQSSNAISNTA+ISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELE
Sbjct: 4321 KSNQSSNAISNTAIISTTAARDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELE 4380

Query: 4381 FAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGAL 4440
            FAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGAL
Sbjct: 4381 FAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGAL 4440

Query: 4441 GLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAK 4500
            GLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAK
Sbjct: 4441 GLLLETARRAFSVDAMESAEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAK 4500

Query: 4501 KIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDE 4560
            KIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDE
Sbjct: 4501 KIVLMFLERLSHPFGFKKSNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDE 4560

Query: 4561 FDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLA 4620
            FDRLQKQHEDNP+DKS+SEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLA
Sbjct: 4561 FDRLQKQHEDNPDDKSLSEQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLA 4620

Query: 4621 IKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPV 4680
            IKHLRD+FAVAGQTGFRSSVEW FALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPV
Sbjct: 4621 IKHLRDTFAVAGQTGFRSSVEWGFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPV 4680

Query: 4681 LHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNRED 4740
            LHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNRED
Sbjct: 4681 LHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNRED 4740

Query: 4741 MLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSY 4800
            MLQ LGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSY
Sbjct: 4741 MLQRLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSY 4800

Query: 4801 SKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNE 4860
            SKRVNLGVGTSGS+RGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNE
Sbjct: 4801 SKRVNLGVGTSGSSRGECVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNE 4860

Query: 4861 SLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVLV 4872
            SLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVL+
Sbjct: 4861 SLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGNRLRLLTYDIVLL 4886

BLAST of HG10018299 vs. ExPASy TrEMBL
Match: A0A6J1FM06 (auxin transport protein BIG-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445189 PE=3 SV=1)

HSP 1 Score: 8957.0 bits (23241), Expect = 0.0e+00
Identity = 4548/4872 (93.35%), Postives = 4684/4872 (96.14%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAEQS VKL+DTIFLDDSS+SANT+K FSSSDLLQLLRSDDSS KLGLRQFYSILKAGLR
Sbjct: 1    MAEQSLVKLIDTIFLDDSSSSANTKKPFSSSDLLQLLRSDDSSFKLGLRQFYSILKAGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDG  AFQSW D QIQAVCSIAHAIASASR+LTVDQAEAIVVAVIKKSLEL+FCYLEK
Sbjct: 61   DLGDGKLAFQSWNDSQIQAVCSIAHAIASASRALTVDQAEAIVVAVIKKSLELLFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQNNML+I+ETILVDGMDKVSD AQLCAKK L++LLK TG DCD +IEFDN
Sbjct: 121  SEFKCDDFSIQNNMLLIMETILVDGMDKVSDCAQLCAKKGLIELLKFTGADCDVSIEFDN 180

Query: 181  TIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQHW 240
             IECG  GVCCSREEKQVGRLLMTIAAECVQADQLT+ESGFS+PTFLED+NKLI  CQHW
Sbjct: 181  HIECGFAGVCCSREEKQVGRLLMTIAAECVQADQLTTESGFSEPTFLEDINKLILFCQHW 240

Query: 241  AVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIEY 300
            A+THLACIQ LILICKELV+LPD LDEKTGSTSFRKRLSCSLRILKLLT+LSKKFP IEY
Sbjct: 241  AITHLACIQRLILICKELVILPDVLDEKTGSTSFRKRLSCSLRILKLLTELSKKFPCIEY 300

Query: 301  DAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSYV 360
            DAK+MQAFALFANSLPCLFGLCFEFANSHA VEGSFENTILLLLEE+LELV+VVFRNSYV
Sbjct: 301  DAKLMQAFALFANSLPCLFGLCFEFANSHAIVEGSFENTILLLLEEYLELVKVVFRNSYV 360

Query: 361  SVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420
             VNIQTC+VASILDNLSSSVWR+DAS ANLK PLVYFPRSVMVIIKLIQDLKGHKYHAFS
Sbjct: 361  CVNIQTCIVASILDNLSSSVWRHDASIANLKPPLVYFPRSVMVIIKLIQDLKGHKYHAFS 420

Query: 421  FKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLLF 480
            F DLE HHTSTLADLSV+IPKC+ARLEIVPL KNY VEEILRMIFP SKQWMDDLMHLLF
Sbjct: 421  FNDLEMHHTSTLADLSVEIPKCHARLEIVPLQKNYTVEEILRMIFPPSKQWMDDLMHLLF 480

Query: 481  FLYSEGVRLRPKIERSLSS-MKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYDL 540
            FLYSEGVRLRPKIERSLSS MKSSSTVEQET+VCHEDEALFGDLFSESGRSVGSVDGYDL
Sbjct: 481  FLYSEGVRLRPKIERSLSSCMKSSSTVEQETSVCHEDEALFGDLFSESGRSVGSVDGYDL 540

Query: 541  QHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLLN 600
             HLAVNSTSSFCNLLLQAAKELLSFIK CIFSPEW+ASVFDDGCNKL+QNHIDIL+SLLN
Sbjct: 541  HHLAVNSTSSFCNLLLQAAKELLSFIKQCIFSPEWSASVFDDGCNKLDQNHIDILISLLN 600

Query: 601  CEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAEN 660
            CEG  SDD+SSASC+PAHDE+KSGHIHEICYRLLHGLLTRH LPD LEEYLVKKILNAEN
Sbjct: 601  CEGFYSDDRSSASCVPAHDEKKSGHIHEICYRLLHGLLTRHVLPDYLEEYLVKKILNAEN 660

Query: 661  GNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMGT 720
            GN VYNDQTLSLLA+TLF RTG AGT LRTQIYRQFVEFI EK+KTIS + SSLQEF+GT
Sbjct: 661  GNVVYNDQTLSLLANTLFCRTGTAGTQLRTQIYRQFVEFIAEKAKTISLSNSSLQEFIGT 720

Query: 721  LPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLII 780
            LPSVFHIEILLVAFHLSSE EKREISSLIFSSIR IDAPSTFSN TELSMWGLLVSRLII
Sbjct: 721  LPSVFHIEILLVAFHLSSEEEKREISSLIFSSIRTIDAPSTFSNSTELSMWGLLVSRLII 780

Query: 781  VLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVES 840
            VLR++IFHPHTCSSSLLFDFRSKLRDAPAFSSS PYT+NDHLSSWGA+VAKNIIGSS+ES
Sbjct: 781  VLRYVIFHPHTCSSSLLFDFRSKLRDAPAFSSSFPYTVNDHLSSWGANVAKNIIGSSMES 840

Query: 841  KPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVEDL 900
            +PFF+ LINQLIDISSFPASLRQHDLT+ECPWFN G+IFSTFSWILGFWNGKQAVTVEDL
Sbjct: 841  EPFFNGLINQLIDISSFPASLRQHDLTIECPWFNPGEIFSTFSWILGFWNGKQAVTVEDL 900

Query: 901  IIERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMKF 960
            IIERYIFVLCWDFP  NALSHGG LWSDP+TLDISNTTCFFYFSYLLLDH  +IGE MKF
Sbjct: 901  IIERYIFVLCWDFPYGNALSHGGSLWSDPETLDISNTTCFFYFSYLLLDHSDIIGESMKF 960

Query: 961  PQVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGSL 1020
            PQVVIGLL+RLHGGSILEDFKALGW+FLRNGAWLSL+LSFLSVGI RYCSKN IPTVGS 
Sbjct: 961  PQVVIGLLRRLHGGSILEDFKALGWSFLRNGAWLSLILSFLSVGILRYCSKNTIPTVGSF 1020

Query: 1021 LTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSNDH 1080
            LTD  VTD EQAN A SLISSVITD+QVSILIRELSSVLSMYLQVYQKAFVATLSS+NDH
Sbjct: 1021 LTDIAVTDIEQANLAGSLISSVITDNQVSILIRELSSVLSMYLQVYQKAFVATLSSTNDH 1080

Query: 1081 ATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRFC 1140
            A+EFSPLLLFKHSEFD CVQNK LENYGTTSC+LESVF LMSRLDEIVDKR LGFLSR  
Sbjct: 1081 ASEFSPLLLFKHSEFDRCVQNKALENYGTTSCVLESVFKLMSRLDEIVDKRALGFLSRAS 1140

Query: 1141 WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDAV 1200
            WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLL IVDVKRNIILETEVT GILDAV
Sbjct: 1141 WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLGIVDVKRNIILETEVTHGILDAV 1200

Query: 1201 MTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIHE 1260
            MTIKFDKTFE VHGLCEGIYQSL+ EL+GC+YGVLFLLKQLEGYLRH+NM GASDSTIHE
Sbjct: 1201 MTIKFDKTFEKVHGLCEGIYQSLSVELEGCAYGVLFLLKQLEGYLRHINMSGASDSTIHE 1260

Query: 1261 LVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCSE 1320
             VIVKATDI+DNLRKD SKSSVFQFY GAEVV EQVRE Y  QHGNLLVLLDSLDNCCSE
Sbjct: 1261 WVIVKATDIIDNLRKDASKSSVFQFYFGAEVVPEQVREFYGSQHGNLLVLLDSLDNCCSE 1320

Query: 1321 LVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNVK 1380
            LVNLKVLGFFVELLSGEPCPKLK E+QNKFLSMDL  LS+WLEKRI G VA+DSSG NVK
Sbjct: 1321 LVNLKVLGFFVELLSGEPCPKLKLEIQNKFLSMDLHGLSKWLEKRILGSVAKDSSGVNVK 1380

Query: 1381 GSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHFV 1440
            GSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLD+AFLRFDISV+ SYFHFV
Sbjct: 1381 GSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDLAFLRFDISVANSYFHFV 1440

Query: 1441 VQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPSG 1500
            VQLL+G+KSMKLLLERIL+LM KLASDERLLPGLKYLFSFLEMILIESGSGKNVFER SG
Sbjct: 1441 VQLLRGEKSMKLLLERILVLMEKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERASG 1500

Query: 1501 KPLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGEV 1560
            KPLSRYAPE+GPLSSKSVGPRKNSETLVLSSNQE+G ASFECDATS EEDEDDGTSDGEV
Sbjct: 1501 KPLSRYAPEIGPLSSKSVGPRKNSETLVLSSNQEDGRASFECDATSGEEDEDDGTSDGEV 1560

Query: 1561 ASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR 1620
            ASLDKDE+EDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR
Sbjct: 1561 ASLDKDEDEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHR 1620

Query: 1621 GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGDQ 1680
            GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGS PVRGASNFQCFLPF EEGDQ
Sbjct: 1621 GHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSTPVRGASNFQCFLPFCEEGDQ 1680

Query: 1681 LPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITNQ 1740
            LPESESDLEDDV V DTDKCL+P+VPRELLDGVSVLLE+LDVEGRMLELCSCLLP+ITNQ
Sbjct: 1681 LPESESDLEDDV-VPDTDKCLKPAVPRELLDGVSVLLEKLDVEGRMLELCSCLLPSITNQ 1740

Query: 1741 RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL 1800
            RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL
Sbjct: 1741 RDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGSL 1800

Query: 1801 VKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVHL 1860
            VKSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVHL
Sbjct: 1801 VKSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVHL 1860

Query: 1861 AFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVV 1920
            AFNPT ENYLAVAG+ED QVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVV
Sbjct: 1861 AFNPTTENYLAVAGFEDFQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMVV 1920

Query: 1921 TNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSVL 1980
            TN+FVKIYDLSLDNISPMHYFTLPDDMVVDATL TASQGRMFLIVLSENGRIFR ELSVL
Sbjct: 1921 TNKFVKIYDLSLDNISPMHYFTLPDDMVVDATLFTASQGRMFLIVLSENGRIFRFELSVL 1980

Query: 1981 GNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEIS 2040
            GNVGAT LKE I IQGREMSAKGLSLYFSSCYKLLF+AYADGTTLVGQ+SPDATKL+EIS
Sbjct: 1981 GNVGATLLKETIHIQGREMSAKGLSLYFSSCYKLLFVAYADGTTLVGQMSPDATKLSEIS 2040

Query: 2041 VIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAGG 2100
            VIYEE+QDRKLRPAGL+RWKELFAGSGLFVCFSSVKSNSALA+SMGAH+IYAQNLRHAGG
Sbjct: 2041 VIYEEDQDRKLRPAGLYRWKELFAGSGLFVCFSSVKSNSALAISMGAHEIYAQNLRHAGG 2100

Query: 2101 SSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGILN 2160
            SS PLVGITAYKPLSKDKIHCL+LHDDGSLQIYTHT VGVDASA ATAEKIKKLGSGILN
Sbjct: 2101 SSFPLVGITAYKPLSKDKIHCLLLHDDGSLQIYTHTTVGVDASANATAEKIKKLGSGILN 2160

Query: 2161 NKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSGF 2220
            NKVYAS NPEFPLDFFE TVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSS F
Sbjct: 2161 NKVYASANPEFPLDFFEKTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSDF 2220

Query: 2221 KITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESLL 2280
            KITVSNSNPDIVMVGFRIHVGN SANHIPSEI+IFQRVIKLDEGMRSWYDIPFTVAESLL
Sbjct: 2221 KITVSNSNPDIVMVGFRIHVGNMSANHIPSEISIFQRVIKLDEGMRSWYDIPFTVAESLL 2280

Query: 2281 ADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLARS 2340
            ADEEF++TVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDA+LDMEARALGSNSL+A+S
Sbjct: 2281 ADEEFTITVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAILDMEARALGSNSLVAKS 2340

Query: 2341 GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIYE 2400
            GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKL+DVNQEL KLKCKQLLETIYE
Sbjct: 2341 GKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLDDVNQELDKLKCKQLLETIYE 2400

Query: 2401 SDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWIIE 2460
            SDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKS S+LSSRLGVG AA GWIIE
Sbjct: 2401 SDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSASLLSSRLGVGDAAAGWIIE 2460

Query: 2461 EFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISS 2520
            EFTSQM AVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISS
Sbjct: 2461 EFTSQMHAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVISS 2520

Query: 2521 VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFPK 2580
            VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQ SSSLAISSRLLQVPFPK
Sbjct: 2521 VELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQTSSSLAISSRLLQVPFPK 2580

Query: 2581 QTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHCT 2640
            QTMLATDDGADIPLSAP+ TE TGT+PQVMIEED++ SSVQYCCDGCSTVPILRRRWHCT
Sbjct: 2581 QTMLATDDGADIPLSAPISTEATGTHPQVMIEEDSITSSVQYCCDGCSTVPILRRRWHCT 2640

Query: 2641 ICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLTS 2700
            ICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHF+TEDINDSSLTS
Sbjct: 2641 ICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFSTEDINDSSLTS 2700

Query: 2701 LIPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMET 2760
            L  DISVKNPVSSIHVL PADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMET
Sbjct: 2701 LRSDISVKNPVSSIHVLGPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWMET 2760

Query: 2761 TSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTSF 2820
            TSGVQAVP+MQLFYRLSSTMGGPFMNSLKSENL+LERLIKWFLDEINLNKPFEAKTRTSF
Sbjct: 2761 TSGVQAVPIMQLFYRLSSTMGGPFMNSLKSENLDLERLIKWFLDEINLNKPFEAKTRTSF 2820

Query: 2821 GEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQG 2880
            GEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTT DTHDK+STQVAPSTSVTAQSS+DDQG
Sbjct: 2821 GEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTADTHDKSSTQVAPSTSVTAQSSVDDQG 2880

Query: 2881 KNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLTV 2940
            KNDFTSQLLRAC SIRQQSFVNYLMDVLQQLVHVFKSS+IDYDSGHGF NGSGCGALLTV
Sbjct: 2881 KNDFTSQLLRACGSIRQQSFVNYLMDVLQQLVHVFKSSSIDYDSGHGFTNGSGCGALLTV 2940

Query: 2941 RKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKVY 3000
            RKDLPAGNFSPFFSDSYAKAHRTDLF+DYHRLLLENAFRLVYTLVRPEKYDKTLEKEK +
Sbjct: 2941 RKDLPAGNFSPFFSDSYAKAHRTDLFVDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKAF 3000

Query: 3001 KIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEV 3060
            KIYSSKDLKLD YQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTE 
Sbjct: 3001 KIYSSKDLKLDVYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTEE 3060

Query: 3061 KRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLNG 3120
            K+LFKYINKVGGFQ PMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLNG
Sbjct: 3061 KKLFKYINKVGGFQIPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLNG 3120

Query: 3121 IFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGED 3180
            +FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGD GTSTNKSGTQTVDSRKKRKGED
Sbjct: 3121 VFYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDAGTSTNKSGTQTVDSRKKRKGED 3180

Query: 3181 GNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHGK 3240
            GNDSALEKSYLDME MVNIFV+K +NVLSHFIDCFLLEWNSSS+RAE KGVVCG+WHHGK
Sbjct: 3181 GNDSALEKSYLDMEAMVNIFVEKDNNVLSHFIDCFLLEWNSSSIRAEAKGVVCGVWHHGK 3240

Query: 3241 QTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSDV 3300
            QTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDV SKQQSSELLDRCLTSDV
Sbjct: 3241 QTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVVSKQQSSELLDRCLTSDV 3300

Query: 3301 IRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLES 3360
            IRSIYQTLHSQNELLANHPNSR+YNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLES
Sbjct: 3301 IRSIYQTLHSQNELLANHPNSRMYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLES 3360

Query: 3361 LKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNW 3420
            LKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNW
Sbjct: 3361 LKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNNW 3420

Query: 3421 SLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVT 3480
            SLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVT
Sbjct: 3421 SLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPVT 3480

Query: 3481 DKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEN 3540
            D+HGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEN
Sbjct: 3481 DRHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNMEN 3540

Query: 3541 DEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLP 3600
            DEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLP
Sbjct: 3541 DEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSLP 3600

Query: 3601 GPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASRF 3660
            GPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRR LMTYLHQKHTDDGFPASRF
Sbjct: 3601 GPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRALMTYLHQKHTDDGFPASRF 3660

Query: 3661 VISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTARI 3720
            VISRSPNNCYGCATTFVTQCLEILQVLSKH+SSKKQLV+LGILSELFENNIHQGPKTARI
Sbjct: 3661 VISRSPNNCYGCATTFVTQCLEILQVLSKHRSSKKQLVNLGILSELFENNIHQGPKTARI 3720

Query: 3721 QARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADE 3780
            QARAVLCSFSE DVNAV+GLN+LIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADE
Sbjct: 3721 QARAVLCSFSESDVNAVSGLNDLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLADE 3780

Query: 3781 FWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKLT 3840
            FWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRM KLT
Sbjct: 3781 FWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMAKLT 3840

Query: 3841 SVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYLD 3900
            SVSQNKDEN+  ISGS SGPVSG+KSA ESLEHNWDSS RTQDIQLLSYAEWEKGASYLD
Sbjct: 3841 SVSQNKDENSKKISGSSSGPVSGSKSASESLEHNWDSSQRTQDIQLLSYAEWEKGASYLD 3900

Query: 3901 FVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTELV 3960
            FVRRQYKVSQV KGTVQR RTQKGDYL LKYALKWKRFVCRNAKSDLSAFELGSWVTELV
Sbjct: 3901 FVRRQYKVSQVFKGTVQRHRTQKGDYLCLKYALKWKRFVCRNAKSDLSAFELGSWVTELV 3960

Query: 3961 LCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMVD 4020
            LCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGES AEYF+LLFKMVD
Sbjct: 3961 LCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESVAEYFELLFKMVD 4020

Query: 4021 SEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIPN 4080
            SEDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIPN
Sbjct: 4021 SEDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIPN 4080

Query: 4081 IRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISDC 4140
            IRS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISDC
Sbjct: 4081 IRS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISDC 4140

Query: 4141 NRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPEP 4200
            N+LLKDLLDSLLLESNENK+QFIRACICGLQIHG ERKGRTCLFILEQLCNLISPSKPEP
Sbjct: 4141 NQLLKDLLDSLLLESNENKKQFIRACICGLQIHGAERKGRTCLFILEQLCNLISPSKPEP 4200

Query: 4201 VYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMELL 4260
            VYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRD+KNKICHQLDLLGFLEDDYGMELL
Sbjct: 4201 VYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDIKNKICHQLDLLGFLEDDYGMELL 4260

Query: 4261 VAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGEA 4320
            VAGNII+LDLSIALVYEQVWKKSNQSS+AISNTALIS TAARDS PMTVTYRLQGLDGEA
Sbjct: 4261 VAGNIIALDLSIALVYEQVWKKSNQSSSAISNTALISATAARDSSPMTVTYRLQGLDGEA 4320

Query: 4321 TEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVLN 4380
            TEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILL MIQRIWDNFKSNQEQLVAVLN
Sbjct: 4321 TEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLAMIQRIWDNFKSNQEQLVAVLN 4380

Query: 4381 LLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESI 4440
            LLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESI
Sbjct: 4381 LLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESESI 4440

Query: 4441 SIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTYG 4500
            SIGQSALT+TSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTYG
Sbjct: 4441 SIGQSALTITSEQTGTGEQAKKIVLMFLERLSHPFGYKKSNKQQRNTEMVARILPYLTYG 4500

Query: 4501 EPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLK 4560
            EPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLK
Sbjct: 4501 EPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESLK 4560

Query: 4561 TSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSMLR 4620
            TSSCGERLKDIILEKGITGLA+KHLRDSFAVAGQ  FRSS+EWAF+LKRPSIPLILSMLR
Sbjct: 4561 TSSCGERLKDIILEKGITGLAVKHLRDSFAVAGQASFRSSMEWAFSLKRPSIPLILSMLR 4620

Query: 4621 GLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDKV 4680
            GLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLS KEGNGDGFLEDKV
Sbjct: 4621 GLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSKKEGNGDGFLEDKV 4680

Query: 4681 RMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGLA 4740
            RMLRHAT+DEMRRLALKNRE MLQ LGM  VASDGGERI+VSRPALEGLEDV+EEEDGLA
Sbjct: 4681 RMLRHATKDEMRRLALKNREHMLQVLGMELVASDGGERIVVSRPALEGLEDVKEEEDGLA 4740

Query: 4741 CMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAKR 4800
            CMVCREGYSLRPTDLLGVYSYSKRVNLGVG SGS+RGECVYTTVSYFNIIHYQCHQEAKR
Sbjct: 4741 CMVCREGYSLRPTDLLGVYSYSKRVNLGVGISGSSRGECVYTTVSYFNIIHYQCHQEAKR 4800

Query: 4801 TDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADGN 4860
            TDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVP AQY+RYVDQHWDNLNALGRADGN
Sbjct: 4801 TDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPSAQYMRYVDQHWDNLNALGRADGN 4846

Query: 4861 RLRLLTYDIVLV 4872
            RLRLLTYDIVL+
Sbjct: 4861 RLRLLTYDIVLM 4846

BLAST of HG10018299 vs. ExPASy TrEMBL
Match: A0A6J1FGF6 (auxin transport protein BIG-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445189 PE=3 SV=1)

HSP 1 Score: 8953.2 bits (23231), Expect = 0.0e+00
Identity = 4548/4873 (93.33%), Postives = 4685/4873 (96.14%), Query Frame = 0

Query: 1    MAEQSFVKLLDTIFLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLR 60
            MAEQS VKL+DTIFLDDSS+SANT+K FSSSDLLQLLRSDDSS KLGLRQFYSILKAGLR
Sbjct: 1    MAEQSLVKLIDTIFLDDSSSSANTKKPFSSSDLLQLLRSDDSSFKLGLRQFYSILKAGLR 60

Query: 61   DLGDGNFAFQSWTDPQIQAVCSIAHAIASASRSLTVDQAEAIVVAVIKKSLELVFCYLEK 120
            DLGDG  AFQSW D QIQAVCSIAHAIASASR+LTVDQAEAIVVAVIKKSLEL+FCYLEK
Sbjct: 61   DLGDGKLAFQSWNDSQIQAVCSIAHAIASASRALTVDQAEAIVVAVIKKSLELLFCYLEK 120

Query: 121  SEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIEFDN 180
            SEFKCDDFSIQNNML+I+ETILVDGMDKVSD AQLCAKK L++LLK TG DCD +IEFDN
Sbjct: 121  SEFKCDDFSIQNNMLLIMETILVDGMDKVSDCAQLCAKKGLIELLKFTGADCDVSIEFDN 180

Query: 181  TIECG-STGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLCQH 240
             IECG + GVCCSREEKQVGRLLMTIAAECVQADQLT+ESGFS+PTFLED+NKLI  CQH
Sbjct: 181  HIECGFAEGVCCSREEKQVGRLLMTIAAECVQADQLTTESGFSEPTFLEDINKLILFCQH 240

Query: 241  WAVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPYIE 300
            WA+THLACIQ LILICKELV+LPD LDEKTGSTSFRKRLSCSLRILKLLT+LSKKFP IE
Sbjct: 241  WAITHLACIQRLILICKELVILPDVLDEKTGSTSFRKRLSCSLRILKLLTELSKKFPCIE 300

Query: 301  YDAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRNSY 360
            YDAK+MQAFALFANSLPCLFGLCFEFANSHA VEGSFENTILLLLEE+LELV+VVFRNSY
Sbjct: 301  YDAKLMQAFALFANSLPCLFGLCFEFANSHAIVEGSFENTILLLLEEYLELVKVVFRNSY 360

Query: 361  VSVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYHAF 420
            V VNIQTC+VASILDNLSSSVWR+DAS ANLK PLVYFPRSVMVIIKLIQDLKGHKYHAF
Sbjct: 361  VCVNIQTCIVASILDNLSSSVWRHDASIANLKPPLVYFPRSVMVIIKLIQDLKGHKYHAF 420

Query: 421  SFKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMHLL 480
            SF DLE HHTSTLADLSV+IPKC+ARLEIVPL KNY VEEILRMIFP SKQWMDDLMHLL
Sbjct: 421  SFNDLEMHHTSTLADLSVEIPKCHARLEIVPLQKNYTVEEILRMIFPPSKQWMDDLMHLL 480

Query: 481  FFLYSEGVRLRPKIERSLSS-MKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGYD 540
            FFLYSEGVRLRPKIERSLSS MKSSSTVEQET+VCHEDEALFGDLFSESGRSVGSVDGYD
Sbjct: 481  FFLYSEGVRLRPKIERSLSSCMKSSSTVEQETSVCHEDEALFGDLFSESGRSVGSVDGYD 540

Query: 541  LQHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSLL 600
            L HLAVNSTSSFCNLLLQAAKELLSFIK CIFSPEW+ASVFDDGCNKL+QNHIDIL+SLL
Sbjct: 541  LHHLAVNSTSSFCNLLLQAAKELLSFIKQCIFSPEWSASVFDDGCNKLDQNHIDILISLL 600

Query: 601  NCEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNAE 660
            NCEG  SDD+SSASC+PAHDE+KSGHIHEICYRLLHGLLTRH LPD LEEYLVKKILNAE
Sbjct: 601  NCEGFYSDDRSSASCVPAHDEKKSGHIHEICYRLLHGLLTRHVLPDYLEEYLVKKILNAE 660

Query: 661  NGNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFMG 720
            NGN VYNDQTLSLLA+TLF RTG AGT LRTQIYRQFVEFI EK+KTIS + SSLQEF+G
Sbjct: 661  NGNVVYNDQTLSLLANTLFCRTGTAGTQLRTQIYRQFVEFIAEKAKTISLSNSSLQEFIG 720

Query: 721  TLPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRLI 780
            TLPSVFHIEILLVAFHLSSE EKREISSLIFSSIR IDAPSTFSN TELSMWGLLVSRLI
Sbjct: 721  TLPSVFHIEILLVAFHLSSEEEKREISSLIFSSIRTIDAPSTFSNSTELSMWGLLVSRLI 780

Query: 781  IVLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSVE 840
            IVLR++IFHPHTCSSSLLFDFRSKLRDAPAFSSS PYT+NDHLSSWGA+VAKNIIGSS+E
Sbjct: 781  IVLRYVIFHPHTCSSSLLFDFRSKLRDAPAFSSSFPYTVNDHLSSWGANVAKNIIGSSME 840

Query: 841  SKPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVED 900
            S+PFF+ LINQLIDISSFPASLRQHDLT+ECPWFN G+IFSTFSWILGFWNGKQAVTVED
Sbjct: 841  SEPFFNGLINQLIDISSFPASLRQHDLTIECPWFNPGEIFSTFSWILGFWNGKQAVTVED 900

Query: 901  LIIERYIFVLCWDFPSMNALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEHMK 960
            LIIERYIFVLCWDFP  NALSHGG LWSDP+TLDISNTTCFFYFSYLLLDH  +IGE MK
Sbjct: 901  LIIERYIFVLCWDFPYGNALSHGGSLWSDPETLDISNTTCFFYFSYLLLDHSDIIGESMK 960

Query: 961  FPQVVIGLLQRLHGGSILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIPTVGS 1020
            FPQVVIGLL+RLHGGSILEDFKALGW+FLRNGAWLSL+LSFLSVGI RYCSKN IPTVGS
Sbjct: 961  FPQVVIGLLRRLHGGSILEDFKALGWSFLRNGAWLSLILSFLSVGILRYCSKNTIPTVGS 1020

Query: 1021 LLTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLSSSND 1080
             LTD  VTD EQAN A SLISSVITD+QVSILIRELSSVLSMYLQVYQKAFVATLSS+ND
Sbjct: 1021 FLTDIAVTDIEQANLAGSLISSVITDNQVSILIRELSSVLSMYLQVYQKAFVATLSSTND 1080

Query: 1081 HATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGFLSRF 1140
            HA+EFSPLLLFKHSEFD CVQNK LENYGTTSC+LESVF LMSRLDEIVDKR LGFLSR 
Sbjct: 1081 HASEFSPLLLFKHSEFDRCVQNKALENYGTTSCVLESVFKLMSRLDEIVDKRALGFLSRA 1140

Query: 1141 CWESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRGILDA 1200
             WESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLL IVDVKRNIILETEVT GILDA
Sbjct: 1141 SWESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLGIVDVKRNIILETEVTHGILDA 1200

Query: 1201 VMTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASDSTIH 1260
            VMTIKFDKTFE VHGLCEGIYQSL+ EL+GC+YGVLFLLKQLEGYLRH+NM GASDSTIH
Sbjct: 1201 VMTIKFDKTFEKVHGLCEGIYQSLSVELEGCAYGVLFLLKQLEGYLRHINMSGASDSTIH 1260

Query: 1261 ELVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLDNCCS 1320
            E VIVKATDI+DNLRKD SKSSVFQFY GAEVV EQVRE Y  QHGNLLVLLDSLDNCCS
Sbjct: 1261 EWVIVKATDIIDNLRKDASKSSVFQFYFGAEVVPEQVREFYGSQHGNLLVLLDSLDNCCS 1320

Query: 1321 ELVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSSGGNV 1380
            ELVNLKVLGFFVELLSGEPCPKLK E+QNKFLSMDL  LS+WLEKRI G VA+DSSG NV
Sbjct: 1321 ELVNLKVLGFFVELLSGEPCPKLKLEIQNKFLSMDLHGLSKWLEKRILGSVAKDSSGVNV 1380

Query: 1381 KGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSKSYFHF 1440
            KGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLD+AFLRFDISV+ SYFHF
Sbjct: 1381 KGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDLAFLRFDISVANSYFHF 1440

Query: 1441 VVQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERPS 1500
            VVQLL+G+KSMKLLLERIL+LM KLASDERLLPGLKYLFSFLEMILIESGSGKNVFER S
Sbjct: 1441 VVQLLRGEKSMKLLLERILVLMEKLASDERLLPGLKYLFSFLEMILIESGSGKNVFERAS 1500

Query: 1501 GKPLSRYAPEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDDGTSDGE 1560
            GKPLSRYAPE+GPLSSKSVGPRKNSETLVLSSNQE+G ASFECDATS EEDEDDGTSDGE
Sbjct: 1501 GKPLSRYAPEIGPLSSKSVGPRKNSETLVLSSNQEDGRASFECDATSGEEDEDDGTSDGE 1560

Query: 1561 VASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCH 1620
            VASLDKDE+EDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCH
Sbjct: 1561 VASLDKDEDEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCH 1620

Query: 1621 RGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLPFSEEGD 1680
            RGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRK+TGHGS PVRGASNFQCFLPF EEGD
Sbjct: 1621 RGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKFTGHGSTPVRGASNFQCFLPFCEEGD 1680

Query: 1681 QLPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCLLPTITN 1740
            QLPESESDLEDDV V DTDKCL+P+VPRELLDGVSVLLE+LDVEGRMLELCSCLLP+ITN
Sbjct: 1681 QLPESESDLEDDV-VPDTDKCLKPAVPRELLDGVSVLLEKLDVEGRMLELCSCLLPSITN 1740

Query: 1741 QRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGS 1800
            QRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGS
Sbjct: 1741 QRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKSHLASGS 1800

Query: 1801 LVKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVVRFEIVH 1860
            LVKSLLSVSIRGRLAVGEGDKVSIFD+RQLIEQ TVAPMTADKTNVKPLSKNVVRFEIVH
Sbjct: 1801 LVKSLLSVSIRGRLAVGEGDKVSIFDVRQLIEQATVAPMTADKTNVKPLSKNVVRFEIVH 1860

Query: 1861 LAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMV 1920
            LAFNPT ENYLAVAG+ED QVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMV
Sbjct: 1861 LAFNPTTENYLAVAGFEDFQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGSQVQLMV 1920

Query: 1921 VTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIFRLELSV 1980
            VTN+FVKIYDLSLDNISPMHYFTLPDDMVVDATL TASQGRMFLIVLSENGRIFR ELSV
Sbjct: 1921 VTNKFVKIYDLSLDNISPMHYFTLPDDMVVDATLFTASQGRMFLIVLSENGRIFRFELSV 1980

Query: 1981 LGNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDATKLTEI 2040
            LGNVGAT LKE I IQGREMSAKGLSLYFSSCYKLLF+AYADGTTLVGQ+SPDATKL+EI
Sbjct: 1981 LGNVGATLLKETIHIQGREMSAKGLSLYFSSCYKLLFVAYADGTTLVGQMSPDATKLSEI 2040

Query: 2041 SVIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQNLRHAG 2100
            SVIYEE+QDRKLRPAGL+RWKELFAGSGLFVCFSSVKSNSALA+SMGAH+IYAQNLRHAG
Sbjct: 2041 SVIYEEDQDRKLRPAGLYRWKELFAGSGLFVCFSSVKSNSALAISMGAHEIYAQNLRHAG 2100

Query: 2101 GSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKKLGSGIL 2160
            GSS PLVGITAYKPLSKDKIHCL+LHDDGSLQIYTHT VGVDASA ATAEKIKKLGSGIL
Sbjct: 2101 GSSFPLVGITAYKPLSKDKIHCLLLHDDGSLQIYTHTTVGVDASANATAEKIKKLGSGIL 2160

Query: 2161 NNKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSG 2220
            NNKVYAS NPEFPLDFFE TVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSS 
Sbjct: 2161 NNKVYASANPEFPLDFFEKTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLESPSSSD 2220

Query: 2221 FKITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPFTVAESL 2280
            FKITVSNSNPDIVMVGFRIHVGN SANHIPSEI+IFQRVIKLDEGMRSWYDIPFTVAESL
Sbjct: 2221 FKITVSNSNPDIVMVGFRIHVGNMSANHIPSEISIFQRVIKLDEGMRSWYDIPFTVAESL 2280

Query: 2281 LADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGSNSLLAR 2340
            LADEEF++TVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDA+LDMEARALGSNSL+A+
Sbjct: 2281 LADEEFTITVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAILDMEARALGSNSLVAK 2340

Query: 2341 SGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQLLETIY 2400
            SGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKL+DVNQEL KLKCKQLLETIY
Sbjct: 2341 SGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLDDVNQELDKLKCKQLLETIY 2400

Query: 2401 ESDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSTSVLSSRLGVGGAAGGWII 2460
            ESDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKS S+LSSRLGVG AA GWII
Sbjct: 2401 ESDREPLLQSAACRVLQAIFPKKEIYYQVKDTMRLTGVVKSASLLSSRLGVGDAAAGWII 2460

Query: 2461 EEFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVIS 2520
            EEFTSQM AVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVIS
Sbjct: 2461 EEFTSQMHAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLEQPNTQTLNNIVIS 2520

Query: 2521 SVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSSLAISSRLLQVPFP 2580
            SVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQ SSSLAISSRLLQVPFP
Sbjct: 2521 SVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQTSSSLAISSRLLQVPFP 2580

Query: 2581 KQTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQYCCDGCSTVPILRRRWHC 2640
            KQTMLATDDGADIPLSAP+ TE TGT+PQVMIEED++ SSVQYCCDGCSTVPILRRRWHC
Sbjct: 2581 KQTMLATDDGADIPLSAPISTEATGTHPQVMIEEDSITSSVQYCCDGCSTVPILRRRWHC 2640

Query: 2641 TICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLGDGNEYHFATEDINDSSLT 2700
            TICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEV+SLGDGNEYHF+TEDINDSSLT
Sbjct: 2641 TICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVESLGDGNEYHFSTEDINDSSLT 2700

Query: 2701 SLIPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME 2760
            SL  DISVKNPVSSIHVL PADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME
Sbjct: 2701 SLRSDISVKNPVSSIHVLGPADSGDFSASVTDPVSISASKQTVNSLLLSELLEQLKGWME 2760

Query: 2761 TTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKWFLDEINLNKPFEAKTRTS 2820
            TTSGVQAVP+MQLFYRLSSTMGGPFMNSLKSENL+LERLIKWFLDEINLNKPFEAKTRTS
Sbjct: 2761 TTSGVQAVPIMQLFYRLSSTMGGPFMNSLKSENLDLERLIKWFLDEINLNKPFEAKTRTS 2820

Query: 2821 FGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNSTQVAPSTSVTAQSSMDDQ 2880
            FGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTT DTHDK+STQVAPSTSVTAQSS+DDQ
Sbjct: 2821 FGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTADTHDKSSTQVAPSTSVTAQSSVDDQ 2880

Query: 2881 GKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTIDYDSGHGFHNGSGCGALLT 2940
            GKNDFTSQLLRAC SIRQQSFVNYLMDVLQQLVHVFKSS+IDYDSGHGF NGSGCGALLT
Sbjct: 2881 GKNDFTSQLLRACGSIRQQSFVNYLMDVLQQLVHVFKSSSIDYDSGHGFTNGSGCGALLT 2940

Query: 2941 VRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKV 3000
            VRKDLPAGNFSPFFSDSYAKAHRTDLF+DYHRLLLENAFRLVYTLVRPEKYDKTLEKEK 
Sbjct: 2941 VRKDLPAGNFSPFFSDSYAKAHRTDLFVDYHRLLLENAFRLVYTLVRPEKYDKTLEKEKA 3000

Query: 3001 YKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTE 3060
            +KIYSSKDLKLD YQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTE
Sbjct: 3001 FKIYSSKDLKLDVYQDVLCSYINNPNTSFVRRYARRLFLHICGSKSHYYSIRDSWQFSTE 3060

Query: 3061 VKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHADVLPFLLN 3120
             K+LFKYINKVGGFQ PMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRH DVLPFLLN
Sbjct: 3061 EKKLFKYINKVGGFQIPMSYERSVKIVKCLTTMAEVAAARPRNWQKYCLRHGDVLPFLLN 3120

Query: 3121 GIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTSTNKSGTQTVDSRKKRKGE 3180
            G+FYFGEESV+QTLKLLNLAFYTGKDIGHSVQKSEAGD GTSTNKSGTQTVDSRKKRKGE
Sbjct: 3121 GVFYFGEESVIQTLKLLNLAFYTGKDIGHSVQKSEAGDAGTSTNKSGTQTVDSRKKRKGE 3180

Query: 3181 DGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNSSSVRAETKGVVCGIWHHG 3240
            DGNDSALEKSYLDME MVNIFV+K +NVLSHFIDCFLLEWNSSS+RAE KGVVCG+WHHG
Sbjct: 3181 DGNDSALEKSYLDMEAMVNIFVEKDNNVLSHFIDCFLLEWNSSSIRAEAKGVVCGVWHHG 3240

Query: 3241 KQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVGSKQQSSELLDRCLTSD 3300
            KQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDV SKQQSSELLDRCLTSD
Sbjct: 3241 KQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPDVVSKQQSSELLDRCLTSD 3300

Query: 3301 VIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE 3360
            VIRSIYQTLHSQNELLANHPNSR+YNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE
Sbjct: 3301 VIRSIYQTLHSQNELLANHPNSRMYNTLSGLVEFDGYYLESEPCAACSSPEVPYSRMKLE 3360

Query: 3361 SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN 3420
            SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN
Sbjct: 3361 SLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVLNLYYNNRPVADLSELKNN 3420

Query: 3421 WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV 3480
            WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV
Sbjct: 3421 WSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYENLQALSLEPLQCPRCSRPV 3480

Query: 3481 TDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME 3540
            TD+HGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME
Sbjct: 3481 TDRHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYGRFEFNFMAKPSFTFDNME 3540

Query: 3541 NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL 3600
            NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL
Sbjct: 3541 NDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGENEMDSQQKDSVQQMMVSL 3600

Query: 3601 PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRVLMTYLHQKHTDDGFPASR 3660
            PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRR LMTYLHQKHTDDGFPASR
Sbjct: 3601 PGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRALMTYLHQKHTDDGFPASR 3660

Query: 3661 FVISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLGILSELFENNIHQGPKTAR 3720
            FVISRSPNNCYGCATTFVTQCLEILQVLSKH+SSKKQLV+LGILSELFENNIHQGPKTAR
Sbjct: 3661 FVISRSPNNCYGCATTFVTQCLEILQVLSKHRSSKKQLVNLGILSELFENNIHQGPKTAR 3720

Query: 3721 IQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD 3780
            IQARAVLCSFSE DVNAV+GLN+LIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD
Sbjct: 3721 IQARAVLCSFSESDVNAVSGLNDLIQKKVMYCLEHHRSMDIALATREELSLLSEVCSLAD 3780

Query: 3781 EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMGKL 3840
            EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRM KL
Sbjct: 3781 EFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQACTPPKSDTVDKEQRMAKL 3840

Query: 3841 TSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRTQDIQLLSYAEWEKGASYL 3900
            TSVSQNKDEN+  ISGS SGPVSG+KSA ESLEHNWDSS RTQDIQLLSYAEWEKGASYL
Sbjct: 3841 TSVSQNKDENSKKISGSSSGPVSGSKSASESLEHNWDSSQRTQDIQLLSYAEWEKGASYL 3900

Query: 3901 DFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCRNAKSDLSAFELGSWVTEL 3960
            DFVRRQYKVSQV KGTVQR RTQKGDYL LKYALKWKRFVCRNAKSDLSAFELGSWVTEL
Sbjct: 3901 DFVRRQYKVSQVFKGTVQRHRTQKGDYLCLKYALKWKRFVCRNAKSDLSAFELGSWVTEL 3960

Query: 3961 VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESAAEYFDLLFKMV 4020
            VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGES AEYF+LLFKMV
Sbjct: 3961 VLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPATLSAGESVAEYFELLFKMV 4020

Query: 4021 DSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQGFILHKLIELLGKFLEIP 4080
            DSEDARLFLTVRGCLRTICQLISQEV NVESLERSLHIDISQGFILHKLIELLGKFLEIP
Sbjct: 4021 DSEDARLFLTVRGCLRTICQLISQEVSNVESLERSLHIDISQGFILHKLIELLGKFLEIP 4080

Query: 4081 NIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLEALIVIRGLVVQKTKLISD 4140
            NIRS                         RFMRDNLLSEVLEALIVIRGLVVQKTKLISD
Sbjct: 4081 NIRS-------------------------RFMRDNLLSEVLEALIVIRGLVVQKTKLISD 4140

Query: 4141 CNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRTCLFILEQLCNLISPSKPE 4200
            CN+LLKDLLDSLLLESNENK+QFIRACICGLQIHG ERKGRTCLFILEQLCNLISPSKPE
Sbjct: 4141 CNQLLKDLLDSLLLESNENKKQFIRACICGLQIHGAERKGRTCLFILEQLCNLISPSKPE 4200

Query: 4201 PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKICHQLDLLGFLEDDYGMEL 4260
            PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRD+KNKICHQLDLLGFLEDDYGMEL
Sbjct: 4201 PVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDIKNKICHQLDLLGFLEDDYGMEL 4260

Query: 4261 LVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAARDSPPMTVTYRLQGLDGE 4320
            LVAGNII+LDLSIALVYEQVWKKSNQSS+AISNTALIS TAARDS PMTVTYRLQGLDGE
Sbjct: 4261 LVAGNIIALDLSIALVYEQVWKKSNQSSSAISNTALISATAARDSSPMTVTYRLQGLDGE 4320

Query: 4321 ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLGMIQRIWDNFKSNQEQLVAVL 4380
            ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILL MIQRIWDNFKSNQEQLVAVL
Sbjct: 4321 ATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLAMIQRIWDNFKSNQEQLVAVL 4380

Query: 4381 NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES 4440
            NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES
Sbjct: 4381 NLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMESAEGILLIVESLTIEANESES 4440

Query: 4441 ISIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKKSNKQQRNTEMVARILPYLTY 4500
            ISIGQSALT+TSEQTGTGEQAKKIVLMFLERLSHPFG KKSNKQQRNTEMVARILPYLTY
Sbjct: 4441 ISIGQSALTITSEQTGTGEQAKKIVLMFLERLSHPFGYKKSNKQQRNTEMVARILPYLTY 4500

Query: 4501 GEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL 4560
            GEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL
Sbjct: 4501 GEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSISEQAAKQRFTVENFVRVSESL 4560

Query: 4561 KTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRSSVEWAFALKRPSIPLILSML 4620
            KTSSCGERLKDIILEKGITGLA+KHLRDSFAVAGQ  FRSS+EWAF+LKRPSIPLILSML
Sbjct: 4561 KTSSCGERLKDIILEKGITGLAVKHLRDSFAVAGQASFRSSMEWAFSLKRPSIPLILSML 4620

Query: 4621 RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSNKEGNGDGFLEDK 4680
            RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLS KEGNGDGFLEDK
Sbjct: 4621 RGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAENLLDTLSKKEGNGDGFLEDK 4680

Query: 4681 VRMLRHATRDEMRRLALKNREDMLQGLGMRQVASDGGERIIVSRPALEGLEDVEEEEDGL 4740
            VRMLRHAT+DEMRRLALKNRE MLQ LGM  VASDGGERI+VSRPALEGLEDV+EEEDGL
Sbjct: 4681 VRMLRHATKDEMRRLALKNREHMLQVLGMELVASDGGERIVVSRPALEGLEDVKEEEDGL 4740

Query: 4741 ACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGECVYTTVSYFNIIHYQCHQEAK 4800
            ACMVCREGYSLRPTDLLGVYSYSKRVNLGVG SGS+RGECVYTTVSYFNIIHYQCHQEAK
Sbjct: 4741 ACMVCREGYSLRPTDLLGVYSYSKRVNLGVGISGSSRGECVYTTVSYFNIIHYQCHQEAK 4800

Query: 4801 RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLAQYIRYVDQHWDNLNALGRADG 4860
            RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVP AQY+RYVDQHWDNLNALGRADG
Sbjct: 4801 RTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPSAQYMRYVDQHWDNLNALGRADG 4847

Query: 4861 NRLRLLTYDIVLV 4872
            NRLRLLTYDIVL+
Sbjct: 4861 NRLRLLTYDIVLM 4847

BLAST of HG10018299 vs. TAIR 10
Match: AT3G02260.1 (auxin transport protein (BIG) )

HSP 1 Score: 6129.7 bits (15901), Expect = 0.0e+00
Identity = 3189/4894 (65.16%), Postives = 3816/4894 (77.97%), Query Frame = 0

Query: 14   FLDDSSTSANTRKHFSSSDLLQLLRSDDSSIKLGLRQFYSILKAGLRDLG------DGNF 73
            FL D +   +     SS    + LRSDD SIK GLR FY +L+ G+  +G       G  
Sbjct: 11   FLFDDTAFPSLSSSASSDLFSRRLRSDD-SIKRGLRSFYLLLRWGVAPIGGDDADSSGKL 70

Query: 74   AFQSWTDPQIQAVCSIAHAIASASRSL----------TVDQAEAIVVAVIKKSLELVFCY 133
             F++W+D Q+QA+ SI+ AI   SRSL           VDQ E IV+ VI++ +E    +
Sbjct: 71   RFETWSDSQLQALVSISQAILLLSRSLLGTDLTLNQGLVDQLEPIVLGVIQEVMEFSLSF 130

Query: 134  LEKSEFKCDDFSIQNNMLMILETILVDGMDKVSDFAQLCAKKSLMDLLKSTGGDCDATIE 193
            LEKS F+ +D  ++ NM ++LE    DG +K  D     +   + +L  +  G+ D  ++
Sbjct: 131  LEKSSFRQNDLKMEINMEILLEIASFDGSEKQYDILPDFSPAEVAELWPAFSGEHD-NMD 190

Query: 194  FDNTIECGSTGVCCSREEKQVGRLLMTIAAECVQADQLTSESGFSQPTFLEDMNKLIFLC 253
              + ++C   G  CS EEK V RLL+T+ +EC+++D + ++S    P F +D   L    
Sbjct: 191  AQSLVKCTFQGGRCSNEEKPVDRLLITLMSECIESD-VQAQSVVKSP-FQQDCGDLNPFT 250

Query: 254  QHWAVTHLACIQHLILICKELVVLPDALDEKTGSTSFRKRLSCSLRILKLLTDLSKKFPY 313
            +H AV HL C+  LI++CKELV LP+ LDEKT   +   +LS  LRILKLL  LSK    
Sbjct: 251  RHLAVVHLRCVCRLIMVCKELVQLPNMLDEKTVDQAVLDKLSFCLRILKLLGSLSKDVQS 310

Query: 314  IEYDAKMMQAFALFANSLPCLFGLCFEFANSHATVEGSFENTILLLLEEFLELVQVVFRN 373
            IE D  ++QA A F ++ P LF + FEF N H   EG+ E+  L L+E FL LVQ++F  
Sbjct: 311  IENDGSLLQAVASFTDAFPKLFRVFFEFTN-HTATEGNIESLSLALVEGFLNLVQLIFGK 370

Query: 374  SYVSVNIQTCVVASILDNLSSSVWRYDASTANLKTPLVYFPRSVMVIIKLIQDLKGHKYH 433
            S V  N+Q CV ASI+ NL SSVWRYD S+ NL  PL YFPRSV+  +KLIQDLK   YH
Sbjct: 371  SSVFQNVQACVAASIVSNLDSSVWRYDGSSCNLTPPLAYFPRSVIYTLKLIQDLKRQPYH 430

Query: 434  AFSFKDLETHHTSTLADLSVDIPKCYARLEIVPLHKNYKVEEILRMIFPLSKQWMDDLMH 493
                + LE+  T      +VD    + R E +PL K + VE+I+R+IFP S QWMD+  H
Sbjct: 431  IHDLRVLESEVTYEDVSSTVDSVYFHLRQEKIPLLKCFTVEDIMRVIFPSSSQWMDNFFH 490

Query: 494  LLFFLYSEGVRLRPKIERSLSSMKSSSTVEQETAVCHEDEALFGDLFSESGRSVGSVDGY 553
            L++FL+ EGV+LRPK+ER+ SS++S+S  E E+ + H+DEALFG+LFSE  RS+ S++  
Sbjct: 491  LVYFLHREGVKLRPKVERTYSSLRSNSFAEVESQISHDDEALFGNLFSEGSRSLCSIEPN 550

Query: 554  DLQHLAVNSTSSFCNLLLQAAKELLSFIKLCIFSPEWNASVFDDGCNKLNQNHIDILLSL 613
            D   ++V+S     NLLLQAAKELL+F++ CI   EW  S+++DGC KL+  HIDILL++
Sbjct: 551  DQPPVSVSS-----NLLLQAAKELLNFLRACILCQEWVPSIYEDGCKKLDTGHIDILLNI 610

Query: 614  LNCEGCCSDDKSSASCLPAHDERKSGHIHEICYRLLHGLLTRHALPDSLEEYLVKKILNA 673
            +   GC  +DK+S       DE + GH   + + LL  LL   AL D LE YL ++IL  
Sbjct: 611  V---GCSIEDKASDGGCMLQDEGRPGH---VAFELLLNLLRSRALSDFLESYLFQQILVV 670

Query: 674  ENGNFVYNDQTLSLLAHTLFRRTGVAGTLLRTQIYRQFVEFIIEKSKTISSNYSSLQEFM 733
            EN +F YND+TL+LLAHTL  R G+AG  LR +IY  FV F+ E+++ I +   SL+E  
Sbjct: 671  ENSDFNYNDKTLALLAHTLLCRPGLAGAQLRAKIYDGFVSFVTERARGICAEALSLKELT 730

Query: 734  GTLPSVFHIEILLVAFHLSSEGEKREISSLIFSSIRAIDAPSTFSNCTELSMWGLLVSRL 793
              LPS FHIEILL+AFHLS+E EK + S+LI S +  +D P+   +  +LS W +L+SRL
Sbjct: 731  ACLPSAFHIEILLMAFHLSNEAEKAKFSNLIASCLHKVDTPAGICDGPQLSSWAMLISRL 790

Query: 794  IIVLRHIIFHPHTCSSSLLFDFRSKLRDAPAFSSSLPYTLNDHLSSWGASVAKNIIGSSV 853
            +++L H++ HP+TC +SL+ D RSKLR+  +  S+L  T+ DHLSSW + VA+ I  S  
Sbjct: 791  LVLLHHMLLHPNTCPTSLMLDLRSKLREVRSCGSNLHVTVGDHLSSWASLVARGITDSWA 850

Query: 854  ESKPFFHSLINQLIDISSFPASLRQHDLTVECPWFNAGDIFSTFSWILGFWNGKQAVTVE 913
            E +   H L++Q+ID S  P + +    T +    + GD+ ++   +LG W GK+A  VE
Sbjct: 851  EDESVSH-LMSQMIDFSPHPPTFQNDVSTAKTLNLDYGDLSASLCRVLGLWKGKKAGKVE 910

Query: 914  DLIIERYIFVLCWDFPSMN-ALSHGGPLWSDPDTLDISNTTCFFYFSYLLLDHGGVIGEH 973
            DL++ERYIF+L  D   +N AL     L  +   +DISN+      S+LL+    V+G +
Sbjct: 911  DLLVERYIFMLSSDIARINCALDSQPSLHVNYQNVDISNSVDLISTSHLLVGDINVVGRN 970

Query: 974  MKFPQVVIGLLQRLHGG--SILEDFKALGWNFLRNGAWLSLVLSFLSVGIWRYCSKNMIP 1033
            ++   ++IG+L +L      ++ED   LGW+++R GAWLSL+L FL  G+W YC+KN   
Sbjct: 971  IELRNILIGVLNQLQAAPEQVVED---LGWDYIREGAWLSLLLYFLDGGVWDYCNKNSCS 1030

Query: 1034 TVGSLLTDTTVTDNEQANFAESLISSVITDSQVSILIRELSSVLSMYLQVYQKAFVATLS 1093
             +     + T  D +    AE ++S ++    ++ L+R LSS++  YL+VY+KAF+AT S
Sbjct: 1031 EIDPFWKECTSVDAKYVAAAEGVVSYLMKTGDIAELLRMLSSLVGKYLRVYKKAFLATFS 1090

Query: 1094 SSNDHATEFSPLLLFKHSEFDSCVQNKTLENYGTTSCLLESVFNLMSRLDEIVDKRTLGF 1153
              N H      LLL KH++F   +Q +     G  S  L+ +F L S+LD + D R  G 
Sbjct: 1091 DWNHHGHSSPSLLLLKHTQFGKSLQGE-YAKIGDNSLHLQCIFYL-SKLDSLGDGRGSGV 1150

Query: 1154 LSRFCWESMFHGFPSHLETSSGILLSCVLSIGRIISVLAGLLRIVDVKRNIILETEVTRG 1213
            L +  WE M HGFP+ L+TSS ILLSC+LSI  I+  + GLL++ + K    ++T V   
Sbjct: 1151 LWKVFWEFMVHGFPTSLQTSSAILLSCILSIRCIVLTINGLLKLGNSKEKFGVDTSVLHQ 1210

Query: 1214 ILDAVMTIKFDKTFESVHGLCEGIYQSLNKELDGCSYGVLFLLKQLEGYLRHMNMRGASD 1273
            +LD++M IKFD+ FES HG CE I+Q++   L       LFL+K +EG++R ++      
Sbjct: 1211 LLDSIMIIKFDQVFESFHGKCEEIHQNICAVLQLPDLTELFLMKDMEGFVRDISAEQIDR 1270

Query: 1274 STIHELVIVKATDIMDNLRKDVSKSSVFQFYLGAEVVLEQVRELYTFQHGNLLVLLDSLD 1333
            S + E VI K  D+MD+L KD SKS +F+FYLG + V E  RE Y  Q G+L V +DSLD
Sbjct: 1271 SQVLEGVITKIVDVMDSLSKDSSKSDIFKFYLGVDAVSEHTREFYELQRGDLSVFIDSLD 1330

Query: 1334 NCCSELVNLKVLGFFVELLSGEPCPKLKQEVQNKFLSMDLLSLSQWLEKRIFGLVAEDSS 1393
             C  E VN+KVL F V+LLS    P L++ VQ KF+ MDL+SLS WLE+R+ G   E+  
Sbjct: 1331 YCSLEPVNIKVLNFLVDLLSVAQSPDLRRRVQQKFIDMDLISLSGWLERRLLGSFVEEID 1390

Query: 1394 G-GNVKGSSISLRESSMNFVFCLISSPSEPLALQLQSHIFEAALVSLDMAFLRFDISVSK 1453
            G    KG+S+  RE++MNF+ CL+SS ++    +LQ+H+FEA L+SLD AFL FDI +S 
Sbjct: 1391 GKKTAKGNSLPFREAAMNFINCLVSSTNDLQTRELQNHLFEALLISLDTAFLSFDIHMSM 1450

Query: 1454 SYFHFVVQLLKGDKSMKLLLERILILMGKLASDERLLPGLKYLFSFLEMILIESGSGKNV 1513
            SYFHFV+QL + D  MK++L+R ++LM KLA++E+LLPGLK++F  +  +L  S    + 
Sbjct: 1451 SYFHFVLQLAREDNLMKMVLKRTIMLMEKLAAEEKLLPGLKFIFGVIGTLL--SNRSPSH 1510

Query: 1514 FERPSGKPLSRYA-PEVGPLSSKSVGPRKNSETLVLSSNQEEGPASFECDATSAEEDEDD 1573
             E   GK L+ Y     GPL  K  G  K S+TL L  +QE    S ECD TS +EDEDD
Sbjct: 1511 GESLCGKSLASYKNTATGPLVPKLSGTTKKSDTLALPVDQEGSSISLECDVTSVDEDEDD 1570

Query: 1574 GTSDGEVASLDKDEEEDTNSERALASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV 1633
            GTSDGEVASLDK++EED NSER LASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV
Sbjct: 1571 GTSDGEVASLDKEDEEDANSERYLASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSV 1630

Query: 1634 CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYTGHGSAPVRGASNFQCFLP 1693
            CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKY G+GSAP RG +NFQ FLP
Sbjct: 1631 CAKVCHRGHRVVYSRSSRFFCDCGAGGVRGSSCQCLKPRKYNGNGSAPARGTNNFQSFLP 1690

Query: 1694 FSEEGDQLPESESDLEDDVSVTDTDKCLRPSVPRELLDGVSVLLEELDVEGRMLELCSCL 1753
             SE+ DQL ES+SD+E+D    +    L   +P+E    +S+LLEEL +E R+LEL S L
Sbjct: 1691 LSEDADQLGESDSDVEEDGFGEENHVVL--YIPKETQYKMSLLLEELGIEDRVLELFSSL 1750

Query: 1754 LPTITNQRDPDLSKDKKIILGKDKVLSYGLDLLQLKKAYKGGSLDLKIKAEYANAKELKS 1813
            LP+IT++RD  LSK+K++ LGKDKVLS+  DLLQLKKAYK GSLDLKIKA+Y N+K+LKS
Sbjct: 1751 LPSITSKRDSGLSKEKQVNLGKDKVLSFDTDLLQLKKAYKSGSLDLKIKADYTNSKDLKS 1810

Query: 1814 HLASGSLVKSLLSVSIRGRLAVGEGDKVSIFDIRQLIEQTTVAPMTADKTNVKPLSKNVV 1873
             LA+GSLVKSLLSVS+RGRLAVGEGDKV+IFD+ QLI Q T+AP+ ADK NVKPLS+N+V
Sbjct: 1811 LLANGSLVKSLLSVSVRGRLAVGEGDKVAIFDVGQLIGQATIAPINADKANVKPLSRNIV 1870

Query: 1874 RFEIVHLAFNPTVENYLAVAGYEDCQVLTLNHRGEVVDRLAIELALQGAHIKRMEWVPGS 1933
            RFEIVHL+FNP VENYLAVAG EDCQ+LTLNHRGEV+DRLA+ELALQGA I+R++WVPGS
Sbjct: 1871 RFEIVHLSFNPVVENYLAVAGLEDCQILTLNHRGEVIDRLAVELALQGAFIRRIDWVPGS 1930

Query: 1934 QVQLMVVTNRFVKIYDLSLDNISPMHYFTLPDDMVVDATLSTASQGRMFLIVLSENGRIF 1993
            QVQLMVVTN+FVKIYDLS D+ISP  YFTLP+DM+VDATL  AS+GR+FL+VLSE G ++
Sbjct: 1931 QVQLMVVTNKFVKIYDLSQDSISPTQYFTLPNDMIVDATLFVASRGRVFLLVLSEQGNLY 1990

Query: 1994 RLELSVLGNVGATPLKEIIEIQGREMSAKGLSLYFSSCYKLLFLAYADGTTLVGQLSPDA 2053
            R ELS  GN GATPLKEI++I G++++ KG S+YFS  Y+LLF++Y DG++ +G+LS DA
Sbjct: 1991 RFELSWGGNAGATPLKEIVQIMGKDVTGKGSSVYFSPTYRLLFISYHDGSSFMGRLSSDA 2050

Query: 2054 TKLTEISVIYEEEQDRKLRPAGLHRWKELFAGSGLFVCFSSVKSNSALAVSMGAHDIYAQ 2113
            T LT+ S ++EEE D K R AGLHRWKEL AGSGLF+CFSSVKSN+ LAVS+    + AQ
Sbjct: 2051 TSLTDTSGMFEEESDCKQRVAGLHRWKELLAGSGLFICFSSVKSNAVLAVSLRGDGVCAQ 2110

Query: 2114 NLRHAGGSSLPLVGITAYKPLSKDKIHCLVLHDDGSLQIYTHTAVGVDASAYATAEKIKK 2173
            NLRH  GSS P+VGITAYKPLSKD +HCLVLHDDGSLQIY+H   GVD  +  TAEK+KK
Sbjct: 2111 NLRHPTGSSSPMVGITAYKPLSKDNVHCLVLHDDGSLQIYSHVRSGVDTDSNFTAEKVKK 2170

Query: 2174 LGSGILNNKVYASTNPEFPLDFFENTVCITADVRLGGDAIRNGDSEGAKQSLASEDGFLE 2233
            LGS ILNNK YA   PEFPLDFFE   CITADVRLG DAIRNGDSEGAKQSLASEDGF+E
Sbjct: 2171 LGSKILNNKTYAGAKPEFPLDFFERAFCITADVRLGSDAIRNGDSEGAKQSLASEDGFIE 2230

Query: 2234 SPSSSGFKITVSNSNPDIVMVGFRIHVGNTSANHIPSEITIFQRVIKLDEGMRSWYDIPF 2293
            SPS  GFKI+VSN NPDIVMVG R+HVG TSA+ IPSE+TIFQR IK+DEGMR WYDIPF
Sbjct: 2231 SPSPVGFKISVSNPNPDIVMVGIRMHVGTTSASSIPSEVTIFQRSIKMDEGMRCWYDIPF 2290

Query: 2294 TVAESLLADEEFSVTVGPAFNGTALPRIDSLEVYGRAKDEFGWKEKLDAVLDMEARALGS 2353
            TVAESLLADE+  ++VGP  +GTALPRIDSLEVYGRAKDEFGWKEK+DAVLDMEAR LG 
Sbjct: 2291 TVAESLLADEDVVISVGPTTSGTALPRIDSLEVYGRAKDEFGWKEKMDAVLDMEARVLGH 2350

Query: 2354 NSLLARSGKKRRSIQCAPIQQQVLADGLKVLSSYYLLRRSQGCPKLNDVNQELTKLKCKQ 2413
              LL  S KKR   Q A +++QV+ADGLK+LS YY + R    P+   V   L++LKCKQ
Sbjct: 2351 GLLLPGSSKKRALAQSASMEEQVIADGLKLLSIYYSVCR----PRQEVV---LSELKCKQ 2410

Query: 2414 LLETIYESDREPLLQSAACRVLQAIFPKKEIYYQ-------VKDTMRLTGVVKSTSVLSS 2473
            LLETI+ESDRE LLQ+ ACRVLQ++FP+KEIYYQ       VKDTMRL GVVK TS+LSS
Sbjct: 2411 LLETIFESDRETLLQTTACRVLQSVFPRKEIYYQVMFLPNSVKDTMRLLGVVKVTSILSS 2470

Query: 2474 RLGVGGAAGGWIIEEFTSQMRAVSKIALHRRSNLACFLERNGSQVVDGLMQILWGILDLE 2533
            RLG+ G  GG I+EEF +QMRAVSK+AL R+SN + FLE NGS+VVD LMQ+LWGIL+ E
Sbjct: 2471 RLGILG-TGGSIVEEFNAQMRAVSKVALTRKSNFSVFLEMNGSEVVDNLMQVLWGILESE 2530

Query: 2534 QPNTQTLNNIVISSVELIYCYAECLALHGPDTGRHSVAPAVVLFKKLLFSSSEAVQASSS 2593
              +T T+NN+V+SSVELIY YAECLA  G DTG HSVAPAV L K L+   +E+VQ SS 
Sbjct: 2531 PLDTPTMNNVVMSSVELIYSYAECLASQGKDTGVHSVAPAVQLLKALMLFPNESVQTSSR 2590

Query: 2594 ----LAISSRLLQVPFPKQTMLATDDGADIPLSAPVPTETTGTNPQVMIEEDAVASSVQY 2653
                LAISSRLLQVPFPKQTML TDD  D   +  VP  T G N  VMIEED++ SSVQY
Sbjct: 2591 CVLVLAISSRLLQVPFPKQTMLTTDDLVDNVTTPSVPIRTAGGNTHVMIEEDSITSSVQY 2650

Query: 2654 CCDGCSTVPILRRRWHCTICPDFDLCESCYEVLDADRLPSPHSRDHPMTAIPIEVDSLG- 2713
            CCDGCSTVPILRRRWHCT+CPDFDLCE+CYEVLDADRLP PH+RDHPMTAIPIEV+SLG 
Sbjct: 2651 CCDGCSTVPILRRRWHCTVCPDFDLCEACYEVLDADRLPPPHTRDHPMTAIPIEVESLGA 2710

Query: 2714 DGNEYHFATEDINDSSLTSLIPDISVKNPVSSIHVLEPADSGDFSASVTDPVSISASKQT 2773
            D NE  F+ +++  S++  ++     +    SIHVLEP +S +FSAS+TDP+SISASK+ 
Sbjct: 2711 DTNEIQFSADEVGISNMLPVVTSSIPQASTPSIHVLEPGESAEFSASLTDPISISASKRA 2770

Query: 2774 VNSLLLSELLEQLKGWMETTSGVQAVPVMQLFYRLSSTMGGPFMNSLKSENLNLERLIKW 2833
            VNSL+LSE L++L GWMET SGVQA+PVMQLFYRLSS +GG FM+S K E ++L++LIKW
Sbjct: 2771 VNSLILSEFLQELSGWMETVSGVQAIPVMQLFYRLSSAIGGAFMDSSKPEEISLDKLIKW 2830

Query: 2834 FLDEINLNKPFEAKTRTSFGEVAILVFMFFTLMLRNWHQPGSDGPGAKPSTTTDTHDKNS 2893
             L EINL+KPF A TR+S GE+ ILVFMFFTLMLR+WHQPGSDG  +K   +TD HD+  
Sbjct: 2831 LLGEINLSKPFAASTRSSLGEIVILVFMFFTLMLRSWHQPGSDGSSSKLGGSTDVHDRRI 2890

Query: 2894 TQVAPSTSVTAQSSMDDQGKNDFTSQLLRACSSIRQQSFVNYLMDVLQQLVHVFKSSTID 2953
             Q   ST V  QSS+  Q ++DF SQL+RACS +R Q FVNYLM++LQQLVHVFKS   +
Sbjct: 2891 VQ--SSTVVATQSSLHVQERDDFASQLVRACSCLRNQEFVNYLMNILQQLVHVFKSRAAN 2950

Query: 2954 YDSGHGFHNGSGCGALLTVRKDLPAGNFSPFFSDSYAKAHRTDLFIDYHRLLLENAFRLV 3013
             ++  G  +GSGCGA+LTVR+DLPAGN+SPFFSDSYAKAHR D+F+DYHRLLLEN FRLV
Sbjct: 2951 VEA-RGSSSGSGCGAMLTVRRDLPAGNYSPFFSDSYAKAHRADIFVDYHRLLLENVFRLV 3010

Query: 3014 YTLVRPEKYDKTLEKEKVYKIYSSKDLKLDAYQDVLCSYINNPNTSFVRRYARRLFLHIC 3073
            YTLVRPEK +K  EKEKVY+  SSKDLKLD +QDVLCSYINNP+T+FVRRYARRLFLH+C
Sbjct: 3011 YTLVRPEKQEKMGEKEKVYRNASSKDLKLDGFQDVLCSYINNPHTAFVRRYARRLFLHLC 3070

Query: 3074 GSKSHYYSIRDSWQFSTEVKRLFKYINKVGGFQNPMSYERSVKIVKCLTTMAEVAAARPR 3133
            GSK+ YYS+RDSWQFS EVK L+K++ K GGF+N +SYERSVKIVK L+T+AEVA ARPR
Sbjct: 3071 GSKTQYYSVRDSWQFSNEVKNLYKHVEKSGGFENNVSYERSVKIVKSLSTIAEVAVARPR 3130

Query: 3134 NWQKYCLRHADVLPFLLNGIFYFGEESVVQTLKLLNLAFYTGKDIGHSVQKSEAGDTGTS 3193
            NWQKYCLRH D L FLLNG+F+F EESV+QTLKLLNLAFY GKD+  SVQK+EA +  T 
Sbjct: 3131 NWQKYCLRHGDFLSFLLNGVFHFAEESVIQTLKLLNLAFYQGKDVSSSVQKAEATEVVTG 3190

Query: 3194 TNKSGTQTVDSRKKRKGEDGNDSALEKSYLDMEIMVNIFVDKGSNVLSHFIDCFLLEWNS 3253
            +N+SG+Q+VDS+KK+KGEDG+DS LEK Y+DME +V+IF     ++L  FID FLLEWNS
Sbjct: 3191 SNRSGSQSVDSKKKKKGEDGHDSGLEKLYVDMEGVVDIFSANCGDLLRQFIDFFLLEWNS 3250

Query: 3254 SSVRAETKGVVCGIWHHGKQTFKETLLMALLQKVKTLPMYGLNIAEYTELVTWLLGKVPD 3313
            SSVR E K V+ G+WHHG+ +FKE+LL ALLQKV+ LP YG NI EYTELV+ LL K P+
Sbjct: 3251 SSVRTEAKSVIYGLWHHGRHSFKESLLAALLQKVRYLPAYGQNIVEYTELVSLLLDKAPE 3310

Query: 3314 VGSKQQSSELLDRCLTSDVIRSIYQTLHSQNELLANHPNSRIYNTLSGLVEFDGYYLESE 3373
              SKQ  +EL+DRCL  DVIR  ++TLHSQNEL+ANHPNSRIY+TL  LVEFDGYYLESE
Sbjct: 3311 NNSKQAINELVDRCLNPDVIRCFFETLHSQNELIANHPNSRIYSTLGNLVEFDGYYLESE 3370

Query: 3374 PCAACSSPEVPYSRMKLESLKSETKFTDNRIIVKCTGSYTIQTVIMNVHDARKSKSVKVL 3433
            PC ACSSP+VPYS+MKLESLKSETKFTDNRIIVKCTGSYTIQ+V MNVHDARKSKSVKVL
Sbjct: 3371 PCVACSSPDVPYSKMKLESLKSETKFTDNRIIVKCTGSYTIQSVTMNVHDARKSKSVKVL 3430

Query: 3434 NLYYNNRPVADLSELKNNWSLWKRAKSCHLAFNQTELKVEFPIPITACNFMIELDSFYEN 3493
            NLYYNNRPV+DLSELKNNWSLWKRAKSCHL+FNQTELKVEFPIPITACNFMIELDSFYEN
Sbjct: 3431 NLYYNNRPVSDLSELKNNWSLWKRAKSCHLSFNQTELKVEFPIPITACNFMIELDSFYEN 3490

Query: 3494 LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG 3553
            LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG
Sbjct: 3491 LQALSLEPLQCPRCSRPVTDKHGICSNCHENAYQCRQCRNINYENLDSFLCNECGYSKYG 3550

Query: 3554 RFEFNFMAKPSFTFDNMENDEDMKRGLAAIESESENAHRRYQQLLGYKKPLLKIVSSIGE 3613
            RFEFNFMAKPSF FDNMENDEDMK+GLAAIESESENAH+RYQQLLG+KKPLLKIVSSIGE
Sbjct: 3551 RFEFNFMAKPSFIFDNMENDEDMKKGLAAIESESENAHKRYQQLLGFKKPLLKIVSSIGE 3610

Query: 3614 NEMDSQQKDSVQQMMVSLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV 3673
             EMDSQ KD+VQQMM SLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV
Sbjct: 3611 TEMDSQHKDTVQQMMASLPGPSCKINRKIALLGVLYGEKCKAAFDSVSKSVQTLQGLRRV 3670

Query: 3674 LMTYLHQKHTDDGFPASRFVISRSPNNCYGCATTFVTQCLEILQVLSKHQSSKKQLVSLG 3733
            LM+YLHQK+++    ASR V+S++PNNCYGCATTFVTQCLEILQVLSKH  S+KQLV+ G
Sbjct: 3671 LMSYLHQKNSNFSSGASRCVVSKTPNNCYGCATTFVTQCLEILQVLSKHPRSRKQLVAAG 3730

Query: 3734 ILSELFENNIHQGPKTARIQARAVLCSFSEGDVNAVNGLNNLIQKKVMYCLEHHRSMDIA 3793
            ILSELFENNIHQGPKTAR QARA L +FSEGD++AVN LNNL+QKK+MYCLEHHRSMDIA
Sbjct: 3731 ILSELFENNIHQGPKTARAQARAALSTFSEGDLSAVNELNNLVQKKIMYCLEHHRSMDIA 3790

Query: 3794 LATREELSLLSEVCSLADEFWEARLRVVFQLLFSSIKSGAKHPAIAEHIILPCLRIISQA 3853
            LATREE+ LLSEVCSL DEFWE+RLR+VFQLLFSSIK GAKHPAI+EHIILPCL+IIS A
Sbjct: 3791 LATREEMLLLSEVCSLTDEFWESRLRLVFQLLFSSIKLGAKHPAISEHIILPCLKIISVA 3850

Query: 3854 CTPPKSDTVDKEQRMGKLTSVSQNKDENATNISGSFSGPVSGNKSAPESLEHNWDSSHRT 3913
            CTPPK DT +KEQ MGK     Q KDENA  +           K + ES E+N + S +T
Sbjct: 3851 CTPPKPDTAEKEQTMGKSAPAVQEKDENAAGVI----------KYSSESEENNLNVSQKT 3910

Query: 3914 QDIQLLSYAEWEKGASYLDFVRRQYKVSQVCKGTVQRSRTQKGDYLSLKYALKWKRFVCR 3973
            +DIQL+SY EWEKGASYLDFVRRQYK SQ  +G  Q+SRT + D+L+LKY L+WKR   R
Sbjct: 3911 RDIQLVSYLEWEKGASYLDFVRRQYKASQSIRGASQKSRTHRSDFLALKYTLRWKRRSSR 3970

Query: 3974 NAKSDLSAFELGSWVTELVLCACSQSIRSEMCMLISLLCAQSSSRRFRLLDLLVSLLPAT 4033
             +K  L AFELGSWVTEL+L ACSQSIRSEMC LISLL AQSS RR+RL++LL+ LLPAT
Sbjct: 3971 TSKGGLQAFELGSWVTELILSACSQSIRSEMCTLISLLAAQSSPRRYRLINLLIGLLPAT 4030

Query: 4034 LSAGESAAEYFDLLFKMVDSEDARLFLTVRGCLRTICQLISQEVGNVESLERSLHIDISQ 4093
            L+AGES+AEYF+LLFKM++++DA LFLTVRGCL TIC+LISQEVGN+ESLERSL IDISQ
Sbjct: 4031 LAAGESSAEYFELLFKMIETQDALLFLTVRGCLTTICKLISQEVGNIESLERSLQIDISQ 4090

Query: 4094 GFILHKLIELLGKFLEIPNIRSRYNFFLKHLFFRLLLFKLIQCDLIVRFMRDNLLSEVLE 4153
            GF LHKL+ELLGKFLE+PNIRS                         RFMRDNLLS VLE
Sbjct: 4091 GFTLHKLLELLGKFLEVPNIRS-------------------------RFMRDNLLSHVLE 4150

Query: 4154 ALIVIRGLVVQKTKLISDCNRLLKDLLDSLLLESNENKRQFIRACICGLQIHGEERKGRT 4213
            ALIVIRGL+VQKTKLI+DCNR LKDLLD LLLES+ENKRQFIRAC+ GLQ H EE KGRT
Sbjct: 4151 ALIVIRGLIVQKTKLINDCNRRLKDLLDGLLLESSENKRQFIRACVSGLQTHAEENKGRT 4210

Query: 4214 CLFILEQLCNLISPSKPEPVYLLVLNKAHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI 4273
            CLFILEQLCNLI PSKPE VY+L+LNK+HTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI
Sbjct: 4211 CLFILEQLCNLICPSKPEAVYMLILNKSHTQEEFIRGSMTKNPYSSAEIGPLMRDVKNKI 4270

Query: 4274 CHQLDLLGFLEDDYGMELLVAGNIISLDLSIALVYEQVWKKSNQSSNAISNTALISTTAA 4333
            C QLDLLG LEDDYGMELLVAGNIISLDLSIA VYE VWKKSNQSS +++N+AL+++ AA
Sbjct: 4271 CQQLDLLGLLEDDYGMELLVAGNIISLDLSIAQVYELVWKKSNQSSTSLTNSALLASNAA 4330

Query: 4334 --RDSPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPELEFAIAGAVREYGGLEILLG 4393
              RD PPMTVTYRLQGLDGEATEPMIKELEEDREESQDPE+EFAIAGAVREYGGLEILL 
Sbjct: 4331 PSRDCPPMTVTYRLQGLDGEATEPMIKELEEDREESQDPEIEFAIAGAVREYGGLEILLD 4390

Query: 4394 MIQRIWDNFKSNQEQLVAVLNLLMHCCKIRENRRALLRLGALGLLLETARRAFSVDAMES 4453
            MI+ + D+FKSNQE++VAVL+LL HCCKIRENRRALLRLGAL LLLETARRAFSVDAME 
Sbjct: 4391 MIKSLQDDFKSNQEEMVAVLDLLNHCCKIRENRRALLRLGALSLLLETARRAFSVDAMEP 4450

Query: 4454 AEGILLIVESLTIEANESESISIGQSALTVTSEQTGTGEQAKKIVLMFLERLSHPFGSKK 4513
            AEGILLIVESLT+EANES+SIS  QSALTV++E+TGT EQAKKIVLMFLERLSHP G KK
Sbjct: 4451 AEGILLIVESLTLEANESDSISAAQSALTVSNEETGTWEQAKKIVLMFLERLSHPSGLKK 4510

Query: 4514 SNKQQRNTEMVARILPYLTYGEPAAMDALIQHFTPYLNDWDEFDRLQKQHEDNPEDKSIS 4573
            SNKQQRNTEMVARILPYLTYGEPAAM+ALI+HF+PYL +W EFD+LQ++HE++P+D SI+
Sbjct: 4511 SNKQQRNTEMVARILPYLTYGEPAAMEALIEHFSPYLQNWSEFDQLQQRHEEDPKDDSIA 4570

Query: 4574 EQAAKQRFTVENFVRVSESLKTSSCGERLKDIILEKGITGLAIKHLRDSFAVAGQTGFRS 4633
            +QAAKQRFTVENFVRVSESLKTSSCGERLKDI+LE GI  +A+KH+++ FA+ GQTGF+S
Sbjct: 4571 QQAAKQRFTVENFVRVSESLKTSSCGERLKDIVLENGIIAVAVKHIKEIFAITGQTGFKS 4630

Query: 4634 SVEWAFALKRPSIPLILSMLRGLSMGHLATQRCIDEGRILPVLHALERVPGENEIGARAE 4693
            S EW  ALK PS+PLILSMLRGLSMGHL TQ CIDEG IL +LHALE V GEN+IGARAE
Sbjct: 4631 SKEWLLALKLPSVPLILSMLRGLSMGHLPTQTCIDEGGILTLLHALEGVSGENDIGARAE 4690

Query: 4694 NLLDTLSNKEGNGDGFLEDKVRMLRHATRDEMRRLALKNREDMLQGLGMRQ-VASDGGER 4753
            NLLDTL++KEG GDGFL +KVR LR AT+DEMRR AL+ RE++LQGLGMRQ ++SDGGER
Sbjct: 4691 NLLDTLADKEGKGDGFLGEKVRALRDATKDEMRRRALRKREELLQGLGMRQELSSDGGER 4750

Query: 4754 IIVSRPALEGLEDVEEEEDGLACMVCREGYSLRPTDLLGVYSYSKRVNLGVGTSGSTRGE 4813
            I+VS+P LEG EDVEEEEDGLACMVCREGY LRP+DLLGVYSYSKRVNLGVG SGS RGE
Sbjct: 4751 IVVSQPILEGFEDVEEEEDGLACMVCREGYKLRPSDLLGVYSYSKRVNLGVGNSGSARGE 4810

Query: 4814 CVYTTVSYFNIIHYQCHQEAKRTDAGLKIPKKEWEGATLRNNESLCNSLFPVRGPSVPLA 4872
            CVYTTVSYFNIIH+QCHQEAKR DA LK PKKEWEGA LRNNESLCNSLFPV+GPSVPLA
Sbjct: 4811 CVYTTVSYFNIIHFQCHQEAKRADAALKNPKKEWEGAMLRNNESLCNSLFPVKGPSVPLA 4832

BLAST of HG10018299 vs. TAIR 10
Match: AT1G55970.1 (histone acetyltransferase of the CBP family 4 )

HSP 1 Score: 53.5 bits (127), Expect = 5.4e-06
Identity = 21/70 (30.00%), Postives = 38/70 (54.29%), Query Frame = 0

Query: 2610 IEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEV----LDADRLPSPHSR 2669
            ++ED +   +Q+CC  C+T+ +   RW C  C +F +C+ CYEV    ++ +R P     
Sbjct: 1153 MKEDFIMVHLQHCCKHCTTLMVSGNRWVCNHCKNFQICDKCYEVEQNRINIERHPINQKE 1212

Query: 2670 DHPMTAIPIE 2676
             H +  + I+
Sbjct: 1213 KHALFPVAIK 1222

BLAST of HG10018299 vs. TAIR 10
Match: AT1G16710.1 (histone acetyltransferase of the CBP family 12 )

HSP 1 Score: 51.2 bits (121), Expect = 2.7e-05
Identity = 22/69 (31.88%), Postives = 35/69 (50.72%), Query Frame = 0

Query: 2610 IEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEVLD--ADRLPSPHSRDH 2669
            ++ED +   +Q+ C  C T+ +   RW C+ C DF LC+ CYE      DR   P ++  
Sbjct: 1400 MKEDFIMVHLQHSCTHCCTLMVTGNRWVCSQCKDFQLCDGCYEAEQKREDRERHPVNQKD 1459

Query: 2670 PMTAIPIEV 2677
                 P+E+
Sbjct: 1460 KHNIFPVEI 1468

BLAST of HG10018299 vs. TAIR 10
Match: AT1G16710.2 (histone acetyltransferase of the CBP family 12 )

HSP 1 Score: 51.2 bits (121), Expect = 2.7e-05
Identity = 22/69 (31.88%), Postives = 35/69 (50.72%), Query Frame = 0

Query: 2610 IEEDAVASSVQYCCDGCSTVPILRRRWHCTICPDFDLCESCYEVLD--ADRLPSPHSRDH 2669
            ++ED +   +Q+ C  C T+ +   RW C+ C DF LC+ CYE      DR   P ++  
Sbjct: 1371 MKEDFIMVHLQHSCTHCCTLMVTGNRWVCSQCKDFQLCDGCYEAEQKREDRERHPVNQKD 1430

Query: 2670 PMTAIPIEV 2677
                 P+E+
Sbjct: 1431 KHNIFPVEI 1439

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890252.10.0e+0096.04auxin transport protein BIG isoform X1 [Benincasa hispida][more]
XP_008459406.10.0e+0095.69PREDICTED: auxin transport protein BIG [Cucumis melo][more]
KAA0039419.10.0e+0095.65auxin transport protein BIG [Cucumis melo var. makuwa][more]
XP_004141595.10.0e+0095.28auxin transport protein BIG isoform X1 [Cucumis sativus] >KAE8648950.1 hypotheti... [more]
XP_038890253.10.0e+0096.16auxin transport protein BIG isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9SRU20.0e+0065.16Auxin transport protein BIG OS=Arabidopsis thaliana OX=3702 GN=BIG PE=1 SV=2[more]
B9G2A80.0e+0059.27Auxin transport protein BIG OS=Oryza sativa subsp. japonica OX=39947 GN=Os09g024... [more]
Q54QG50.0e+0029.45Probable E3 ubiquitin-protein ligase DDB_G0283893 OS=Dictyostelium discoideum OX... [more]
Q5T4S70.0e+0028.40E3 ubiquitin-protein ligase UBR4 OS=Homo sapiens OX=9606 GN=UBR4 PE=1 SV=1[more]
A2AN080.0e+0028.18E3 ubiquitin-protein ligase UBR4 OS=Mus musculus OX=10090 GN=Ubr4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A1S3CBB80.0e+0095.69auxin transport protein BIG OS=Cucumis melo OX=3656 GN=LOC103498551 PE=3 SV=1[more]
A0A5A7TDS60.0e+0095.65Auxin transport protein BIG OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaf... [more]
A0A0A0KVU70.0e+0094.50Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642140 PE=3 SV=1[more]
A0A6J1FM060.0e+0093.35auxin transport protein BIG-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
A0A6J1FGF60.0e+0093.33auxin transport protein BIG-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC... [more]
Match NameE-valueIdentityDescription
AT3G02260.10.0e+0065.16auxin transport protein (BIG) [more]
AT1G55970.15.4e-0630.00histone acetyltransferase of the CBP family 4 [more]
AT1G16710.12.7e-0531.88histone acetyltransferase of the CBP family 12 [more]
AT1G16710.22.7e-0531.88histone acetyltransferase of the CBP family 12 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003126Zinc finger, UBR-typeSMARTSM00396push_1coord: 1579..1651
e-value: 1.0E-17
score: 74.9
IPR003126Zinc finger, UBR-typePROSITEPS51157ZF_UBRcoord: 1579..1650
score: 8.491323
IPR000433Zinc finger, ZZ-typeSMARTSM00291zz_5coord: 2617..2660
e-value: 1.4E-10
score: 51.2
IPR000433Zinc finger, ZZ-typePFAMPF00569ZZcoord: 2620..2652
e-value: 6.0E-10
score: 38.8
IPR000433Zinc finger, ZZ-typePROSITEPS01357ZF_ZZ_1coord: 2623..2650
IPR000433Zinc finger, ZZ-typePROSITEPS50135ZF_ZZ_2coord: 2617..2651
score: 12.027581
IPR043145Zinc finger, ZZ-type superfamilyGENE3D3.30.60.90coord: 2613..2677
e-value: 1.5E-14
score: 55.5
IPR025704E3 ubiquitin ligase, UBR4PFAMPF13764E3_UbLigase_R4coord: 4219..4871
e-value: 1.5E-264
score: 880.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2845..2878
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3152..3167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3168..3183
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 2841..2878
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3825..3875
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1516..1538
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1505..1571
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3835..3870
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1543..1566
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 3149..3183
NoneNo IPR availablePANTHERPTHR21725E3 UBIQUITIN-PROTEIN LIGASE UBR4coord: 230..4871
NoneNo IPR availablePANTHERPTHR21725:SF1E3 UBIQUITIN-PROTEIN LIGASE UBR4coord: 230..4871
NoneNo IPR availableCDDcd02249ZZcoord: 2621..2672
e-value: 1.31229E-14
score: 68.6151
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 2608..2680
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 1811..1976
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 2881..4204

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10018299.1HG10018299.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0008270 zinc ion binding