Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCGTATTAGGTTTCCAAAACAATAACAAACCTAAATACTACTAACCTAGAGCCCATTGGGCTTTGACTTGATAGAAACATGTAAGTTTTAGAGATTTAACCTTCACAAACTTTATTTTACGAATATTCTAAAATTGGAAAATATTTCAAGATTTTGGCCTAATTTTCAAACAAATTTAACCTTAAGCCTCGTGGTCTCTCTGATTAAAATTTCAATTTTTTTATAATTTAATATATCATAAATAATCATCTAAAATTTATTTAGACCTTATAAAATTGGTTCTTTCACAATTAATGTTGTACATATTATAATTAATATTGGGATATTATTTAATGAGTATTATTTTAGTCTTTCATAATTATTGCTTCAATTTAGAAATAAAATAGGGTTACTCTTTATGTGTTTTGAATTGTATTATGCGTTTATGTAGTACTATAGAAAGTATACTTTTATTGATTTTTGTTTTATCAATATATACTATGTTATATTAATTGTTCAATCTTCTACCATGATATGCATTGTGTTTCTTCATATATATTGTGTGCTATTTTTTTTTTTTTAATATTTTCATATGTAACTAGTTTTTAATAGTACGTTTCACAAATGTAACTACATTGTCATGTATACACATGTATATATTTGAAATTGAGTTATATATTATGCATATTTTGTCATATATACACATGTTTGTTATACTAAGATAAATGATATCTTCATATTATTTGAAATAAACTCACATATATTTGAAATGGAAAACAAAATGTACAAATAAACCTCATATATACTATGTATTATAACCCATGTGACTTTCATGTGCGTGTTTTTGAAAGAAAAAAAAAATTAATGTGCGTAAATATGATATATTTTTTAAAGATTTGATTTTGAAAATAAAACGGTTTTGAAAAAAAAAATCCATTTAATGCATGTGAATCTGATATATTTTCTTTTCCATATGTGTCTTTTCTTATTTCTATTTTTTCTTATTATTATTTTTCAAAATTTCTAAAATTTGTGGAGGATGATATGGACATTTGAGCCATATTTTTATGTAAAATATTTCATTTCATTAAATTATTCATAAAAAAAACAAAAATAAGAATAAGTTTAAAACAATTTTGTAATTAAACTTATTTAATTAGGTTATATGTGTAGAGAGCTCAAGACATATAGCCTAAACTAAATCAACATACTATGAAATACTATTACATTAAATAGTATTTACTCTAACACTGTAGACATGAGAAGGGATGTGGGTTTGTTCTGAAAGAGGAGATGGTAATGGAGCTTGATTTCAATGGGGTTTGATTGTGAGCTTTGGCAATGGCGATGGATGGAAGAGATGAGATGCAAACCAGTTGTGAACACATTTCCCTCTCTTTCTTCTATTTTGGAATTAGGATTCTCGAGAGTACTTCATTTAATTTAAATGTTTTCGAGGGTTAGGGTTGCTCATTCAAACTACAAGAATTTAATCGTTTTTAAAAACATAATCAAACTTAATAAAAAATTTAGTGAATTCCATCCACTTTGAAGAAATTTTGAAATTGAATTATTGAGTTTCAATTTCGAAACCATAGTGAACCCAACTTTTTTTTTTATCTATATAAATATCACTAATTTAAATACTTTATTTGTTAAATACAAATATGTTAAAGAAAAATTGCAAGAGTCATCTTTGACTTTTGATTTAATCCATCATTTCCCTAAACTATATGGTTTGCTACATCCATCAACTATTAATTCGATCATCATTTCCCTAAACTATATGGTTTGCTACACCCATCAACTATTAATTCGATCAATTAGCCTCTGTATTTTTTACTTGTTGCAATACATATTAATATTGATATTGATATATTTTCAAGATTAAAATTTTATTGGGGTGCAGAGTTGTGTTGAGTTGAATTGAGTTAGAAAATCTATGTTTAGGTGTGTTTGGGTGCCGATTTCAGTTAAATTGGAGTATATGTTACAATTTTTTTAAACAAATTGATTCTATATATATATATATGTTTTTTTTCTTTTTTTTTCTTTTCCTTTTTTATCTCCAAGTATGCCCATTTGAATAAAAATTTAACACTAGAAAATGATTAACTTTTTTCTATTCTTAAAACATAACAAATTCGTACAAAGTAAAAATAAACTATTGGTCCCTAAACTTTCAAAGGTAACGATTTAGTCTGTGAAGTTATTAACTAGTACCAATTTGGTCCAACTTTCTTTAGTTTACCGATAGATTTATGAAATGGCGCCATGTGTCATCTTAATTATAAAAAATAAAAGCATTGATGTATTAGAGAAACCAAAGAGGAATTTTTTTCTTTCTTCTTTTTTTTTTCTTTTTTTTTTTTGTGATTCCCCCTCTATCTCTTTCTTCTTCTTCTTCTTCTCTTGCTTTTTCTTCGTTTCCTCCTTTTCTTTATTCAACCTTTTTTTTTTAAAAAAAAAAATAAATTCTCCTGCATCTTCTTGTTCTCCGACTTCTTCTCTAACTTTTTCTTCTTCTCCAACATGAAGAGTCAAAACTCCTACACGACCAACCCGATGAGCGAGCGAGAGAGAGTGAGAGCGAGGGTGTGAGAGAGAGAAAGAGATAGGGTGATTGTGAGTGACGAAAGGAAGCGAGAGCGAGAGGGATAGAATGATTGTGAGTGAAAGTGAGAATGAGAGCGAAAGAGAGTGAGAGTGAGAGCGAAAGTGTGAGAAAGAGCGAGAGAGATAGAGTGATTGTGAGGAAAAGAAGCGAGAGTGAGAGTTAGAGATGCATGAGGGAGAATAATTATGGGGGTTGGGGATAATTAAAACGCCTGGAAAATTGTGTCGATAAAATGGTGGGTAAAACAACGGGTTATCGTTTTTCTTTTATTGTCAAACAAAGGTGGGTTTCATTTTAAAAACCCACTCAACCCGTTGGGTTATCAAACAACCCCAAAGTTTAAACTTTGTGAAGCCATTTAAAAACAAAGATGCTTTAACATTTTGCTCAATTATTGACAAAAAGAACATGAAAAAGCATATACAAAAGAGTAATCATTTAGCTAAAATGGGGTTGTAGAAACAAAAATATGACCAAAACAAAATTATTGTTCGCGTACAAAAAAAAAAGTACGCAATCTCTAACCTTAGGTGCGTGATTGTTAAGTTCTTGCTTTGCAATTTTGTTTTGGTCATAACTCCCTCGATAAAATTTAGTTTTAGCTAAATAACCACTTGTTTTCACATGGTTTTTTGTGTTATTTATAACAATGCTTATAAAAAGAGCAAAATTTAAAAGAATATTGAATATAAATGACCTGAAAACATTTAAACGTAGATAAAATTATATTTTGGAGTATTAAAGTTAAAGATTGTTTAAGTTTTTGTTCAGTTTACAAGCATGGATGGAAATAATGTAAAAAAAATTATGAAAACGAGTGACCATTTAGCTAAATGGAAGTTGTATTGATGGAGATATGACCAAAACAAAATTGTCAAGGGAGAACTAAGCGAATGCGTACATGGATGTCAATGATTGCACTTTTTTTTTTGCCCAAACGTAATCTTGTTTTGGTTGTATATTTGTTGATGCAATTTCGTTTTAACTAAATAACTACTCGTTTTCATATGTTTTTTTTTATGTTCTTTTCAAACATGTTTATAAATTAAGCTAAATGTTAAAACATATTTTCTTTTAAATGACTTCATAGAGTTTAAACTTGATCTCACAGAGTTTAAACTTGATCATTTTTGCTATGTGATGTTATGGTATCATAAGAAGAAACACTAATGGAAGAAACTTAGAAATAAAAAAAGGGAGAGAAAAAAAATTTGTTTAAAACTGACGACAAAATGAATATTTCACATTTTTAAAAAGTAAAAAGACAGAAAAAAAAAAAAAAGAAAAGAAAAAGGAAAAAGAAGAAGATCAATTACATATAAGTTTTTGGATTTTGAGTTAGTATGGGGAATTTTCCACAAACAAATGACAATATTTTGAATATAGTTTGAAAACAACTTTATGCCTATTATTTTTAAACCTACAATATTTATCTTCTTAACCCGCCCGAAGGCTGTTGGAGTGCGATGCTCCTAAAACCCCAGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGTAATTTCAATTACTTCCCCATTGTTGTGAATACCATTGAATTTTCTATGATTTTATTTGATATTGAAAAAAATTTTGTTTCCTAAGGGTTTTTTGTGGTGAGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGTAGTGCTTTTCGCTGGAGCTTGAATATCATAGTATAGTTTCCCTGAAACGTCAATTCGGATATTTGCATCAACGGGTAAAATGTCGCAATCATTAGTTATATTTATTGCACGCCATTCCTTATATGTCATTACTGTTATGCAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGGTAGGCTACATACTTTTGTGTAGTAACTTGTGTATAATTCTTATTGACAATTTTAAGTGACGTTTGTTAAAGTTCTTACATTTTTTTTTTAAAAATTACTGTTTACTTCAAGAAAATGGATCTTTTTGCTACATATCTGCATGGGGAGATGCTGTGTTTCCCTGTCTAATTCGCCATTTCTGAGTGCCCCTTCATTAAACTTAAACGATCTCTCTAAAACCTTTCTAGTCAATTCTTCCAGTTACTACATCTATTGTGAAGACCTCTCTTTTCCTTGTATATTTCATTTAATCATTTAAACTTTGTTTCTTATAAAAGGACCCTTTGTTCTATTCATGAGTGTAAGAACGGTGTCTTGGAATGGAAATTTGAAAACAAGGAATCGCTACAACTTAGACCTGGTTGTTATTGGCATTAGGCCTTAGAGAAACAGCAAAGGGCCATTATTGCTGAGACTCAATTGAGGGCTTAGAAATTTTAGGAAGGATGACGAGGGAACACTGCTCATTAGCACAATAATTAGGGTAGCTAGGCCAGAGATAGGAATGTGATTGAAGATAGGATAGTCATAATTGTTGGGCTCTGTTAATTTAACTATTTATCTTAGAGATCTTTATTTGTTCAGGAATTGAAACTCATTTTTATTTTTTGATAGTGAACAATTTTAATATTGCATGTTTTCCCCCCCCTCTTGCATGTACCATGCTCCATGCCAGTGTTTTTGGCATATATTTTCTTCTTCTTTTGTATTCCCTTGCCCCTAATTTTCTGCCATTTTCCCCCCAAAGCGATCTATTAGCATGGATTATTGTCAGCAAAACTTTCTATATGTGGAGCTACTGTTTTTGAAATTCTTTTAAGTCTTATTTTGGATTTTAGTGGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGATACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCTGAGTTCAGGGAATGGTGAAACTGATACGGACTTTACAGTGAAGGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGTGTGTGTGTAAAAATGATATTTTGCAGAATGTTGAAACATGAAGGTTTAATAATTTTCGTTATTCATTTTTCTATTTGGTTAGGGAGGGGATTTTTCTTGTATTTTACTTTTGTGATGTATCATGAGGTGTCAATCAGGTTGGCTATGAGTTGAGTTTGAGTTGTTAGCTTGGAGTGAAGTACAGTTGGTTTCACCATAGTAATCCCAATAGATACTTTTGCACATTTTAACCCTATGTAGTTTGCTTCCTATGGCAGATAGATTTGATTAGCAATGGTTTTTGTGGTTGAAGAAAATTCTAGATTTGGGTAATTATTATTTTTATATGGGTTATTTGCCTGGAGAGAAATCGTGGAACTTTTGTATCTATTGCTAATTCATACCATGCAAGAGATTGAATACTAGCAAGCATATAAAATCTAACACAAAGGAGCCAAAATAAATCTCTAATTGCTGGGTTCTTGGCTAGTTTGGGAATTTGCCTCTTCCTTCCCCTTCTTTAATTCTTTGTCATTACTATATTATTCCTTTTCCAGAATAAAAGCAATAAAATAACAATTAAATGAGGATTACAAGAGAACAAGGAATTCTAAATTCTCATTAATTCTAAATAATCCAGAAAACTTAGTATAATTAAAAAAAATGTCTGAAAACTGAAATAAATTGGAAACAATAGTACTACGTACCATACAGAACTGCTATTTTTGATCTCAATTACAAGCTTACGTTGGTGATTCTAGAAAGTACTTATGGTTACTATCTTTTAACTTGATTTGTGGTGATTGCTCAGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTGAGTAGTTGTGAATTTCTTCCTTTTCCCCCTCGCATTCCATTTCTATTTTCACGATGATGGTTGTGTTCTTTATCATGCTTGTACTTATAGATGAGGGTGATGATAATGTTGATGACGAGGGTGATGATAAGGGTTGCTTTTGACTTGAGAGGAATGAGAGAAACTTTAGAGGCTTTAAGAGGTATTGGAAGGAGGTGTAGACTCTTCCAAAATTATTGCCTCTCTGCAGATATTTGTAACTTAAAGGACCTTTGTAATTATCAATTATCATCCTCTAGGCCTTGTTCTTTTGGACAGTTCTTTTTCTTGTTAGGTCCATTCTTGTTTGACACAGTTTTTTGGTTGTTTTTTACAAACCCTTCTGTGTTCTTTCATCTATCTCTTATTGAAAGCTCGGTTTCTTGATGAAAAGACAACTTTGTACATGGGCTTTATTGATGTTTTGAATTCTATTGTGCAAATTATTTATGAGGCATTAGTTTGTTTATCAATTGCCTTGAGTATATCTCTTTGGCAAATTATTGTGGGATATTTGGGGGGAGAGAAACAACAAAGTGTGTAGAGGTATGGAGATGTTTGGTCCTTGACGAGGTTCCGTGTTTCTCTTTGGGATTTGGTTTCGAAGATGTTTTGTAATTATTCTCATCTTGCTTAGTTGGAAACCCTTTCTTTAGAAGGGTTTTTGGGGATTGGTTTTTTGTATGCTCTTGTATTCTTTTCATTTTTTCTCAATGAAATCAATTATTTCTATTAAAAAAATCTCTTTGAGAAATACAAACATTTAGAGGGGAAATAAATCCTTGGACGTGCAATGCAAATATTGAGCCTTAGAAGATTTGTCTTTAAAAATAAACCATTGTTATTGATGTGGTAAAAAAAAAATGGTGCACTCTCTCTCCCTTACCTTTTGCTCAACATAGTGCTCGTCAGAACCCCCCAATTCTCTGTTAGTCACAATTTAGACTCCTTTGCTTCAACTCATTATTTGTCATAGGTATTTTTTTCTGAAAAAAAAAAACAAGATATCTTAGATTTGATGAAAAGAGACTAATAATGTAAGGTCAGGTGGGTTGTCCCATGAGATTGGTCAAGGTGCGCGTAAGCTGGTCCAAACACTCACAGATATTTAAAAAAAAAAAGAAAAGAGAATAATGCTCAAAATACAATGACAAAACAAAAAGACGAAGAGACAATTCCATCTACCATAATACATGAGAACTATGCAGAAAAAAATTTAAACATAAATCTTGACTAGAGTATCGAGCAAAAGACTTGGAGAGAGAACTAGGAGGCGAGGGAGTTATCTGAGCAAACTCAAACCTATCAAACCAAGGAGAGGACTTGTCTTGAAAATACGCTGATTTCTTTCAAACCAAATCTTCATGAGAATAGCTTTAACCACATTTGACCATAACAAGACATGGGTATCACAACAACCTTTAGAAAGATTGATCACATAATATTTACTTGGGGTTTTTGACCAAAGAAAGAAAGGAAGATGTTGCAAGACTATGAAAGACATTTGGAAAGGCTCTTTGACATCTTCGTTGGAGAAGCAATTGTTGTGCAGTTATTTGACGCATTTGAGGATCTCCTGTTTGCTGCCTACTCCTATAAGTTATCTTGAAGAACAAACGACAAAGATTGCTCAATATGGATTCAAAAAATCACTAACAAGAGAGGTACTTTCATCGAAATCACTAAGGGGTGTTTGGGCCAAGGAGTTGGAAAGTGTGGAGTTGTGAACTCCACTACTTGTTCTGCTCAGAGTTTGTAGGTCCCACTACTAGAACTCAAGGACACTGTTGTACAACCTATAGAACAGTTCCTGTTCAAGTAAGGTCGTTTGTGGGTCTCACTACTAAAAAGCATCAATTTTATGTCTTATTAACTCCTTATATCGTGGGCCCTAGGAGTTCACAACTCCCTACACTTCATAACTCCTTGGACTTCACAAGTCCATTCCTTGCCCCAAACGCCCCCTAAAGTACAAAGCAGGGGAAGAAATCTAATTATCCCTTTTGCTAATAAAACAATGGTTGGAAAGTTTTCAGAGAGCTCTTATCTGATTTCTTAATTGAACCACAAAAGGGAGAGGAAGCTTTATCGAAGAAAACTAAAACATAAAGGAAGTCCGTCCTTTGTAGAGGTAGTGAAGAAAAGTAATCAACCTCAAGATAGAAGCGGGGACCTTTTGAGGTGTGATGACTTGTTTGTCCCAATGATGTAGAGAGAAAGGAGTTAAAGGGCATTTCAATGGAACAAGGTACTGGTTATTACAAGAAGAGTCTTCCATGATGATTGGGTAAAAATTATCTTCACATTGTTTGTTTTTGAAAGGGGCTGTTTGGATCTGTTTTATTGACTATCTTTTGCGTTTTCTTGCTGCTTTTGTTTGGAGGCCCTCCACTTTTTTGCTTTTATCTTTCTTTTTTGGCTGTACATCTTTTATTTTGGTGCTATCTTCCCTCACGTTTCGGCTTCATCTGTATTTCTGATGTGTCTCCTTATTGTACTCATTGAATTATTTCATTCATCAATGAAATGTTTCTTATAAATTTTTTTTTTTTTTGGAGTCCCTCGAAAGATAACTAGAAGAAGTCTTTGCCAATAGACCTTTTCCACCTAGATAAATGCTTCTTTTGTTTCCATTCATGGATTGTGCAAAGTTGTGACCAATGAATGTGGGGTGGGTCACTTTTGGTGCTTTCACCATAAATAGGAGAGATAGGACGCTTAGGAACACTGAAAAACTTATGCTATCCGTTCATGCTTAGAAGTGGCGATCAAAGTACAAGGAAACTATTGTAGATGTATTCCATTGGACACGAGAATATTGGATGGAGATCAAGCCATCAATTCTTATGTTTGTACTTTTCAAGACAGTGCTTTACTTGCTGGCAGAAAAGTGGAGTTTTACCATGGTTTCTCATCGACTCCAACTGAAATTTTTTTATGGTTCTGGTGGTAGGTTGAGTCTCAACCTGATAGACTTGTCTCAGTTGCACGATGGAATTAGTCTCCCCACGATAAATCTTTTTACCATGAACTTCCATTTCATAGGAAAAGGATACGAGCTTTCATGTTCCAGAAGGGCATCTTGTCAAAGAGTTAGATTTAAGGAGCCATCAATTAATGACTCATATGTATTCCAATTTTTGTCTATGACCATAGAGCCCTTAACTCTTTGCCCCCTTCTTTGAATCAGATTTTATGCCATGCAACCATTTCCTTTTTTAAATGTCAAACAAGACGAAAAGGTTGTTTAAACATCTCTTGAAGCTGCTTTCCCAAATATTTTCTCTATCATTTAAATAGGAAGCTTTGGTGGCAGAATGTTGGGATGCAACTTAGGGAACTTGGGCTTTGGGAATTAGACGAAGGTTGTTTGATTGCGAGATGTAGGCCTAGGCTGAGTAGTTGGAGGGCTTTCGGTTGGGTCAAAGAAAGGAAGGAATTAGATGGTCCTTTTATGCCTCAAGCTCTTCCTCCACCCTAAGAAGTTGGAGGGCTTTCGATTGTGTCAAGGAGAGGATGGAATTTAGGTGGTCCTTGGATGCCTCAAGCTTTTTCTCCACCAAATCCTTGTATTTTGAATTTGATTGGCTCGCCATATTTTAACATGCACTATGATAAAGGTGCTTTGGGATTTCAAAATTTCAAAAGGTGAAGGTTTTCCTTTGGTCCCTAGCACATAGGAGCCTAAATACTCAGGAGAGAATGCAACCACAAAGTGTCCTCGTACCACCTTCACACATCTATTTGCCATTTTTGTTTGGCCTGTGGTGGAGTTTGGAGTCCCTAGACCACACCTTCTTAGACGGCCTTTTGCTAGACAAAATTGGGATGTCTTTTTGGCCTTTTTGACTTGCATGCTTGTCTTCCCAAGTGGGTGGATGGGTGGTTACCTGAATCCCTCAACAGTTGGAGCTTGAAAGGAAAGTGTTAAATCAAAAGCTAAAGTTGATGGGTGTAGGTAAATCTAATATTATATCATTTAACGCTCCCCTCACTTGTCGGCGTGGAATATGTAGAAGACTCAATAAATGGAAATCAATGTTATGGGGAGAAAATAACATTGTAGGGGTTTGAACATATGATCTCCTGACCACCTTCTTCGATACCATGTCTAATCACCAATTGACCCAAAAGCTTAAGCCAATGAGCGAAGGTAAATTTAATATTATATCATCTAATAGAAAGGCTTTATATGAAGATTTGCTTTTTGATCCCTCTTATGGGGTTTATGGCTTGAATGTAACAAGTAAATTTTGAAGATAAGCCCACTTTTGCCTCTTCTTTTTCCCCTCTTTGGGGTTACTATATTTTGAGCATTAGTCTTTTTTCATTATTTCGATGAACAGTCTCATTTCTTTAAAAGAAAGATAAGTCCACTTTTTCTACTTTTTGCGAATTTGTACAACTAGATCCTTCATGATTGGAAGGCTTTTTCTCCGTAGCTTCTTGAGTGGGGAGCCCTCTTTCCCCAGCCTTTACGTCGTTTTGTTCCTTTCTTTCTTTATTTCTTTTGTTACAATCAAACCAAGGAAGGGATTTATCATGAAATATCCTTTGATTCCTGTCAAACCAAATTTTGGAAAGGAGAGCTTTGATTGCATTGATCCTAGTTGAGCTTTATTAGATAGACCATGTGCAGTCTTATTGGTAGAAAAGGAGCAGTTTAGACTGAAACTAGAGTGGATTGGCAGATGGTTGTATGAGGGGAACTTTAGTAGTCTTTTATTCAAGGGAATGATTATTAAGGTTGCGACTAAACTCCTTCATGGATTAAGAAATTGGAAAGATCAGGCGTGGATTTTATTTATTTAATTTTTTTGTCTTGACAAAATCTTATATTTATCTGTGAATCATGTATTATGTTATTATTATTAGTGCCCTAGAATCAAGGGCAATCATATTGGATTAATTAAGCCCTCACCTTCCTTTTCTTGGTTGTAAATTTGTTTTTATTATTCTTGATCTTCTGAATGAGAATGATGTAACTTTATCTGAAATCGACAATGAAAGTTGAGAATTATCACAGTATCATAAACTGATCAACATGATGTAGTTATTATGTTATTTATTTATTTTTTATGTTTAGTAGATTAATTTGTGTAACCCTCTTTTGTAGGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGGTATGCAGAAATGACTTGATCTGATTTAAGCTCTGAATGTTTGAAATTGATTTTTTCTTGAAATTCCTTATTTTTTTACTACTTTTTTTTCATTTAATAAACTTTTGGTTATCACTTCTTTGAGTTTTTTTTCTTCTTTCTGTTACCGTGGGATTATTTTCTAATTATGAAACTACTTGTCAAGTTTATTTATTTTCTCTTTCTTGATTTGGATGCAGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGGTTAGTGATTTTTTGATTATTGTTATGAAAGAATTCTGAAACTTTATTAAAAAATAATGCTGATTTCTGAATAGTGAATATAACCTAAAGTTAGCTATATAAAGCATACAAGTCAGCCTAACAGCTTGTATTTAGCTCAAAAAATAACTAATCTAGCTGACCTATTCGTTTAGAATGAAATATAACTGGCTCATCAGGAGAAGAGTATCTTTTTATAGCTTGCTGAGTGTGTGCTACCTTATGGGTTTTTGAGGGGAGTAGAATAGTAGGGGTGTTTAGAAGGGTGGAGAGGAACCCTAGTGAGGTTTGATCTCTTGTTCACTTTCATGTTTCTTCGTGGGATTCGATTTTGAAGACCTTTTGTAATTATTCTATAGGCGTTGTTTTCCATGATCTTCCTCTCCTTAATGGTGAATACATGTGGTCCTGGGATAGACATCCCCATTCACTCATTCATTGACAGATTCCTCATCACTAAAGGCTGCGTAGATAAATATGGGGAAGGCTACAGAGAGTTACATCTGACCACTTCCCTATCCAACTAACTCTTGATAAGCAGAAATGTGGTCCTACTTATTTCAAATTCCACAATGATTGGATGGAACACTAGGGTGCCTCAGACACGATTTTATTTAGAAACCACCAGGAATTAAAAAAGTGGTTAAAGAATGTGGTATAGAACCCAATTCTTGGAGCAATCTTGAAGAGAGAATTTTATTGAAGTGAAATAATGAAGACAAGCTCCCCTTAAATAGACCCAAGGGGAAAGGGTTGGTTCACCCCTAAACTAACTAGTTAACTAATCTAATCATAATCTTCCCTTAAATAGACTCAAGGGGAAAAGGTTGGTTCACACCTATTCTAAAATTATTAAAATAAAAATACAATTGATTAAATCAAAGGGAATTTCCCAAAATACCCTACATCAAAATTTTCAGGTGTAAGGGAGAACTACATTTGTTTGATCCAAAAAAAGGAAGACGCCACCCTAGTGAAGGATTTCAGACCAATCAACCTTACCACTTCAGTTTATAAGATTGTAGCTAAGGTTTAGCGGAAAGAATGAAGAAAGTTATGCCAAGAATAATTGCTCCTACTCAGAGTGCTTTTATTGGGGGAAGACAAATTCTTGATCAGGTCCTCATAGCTAATGAAGTGGCCGAGGAATATAGAATAAAAAAGAAGAAAGGTTGGTTGTTAAAGCTTGACCTTGAAAAAGCTTTTGGCCATGTAAACTTTCTTGAAAAAGTCCTAGTTGGAAAGAATTTCGACCCTAGATGGATCTATTGGATTATGGGATGTGTGTCGAACCCAAAGTTCTCAATATTTATCAATAGAAGAGCAAGGGGAAGAATACAAGCCTCTAGAGGCATTAGGCAAGGGGATCCTCTCTCACCCTTCCACTTTCTACTTGTTAGTGAGGTACTTAGTGGTTCATTATCAAGGCTACATGATAAGGGCAAATATGAGGAATTTATTGTTGGAAAGGATGCTGTCCATGTTTCTTTGCTACAATTTGCAGATGATACTTTGTTATTTTGCAAATATGACGACGATATGATAGAAAATTTAAGAAAGACCATAGAACTTTTTTAGTGTTGTTCGGGGCAAAAAGTTTATTGGGAGAAATCAGCACTTTGTGGGATAAATATCGAAGATAGCAAGTTGATGTCAGTGGCAGCAAAACTCAACTGTAAAGTTGACTACCTCCCTATCATGTACCTCGGTTTACCTCTAGGAGGATACCCCAAAAAAGAAGCTTTCTGGCAGCCGATCATTGGGAAATTTCAAGATAAATTAGGTAAGTGGAAGAGATACAACTTGTCAAGGGGTGGCGTGTTACTCATTGCAAATCAGTCCTTTCACACCTCCCCACCTATTATATGTCCATCTTCTTAATGCCGGAGAAGTTGATCTCAACCATTGAACGCACAATGAGGAACTTCTTTTGGGAGGGACACAAAGGAGGTAAGTTGAATCACTTAGTGAAATGGGAAGTGACTACTAGAACCCAATCTGAGGGTGGCCTTGGAATCAGTGGCTTGAAATCGAAGAATATTGCTCTCTTGGCTAAATGGGGCTGGCGGTTTATGAAGGAAGAAGACTCCCTTTGGTGTCAAGTAGTGCGAAGCATTCATGGAAGAAGCCTGTTCGGTTGGCACACAAGTGAATAGGTCAAGAACAGTCTTCGTAGCCCATGGAATAGCATCTTAAGATCTTGGTTAAAAGTAGAAGCTTTGGCCATCTACATATCAAGTCTTCGTAGCCCATGGAATAACATCTCAAGGTCTTGGTTAAAAGTAGAAGCTTTGGCTGTCTACATATCAAGTCTCTTAGTGATCTTTGAGGGAGCATGGCATAAGGAAAAATGATAAGTAGGCATGTTGGAGAGAGTAGCTTGAATAAGCGTATGCCAAGTGTCTTGAGCATGTTTGTCCAGGGAGTGTTTGCCTATGGCCGCATGCCCAAACCACCCTCAACCACCTTGTATTTATGTTTTGCAAGTAGCTTAGGAAGTTGTTTTCATTTTCATGTTGTTTTCCTTTCGAGTGACGTTTCGCCAGTTTTGATATGCAAGGCTGGCAGACTAGTTTTTGCTTAGAATGTACTATAGGGGACCCCTACTAGTTGACTCCCCGACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGTCAGTCCTCAATGCTTTATTCAATGATGTTTTCAAAGCTTTTCCTCTCTCGTTTTATAAAACCATTTTCTCTCAAAACGTGAGTAGAGGCTACATCATACCCCGAGTTTACCCGCGACCCACGGTTTTTGTACGGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCTGTCTCGTATTCTAAAGCGAGCGATTGTGACAAGTGGTATCACAGCCTAGTTAGCTCCTTGGCTATCAAGTACCGCTATCATGTCGACTGTAAAGCAACTGAGCAAGTCGCAGTCTGAGCGACTTGCTGAGATAGAAGAGCAATGGCTATATCTAAGGGAAGTTCCCGACGATGTGAGATTCATCCAAGCCCGACTGGAAGGGTTAGATGAGAAGGTCAGAGAAATTAATGTGTTAAACGCTTGAGTAGACGCGCTACCGATCACTGAGTTGACGCTAAGGGTCGACTCTTTGGAACACAAAACTACACGTCCTGGTAGTTTCGAACGTGGAGATAGCTCCACAAGCTCTGTCGCACACATGGAAGAGCGTGTCGAAGAGCTCGATAGCTCCCACAAGGTCATGTTAAAGCTGTTTAATGACTTGACCGACGATTTCAGAGTAACAGTGGAAGCCATCAGGGCTGAGATGACCGAGATGAAGACTCAAGTCAACTTAACGATGCGAGCTGTGGGGAACCAAACCCCGAATCAAACGCTTGCTATGCCCAACAAATTCAAGATTCCGGAACCCAAGGCCTTTAGTGGGAACCGCGACACTAAAGAATTGGAGAACTTCATCTTTGACATGGATCAGTATTTTAAAAGCGAGTGGAACCGTGTCAGAGGAAGTCAAGGTCACGTTAGCCTCCATGCATATCTCCGACGATGCAAAATTGTGGTGGAGATCCAAAGTTAATGATGTCGAAGATGGTCGATGCACCATCGATACGTGGGGAGATTTTAAGAAAGAGTTGAGGGCTCAGTTCTTCCCCGAAAACGTGGAGTTCATAGCGAGGAGGAAGCTAAGGGAACTCAAACACACGAGAACCATCCGTGAGTACGTGAAACAGTTCTCCGCTGTTATGTTAGACATCAGAGATATGTCCGAGAAGGATAAGGTGTTCTGTTTTGTCGAGGGCTTAAAGCCATGGGCCAGGACTAAACTCTACGAGCAGAAAGTACAAGACCTGGCCTCCGCCCTGGCTGCTTCCGAGAGACTGTTAGATTACAGTGGTGATCAGACGCCACAAAAGAAGAACACGGCGCCCCCGAATACTGGGTACAAGGCCACAAAATCCAACCCTCCGAAGAGCTTCAATTCGGAAAGGAAGTCGCAAGCCCCAGGAACAGGCCCCTCCCGGAGGCCCTATCCAGCCGGCCAGACCAACCCAGACCCATCTCTTGCTTCTTATGCAAAGGCCCCCACCGAGTAGTCGAGTGTCCGCATCGGGGCGCCTTGACGGCCTTACAAGCCTCAGTTCAAAGTTGTAACGAACCCGAGACGGAAGCGGAGGCAGAAAGAGAAGAAGATGAGGAAACCCCTAGAATGGGAGCATTGAAGTTCTTGTTTGCAATTCAGAAGAAAACTGCCCGCCAGAAAGATACAGTAGAAAAGGGGCTTATGTTTGTTAACGCGAGCATCAACTCCAAACTTGCTAGAGGCATCCTGGTAGACTCTGGTGCAACACACAACTTCATTTCTGAATAGGAAGCCCGTCGCTTAGAACTCAAAATTGAGAAAGACACGGGTAAGATGAAAGCCGTTAATTCGGAAGCCCTGCCAATCGTCGGGGTGTCCAAGAGGGTACCTTTACAGCTAGCTGGGTGGATTGGGAATGTCGATCTGGTCTTGGTACGCATGGACGACTTTGACGTCGTGCTCGGAATGGAGTTCCTCTTGGAACACAAAGTCATCCCTATGCCCTTGGCCAAGTGCCTAATTGTCACCAGCAGTAACCCCACAGTGGTCTTGGCAGATGTAAAGCAGCCTAGTGGAGTAAGAATGATTTCTACCCTGCAATTGAAGACAGGTCTCGGTCGAGAGGAACCCACCTTTATGGCTATCCCGGTGGTCGACGAGATCACCGGGATCGAGTTCGTCCCTCCAGAAATCCAAGAGATCCTGAACGAGTATGTTGATGTTATGCCACAAAGTTTACCAAAGTCTTTACCTCCGCGGCGTGGGATTGATCACGAGATTGAACTTGTCCCTGGAGCCAAACATCCAGCGAAGAACGCATACAGGATGGCCCCCCCGAACTAGCTGAGCTCAGGAAGCAGTTGGATGAGTTGCTGAATGCGGGATTCATTCGCCCTGCAAAAGCACCATATGGGGCCCCAGTGCTGTTCCAGAAGAAGAAGGATGGAACACTACGACTGTGTATCGACTATAGGGCTCTAAACAAAGTAACAGTTCGAAACAAGTACCCCTTACCGATCATCACTGATCTGTTTGACCAGTTGAACGGAGCCCGATACTTTATGAAACTTGACCTCAGGTCAGGGTACTACCAAGTTCGCATCGCCCAGGGGGATGAACCGAAGACCACCTGTGTGACTAGATATGGGGCCTTCGAATTCCTCGTGATGCCCTTTGGGCTCACCAATGCGCCAGCCACGTTTTGTACACTGATGAACCAAGTGTTCCACGAATATCTCGATCAGTTTGTAGTGGTCTACCTCGACGACATTGTTGTGTATAGCCCCGCTTTGGAAGAGCACAAGGTTCACCTCCGATTAGTCTTTGATAAACTACGGCAGAATCAACTCTATGTGAAGAAAGAGAAGTGTGCTTTCGCCCAAAAGCGTATTAACTTCTTGGGCCAAGTGATCGAACATGGGAAGATCAGGATGGACAGTGACAAGTTGAAAGCCATCCAGGAGTGGAGAGTTCCTGTCTCTGTGCCCGATTTGCGCTCTTTCTTGGGTCTAGCAAACTACTACCGTCGTTTCGTCGAAGGATTTTCAAGAAGAGCTGCCCCCTTGACTGAGTTGTTGAAGAAAGACACCACCTGGCAATGGTCGGCCGAATGTCAGTCGGCCTTCGAAGAGCTTAAAGCGACTATGACGAGGGGCCCTGTCCTCGGTCTAGTTGACGTCACTAAACCGTTTGAAGTAGAGACAGACGCTTTAGATTATGCCCTCGGAGGTGGCCTTCTTGGCGTCCTTCTCCAAGAAGGCCACCCCATAGCTTACGAAAGTCGGAAACTCAATAGTGCAGAAAGAAGATACACGGTCTCTGAGGAAGAAATGCTGGTCGTGGTCCACTGCCTTAGAGTCTAGAGGCAATACTTGCTGGGATCATGTTTCGTAGTTAAGACAGACGATACTGCGATCTGCCATTTTTTTAGCCAGCCAAAATTAACATCAAAGCAAGCGCGGTGGCAAGAGTTTCTAGCCTTTCGAACACAAGACGGGGAGAAGCAATCAAGCTGTCGATGCCCTTAGCCGAAAAGGCGAGCATGCAGCCATGTGCGTGTTGGGCCATATTCAATCAAGCAAGGTCAATGGATCGATGCGGGAGATCATCAAAGAATTTTTGCAGAAAGACCCTTCTGCTCAGGCCGTAGTATCCTTAGCCAAAGCTGGCAAGACCAGACAATTCTGGGTCGAGGGAGACCTGCTGTTGACAAAAGGGAACCGGTTGTATGTTCCTAGAACAGGAGACCTGAGGAAGAAGTTGTTACACGAGTGTCATGACACCTTGTGGGCAGGCCATCCAGGATGGCAAAGAACCTATGCACTACTGAAGAAGGGTTACTTCTGGCCCAGTATGCGAGACGATGTCATACAGTACACCAAGACCTGCCTCATTTGCCAACAAGACAAGGTTGAGAAAGCGAAAATTGCCGGGCTCCTTGAACCCCTACCAGTGCCATCTAGACCATGGGAGATTGTGTCCATGGATTTCATCACCCATCTGCCTAAGGTGGGCGAGCACGAAGCCATCTTAGTTATCGTTGATCGGTTTTCGAAGTATGCCACCTTCATAGCTACCCCAAAACTATGCTCTGCTGAAATGACAGCCCAATTGTTCTTCAAACACGTGGTGAAGCTATGGGGAATTCCAGCCAGTATTGTCAGCGACAGAGATGGCAGGTTCATAGGCACCTTCTGGACCGAACTATTTTCATTCTTAGGGACAAGCCTGAACATATCCTCGAGTTACCACCACCAAACAGACGGCCAGACAGAAAGGCTTAACTGCCTGCTAGAAGAATATCTGCGTCACTTTGTCGACGCCCGGCAAAAGAATTGGGTGCAGTTGTTAGATGTGGCCCAGTTCTGCTTCAATTGCCAAACAAGCTCATCAACAGGGAAGAGTCCCTTTGAAGTTGTAAGCGGAAGACAACCCTTGTTGCCCCACATTATTGATCATCCGTACGCAGGAAAGAACCCCCAAGCCCGCAGCTTTACAAAGGAGTGGAAACAGACCATTGAGATTGCACGAGCCTACTTGGAAAAAGCCTCCAAACACATGAAGAAGTGGGCGGACAAGAAGCGACGCCCCCTTGAATTTCGAGTAGGGGATCAGGTTCTGATCAAGCTGAGACCCGAACAGATTCGATTCTGAAGTCGAAAGGACCAACGCCTCGTCAGGAGGTATGAAGGTCCAGTAGAAGTCCTGAAGAAGGTCGGGAACGCTTCATATAGGGTGGTGTTGCCCCCATGGATGAAAATTCACCCAGTAATTCATGTAAGCAACCTCAAATCCTATCACCCCAACCCCAATGATGGTAGCCGCAATGTCACTGTTTGGCCAGATATCGACCTCAAGCACACATACGAGAAGGAAGTTGAAGAGATCCTCGCAGACAGAGTCAGGAAAGTTGGAAGACCTGTTCGGAGAGTCCTCGAGTTCCTTGTCAAGTGGAAAAATCTCCCCGCAGAAGAAACGAGTTGGGAACGCCTTGAAGATTTGGAAGCGTGGAAGCCGAAGATCGAAGAGTTCAAGCTCCACCAGCCGCAGAGACGTCAACTGATTAAGTGGGGGAGAGTGTCTTGAGCATGCTTGTCCAGGGAGTGTTTGCCTATGGCCGCATGCCCAAACCACCCTCAACCACCTTGTATTTATGTTTTGCAAGTAGCTTAGGAAGTTGTTTTCATTTTCATGTTGTTTTCCTTTCGAGTGACGTTTCGCCAGTTTTGATATGTAAGGCTGGCAGACTAGTTTTTGCTTAGAATGTACTATAGGGGACCCCCACTAGTTGACTCCCCAACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGTCAGTCCTCAATGCTTTATTCAATGATGTTTTCAAAGCTTTTCCTCTCTCGTTTTATAAAACCATTTTCTCTCAAAACGTGAGTAGAGGTTACATCATACCCCGAGTTTACCCGCGACCCACGGTTTTTGTACGGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCCGTCTCGTATTCTAAAGCGAGCGATTGTGACACCAAGCACCCTTAGAGATGAAAGAATATTTCCAATTATGAAATTTTTGATGAATTCGTTCAAAAACCGGTAGCTAAAAGCCAATAGACTTGGAATTTCCACCCAACTGAAAACCAAGATATGTTGCAGGCTAATTAGCTCTTTTACAACCAAAAGTATTAAGGATCCGATCGAATCCGATTCATTGATATTGGTGTCCAACAACACTGTTTCTCATAAAAAAAAAGATATGAATGCTCAACAAGATTAATTTTCAAACTAGAAGCACACTAAAAAATATGAACTATCTCATAGATGACCAAACTTTATTCTCTTTTGTCATCATCTTTGAAGCTTTCCCCTAATGCAGCCAAGTTATAATTTCCTAATGCAGCCAAGTTATTTCATAAGTAATAGTAACTTTACTTCATTAATAAGATAGAACATGAGACCTCACAGATGAAAGATGATTGACCAATAGGATGAGAACCAACAAGAATCAATGAACTATGCTTCATTAGACGACTCATACGATCTGCCACTAGAATAAATAAAAATGGGGAAAGCAGGTCACCTTGTTTGATATCATGAGAAGGAGTAATATTTCCCCTCGGTTGACCATTGATAATGATAGAGAAGTTAACACTTAATTTGAGTCATATCTTTGGTAATTATAGTTTTCTTTGACCTTCCATCCAATTGAAATTCGTTATTCTAATCTAATGGGTATGATCTGATCATTTTATAATTTCATTTTATCAATGAAATGCTTCTGCTTCCTTTCAAAATAAATAAATAAATAAATAAAATAACATTAAAGATTAGGAAACTTTGGACTCTCTATATCACATAATATAGCTGTTTCTTGTTTGTTAGGCTGGTTCCTGTTCGACTCCTTTTATGCTATTGTTTCTGTAGGCCCACTTCTTGTTTTTTGGTCAATGTTGGCCCTCTTGTATTCTTCCAATTTTTTTTTTTCTTAATTAAAGTTGGGCTGTTTTAAAATATAGGAAAATGAACTAAAATATTTACAAATATAGCAAAAAAAAAAAAAAAAAAAATCAATCCTTAAACCTTTTCCCAAGCATTTTTCTAGAAAAAAGAAATCAATCCTTAATGGAATTCTCATTTTAAGCCACTGCCTCAATCGGAGATTCCGGCACATTTAAAGTCTATAATAGCAGAATGTGGACTGGTTCTGGGTTAATCCATCTATCCAAATCATTTTAGTTGTGACTCTTGGCTCTCCTTGGATCAAAGTTTAGCACAGTGGTTTGAATAGGTCTTGTTCTCTTTCATCAGCTTCAGATAATTGAAGACAGCCTCTTCTCCTTGGTTGACTTTTCTACCAACAGATATCATATTATATTTATCTTGATATGGCAGCTATGGAGTTGGTGATCCTAAGGACTGTTGTAAATTTCTTGGGCTGTTTTGTTTTGCATTTGTTCCATTTTGATGGGATTGGTACTTGTCTCTTTTGATACTGTACTTGCATTCTTTTGGTTGTGTTAAGGGTGATATTTTCTTACTTGTATTGTACTATGAGATATTATCTCATTTCATTATACTAATGAAAGAGACCGTTTCCTTTAAAAATAATAAAAAAAAATATCCTTGTATTCTTAAAAAAAATATAGCAAAATTTTATTGTTTATCTACAATAGACCATAACAGATCGCGATAGACTACTATTTCTATCTATATGTTTCATGATAGATATAGATAGTAATTTATCGCGGTCTATCTGCTGATATTTTGCTATTATTTGTAAATATTTAACATTTTTGCCATTTAAAATAATTTCCCATTAAAGTTTGATTTTTTGATTAAAAAAAGTTCCAATTTGGTAGTTTGAGTTATAATTTGAGACATGACATTATCTGATTATTTTTTATCAATTCTTCTAAGAGTTGGTTGCATTATGTTTTCTTTTAGTGTTTCTCTGTCTATTTGATTTTCACGGTTTACTCATGTGCACCCAAAGTATTTTTGTTAGAATGTTGGAGTTCGGTTAGGCAGTCAATTTTTATGTACACATTTATTCATGCTCATTGATTAGGTTTTTATTTTTGTATTCTGCAGCATCTCTTCTTCTGTGATATAATTCTATCTCATTCTTGTTTTGTTCCAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGGTACTGCCTCTTATGTTTTAAAACCTTTTTTTGTTATTGTACCCAAGGCCGAAGCCTATTTTTCCTTGAATAGATGTTTCTGTCTTTAGCTTTTAATAGCAGCCTGAATACTATGATTATTGCTCCTGATTTCTTTTTACTATGCGTTTTCAACTTTTCATGCAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAGTAAGTTCTGAACGAAATTTATTCAAACAATTTGCCAATGTGAACGTACTTGAGTCTAATATAGGAATGACTTGGATTAAAATTTGCCATTCTTTGGCTTGCTCATACACCTCCAATCCATCGCCTTTGGAAAAGAAATCTTTATAAAGAAAATACACATCCACTCCATTTTTACTAGTCTAAAGGGAAAAGTGGATAAAATACTTGCATGAGCTTTCCTGTACAGATATATTCCTCTTAGAAACATCTTGAATTAGAAAATGGGATATCCCCTTTGTATTTTTATGCTTGAATGATGGTCCCTGTATGTGACTAACAAAATAACGAGGCCCATGCGTTTTCATATATGACTTCATATTGTATATTTACCATATAATTAGTCAACTAACATCTCTCTGTTTCGTTTACTCCGGATAGTTATTGGTTCTGACAATTTTTGATAGCTGTGTTACCCTTTTGTAAATACATAGGCAACTAAAAAGCGCCCCACCCCCATGTGTTTCCCCTTGTAATCTCTTAAGATTTTTTCTAGTTTCTTCAATACTCCCTTTTCCCTACAGATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGGTAATTTGTCAATTGTTATTTTATATTTTCAGTTTGTTAAGTATTTTCTCCTTTATTTTGTTAGACTGAGAATATAGTTATTAATTAGTATTGCCTCATTATTTATTGCTAAAAGATTCAGAAAAAGGAGAAAATGTGAAAAAGAAATGTATGGAGTAGAAAGTGGCTGCAGGGCTTCTCATCCTAATAGCAATGGCGTGTGTAAGACATTAGAAGAGGAAAGAATAGGGAAATGACAATAGCTTGATGTAAGCATGACATAAAAGAAAATACTGGAAAAAGAAAATGTATGCTATGAGTGCCAACTGATAATCAACCAACGATGTTTGACTAGAAAATTTAAGAAAGAGCTGATAAAAGGTAATAGGTAATTGATTTACAAAGTACTTGATCTTAGAAAACATCAAAAGTATAATTCAGGTTTCAAGTCTTTTCTAGGAATGGAACTCATTGAAAATTTATTGCTATGATTAGTTGCTTGAATTTCTTTATTGATTATTCACTCAGTTTGCTGTTTTTGTGGATTTAATGGGGGTTCTGTAGATATCTAATGTTGCATTATTCTTTTGCTTATTATCCTTGGAATAATTTGTTGTTTTGGATTAATTCTATCAGTAGTCACTGTAGTGATTAATTCTTGTTTGAGGAGATTTGGTTCGCACTCAGGTATAGCAGATAAATATTGTGGACCCTGGCTAGATCTATCCTTTTTTGAGCGTTAGTGTCAAATTCTTTTTGAAATATCCTTTAGGTTTCATTTTGCTTGATTGGACTCCCTTTCTTTAGTGGCACTCTCCCTTTTTGTGGGCTTAGCATTTTTTGGATGCCTTTGTTTTTGTCTTGTTTTTTATTTTTATTTTTTGTTTTTGTTTTTTCATTTTTCTTCAATGGAAGTTTTGTTCTTTATAAAAAAAAATATGTGTTTGAATGCCCTCCCTTTTCTCCCTCAAAGTTTAGGTTATTTTTAATTCTTGCAATTTGAATTGTATGTTTTATTTTTGCGTTCTTTGTTTAGGGTCCTTTTTCTTTTCTTCTTTACTATATCCTGTTCCTCGCTATCAATACAGTTATCTAACATGCGAATTTCATTTTCCTTCTCGTTGTAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGGTATTGGAACCTCAATTTCTTGTTCGTTTATCCTACATTATTGATGGTGACCCTGTTGCATGAAACGCTCCTTGCCGATCTAGCTAATCTTGGAAAAGTGCCTTTGTTTTTTAATGCATTGAATATTTGATCGTCTCAAAGAAATAGAACTATAAAGTTCCACGTATATCCCTTCCAAACATTCAAGCTAAAATTTATCTCTAAAAATTCTTATGGGTTATCATGGGAATTTGATTGAGATTTGCAAATAATAAGGTGGGAGATCAGGAGTACTGGGATTTGGTTGAAATACGTAGCGATTATGTCAGTCCACTCATCAAGTTGACTTGGTTAAATATCAAATTGTAAATCATAATCAATTAATTTCTGGTCACAATTTCATATATATTTTACATTCTCAGTTTATTTGGTTTCAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGGTAGTCTATTCTTGTTCCTAAGACCAATGAATTTCCCCAATTTCAGATGTGATTTATCCATGCCTTTCTTCTCTTTTCCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGTACTTCCAATCTTTCTTTTCTGGCTCTTAAATGTAGAGTCTTCTCAATTCTTGTTCTCTCTTTGTTTTTTGGTTGTATTGTTGTTGATTATTTGGGGATTATTTTCCTTATTGTTAAATATTTTATCTTTGTATTTATTGTAAAGATATTTACTATATTTATCTCCTTCCTTATTTGTAATAGATTTTCTTTTATATAAGAAAACCCTTGTTTAACAAAGTATGTAGAGAGATAAAATATTTTAGCAATTGTTTTGCTTCTATGCTTTTATTTACTCACTCATTTCCAGAGTTTGTATCTGTTGAGCAATAGTTTCTTTTCATTATATCAATGAAAAGTTTTATTCCTAATGTTGAAATCTCAACTGTCTTCATGATTATTGAAATGAATTAATTTAGGGCTGTTTTCAATTATAGGAAAATGAGCCAACTTATTTACAAATATAACAAAATGTCACTATCTATCAGTGACAAATGGTGATAGACACTGATAGATAGTGACAGTTTGCTATATTTGAAAATATTTCCAGCAATTTTGCCATTTAAACCAATTACCTATTAATTTATTGTCTCTGTTGTCTTTCACTTTTCAGCTGATATTTCCTTGAGTAGCCTGACACGGGTTTCTTCATCAAAATTTGTTGCAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGGTACTGTTTCGTAGAACTTCTATTTATCTATTTTTACTTGTGGTTCATATAGGATTTTTTTTGTTAATTGAAAATGAGATTCATGTTACATGAATAATATAAATTTGGAAAGTGCCATACACGGAAGATGGTCCAGAACTTGGTAAGCTCTCTGATCACTTTTCTTCATATGTCTGCCCATGGGCTTCTTTCATAGAATGTAGTTAGAACAGTGGTTGGAGGATGCGTATGGTCCACATTTTCCTAGATGGTAACTAATGAGTTCTTGTATGAAAAGTATCAGAGTGGGGTCACTCTTGCTGACAAGAAACCCTTTACAAGATTGCTTTTAGTTATATTTAGGGCTTCCCCTTCTAACATAGTCACATAAGATTTAAGAGAACAATTCCCAAAAGCCCTGAGCTACTTGCAAGAGTTACTATTGTACTATTATGTTTTTGAGTTTTGCATTCGTCATGTTGTCATGTTTATTAGTCGTTGGGTCTGAAAATACAAGTATACTACTAACCATCTACTAGTCAAATTTAAAATATTTGCATCTAAGAATGGCCATCTGTAATAGCAGCATCTACATGAAGAATGTTTAAGAACCCCCAATCCTTAAGATGTATCAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAATTGCATTAAGGAGAAAGAGAATTGTATTAAGGAGGAAGTGGGGGAAGTTAGTTAGAGGAGTGACCGCTCATTGTTGGGTGGGCTATGGCCCATGGGAATGGAGAGGTTTTGCGTGACAGGGGAGGGGAGACTGATTTTGGTGGTAAATTGCAAGCCTGACTTGTAGGAGAGGATTTCTAGCCCTCCATAAAGTGTTGGAGTGATATTTCTCTTTTTCATCTTCCTTCTTGTTTTCATGAGAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGTTGCTTGGGAATTCTTGTTAGGAAAGTCTATTGCAGGTGCGGTTTCCTAACAAATTGGTATTAGAGCCATTGTTTTTTCTTTTTCATCCTGGGAAGAACGCATAGTGAGACGAGGCGAGAAATGGAGGAGAAGATTGATAGTCATTCCAAGAGTCTCGTGGAGTTGAAGGAATGGATGATTGAAATGGCAAAGATAGTGGAATGTGTGGATAAGATGGTCCAGATGAATCGAAGATCGGATCTGTGAAGATGATGAAGGAGAAGGAAGGAGAGGAAGGCGAGAGTTTGCAAGCTAAAGCGAATGACGGAGGTGTTGATCGCAACAAGTATAAACGGTTGGAGATGCTGATTTTTTCTTGAGAACACTTAGATTCGAGGGTTTATAGAGTTGAGCACTACTTCGAGATTCATGAATTATCTGGCACTGAGAAGATCAAAGTAGTGGTAATAGCCTTTGTCAAACGTGGTTGATTGGTTTTAATGGGCTCATCAGCGAAAATCGATCAGATCCTGCGAAGCTCTAGTGCATAGGATGTTTGAAAGGTTTCGATCGTCTCAAGAAGGGTTCCCTGCTGTCTCGATTGATGTGAATTAAATAGGAAGGGACGTTTGAAGAGTATTGGAAGAAATTTGAGTCATATGCGGCCCCAATTCTGGAGACGGTAGAAAATGTGTTACAGGAGGCATTCATGAATGGGTTGTCGCTAGAAATAAAGGCAGAGGTAACGAGTAGACATCCTGTGGGCCTTGATGAGTGCATGGTGGAGGCCCAAGCTGTGAGCGATCGTAATCTGGAAATGAAGTTGGCTGAAGAAGAATTGGGCCTAAGTAAGCTTGTGGCCCAACAACAAACAGGTAAACAAGTCGAGGTAGGGGGAGCGAAGACATAAGGTAAAACCCAACCCGGGGGATGACGAAAAATGTCCTACCTGAAAAGGGGGAATCTAGAAAAAAGAACAACCCTACTGAAAACTATTGGATTTTGAGATTTGAGAGAGAAGGGGCTATGCTTTTGGTGCGATGAAAGATACTTTCACGACCACAAGTGTAAGACGAAAGAGAAGCGAGAGCTAAATTTATTGATCGTGCACAACGAAGACGAAATCGACAAATTAGAAGTGAAGGAGACCGAGGAGGAAGAACCAGAGGTGAAAGTCATGGAAGTACCGAACAACATTGAGATCGTGTTACGCTCGATTCTGGGATTTTCTACTAAGGGAATAATGAAATTAAAAGGTTTGATAGTCGGTAGAGAAGTGATAGTGATGATCGACTGTGGTGCCACACACAATTTCATACATCAAAAATTGGTGGATGAACCAATTTTGCCTCATGCTAACAACAAAGTATGGGGTGGTAGTAGGTAATGGAAAGGCAAATCGGGGAAAAGGCATTTGCCAGGCGGTTGTTGTGGTATTACTCGAATTAATGGTGACAGAGGATTTATTGCCATTTGATTTAGGAAGGGTGGATATAATTTTAGGAATCATTTGGTTGTGCAATATGGGATACATGGAAGTCCATTGGCCTAGTTTGACTAGGACGTTCACGATTGGAGATAGGAAGATAACTTTGAAGGGGGATGCTTCATTGACAGCAACAGAGGTCACTCTTAAAACGTTGACTCATAGATGAGAGGAAGAAGATATGGGGTCCCTTGTAGAATTCCAACACATGGAACCAGAGATTGAAGAAGAAAAGAGACAAGTTCCAAACGGTAGTGAATAACAACCACCGCCAATTAATTCAATGCTTATTGGACGTGTATGAGGATGTCTTTGAGTTTCCTATTGCTCTACCACCCAAAAGGGTGGTGGATTATCGAATTAAGTTAGAAGAAGATGTGAAACCGGTCAACGTGCGACCTTATAGATACAGACATACACGGAAGGATGAGATCAAAAAACTGGTAAATGAGATGTTAGTAGCAAAGATTATACGTCCAAGCCATCGTCCTTACTCGAGCCCTATTTTATTGGTCAAGAAGAAAGACGAGGGATGGCGTTTTTGTTACTGGAAGTTAAATCAATCAACCATAACCGACAAGTTCCCAATTCCTGTAATAAAGGAATTGATTGATGAATTGCACAGGTCGGTAATCTTTTTGAAGTTGGATTTGAAGATCGGTTATCATCAAATACGAATGCATGAGCCATGTATAGAGAAGACAACCTTCCGTACGCATGAGGGGCATTATGAATTCTCAGTAATGCCGTTTGGATTAACCAATGCTCCGGTGACCTTCTAATCAATAAGGAATCAAGTATTCCGCCCTTTTTTGAGGAGACATGTTTTGGTTTTCTTTTGAGGATGTTTTAGTGTATAGTCTCGATGAAGATACCCATGTCAAGCATCTGGGAATGGTGTTAAATGTACTGCGTGACAATAAACTCTACGCAAGTAAAAAGAAATGTGTGTTTGGGCAAGAACGGATTCATTACTTGGGGCACCGGGTATCAACTCATGTAGTGGAAGCAGATAGAGATAAGATTCAAGAGCGACGATACGATGGCCAATACCGAAAACGGTATCTGAATTGAGGGGATTCTTAGGACTCACTGGATATTTTTGAAGATTTGTAAAGGATTACGGATTGATTGTGGCTCCTCTGACGAAATTGTAACATAAAGATGCCTTTAAGTGGGATGATCAAGACATTGAGGCTTTCGATTCTCTATAAAACGCCATGGTAACTCTTCCTGTACTTGCTCTACCAAATTTCGATCTTCCTTTCGTGATAGAAACAAATGCATCAGGCTTTGGTTTGAGGGCTATGTTGATGCAAAGGGAGAGACCGATAGCTTACTTTAGTCAGACATTGTCTATGCGCGCCCAAGGAAAGTCTATTTATGAACGTGAGCTGATGGTAGTGGTGTTGTCCCCACAAAAATGGAGGCTTTACCTTTTGGGAAGGAAATTCACAATGATATCGGATCAGAAGGCGTTGAAATTTTTATTAGAACAGCGTGAAGTGCAACTGCAGTTCCAGAAATGGTCGACTAAACTCCTCGGATATAATTTTGACATTGAATGTAACTAAGTTCTTAGTGGAGCAAGGGGTGTTATACTACAAAGGAAGGTTAGTGCTATCTAAATCTTCTTCCCGCATTCCAACCTTGTTGCGGACTTTTCATGACTCGGTACTAGGAGGCCATTTGGGGTATTTACAAACATATAAAAGAATGTTGGGAGAGCTTTATTGGAAGGAGATGAAAGAGGATGTAAAGAAATATGTGGCTGAATGTGTAATTTGCCAGAGGAACAAGAGTGAGTCAGTGTTGCCAGCAAGTCTTTTGCAGCTTTTACCAATACCGGATAGAGTTTGGGAAGACATTTCAATGGATTTCATGGAAGGACTTCCAAGATCAAAAGGGTATAATGCCCTAATGGTGGTGGTTGACAGATTAAGCAAATATGGGCACTTTATTCCGTTGAAACATCCCTTCACAACCAAAACAGTTGTCGAAGAGTTCATTCGTGAAGTGGTGAGCATCATGGATTCCCAAAACAGATGTCAAAGGGTTCGGTCGCGATAAAATTTTTGTAAGCAATTTTTGGGTTGAACTATATGCTGTTCATGGGACGATGCTCAAATGGAATACAACATTCCATCCTCAGACGGATGACCAGACTAAGAGAGTCAATTGGTGTATAGAGACATACCTTCCGTGTTTTTGCAACGAGCAACCCACCAGTTGGTTTCAAATGGATTCCATGGGCTGAGTATTGGTATAATACAACCTTTCAAAGTTCAATACACATGAGTCCCTATCAGGTGCTAAATGGACGGCCTATTCCTGCACTAGTGTCATATGGTGATAGGAGGATTACTAATGATACCCTGGAACAGAAGCTAGTGGATAGAGATCGAGCACTGATAGCTTTAAAAGAGCATCTGGTACTGGCCTAAGAAAGGATGAGGAAATACGCCGATTAGAAGAGGCAGGATGTTCAGCTTGAAATGGATGACATGGTTTTCTTGAAATTCCGACCTTACAGACAGTAGACACAGGCCTGAAGACGATGTGAAAAATTGGCTCCTCGATTGTATGGACCATTCAAGGTAATTGAGAAGGTGGGGGAGGTTGTGTATAAATTGAAGCTTTCGGAAGATGCAAAAAGACATAATGTCTTTCATGTTTTCGCAACTCAAAAAGTATGTGGGGTCAACTACCCGAGTACAAGCTACTCCTCCAGACTTTATCAAATGATTTCGAATTACAAATGGTTCCCGAGAAGAATTTGGGTGTTCGTTGGAATAATGACATGGTGAAAGAAGAGTGGCTTATTAAATGGCAGAAATATCCAAAGAGTGAAGCAACGTGGGAGATTGCGGGTTGGCTGAAACAGTAGTTTCCAACTTTTCACCTTGAGGACAAGTGTGAATGACAACCCGGGAGGTATTATAAGACCTCCAATCCTTCAGACGTATAAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAGTTGTATTAAGGAGAAAGAGTGGGAAGTTAGTTAGAGGAGGGACCACTCATTGTTGGGTGGGATATGGCCCATGAGAGTGAAGAGGTTTAAGAGAGGTGTCGCGTGGTAGGGGAGGCTGATTTTGGTGGTGAATTGCAAGCTTGGCTTGTAGGAGAGGATTTCCAGCCCTCTGTAAAGTGCTGGAGTGATATTTCTCTTTTTCGTCTTCCTTCTTGTTTTCATGAGAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGATTCTTGGGAATGCTTGTTAGGAAAGTCTTTTGCAGGTGTGGTTTCCTAACAGAATGATATTCATGTACTTCATAAAAAGCCCATGGTCTCTGAAGTCAGGTGTTGCGCTCAGGAATGCACGAACTGAGTGTTCTTTGAATTTTTTATCTCCATCCTCCCAAGTTGATAATTCTCCTATCTGTTCTTAGCAGGTTCCATCTGCATGGTTCTGAGTACTTGTAGGATTATTCGGAGTATTGGCTTTTCAGAAACTCTGGATAGAAAGGAATTACCCCATTTCTTAAATGATACAAGTTTGCAGTGGCTCTTCATTTTATTTTTCTTTTGAATGATACAAGGCAGTTGGAGAATCTTTTTTATGTTTATTTTTGCTTGGACGGAGAGTATACCTTATCCCTTCGGGACACTTTTGCATATTAATGTACTGATATTTCCCTGGTAATATATCTTCATATCGCACTTACTTAATCTTCTGCACTTCCAAGATCAAAATCAACACGTGAGGTTTGGAAGCATCCAAGGGATCGGATATCTAGGGATTCAAAATGGGATAGTTTATATAGTACTTTCTCTTTAAGAGGATTTCCTTCTTGCCATTCAGTTTTTTATTTTTATTTTTATTTTTTTTACAATTAATAATAATAATAACTTTTAAAAGAAAAGTAACGTCAAGTTCATTGTCTATGTGGTTATAAGCAATTATACTTGCTCCTTGATAGATTTTTATAACTAACATATCTTGCAATGAAGTGCACTATCTGACATGGCTGATTCTGTTTTGTACTCAGTCTAATAATTAAGCAGACGAAGTGCACTATCTGACATGGCTGATTCTGTTTTGTACTCAGTCTAATAATTAAGCAGACTTATATGCTTTTGTGTCCAAACATTTATTCCCAATAAAATTGATGCAGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGGTAATCCCTAAGATGAAGCATAAGCAGAAGGTTTTTATAGTCATTTGCTTGCCCGCAGCTTTCACAAATTCTATAAAATCACTTGTACCTTTTGAGTTTTGAATGTTTCAGTTTGAGGGTATTAGTAGGAGATTTCTTGGTAACAGTAAAAGGATGGTAATAAGGGCATAGTGATAATTAGCTGGAGGAGCTTTGGTTATAAATAAGGGAGGTTGTGCACCTTAGGAGCGGGGGGGATCAGTTTGGTATCCTTTGTATATTTGTTGAGGGAGAGACATAGTCTTCTTGAATGGCTATTGATATTGCAATAAAGTTATCTTTGATGTTTTCATATATTTTCTGTGTTTTGGTAACCTGACATCCAACTTATATTTATTTTACATATCCAGGACATGACTCTGAGAATCCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
mRNA sequence
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
Coding sequence (CDS)
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
Protein sequence
MASVDSRPSTLIPLEDAGEGEQIVRNGFYFQKISKPVTVKLCDSIFYPETPPSQPLALSESFGGTTLVASKNLLQSFSLASERSLLQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLSGWTKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHVMHDIDAVDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFREDDLKMQVTEKLAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSVLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSSSKNAATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSSSTLFLGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAAGGGFAGAAGGFGAFGSQQGSGGFSAFCVAAGGAGGTGKPPELFTQMRK
Homology
BLAST of CcUC11G220380 vs. NCBI nr
Match:
XP_038892124.1 (nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida])
HSP 1 Score: 2620.5 bits (6791), Expect = 0.0e+00
Identity = 1466/1683 (87.11%), Postives = 1520/1683 (90.31%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANG THVMHDIDA
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPGD GPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKV+VRVGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRE--DDLKMQVTEK 568
IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL S SKKEFRE +DLKMQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+AISSEIP EKIKISNDIKSSNNDQS VSKI ESATVGAESNTKS+KADSFIYSQSL+SS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 629 VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 688
VLER NYEIGNFDKPVQKFGLG VSISGKS DVHSQPFPNVKESTK++ STGLLAASELS
Sbjct: 541 VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600
Query: 689 SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
SDKA+FLNKIDP+SSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601 SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660
Query: 749 GKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
G+QV GGAGKIESLPV+RSSQISLQDNL KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661 GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720
Query: 809 DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKM 868
DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNER+QEVQNLFDKM
Sbjct: 721 DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780
Query: 869 VQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 928
+QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF
Sbjct: 781 IQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 840
Query: 929 NGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 988
NGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA
Sbjct: 841 NGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 900
Query: 989 ALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLR 1048
ALNIESPSLKRQ VTKELFETIGLTYDASF SPNVNKIAETSSKKLLLSADSFS KDT R
Sbjct: 901 ALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSHKDTSR 960
Query: 1049 RKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGA 1108
RKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS TP+GA
Sbjct: 961 RKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSHTPDGA 1020
Query: 1109 ATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTTAP 1168
ATVA PASRLTSS SSSSSKNA ATPFMWASPLQPSN SRQKSQPL KTN TAP
Sbjct: 1021 ATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKTNATAP 1080
Query: 1169 SPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSI 1228
S LSVFQSSHEMLKKSNNEAF+VTSENKF EKSKASDFFSVTR+DSVQKSN NLD+K SI
Sbjct: 1081 S-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLDKKPSI 1140
Query: 1229 FTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTV 1288
FTISSKQ T KD I+TSNLDNQKTAN+KERHTTTSPLFGSANKPESAS+GTMSSLVPTV
Sbjct: 1141 FTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSSLVPTV 1200
Query: 1289 NEARKTEEKRSLTTISPSVPA--SAQLNTP-SSSTLFLGFAVSKPLPSSAAVIDLNQPVS 1348
+EARK EKRSL TISPSVPA A+ N+P SSSTLF GFAVSKPLPSSAA IDLNQP+S
Sbjct: 1201 DEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDLNQPLS 1260
Query: 1349 TSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPA 1408
TSTQLNFS+PVVSVSDSLFQA KM+STSSTL SLNP LESSKKELPVSKS+ DTEK+TPA
Sbjct: 1261 TSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTEKKTPA 1320
Query: 1409 SKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPS 1468
SKPES+ELKFQPS+TP +K H+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAFA LPS
Sbjct: 1321 SKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAFASLPS 1380
Query: 1469 PNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKP 1528
PNLT K GN RNETSNVT TQDDDMDEEAPET NN+EF+LS LGGFGNSS+PMSGAPKP
Sbjct: 1381 PNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMSGAPKP 1440
Query: 1529 NPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFG 1588
NPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAFSGGFG
Sbjct: 1441 NPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAFSGGFG 1500
Query: 1589 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTS 1648
SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F GGFT
Sbjct: 1501 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFSGGFTG 1560
Query: 1649 MKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTT 1695
MKP VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS GGFAGT+ TGGGFAGAS+TT
Sbjct: 1561 MKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAGASATT 1620
BLAST of CcUC11G220380 vs. NCBI nr
Match:
XP_038892123.1 (nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida])
HSP 1 Score: 2614.3 bits (6775), Expect = 0.0e+00
Identity = 1466/1688 (86.85%), Postives = 1520/1688 (90.05%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANG THVMHDIDA
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPGD GPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKV+VRVGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRE--DDLKMQVTEK 568
IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL S SKKEFRE +DLKMQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+AISSEIP EKIKISNDIKSSNNDQS VSKI ESATVGAESNTKS+KADSFIYSQSL+SS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 629 VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 688
VLER NYEIGNFDKPVQKFGLG VSISGKS DVHSQPFPNVKESTK++ STGLLAASELS
Sbjct: 541 VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600
Query: 689 SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
SDKA+FLNKIDP+SSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601 SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660
Query: 749 GKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
G+QV GGAGKIESLPV+RSSQISLQDNL KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661 GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720
Query: 809 DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKM 868
DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNER+QEVQNLFDKM
Sbjct: 721 DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780
Query: 869 VQ-----VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 928
+Q VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE
Sbjct: 781 IQVYLVSVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 840
Query: 929 LERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESL 988
LERHFNGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESL
Sbjct: 841 LERHFNGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESL 900
Query: 989 SKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSS 1048
SKQLAALNIESPSLKRQ VTKELFETIGLTYDASF SPNVNKIAETSSKKLLLSADSFS
Sbjct: 901 SKQLAALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSH 960
Query: 1049 KDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSR 1108
KDT RRKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS
Sbjct: 961 KDTSRRKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSH 1020
Query: 1109 TPEGAATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKT 1168
TP+GAATVA PASRLTSS SSSSSKNA ATPFMWASPLQPSN SRQKSQPL KT
Sbjct: 1021 TPDGAATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKT 1080
Query: 1169 NTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLD 1228
N TAPS LSVFQSSHEMLKKSNNEAF+VTSENKF EKSKASDFFSVTR+DSVQKSN NLD
Sbjct: 1081 NATAPS-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLD 1140
Query: 1229 QKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSS 1288
+K SIFTISSKQ T KD I+TSNLDNQKTAN+KERHTTTSPLFGSANKPESAS+GTMSS
Sbjct: 1141 KKPSIFTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSS 1200
Query: 1289 LVPTVNEARKTEEKRSLTTISPSVPA--SAQLNTP-SSSTLFLGFAVSKPLPSSAAVIDL 1348
LVPTV+EARK EKRSL TISPSVPA A+ N+P SSSTLF GFAVSKPLPSSAA IDL
Sbjct: 1201 LVPTVDEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDL 1260
Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
NQP+STSTQLNFS+PVVSVSDSLFQA KM+STSSTL SLNP LESSKKELPVSKS+ DTE
Sbjct: 1261 NQPLSTSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
K+TPASKPES+ELKFQPS+TP +K H+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAF
Sbjct: 1321 KKTPASKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAF 1380
Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A LPSPNLT K GN RNETSNVT TQDDDMDEEAPET NN+EF+LS LGGFGNSS+PMS
Sbjct: 1381 ASLPSPNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMS 1440
Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
GAPKPNPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAF
Sbjct: 1441 GAPKPNPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F
Sbjct: 1501 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFS 1560
Query: 1649 GGFTSMKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAG 1695
GGFT MKP VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS GGFAGT+ TGGGFAG
Sbjct: 1561 GGFTGMKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAG 1620
BLAST of CcUC11G220380 vs. NCBI nr
Match:
XP_031741375.1 (nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus])
HSP 1 Score: 2410.2 bits (6245), Expect = 0.0e+00
Identity = 1358/1673 (81.17%), Postives = 1441/1673 (86.13%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA---------------------------------------VDCI 328
QGSANGP THVMHDIDA VDCI
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNVDCI 240
Query: 329 KWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDIL 388
KWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF DIHSGFTRDIL
Sbjct: 241 KWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDIL 300
Query: 389 PGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIEL 448
PG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NIDRNTSLPKIEL
Sbjct: 301 PGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDRNTSLPKIEL 360
Query: 449 QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 508
QANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE
Sbjct: 361 QANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 420
Query: 509 TEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEKLAISSEIPQE 568
TEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK+AISSEIP+E
Sbjct: 421 TEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEKIAISSEIPRE 480
Query: 569 KIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSVLER-PNYEI 628
K K SNDIKSS NDQS V IDESA V E NTKSQK DSFIYSQSL+SS ER P+YEI
Sbjct: 481 KGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERPPHYEI 540
Query: 629 GNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELSSDKAMFLN 688
GNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASELSS+KAM
Sbjct: 541 GNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEKAMSFK 600
Query: 689 KIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGA 748
KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLTQSG+Q TGGA
Sbjct: 601 KIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGA 660
Query: 749 GKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESI 808
GKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMCEGLD LLESI
Sbjct: 661 GKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESI 720
Query: 809 EEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKK 868
EE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLFDKMVQVLSKK
Sbjct: 721 EESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKK 780
Query: 869 TYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNK 928
TYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELERHFNGLELNK
Sbjct: 781 TYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNK 840
Query: 929 FGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESP 988
FGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSKQLAALN+ESP
Sbjct: 841 FGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMESP 900
Query: 989 SLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGR 1048
SLKRQ TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK T RRKQ+SG
Sbjct: 901 SLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRKQQSGT 960
Query: 1049 KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPA 1108
KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTPEGAATVA PA
Sbjct: 961 KNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAATVARPA 1020
Query: 1109 SRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQS 1168
SR+TSS SSSS S+N TPFMW SPLQPSNTSRQKS PL K N T PSP VFQS
Sbjct: 1021 SRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPPPVFQS 1080
Query: 1169 SHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTI 1228
SH+MLKK NNEA +VTSENKF EKSKASDFFS TRSDSVQKSNIN+DQKSSIFTI
Sbjct: 1081 SHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKSSIFTI 1140
Query: 1229 SSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEA 1288
SSKQ PT DSI TSN+DNQKTAN KERHTTTSP FGSANKPES +G+M SLVPTV+ +
Sbjct: 1141 SSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVPTVDGS 1200
Query: 1289 RKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDLNQPVSTSTQL 1348
RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDLNQP STSTQL
Sbjct: 1201 RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPSTSTQL 1260
Query: 1349 NFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPASKPES 1408
NFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E+Q +SKP S
Sbjct: 1261 NFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILSSKPGS 1320
Query: 1409 YELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTP 1468
+ELKFQPS+TP DK HVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAFA +PSPNLT
Sbjct: 1321 HELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTS 1380
Query: 1469 KIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGG 1528
KIF N RNETSN TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+SG PKPNPFGG
Sbjct: 1381 KIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGG 1440
Query: 1529 PFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMAT 1588
PFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ T
Sbjct: 1441 PFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVPT 1500
Query: 1589 QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVG 1648
Q PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF GGFT+ KPV
Sbjct: 1501 QPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPV- 1560
Query: 1649 GFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAA 1695
G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGASST GGFAGAA
Sbjct: 1561 ----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGASSTAGGFAGAA 1620
BLAST of CcUC11G220380 vs. NCBI nr
Match:
XP_031741374.1 (nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hypothetical protein Csa_008316 [Cucumis sativus])
HSP 1 Score: 2406.3 bits (6235), Expect = 0.0e+00
Identity = 1358/1683 (80.69%), Postives = 1441/1683 (85.62%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANGP THVMHDIDA
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+AISSEIP+EK K SNDIKSS NDQS V IDESA V E NTKSQK DSFIYSQSL+SS
Sbjct: 481 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
ER P+YEIGNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASE
Sbjct: 541 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600
Query: 689 LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+KAM KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+Q TGGAGKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
QLAALN+ESPSLKRQ TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960
Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020
Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
EGAATVA PASR+TSS SSSS S+N TPFMW SPLQPSNTSRQKS PL K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
PSP VFQSSH+MLKK NNEA +VTSENKF EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140
Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTM 1288
+DQKSSIFTISSKQ PT DSI TSN+DNQKTAN KERHTTTSP FGSANKPES +G+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200
Query: 1289 SSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDL 1348
SLVPTV+ +RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260
Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
NQP STSTQLNFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
+Q +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380
Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A +PSPNLT KIF N RNETSN TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440
Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
G PKPNPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560
Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1695
GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620
BLAST of CcUC11G220380 vs. NCBI nr
Match:
KAA0034115.1 (nuclear pore complex protein NUP214 [Cucumis melo var. makuwa])
HSP 1 Score: 2399.4 bits (6217), Expect = 0.0e+00
Identity = 1362/1707 (79.79%), Postives = 1448/1707 (84.83%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ SKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+ ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V E NTKSQK DSFI+SQSL+SS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
ER PN EIGNFDKPV KF GLGSVSISGK DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF KIDP+SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840
Query: 929 --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841 NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900
Query: 989 SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGS 1048
SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQ TKELFETIGLTYDASF S
Sbjct: 901 SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960
Query: 1049 PNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
PNVNKIA+TSSKKLLLS+DSFSSK T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961 PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020
Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSS------SKNAATPF 1168
KTTVKRMLLQG PSS+EK FRSRTPEGAATV PASR+TSS SSSS S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080
Query: 1169 MWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFI--- 1228
MWAS LQPSNTSRQKS PL KTN TAPSP VFQSSH+MLKK+NN A + TSENKF
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140
Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANA 1288
EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP +DSI TSN+DNQKTAN
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200
Query: 1289 KERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTP 1348
KERHTTTS LFGSANKPES +GTM SLVPTV+ ARKTEEK+S+TTIS SV A A LNT
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260
Query: 1349 SS-STLFLGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMIST 1408
SS STLF GFAVSK LPSS AAV+DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTS 1320
Query: 1409 SSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKKHVEPTSKT 1468
+ SLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT
Sbjct: 1321 PTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKT 1380
Query: 1469 HTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1528
TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN TQDDDMDE
Sbjct: 1381 QTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDE 1440
Query: 1529 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1588
EAPETNNN+EF+LSSLGGFGNSSTP+SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGE
Sbjct: 1441 EAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGE 1500
Query: 1589 LFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGN 1648
LFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALGN
Sbjct: 1501 LFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGN 1560
Query: 1649 VLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGG 1695
VLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV G GGFAGVGSGGG
Sbjct: 1561 VLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGGG 1620
BLAST of CcUC11G220380 vs. ExPASy Swiss-Prot
Match:
F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)
HSP 1 Score: 788.5 bits (2035), Expect = 1.5e-226
Identity = 691/1852 (37.31%), Postives = 939/1852 (50.70%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
+GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ S ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
P E I K +KSS N Q P ++ +S SF L S
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543
Query: 640 VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
+ N + F + I +S +H Q K + S GL A
Sbjct: 544 LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603
Query: 700 -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
SS + P V P F S + + S + G+ P KD
Sbjct: 604 PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663
Query: 760 TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
+ ++G+ G KIE +P +R+SQ+S Q + S+++H
Sbjct: 664 QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723
Query: 820 GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
+ N SN P + EM +D LL+SIE PGGF D+C KS+VE LE+
Sbjct: 724 TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783
Query: 880 GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
GL SL+ +CQ WKST++E+ E+Q+L DK +QVL+KKTY+EG+ Q +D++YW+ W+RQK
Sbjct: 784 GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843
Query: 940 LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++ + + R + + SR
Sbjct: 844 LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903
Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
SLHSL+N M SQLAAA+ LSE LSKQ+ L I+SP ++ V +ELFETIG+ YDASF
Sbjct: 904 VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963
Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDR---NL 1119
SP+ K SS K LLLS+ S R++Q S KNS+ ET RRRR+SLDR N
Sbjct: 964 SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRVIFNW 1023
Query: 1120 ASVEPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASR 1179
A+ EPPKTTVKRMLLQ + S++ RS + A+ V
Sbjct: 1024 AAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKG 1083
Query: 1180 LTSSKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTA 1239
+ S +S+ +TPF P+ SN + + S +A
Sbjct: 1084 IMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESA 1143
Query: 1240 PSPL----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE--------- 1299
PS + +V Q + KK+ F+ N F+E
Sbjct: 1144 PSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRL 1203
Query: 1300 --KSKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP-- 1359
S SDF S S KS + SSI FT + P
Sbjct: 1204 STTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLS 1263
Query: 1360 -TLKDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT-------- 1419
T DS +T + +++ + S SA P++ S+ + S++ T
Sbjct: 1264 GTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGK 1323
Query: 1420 --------VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA--- 1479
+N+A + S L +SPS P +T SS LF A
Sbjct: 1324 PLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTS 1383
Query: 1480 -VSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESS 1539
VS S+ + + + + +ST L+ STP ++ D+ FQ+P++ + SS + + +
Sbjct: 1384 QVSSDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEP 1443
Query: 1540 KKELPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQ 1599
KK S S T+ A+K ++ L + ++ V P S + +S G
Sbjct: 1444 KKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGT 1503
Query: 1600 VPNVI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1659
++ G +QPQQ S AP P+ + T + E ++ TQ+D+MDE
Sbjct: 1504 QSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDE 1563
Query: 1660 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1695
EAPE + E S+ S GGFG STP GAPK NPFGGPFG NAT+ TS+ + PSGE
Sbjct: 1564 EAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGE 1623
BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match:
A0A5A7SY34 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G001060 PE=4 SV=1)
HSP 1 Score: 2399.4 bits (6217), Expect = 0.0e+00
Identity = 1362/1707 (79.79%), Postives = 1448/1707 (84.83%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ SKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+ ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V E NTKSQK DSFI+SQSL+SS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
ER PN EIGNFDKPV KF GLGSVSISGK DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF KIDP+SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840
Query: 929 --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841 NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900
Query: 989 SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGS 1048
SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQ TKELFETIGLTYDASF S
Sbjct: 901 SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960
Query: 1049 PNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
PNVNKIA+TSSKKLLLS+DSFSSK T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961 PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020
Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSS------SKNAATPF 1168
KTTVKRMLLQG PSS+EK FRSRTPEGAATV PASR+TSS SSSS S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080
Query: 1169 MWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFI--- 1228
MWAS LQPSNTSRQKS PL KTN TAPSP VFQSSH+MLKK+NN A + TSENKF
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140
Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANA 1288
EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP +DSI TSN+DNQKTAN
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200
Query: 1289 KERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTP 1348
KERHTTTS LFGSANKPES +GTM SLVPTV+ ARKTEEK+S+TTIS SV A A LNT
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260
Query: 1349 SS-STLFLGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMIST 1408
SS STLF GFAVSK LPSS AAV+DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTS 1320
Query: 1409 SSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKKHVEPTSKT 1468
+ SLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT
Sbjct: 1321 PTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKT 1380
Query: 1469 HTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1528
TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN TQDDDMDE
Sbjct: 1381 QTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDE 1440
Query: 1529 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1588
EAPETNNN+EF+LSSLGGFGNSSTP+SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGE
Sbjct: 1441 EAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGE 1500
Query: 1589 LFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGN 1648
LFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALGN
Sbjct: 1501 LFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGN 1560
Query: 1649 VLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGG 1695
VLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV G GGFAGVGSGGG
Sbjct: 1561 VLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGGG 1620
BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match:
A0A0A0KV45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1)
HSP 1 Score: 2390.5 bits (6194), Expect = 0.0e+00
Identity = 1358/1724 (78.77%), Postives = 1441/1724 (83.58%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANGP THVMHDIDA
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+AISSEIP+EK K SNDIKSS NDQS V IDESA V E NTKSQK DSFIYSQSL+SS
Sbjct: 481 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
ER P+YEIGNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASE
Sbjct: 541 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600
Query: 689 LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+KAM KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+Q TGGAGKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
QLAALN+ESPSLKRQ TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960
Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020
Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
EGAATVA PASR+TSS SSSS S+N TPFMW SPLQPSNTSRQKS PL K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
PSP VFQSSH+MLKK NNEA +VTSENKF EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140
Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTM 1288
+DQKSSIFTISSKQ PT DSI TSN+DNQKTAN KERHTTTSP FGSANKPES +G+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200
Query: 1289 SSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDL 1348
SLVPTV+ +RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260
Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
NQP STSTQLNFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
+Q +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380
Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A +PSPNLT KIF N RNETSN TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440
Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
G PKPNPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560
Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1695
GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620
BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match:
A0A1S3BDU8 (LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656 GN=LOC103488807 PE=4 SV=1)
HSP 1 Score: 2377.8 bits (6161), Expect = 0.0e+00
Identity = 1350/1681 (80.31%), Postives = 1436/1681 (85.43%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ SKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
+ ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V E NTKSQK DSFI+SQSL+SS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
ER PN EIGNFDKPV KF GLGSVSISGK DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF K+ +SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKL-IVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPV+RSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHSLHSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
QLAALN+ESP LKRQ TKELFETIGLTYDASF SPNVNKIA+TSSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADTSSKKLLLSSDSFSSKG 960
Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQG PSS+EK FRSRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGTPSSEEKQFRSRTP 1020
Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
EGAATV PASR+TSS SSSS S+N ATPFMWAS LQPSNTSRQKS PL KTN T
Sbjct: 1021 EGAATVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPSNTSRQKSLPLQKTNAT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAF----NVTSENKFIEKSKASDFFSVTRSDSVQKSNINL 1228
APSP VFQSSH+MLKK + EKSKASDFFS TRSDSVQKS IN+
Sbjct: 1081 APSPPPVFQSSHDMLKKIIMQLTVRLQKTNLRTWHPEKSKASDFFSATRSDSVQKSKINV 1140
Query: 1229 DQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMS 1288
DQKSSIFTISSKQTP +DSI TSN+DNQKTAN KERHTTTS LFGSANKPES +GTM
Sbjct: 1141 DQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQLFGSANKPESPFVGTMP 1200
Query: 1289 SLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSS---AAVI 1348
SLVPTV+ ARKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSS AAV+
Sbjct: 1201 SLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKSLPSSAAVAAVV 1260
Query: 1349 DLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDT 1408
DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++ + SLNP++ESSK EL V KS+DD
Sbjct: 1261 DLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTSPTLSSLNPTMESSKTELSVLKSNDDA 1320
Query: 1409 EKQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVA 1468
EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQVPNV+GDAQ QQPSVA
Sbjct: 1321 EKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQVPNVVGDAQAQQPSVA 1380
Query: 1469 FAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPM 1528
FA +PS NLT KIF N RNETSN TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+
Sbjct: 1381 FASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPI 1440
Query: 1529 SGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1588
SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGELFRPASFSFQSPLASQAASQPTNSVA
Sbjct: 1441 SGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1500
Query: 1589 FSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGF 1648
FSG FGSA+ATQAP QGGFGQPAQIGVGQQALGNVLGSFGQSRQLGP+LPGTGSGSPGGF
Sbjct: 1501 FSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPTLPGTGSGSPGGF 1560
Query: 1649 GGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGA 1695
GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGA
Sbjct: 1561 SGGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGA 1620
BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match:
A0A6J1CBF2 (nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC111010057 PE=4 SV=1)
HSP 1 Score: 2144.4 bits (5555), Expect = 0.0e+00
Identity = 1247/1713 (72.80%), Postives = 1360/1713 (79.39%), Query Frame = 0
Query: 86 LQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 145
LQ S ST I E+A EGE + D+YF+KIG+PVPVKL DSIFD ++PPSQPLA
Sbjct: 4 LQDSTPSTSSTSTPIRFEEAEEGEHVESTDYYFEKIGEPVPVKLHDSIFDSESPPSQPLA 63
Query: 146 LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 205
+SESFGLIFVAHLSG+ T+DVIASA+EIKNGGTGSSVQDLSI+D+S+G+VHIL LS
Sbjct: 64 VSESFGLIFVAHLSGFFVARTEDVIASAKEIKNGGTGSSVQDLSIMDVSVGRVHILALSA 123
Query: 206 DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 265
D S +AA+VA DIHLFSV SLLDKA P SCS+TDSS IKDFKW RKLE SYLVLSKHG
Sbjct: 124 DSSTIAAVVAADIHLFSVHSLLDKAAKPFYSCSITDSSCIKDFKWIRKLESSYLVLSKHG 183
Query: 266 QLYQGSANGPPTHVMHDIDA---------------------------------------- 325
QLYQGSANG HVMHD DA
Sbjct: 184 QLYQGSANGTLKHVMHDTDAVECSVKGRFIAVAKKDTLTIFSSKFKERLSMSLLPSDADS 243
Query: 326 -----VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDI 385
VDCIKWVRADCII+GCF+VTA GDEE+YFV VIRSKDGKITDVSSN+VLLSF+ I
Sbjct: 244 NFIVKVDCIKWVRADCIILGCFEVTAIGDEENYFVQVIRSKDGKITDVSSNRVLLSFQYI 303
Query: 386 HSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDR 445
H GFTRDILP GPCL SYL KCKLAIVANR ++HIVLLG L EVEN+VAVI+I+R
Sbjct: 304 HPGFTRDILPVGSGPCLFSSYLGKCKLAIVANRNNTDQHIVLLGWLPEVENQVAVIDIER 363
Query: 446 NTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELI 505
+TSLP+IELQ NGDDNLVMGLCIDRVSLP KV ++VG EDMREVSPYCIL+CLTLEG+L+
Sbjct: 364 DTSLPRIELQENGDDNLVMGLCIDRVSLPAKVKIQVGVEDMREVSPYCILLCLTLEGKLV 423
Query: 506 MFQFSSVNETEAPHETVSAC-DEEEDDIIVPADDRSQLFSGSKKEFREDDL-KMQVTEKL 565
MF SS+NETE PHETVSAC DEEEDD IVP DD+ Q+ S S+KE RE + +M T+K+
Sbjct: 424 MFHLSSINETETPHETVSACEDEEEDDTIVPIDDQPQVSSESRKELREAMVGQMHDTDKI 483
Query: 566 AISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSV 625
SSEIP+EKI ISNDIK S+ DQSPVS ID+SA V ESN+KS+K SFIYSQ L+SS+
Sbjct: 484 TTSSEIPEEKINISNDIKPSDIDQSPVSYIDKSAIVSRESNSKSEKVGSFIYSQPLKSSI 543
Query: 626 LERPNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 685
LE+PN EIGNF KPVQKF GLGSV+ SG+SADV SQPF N KEST R+ STGL ASELS
Sbjct: 544 LEKPNSEIGNFGKPVQKFTGLGSVAFSGQSADVPSQPFLNAKESTLRLGSTGLQDASELS 603
Query: 686 SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 745
SD+AMFLNKIDP SSVL NS QS+KT+N GPSFG ANAF F+G+ FQ KDV STLTQ
Sbjct: 604 SDRAMFLNKIDPASSVLPLNSLQSTKTDNLGPSFGAANAFTAFTGRSFQTKDVSSTLTQI 663
Query: 746 GKQVTGGAGKIESLPVLRSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEG 805
G+QVT GAGKIESLP +RSSQ+ LQDN S K SNEKH S+RNYSN PLAKPMKEMC+G
Sbjct: 664 GRQVTAGAGKIESLPPMRSSQVPLQDNFSLGKTSNEKHSRSERNYSNVPLAKPMKEMCDG 723
Query: 806 LDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDK 865
LDMLLESIEEPGGF DACTA QKSS+EALE GLA+LSD+CQIW TMNER+QE+QNLFDK
Sbjct: 724 LDMLLESIEEPGGFWDACTASQKSSIEALELGLATLSDQCQIWGRTMNERAQEIQNLFDK 783
Query: 866 MV-QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELER 925
V QV+ KKTYIEGIV QAS S YWE WDRQ+LSSELELKRQHILK NQNMTNQLIELER
Sbjct: 784 TVNQVMPKKTYIEGIVKQASHSHYWEHWDRQRLSSELELKRQHILKTNQNMTNQLIELER 843
Query: 926 HFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQ 985
HFNGLELNKFGGN+ESQ SERALQRKFG SRHSHS HSLNNI GSQLAAAQLLSESLSKQ
Sbjct: 844 HFNGLELNKFGGNDESQVSERALQRKFGSSRHSHSFHSLNNITGSQLAAAQLLSESLSKQ 903
Query: 986 LAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDT 1045
+AALNIESPS KRQ VTKELFETIG+TYDASF SPNVNKIAETSSKKLLLSADSFSSKD+
Sbjct: 904 MAALNIESPSSKRQSVTKELFETIGITYDASFSSPNVNKIAETSSKKLLLSADSFSSKDS 963
Query: 1046 LRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPE 1105
RRK RSG KNSEAETGRRRR+SLDRNLASVEPPKTTVKRMLL+GIP +DEK FRS TPE
Sbjct: 964 SRRKLRSGMKNSEAETGRRRRESLDRNLASVEPPKTTVKRMLLEGIPLADEKHFRSPTPE 1023
Query: 1106 GAATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTT 1165
G ATV PASR+ SS SSSSKNA ATPFMW+SP Q SN SRQKSQPL KTN T
Sbjct: 1024 GTATVTRPASRIASSMLSSSSKNAEHSSENPATPFMWSSPSQSSNISRQKSQPLKKTNAT 1083
Query: 1166 APSPLS-VFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNI 1225
APSPL V+QSSHEM KKSN EA++VTS+NKF EKSK+SDF S+TRSDSVQKSNI
Sbjct: 1084 APSPLPVVYQSSHEMPKKSNTEAYSVTSDNKFTEATYPEKSKSSDFLSLTRSDSVQKSNI 1143
Query: 1226 NLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGT 1285
NLDQKSSIF IS+ Q PTLKDSINTSNL+ QKTAN KERHT S LF SANKPESA +GT
Sbjct: 1144 NLDQKSSIFKISNNQMPTLKDSINTSNLNGQKTANVKERHTPKSSLFESANKPESAFVGT 1203
Query: 1286 MSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVID 1345
S+ VPTV ARKTEEK SLT SPSVPA A LNTPSS STLF GF+V+K L +S A +D
Sbjct: 1204 ASTPVPTVLGARKTEEKTSLTAFSPSVPAPALLNTPSSASTLFSGFSVTKSLTNSTAHVD 1263
Query: 1346 LNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDTE 1405
LN+P+ST TQ NFS+P VSVSDSLFQAPKM+S S P+ SKKELP KSD DT
Sbjct: 1264 LNKPLSTFTQSNFSSPAVSVSDSLFQAPKMVSPS------PTTLESKKELPGPKSDADTP 1323
Query: 1406 KQTPASK-PESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVA 1465
K P SK PES+ELK QPSVTP DK HVEPTS + TV KDVGG VPNV+ QQ S A
Sbjct: 1324 KPAPDSKPPESHELKLQPSVTPADKNHVEPTSGSQTVPKDVGGLVPNVL-----QQSSAA 1383
Query: 1466 FAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPM 1525
F PLP+ NLT K N +NETS+ TQDDDMDEEAPET NN+EFSLSSLGGFGNSSTP+
Sbjct: 1384 FVPLPTLNLTSKSSTNGKNETSDAALTQDDDMDEEAPET-NNVEFSLSSLGGFGNSSTPI 1443
Query: 1526 SGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1585
S APK NPFGGPFGNVNATSM SSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA
Sbjct: 1444 SSAPKSNPFGGPFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1503
Query: 1586 FSGGFGSAMAT--QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPG 1645
FSGGFGS MAT Q SQGGFGQPAQIGVGQQALG VLG+FG+SRQLGPSLPGT SGSP
Sbjct: 1504 FSGGFGSGMATQPQTSSQGGFGQPAQIGVGQQALGTVLGAFGRSRQLGPSLPGTASGSPS 1563
Query: 1646 GFGGGFTSMKPVGGFASVGSSGGG---------SGGFAGVGSGGGGGFGGVGSN------ 1695
GF GGFT +KP+GGFA VGS GG GGF GVGSG GGGFG VGS+
Sbjct: 1564 GFSGGFTGVKPIGGFAGVGSGSGGGFGGVGSVSGGGFGGVGSGSGGGFGAVGSSSGGGFG 1623
BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match:
A0A6J1HNV2 (nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)
HSP 1 Score: 2040.0 bits (5284), Expect = 0.0e+00
Identity = 1193/1721 (69.32%), Postives = 1314/1721 (76.35%), Query Frame = 0
Query: 89 MASVDSR---PSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 148
MASVDSR ST IPLED+ EGE + ND+YF+KIG+PVPVKL DSIFDP +PPSQPLA
Sbjct: 1 MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 149 LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 208
+SESFGLIFVAHLSG+ TKDV+ASA+E+KNGGTGSS+QDLSIVD+S+GKVH+L LS
Sbjct: 61 VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 209 DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 268
D+S LAA+VAGD+HLF V SLLDK + PS SCS TDSS IKDFKWTRK E+SYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 269 QLYQGSANGPPTHVMHDIDAVDC------------------------------------- 328
+LYQGSA+GP H+MHDIDAV+C
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240
Query: 329 ------------IKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLS 388
IKWVRADCIIIGCFQVTATGDEEDYFV VIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 389 FRDIHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVI 448
F DI+SGFT DILP + GPCLLLSYLDKCKLAIVANR ++HIVLLG LQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 449 NIDRNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLE 508
+I+R+ SLP+IELQ NGDDNLVMGLCIDRVSLPGKV V+VG E++REVSPYC L+CLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 509 GELIMFQFSSVNETEAPHETVSACD-EEEDDIIVPADDRSQLFSGSKKEFREDDLKMQVT 568
G+LI+F FSS NE+EA ETVSACD EEED+ +VP DD+ QLF
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLF----------------- 480
Query: 569 EKLAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLE 628
SN DQ PVSK+D S + ESN KSQ+ DS +SQ L+
Sbjct: 481 ----------------------SNIDQRPVSKVDGSPVITRESNAKSQQMDSLAFSQPLK 540
Query: 629 SSVLERPNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFP------------NVKEST 688
S LERPN EIGNF KPV+ F GLGSV+ SG+S DV SQP N +
Sbjct: 541 PSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQPLKSSILERPNNEIGNFNKPF 600
Query: 689 KRMVSTGLLAASELSSD------KAMFL--------------NKIDPISSV--------L 748
+ G +A S S D K FL K + SV +
Sbjct: 601 HKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFDKPVQKFTGLGSVAFSEQSVDV 660
Query: 749 TPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVL 808
+ F + K S G ANAF GF+GKPFQPKDVPSTLTQSG+QV+ GAGKIESLPV+
Sbjct: 661 PSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPSTLTQSGRQVSAGAGKIESLPVI 720
Query: 809 RSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDA 868
+SSQ+SLQDN S KISN+K DGS+RNY N PLAKPM EMCEGLDMLLESIEEPGGFLDA
Sbjct: 721 QSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNEMCEGLDMLLESIEEPGGFLDA 780
Query: 869 CTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQ 928
CT FQKSSVEAL GLA+LSD+CQIW+ TM ER+QEVQNLFD+ V+VLSKKTYIEGIV Q
Sbjct: 781 CTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQ 840
Query: 929 ASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQA 988
ASDS YW+ WDRQKLSSELELKRQ IL+MNQNMTNQLIELERHFNGLELN FGGNEE Q
Sbjct: 841 ASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNTFGGNEEIQV 900
Query: 989 SERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTK 1048
+ER LQRKFG SR SHSLHSLNNIMGSQLAAAQLLS++LSKQ+A LNI+SPS KRQ +TK
Sbjct: 901 NERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIKSPSSKRQSITK 960
Query: 1049 ELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGR 1108
ELFETIG+TYDASF SPNVNKI ETSSKKLLLSADSFSSKDT RRKQRSG K SE ETGR
Sbjct: 961 ELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSSKDTSRRKQRSGAKISETETGR 1020
Query: 1109 RRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSS 1168
RRRDSLDRNLAS++PPKTTVKRM+LQG P S+EK FRS T EG ATVA PA R+ SS S
Sbjct: 1021 RRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSPTLEGPATVARPAGRIPSSMLS 1080
Query: 1169 SSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKS 1228
SSSKNA ATPF WASP RQK QPL KTN TAPSPL V+QSSHEM+KKS
Sbjct: 1081 SSSKNAEQGSENPATPFSWASP------PRQKFQPLQKTNGTAPSPLPVYQSSHEMVKKS 1140
Query: 1229 NNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTL 1288
N+EA++ SENKF EKSKASDFFS+ RSDSVQKSN+N +QKSS F SSK T
Sbjct: 1141 NSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQKSSFFVTSSKPMSTP 1200
Query: 1289 KDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRS 1348
KDSI T N ++QKTAN KER TT SPLFG+ANKPE AS+GT SSLVPTV+E RKTEEK+
Sbjct: 1201 KDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSLVPTVDELRKTEEKKP 1260
Query: 1349 LTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVS 1408
T SPSVPAS +NTPSS STLF G +SK PS AAV+DLN+P+STSTQ +F++PVVS
Sbjct: 1261 PTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKPLSTSTQSSFASPVVS 1320
Query: 1409 VSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPS 1468
VSDSLFQAPKM+S STL SLNPSL SS KE P+ KSD DTEKQ ASKPE ELK QPS
Sbjct: 1321 VSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQAQASKPEFRELKLQPS 1380
Query: 1469 VT-PDKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRN 1528
VT HVEPTS T TVSKDVGG VP+VI DAQPQQ S AF PLPSPN TPK+ N ++
Sbjct: 1381 VTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPLPSPNSTPKVSANGKS 1440
Query: 1529 ETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNAT 1588
ETS+ TQDDDMDEEAPET NN+EFSLSSLGGFG +STPMS APKPNPFGG FGN NAT
Sbjct: 1441 ETSDALITQDDDMDEEAPET-NNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNANAT 1500
Query: 1589 SMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGF 1648
SM SSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS FGS MATQAP+QGGF
Sbjct: 1501 SMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSFGSGMATQAPTQGGF 1560
Query: 1649 GQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGF-GGGFTSMKPVGGFASVGS 1695
GQPAQIGVGQQALG VLGSFGQSRQLGPSLPGT SGSPGGF GGGFTS+KPVG
Sbjct: 1561 GQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFTSVKPVG------- 1620
BLAST of CcUC11G220380 vs. TAIR 10
Match:
AT1G55540.1 (Nuclear pore complex protein )
HSP 1 Score: 793.9 bits (2049), Expect = 2.6e-229
Identity = 691/1849 (37.37%), Postives = 939/1849 (50.78%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
+GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ S ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
P E I K +KSS N Q P ++ +S SF L S
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543
Query: 640 VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
+ N + F + I +S +H Q K + S GL A
Sbjct: 544 LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603
Query: 700 -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
SS + P V P F S + + S + G+ P KD
Sbjct: 604 PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663
Query: 760 TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
+ ++G+ G KIE +P +R+SQ+S Q + S+++H
Sbjct: 664 QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723
Query: 820 GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
+ N SN P + EM +D LL+SIE PGGF D+C KS+VE LE+
Sbjct: 724 TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783
Query: 880 GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
GL SL+ +CQ WKST++E+ E+Q+L DK +QVL+KKTY+EG+ Q +D++YW+ W+RQK
Sbjct: 784 GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843
Query: 940 LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++ + + R + + SR
Sbjct: 844 LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903
Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
SLHSL+N M SQLAAA+ LSE LSKQ+ L I+SP ++ V +ELFETIG+ YDASF
Sbjct: 904 VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963
Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASV 1119
SP+ K SS K LLLS+ S R++Q S KNS+ ET RRRR+SLDRN A+
Sbjct: 964 SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRNWAAF 1023
Query: 1120 EPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASRLTS 1179
EPPKTTVKRMLLQ + S++ RS + A+ V +
Sbjct: 1024 EPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKGIME 1083
Query: 1180 SKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTAPSP 1239
S +S+ +TPF P+ SN + + S +APS
Sbjct: 1084 SFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESAPSQ 1143
Query: 1240 L----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE-----------K 1299
+ +V Q + KK+ F+ N F+E
Sbjct: 1144 IKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRLSTT 1203
Query: 1300 SKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP---TL 1359
S SDF S S KS + SSI FT + P T
Sbjct: 1204 SSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTP 1263
Query: 1360 KDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT----------- 1419
DS +T + +++ + S SA P++ S+ + S++ T
Sbjct: 1264 LDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGKPLT 1323
Query: 1420 -----VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA----VS 1479
+N+A + S L +SPS P +T SS LF A VS
Sbjct: 1324 SVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTSQVS 1383
Query: 1480 KPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKE 1539
S+ + + + + +ST L+ STP ++ D+ FQ+P++ + SS + + + KK
Sbjct: 1384 SDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEPKKP 1443
Query: 1540 LPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQVPN 1599
S S T+ A+K ++ L + ++ V P S + +S G +
Sbjct: 1444 EAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGTQSS 1503
Query: 1600 VI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAP 1659
+ G +QPQQ S AP P+ + T + E ++ TQ+D+MDEEAP
Sbjct: 1504 LASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDEEAP 1563
Query: 1660 ETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFR 1695
E + E S+ S GGFG STP GAPK NPFGGPFG NAT+ TS+ + PSGELF+
Sbjct: 1564 EASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGELFK 1623
BLAST of CcUC11G220380 vs. TAIR 10
Match:
AT1G55540.2 (Nuclear pore complex protein )
HSP 1 Score: 788.5 bits (2035), Expect = 1.1e-227
Identity = 691/1852 (37.31%), Postives = 939/1852 (50.70%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
+GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ S ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
P E I K +KSS N Q P ++ +S SF L S
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543
Query: 640 VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
+ N + F + I +S +H Q K + S GL A
Sbjct: 544 LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603
Query: 700 -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
SS + P V P F S + + S + G+ P KD
Sbjct: 604 PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663
Query: 760 TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
+ ++G+ G KIE +P +R+SQ+S Q + S+++H
Sbjct: 664 QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723
Query: 820 GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
+ N SN P + EM +D LL+SIE PGGF D+C KS+VE LE+
Sbjct: 724 TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783
Query: 880 GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
GL SL+ +CQ WKST++E+ E+Q+L DK +QVL+KKTY+EG+ Q +D++YW+ W+RQK
Sbjct: 784 GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843
Query: 940 LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++ + + R + + SR
Sbjct: 844 LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903
Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
SLHSL+N M SQLAAA+ LSE LSKQ+ L I+SP ++ V +ELFETIG+ YDASF
Sbjct: 904 VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963
Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDR---NL 1119
SP+ K SS K LLLS+ S R++Q S KNS+ ET RRRR+SLDR N
Sbjct: 964 SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRVIFNW 1023
Query: 1120 ASVEPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASR 1179
A+ EPPKTTVKRMLLQ + S++ RS + A+ V
Sbjct: 1024 AAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKG 1083
Query: 1180 LTSSKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTA 1239
+ S +S+ +TPF P+ SN + + S +A
Sbjct: 1084 IMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESA 1143
Query: 1240 PSPL----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE--------- 1299
PS + +V Q + KK+ F+ N F+E
Sbjct: 1144 PSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRL 1203
Query: 1300 --KSKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP-- 1359
S SDF S S KS + SSI FT + P
Sbjct: 1204 STTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLS 1263
Query: 1360 -TLKDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT-------- 1419
T DS +T + +++ + S SA P++ S+ + S++ T
Sbjct: 1264 GTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGK 1323
Query: 1420 --------VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA--- 1479
+N+A + S L +SPS P +T SS LF A
Sbjct: 1324 PLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTS 1383
Query: 1480 -VSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESS 1539
VS S+ + + + + +ST L+ STP ++ D+ FQ+P++ + SS + + +
Sbjct: 1384 QVSSDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEP 1443
Query: 1540 KKELPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQ 1599
KK S S T+ A+K ++ L + ++ V P S + +S G
Sbjct: 1444 KKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGT 1503
Query: 1600 VPNVI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1659
++ G +QPQQ S AP P+ + T + E ++ TQ+D+MDE
Sbjct: 1504 QSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDE 1563
Query: 1660 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1695
EAPE + E S+ S GGFG STP GAPK NPFGGPFG NAT+ TS+ + PSGE
Sbjct: 1564 EAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGE 1623
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038892124.1 | 0.0e+00 | 87.11 | nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida] | [more] |
XP_038892123.1 | 0.0e+00 | 86.85 | nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida] | [more] |
XP_031741375.1 | 0.0e+00 | 81.17 | nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus] | [more] |
XP_031741374.1 | 0.0e+00 | 80.69 | nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hyp... | [more] |
KAA0034115.1 | 0.0e+00 | 79.79 | nuclear pore complex protein NUP214 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
F4I1T7 | 1.5e-226 | 37.31 | Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7SY34 | 0.0e+00 | 79.79 | Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... | [more] |
A0A0A0KV45 | 0.0e+00 | 78.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1 | [more] |
A0A1S3BDU8 | 0.0e+00 | 80.31 | LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656... | [more] |
A0A6J1CBF2 | 0.0e+00 | 72.80 | nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC1110100... | [more] |
A0A6J1HNV2 | 0.0e+00 | 69.32 | nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |