Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAAACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCGTATTAGGTTTCCAAAACAATAACAAACCTATACTGCAAACCAAGAGCCCATTGGGCTTTGACTTAATAGAAACATGTAAGTTTTAGAGATTTAACCTTCACAAACTTTATTTTACGAATATTCTAAAATTGGAAAATATTTCAAGATTTTGGCTTAATTTTCAAACAAATTTAACCTTAAGCCTCATGGTCTCTCCGATTAAAATTTCAATTTTTTATAATTTAATATATCATAAATAATCATCTAAAATTCATTTAGACCTTATAAAATTGGTTCTTTCACAATTAATGTTGTACATATTATAATTAATGTTGGCATATTATTTAATGAGTATTTTTTTAGTCTTTCATAATTATCGCTTCAATTTATGAATAAAATAGGTTTACTCTTTATGTGTTTTGAATTGTATTATGCGTTTATGTAGTATTATACAAAGTATATTTTTATTGATTTTTGTTTTATCAATATATACTATGTTATATTAATTGTTCAATCTTGTACCATGATATGCATTGTGTTTCTTCATATATCTTGTGTGCTATTTTTTAAAAATATTTTCATATGTAACTAGTTTTTAATAGTATATTTCACAAATGTAATTACATTGTCATGTATATACATGTATATATTTGAAATTGAGTTATATATTCTGCATATTTTGTCATATATACACATGCTTGTTATATTAAGATAAATGATATCTTCATATTATTTGAAATAAACTCACATATATTTGAAATGGAAAACAAAATGTATAAATAAACCTCACATATACTGTGTATTATAACCCGCATGACTTTCATGTGCGAGTTTTTGAAAAAAAAAATTTAATGCGCGTAAATATGATATATTTTTTAAAGATTTGATTTTAAAAATAAAACAGTTTTGAAAAAAAAAAAATCATTTAATGCTGTGAATCTGATATATTTTATTTTTCATATGTGTCTTTTCTTATTTCTATTTTTTCTTATTATTATTTTTCAAAATTTTTAAAATTTGTGGAGGATGATATGGACATTGAACCATATTTTTTTGGTAATATTTAATTTCATTAAACTATTCATTAAATAATAATAAGTTTAAAACAATTTTGTAATTAAACTTATTTAATTAGGTTATATGTGTATATGTACCTAAACTAAATTAACATACTATGAAATACTATTACATTAAATAGTATTTACTCTAACACTGTAGACATGAGAGGGGATGTGGGTTCGTTCCGAAAGAGGAGATAGTAATGGAGCTTGATTTCAATGGGGTTTGATTGTGAGCTTTGGCAATGGCGATGGATGGAAGAGTCATGAGATGCGAACCAGCTGTGAACATATTTCCCTCTCTTTCTTCTATTTTGGAATTAGGATTCTCGAGAATACTTCATTTAATTTAAATGTTTTCGAGGGCTAGGGTTGCTCATTCAAACTAAAAAATTTAATCATTTTTAAAAACATAATCAAACTTAATAAAAAATTTTGTGAAATCAAAAGGTGAATTCTATCCGCTTTGAAGAGATTTTAAAACCGAATTATTGAGTTTCAATTTTGAAAAGCATAGTGAACCCAACTTTTTTTTTATTTATATAAATATTACTGATTTAAATACTTTATTTGTTAAATACAAATATGTTAAAGAAAAATTGCAAGAGTCACCTTTGACTTTTGATTTAATCCATCATTTCCCTAAACTATATGGTTTGCTTACACCCATCAACTATTTAATTCGATCAATTAGCCTCTGTATTTTTTACTTGTTGCAATAAAGCATCTATATTAATATTGATATTGATATAGTTTCAAGATTAAAATTTTATTGGGTGCAGAGTTGTGTTGAGTTGAATTGAAATGTCTGTGATTCTGGTGATAAGTTGAGTTGAGTTAGAAAATCTGTGTTTGGGTGCTGATTTGAGTTGAATTGGGGTATATACTACAATTTTTTTAAACAAATGGATCTTATATATATGTTTTTTTTTTTCCTTTTTGGTATTTTTTTTTTTTATTTCTGATTGTTTTTTTTCCTTTTCCTTTTTTTATCTCCAAGTATGCCCATATGAATAAAAAATTTAACACTAGAAAATGATTAACTTTTTTCCATTAAAACATAACAAATTCGTACAAAGTAAAATTACACTATTGGTTCGCAAACTTTAGTAAAAGTAACGATTTGGTCCCTGAACTTTCAAATGTAACGATTTAGTCGGTGAAGTTATTAACTGGTACCAATTTGGTCCAACTCTCTTAAAAAATTCTGTTAGTTTGCCGATAGAATTATGAAATGATGCCATGTATCATCCTAAGTATAAAAAATAAAAGCATTGATGTATTAGAGAAACCAAAAAGGACTATTTATTATTTATTTATTTATTATTATTTTTTTAAATTCTCCTACATCTTCTTGTTCTCCGACTTCTTCTCTAGCTCTTTCTTCTTCTCCAACACGAAGAGTCAAAACTCTAACACGACCAACCCGATGAGCGAGAGAGAGTGAGAGCGAGGGTGTGAGAGAGCGAAAGAGATAGGGTGATTGTGAGTGACGAAAGGAAGCGAGACCGAGACGGATAAAATGATTGTGAGTGAAAATGAGAACGAGAGCGAAAAAGAGTGAGAGTGAGAGCGAAAGTGTGAGAAAGAGCGAGAGAGATAGAGTGATTGTGAGGAAAGGAAGCCAGAGTGGGAGCTAGAGATGCACGAGGGAGAATAATTATGGGGGTTGGGGATAATTAAAACACTTGGAAAATTGTGTCGATAAAATGGTGGGTAAAAGAATGGGTTATCATTTTCTTTTATTGTCAAACAAAGGTAGGTTTCATTTTAAAAATCCACCCCAACCCGTTGGGTTATCAAACAGCCCCTAAGTTTAAACTTTGTGAAGCCATTTAAAAGCAAAAATGCTTTAACATTTTGCTCAATTATTGACAAAAAAAACATGAAAAAGCATATACAAAGGAGTGGTCATTTAGCTAAAATGGGGTTGTAGCAACAAAAATATGACCAAAACAAAATTATTGTTCGCATACAAAAAAAAAAAAGTACGCGATCTCTAACCTTAGGTGCGTGATTGTTGAGTTCTTGCTTTGCAATTTTGTTTTGGTCATAACTCCCCAATAAAATTTAGTTTTAGCTAAATAACCACTTGTTTTCACATGGTTTTTTGTGTTATTTATAACAATGCTTATAAAAAGAATAAAATTTAAAAGAATATTGAATATAAATGATCTGACAACATTTAAACGTAGATAAAATTATATTTTGGACTATTAAAGTTAAAGATTGCTTAAGTTTTTGTTAAGTTTACAAGCCTGGATGGAAATAATGTAAAAAAAAATATGAAAACGAGTGACCATTTAGCTAAATCGAAGTTGTATCGATGGAGATATGACCTCAAAATTGTCAAGGGAGAACTAAGCGATTGCATACATGGATGTCAATGATTGCATTTTTTTTTTTTTTTTTTTGCGCAAATGATAATTTTGTTTTGGTTGTATATTTGTTGATACAATTTCGTTTTAGCTAAATAACTACTCGTTTTCATATGTTTTTTTTATGTTCTTTTCAAACATGCTTATAAATTAATCTAAATGTTAAAGCATAAGAAACACTAATAGAAGAAACTCGAAATAAAAAAAACGAAGAGAAAAGAAATTTGTTTAAAACTGACGACAAAATGAATATTCACATTTTTAAAAAGTAAAAAGACAGAATAATGAGCAAGGAATGTCCGACTTTCTGCTAAATAGGATGTATTGAACACATAATTCATTAGATATTTTTTATTAAAAAGAAAAAGAAAAGAAAAAGAAAAAAAAGAAGAAGATCAATTACATATAAGTTTTTGGATTTTGAGTTAGTATGGGGAATTTTCCACAAACAAATGACACTATTTTGAATAGTTTGAAAACAACTTTATGCCAATTATTTTTAAACCTATAATATTAATCTTCTTAACCCGCCCGAAGGCACTTGGAGTGCGATGCTCCTAAAACCCCAGGCGGAACAACATCGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCCACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATAGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGTAATTTCAATTACTTCCCCCATTGTTGTGAATACCATTGAATTTTCTATGATTTTATTTTATATTGAAGAAAATTTTGTTTCCTAAGGGTTTTTTGTGGTGAGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCGCTGCTTGATAAGGTAGTGCTTTTCGCTGGAGCTTGAATATCATAGTATAGTTTCCCTGAAACGTCAATTCGGATATTTGCATCAACGGGTAAAATGTCGCAATCATTAGTTATATCTATTGCACGCCACTCCTCATATGTCATTACTGTTATGCAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCCGGTAGGCTACATACTTTTGTATAGTAACTTGTGTATAATTCTTATTGACAATTTTAAGTGACGTTTGTTAAAGTTCTTACATTTTTTTAAAAATTACTGTATACTTCAAGAAAATGGATCTTTTTGCTACATATCTGCATGGGGAGATGCTGTGTTTCCCTGTCTAATTCGCCATTTCTGAGTGCCCCTTCATTAAACTTAAACGATCTCTCTAAAACCTTTCTAGTCAATTCTTCCAGTTACTACATCTATTGTGAAGACCTCTCTTTTCCTTGTATATTTCATTTAATCATTTAAACTTTGTTTCTTATAAAAGGACCCTTTGTTCTATTCATGAGTGTAAGAACGGTGTCTTGGAATGGAAATTTGAAAACAGGGAATTGCTACAACTTATACCTGGTTGTTATTGGCATTAGGCCTTAGAGAAACAGCAAAGGGCCATTATTGAACTGAGACTCAATTGAGGGCTTAGAAATTTTAGGAAGGATGACGAGGGAACACTGCTCATTAGCACAATAATTAGGGTAGCTAGGCCAGAGATAGGAATGTGATTGAAGATAGGATATTCATAATTGTTGGGCTCTGTTAATTTAACTATTTATCTTAGAGATCTTTATTTGTTCAGGAATTGAAGCTCATCTTTATTTTTTGATAGTGAACAATTTTAATATTGCATGTTTTTCCCCCCTCTCTTGCATGGACCGTGCTCCATGCCAGTGTTTTTGGCATATATTTTCTTCTTCTTTTGTATTCCCTTGCCCCTAATTTTCTGCCATTTTCTCCCCAAAGCGATCTATTAGCATGGGTTATTGTCAGCAAAACTTAGTATATGTGGAGCTACTGTTTTTGAAATTCTTTTAAGTCTTATTTTGGATTTTAGTGGAATGCAGTGTGAAAGGGAAATTCATTGCGGTGGCTAAAAAGGATACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCTGAGTTCAGGGAATGGTGAAACTGATACGGACTTTACAGTGAAGGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGTGTGTGTGTAAAAATGATATTTGCAGAATGTTGAAACATGAAGGTTTAATAATTTTCGTTATTCATTTTTCTGTTTGGTTGGGGAGGGGATTTTCTTGTATTTTACTTTTGTGATGTATCATGAGGTGTCAATCAGGTTGGCTATGAGTTGAGTTTGAGTTGTTAGCTTGGAGTGAAGTACAGTTGGTTTCACCATAGTAATCCCAATAGATATTTTTGCACATTTTAACCCTATGTAGTTTGCTTCCTGTGGCAGATAGATTTGATTAGCAATGGTTTTTGCGGTTGAAGAAAATTCTAGATTTGGGCAATTATTATTTTATATGGGTTATTTGCCTAGAGAGAAATTGTGGAACTTTTGTATCTATTGCTAATTCATACCATGCAAGAGATTGAATACTAGCAAGCATATAAAATCTAACACAAAGGAGCCAAAATAAATCTCTAATTGCTGGGTTCTTGGCTAGTTTGGGAATTTGCCTCTTCCTTCCCCTTCTTTAATTCTCTGTCATTACTATATTATTCCTTTTCCAGAATAAAAGCAATAAAAAAACAATTAAATGAGGATTACAAGAGAACAAGGAATTCTCTAAATTCTCATTAATTCTAAATAATCCAGAAAATTTAGTATAATTAAAAAAAATGTCTGAAAACTGAAATAAATTGGAAACAATAGTACTACATACCATACAGAACTGCTATTTTTGACCTCAATAACAAGCTTACGTTGGCGATTCTAGAAAGTACTTATGGTTACTATCTTTTAACTTGATTTGTGGTGATTGCTCAGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTGAGTAGTTGTGAATTTCTTCCTTTTCCCCCTCGCATTCCATTTCTATTTTCATGATGATGGTTGTGTTCTTTATCATGCTTGTACTTATAGATGAGGGTGATGATAATGATGATGACGAGGGTGATGATAAGGGTTGCTTTTGACTTGAGAGGAATGAGAGAAACTTTAGAGGCTTTAAGAGGTATTGGAAGGAGGTGTAGACTCTTGCCAAATATATTGCCTCTCTGCAGATATTTGTAACTTAAAGGACCTTTGTAATTATCCATTATCATCCTCTAGGCCTTGTTCTTTTGGACAGTTCTTTTTCTTGTTAGGTCCATTCTTGTTTGACACAGTTTTTTGGTTGTTTTTTACAAACCCTTTTGTATTCTTTCATCTATCTCTTATTGAAAGCTTGGTTTCTTGATGAAAAGACAACTTTGTACATGGGCTTTATTGATGTTTTGAATTCTATTGTGCAAATTATTTGCAAGGCATTAGTTTGTTTATCAATTGCCTTGACTATATCTCTTTGACAAATTGTTGTGGGATATTTGGGGGGAGAGAAACAACAAATTGTGTAGAGGTATGGAGATGTTTGGTCCTTGACGAGGTTCCATGTTTCTCCTTGGGATTTGGTTTCAATTGTAATTATTCTCATCTTGCTTAGTTGGAAACCCTTTCTTTAGAAGGGTTTTTGGGGATTGGTTTTTTGTATGCTCTTGTATTCTTTCATTTTTTTCTCACTGAAACCAATTATTTCTATTAAAAAAATCTCTTTGAGAAATACAAACATTTAGAGGGGAAATAAATCCTTGGACGGGCAATGCAAATATTGAGCCTTAGAAGATTTGTCTTTAAAAATAAACCATTGTTATTAATGCGGTAAAAAACATGGTGCACTTTCTCTCCCTTACCTTTTGCTCAACATAGTGCTCGTCAGAACCCCCCAATTCTCTGTTAGTCACAATTTAGACTCCTGAGCTTCAACTCATTATTTGTCATGGGTATTTCTTTTTGAAAAAAAAAACAAGATTTCTTAGATTTGATGAGAAGAGACTAATAATGTTGTAAGGTCAGGTGGGTTGTCCCATGAGATTAGTTAAGGTGCGCGTAAGCTGGTCCGAACACTCACGGATAGTTAAAAAAAAAAAAGAAAAGAGAATAATGCTCAAAATACAATGACAAAACAAAAAGACGAAGAGACAAATCCATCTACCATAATACATGAGAACTATGCAGAAAAAACAAAGACATTCTATTTTAAACATAAATCATGAATAGAGTATCGAGCAAAAGACTTGGAGAGAGAACCAGGAGGCGAGGGAGTTATCTGAGCAAACTCAAACCGATCAAACCAAGGAATAGACTTGTCTTGAAAATACGCTGATTTCTTTCAAACCAAATCTTCATGAGAATAGCTTTAACCACATTTGACCGTAACTAGACATGGGTATCACAACAACCTTTAGAAAGATTGATCACATAATATTTACTCGGGGTTTTTGACCAAAGAAAGAAAGGAAAATGTTGCAAGATTATGGAAAGACATTTGGAAAGGCTCTTTGACATCTTCGTTGGAGAAGCAATTGTTGTGCAGTTATTTGACACATTTGAGGATCTCCTGTTTGCTGCCTACTCCTATAAGTTATCTCGAAGAACAAACGACAAAGATTGCTCAATATGGATTCAAAAAATCACTAACAAGAGAGGTACTTTCATCGAAATCACTAAGGGGTGTTTGGGCCAAGGAGTTGGAAAGTGTGGAGTTGTGAACTCCACTACTTGTTCCGCTCAGAGTTTGTAGGTCCCACTACTAGAACTCAAGGACACTGTTGTACAACCTATAGGACAGTTCCTGTTCAAGTAAGGTCGTTTGTGGGTCTCACTACTAAAAAGTATCAATTTTATGTCTTATTAACTCCTTATATCGTGAGCCCTATGAGTTCACAACTTCCTATACTTCATAACTCCTTGGACTTCACAACTCCACTCCTTGCCCCAAACACCCCCTAAAGTACAAAGCAGTTGGAGAAATCTAATTATCCCTTTTGCTAATAAAACAATGGTTGGAAAGTTTTCAGAGAGCTCCTATCTGATTTCTTAATTGAACCACAAAAGGGAGAGGAAGCTTTATCGAAGAAAACTAAAACATAAAGGAAGTCCGTCCTTTGCAAAGGCAGTGAAGGAAAGTAATCAACCTCAAGATAGAAGTGGGGGCCTTTTGAGGTGTGATGACTTGTTTGTCCCAACGATGTAGAGAGAAAGATGTAGAGAGAAAGGAGTTAAAGGGCATTTCAATGGAACAAGGTACTGGTTATTACAAGAAGAGTCTTCCATGATGATTGGGTAAAAATTATCTTCACATTGTTTGTTTTTGAAAGGGGCTGTTTGGATCTGTTTTATTGACTATCTTTTGCGTTTTCTTGCTGCTTTTGTTTGGAGGCCCTCCGCTTTTTTGCTTTTATCTTTCTTTTTTGGCTGTACATCTTTTATTTTGGTGCTATCTTCCCTCACGTTTGGGCTTCATCTGTATTTCTGATGTGTCTCCTTATTGTACTCATTGGATTATTTTATTCATCAATGAAATGTTTCTTATAAATTTTTTTTTTTTTTGGAGTCCCTCGAAAGATAACTAGAAGAAGTGTTTGCCAATAGACCTTTTCCACCTAGATAAATGCTTCTTTTGTTTCCATTCATGGATTGTGCAAAGTTGTGACCAATGAATGTGGGATGGGTCACTTTTGGTGCTTTCACCATAAATAGGAGAGATAGGACGCTTAGGAACACTGAAAAACTTATGCTATCCGTTCATGCTTAGAAGTGGCGATCAAAGTACAAGGAAACTATTGTAGATTTATTCCATTTGACACGAGAATATTGGATGGAGATCAAGCCATCAATTCTTATGTTTGTACTTTTCAAGATAGTGCTTTACTTGCTGGCGGAAAAGTGGAGTTTTACCATGGTTTCTCATCGACTCCGACTGAAATTTTTTTATGGTTTTGGTGGTAGGTTGAGTCTCAACCTGATAGACTTGTCTCAGTTGCACGATGGAATTAGTCTCCCCACGATAAATCTTTTTACCATGAACTTCCATTTCATAGGAAAAGGACACGAGCTTTCCCAGAAGGGCATCTTGTCAAAGAGTTAGATTTAAGGAGCCATCAATTAATGACTCATATGTATTCCAATTTTTGTCTATGACCATAGAGCCCTTAACTCTTTGCCCCCTTCTTTGAATCAGATTTTATGTCATGCAACCATTTCCTTTTTTAAATGTCAAACAAGACGAAAAGGTTGTTTAAACATGTCTTGAAGCTGCTTTCCCAAATATTTTCTCTATCATTTAAATAGGAAGCTTTGATGGCAAAATGTTGGGATGCAACTTAGGGAACTTGGGATTTGGGAATTAGACGAAGGTTGTTTGATTGCGAGATGTAGGCCTAGGCTGAGCAGTGGGAGGGCTTTTGGTTGGGTCAAAGAAAGGAAGGAATTAGATGGTCCTTTGATGCCTCTAGCTCTTTCTCCACCCTAAGAAGTTGGAGGGCTTTCGATTGTGTCAAGGAGAGGATGGAATTTAGGTGGTCCTTGGATGCCTCAAGCTTTTTGTCCACCAAATCCTTGTATTTTGAATTTGACTGGCTCGCCATATTTTAACATGCACTATGACAAAGGTGCTTTGGGATTTCAAAATTTCAATAAGTGAAGGTTTTCCTTTGGTCCCTAGCACACAGGAGCCTAAATACTCAGGAGAGAATGCAACCACAAAGTGTCCTCGTACCACCTTTACACATCTATTTGCCATTTTTGTTTGGCATGTGGTGGAGTCCAGAGTCCCTAGACCACACCTTCTTACACGGCCTTTTGCTAGACAAAATTGGGATGTCTTTTTGGCCTTTTTGACTTGCATGCTTGTCTTCCCAAGTGGGTGGATGGGTGGTTACCTGAATCCCTCAACAGTTGGAGCTTGAAAGGAAAGTGTTAAATCATTGATTAACTCAAAAGCTTAAGTTGATGGGTGTAGGTAAATCTAATATTATATCATTTAACGCTCCCCTCACTTGTCGGCGTGGAATATGTAGAAGACCCAACAAATGGAAATCAATGTTATGGGGAGAAAATAACATTGTAGGGGTTTGAACATATGATCTCCTGACCACCTTCTTCGATACCATGTCTAATTACCAATTGACCCAAAAGCTTAAGCCGATGAGCGAAGGTAAATTTAATATTATATCATCTAATAGAAAGGCTTTATATGAAGATTTGCTTTTTGATCCCTCTTATGGGGTTTATGGCTTGAATGTAACAAGTAAATTTTGAAGATAAGCCCACTTTTGCCTCTTCTTTTTCCCCTCTTTGGGGTTACTGTATTTTGAGCATTAGTCTTTTTTCATTATTTCAATGAACAGTTTCATTTCTTTAAAAGAAAGATAAGTCCACTTTTTCTACTTTTTGCGAATTTGTACAACTAGATCCTTCACGATTGGAAGGCTTTTTGTCCGTAGCTTCTTGAGTGGGGAGCCCTCTTTCCCCAGCCTTTACGTTGTTTTGTTCCTTTCTTTCTTTCTTTCTTTTGTTACAATCAAACCAAGGAAGGGATTTATCATGAAATATCCTATGATTCCTGTCAGAACCAAATTTTGGAAAGGAGAGCTTTGATTGCATCAATCCTAGTTGAGCTTTATTAGATAGACCATGTGCAGTCTTATTGGTAGAAAAGGAGTAGTTTAGACTGAAACTAGAGTGGATTGGCAGATGGTTGTATGAGGGGAACTTTAGTAGTCTTCTATTCAAGGGAATGATTGTTAAGGTTGCGACTAAACTCCTTCATGGATTAAGAAATGGGAAAGATCAGGCGTGGTTTTTATTTATTTAATTTTTTTGTCTTGACAAAATCTTATATTTATCTGTGAATCATGTATTATGTTATTATTATTAGTGCCCTAGAATCAAGGGCAATCATATTGGATTAATTAAACCCTCACCTTCCTTTTCTTGGTTGTAAATTTGTTTTTATTATTCTTGATCTTCTGAATGAGAATGATGTAACTTTATCTGAAATCGACAGTGAAAGTTGAGAATTATCACAGTATCATAAACTGATCAACATGATGTAGTTATTATGTTATTTATTTATTTTTTACGTTTAGTAGATTAATTTTTGTAACCCTCTTTTGTAGGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATAGTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGGTATGCAGAAATGACTTGATCTGATTTAAGCTCTGAATGTTTGAAATTGATTTTTTCTTGAAATTCCTTATTTTTTTTACTACTTTTTCTTCATTTAATAAACTTTTGGTTATCACTTCTTTGAGTTCTTTTTCTCTTTCTGTTACCGTGGGATTATTTTTTAATTATGAACCTACTTGTCAAGTTTATTTATTTTCTCTTTCTTGATTTGGATGCAGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGGTTAGATCTTGTGATTATTGTTATGAAAGAATTCTGAAACTTTATTAAAAAATGATGCTGATTGCTGAATAGTGAATATAACCTAAAGTTAGATATATAAAGCATACAAGTCAGCCTAACAGCTTGTATTTAGCTCAAAAAATAACTAATCTAGCAGACCTATTCGTTTAGAATGAAATATAACTGGCTCATCAGGAGAAGAGTATCTTTTTATAGCTTGCTGAGTGTGTGCAACCTTATGGGTTTTTGAGGGGAGTAGAATAGTAGGGGTGTTTAGAAGGGTGGAGAGGAACCCTAATGAGGTTTGATCTCTTGTTCACTTTCATGTTTCTTTGTGGGATTCGATTTTGAAGACCTTTTGTAATTATTCTATAGGTGTTGTTTTCCATGATCTTCCTCTCCTTAACGGTGAATACATGTGGTCCTGGGATAGACATCCCCATTCACTCATTGATTGACAGATTCCTCATCACTAAAGGCTGCGTAGATAAATATGGGGAAGGCTACAGAGAGTTACATCTGACCACTTCCCTATCCAACTAACTCTTGATAAGCAGAAATGTGGTCCTACTTATTTCAAATTCCACAATGATTGGATGGAACACTAGGCTGCCTCGGACACGATTTTATTTAGAAACTACCAGGAATTAAAAAAGTGGTTAAAGAATGTGGTATAGAACCCAATTCTTGGAGCAATCTTGAAGAGAGAATTTTATTGAAGTTGTGAAATAATGAAGACAAGCTCCCCTTAAATAGACCCAAGGGGAAAGGGTTGGTTCACCCCTAAACTAACTAGTTAACTAATCTAATCATAATCTTCCCTTAAATAGACTCAAGGGAAAAGGGTTGGTTCACACCTATTCCAAAATTATTAAAATAAAAATACAATTGATTAAATCAAAGGGAATTTCCCAAAATACCCTACATCAAATTTTTCATGTGTAAGGGAGAACTACATTTGTTTGATCAAAAAAAAGGAAGGCGCCACCCTAGTGAAGGATTTCAGACCAATCAGCCTTACCACTTCAGTTTATAAGATTGTAGCTAAGGTGTTAGCAGAAAGAATGAAGAAAGTTATGCCAAGAATAATTGCTCCTACTCAGAGTGCTTTTATTGGGGGAAGACAAATTCTTGATCCGGTCCTCATAGCTAATGAAGTGGCCGAGGAATATAGAATCAAAAAGAAGAAAAGTTGGTTGTTAAAGCTTGACCTTGAAAAAACTTTTGGCCGTGTAAACTTCCTTGAAAAAGTCCTAGTTGGAAAGAATTTCGACCCTAGATGGATCTATTGGATTATGGGATGTGTGTCGAACCCAAATTTCTCAATATTTATCAATGGAAAACCAAGGGGAAGAATACAAGCCTCTAGAGGCATTAGGCAAGGGGATCCTCTCTCACCCTTCCTCTTTCTACTTGTTAGTGAGGTACTTAGTGGTTTATTATCAAGGCTACATGATAAGGGCAAATATGAGGGATTTATTGTTGGAAAGGATGCTGTCCATGTTTCTTTGCTACAATTTGCGGATGATACTTGTTATTTTGCAAATATGACAACGATATGATAGAAAATTTAAGAAAGACCATAGAACTTTTTGAGTGTTGTTCGGGGCAAAAAGTTTATTGGGAGAAATCAACACTTTGTGGGATAAATATCGAAGATAGCAAGCTGATGTCAGTGGCAGCAAAACTCAACTGTAAAGTTGACTACCTCCCTATCATGTACCTCGGTTTACCTCTAGGAGGATACCCCAAAAAAGAAGCTTTCTGGCAGCCGATCACTGGGAAAATTCAAGATAAATTAGATAAGTGGAAGAGATACAACTTGTCAAGGGGTGGGCGTGTTACTCTTTACAGATCAGTCCTTTCACACCTCCCCACCTATTATATGTCCATCTTCTTAATGCCAGAGAAGTTGATCTCAACCATTGAACGCGCGATGAGGAACTTCTTTTGGGAGAGACACAAAGGAGGTAAGTTGAATCACTTAGTGAAATGGGAAGTGACTACTAGAACCCAATCTGAGGGTGGCCTTGGAATCGGTGGCTTGAAATCGAAGAATATTGCTCTCTTGGCTAAATGGGGCTGGCGGTTTATGAAGGAAGAAGACTCCCTTTGGTGTCAAGTAGTGCGAAGCATTCATGGAAGAAGCTTGTTCGGTTGGCACACAAGTGGAGAGGTCAAGAACAGTCTTCGTAGCCCATGGAATAGCATCTCAAGGTCTTGGTTAAAAGTAGAAGCTTTCGCCATCTACATATCAAGTCTTCGTAGCCCATGGAATAGCTTCTCAAGGTCTTGGTTAAAAGTAGAAGCTTTGGCTGTCTACATATCAAGTCTCTTAGTGATCTCTGAGGGAGCACGGTATAAGGAAAAGTGACAAGTAGGCATGTTGGAGAGAGTAACTTGAATAAGCGTATGCCAAGCACCCTTAGAGATGAAAGAATATTTCCAATTATGAAATTTTTGATGAATTCGTTCAAAAACCGGTAGCTAAAAGCCAATAGACTTGGAATTTCCACCCAACTGAAAACCAAGATATGTTGCAGGCTAATTAGCTCTTTGACAACCAAAAGTATTAAGGATCCGATCGAATCCGATTCATTGATTTTAGTGTCTAACAACACTGTTTCTCATAAAAAAAAAGATATGAATGCCCAACAAGATTAATTTTCAAACTAGAAGCACACTAAAAAATATGAACTACCTCATAGATGAACCAAACTTTATTCTCTTTTGTCATCATCTTTGAAGCTTTCCCCTAATGCAGCCAAGTTATAATTTCATAAGTAATAGTAACTTTACTTCATTAATAAGATAGAACATGAGACCTCACAGATGAAAGAGGATTGACCAATAGGATGAGAACCAATAAGAACCGATGAACTATGCTTCATTAGACGACTCATACGATCTGCCACTAGAATAAATAAAAAAGGGGAAAGTAGGTCACCTTGTTTGATACCATGAGAAGGAGTAATATTTCCCCTTGGTTGACCATTGATAATGATAGAGAAGTTAACACTTAATTTGAGTCATATCTTTGGTAATTATAGTTTTCTTTGACCTTCCATCCAATTGAAATTCGTTATTCTAATCTAGTGGGTATGATCTGATCATTTTATAATTTCATTTTATCAATGAAATGCTTCTGCTTCCTTTCAAAATAAATAAATAAATAAGTAAATAAAATAACATAAAAGATTAGGAAACTTTGGACTCTCTATATCACATAATATAGCTGTTTCTTGTTTGTTCGGCTGGTTCCTGTTCGACTCCTTTTATGCTATTGTTTCTGTAGGCCCACTTCTTGTTTTTTGGTCAATGTTGGCCCTCTTGTATTCTTCCAATTTTTCTTTTTCTTAATTAAAGTTGGGCTGTTTTCAAATATAGGAAAATGAACCAAAATATTTACAAATATAGCAAAAAAAAAAAAAAAAAAAAAAACCACCCCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCAAAAAAAAAAAAAAAAAAAAAAACCAACCCCCCCCCACCCACCCACCACATTTGTTGGAAGATTGCTGCACCAAAGCTTTAGTCCCATCTCCTCCCTCGAAGCCAGCTCCTCATTCCTCCTCTTCTCACCACTTCAAGCATTCATTATTCCACATCCCTAATCACAATACCAATTTTATTCGAGGCTCTCGGCAATCCTCCCCAATTTGTCGTCAGAAAAAGGATTACAATTTAGACATTGATTCAATAGTAAGGGCTAGTAGTGAAGAGTTAGAAAGCTTTGGGGAAGAAGATATCCAAGTTTTCTCAGATCAAGTTGATAATTTTGCGGAAGAGCTTAATTATTTGTTCCAAATTGACAAAAGAAATCAATCAGAGGAAGAAGTTTGGAATTCTCATTTTAAGCCACTGCCTCAATCGGAGATTCCGGCACATTTAAAGTCTATAATAGCAGAATGTGGACTGGTTCTAGGTTAATCCATCTATCCAAATCATTTTAGTTGTGACTCTTGGCTCTCCTTGGATCAAAGTTTAGCACAGTGGTTTGAATAGGTTTTGTTCTCTTTCATCAGCTTCAGATAATTGAAGACAGCCTCTTCTCCTTGGTTGACTTTTCTACCAACAGATATCATATTATATTTGTCTTGATATGGCAGCTATGGAGTTGGTGATCCTAAGGACTGTTGTAAATTTCTTGGGCTGTTTTGTTTCGCATTTCTTCCATTTTGGTGGGATTGGTACTTGTCTCTTTTGATACTGTACTTGCGTTCTTTTGGTTGTGTTGAGGGTGAGATTTTCTTACTTGTATTGTACTATGAGATATTATCTCATTTCATTATATTAATGAAAGAGACCGTTTCCTTTAAAAAAAAAAAAAAAAAATCCTTGTATTCTTAAAAAAAATATAGCAAAATTTTATTGTTTATCTACAATAGACCATGACAGATCACGATAGACTACTATTTGTATCTATCTGTTTCATGATAGATATAGATAGTAATTTATCGCGGTCTATCTGCTGATATTTTACTATTATTTGTAAATATTGAACATTTTTGCCATTTAAAATAATTTCCCATTAAAGTTTGATTTTTCGATTAAAAAAAGTTTCAATTTGGTAGTTTGAGTTATAATTTGAGACATTACATTATCTGATTATTTTTTATCAATTCTTCTAAGAGTTGGTTGCATTATGTTTTCTTTTAGTGTTTCTCTATCTATTTGATTTTCACGGTTTACTCATGTGCACCCAAAGTATTTTTGTTAGAATGTCGGAGTTCGGTTAGGCAGTCAATTTTTATGTACACATTTATTCATGCTCATAGATTAGGTTTTTATTTTTGTATTCTGCAGCATCTCTTCTTCTGTGATATAATTCTATCTCATTCTTGTTTTGTTCCAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTCCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATACGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGGTACTGCCTTTTATGTTTTAAAACTTTTTTTTGTTATTGTACCCAAGGCCGAAGTCTATTTTTCCTTGAATAGATGTTTCTGTCTTTAGCTTTTAATAGCAGCCTGAATACTATGATTATTGCTCCTGATTTCTTTTTACTATGCGTTTTCAACTTTTCATGCAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGACGATCGGTCTCAACTCTTTTCTGAATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCGGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTAGTGCAGAGAGTAATACTAAAAGCCAGAAAGCGGATTCTTTCATTTATTCACAATCATTAAAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGTCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGCCTGCGGATGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACGAAAAGATTGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTGTATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAGTAAGTTCTGAATGAAATTTATTCAAACAATTTGCCAATGTGAACATACTTGAGTCTAATACAGGAATGACTTGGTTTAAAATTTGCCATTCTTTGGCTTGCTCATACACCTCCAATCCATCGTCTTTGGAAAAGAAATATTTATAAAAAAAATACACATCCACTCCATTTTTACTAGTATAAAGGGAAAAGTGGATAAAATACTTGCATGAGCTTTCCTGTACAGATATATTCCTCTTAGAAACATCTTGAATTAGAAAATGGGATATCCCCTTTGTATTTTTATGCTTTGGTTGAATGATGGTCCCTGTATGTGACTAACAAAATAACGAGGCCCATGCGTTTTCATATATGACTTCATATTGTATAACTAACATCTCTCTGTTTCGTTTACTCTGGATAGTTATTGGTTATGACAATTTTTGATAGCTGTGTTACCCTTTTGTAAATACATAGGCAACTAAAAAGCACCCCACCCCCATGTGTTTCCCCTTGTAATCTCTTAAGATTTTTTCTAGTTTCTTCAATACTCCCTTTTCCCTACAGATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACCGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGGTAATTGTCAATTGTTATTTTATATTTTCAGTTTGTTAAGTATTTTCTCCTTTATTTTGTTAGACTGAGAATATAGTTATTAATTAGTATAGCCTCATTATTTATGGCTAAAAGTACTAGATACAGAAAAATGAGAAAATGTGAAAAAGAATGTATGGAGTAGAAAGTGGCTGCAGGGCTTCTCATCCTAATAGCAATGGTGTGTGTAAGACATAGAAGAGGAAAGAATAGGGAAATGACAATAGCTTGATGTAAGCATGACATAAAAGAAAATACTGGAAAAAGAAAATGTATGCTATGAGATGCCAACTGATAATCAACCAACGATGTTTGACTAGAAAATTTAAGAAAGAGCTGATAAAAGGTAATAGGTAATTGATTTACAAAGTACTTGATCTTAGAAAACATCAAAAGTATAATTCAGGTTTCAAGTCTTTTCTAGGAATGGGACTCATTGAAGATTTATTGCTATGATTAGTTGCTTGAATTTCTTTATTGATTATTCACTCAGTTTGCTGTTTTGGTGGATTTAATGGGGTTTCTGTAGATATCTAATGTTGCATTTTTCTTTTGCTTATTATCCTTGGAATAATTCGTTGTTTTGGATTAATTCTATCAGTAGTCACTGTAGTGATTAATTCTTGTCAGAGGAGATTTGGTTCGCACTCAGGTATAGCAGATAAATATTGTGGACCCTGGCTAGATCTATCCCTTTTTGAGCGTTAGTGTCAAATTCTTTTTGAAATTATCCTTTAGGTTTCATTTTGCTTGATTGGACTCCCTTTCTTTAGTGGCACTCTTCCTTTTTGTGGGCTAAGCATTTTTTGGATGCCTTTGTTTTTGTCTTGTTTTTTATTTTTGTTTTTGTTTTTGTTTTTTCATTTTTCTTCAATGGAAGTTTTGTTCTTTATTAAAAAAAATGTGTTTGAATGCCCTCCCTTTCCTCCCTCAAAGTTTAGGTTATTTTTAATTCTTGCAATTTGAATTATATGCTTTAATTTTGGGTTGTTTGTTTAGGGCCCTTTTTCTTTTCTTCTTTACTATATCCCTTTCCTCGCTATCAATACAGTTATCTAACATGCGAATTTCATTTTCCTTCTCGTTGTAGAGCACAATGAATGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGGTATTGGAACCTCAATTTCTTGTTTGTTTATCCTACATTATTGATGGTGACCCTGTTGCATGAAACGCTCCTTGCCGATCTAGCTAATCTTGGAAAAGTGCCTTTGTTTTTTAATGCATTGAATATCTGATCGTCTCAAAGAAATAGAACTATAAAGTTCCACGTATATCCCTTCCAAATATTCAAGCTAAAATTTATCTCTAAAAATTCTTATGGGTTATCATGGGAATTTGATTGAGATTTGCAAATAATAAGGTGGGAGATCAGGAGTACTGGGATTTGGTTGAAATACGTAGCGATTATGTCAGTCCACTCATCAAGTTGACTTGGTTAAACATTAAATTGTAAATCATAATCAATTAATTTCTGGTCACAATTTCATATATATTTTACATTCTCAGTTTATTTGGTTTCAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGGTAGTCTATTCTTGTTCCGAAGGCCAATGAATTTCCCCGATTTCAGATGTGATTTATCCAGGCCTTTCTTCTCTTTTCCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTCAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGTACTTCCAATCTTTCTTTTCTGGCTCTTAGATGTAGAGTCTTCTCAATTCTTGTTCTCTCTTTGTTCTTTGGTTCTATTGTTTTGCTTCTATGCTTTTATTTACTCACTCATTTCCAGAGTTTGTATCTGTTGAGCAATAGTTTCTTTTCATTATATCAATGAAAAGTTTTATTCCTAATGTTGAAATCTCAACTGTCTTCATGATTATTGAAATGAATTAATTTGGCTGTTTTTAATTATAGGAAAATGAGCCAACTTATTTACAAATATAGCAAAATGTCACTATCTATCAGTGACAAATGGTGATAGACACTGATAGATAGTGTCAATGATAGATAGTGACATTCTGCTATATTTGAAAATATTTCCAGCAATTTTGCCATTTAAACCAATTACCTATTAATTTATTGTCTCTGTTGTCTTTCACTTTTCAGCTGATTTTTCCTTGAGTAGCCTGACACAGGTTTCTTCATCAAAATTTGTTGCAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGTGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAGCTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACCGAGAAGAAAACAGCAGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGGTACTGTTTTGTTGAACTTCTATTTATCTAGTTTTACTTGTGGTTCATATAGGATTTTTTTTGTTAATTGAAAATGAGATTCATGTTACATGAATAATATAAATTTGGAAAGTGCCATACACGGAAGATGGTCCAGAACTTGGTAAGCTCTCTGATCACTTTTCTTCATGTCTGCCCATGGGCTTCTTTCATAGAATGTAGTTAGAACAGTGGTTAGAGGATGCGTATGGTCCACATTTTCCTAGATGGTAACTAATGAGTTCTTGTATAAAAAGTATCAGAGTGGGGTCACTCTTGCTGACAAGAAACCCTTTACAAGATTGCTTTTAGCTATATTTAGGGCTTCCCCTTCTAACATAGTCACATAAGATTTAAGAGAACAATTCCCAAAAGCCCTGAGCTACTTGCAAGAGTTACTATTGTACTATTATGTTTTTGAGTTTTGCATTCGTCATGTTGTCATGTTTATTAGTCGTTGGGTCTGAAAATACAAGTATACTACCAACCATCTACTAGTCAAATTTAAAATATTTGCATCTAAGAATGGCCATCTGTAATAGCAGCATCTACATGAAGAATGTGTAAGACCCCCAATCCTTAAGATGTATCAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAATTGCATCAAGGAGAAAGAGAATTGTATTAAGGAGAAAGAGGGGGAAGTTAATTAGAGGAGGGACCACTCATTGTTGGGTGGGCTATATGGCCCATGGGAATGGAGAGGTTTTGCATGACAGGGGAGGGGAGGCTGATTTTGGTGGTGAATTGCAAGCTTGGCTTGTAGGAGAGGATTTCTAGCCCTCCGTGAAGTGTTGGAGTGATATTTCTCTTTTTCATCTTCCTTCTTGTTTTCATGAGAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGTTTCTTGGGAATTCTTGTTAGGAAAGTCTGTTGCAGGTGCGGTTTCCTAACAAATTGGTATTAGAGCCATTGTTTTTTCTTTTTTATCTTGGGAAGAATGCATAGAGAGACGAGGCGAGAAATGGAGGAGAAGATTGATAGTCATTCCAAGAGTCTTGTGGGGTTGAAGGAATGGATGATTGAAATGGAAAAGATAGTGGAACGCGTGGATAAGATGGTCTAGATGAATCTAAGATCGGATCTGTGAAGATGATGAAGGAGAAGGTGTTAATCGCAACAAGTATAAATGGTTGGAGATGCCCATTTTTTCTTGAGAACACTCAGATTCGTGGGTTTATAGAGCTGAGCACTACTTCGAGATTGATGAATTATCTGGCACTGAGAAGATCAAAGTAGCGGTAATAGCCTTTGCCAAATGTGGTTGATTGGTTTCAATGGGCTCATCAGCGAAAATCGATCAGATCCTGCGAAGCTTTAGTGCATAGGATGTTTGAAAGATTTCGATCGTCTCAAGAAGGGTTCCCTGCTGTCTCGATTGATGTGAATTAAATAGGAAGGGACGTATGAAGAGTATTGGAAGAAATTTGAGTCGTATGCGGCCCCAATTCTGGAGACGGTAGAAAATGTGTTACAGGAGGCATTCATGAATGGGTTGTCGCTAGAAATAAAGGCAGAGGTGATGAGCAGACATTCGGTGGGCCTTGATGAGTGCATGGTGGAGGCCCAAGTTGTGAGCGATCGTAATCTGGCACTGAAGTTGGATGAAGAAGAATTGGGCCTAAGTAAGCTTGTGGCCCAACAACAAACAGGTAAACAAGTCGAGGTAGGGGGAGCGAAGACACAAGGTAAAACCCAACTCGGGGGATGACGAAAAATGTCCTCCTACCTGAAAAATGGGGAATCTAGAAAAAAGAACAACCCTACTGAAAACTATTGGATTCTGAGATTTGAGAGAGAAGGGGCTATGCTTTTGGTGCGACGAAAGATACTTCCACCACAAGCGTAAGACGAAAGAGAAGCGAGAGCTAAATTTATTGATCGTGCACGACGAAGACGAAACCGACAAATCAGAAGTGAAGGAGACCGAGGAGGAAGAACCAGAGGTGAAAGTCATGGAAGTAGCGAACAACGTTGAGATCGCGTTACGCTCGATTCTGGGATTTTCTACTAAGGGAATAATGAAATTAAAAGGGTTGATAGTCGGTAGAGAAGTGATAGCGATGATCGATTGTGGTGCCACACACAATTTCATACATCAAAAATTGGTGGATGAACCAAATTTGCCTCTCACGCTAACATCAAAACTATGGGGTGGTAGTAGGTAATGGAAAGGCATATCGGGGAAAAGGCATTTGCCAGGCAGTTGTTGTGGTATTGCTCGAATTAATGGTGACAGAGGATTTATTGCCATTTGAATTAGGAAGGGTGGATATAATTTTAGGAATCATTTGGTTGTGCAATATGGGATACATGGAAGTCCATTGGCCTAGTTTGACCATGACGTTCATGATCAGAGATAGGAAGATAACTTTGAAGGGGGATGCTTCATTGACAGCAACAGAGGTCACTCTTAAAACGTTGACTCATAGATGGGAGGAAGAAGATATGGGGTCCCTTGTAGAATTCCAACACATGGAACCAGAGATAGAAGAAGAAAAGAGACAAGTTCCAATCGGTAATGAATAACAACCACCGCCGATAGGAATTCAATGCTTATTGGACGAGTATAAGGATGTCTTTGAGTTTCCTATGGCTCTACCACCCAAAAGGGTGGTGGATCATTGAATTAAGTTAGAAGATGTGAAACCAGTCAACGTGCGACCTTATAGATACAGGCATACACAGAAGGATGAGATCAAAAAACTAGTAAATGAGATGTTAGCAGCAAAGATTACCAAGCCATCGTCCTTACTTGAGCCCTATTTTATTGGTCAAGAAGAAAGACGAGGGATGGCGTTTTTGTTACTGGAAGTTAAATCAATCAACCATAGCCGACAAGTTCCCAATTCCTGTAATAAACGAATTGATTGATGAATTGCACGGGTCGGTAATTTTTTTGAAGTTGGATTTGAAGATTGGTTATCATCAAATACGAATGTATGAGCCATGTATAGAGAAGACAACCTTCTGTACGCATGAGGGGCATTATGAATTCTTAGTAATGCCGTTTGGATTAACCAATGCTCCGGTGACCTTCCAATCAATAATGAATCAGGTATTCCGCCCTTTTTTGAGGAGACGTGTTTTGGTTTTCTTTGATGATGTTTTAGTATATAGTCCCGATTAAGATACCCATGTCAAACATCTGGGAATGGTGTTAAATGTACTGCGTGACAATAAACTCTACGCAAGTAAAAAGAAATGTGTGTTTGGGCAAGAATGAATTCATTACTTGGGGCACTGGGTATCAACTCATGTAGTGGAAGCAGATGGAGATAAGATTCAAGAGCGGCGATACGATGGCCAATACCGAAAACAGTATCTGAATTGAGGGGATTCTTAGGACTCACTAGATATTATTGAAGATTTGTAAAGGATTACGGATTGATTGTGGCTCCTCTGACGAAATTGTAACATAAAGACGCCTTTAAGTGGGATGATCAAGACATTGAGGCTTCCTGTACTTACTCTACCAAATTTCGATCTTCCTTTTGTGATAGAAACAAATGCATCAGGCTTTGGTTTGAGGGCTATGTGCAAGGGGAGAGACCGATAGCTTACTTTAGTCAGACATTGTCTATGCGCGCCCAAGGAAAGTCTATTTATGAACGTGAGCTGATGGTAGTGGTGTTGTGGCACAAAAATGGACGCTTTACCTTTTGGGAAGGAAATTCACAGTGATATCGGATCAGAAGGCGTTGAAATTTTTATTAGAACAGTGTGAAGTACAACTGCAGTTCCAGAAATGGTTGACTAAACTCCTCGGATATAATTTTGACATTGAATGTAACTAAGTTCTTAGTGGAGCAAGGGGTGTTATACTACAAAGGAAGGTTAGTGCTATCTAAATCTTCTTCCTCATTCCAACCTTGTTGCAGACTTTTCATGACTCGGTACTAGGAGGCCATTTGGGGTATTTACGAACATATAAAAGAATGTCGGGAGAGCTTTATTGGAAAGAGATAAAAGAGGATGTAAAGAAATATGTGGCCGAATGTGTAATTTGCTAGAGGAACAAGAGTGAGTCAGTGTTGCTAGCAAGTCTTTTGCAGCCTTTACCAATACCGGATAGAGTTTGGGAAGACATTTCAATGGATTTCATGGAAGGACTTCCAAGATCAAAAGGGTATAATGCCCTAATGGTGGTGGTTGACAGATTAAGCAAATATGGGCACTTTATTCCAATGAAACATCCCTTCACAACCAAAACAGTTGTCAAAGAGTTCATTCGTGAAGTGGTGAGGCATCATGGATTCCCAAAATAGTTGTCAAAGGGTTCATTCGCGATAAAATTTTTGTAAACAGTTTTTGGGTTGAACTATGTGCTGTTCATGGGACGGTGCTCAAACGGAGTACAACATTCCATCCTCAAACGGATGACCAGACAAAGAGAGTCAACTGGTGTGTAGAGACATACCTTCCGTGTTTTTGCAACGAGCAACCCACCAGTTGGTTCAAATGGATTCCATGGGCTGAGTATTGGTATAATACAACCTTTCAAAGTTCAATACACATGAGTCCCTATCAAGTGCTATATGGACGGCCTATTCCTGCACTAGTGTCATATGGTGATAGGAGGATTACTAATGATACCCTGGAACAGAAGCTTGTGGATCGAGATCGAGCACTGATAGCTCTAAAAGAGCATCTGGTACTGGCCTAAGAAAGGATGAGGAAATACGCCGATCAGAAGAGGCAGGATGTTCAGCTTGAATTGGATGACATGGTTTTCTTGAAACTCCGACCTTATAGACAGTAGACACTGGCTCGAAGACGATGTGAAAAATTGGCTCCTCGATTCTATGGACCATTCAAGGTAATTGAGAAGGTGGGGGAGGTCGTGTATAAATTGAAGCTTTCGGAAGATGCAAAAAGACATAATGTCTTTCATGTTTTCGCAACTCAAAAAGTATGTGGGGTCAACTACCCGAGTACAAGCTACACCTCCAGATTTTATCGAATGATTTTGAATTACAGATGGTTCCCGAGAAGAATTTGGGTGTTCATTGGAATAATGACATGGTGAAAGAAGAGTGGCTTATTAAATGGCAGAAATATCCAAAGAGTGAAGCAACGTGGGAGATTGCGGATTGGCTGAAACAGTAGTTTCCAACTTTTCACCTTGAGGACAAGGTGAATGACAACCCGGGAGGTATTGTAAGACCTCCAATCCTTCAGACGTATAAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAGTTGTATTAAGGAGAAAGAGTGGGAAGTTAGAGGAGGGACCACTCATTGTTGGGTGGGCTATGGCCCATGAGAGTGAAGAGGTTTAAGAGAGGTGTCACGTGGTAGGGGAGGCTGATTTTGGTGGTGAATTGCAAGCTTGGCTTGTAGGAGAGGATTTCCAGCCCTCTGTAAAGTGCTGGAGTGATATTTCTCTTTTTCCATCTTCCTTCTTGTTTTCATGATAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGATTCTTGGGAATTCTTGTTAGGAAAGTCTGTTGTAGGTGTGGTTTCCTAACAGAATGATATTCATGTACTTCATAAAAAGCCCATGGTCTCTGAAGTCATGTGTTGCGCTCAGGAATGCACGAACTGAGTGTTCTTTGAATTTTTTATCTCCATTCTCCCAAGTTGATAATTCACCTATCTGTTCTTAGCAGGTTCCATCTGCATGGTTCTGAGTACTTGTAGGATTATTCGGAGTATTGGCTTTTCAGAAACTCTGGATAGAAAGGAATTACCCCATTTCTTAAATGATACAAGTTTGCAGTGGCTCTTCATTTTATTTTTCTTTTGAATGATACAAGGCAGTTGGAGAATCTTTTTTATGTTTATTTTTGCTTGGACGGAGAGTATACCTTATCCCTTCGGGACACTTTTGCATATTAATGTACCGATATTTCCTTGATAATATATCTTCATATCACACTTACTTAATCTTCTGAATTTCCAAGATCAAAATCGACACATAAGATTTGGAAGCATCCAAGGGATCAGATATCTAGGGATTCAAAATGGGATAGTTTATCTAGTACTTTCTCTTTAAGATGATTTCCTTCTTGCCATTCAGTTTTTTATTTTTATTTTTATTTTTATTTTTTTTACAATTAATAATAATAATAATAACTTTTAAAAGAAAAGTAACGTCAAGTTCATTGTCTATGTGGTTATAAGCAATTATTCTTGCTCCTTGATAGATTTTTATAACTAACATATCTTGCAACGAAGTGCACTATCTGACATGGCTGATTCTGTTTTGTACTCAGTCTAATAATTAAGCAGACTTATATGCTTTTGTGTCCTAACATTTATTCCCAATAAAATTGATGCAGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCATGGCCAGCTAGTCGATTAACATCATCTATGTCATCATCATCATCCAAAAATGCAGGTAATCCCTAAGATGAAGCATAAGCAGAAGGTTTTTATAGTCATTTGCTTGCCCGCAGCTTTCACAAATTCTATAAAATCACTTGTACCTTTTGAGTTTTGAATGTTTCAGTTTGAGGGTATTAGTAGGAGATTTCTTGGTAACAGTAAAAGGATGGTAATAAGGGCATAGTGATAATTAGCTAGAGGAGCTTTGGTTATAAATAAGGGAAGTTGTGCACCTTAGGAGCGGGGGGGATCAGTTTGGTATCCTTTGTATATTTGTTGAGGGAGAGACATAGTCTTCTTGAATGGCTATTGATATTGCAATAAAGATCTTTGATGTTTTCATATATTTTCTGTGTTTTGGTAACCTGACATCCAACTTATATTTATTTTACATGTCCAGGACATGACTCTGAGAATCCAGCAACTCCTTTCATGTGGGCTAGTCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAGCCATCGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAGTGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTCGGACAATCAGAAGACTGCTAACCCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTGTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCCGACAATGATTTCACCATCAGTTCCAGCACCAGCACGGTTAAATACTCCAAGTTCATCAACTTTATTTTCAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCTCCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTCCTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAATCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCATCAAACTTAACTCCTAAGATTTTTGGTAATGTTAGAAATGAAACTTCAAACGTGACGGTTACTCCGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATTGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAAACAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTGGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
mRNA sequence
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAAACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATCGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCCACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATAGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCGCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCCGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATAGTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTCCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATACGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGACGATCGGTCTCAACTCTTTTCTGAATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCGGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTAGTGCAGAGAGTAATACTAAAAGCCAGAAAGCGGATTCTTTCATTTATTCACAATCATTAAAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGTCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGCCTGCGGATGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACGAAAAGATTGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTGTATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACCGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTCAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGTGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAGCTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACCGAGAAGAAAACAGCAGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCATGGCCAGCTAGTCGATTAACATCATCTATGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTCATGTGGGCTAGTCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAGCCATCGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAGTGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTCGGACAATCAGAAGACTGCTAACCCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTGTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCCGACAATGATTTCACCATCAGTTCCAGCACCAGCACGGTTAAATACTCCAAGTTCATCAACTTTATTTTCAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCTCCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTCCTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAATCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCATCAAACTTAACTCCTAAGATTTTTGGTAATGTTAGAAATGAAACTTCAAACGTGACGGTTACTCCGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATTGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAAACAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTGGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
Coding sequence (CDS)
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAAACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATCGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCCACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATAGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCGCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCCGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATAGTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTCCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATACGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGACGATCGGTCTCAACTCTTTTCTGAATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCGGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTAGTGCAGAGAGTAATACTAAAAGCCAGAAAGCGGATTCTTTCATTTATTCACAATCATTAAAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGTCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGCCTGCGGATGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACGAAAAGATTGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTGTATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACCGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTGCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTCAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGTGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAGCTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACCGAGAAGAAAACAGCAGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCATGGCCAGCTAGTCGATTAACATCATCTATGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTCATGTGGGCTAGTCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAGCCATCGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAGTGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTCGGACAATCAGAAGACTGCTAACCCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTGTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCCGACAATGATTTCACCATCAGTTCCAGCACCAGCACGGTTAAATACTCCAAGTTCATCAACTTTATTTTCAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCTCCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTCCTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAATCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCATCAAACTTAACTCCTAAGATTTTTGGTAATGTTAGAAATGAAACTTCAAACGTGACGGTTACTCCGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATTGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAAACAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGTTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTGGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG
Protein sequence
MASVDSRPSTLIPLENAGEGEQIVRNDFYFQKISKPVTVKLCDSIFYPETPPSQPLALSESFGGTTSVASKNLLQSFSLASERSLLQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLSGWTKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHVMHDIDAVDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELIMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFREDDLKMQVTEKLAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSSVLERPNYEIGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASELSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVIRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKDTPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAWPASRLTSSMSSSSSKNAATPFMWASPLQPSNTSRQKSQPSPKTNTTAPSPLSVFQSSHEMLKKSNNEAFSVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMSSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTPSSSTLFSGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTPDKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAAGGGFAGAAGGFGAFGSQQGSGGFSAFGVAAGGAGGTGKPPELFTQMRK
Homology
BLAST of ClCG11G012960 vs. NCBI nr
Match:
XP_038892124.1 (nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida])
HSP 1 Score: 2622.8 bits (6797), Expect = 0.0e+00
Identity = 1473/1683 (87.52%), Postives = 1517/1683 (90.14%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANG THVMHDIDA
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL IDRVSLPGKV+VRVGFED REVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRE--DDLKMQVTEK 568
IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL SESKKEFRE +DLKMQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+AISSEIP EKIKISNDIKSSNNDQS VSKI ESATV AESNTKS+KADSFIYSQSLKSS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 629 VLERPNYEIGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASELS 688
VLER NYEIGNFDK VQKFGLG VSISGK DVHSQPFPNVKESTK+L STGLLAASELS
Sbjct: 541 VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600
Query: 689 SDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
SDKA+FLNKIDPVSSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601 SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660
Query: 749 GKQVTGGAGKIESLPVIRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
G+QV GGAGKIESLPVIRSSQISLQDNL KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661 GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720
Query: 809 DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDKM 868
DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNERAQEVQNLFDKM
Sbjct: 721 DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780
Query: 869 VQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 928
+QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF
Sbjct: 781 IQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 840
Query: 929 NGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 988
NGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA
Sbjct: 841 NGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 900
Query: 989 ALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKDTPR 1048
ALNIESPSLKRQSVTKELFETIGLTYDASF SPNVNKIAE SSKKLLLSADSFS KDT R
Sbjct: 901 ALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSHKDTSR 960
Query: 1049 RKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGA 1108
RKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS TP+GA
Sbjct: 961 RKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSHTPDGA 1020
Query: 1109 ATVAWPASRLTSSMSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPSPKTNTTAP 1168
ATVAWPASRLTSSMSSSSSKNA ATPFMWASPLQPSN SRQKSQP KTN TAP
Sbjct: 1021 ATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKTNATAP 1080
Query: 1169 SPLSVFQSSHEMLKKSNNEAFSVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSI 1228
S LSVFQSSHEMLKKSNNEAFSVTSENKF EKSKASDFFSVTR+DSVQKSN NLD+K SI
Sbjct: 1081 S-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLDKKPSI 1140
Query: 1229 FTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMSSLVPTV 1288
FTISSKQ T KD I+TSN DNQKTAN KERHTTTSPLFGSANKPESASVGTMSSLVPTV
Sbjct: 1141 FTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSSLVPTV 1200
Query: 1289 NEARKTEEKRSPTMISPSVPA--PARLNTP-SSSTLFSGFAVSKPLPSSAAVIDLNQPVS 1348
+EARK EKRS ISPSVPA PAR N+P SSSTLFSGFAVSKPLPSSAA IDLNQP+S
Sbjct: 1201 DEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDLNQPLS 1260
Query: 1349 TSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTEKQTPA 1408
TSTQLNFSSPVVSVSDSLFQA KM+STSSTLSSLNP LESSKKELPVSKS+ DTEK+TPA
Sbjct: 1261 TSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTEKKTPA 1320
Query: 1409 SKPESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPS 1468
SKPES+ELKFQPS+TP +KNH+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAFA LPS
Sbjct: 1321 SKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAFASLPS 1380
Query: 1469 SNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKT 1528
NLT K GN RNETSNVTVT DDDMDEEAPET NN+EF+LS LGGFGNSS+PMSGAPK
Sbjct: 1381 PNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMSGAPKP 1440
Query: 1529 NPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFG 1588
NPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAFSGGFG
Sbjct: 1441 NPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAFSGGFG 1500
Query: 1589 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTS 1648
SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F GGFT
Sbjct: 1501 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFSGGFTG 1560
Query: 1649 MKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTT 1696
MKP VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS GGFAGT+ TGGGFAGAS+TT
Sbjct: 1561 MKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAGASATT 1620
BLAST of ClCG11G012960 vs. NCBI nr
Match:
XP_038892123.1 (nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida])
HSP 1 Score: 2616.6 bits (6781), Expect = 0.0e+00
Identity = 1473/1688 (87.26%), Postives = 1517/1688 (89.87%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANG THVMHDIDA
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL IDRVSLPGKV+VRVGFED REVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRE--DDLKMQVTEK 568
IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL SESKKEFRE +DLKMQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+AISSEIP EKIKISNDIKSSNNDQS VSKI ESATV AESNTKS+KADSFIYSQSLKSS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 629 VLERPNYEIGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASELS 688
VLER NYEIGNFDK VQKFGLG VSISGK DVHSQPFPNVKESTK+L STGLLAASELS
Sbjct: 541 VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600
Query: 689 SDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
SDKA+FLNKIDPVSSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601 SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660
Query: 749 GKQVTGGAGKIESLPVIRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
G+QV GGAGKIESLPVIRSSQISLQDNL KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661 GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720
Query: 809 DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDKM 868
DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNERAQEVQNLFDKM
Sbjct: 721 DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780
Query: 869 VQ-----VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 928
+Q VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE
Sbjct: 781 IQVYLVSVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 840
Query: 929 LERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESL 988
LERHFNGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESL
Sbjct: 841 LERHFNGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESL 900
Query: 989 SKQLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSS 1048
SKQLAALNIESPSLKRQSVTKELFETIGLTYDASF SPNVNKIAE SSKKLLLSADSFS
Sbjct: 901 SKQLAALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSH 960
Query: 1049 KDTPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSR 1108
KDT RRKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS
Sbjct: 961 KDTSRRKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSH 1020
Query: 1109 TPEGAATVAWPASRLTSSMSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPSPKT 1168
TP+GAATVAWPASRLTSSMSSSSSKNA ATPFMWASPLQPSN SRQKSQP KT
Sbjct: 1021 TPDGAATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKT 1080
Query: 1169 NTTAPSPLSVFQSSHEMLKKSNNEAFSVTSENKFIEKSKASDFFSVTRSDSVQKSNINLD 1228
N TAPS LSVFQSSHEMLKKSNNEAFSVTSENKF EKSKASDFFSVTR+DSVQKSN NLD
Sbjct: 1081 NATAPS-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLD 1140
Query: 1229 QKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMSS 1288
+K SIFTISSKQ T KD I+TSN DNQKTAN KERHTTTSPLFGSANKPESASVGTMSS
Sbjct: 1141 KKPSIFTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSS 1200
Query: 1289 LVPTVNEARKTEEKRSPTMISPSVPA--PARLNTP-SSSTLFSGFAVSKPLPSSAAVIDL 1348
LVPTV+EARK EKRS ISPSVPA PAR N+P SSSTLFSGFAVSKPLPSSAA IDL
Sbjct: 1201 LVPTVDEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDL 1260
Query: 1349 NQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTE 1408
NQP+STSTQLNFSSPVVSVSDSLFQA KM+STSSTLSSLNP LESSKKELPVSKS+ DTE
Sbjct: 1261 NQPLSTSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
K+TPASKPES+ELKFQPS+TP +KNH+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAF
Sbjct: 1321 KKTPASKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAF 1380
Query: 1469 APLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A LPS NLT K GN RNETSNVTVT DDDMDEEAPET NN+EF+LS LGGFGNSS+PMS
Sbjct: 1381 ASLPSPNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMS 1440
Query: 1529 GAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
GAPK NPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAF
Sbjct: 1441 GAPKPNPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F
Sbjct: 1501 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFS 1560
Query: 1649 GGFTSMKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAG 1696
GGFT MKP VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS GGFAGT+ TGGGFAG
Sbjct: 1561 GGFTGMKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAG 1620
BLAST of ClCG11G012960 vs. NCBI nr
Match:
XP_031741375.1 (nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus])
HSP 1 Score: 2408.3 bits (6240), Expect = 0.0e+00
Identity = 1365/1673 (81.59%), Postives = 1439/1673 (86.01%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA---------------------------------------VDCI 328
QGSANGP THVMHDIDA VDCI
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNVDCI 240
Query: 329 KWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDIL 388
KWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF DIHSGFTRDIL
Sbjct: 241 KWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDIL 300
Query: 389 PGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIEL 448
PG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NIDRNTSLPKIEL
Sbjct: 301 PGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDRNTSLPKIEL 360
Query: 449 QANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELIMFQFSSVNE 508
QANGDDNLVMGL IDRVSL GKV+V+VGFED REVSPYCILVCLTLEGELIMFQFSSVNE
Sbjct: 361 QANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 420
Query: 509 TEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEKLAISSEIPRE 568
TEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK+AISSEIPRE
Sbjct: 421 TEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEKIAISSEIPRE 480
Query: 569 KIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSSVLER-PNYEI 628
K K SNDIKSS NDQS V IDESA VS E NTKSQK DSFIYSQSLKSS ER P+YEI
Sbjct: 481 KGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERPPHYEI 540
Query: 629 GNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASELSSDKAMFLN 688
GNFDK V KF GLGS SISGK DV SQPFPNVKESTKRL STGL+AASELSS+KAM
Sbjct: 541 GNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEKAMSFK 600
Query: 689 KIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGA 748
KIDPV SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLTQSG+Q TGGA
Sbjct: 601 KIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGA 660
Query: 749 GKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESI 808
GKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMCEGLD LLESI
Sbjct: 661 GKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESI 720
Query: 809 EEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDKMVQVLSKK 868
EE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNER+QEVQNLFDKMVQVLSKK
Sbjct: 721 EESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKK 780
Query: 869 TYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNK 928
TYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELERHFNGLELNK
Sbjct: 781 TYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNK 840
Query: 929 FGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESP 988
FGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSKQLAALN+ESP
Sbjct: 841 FGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMESP 900
Query: 989 SLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKDTPRRKQQSGR 1048
SLKRQS TKELFE+IGLTYDASF SPNVNKIAE SSKKLLLS+DSFSSK T RRKQQSG
Sbjct: 901 SLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRKQQSGT 960
Query: 1049 KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAWPA 1108
KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTPEGAATVA PA
Sbjct: 961 KNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAATVARPA 1020
Query: 1109 SRLTSSMSSSS------SKNAATPFMWASPLQPSNTSRQKSQPSPKTNTTAPSPLSVFQS 1168
SR+TSS+SSSS S+N TPFMW SPLQPSNTSRQKS P K N T PSP VFQS
Sbjct: 1021 SRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPPPVFQS 1080
Query: 1169 SHEMLKKSNNEAFSVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTI 1228
SH+MLKK NNEA SVTSENKF EKSKASDFFS TRSDSVQKSNIN+DQKSSIFTI
Sbjct: 1081 SHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKSSIFTI 1140
Query: 1229 SSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMSSLVPTVNEA 1288
SSKQ PT DSI TSN DNQKTAN KERHTTTSP FGSANKPES VG+M SLVPTV+ +
Sbjct: 1141 SSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVPTVDGS 1200
Query: 1289 RKTEEKRSPTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSSAAVIDLNQPVSTSTQL 1348
RKTEEK+S T IS SV APA LNT SS STLFSGFAVSK LPSSAAVIDLNQP STSTQL
Sbjct: 1201 RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPSTSTQL 1260
Query: 1349 NFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTEKQTPASKPES 1408
NFSSPVVS S+SLFQAPK++ TS TLSSLNP+LESSK EL V KS+DD E+Q +SKP S
Sbjct: 1261 NFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILSSKPGS 1320
Query: 1409 YELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSSNLTP 1468
+ELKFQPS+TP DKNHVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAFA +PS NLT
Sbjct: 1321 HELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTS 1380
Query: 1469 KIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKTNPFGG 1528
KIF N RNETSN VT DDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+SG PK NPFGG
Sbjct: 1381 KIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGG 1440
Query: 1529 PFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMAT 1588
PFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ T
Sbjct: 1441 PFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVPT 1500
Query: 1589 QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVG 1648
Q PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF GGFT+ KPV
Sbjct: 1501 QPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPV- 1560
Query: 1649 GFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAA 1696
G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGASST GGFAGAA
Sbjct: 1561 ----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGASSTAGGFAGAA 1620
BLAST of ClCG11G012960 vs. NCBI nr
Match:
KAA0034115.1 (nuclear pore complex protein NUP214 [Cucumis melo var. makuwa])
HSP 1 Score: 2407.5 bits (6238), Expect = 0.0e+00
Identity = 1377/1708 (80.62%), Postives = 1450/1708 (84.89%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL +DRVSLPGKV+V+VGFED REVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDR SESKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDR----SESKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+ ISSEIPREK+K SNDIKSSNND+SPVS IDESA VS E NTKSQK DSFI+SQSLKSS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASE 688
ER PN EIGNFDK V KF GLGSVSISGKP DV SQPFPNVKES KRL STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF KIDPVSSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840
Query: 929 --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841 NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900
Query: 989 SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQSVTKELFETIGLTYDASFGS 1048
SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQS TKELFETIGLTYDASF S
Sbjct: 901 SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960
Query: 1049 PNVNKIAEASSKKLLLSADSFSSKDTPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
PNVNKIA+ SSKKLLLS+DSFSSK T RRKQQSG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961 PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020
Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAWPASRLTSSMSSSS------SKNAATPF 1168
KTTVKRMLLQG PSS+EK FRSRTPEGAATV PASR+TSS+SSSS S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080
Query: 1169 MWASPLQPSNTSRQKSQPSPKTNTTAPSPLSVFQSSHEMLKKSNNEAFSVTSENKFI--- 1228
MWAS LQPSNTSRQKS P KTN TAPSP VFQSSH+MLKK+NN A S TSENKF
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140
Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANP 1288
EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP +DSI TSN DNQKTAN
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200
Query: 1289 KERHTTTSPLFGSANKPESASVGTMSSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTP 1348
KERHTTTS LFGSANKPES VGTM SLVPTV+ ARKTEEK+S T IS SV APA LNT
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260
Query: 1349 SS-STLFSGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSSPVVSVSDSLFQAPKMIST 1408
SS STLFSGFAVSK LPSS AAV+DLNQP STSTQLNF SPVVS S+SLFQAPK + T
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNF-SPVVSGSNSLFQAPK-VPT 1320
Query: 1409 SSTLSSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKNHVEPTSK 1468
S TLSSLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DKNHVEPTSK
Sbjct: 1321 SPTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSK 1380
Query: 1469 THTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSSNLTPKIFGNVRNETSNVTVTPDDDMD 1528
T TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN VT DDDMD
Sbjct: 1381 TQTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMD 1440
Query: 1529 EEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKTNPFGGPFGNVNATSMTSSFTMASPPSG 1588
EEAPETNNN+EF+LSSLGGFGNSSTP+SGAPK NPFGGPFGNVNA S+T+SF MASPPSG
Sbjct: 1441 EEAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSG 1500
Query: 1589 ELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALG 1648
ELFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALG
Sbjct: 1501 ELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALG 1560
Query: 1649 NVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGG 1696
NVLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV G GGFAGVGSGG
Sbjct: 1561 NVLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGG 1620
BLAST of ClCG11G012960 vs. NCBI nr
Match:
XP_031741374.1 (nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hypothetical protein Csa_008316 [Cucumis sativus])
HSP 1 Score: 2404.4 bits (6230), Expect = 0.0e+00
Identity = 1365/1683 (81.11%), Postives = 1439/1683 (85.50%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANGP THVMHDIDA
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL IDRVSL GKV+V+VGFED REVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+AISSEIPREK K SNDIKSS NDQS V IDESA VS E NTKSQK DSFIYSQSLKSS
Sbjct: 481 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASE 688
ER P+YEIGNFDK V KF GLGS SISGK DV SQPFPNVKESTKRL STGL+AASE
Sbjct: 541 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600
Query: 689 LSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+KAM KIDPV SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+Q TGGAGKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNER+QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKD 1048
QLAALN+ESPSLKRQS TKELFE+IGLTYDASF SPNVNKIAE SSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960
Query: 1049 TPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQQSG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020
Query: 1109 EGAATVAWPASRLTSSMSSSS------SKNAATPFMWASPLQPSNTSRQKSQPSPKTNTT 1168
EGAATVA PASR+TSS+SSSS S+N TPFMW SPLQPSNTSRQKS P K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAFSVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
PSP VFQSSH+MLKK NNEA SVTSENKF EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140
Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTM 1288
+DQKSSIFTISSKQ PT DSI TSN DNQKTAN KERHTTTSP FGSANKPES VG+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200
Query: 1289 SSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSSAAVIDL 1348
SLVPTV+ +RKTEEK+S T IS SV APA LNT SS STLFSGFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260
Query: 1349 NQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTE 1408
NQP STSTQLNFSSPVVS S+SLFQAPK++ TS TLSSLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
+Q +SKP S+ELKFQPS+TP DKNHVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380
Query: 1469 APLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A +PS NLT KIF N RNETSN VT DDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440
Query: 1529 GAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
G PK NPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560
Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1696
GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620
BLAST of ClCG11G012960 vs. ExPASy Swiss-Prot
Match:
F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)
HSP 1 Score: 791.6 bits (2043), Expect = 1.8e-227
Identity = 700/1885 (37.14%), Postives = 959/1885 (50.88%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGL IDRVS+ G V VR G ++ +E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSESKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ SE ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PREKI------KISNDIKSSNNDQS----------------------------------- 639
P E I + + + NN +
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSFGQLPMSLGY 543
Query: 640 ------------PVSK------IDESATVSAESNTKSQKADSFIYSQSLKSSVLERPNYE 699
PVS+ +S ++ ++N +S+ +F S L++++L+ P
Sbjct: 544 DTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQSPQ-- 603
Query: 700 IGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKEST-KRLVSTGLLAASELSSDKAMFL 759
+ S Q + G S P D S PFP+++++ K+ V +G
Sbjct: 604 ----NTSSQPWSSGK---SVSPPDFVSGPFPSMRDTQHKQSVQSG--------------T 663
Query: 760 NKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGG 819
++P S+ S Q +T G +A + S L + G
Sbjct: 664 GYVNPPMSI-KDKSVQVIET-------GRVSALSNL-----------SPLLGQNQDTNEG 723
Query: 820 AGKIESLPVIRSSQISLQ--DNLSAKISNEKHDGS--------DRNYSNAPLAKPMKEMC 879
KIE +P IR+SQ+S Q + S+++H + N SN P + EM
Sbjct: 724 VEKIEPIPSIRASQLSQQVKSSFEKSASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMA 783
Query: 880 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 939
+D LL+SIE PGGF D+C KS+VE LE+GL SL+ +CQ WKST++E+ E+Q+L
Sbjct: 784 REMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLL 843
Query: 940 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 999
DK +QVL+KKTY+EG+ Q +D++YW+ W+RQKL+ ELE KRQHI+K+N+++T+QLIELE
Sbjct: 844 DKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELE 903
Query: 1000 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 1059
R+FN LEL+++ + + R + + SR SLHSL+N M SQLAAA+ LSE LSK
Sbjct: 904 RYFNRLELDRYNEDGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSK 963
Query: 1060 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASS-KKLLLSADSFSSK 1119
Q+ L I+SP +++V +ELFETIG+ YDASF SP+ K ASS K LLLS+ S
Sbjct: 964 QMTYLKIDSP--VKKNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSSIPASIN 1023
Query: 1120 DTPRRKQQSGRKNSEAETGRRRRDSLDR---NLASVEPPKTTVKRMLLQ----------- 1179
R++Q S KNS+ ET RRRR+SLDR N A+ EPPKTTVKRMLLQ
Sbjct: 1024 QQSRQRQSSAMKNSDPETARRRRESLDRVIFNWAAFEPPKTTVKRMLLQEQQKTGMNQQT 1083
Query: 1180 ----GIPSSDEKLFRS--RTPEGAATVAWPASRLTSSMSSSSSKNAATPFMWASPLQPSN 1239
+ S++ RS + A+ V + S +S+ +TPF P+ SN
Sbjct: 1084 VLSERLRSANNTQDRSLLHVKDHASPVVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSN 1143
Query: 1240 -----TSRQKSQPS------PKTNTT------APSPL----SVFQSS------------- 1299
+ S+PS +NTT APS + +V Q
Sbjct: 1144 SPFTISPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVAST 1203
Query: 1300 --HEMLKKSNNEAFSVTSENKFIE-----------KSKASDFFS--------------VT 1359
+ KK+ FS N F+E S SDF S
Sbjct: 1204 VLEQTEKKAGEFKFSEAKANAFVETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAP 1263
Query: 1360 RSDSVQKSNINLDQKSSI----FTISSKQTP---TLKDSINT-SNSDNQKTANPKERHTT 1419
S KS + SSI FT + P T DS +T + + ++ +
Sbjct: 1264 ASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVP 1323
Query: 1420 TSPLFGSANKPESASVGTMSSLVPT----------------VNEARKTEEKRSP------ 1479
S SA P++ SV + S++ T +N+A + SP
Sbjct: 1324 ASIPISSAPVPQTFSVTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPSTPSPSPGPTAGF 1383
Query: 1480 ----TMISPSVPAPARLNTPSSSTLFSGFA----VSKPLPSSAAVIDLNQPVSTSTQLNF 1539
+SPS P +T SS LF A VS S+ + + + + +ST L+
Sbjct: 1384 TFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLS- 1443
Query: 1540 SSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVS---KSDDDTEKQTPASKPE 1599
S+P ++ D+ FQ+P++ + SS + P E K E S + + A+K +
Sbjct: 1444 STPPITPPDA-FQSPQVSTPSSAVPITEPVSEPKKPEAQSSSILSTQSTVDSVANATKTQ 1503
Query: 1600 SYELKFQPSVTPDKNHVEPTSKTHTVSKDVGGQVPNVI----------GDAQPQQPSVAF 1659
+ L + ++ V P S + +S G ++ G +QPQQ S
Sbjct: 1504 NEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTP 1563
Query: 1660 APLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1696
AP P+S+ T + E ++ T +D+MDEEAPE + E S+ S GGFG STP
Sbjct: 1564 APFPASSPTS---ASPFGEKKDIVDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNP 1623
BLAST of ClCG11G012960 vs. ExPASy TrEMBL
Match:
A0A5A7SY34 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G001060 PE=4 SV=1)
HSP 1 Score: 2407.5 bits (6238), Expect = 0.0e+00
Identity = 1377/1708 (80.62%), Postives = 1450/1708 (84.89%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL +DRVSLPGKV+V+VGFED REVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDR SESKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDR----SESKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+ ISSEIPREK+K SNDIKSSNND+SPVS IDESA VS E NTKSQK DSFI+SQSLKSS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASE 688
ER PN EIGNFDK V KF GLGSVSISGKP DV SQPFPNVKES KRL STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF KIDPVSSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840
Query: 929 --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841 NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900
Query: 989 SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQSVTKELFETIGLTYDASFGS 1048
SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQS TKELFETIGLTYDASF S
Sbjct: 901 SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960
Query: 1049 PNVNKIAEASSKKLLLSADSFSSKDTPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
PNVNKIA+ SSKKLLLS+DSFSSK T RRKQQSG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961 PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020
Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAWPASRLTSSMSSSS------SKNAATPF 1168
KTTVKRMLLQG PSS+EK FRSRTPEGAATV PASR+TSS+SSSS S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080
Query: 1169 MWASPLQPSNTSRQKSQPSPKTNTTAPSPLSVFQSSHEMLKKSNNEAFSVTSENKFI--- 1228
MWAS LQPSNTSRQKS P KTN TAPSP VFQSSH+MLKK+NN A S TSENKF
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140
Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANP 1288
EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP +DSI TSN DNQKTAN
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200
Query: 1289 KERHTTTSPLFGSANKPESASVGTMSSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTP 1348
KERHTTTS LFGSANKPES VGTM SLVPTV+ ARKTEEK+S T IS SV APA LNT
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260
Query: 1349 SS-STLFSGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSSPVVSVSDSLFQAPKMIST 1408
SS STLFSGFAVSK LPSS AAV+DLNQP STSTQLNF SPVVS S+SLFQAPK + T
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNF-SPVVSGSNSLFQAPK-VPT 1320
Query: 1409 SSTLSSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKNHVEPTSK 1468
S TLSSLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DKNHVEPTSK
Sbjct: 1321 SPTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSK 1380
Query: 1469 THTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSSNLTPKIFGNVRNETSNVTVTPDDDMD 1528
T TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN VT DDDMD
Sbjct: 1381 TQTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMD 1440
Query: 1529 EEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKTNPFGGPFGNVNATSMTSSFTMASPPSG 1588
EEAPETNNN+EF+LSSLGGFGNSSTP+SGAPK NPFGGPFGNVNA S+T+SF MASPPSG
Sbjct: 1441 EEAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSG 1500
Query: 1589 ELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALG 1648
ELFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALG
Sbjct: 1501 ELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALG 1560
Query: 1649 NVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGG 1696
NVLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV G GGFAGVGSGG
Sbjct: 1561 NVLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGG 1620
BLAST of ClCG11G012960 vs. ExPASy TrEMBL
Match:
A0A0A0KV45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1)
HSP 1 Score: 2388.6 bits (6189), Expect = 0.0e+00
Identity = 1365/1724 (79.18%), Postives = 1439/1724 (83.47%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGSANGP THVMHDIDA
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL IDRVSL GKV+V+VGFED REVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+ KE RE D +MQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+AISSEIPREK K SNDIKSS NDQS V IDESA VS E NTKSQK DSFIYSQSLKSS
Sbjct: 481 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASE 688
ER P+YEIGNFDK V KF GLGS SISGK DV SQPFPNVKESTKRL STGL+AASE
Sbjct: 541 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600
Query: 689 LSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+KAM KIDPV SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+Q TGGAGKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNER+QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKD 1048
QLAALN+ESPSLKRQS TKELFE+IGLTYDASF SPNVNKIAE SSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960
Query: 1049 TPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQQSG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020
Query: 1109 EGAATVAWPASRLTSSMSSSS------SKNAATPFMWASPLQPSNTSRQKSQPSPKTNTT 1168
EGAATVA PASR+TSS+SSSS S+N TPFMW SPLQPSNTSRQKS P K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAFSVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
PSP VFQSSH+MLKK NNEA SVTSENKF EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140
Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTM 1288
+DQKSSIFTISSKQ PT DSI TSN DNQKTAN KERHTTTSP FGSANKPES VG+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200
Query: 1289 SSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSSAAVIDL 1348
SLVPTV+ +RKTEEK+S T IS SV APA LNT SS STLFSGFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260
Query: 1349 NQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTE 1408
NQP STSTQLNFSSPVVS S+SLFQAPK++ TS TLSSLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320
Query: 1409 KQTPASKPESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
+Q +SKP S+ELKFQPS+TP DKNHVEPTSKT TV KDVGGQ NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380
Query: 1469 APLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
A +PS NLT KIF N RNETSN VT DDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440
Query: 1529 GAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
G PK NPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500
Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560
Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1696
GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620
BLAST of ClCG11G012960 vs. ExPASy TrEMBL
Match:
A0A1S3BDU8 (LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656 GN=LOC103488807 PE=4 SV=1)
HSP 1 Score: 2385.1 bits (6180), Expect = 0.0e+00
Identity = 1364/1682 (81.09%), Postives = 1439/1682 (85.55%), Query Frame = 0
Query: 89 MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
MASVDS S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 149 SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
S GLIFVAHLSG+ KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 209 ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
+LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 269 QGSANGPPTHVMHDIDA------------------------------------------- 328
QGS NGP THVMHDIDA
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 329 ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 389 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 449 RNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGEL 508
RNTSLPKIELQANGDDNLVMGL +DRVSLPGKV+V+VGFED REVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 509 IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSESKKEFRED--DLKMQVTEK 568
IMFQFSSVNETEAPHETVSACD+EEDDI VP DDR SESKKE RE DLKMQVTEK
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDR----SESKKESREANVDLKMQVTEK 480
Query: 569 LAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSS 628
+ ISSEIPREK+K SNDIKSSNND+SPVS IDESA VS E NTKSQK DSFI+SQSLKSS
Sbjct: 481 ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540
Query: 629 VLER-PNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASE 688
ER PN EIGNFDK V KF GLGSVSISGKP DV SQPFPNVKES KRL STGL+AASE
Sbjct: 541 APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600
Query: 689 LSSDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
LSS+K MF K+ VSSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601 LSSEKTMFFKKL-IVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660
Query: 749 QSGKQVTGGAGKIESLPVIRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
QSG+QVTGGAGKIESLPVIRSSQISLQD S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720
Query: 809 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 868
EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780
Query: 869 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840
Query: 929 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHSLHSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIMGSQLATAQLLSESLSK 900
Query: 989 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKD 1048
QLAALN+ESP LKRQS TKELFETIGLTYDASF SPNVNKIA+ SSKKLLLS+DSFSSK
Sbjct: 901 QLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADTSSKKLLLSSDSFSSKG 960
Query: 1049 TPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
T RRKQQSG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQG PSS+EK FRSRTP
Sbjct: 961 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGTPSSEEKQFRSRTP 1020
Query: 1109 EGAATVAWPASRLTSSMSSSS------SKNAATPFMWASPLQPSNTSRQKSQPSPKTNTT 1168
EGAATV PASR+TSS+SSSS S+N ATPFMWAS LQPSNTSRQKS P KTN T
Sbjct: 1021 EGAATVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPSNTSRQKSLPLQKTNAT 1080
Query: 1169 APSPLSVFQSSHEMLKKSNNEAFSVTSENKF----IEKSKASDFFSVTRSDSVQKSNINL 1228
APSP VFQSSH+MLKK + + EKSKASDFFS TRSDSVQKS IN+
Sbjct: 1081 APSPPPVFQSSHDMLKKIIMQLTVRLQKTNLRTWHPEKSKASDFFSATRSDSVQKSKINV 1140
Query: 1229 DQKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMS 1288
DQKSSIFTISSKQTP +DSI TSN DNQKTAN KERHTTTS LFGSANKPES VGTM
Sbjct: 1141 DQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQLFGSANKPESPFVGTMP 1200
Query: 1289 SLVPTVNEARKTEEKRSPTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSS---AAVI 1348
SLVPTV+ ARKTEEK+S T IS SV APA LNT SS STLFSGFAVSK LPSS AAV+
Sbjct: 1201 SLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKSLPSSAAVAAVV 1260
Query: 1349 DLNQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDD 1408
DLNQP STSTQLNF SPVVS S+SLFQAPK + TS TLSSLNP++ESSK EL V KS+DD
Sbjct: 1261 DLNQPQSTSTQLNF-SPVVSGSNSLFQAPK-VPTSPTLSSLNPTMESSKTELSVLKSNDD 1320
Query: 1409 TEKQTPASKPESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSV 1468
EKQT +SKP S+ELKFQPS+TP DKNHVEPTSKT TV KDVGGQVPNV+GDAQ QQPSV
Sbjct: 1321 AEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQVPNVVGDAQAQQPSV 1380
Query: 1469 AFAPLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTP 1528
AFA +PS NLT KIF N RNETSN VT DDDMDEEAPETNNN+EF+LSSLGGFGNSSTP
Sbjct: 1381 AFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTP 1440
Query: 1529 MSGAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSV 1588
+SGAPK NPFGGPFGNVNA S+T+SF MASPPSGELFRPASFSFQSPLASQAASQPTNSV
Sbjct: 1441 ISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSV 1500
Query: 1589 AFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGG 1648
AFSG FGSA+ATQAP QGGFGQPAQIGVGQQALGNVLGSFGQSRQLGP+LPGTGSGSPGG
Sbjct: 1501 AFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPTLPGTGSGSPGG 1560
Query: 1649 FGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAG 1696
F GGFT+ KPV G GGFAGVGSGGGGGFGGV GGFAG TGGGFAG
Sbjct: 1561 FSGGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAG 1620
BLAST of ClCG11G012960 vs. ExPASy TrEMBL
Match:
A0A6J1CBF2 (nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC111010057 PE=4 SV=1)
HSP 1 Score: 2142.5 bits (5550), Expect = 0.0e+00
Identity = 1252/1714 (73.05%), Postives = 1361/1714 (79.40%), Query Frame = 0
Query: 86 LQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 145
LQ S ST I E+A EGE + D+YF+KIG+PVPVKL DSIFD ++PPSQPLA
Sbjct: 4 LQDSTPSTSSTSTPIRFEEAEEGEHVESTDYYFEKIGEPVPVKLHDSIFDSESPPSQPLA 63
Query: 146 LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 205
+SESFGLIFVAHLSG+ T+DVIASA+EIKNGGTGSSVQDLSI+D+S+G+VHIL LS
Sbjct: 64 VSESFGLIFVAHLSGFFVARTEDVIASAKEIKNGGTGSSVQDLSIMDVSVGRVHILALSA 123
Query: 206 DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 265
D S +AA+VA DIHLFSV SLLDKA P SCS+TDSS IKDFKW RKLE SYLVLSKHG
Sbjct: 124 DSSTIAAVVAADIHLFSVHSLLDKAAKPFYSCSITDSSCIKDFKWIRKLESSYLVLSKHG 183
Query: 266 QLYQGSANGPPTHVMHDIDA---------------------------------------- 325
QLYQGSANG HVMHD DA
Sbjct: 184 QLYQGSANGTLKHVMHDTDAVECSVKGRFIAVAKKDTLTIFSSKFKERLSMSLLPSDADS 243
Query: 326 -----VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDI 385
VDCIKWVRADCII+GCF+VTA GDEE+YFV VIRSKDGKITDVSSN+VLLSF+ I
Sbjct: 244 NFIVKVDCIKWVRADCIILGCFEVTAIGDEENYFVQVIRSKDGKITDVSSNRVLLSFQYI 303
Query: 386 HSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDR 445
H GFTRDILP SGPCL SYL KCKLAIVANR ++HIVLLG L EVEN+VAVI+I+R
Sbjct: 304 HPGFTRDILPVGSGPCLFSSYLGKCKLAIVANRNNTDQHIVLLGWLPEVENQVAVIDIER 363
Query: 446 NTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELI 505
+TSLP+IELQ NGDDNLVMGL IDRVSLP KV ++VG ED REVSPYCIL+CLTLEG+L+
Sbjct: 364 DTSLPRIELQENGDDNLVMGLCIDRVSLPAKVKIQVGVEDMREVSPYCILLCLTLEGKLV 423
Query: 506 MFQFSSVNETEAPHETVSAC-DEEEDDIIVPADDRSQLFSESKKEFREDDL-KMQVTEKL 565
MF SS+NETE PHETVSAC DEEEDD IVP DD+ Q+ SES+KE RE + +M T+K+
Sbjct: 424 MFHLSSINETETPHETVSACEDEEEDDTIVPIDDQPQVSSESRKELREAMVGQMHDTDKI 483
Query: 566 AISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLKSSV 625
SSEIP EKI ISNDIK S+ DQSPVS ID+SA VS ESN+KS+K SFIYSQ LKSS+
Sbjct: 484 TTSSEIPEEKINISNDIKPSDIDQSPVSYIDKSAIVSRESNSKSEKVGSFIYSQPLKSSI 543
Query: 626 LERPNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFPNVKESTKRLVSTGLLAASELS 685
LE+PN EIGNF K VQKF GLGSV+ SG+ ADV SQPF N KEST RL STGL ASELS
Sbjct: 544 LEKPNSEIGNFGKPVQKFTGLGSVAFSGQSADVPSQPFLNAKESTLRLGSTGLQDASELS 603
Query: 686 SDKAMFLNKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 745
SD+AMFLNKIDP SSVL NS QS+KT+N GPSFG ANAF F+G+ FQ KDV STLTQ
Sbjct: 604 SDRAMFLNKIDPASSVLPLNSLQSTKTDNLGPSFGAANAFTAFTGRSFQTKDVSSTLTQI 663
Query: 746 GKQVTGGAGKIESLPVIRSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEG 805
G+QVT GAGKIESLP +RSSQ+ LQDN S K SNEKH S+RNYSN PLAKPMKEMC+G
Sbjct: 664 GRQVTAGAGKIESLPPMRSSQVPLQDNFSLGKTSNEKHSRSERNYSNVPLAKPMKEMCDG 723
Query: 806 LDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDK 865
LDMLLESIEEPGGF DACTA QKSS+EALE GLA+LSD+CQIW TMNERAQE+QNLFDK
Sbjct: 724 LDMLLESIEEPGGFWDACTASQKSSIEALELGLATLSDQCQIWGRTMNERAQEIQNLFDK 783
Query: 866 MV-QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELER 925
V QV+ KKTYIEGIV QAS S YWE WDRQ+LSSELELKRQHILK NQNMTNQLIELER
Sbjct: 784 TVNQVMPKKTYIEGIVKQASHSHYWEHWDRQRLSSELELKRQHILKTNQNMTNQLIELER 843
Query: 926 HFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQ 985
HFNGLELNKFGGN+ESQ SERALQRKFG SRHSHS HSLNNI GSQLAAAQLLSESLSKQ
Sbjct: 844 HFNGLELNKFGGNDESQVSERALQRKFGSSRHSHSFHSLNNITGSQLAAAQLLSESLSKQ 903
Query: 986 LAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKDT 1045
+AALNIESPS KRQSVTKELFETIG+TYDASF SPNVNKIAE SSKKLLLSADSFSSKD+
Sbjct: 904 MAALNIESPSSKRQSVTKELFETIGITYDASFSSPNVNKIAETSSKKLLLSADSFSSKDS 963
Query: 1046 PRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPE 1105
RRK +SG KNSEAETGRRRR+SLDRNLASVEPPKTTVKRMLL+GIP +DEK FRS TPE
Sbjct: 964 SRRKLRSGMKNSEAETGRRRRESLDRNLASVEPPKTTVKRMLLEGIPLADEKHFRSPTPE 1023
Query: 1106 GAATVAWPASRLTSSMSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPSPKTNTT 1165
G ATV PASR+ SSM SSSSKNA ATPFMW+SP Q SN SRQKSQP KTN T
Sbjct: 1024 GTATVTRPASRIASSMLSSSSKNAEHSSENPATPFMWSSPSQSSNISRQKSQPLKKTNAT 1083
Query: 1166 APSPLS-VFQSSHEMLKKSNNEAFSVTSENKFI-----EKSKASDFFSVTRSDSVQKSNI 1225
APSPL V+QSSHEM KKSN EA+SVTS+NKF EKSK+SDF S+TRSDSVQKSNI
Sbjct: 1084 APSPLPVVYQSSHEMPKKSNTEAYSVTSDNKFTEATYPEKSKSSDFLSLTRSDSVQKSNI 1143
Query: 1226 NLDQKSSIFTISSKQTPTLKDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGT 1285
NLDQKSSIF IS+ Q PTLKDSINTSN + QKTAN KERHT S LF SANKPESA VGT
Sbjct: 1144 NLDQKSSIFKISNNQMPTLKDSINTSNLNGQKTANVKERHTPKSSLFESANKPESAFVGT 1203
Query: 1286 MSSLVPTVNEARKTEEKRSPTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSSAAVID 1345
S+ VPTV ARKTEEK S T SPSVPAPA LNTPSS STLFSGF+V+K L +S A +D
Sbjct: 1204 ASTPVPTVLGARKTEEKTSLTAFSPSVPAPALLNTPSSASTLFSGFSVTKSLTNSTAHVD 1263
Query: 1346 LNQPVSTSTQLNFSSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDT 1405
LN+P+ST TQ NFSSP VSVSDSLFQAPKM+S S P+ SKKELP KSD DT
Sbjct: 1264 LNKPLSTFTQSNFSSPAVSVSDSLFQAPKMVSPS-------PTTLESKKELPGPKSDADT 1323
Query: 1406 EKQTPASK-PESYELKFQPSVTP-DKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSV 1465
K P SK PES+ELK QPSVTP DKNHVEPTS + TV KDVGG VPNV+ QQ S
Sbjct: 1324 PKPAPDSKPPESHELKLQPSVTPADKNHVEPTSGSQTVPKDVGGLVPNVL-----QQSSA 1383
Query: 1466 AFAPLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTP 1525
AF PLP+ NLT K N +NETS+ +T DDDMDEEAPET NN+EFSLSSLGGFGNSSTP
Sbjct: 1384 AFVPLPTLNLTSKSSTNGKNETSDAALTQDDDMDEEAPET-NNVEFSLSSLGGFGNSSTP 1443
Query: 1526 MSGAPKTNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSV 1585
+S APK+NPFGGPFGNVNATSM SSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSV
Sbjct: 1444 ISSAPKSNPFGGPFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSV 1503
Query: 1586 AFSGGFGSAMAT--QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSP 1645
AFSGGFGS MAT Q SQGGFGQPAQIGVGQQALG VLG+FG+SRQLGPSLPGT SGSP
Sbjct: 1504 AFSGGFGSGMATQPQTSSQGGFGQPAQIGVGQQALGTVLGAFGRSRQLGPSLPGTASGSP 1563
Query: 1646 GGFGGGFTSMKPVGGFASVGSSGGG---------SGGFAGVGSGGGGGFGGVGSN----- 1696
GF GGFT +KP+GGFA VGS GG GGF GVGSG GGGFG VGS+
Sbjct: 1564 SGFSGGFTGVKPIGGFAGVGSGSGGGFGGVGSVSGGGFGGVGSGSGGGFGAVGSSSGGGF 1623
BLAST of ClCG11G012960 vs. ExPASy TrEMBL
Match:
A0A6J1HNV2 (nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)
HSP 1 Score: 2041.5 bits (5288), Expect = 0.0e+00
Identity = 1198/1721 (69.61%), Postives = 1317/1721 (76.53%), Query Frame = 0
Query: 89 MASVDSR---PSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 148
MASVDSR ST IPLED+ EGE + ND+YF+KIG+PVPVKL DSIFDP +PPSQPLA
Sbjct: 1 MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 149 LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 208
+SESFGLIFVAHLSG+ TKDV+ASA+E+KNGGTGSS+QDLSIVD+S+GKVH+L LS
Sbjct: 61 VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 209 DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 268
D+S LAA+VAGD+HLF V SLLDK + PS SCS TDSS IKDFKWTRK E+SYLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 269 QLYQGSANGPPTHVMHDIDAVDC------------------------------------- 328
+LYQGSA+GP H+MHDIDAV+C
Sbjct: 181 KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240
Query: 329 ------------IKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLS 388
IKWVRADCIIIGCFQVTATGDEEDYFV VIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 389 FRDIHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVI 448
F DI+SGFT DILP ++GPCLLLSYLDKCKLAIVANR ++HIVLLG LQEVENEVAVI
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 449 NIDRNTSLPKIELQANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLE 508
+I+R+ SLP+IELQ NGDDNLVMGL IDRVSLPGKV V+VG E+ REVSPYC L+CLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 509 GELIMFQFSSVNETEAPHETVSACD-EEEDDIIVPADDRSQLFSESKKEFREDDLKMQVT 568
G+LI+F FSS NE+EA ETVSACD EEED+ +VP DD+ QLF
Sbjct: 421 GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLF----------------- 480
Query: 569 EKLAISSEIPREKIKISNDIKSSNNDQSPVSKIDESATVSAESNTKSQKADSFIYSQSLK 628
SN DQ PVSK+D S ++ ESN KSQ+ DS +SQ LK
Sbjct: 481 ----------------------SNIDQRPVSKVDGSPVITRESNAKSQQMDSLAFSQPLK 540
Query: 629 SSVLERPNYEIGNFDKSVQKF-GLGSVSISGKPADVHSQPFP------------NVKEST 688
S LERPN EIGNF K V+ F GLGSV+ SG+ DV SQP N +
Sbjct: 541 PSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQPLKSSILERPNNEIGNFNKPF 600
Query: 689 KRLVSTGLLAASELSSD------KAMFLNKID--------PV-------------SSVLT 748
+ G +A S S D K FL + + PV SV
Sbjct: 601 HKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFDKPVQKFTGLGSVAFSEQSVDV 660
Query: 749 P-NSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVI 808
P + F + K S G ANAF GF+GKPFQPKDVPSTLTQSG+QV+ GAGKIESLPVI
Sbjct: 661 PSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPSTLTQSGRQVSAGAGKIESLPVI 720
Query: 809 RSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDA 868
+SSQ+SLQDN S KISN+K DGS+RNY N PLAKPM EMCEGLDMLLESIEEPGGFLDA
Sbjct: 721 QSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNEMCEGLDMLLESIEEPGGFLDA 780
Query: 869 CTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLFDKMVQVLSKKTYIEGIVMQ 928
CT FQKSSVEAL GLA+LSD+CQIW+ TM ERAQEVQNLFD+ V+VLSKKTYIEGIV Q
Sbjct: 781 CTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQ 840
Query: 929 ASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQA 988
ASDS YW+ WDRQKLSSELELKRQ IL+MNQNMTNQLIELERHFNGLELN FGGNEE Q
Sbjct: 841 ASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNTFGGNEEIQV 900
Query: 989 SERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQSVTK 1048
+ER LQRKFG SR SHSLHSLNNIMGSQLAAAQLLS++LSKQ+A LNI+SPS KRQS+TK
Sbjct: 901 NERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIKSPSSKRQSITK 960
Query: 1049 ELFETIGLTYDASFGSPNVNKIAEASSKKLLLSADSFSSKDTPRRKQQSGRKNSEAETGR 1108
ELFETIG+TYDASF SPNVNKI E SSKKLLLSADSFSSKDT RRKQ+SG K SE ETGR
Sbjct: 961 ELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSSKDTSRRKQRSGAKISETETGR 1020
Query: 1109 RRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAWPASRLTSSMSS 1168
RRRDSLDRNLAS++PPKTTVKRM+LQG P S+EK FRS T EG ATVA PA R+ SSM S
Sbjct: 1021 RRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSPTLEGPATVARPAGRIPSSMLS 1080
Query: 1169 SSSKNA-------ATPFMWASPLQPSNTSRQKSQPSPKTNTTAPSPLSVFQSSHEMLKKS 1228
SSSKNA ATPF WASP RQK QP KTN TAPSPL V+QSSHEM+KKS
Sbjct: 1081 SSSKNAEQGSENPATPFSWASP------PRQKFQPLQKTNGTAPSPLPVYQSSHEMVKKS 1140
Query: 1229 NNEAFSVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTL 1288
N+EA+S SENKF EKSKASDFFS+ RSDSVQKSN+N +QKSS F SSK T
Sbjct: 1141 NSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQKSSFFVTSSKPMSTP 1200
Query: 1289 KDSINTSNSDNQKTANPKERHTTTSPLFGSANKPESASVGTMSSLVPTVNEARKTEEKRS 1348
KDSI T N ++QKTAN KER TT SPLFG+ANKPE ASVGT SSLVPTV+E RKTEEK+
Sbjct: 1201 KDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSLVPTVDELRKTEEKKP 1260
Query: 1349 PTMISPSVPAPARLNTPSS-STLFSGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSSPVVS 1408
PT+ SPSVPA +NTPSS STLFSG +SK PS AAV+DLN+P+STSTQ +F+SPVVS
Sbjct: 1261 PTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKPLSTSTQSSFASPVVS 1320
Query: 1409 VSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPS 1468
VSDSLFQAPKM+S STLSSLNPSL SS KE P+ KSD DTEKQ ASKPE ELK QPS
Sbjct: 1321 VSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQAQASKPEFRELKLQPS 1380
Query: 1469 VT-PDKNHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSSNLTPKIFGNVRN 1528
VT NHVEPTS T TVSKDVGG VP+VI DAQPQQ S AF PLPS N TPK+ N ++
Sbjct: 1381 VTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPLPSPNSTPKVSANGKS 1440
Query: 1529 ETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKTNPFGGPFGNVNAT 1588
ETS+ +T DDDMDEEAPET NN+EFSLSSLGGFG +STPMS APK NPFGG FGN NAT
Sbjct: 1441 ETSDALITQDDDMDEEAPET-NNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNANAT 1500
Query: 1589 SMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGF 1648
SM SSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS FGS MATQAP+QGGF
Sbjct: 1501 SMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSFGSGMATQAPTQGGF 1560
Query: 1649 GQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGF-GGGFTSMKPVGGFASVGS 1696
GQPAQIGVGQQALG VLGSFGQSRQLGPSLPGT SGSPGGF GGGFTS+KPVG
Sbjct: 1561 GQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFTSVKPVG------- 1620
BLAST of ClCG11G012960 vs. TAIR 10
Match:
AT1G55540.1 (Nuclear pore complex protein )
HSP 1 Score: 797.0 bits (2057), Expect = 3.0e-230
Identity = 700/1882 (37.19%), Postives = 959/1882 (50.96%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGL IDRVS+ G V VR G ++ +E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSESKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ SE ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PREKI------KISNDIKSSNNDQS----------------------------------- 639
P E I + + + NN +
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSFGQLPMSLGY 543
Query: 640 ------------PVSK------IDESATVSAESNTKSQKADSFIYSQSLKSSVLERPNYE 699
PVS+ +S ++ ++N +S+ +F S L++++L+ P
Sbjct: 544 DTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQSPQ-- 603
Query: 700 IGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKEST-KRLVSTGLLAASELSSDKAMFL 759
+ S Q + G S P D S PFP+++++ K+ V +G
Sbjct: 604 ----NTSSQPWSSGK---SVSPPDFVSGPFPSMRDTQHKQSVQSG--------------T 663
Query: 760 NKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGG 819
++P S+ S Q +T G +A + S L + G
Sbjct: 664 GYVNPPMSI-KDKSVQVIET-------GRVSALSNL-----------SPLLGQNQDTNEG 723
Query: 820 AGKIESLPVIRSSQISLQ--DNLSAKISNEKHDGS--------DRNYSNAPLAKPMKEMC 879
KIE +P IR+SQ+S Q + S+++H + N SN P + EM
Sbjct: 724 VEKIEPIPSIRASQLSQQVKSSFEKSASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMA 783
Query: 880 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 939
+D LL+SIE PGGF D+C KS+VE LE+GL SL+ +CQ WKST++E+ E+Q+L
Sbjct: 784 REMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLL 843
Query: 940 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 999
DK +QVL+KKTY+EG+ Q +D++YW+ W+RQKL+ ELE KRQHI+K+N+++T+QLIELE
Sbjct: 844 DKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELE 903
Query: 1000 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 1059
R+FN LEL+++ + + R + + SR SLHSL+N M SQLAAA+ LSE LSK
Sbjct: 904 RYFNRLELDRYNEDGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSK 963
Query: 1060 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASS-KKLLLSADSFSSK 1119
Q+ L I+SP +++V +ELFETIG+ YDASF SP+ K ASS K LLLS+ S
Sbjct: 964 QMTYLKIDSP--VKKNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSSIPASIN 1023
Query: 1120 DTPRRKQQSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQ-------------- 1179
R++Q S KNS+ ET RRRR+SLDRN A+ EPPKTTVKRMLLQ
Sbjct: 1024 QQSRQRQSSAMKNSDPETARRRRESLDRNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLS 1083
Query: 1180 -GIPSSDEKLFRS--RTPEGAATVAWPASRLTSSMSSSSSKNAATPFMWASPLQPSN--- 1239
+ S++ RS + A+ V + S +S+ +TPF P+ SN
Sbjct: 1084 ERLRSANNTQDRSLLHVKDHASPVVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSNSPF 1143
Query: 1240 --TSRQKSQPS------PKTNTT------APSPL----SVFQSS---------------H 1299
+ S+PS +NTT APS + +V Q
Sbjct: 1144 TISPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLE 1203
Query: 1300 EMLKKSNNEAFSVTSENKFIE-----------KSKASDFFS--------------VTRSD 1359
+ KK+ FS N F+E S SDF S S
Sbjct: 1204 QTEKKAGEFKFSEAKANAFVETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASS 1263
Query: 1360 SVQKSNINLDQKSSI----FTISSKQTP---TLKDSINT-SNSDNQKTANPKERHTTTSP 1419
KS + SSI FT + P T DS +T + + ++ + S
Sbjct: 1264 FSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVPASI 1323
Query: 1420 LFGSANKPESASVGTMSSLVPT----------------VNEARKTEEKRSP--------- 1479
SA P++ SV + S++ T +N+A + SP
Sbjct: 1324 PISSAPVPQTFSVTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPSTPSPSPGPTAGFTFN 1383
Query: 1480 -TMISPSVPAPARLNTPSSSTLFSGFA----VSKPLPSSAAVIDLNQPVSTSTQLNFSSP 1539
+SPS P +T SS LF A VS S+ + + + + +ST L+ S+P
Sbjct: 1384 LPALSPSSPEMVSSSTGQSS-LFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLS-STP 1443
Query: 1540 VVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVS---KSDDDTEKQTPASKPESYE 1599
++ D+ FQ+P++ + SS + P E K E S + + A+K ++
Sbjct: 1444 PITPPDA-FQSPQVSTPSSAVPITEPVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEP 1503
Query: 1600 LKFQPSVTPDKNHVEPTSKTHTVSKDVGGQVPNVI----------GDAQPQQPSVAFAPL 1659
L + ++ V P S + +S G ++ G +QPQQ S AP
Sbjct: 1504 LPVKSEISNPGTTVTPVSSSGFLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTPAPF 1563
Query: 1660 PSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAP 1696
P+S+ T + E ++ T +D+MDEEAPE + E S+ S GGFG STP GAP
Sbjct: 1564 PASSPTS---ASPFGEKKDIVDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAP 1623
BLAST of ClCG11G012960 vs. TAIR 10
Match:
AT1G55540.2 (Nuclear pore complex protein )
HSP 1 Score: 791.6 bits (2043), Expect = 1.3e-228
Identity = 700/1885 (37.14%), Postives = 959/1885 (50.88%), Query Frame = 0
Query: 100 IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
+ +E+ EG++I ND+YF++IG+P+ +K D+ +D + PPSQPLA+SE ++FVAH S
Sbjct: 4 VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63
Query: 160 GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
G+ T DVI++++ G +QDLS+VD+ +G V IL+LS DDSILA VA DIH
Sbjct: 64 GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123
Query: 220 LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
FSV SLL K PS S S +S F+KDF+W R + SYLVLS G+L+ G N PP HV
Sbjct: 124 FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183
Query: 280 MHDIDA-------------------------------------------------VDCIK 339
M +DA VD I+
Sbjct: 184 MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243
Query: 340 WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
WVR +CI++GCFQ+ G EE+Y V VIRS DGKI+D S+N V LSF D+ D++P
Sbjct: 244 WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303
Query: 400 GDSGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
GP LL SY+D+CKLA+ ANR ++EHIVLL + ++ V+V++IDR T LP+I L
Sbjct: 304 VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363
Query: 460 QANGDDNLVMGLSIDRVSLPGKVLVRVGFEDTREVSPYCILVCLTLEGELIMFQFSSVNE 519
Q N DDN VMGL IDRVS+ G V VR G ++ +E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 364 QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423
Query: 520 TEAPHETVSACDEEEDDIIVP--ADDRSQLFSESKKEFR---EDDLKMQVTEKLAISSEI 579
A +T A + +D P DD S+ SE ++ ++D K TEK + +
Sbjct: 424 RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483
Query: 580 PREKI------KISNDIKSSNNDQS----------------------------------- 639
P E I + + + NN +
Sbjct: 484 PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSFGQLPMSLGY 543
Query: 640 ------------PVSK------IDESATVSAESNTKSQKADSFIYSQSLKSSVLERPNYE 699
PVS+ +S ++ ++N +S+ +F S L++++L+ P
Sbjct: 544 DTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQSPQ-- 603
Query: 700 IGNFDKSVQKFGLGSVSISGKPADVHSQPFPNVKEST-KRLVSTGLLAASELSSDKAMFL 759
+ S Q + G S P D S PFP+++++ K+ V +G
Sbjct: 604 ----NTSSQPWSSGK---SVSPPDFVSGPFPSMRDTQHKQSVQSG--------------T 663
Query: 760 NKIDPVSSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGG 819
++P S+ S Q +T G +A + S L + G
Sbjct: 664 GYVNPPMSI-KDKSVQVIET-------GRVSALSNL-----------SPLLGQNQDTNEG 723
Query: 820 AGKIESLPVIRSSQISLQ--DNLSAKISNEKHDGS--------DRNYSNAPLAKPMKEMC 879
KIE +P IR+SQ+S Q + S+++H + N SN P + EM
Sbjct: 724 VEKIEPIPSIRASQLSQQVKSSFEKSASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMA 783
Query: 880 EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERAQEVQNLF 939
+D LL+SIE PGGF D+C KS+VE LE+GL SL+ +CQ WKST++E+ E+Q+L
Sbjct: 784 REMDTLLQSIEGPGGFKDSCAFILKSNVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLL 843
Query: 940 DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 999
DK +QVL+KKTY+EG+ Q +D++YW+ W+RQKL+ ELE KRQHI+K+N+++T+QLIELE
Sbjct: 844 DKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELE 903
Query: 1000 RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 1059
R+FN LEL+++ + + R + + SR SLHSL+N M SQLAAA+ LSE LSK
Sbjct: 904 RYFNRLELDRYNEDGGHPVARRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSK 963
Query: 1060 QLAALNIESPSLKRQSVTKELFETIGLTYDASFGSPNVNKIAEASS-KKLLLSADSFSSK 1119
Q+ L I+SP +++V +ELFETIG+ YDASF SP+ K ASS K LLLS+ S
Sbjct: 964 QMTYLKIDSP--VKKNVKQELFETIGIPYDASFSSPDAVKAKNASSAKNLLLSSIPASIN 1023
Query: 1120 DTPRRKQQSGRKNSEAETGRRRRDSLDR---NLASVEPPKTTVKRMLLQ----------- 1179
R++Q S KNS+ ET RRRR+SLDR N A+ EPPKTTVKRMLLQ
Sbjct: 1024 QQSRQRQSSAMKNSDPETARRRRESLDRVIFNWAAFEPPKTTVKRMLLQEQQKTGMNQQT 1083
Query: 1180 ----GIPSSDEKLFRS--RTPEGAATVAWPASRLTSSMSSSSSKNAATPFMWASPLQPSN 1239
+ S++ RS + A+ V + S +S+ +TPF P+ SN
Sbjct: 1084 VLSERLRSANNTQDRSLLHVKDHASPVVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSN 1143
Query: 1240 -----TSRQKSQPS------PKTNTT------APSPL----SVFQSS------------- 1299
+ S+PS +NTT APS + +V Q
Sbjct: 1144 SPFTISPISASKPSFNWSGNKSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVAST 1203
Query: 1300 --HEMLKKSNNEAFSVTSENKFIE-----------KSKASDFFS--------------VT 1359
+ KK+ FS N F+E S SDF S
Sbjct: 1204 VLEQTEKKAGEFKFSEAKANAFVETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAP 1263
Query: 1360 RSDSVQKSNINLDQKSSI----FTISSKQTP---TLKDSINT-SNSDNQKTANPKERHTT 1419
S KS + SSI FT + P T DS +T + + ++ +
Sbjct: 1264 ASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVP 1323
Query: 1420 TSPLFGSANKPESASVGTMSSLVPT----------------VNEARKTEEKRSP------ 1479
S SA P++ SV + S++ T +N+A + SP
Sbjct: 1324 ASIPISSAPVPQTFSVTSTSTVSATGFNVPFGKPLTSVKVDLNQAAPSTPSPSPGPTAGF 1383
Query: 1480 ----TMISPSVPAPARLNTPSSSTLFSGFA----VSKPLPSSAAVIDLNQPVSTSTQLNF 1539
+SPS P +T SS LF A VS S+ + + + + +ST L+
Sbjct: 1384 TFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLS- 1443
Query: 1540 SSPVVSVSDSLFQAPKMISTSSTLSSLNPSLESSKKELPVS---KSDDDTEKQTPASKPE 1599
S+P ++ D+ FQ+P++ + SS + P E K E S + + A+K +
Sbjct: 1444 STPPITPPDA-FQSPQVSTPSSAVPITEPVSEPKKPEAQSSSILSTQSTVDSVANATKTQ 1503
Query: 1600 SYELKFQPSVTPDKNHVEPTSKTHTVSKDVGGQVPNVI----------GDAQPQQPSVAF 1659
+ L + ++ V P S + +S G ++ G +QPQQ S
Sbjct: 1504 NEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTP 1563
Query: 1660 APLPSSNLTPKIFGNVRNETSNVTVTPDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1696
AP P+S+ T + E ++ T +D+MDEEAPE + E S+ S GGFG STP
Sbjct: 1564 APFPASSPTS---ASPFGEKKDIVDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNP 1623
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038892124.1 | 0.0e+00 | 87.52 | nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida] | [more] |
XP_038892123.1 | 0.0e+00 | 87.26 | nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida] | [more] |
XP_031741375.1 | 0.0e+00 | 81.59 | nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus] | [more] |
KAA0034115.1 | 0.0e+00 | 80.62 | nuclear pore complex protein NUP214 [Cucumis melo var. makuwa] | [more] |
XP_031741374.1 | 0.0e+00 | 81.11 | nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hyp... | [more] |
Match Name | E-value | Identity | Description | |
F4I1T7 | 1.8e-227 | 37.14 | Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... | [more] |
Match Name | E-value | Identity | Description | |
A0A5A7SY34 | 0.0e+00 | 80.62 | Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... | [more] |
A0A0A0KV45 | 0.0e+00 | 79.18 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1 | [more] |
A0A1S3BDU8 | 0.0e+00 | 81.09 | LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656... | [more] |
A0A6J1CBF2 | 0.0e+00 | 73.05 | nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC1110100... | [more] |
A0A6J1HNV2 | 0.0e+00 | 69.61 | nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LO... | [more] |