CcUC11G220380 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC11G220380
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionNuclear pore complex protein NUP214
LocationCicolChr11: 24823798 .. 24860679 (+)
RNA-Seq ExpressionCcUC11G220380
SyntenyCcUC11G220380
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCGTATTAGGTTTCCAAAACAATAACAAACCTAAATACTACTAACCTAGAGCCCATTGGGCTTTGACTTGATAGAAACATGTAAGTTTTAGAGATTTAACCTTCACAAACTTTATTTTACGAATATTCTAAAATTGGAAAATATTTCAAGATTTTGGCCTAATTTTCAAACAAATTTAACCTTAAGCCTCGTGGTCTCTCTGATTAAAATTTCAATTTTTTTATAATTTAATATATCATAAATAATCATCTAAAATTTATTTAGACCTTATAAAATTGGTTCTTTCACAATTAATGTTGTACATATTATAATTAATATTGGGATATTATTTAATGAGTATTATTTTAGTCTTTCATAATTATTGCTTCAATTTAGAAATAAAATAGGGTTACTCTTTATGTGTTTTGAATTGTATTATGCGTTTATGTAGTACTATAGAAAGTATACTTTTATTGATTTTTGTTTTATCAATATATACTATGTTATATTAATTGTTCAATCTTCTACCATGATATGCATTGTGTTTCTTCATATATATTGTGTGCTATTTTTTTTTTTTTAATATTTTCATATGTAACTAGTTTTTAATAGTACGTTTCACAAATGTAACTACATTGTCATGTATACACATGTATATATTTGAAATTGAGTTATATATTATGCATATTTTGTCATATATACACATGTTTGTTATACTAAGATAAATGATATCTTCATATTATTTGAAATAAACTCACATATATTTGAAATGGAAAACAAAATGTACAAATAAACCTCATATATACTATGTATTATAACCCATGTGACTTTCATGTGCGTGTTTTTGAAAGAAAAAAAAAATTAATGTGCGTAAATATGATATATTTTTTAAAGATTTGATTTTGAAAATAAAACGGTTTTGAAAAAAAAAATCCATTTAATGCATGTGAATCTGATATATTTTCTTTTCCATATGTGTCTTTTCTTATTTCTATTTTTTCTTATTATTATTTTTCAAAATTTCTAAAATTTGTGGAGGATGATATGGACATTTGAGCCATATTTTTATGTAAAATATTTCATTTCATTAAATTATTCATAAAAAAAACAAAAATAAGAATAAGTTTAAAACAATTTTGTAATTAAACTTATTTAATTAGGTTATATGTGTAGAGAGCTCAAGACATATAGCCTAAACTAAATCAACATACTATGAAATACTATTACATTAAATAGTATTTACTCTAACACTGTAGACATGAGAAGGGATGTGGGTTTGTTCTGAAAGAGGAGATGGTAATGGAGCTTGATTTCAATGGGGTTTGATTGTGAGCTTTGGCAATGGCGATGGATGGAAGAGATGAGATGCAAACCAGTTGTGAACACATTTCCCTCTCTTTCTTCTATTTTGGAATTAGGATTCTCGAGAGTACTTCATTTAATTTAAATGTTTTCGAGGGTTAGGGTTGCTCATTCAAACTACAAGAATTTAATCGTTTTTAAAAACATAATCAAACTTAATAAAAAATTTAGTGAATTCCATCCACTTTGAAGAAATTTTGAAATTGAATTATTGAGTTTCAATTTCGAAACCATAGTGAACCCAACTTTTTTTTTTATCTATATAAATATCACTAATTTAAATACTTTATTTGTTAAATACAAATATGTTAAAGAAAAATTGCAAGAGTCATCTTTGACTTTTGATTTAATCCATCATTTCCCTAAACTATATGGTTTGCTACATCCATCAACTATTAATTCGATCATCATTTCCCTAAACTATATGGTTTGCTACACCCATCAACTATTAATTCGATCAATTAGCCTCTGTATTTTTTACTTGTTGCAATACATATTAATATTGATATTGATATATTTTCAAGATTAAAATTTTATTGGGGTGCAGAGTTGTGTTGAGTTGAATTGAGTTAGAAAATCTATGTTTAGGTGTGTTTGGGTGCCGATTTCAGTTAAATTGGAGTATATGTTACAATTTTTTTAAACAAATTGATTCTATATATATATATATGTTTTTTTTCTTTTTTTTTCTTTTCCTTTTTTATCTCCAAGTATGCCCATTTGAATAAAAATTTAACACTAGAAAATGATTAACTTTTTTCTATTCTTAAAACATAACAAATTCGTACAAAGTAAAAATAAACTATTGGTCCCTAAACTTTCAAAGGTAACGATTTAGTCTGTGAAGTTATTAACTAGTACCAATTTGGTCCAACTTTCTTTAGTTTACCGATAGATTTATGAAATGGCGCCATGTGTCATCTTAATTATAAAAAATAAAAGCATTGATGTATTAGAGAAACCAAAGAGGAATTTTTTTCTTTCTTCTTTTTTTTTTCTTTTTTTTTTTTGTGATTCCCCCTCTATCTCTTTCTTCTTCTTCTTCTTCTCTTGCTTTTTCTTCGTTTCCTCCTTTTCTTTATTCAACCTTTTTTTTTTAAAAAAAAAAATAAATTCTCCTGCATCTTCTTGTTCTCCGACTTCTTCTCTAACTTTTTCTTCTTCTCCAACATGAAGAGTCAAAACTCCTACACGACCAACCCGATGAGCGAGCGAGAGAGAGTGAGAGCGAGGGTGTGAGAGAGAGAAAGAGATAGGGTGATTGTGAGTGACGAAAGGAAGCGAGAGCGAGAGGGATAGAATGATTGTGAGTGAAAGTGAGAATGAGAGCGAAAGAGAGTGAGAGTGAGAGCGAAAGTGTGAGAAAGAGCGAGAGAGATAGAGTGATTGTGAGGAAAAGAAGCGAGAGTGAGAGTTAGAGATGCATGAGGGAGAATAATTATGGGGGTTGGGGATAATTAAAACGCCTGGAAAATTGTGTCGATAAAATGGTGGGTAAAACAACGGGTTATCGTTTTTCTTTTATTGTCAAACAAAGGTGGGTTTCATTTTAAAAACCCACTCAACCCGTTGGGTTATCAAACAACCCCAAAGTTTAAACTTTGTGAAGCCATTTAAAAACAAAGATGCTTTAACATTTTGCTCAATTATTGACAAAAAGAACATGAAAAAGCATATACAAAAGAGTAATCATTTAGCTAAAATGGGGTTGTAGAAACAAAAATATGACCAAAACAAAATTATTGTTCGCGTACAAAAAAAAAAGTACGCAATCTCTAACCTTAGGTGCGTGATTGTTAAGTTCTTGCTTTGCAATTTTGTTTTGGTCATAACTCCCTCGATAAAATTTAGTTTTAGCTAAATAACCACTTGTTTTCACATGGTTTTTTGTGTTATTTATAACAATGCTTATAAAAAGAGCAAAATTTAAAAGAATATTGAATATAAATGACCTGAAAACATTTAAACGTAGATAAAATTATATTTTGGAGTATTAAAGTTAAAGATTGTTTAAGTTTTTGTTCAGTTTACAAGCATGGATGGAAATAATGTAAAAAAAATTATGAAAACGAGTGACCATTTAGCTAAATGGAAGTTGTATTGATGGAGATATGACCAAAACAAAATTGTCAAGGGAGAACTAAGCGAATGCGTACATGGATGTCAATGATTGCACTTTTTTTTTTGCCCAAACGTAATCTTGTTTTGGTTGTATATTTGTTGATGCAATTTCGTTTTAACTAAATAACTACTCGTTTTCATATGTTTTTTTTTATGTTCTTTTCAAACATGTTTATAAATTAAGCTAAATGTTAAAACATATTTTCTTTTAAATGACTTCATAGAGTTTAAACTTGATCTCACAGAGTTTAAACTTGATCATTTTTGCTATGTGATGTTATGGTATCATAAGAAGAAACACTAATGGAAGAAACTTAGAAATAAAAAAAGGGAGAGAAAAAAAATTTGTTTAAAACTGACGACAAAATGAATATTTCACATTTTTAAAAAGTAAAAAGACAGAAAAAAAAAAAAAAGAAAAGAAAAAGGAAAAAGAAGAAGATCAATTACATATAAGTTTTTGGATTTTGAGTTAGTATGGGGAATTTTCCACAAACAAATGACAATATTTTGAATATAGTTTGAAAACAACTTTATGCCTATTATTTTTAAACCTACAATATTTATCTTCTTAACCCGCCCGAAGGCTGTTGGAGTGCGATGCTCCTAAAACCCCAGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGTAATTTCAATTACTTCCCCATTGTTGTGAATACCATTGAATTTTCTATGATTTTATTTGATATTGAAAAAAATTTTGTTTCCTAAGGGTTTTTTGTGGTGAGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGTAGTGCTTTTCGCTGGAGCTTGAATATCATAGTATAGTTTCCCTGAAACGTCAATTCGGATATTTGCATCAACGGGTAAAATGTCGCAATCATTAGTTATATTTATTGCACGCCATTCCTTATATGTCATTACTGTTATGCAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGGTAGGCTACATACTTTTGTGTAGTAACTTGTGTATAATTCTTATTGACAATTTTAAGTGACGTTTGTTAAAGTTCTTACATTTTTTTTTTAAAAATTACTGTTTACTTCAAGAAAATGGATCTTTTTGCTACATATCTGCATGGGGAGATGCTGTGTTTCCCTGTCTAATTCGCCATTTCTGAGTGCCCCTTCATTAAACTTAAACGATCTCTCTAAAACCTTTCTAGTCAATTCTTCCAGTTACTACATCTATTGTGAAGACCTCTCTTTTCCTTGTATATTTCATTTAATCATTTAAACTTTGTTTCTTATAAAAGGACCCTTTGTTCTATTCATGAGTGTAAGAACGGTGTCTTGGAATGGAAATTTGAAAACAAGGAATCGCTACAACTTAGACCTGGTTGTTATTGGCATTAGGCCTTAGAGAAACAGCAAAGGGCCATTATTGCTGAGACTCAATTGAGGGCTTAGAAATTTTAGGAAGGATGACGAGGGAACACTGCTCATTAGCACAATAATTAGGGTAGCTAGGCCAGAGATAGGAATGTGATTGAAGATAGGATAGTCATAATTGTTGGGCTCTGTTAATTTAACTATTTATCTTAGAGATCTTTATTTGTTCAGGAATTGAAACTCATTTTTATTTTTTGATAGTGAACAATTTTAATATTGCATGTTTTCCCCCCCCTCTTGCATGTACCATGCTCCATGCCAGTGTTTTTGGCATATATTTTCTTCTTCTTTTGTATTCCCTTGCCCCTAATTTTCTGCCATTTTCCCCCCAAAGCGATCTATTAGCATGGATTATTGTCAGCAAAACTTTCTATATGTGGAGCTACTGTTTTTGAAATTCTTTTAAGTCTTATTTTGGATTTTAGTGGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGATACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCTGAGTTCAGGGAATGGTGAAACTGATACGGACTTTACAGTGAAGGGTTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTGTGTGTGTGTAAAAATGATATTTTGCAGAATGTTGAAACATGAAGGTTTAATAATTTTCGTTATTCATTTTTCTATTTGGTTAGGGAGGGGATTTTTCTTGTATTTTACTTTTGTGATGTATCATGAGGTGTCAATCAGGTTGGCTATGAGTTGAGTTTGAGTTGTTAGCTTGGAGTGAAGTACAGTTGGTTTCACCATAGTAATCCCAATAGATACTTTTGCACATTTTAACCCTATGTAGTTTGCTTCCTATGGCAGATAGATTTGATTAGCAATGGTTTTTGTGGTTGAAGAAAATTCTAGATTTGGGTAATTATTATTTTTATATGGGTTATTTGCCTGGAGAGAAATCGTGGAACTTTTGTATCTATTGCTAATTCATACCATGCAAGAGATTGAATACTAGCAAGCATATAAAATCTAACACAAAGGAGCCAAAATAAATCTCTAATTGCTGGGTTCTTGGCTAGTTTGGGAATTTGCCTCTTCCTTCCCCTTCTTTAATTCTTTGTCATTACTATATTATTCCTTTTCCAGAATAAAAGCAATAAAATAACAATTAAATGAGGATTACAAGAGAACAAGGAATTCTAAATTCTCATTAATTCTAAATAATCCAGAAAACTTAGTATAATTAAAAAAAATGTCTGAAAACTGAAATAAATTGGAAACAATAGTACTACGTACCATACAGAACTGCTATTTTTGATCTCAATTACAAGCTTACGTTGGTGATTCTAGAAAGTACTTATGGTTACTATCTTTTAACTTGATTTGTGGTGATTGCTCAGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTGAGTAGTTGTGAATTTCTTCCTTTTCCCCCTCGCATTCCATTTCTATTTTCACGATGATGGTTGTGTTCTTTATCATGCTTGTACTTATAGATGAGGGTGATGATAATGTTGATGACGAGGGTGATGATAAGGGTTGCTTTTGACTTGAGAGGAATGAGAGAAACTTTAGAGGCTTTAAGAGGTATTGGAAGGAGGTGTAGACTCTTCCAAAATTATTGCCTCTCTGCAGATATTTGTAACTTAAAGGACCTTTGTAATTATCAATTATCATCCTCTAGGCCTTGTTCTTTTGGACAGTTCTTTTTCTTGTTAGGTCCATTCTTGTTTGACACAGTTTTTTGGTTGTTTTTTACAAACCCTTCTGTGTTCTTTCATCTATCTCTTATTGAAAGCTCGGTTTCTTGATGAAAAGACAACTTTGTACATGGGCTTTATTGATGTTTTGAATTCTATTGTGCAAATTATTTATGAGGCATTAGTTTGTTTATCAATTGCCTTGAGTATATCTCTTTGGCAAATTATTGTGGGATATTTGGGGGGAGAGAAACAACAAAGTGTGTAGAGGTATGGAGATGTTTGGTCCTTGACGAGGTTCCGTGTTTCTCTTTGGGATTTGGTTTCGAAGATGTTTTGTAATTATTCTCATCTTGCTTAGTTGGAAACCCTTTCTTTAGAAGGGTTTTTGGGGATTGGTTTTTTGTATGCTCTTGTATTCTTTTCATTTTTTCTCAATGAAATCAATTATTTCTATTAAAAAAATCTCTTTGAGAAATACAAACATTTAGAGGGGAAATAAATCCTTGGACGTGCAATGCAAATATTGAGCCTTAGAAGATTTGTCTTTAAAAATAAACCATTGTTATTGATGTGGTAAAAAAAAAATGGTGCACTCTCTCTCCCTTACCTTTTGCTCAACATAGTGCTCGTCAGAACCCCCCAATTCTCTGTTAGTCACAATTTAGACTCCTTTGCTTCAACTCATTATTTGTCATAGGTATTTTTTTCTGAAAAAAAAAAACAAGATATCTTAGATTTGATGAAAAGAGACTAATAATGTAAGGTCAGGTGGGTTGTCCCATGAGATTGGTCAAGGTGCGCGTAAGCTGGTCCAAACACTCACAGATATTTAAAAAAAAAAAGAAAAGAGAATAATGCTCAAAATACAATGACAAAACAAAAAGACGAAGAGACAATTCCATCTACCATAATACATGAGAACTATGCAGAAAAAAATTTAAACATAAATCTTGACTAGAGTATCGAGCAAAAGACTTGGAGAGAGAACTAGGAGGCGAGGGAGTTATCTGAGCAAACTCAAACCTATCAAACCAAGGAGAGGACTTGTCTTGAAAATACGCTGATTTCTTTCAAACCAAATCTTCATGAGAATAGCTTTAACCACATTTGACCATAACAAGACATGGGTATCACAACAACCTTTAGAAAGATTGATCACATAATATTTACTTGGGGTTTTTGACCAAAGAAAGAAAGGAAGATGTTGCAAGACTATGAAAGACATTTGGAAAGGCTCTTTGACATCTTCGTTGGAGAAGCAATTGTTGTGCAGTTATTTGACGCATTTGAGGATCTCCTGTTTGCTGCCTACTCCTATAAGTTATCTTGAAGAACAAACGACAAAGATTGCTCAATATGGATTCAAAAAATCACTAACAAGAGAGGTACTTTCATCGAAATCACTAAGGGGTGTTTGGGCCAAGGAGTTGGAAAGTGTGGAGTTGTGAACTCCACTACTTGTTCTGCTCAGAGTTTGTAGGTCCCACTACTAGAACTCAAGGACACTGTTGTACAACCTATAGAACAGTTCCTGTTCAAGTAAGGTCGTTTGTGGGTCTCACTACTAAAAAGCATCAATTTTATGTCTTATTAACTCCTTATATCGTGGGCCCTAGGAGTTCACAACTCCCTACACTTCATAACTCCTTGGACTTCACAAGTCCATTCCTTGCCCCAAACGCCCCCTAAAGTACAAAGCAGGGGAAGAAATCTAATTATCCCTTTTGCTAATAAAACAATGGTTGGAAAGTTTTCAGAGAGCTCTTATCTGATTTCTTAATTGAACCACAAAAGGGAGAGGAAGCTTTATCGAAGAAAACTAAAACATAAAGGAAGTCCGTCCTTTGTAGAGGTAGTGAAGAAAAGTAATCAACCTCAAGATAGAAGCGGGGACCTTTTGAGGTGTGATGACTTGTTTGTCCCAATGATGTAGAGAGAAAGGAGTTAAAGGGCATTTCAATGGAACAAGGTACTGGTTATTACAAGAAGAGTCTTCCATGATGATTGGGTAAAAATTATCTTCACATTGTTTGTTTTTGAAAGGGGCTGTTTGGATCTGTTTTATTGACTATCTTTTGCGTTTTCTTGCTGCTTTTGTTTGGAGGCCCTCCACTTTTTTGCTTTTATCTTTCTTTTTTGGCTGTACATCTTTTATTTTGGTGCTATCTTCCCTCACGTTTCGGCTTCATCTGTATTTCTGATGTGTCTCCTTATTGTACTCATTGAATTATTTCATTCATCAATGAAATGTTTCTTATAAATTTTTTTTTTTTTTGGAGTCCCTCGAAAGATAACTAGAAGAAGTCTTTGCCAATAGACCTTTTCCACCTAGATAAATGCTTCTTTTGTTTCCATTCATGGATTGTGCAAAGTTGTGACCAATGAATGTGGGGTGGGTCACTTTTGGTGCTTTCACCATAAATAGGAGAGATAGGACGCTTAGGAACACTGAAAAACTTATGCTATCCGTTCATGCTTAGAAGTGGCGATCAAAGTACAAGGAAACTATTGTAGATGTATTCCATTGGACACGAGAATATTGGATGGAGATCAAGCCATCAATTCTTATGTTTGTACTTTTCAAGACAGTGCTTTACTTGCTGGCAGAAAAGTGGAGTTTTACCATGGTTTCTCATCGACTCCAACTGAAATTTTTTTATGGTTCTGGTGGTAGGTTGAGTCTCAACCTGATAGACTTGTCTCAGTTGCACGATGGAATTAGTCTCCCCACGATAAATCTTTTTACCATGAACTTCCATTTCATAGGAAAAGGATACGAGCTTTCATGTTCCAGAAGGGCATCTTGTCAAAGAGTTAGATTTAAGGAGCCATCAATTAATGACTCATATGTATTCCAATTTTTGTCTATGACCATAGAGCCCTTAACTCTTTGCCCCCTTCTTTGAATCAGATTTTATGCCATGCAACCATTTCCTTTTTTAAATGTCAAACAAGACGAAAAGGTTGTTTAAACATCTCTTGAAGCTGCTTTCCCAAATATTTTCTCTATCATTTAAATAGGAAGCTTTGGTGGCAGAATGTTGGGATGCAACTTAGGGAACTTGGGCTTTGGGAATTAGACGAAGGTTGTTTGATTGCGAGATGTAGGCCTAGGCTGAGTAGTTGGAGGGCTTTCGGTTGGGTCAAAGAAAGGAAGGAATTAGATGGTCCTTTTATGCCTCAAGCTCTTCCTCCACCCTAAGAAGTTGGAGGGCTTTCGATTGTGTCAAGGAGAGGATGGAATTTAGGTGGTCCTTGGATGCCTCAAGCTTTTTCTCCACCAAATCCTTGTATTTTGAATTTGATTGGCTCGCCATATTTTAACATGCACTATGATAAAGGTGCTTTGGGATTTCAAAATTTCAAAAGGTGAAGGTTTTCCTTTGGTCCCTAGCACATAGGAGCCTAAATACTCAGGAGAGAATGCAACCACAAAGTGTCCTCGTACCACCTTCACACATCTATTTGCCATTTTTGTTTGGCCTGTGGTGGAGTTTGGAGTCCCTAGACCACACCTTCTTAGACGGCCTTTTGCTAGACAAAATTGGGATGTCTTTTTGGCCTTTTTGACTTGCATGCTTGTCTTCCCAAGTGGGTGGATGGGTGGTTACCTGAATCCCTCAACAGTTGGAGCTTGAAAGGAAAGTGTTAAATCAAAAGCTAAAGTTGATGGGTGTAGGTAAATCTAATATTATATCATTTAACGCTCCCCTCACTTGTCGGCGTGGAATATGTAGAAGACTCAATAAATGGAAATCAATGTTATGGGGAGAAAATAACATTGTAGGGGTTTGAACATATGATCTCCTGACCACCTTCTTCGATACCATGTCTAATCACCAATTGACCCAAAAGCTTAAGCCAATGAGCGAAGGTAAATTTAATATTATATCATCTAATAGAAAGGCTTTATATGAAGATTTGCTTTTTGATCCCTCTTATGGGGTTTATGGCTTGAATGTAACAAGTAAATTTTGAAGATAAGCCCACTTTTGCCTCTTCTTTTTCCCCTCTTTGGGGTTACTATATTTTGAGCATTAGTCTTTTTTCATTATTTCGATGAACAGTCTCATTTCTTTAAAAGAAAGATAAGTCCACTTTTTCTACTTTTTGCGAATTTGTACAACTAGATCCTTCATGATTGGAAGGCTTTTTCTCCGTAGCTTCTTGAGTGGGGAGCCCTCTTTCCCCAGCCTTTACGTCGTTTTGTTCCTTTCTTTCTTTATTTCTTTTGTTACAATCAAACCAAGGAAGGGATTTATCATGAAATATCCTTTGATTCCTGTCAAACCAAATTTTGGAAAGGAGAGCTTTGATTGCATTGATCCTAGTTGAGCTTTATTAGATAGACCATGTGCAGTCTTATTGGTAGAAAAGGAGCAGTTTAGACTGAAACTAGAGTGGATTGGCAGATGGTTGTATGAGGGGAACTTTAGTAGTCTTTTATTCAAGGGAATGATTATTAAGGTTGCGACTAAACTCCTTCATGGATTAAGAAATTGGAAAGATCAGGCGTGGATTTTATTTATTTAATTTTTTTGTCTTGACAAAATCTTATATTTATCTGTGAATCATGTATTATGTTATTATTATTAGTGCCCTAGAATCAAGGGCAATCATATTGGATTAATTAAGCCCTCACCTTCCTTTTCTTGGTTGTAAATTTGTTTTTATTATTCTTGATCTTCTGAATGAGAATGATGTAACTTTATCTGAAATCGACAATGAAAGTTGAGAATTATCACAGTATCATAAACTGATCAACATGATGTAGTTATTATGTTATTTATTTATTTTTTATGTTTAGTAGATTAATTTGTGTAACCCTCTTTTGTAGGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGGTATGCAGAAATGACTTGATCTGATTTAAGCTCTGAATGTTTGAAATTGATTTTTTCTTGAAATTCCTTATTTTTTTACTACTTTTTTTTCATTTAATAAACTTTTGGTTATCACTTCTTTGAGTTTTTTTTCTTCTTTCTGTTACCGTGGGATTATTTTCTAATTATGAAACTACTTGTCAAGTTTATTTATTTTCTCTTTCTTGATTTGGATGCAGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGGTTAGTGATTTTTTGATTATTGTTATGAAAGAATTCTGAAACTTTATTAAAAAATAATGCTGATTTCTGAATAGTGAATATAACCTAAAGTTAGCTATATAAAGCATACAAGTCAGCCTAACAGCTTGTATTTAGCTCAAAAAATAACTAATCTAGCTGACCTATTCGTTTAGAATGAAATATAACTGGCTCATCAGGAGAAGAGTATCTTTTTATAGCTTGCTGAGTGTGTGCTACCTTATGGGTTTTTGAGGGGAGTAGAATAGTAGGGGTGTTTAGAAGGGTGGAGAGGAACCCTAGTGAGGTTTGATCTCTTGTTCACTTTCATGTTTCTTCGTGGGATTCGATTTTGAAGACCTTTTGTAATTATTCTATAGGCGTTGTTTTCCATGATCTTCCTCTCCTTAATGGTGAATACATGTGGTCCTGGGATAGACATCCCCATTCACTCATTCATTGACAGATTCCTCATCACTAAAGGCTGCGTAGATAAATATGGGGAAGGCTACAGAGAGTTACATCTGACCACTTCCCTATCCAACTAACTCTTGATAAGCAGAAATGTGGTCCTACTTATTTCAAATTCCACAATGATTGGATGGAACACTAGGGTGCCTCAGACACGATTTTATTTAGAAACCACCAGGAATTAAAAAAGTGGTTAAAGAATGTGGTATAGAACCCAATTCTTGGAGCAATCTTGAAGAGAGAATTTTATTGAAGTGAAATAATGAAGACAAGCTCCCCTTAAATAGACCCAAGGGGAAAGGGTTGGTTCACCCCTAAACTAACTAGTTAACTAATCTAATCATAATCTTCCCTTAAATAGACTCAAGGGGAAAAGGTTGGTTCACACCTATTCTAAAATTATTAAAATAAAAATACAATTGATTAAATCAAAGGGAATTTCCCAAAATACCCTACATCAAAATTTTCAGGTGTAAGGGAGAACTACATTTGTTTGATCCAAAAAAAGGAAGACGCCACCCTAGTGAAGGATTTCAGACCAATCAACCTTACCACTTCAGTTTATAAGATTGTAGCTAAGGTTTAGCGGAAAGAATGAAGAAAGTTATGCCAAGAATAATTGCTCCTACTCAGAGTGCTTTTATTGGGGGAAGACAAATTCTTGATCAGGTCCTCATAGCTAATGAAGTGGCCGAGGAATATAGAATAAAAAAGAAGAAAGGTTGGTTGTTAAAGCTTGACCTTGAAAAAGCTTTTGGCCATGTAAACTTTCTTGAAAAAGTCCTAGTTGGAAAGAATTTCGACCCTAGATGGATCTATTGGATTATGGGATGTGTGTCGAACCCAAAGTTCTCAATATTTATCAATAGAAGAGCAAGGGGAAGAATACAAGCCTCTAGAGGCATTAGGCAAGGGGATCCTCTCTCACCCTTCCACTTTCTACTTGTTAGTGAGGTACTTAGTGGTTCATTATCAAGGCTACATGATAAGGGCAAATATGAGGAATTTATTGTTGGAAAGGATGCTGTCCATGTTTCTTTGCTACAATTTGCAGATGATACTTTGTTATTTTGCAAATATGACGACGATATGATAGAAAATTTAAGAAAGACCATAGAACTTTTTTAGTGTTGTTCGGGGCAAAAAGTTTATTGGGAGAAATCAGCACTTTGTGGGATAAATATCGAAGATAGCAAGTTGATGTCAGTGGCAGCAAAACTCAACTGTAAAGTTGACTACCTCCCTATCATGTACCTCGGTTTACCTCTAGGAGGATACCCCAAAAAAGAAGCTTTCTGGCAGCCGATCATTGGGAAATTTCAAGATAAATTAGGTAAGTGGAAGAGATACAACTTGTCAAGGGGTGGCGTGTTACTCATTGCAAATCAGTCCTTTCACACCTCCCCACCTATTATATGTCCATCTTCTTAATGCCGGAGAAGTTGATCTCAACCATTGAACGCACAATGAGGAACTTCTTTTGGGAGGGACACAAAGGAGGTAAGTTGAATCACTTAGTGAAATGGGAAGTGACTACTAGAACCCAATCTGAGGGTGGCCTTGGAATCAGTGGCTTGAAATCGAAGAATATTGCTCTCTTGGCTAAATGGGGCTGGCGGTTTATGAAGGAAGAAGACTCCCTTTGGTGTCAAGTAGTGCGAAGCATTCATGGAAGAAGCCTGTTCGGTTGGCACACAAGTGAATAGGTCAAGAACAGTCTTCGTAGCCCATGGAATAGCATCTTAAGATCTTGGTTAAAAGTAGAAGCTTTGGCCATCTACATATCAAGTCTTCGTAGCCCATGGAATAACATCTCAAGGTCTTGGTTAAAAGTAGAAGCTTTGGCTGTCTACATATCAAGTCTCTTAGTGATCTTTGAGGGAGCATGGCATAAGGAAAAATGATAAGTAGGCATGTTGGAGAGAGTAGCTTGAATAAGCGTATGCCAAGTGTCTTGAGCATGTTTGTCCAGGGAGTGTTTGCCTATGGCCGCATGCCCAAACCACCCTCAACCACCTTGTATTTATGTTTTGCAAGTAGCTTAGGAAGTTGTTTTCATTTTCATGTTGTTTTCCTTTCGAGTGACGTTTCGCCAGTTTTGATATGCAAGGCTGGCAGACTAGTTTTTGCTTAGAATGTACTATAGGGGACCCCTACTAGTTGACTCCCCGACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGTCAGTCCTCAATGCTTTATTCAATGATGTTTTCAAAGCTTTTCCTCTCTCGTTTTATAAAACCATTTTCTCTCAAAACGTGAGTAGAGGCTACATCATACCCCGAGTTTACCCGCGACCCACGGTTTTTGTACGGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCTGTCTCGTATTCTAAAGCGAGCGATTGTGACAAGTGGTATCACAGCCTAGTTAGCTCCTTGGCTATCAAGTACCGCTATCATGTCGACTGTAAAGCAACTGAGCAAGTCGCAGTCTGAGCGACTTGCTGAGATAGAAGAGCAATGGCTATATCTAAGGGAAGTTCCCGACGATGTGAGATTCATCCAAGCCCGACTGGAAGGGTTAGATGAGAAGGTCAGAGAAATTAATGTGTTAAACGCTTGAGTAGACGCGCTACCGATCACTGAGTTGACGCTAAGGGTCGACTCTTTGGAACACAAAACTACACGTCCTGGTAGTTTCGAACGTGGAGATAGCTCCACAAGCTCTGTCGCACACATGGAAGAGCGTGTCGAAGAGCTCGATAGCTCCCACAAGGTCATGTTAAAGCTGTTTAATGACTTGACCGACGATTTCAGAGTAACAGTGGAAGCCATCAGGGCTGAGATGACCGAGATGAAGACTCAAGTCAACTTAACGATGCGAGCTGTGGGGAACCAAACCCCGAATCAAACGCTTGCTATGCCCAACAAATTCAAGATTCCGGAACCCAAGGCCTTTAGTGGGAACCGCGACACTAAAGAATTGGAGAACTTCATCTTTGACATGGATCAGTATTTTAAAAGCGAGTGGAACCGTGTCAGAGGAAGTCAAGGTCACGTTAGCCTCCATGCATATCTCCGACGATGCAAAATTGTGGTGGAGATCCAAAGTTAATGATGTCGAAGATGGTCGATGCACCATCGATACGTGGGGAGATTTTAAGAAAGAGTTGAGGGCTCAGTTCTTCCCCGAAAACGTGGAGTTCATAGCGAGGAGGAAGCTAAGGGAACTCAAACACACGAGAACCATCCGTGAGTACGTGAAACAGTTCTCCGCTGTTATGTTAGACATCAGAGATATGTCCGAGAAGGATAAGGTGTTCTGTTTTGTCGAGGGCTTAAAGCCATGGGCCAGGACTAAACTCTACGAGCAGAAAGTACAAGACCTGGCCTCCGCCCTGGCTGCTTCCGAGAGACTGTTAGATTACAGTGGTGATCAGACGCCACAAAAGAAGAACACGGCGCCCCCGAATACTGGGTACAAGGCCACAAAATCCAACCCTCCGAAGAGCTTCAATTCGGAAAGGAAGTCGCAAGCCCCAGGAACAGGCCCCTCCCGGAGGCCCTATCCAGCCGGCCAGACCAACCCAGACCCATCTCTTGCTTCTTATGCAAAGGCCCCCACCGAGTAGTCGAGTGTCCGCATCGGGGCGCCTTGACGGCCTTACAAGCCTCAGTTCAAAGTTGTAACGAACCCGAGACGGAAGCGGAGGCAGAAAGAGAAGAAGATGAGGAAACCCCTAGAATGGGAGCATTGAAGTTCTTGTTTGCAATTCAGAAGAAAACTGCCCGCCAGAAAGATACAGTAGAAAAGGGGCTTATGTTTGTTAACGCGAGCATCAACTCCAAACTTGCTAGAGGCATCCTGGTAGACTCTGGTGCAACACACAACTTCATTTCTGAATAGGAAGCCCGTCGCTTAGAACTCAAAATTGAGAAAGACACGGGTAAGATGAAAGCCGTTAATTCGGAAGCCCTGCCAATCGTCGGGGTGTCCAAGAGGGTACCTTTACAGCTAGCTGGGTGGATTGGGAATGTCGATCTGGTCTTGGTACGCATGGACGACTTTGACGTCGTGCTCGGAATGGAGTTCCTCTTGGAACACAAAGTCATCCCTATGCCCTTGGCCAAGTGCCTAATTGTCACCAGCAGTAACCCCACAGTGGTCTTGGCAGATGTAAAGCAGCCTAGTGGAGTAAGAATGATTTCTACCCTGCAATTGAAGACAGGTCTCGGTCGAGAGGAACCCACCTTTATGGCTATCCCGGTGGTCGACGAGATCACCGGGATCGAGTTCGTCCCTCCAGAAATCCAAGAGATCCTGAACGAGTATGTTGATGTTATGCCACAAAGTTTACCAAAGTCTTTACCTCCGCGGCGTGGGATTGATCACGAGATTGAACTTGTCCCTGGAGCCAAACATCCAGCGAAGAACGCATACAGGATGGCCCCCCCGAACTAGCTGAGCTCAGGAAGCAGTTGGATGAGTTGCTGAATGCGGGATTCATTCGCCCTGCAAAAGCACCATATGGGGCCCCAGTGCTGTTCCAGAAGAAGAAGGATGGAACACTACGACTGTGTATCGACTATAGGGCTCTAAACAAAGTAACAGTTCGAAACAAGTACCCCTTACCGATCATCACTGATCTGTTTGACCAGTTGAACGGAGCCCGATACTTTATGAAACTTGACCTCAGGTCAGGGTACTACCAAGTTCGCATCGCCCAGGGGGATGAACCGAAGACCACCTGTGTGACTAGATATGGGGCCTTCGAATTCCTCGTGATGCCCTTTGGGCTCACCAATGCGCCAGCCACGTTTTGTACACTGATGAACCAAGTGTTCCACGAATATCTCGATCAGTTTGTAGTGGTCTACCTCGACGACATTGTTGTGTATAGCCCCGCTTTGGAAGAGCACAAGGTTCACCTCCGATTAGTCTTTGATAAACTACGGCAGAATCAACTCTATGTGAAGAAAGAGAAGTGTGCTTTCGCCCAAAAGCGTATTAACTTCTTGGGCCAAGTGATCGAACATGGGAAGATCAGGATGGACAGTGACAAGTTGAAAGCCATCCAGGAGTGGAGAGTTCCTGTCTCTGTGCCCGATTTGCGCTCTTTCTTGGGTCTAGCAAACTACTACCGTCGTTTCGTCGAAGGATTTTCAAGAAGAGCTGCCCCCTTGACTGAGTTGTTGAAGAAAGACACCACCTGGCAATGGTCGGCCGAATGTCAGTCGGCCTTCGAAGAGCTTAAAGCGACTATGACGAGGGGCCCTGTCCTCGGTCTAGTTGACGTCACTAAACCGTTTGAAGTAGAGACAGACGCTTTAGATTATGCCCTCGGAGGTGGCCTTCTTGGCGTCCTTCTCCAAGAAGGCCACCCCATAGCTTACGAAAGTCGGAAACTCAATAGTGCAGAAAGAAGATACACGGTCTCTGAGGAAGAAATGCTGGTCGTGGTCCACTGCCTTAGAGTCTAGAGGCAATACTTGCTGGGATCATGTTTCGTAGTTAAGACAGACGATACTGCGATCTGCCATTTTTTTAGCCAGCCAAAATTAACATCAAAGCAAGCGCGGTGGCAAGAGTTTCTAGCCTTTCGAACACAAGACGGGGAGAAGCAATCAAGCTGTCGATGCCCTTAGCCGAAAAGGCGAGCATGCAGCCATGTGCGTGTTGGGCCATATTCAATCAAGCAAGGTCAATGGATCGATGCGGGAGATCATCAAAGAATTTTTGCAGAAAGACCCTTCTGCTCAGGCCGTAGTATCCTTAGCCAAAGCTGGCAAGACCAGACAATTCTGGGTCGAGGGAGACCTGCTGTTGACAAAAGGGAACCGGTTGTATGTTCCTAGAACAGGAGACCTGAGGAAGAAGTTGTTACACGAGTGTCATGACACCTTGTGGGCAGGCCATCCAGGATGGCAAAGAACCTATGCACTACTGAAGAAGGGTTACTTCTGGCCCAGTATGCGAGACGATGTCATACAGTACACCAAGACCTGCCTCATTTGCCAACAAGACAAGGTTGAGAAAGCGAAAATTGCCGGGCTCCTTGAACCCCTACCAGTGCCATCTAGACCATGGGAGATTGTGTCCATGGATTTCATCACCCATCTGCCTAAGGTGGGCGAGCACGAAGCCATCTTAGTTATCGTTGATCGGTTTTCGAAGTATGCCACCTTCATAGCTACCCCAAAACTATGCTCTGCTGAAATGACAGCCCAATTGTTCTTCAAACACGTGGTGAAGCTATGGGGAATTCCAGCCAGTATTGTCAGCGACAGAGATGGCAGGTTCATAGGCACCTTCTGGACCGAACTATTTTCATTCTTAGGGACAAGCCTGAACATATCCTCGAGTTACCACCACCAAACAGACGGCCAGACAGAAAGGCTTAACTGCCTGCTAGAAGAATATCTGCGTCACTTTGTCGACGCCCGGCAAAAGAATTGGGTGCAGTTGTTAGATGTGGCCCAGTTCTGCTTCAATTGCCAAACAAGCTCATCAACAGGGAAGAGTCCCTTTGAAGTTGTAAGCGGAAGACAACCCTTGTTGCCCCACATTATTGATCATCCGTACGCAGGAAAGAACCCCCAAGCCCGCAGCTTTACAAAGGAGTGGAAACAGACCATTGAGATTGCACGAGCCTACTTGGAAAAAGCCTCCAAACACATGAAGAAGTGGGCGGACAAGAAGCGACGCCCCCTTGAATTTCGAGTAGGGGATCAGGTTCTGATCAAGCTGAGACCCGAACAGATTCGATTCTGAAGTCGAAAGGACCAACGCCTCGTCAGGAGGTATGAAGGTCCAGTAGAAGTCCTGAAGAAGGTCGGGAACGCTTCATATAGGGTGGTGTTGCCCCCATGGATGAAAATTCACCCAGTAATTCATGTAAGCAACCTCAAATCCTATCACCCCAACCCCAATGATGGTAGCCGCAATGTCACTGTTTGGCCAGATATCGACCTCAAGCACACATACGAGAAGGAAGTTGAAGAGATCCTCGCAGACAGAGTCAGGAAAGTTGGAAGACCTGTTCGGAGAGTCCTCGAGTTCCTTGTCAAGTGGAAAAATCTCCCCGCAGAAGAAACGAGTTGGGAACGCCTTGAAGATTTGGAAGCGTGGAAGCCGAAGATCGAAGAGTTCAAGCTCCACCAGCCGCAGAGACGTCAACTGATTAAGTGGGGGAGAGTGTCTTGAGCATGCTTGTCCAGGGAGTGTTTGCCTATGGCCGCATGCCCAAACCACCCTCAACCACCTTGTATTTATGTTTTGCAAGTAGCTTAGGAAGTTGTTTTCATTTTCATGTTGTTTTCCTTTCGAGTGACGTTTCGCCAGTTTTGATATGTAAGGCTGGCAGACTAGTTTTTGCTTAGAATGTACTATAGGGGACCCCCACTAGTTGACTCCCCAACCAAGTATTGTTGTACTCTTTGCAAACAAGCTTTCTTGCAATTGTCAGTCCTCAATGCTTTATTCAATGATGTTTTCAAAGCTTTTCCTCTCTCGTTTTATAAAACCATTTTCTCTCAAAACGTGAGTAGAGGTTACATCATACCCCGAGTTTACCCGCGACCCACGGTTTTTGTACGGCTGACTTGCTCGCCGGGTTAGAGCAGTGCCGCACAATCGCCGTCTCGTATTCTAAAGCGAGCGATTGTGACACCAAGCACCCTTAGAGATGAAAGAATATTTCCAATTATGAAATTTTTGATGAATTCGTTCAAAAACCGGTAGCTAAAAGCCAATAGACTTGGAATTTCCACCCAACTGAAAACCAAGATATGTTGCAGGCTAATTAGCTCTTTTACAACCAAAAGTATTAAGGATCCGATCGAATCCGATTCATTGATATTGGTGTCCAACAACACTGTTTCTCATAAAAAAAAAGATATGAATGCTCAACAAGATTAATTTTCAAACTAGAAGCACACTAAAAAATATGAACTATCTCATAGATGACCAAACTTTATTCTCTTTTGTCATCATCTTTGAAGCTTTCCCCTAATGCAGCCAAGTTATAATTTCCTAATGCAGCCAAGTTATTTCATAAGTAATAGTAACTTTACTTCATTAATAAGATAGAACATGAGACCTCACAGATGAAAGATGATTGACCAATAGGATGAGAACCAACAAGAATCAATGAACTATGCTTCATTAGACGACTCATACGATCTGCCACTAGAATAAATAAAAATGGGGAAAGCAGGTCACCTTGTTTGATATCATGAGAAGGAGTAATATTTCCCCTCGGTTGACCATTGATAATGATAGAGAAGTTAACACTTAATTTGAGTCATATCTTTGGTAATTATAGTTTTCTTTGACCTTCCATCCAATTGAAATTCGTTATTCTAATCTAATGGGTATGATCTGATCATTTTATAATTTCATTTTATCAATGAAATGCTTCTGCTTCCTTTCAAAATAAATAAATAAATAAATAAAATAACATTAAAGATTAGGAAACTTTGGACTCTCTATATCACATAATATAGCTGTTTCTTGTTTGTTAGGCTGGTTCCTGTTCGACTCCTTTTATGCTATTGTTTCTGTAGGCCCACTTCTTGTTTTTTGGTCAATGTTGGCCCTCTTGTATTCTTCCAATTTTTTTTTTTCTTAATTAAAGTTGGGCTGTTTTAAAATATAGGAAAATGAACTAAAATATTTACAAATATAGCAAAAAAAAAAAAAAAAAAAATCAATCCTTAAACCTTTTCCCAAGCATTTTTCTAGAAAAAAGAAATCAATCCTTAATGGAATTCTCATTTTAAGCCACTGCCTCAATCGGAGATTCCGGCACATTTAAAGTCTATAATAGCAGAATGTGGACTGGTTCTGGGTTAATCCATCTATCCAAATCATTTTAGTTGTGACTCTTGGCTCTCCTTGGATCAAAGTTTAGCACAGTGGTTTGAATAGGTCTTGTTCTCTTTCATCAGCTTCAGATAATTGAAGACAGCCTCTTCTCCTTGGTTGACTTTTCTACCAACAGATATCATATTATATTTATCTTGATATGGCAGCTATGGAGTTGGTGATCCTAAGGACTGTTGTAAATTTCTTGGGCTGTTTTGTTTTGCATTTGTTCCATTTTGATGGGATTGGTACTTGTCTCTTTTGATACTGTACTTGCATTCTTTTGGTTGTGTTAAGGGTGATATTTTCTTACTTGTATTGTACTATGAGATATTATCTCATTTCATTATACTAATGAAAGAGACCGTTTCCTTTAAAAATAATAAAAAAAAATATCCTTGTATTCTTAAAAAAAATATAGCAAAATTTTATTGTTTATCTACAATAGACCATAACAGATCGCGATAGACTACTATTTCTATCTATATGTTTCATGATAGATATAGATAGTAATTTATCGCGGTCTATCTGCTGATATTTTGCTATTATTTGTAAATATTTAACATTTTTGCCATTTAAAATAATTTCCCATTAAAGTTTGATTTTTTGATTAAAAAAAGTTCCAATTTGGTAGTTTGAGTTATAATTTGAGACATGACATTATCTGATTATTTTTTATCAATTCTTCTAAGAGTTGGTTGCATTATGTTTTCTTTTAGTGTTTCTCTGTCTATTTGATTTTCACGGTTTACTCATGTGCACCCAAAGTATTTTTGTTAGAATGTTGGAGTTCGGTTAGGCAGTCAATTTTTATGTACACATTTATTCATGCTCATTGATTAGGTTTTTATTTTTGTATTCTGCAGCATCTCTTCTTCTGTGATATAATTCTATCTCATTCTTGTTTTGTTCCAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGGTACTGCCTCTTATGTTTTAAAACCTTTTTTTGTTATTGTACCCAAGGCCGAAGCCTATTTTTCCTTGAATAGATGTTTCTGTCTTTAGCTTTTAATAGCAGCCTGAATACTATGATTATTGCTCCTGATTTCTTTTTACTATGCGTTTTCAACTTTTCATGCAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAGTAAGTTCTGAACGAAATTTATTCAAACAATTTGCCAATGTGAACGTACTTGAGTCTAATATAGGAATGACTTGGATTAAAATTTGCCATTCTTTGGCTTGCTCATACACCTCCAATCCATCGCCTTTGGAAAAGAAATCTTTATAAAGAAAATACACATCCACTCCATTTTTACTAGTCTAAAGGGAAAAGTGGATAAAATACTTGCATGAGCTTTCCTGTACAGATATATTCCTCTTAGAAACATCTTGAATTAGAAAATGGGATATCCCCTTTGTATTTTTATGCTTGAATGATGGTCCCTGTATGTGACTAACAAAATAACGAGGCCCATGCGTTTTCATATATGACTTCATATTGTATATTTACCATATAATTAGTCAACTAACATCTCTCTGTTTCGTTTACTCCGGATAGTTATTGGTTCTGACAATTTTTGATAGCTGTGTTACCCTTTTGTAAATACATAGGCAACTAAAAAGCGCCCCACCCCCATGTGTTTCCCCTTGTAATCTCTTAAGATTTTTTCTAGTTTCTTCAATACTCCCTTTTCCCTACAGATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGGTAATTTGTCAATTGTTATTTTATATTTTCAGTTTGTTAAGTATTTTCTCCTTTATTTTGTTAGACTGAGAATATAGTTATTAATTAGTATTGCCTCATTATTTATTGCTAAAAGATTCAGAAAAAGGAGAAAATGTGAAAAAGAAATGTATGGAGTAGAAAGTGGCTGCAGGGCTTCTCATCCTAATAGCAATGGCGTGTGTAAGACATTAGAAGAGGAAAGAATAGGGAAATGACAATAGCTTGATGTAAGCATGACATAAAAGAAAATACTGGAAAAAGAAAATGTATGCTATGAGTGCCAACTGATAATCAACCAACGATGTTTGACTAGAAAATTTAAGAAAGAGCTGATAAAAGGTAATAGGTAATTGATTTACAAAGTACTTGATCTTAGAAAACATCAAAAGTATAATTCAGGTTTCAAGTCTTTTCTAGGAATGGAACTCATTGAAAATTTATTGCTATGATTAGTTGCTTGAATTTCTTTATTGATTATTCACTCAGTTTGCTGTTTTTGTGGATTTAATGGGGGTTCTGTAGATATCTAATGTTGCATTATTCTTTTGCTTATTATCCTTGGAATAATTTGTTGTTTTGGATTAATTCTATCAGTAGTCACTGTAGTGATTAATTCTTGTTTGAGGAGATTTGGTTCGCACTCAGGTATAGCAGATAAATATTGTGGACCCTGGCTAGATCTATCCTTTTTTGAGCGTTAGTGTCAAATTCTTTTTGAAATATCCTTTAGGTTTCATTTTGCTTGATTGGACTCCCTTTCTTTAGTGGCACTCTCCCTTTTTGTGGGCTTAGCATTTTTTGGATGCCTTTGTTTTTGTCTTGTTTTTTATTTTTATTTTTTGTTTTTGTTTTTTCATTTTTCTTCAATGGAAGTTTTGTTCTTTATAAAAAAAAATATGTGTTTGAATGCCCTCCCTTTTCTCCCTCAAAGTTTAGGTTATTTTTAATTCTTGCAATTTGAATTGTATGTTTTATTTTTGCGTTCTTTGTTTAGGGTCCTTTTTCTTTTCTTCTTTACTATATCCTGTTCCTCGCTATCAATACAGTTATCTAACATGCGAATTTCATTTTCCTTCTCGTTGTAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGGTATTGGAACCTCAATTTCTTGTTCGTTTATCCTACATTATTGATGGTGACCCTGTTGCATGAAACGCTCCTTGCCGATCTAGCTAATCTTGGAAAAGTGCCTTTGTTTTTTAATGCATTGAATATTTGATCGTCTCAAAGAAATAGAACTATAAAGTTCCACGTATATCCCTTCCAAACATTCAAGCTAAAATTTATCTCTAAAAATTCTTATGGGTTATCATGGGAATTTGATTGAGATTTGCAAATAATAAGGTGGGAGATCAGGAGTACTGGGATTTGGTTGAAATACGTAGCGATTATGTCAGTCCACTCATCAAGTTGACTTGGTTAAATATCAAATTGTAAATCATAATCAATTAATTTCTGGTCACAATTTCATATATATTTTACATTCTCAGTTTATTTGGTTTCAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGGTAGTCTATTCTTGTTCCTAAGACCAATGAATTTCCCCAATTTCAGATGTGATTTATCCATGCCTTTCTTCTCTTTTCCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGTACTTCCAATCTTTCTTTTCTGGCTCTTAAATGTAGAGTCTTCTCAATTCTTGTTCTCTCTTTGTTTTTTGGTTGTATTGTTGTTGATTATTTGGGGATTATTTTCCTTATTGTTAAATATTTTATCTTTGTATTTATTGTAAAGATATTTACTATATTTATCTCCTTCCTTATTTGTAATAGATTTTCTTTTATATAAGAAAACCCTTGTTTAACAAAGTATGTAGAGAGATAAAATATTTTAGCAATTGTTTTGCTTCTATGCTTTTATTTACTCACTCATTTCCAGAGTTTGTATCTGTTGAGCAATAGTTTCTTTTCATTATATCAATGAAAAGTTTTATTCCTAATGTTGAAATCTCAACTGTCTTCATGATTATTGAAATGAATTAATTTAGGGCTGTTTTCAATTATAGGAAAATGAGCCAACTTATTTACAAATATAACAAAATGTCACTATCTATCAGTGACAAATGGTGATAGACACTGATAGATAGTGACAGTTTGCTATATTTGAAAATATTTCCAGCAATTTTGCCATTTAAACCAATTACCTATTAATTTATTGTCTCTGTTGTCTTTCACTTTTCAGCTGATATTTCCTTGAGTAGCCTGACACGGGTTTCTTCATCAAAATTTGTTGCAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGGTACTGTTTCGTAGAACTTCTATTTATCTATTTTTACTTGTGGTTCATATAGGATTTTTTTTGTTAATTGAAAATGAGATTCATGTTACATGAATAATATAAATTTGGAAAGTGCCATACACGGAAGATGGTCCAGAACTTGGTAAGCTCTCTGATCACTTTTCTTCATATGTCTGCCCATGGGCTTCTTTCATAGAATGTAGTTAGAACAGTGGTTGGAGGATGCGTATGGTCCACATTTTCCTAGATGGTAACTAATGAGTTCTTGTATGAAAAGTATCAGAGTGGGGTCACTCTTGCTGACAAGAAACCCTTTACAAGATTGCTTTTAGTTATATTTAGGGCTTCCCCTTCTAACATAGTCACATAAGATTTAAGAGAACAATTCCCAAAAGCCCTGAGCTACTTGCAAGAGTTACTATTGTACTATTATGTTTTTGAGTTTTGCATTCGTCATGTTGTCATGTTTATTAGTCGTTGGGTCTGAAAATACAAGTATACTACTAACCATCTACTAGTCAAATTTAAAATATTTGCATCTAAGAATGGCCATCTGTAATAGCAGCATCTACATGAAGAATGTTTAAGAACCCCCAATCCTTAAGATGTATCAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAATTGCATTAAGGAGAAAGAGAATTGTATTAAGGAGGAAGTGGGGGAAGTTAGTTAGAGGAGTGACCGCTCATTGTTGGGTGGGCTATGGCCCATGGGAATGGAGAGGTTTTGCGTGACAGGGGAGGGGAGACTGATTTTGGTGGTAAATTGCAAGCCTGACTTGTAGGAGAGGATTTCTAGCCCTCCATAAAGTGTTGGAGTGATATTTCTCTTTTTCATCTTCCTTCTTGTTTTCATGAGAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGTTGCTTGGGAATTCTTGTTAGGAAAGTCTATTGCAGGTGCGGTTTCCTAACAAATTGGTATTAGAGCCATTGTTTTTTCTTTTTCATCCTGGGAAGAACGCATAGTGAGACGAGGCGAGAAATGGAGGAGAAGATTGATAGTCATTCCAAGAGTCTCGTGGAGTTGAAGGAATGGATGATTGAAATGGCAAAGATAGTGGAATGTGTGGATAAGATGGTCCAGATGAATCGAAGATCGGATCTGTGAAGATGATGAAGGAGAAGGAAGGAGAGGAAGGCGAGAGTTTGCAAGCTAAAGCGAATGACGGAGGTGTTGATCGCAACAAGTATAAACGGTTGGAGATGCTGATTTTTTCTTGAGAACACTTAGATTCGAGGGTTTATAGAGTTGAGCACTACTTCGAGATTCATGAATTATCTGGCACTGAGAAGATCAAAGTAGTGGTAATAGCCTTTGTCAAACGTGGTTGATTGGTTTTAATGGGCTCATCAGCGAAAATCGATCAGATCCTGCGAAGCTCTAGTGCATAGGATGTTTGAAAGGTTTCGATCGTCTCAAGAAGGGTTCCCTGCTGTCTCGATTGATGTGAATTAAATAGGAAGGGACGTTTGAAGAGTATTGGAAGAAATTTGAGTCATATGCGGCCCCAATTCTGGAGACGGTAGAAAATGTGTTACAGGAGGCATTCATGAATGGGTTGTCGCTAGAAATAAAGGCAGAGGTAACGAGTAGACATCCTGTGGGCCTTGATGAGTGCATGGTGGAGGCCCAAGCTGTGAGCGATCGTAATCTGGAAATGAAGTTGGCTGAAGAAGAATTGGGCCTAAGTAAGCTTGTGGCCCAACAACAAACAGGTAAACAAGTCGAGGTAGGGGGAGCGAAGACATAAGGTAAAACCCAACCCGGGGGATGACGAAAAATGTCCTACCTGAAAAGGGGGAATCTAGAAAAAAGAACAACCCTACTGAAAACTATTGGATTTTGAGATTTGAGAGAGAAGGGGCTATGCTTTTGGTGCGATGAAAGATACTTTCACGACCACAAGTGTAAGACGAAAGAGAAGCGAGAGCTAAATTTATTGATCGTGCACAACGAAGACGAAATCGACAAATTAGAAGTGAAGGAGACCGAGGAGGAAGAACCAGAGGTGAAAGTCATGGAAGTACCGAACAACATTGAGATCGTGTTACGCTCGATTCTGGGATTTTCTACTAAGGGAATAATGAAATTAAAAGGTTTGATAGTCGGTAGAGAAGTGATAGTGATGATCGACTGTGGTGCCACACACAATTTCATACATCAAAAATTGGTGGATGAACCAATTTTGCCTCATGCTAACAACAAAGTATGGGGTGGTAGTAGGTAATGGAAAGGCAAATCGGGGAAAAGGCATTTGCCAGGCGGTTGTTGTGGTATTACTCGAATTAATGGTGACAGAGGATTTATTGCCATTTGATTTAGGAAGGGTGGATATAATTTTAGGAATCATTTGGTTGTGCAATATGGGATACATGGAAGTCCATTGGCCTAGTTTGACTAGGACGTTCACGATTGGAGATAGGAAGATAACTTTGAAGGGGGATGCTTCATTGACAGCAACAGAGGTCACTCTTAAAACGTTGACTCATAGATGAGAGGAAGAAGATATGGGGTCCCTTGTAGAATTCCAACACATGGAACCAGAGATTGAAGAAGAAAAGAGACAAGTTCCAAACGGTAGTGAATAACAACCACCGCCAATTAATTCAATGCTTATTGGACGTGTATGAGGATGTCTTTGAGTTTCCTATTGCTCTACCACCCAAAAGGGTGGTGGATTATCGAATTAAGTTAGAAGAAGATGTGAAACCGGTCAACGTGCGACCTTATAGATACAGACATACACGGAAGGATGAGATCAAAAAACTGGTAAATGAGATGTTAGTAGCAAAGATTATACGTCCAAGCCATCGTCCTTACTCGAGCCCTATTTTATTGGTCAAGAAGAAAGACGAGGGATGGCGTTTTTGTTACTGGAAGTTAAATCAATCAACCATAACCGACAAGTTCCCAATTCCTGTAATAAAGGAATTGATTGATGAATTGCACAGGTCGGTAATCTTTTTGAAGTTGGATTTGAAGATCGGTTATCATCAAATACGAATGCATGAGCCATGTATAGAGAAGACAACCTTCCGTACGCATGAGGGGCATTATGAATTCTCAGTAATGCCGTTTGGATTAACCAATGCTCCGGTGACCTTCTAATCAATAAGGAATCAAGTATTCCGCCCTTTTTTGAGGAGACATGTTTTGGTTTTCTTTTGAGGATGTTTTAGTGTATAGTCTCGATGAAGATACCCATGTCAAGCATCTGGGAATGGTGTTAAATGTACTGCGTGACAATAAACTCTACGCAAGTAAAAAGAAATGTGTGTTTGGGCAAGAACGGATTCATTACTTGGGGCACCGGGTATCAACTCATGTAGTGGAAGCAGATAGAGATAAGATTCAAGAGCGACGATACGATGGCCAATACCGAAAACGGTATCTGAATTGAGGGGATTCTTAGGACTCACTGGATATTTTTGAAGATTTGTAAAGGATTACGGATTGATTGTGGCTCCTCTGACGAAATTGTAACATAAAGATGCCTTTAAGTGGGATGATCAAGACATTGAGGCTTTCGATTCTCTATAAAACGCCATGGTAACTCTTCCTGTACTTGCTCTACCAAATTTCGATCTTCCTTTCGTGATAGAAACAAATGCATCAGGCTTTGGTTTGAGGGCTATGTTGATGCAAAGGGAGAGACCGATAGCTTACTTTAGTCAGACATTGTCTATGCGCGCCCAAGGAAAGTCTATTTATGAACGTGAGCTGATGGTAGTGGTGTTGTCCCCACAAAAATGGAGGCTTTACCTTTTGGGAAGGAAATTCACAATGATATCGGATCAGAAGGCGTTGAAATTTTTATTAGAACAGCGTGAAGTGCAACTGCAGTTCCAGAAATGGTCGACTAAACTCCTCGGATATAATTTTGACATTGAATGTAACTAAGTTCTTAGTGGAGCAAGGGGTGTTATACTACAAAGGAAGGTTAGTGCTATCTAAATCTTCTTCCCGCATTCCAACCTTGTTGCGGACTTTTCATGACTCGGTACTAGGAGGCCATTTGGGGTATTTACAAACATATAAAAGAATGTTGGGAGAGCTTTATTGGAAGGAGATGAAAGAGGATGTAAAGAAATATGTGGCTGAATGTGTAATTTGCCAGAGGAACAAGAGTGAGTCAGTGTTGCCAGCAAGTCTTTTGCAGCTTTTACCAATACCGGATAGAGTTTGGGAAGACATTTCAATGGATTTCATGGAAGGACTTCCAAGATCAAAAGGGTATAATGCCCTAATGGTGGTGGTTGACAGATTAAGCAAATATGGGCACTTTATTCCGTTGAAACATCCCTTCACAACCAAAACAGTTGTCGAAGAGTTCATTCGTGAAGTGGTGAGCATCATGGATTCCCAAAACAGATGTCAAAGGGTTCGGTCGCGATAAAATTTTTGTAAGCAATTTTTGGGTTGAACTATATGCTGTTCATGGGACGATGCTCAAATGGAATACAACATTCCATCCTCAGACGGATGACCAGACTAAGAGAGTCAATTGGTGTATAGAGACATACCTTCCGTGTTTTTGCAACGAGCAACCCACCAGTTGGTTTCAAATGGATTCCATGGGCTGAGTATTGGTATAATACAACCTTTCAAAGTTCAATACACATGAGTCCCTATCAGGTGCTAAATGGACGGCCTATTCCTGCACTAGTGTCATATGGTGATAGGAGGATTACTAATGATACCCTGGAACAGAAGCTAGTGGATAGAGATCGAGCACTGATAGCTTTAAAAGAGCATCTGGTACTGGCCTAAGAAAGGATGAGGAAATACGCCGATTAGAAGAGGCAGGATGTTCAGCTTGAAATGGATGACATGGTTTTCTTGAAATTCCGACCTTACAGACAGTAGACACAGGCCTGAAGACGATGTGAAAAATTGGCTCCTCGATTGTATGGACCATTCAAGGTAATTGAGAAGGTGGGGGAGGTTGTGTATAAATTGAAGCTTTCGGAAGATGCAAAAAGACATAATGTCTTTCATGTTTTCGCAACTCAAAAAGTATGTGGGGTCAACTACCCGAGTACAAGCTACTCCTCCAGACTTTATCAAATGATTTCGAATTACAAATGGTTCCCGAGAAGAATTTGGGTGTTCGTTGGAATAATGACATGGTGAAAGAAGAGTGGCTTATTAAATGGCAGAAATATCCAAAGAGTGAAGCAACGTGGGAGATTGCGGGTTGGCTGAAACAGTAGTTTCCAACTTTTCACCTTGAGGACAAGTGTGAATGACAACCCGGGAGGTATTATAAGACCTCCAATCCTTCAGACGTATAAAAGAAGGGGTAAAAAGGGAAGTAAGGGAGAGAGTTGTATTAAGGAGAAAGAGTGGGAAGTTAGTTAGAGGAGGGACCACTCATTGTTGGGTGGGATATGGCCCATGAGAGTGAAGAGGTTTAAGAGAGGTGTCGCGTGGTAGGGGAGGCTGATTTTGGTGGTGAATTGCAAGCTTGGCTTGTAGGAGAGGATTTCCAGCCCTCTGTAAAGTGCTGGAGTGATATTTCTCTTTTTCGTCTTCCTTCTTGTTTTCATGAGAACTTCTGTTGTGAATTTCCCTTGAATTATACATAAATAAGATCAGTTGAGCTTTGCTCTGATTCTTGGGAATGCTTGTTAGGAAAGTCTTTTGCAGGTGTGGTTTCCTAACAGAATGATATTCATGTACTTCATAAAAAGCCCATGGTCTCTGAAGTCAGGTGTTGCGCTCAGGAATGCACGAACTGAGTGTTCTTTGAATTTTTTATCTCCATCCTCCCAAGTTGATAATTCTCCTATCTGTTCTTAGCAGGTTCCATCTGCATGGTTCTGAGTACTTGTAGGATTATTCGGAGTATTGGCTTTTCAGAAACTCTGGATAGAAAGGAATTACCCCATTTCTTAAATGATACAAGTTTGCAGTGGCTCTTCATTTTATTTTTCTTTTGAATGATACAAGGCAGTTGGAGAATCTTTTTTATGTTTATTTTTGCTTGGACGGAGAGTATACCTTATCCCTTCGGGACACTTTTGCATATTAATGTACTGATATTTCCCTGGTAATATATCTTCATATCGCACTTACTTAATCTTCTGCACTTCCAAGATCAAAATCAACACGTGAGGTTTGGAAGCATCCAAGGGATCGGATATCTAGGGATTCAAAATGGGATAGTTTATATAGTACTTTCTCTTTAAGAGGATTTCCTTCTTGCCATTCAGTTTTTTATTTTTATTTTTATTTTTTTTACAATTAATAATAATAATAACTTTTAAAAGAAAAGTAACGTCAAGTTCATTGTCTATGTGGTTATAAGCAATTATACTTGCTCCTTGATAGATTTTTATAACTAACATATCTTGCAATGAAGTGCACTATCTGACATGGCTGATTCTGTTTTGTACTCAGTCTAATAATTAAGCAGACGAAGTGCACTATCTGACATGGCTGATTCTGTTTTGTACTCAGTCTAATAATTAAGCAGACTTATATGCTTTTGTGTCCAAACATTTATTCCCAATAAAATTGATGCAGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGGTAATCCCTAAGATGAAGCATAAGCAGAAGGTTTTTATAGTCATTTGCTTGCCCGCAGCTTTCACAAATTCTATAAAATCACTTGTACCTTTTGAGTTTTGAATGTTTCAGTTTGAGGGTATTAGTAGGAGATTTCTTGGTAACAGTAAAAGGATGGTAATAAGGGCATAGTGATAATTAGCTGGAGGAGCTTTGGTTATAAATAAGGGAGGTTGTGCACCTTAGGAGCGGGGGGGATCAGTTTGGTATCCTTTGTATATTTGTTGAGGGAGAGACATAGTCTTCTTGAATGGCTATTGATATTGCAATAAAGTTATCTTTGATGTTTTCATATATTTTCTGTGTTTTGGTAACCTGACATCCAACTTATATTTATTTTACATATCCAGGACATGACTCTGAGAATCCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG

mRNA sequence

ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG

Coding sequence (CDS)

ATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGGTTTCTACTTCCAAAAGATCAGCAAACCTGTTACCGTCAAGCTCTGCGACTCCATCTTTTATCCCGAAACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGCGGAACAACATTGGTTGCTTCCAAAAACCTTCTGCAGAGCTTCTCTCTCGCTTCAGAGAGAAGCCTTTTGCAATTCATGGCTTCCGTTGATTCCCGACCTTCAACCTTGATTCCATTAGAAGACGCCGGCGAAGGAGAACAAATTGTAAGGAACGATTTCTACTTCCAGAAGATCGGCAAACCTGTCCCGGTCAAGCTCTGCGACTCCATTTTTGATCCCCAGACCCCTCCCTCTCAGCCTCTTGCTCTCTCCGAGAGTTTCGGTCTCATCTTCGTTGCGCATTTGTCTGGTTGGACCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAACGGGGGAACTGGTTCTTCTGTCCAGGATTTAAGCATAGTGGATATTTCCATCGGAAAAGTTCACATTCTAACTCTTTCCACGGATGATTCCATTCTTGCTGCCATCGTAGCTGGTGATATTCATCTTTTTTCAGTCCAGTCCCTGCTTGATAAGGCAAAAACACCCTCTTCTTCTTGTTCATTAACTGATTCCAGTTTCATCAAAGACTTCAAATGGACCAGAAAGTTGGAAGATTCTTATCTGGTTCTTTCAAAGCATGGACAGTTATATCAAGGATCGGCGAATGGGCCTCCTACACATGTGATGCACGATATTGATGCTGTTGACTGTATCAAGTGGGTTCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACTGCAACAGGTGATGAAGAAGATTACTTTGTCCTAGTTATCAGAAGTAAAGATGGAAAAATCACTGACGTTTCTTCAAACAAAGTTTTGTTATCATTCCGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGATATTGGGCCTTGTTTACTGTTGAGTTATTTGGATAAATGCAAGCTCGCAATTGTTGCAAATAGGCTCTATATGGAAGAGCATATTGTGTTGCTTGGTTTGTTGCAAGAGGTTGAGAACGAAGTTGCAGTTATTAATATTGATAGAAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGTGATGATAATTTGGTAATGGGGCTGTGCATTGATCGAGTTTCTCTTCCTGGGAAGGTGTTAGTTAGAGTTGGATTTGAAGATATGAGAGAAGTTTCGCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAATGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGAGGAGGAAGATGATATAATAGTGCCTGCTGATGATCGGTCTCAACTCTTTTCTGGATCAAAGAAAGAGTTTAGAGAAGATGATCTTAAGATGCAGGTTACGGAAAAACTTGCAATCAGTAGTGAGATTCCTCAGGAAAAAATTAAAATCTCAAATGACATTAAGTCTTCTAATAATGATCAAAGTCCAGTATCTAAAATAGATGAGAGTGCAACTGTTGGTGCAGAGAGTAATACTAAAAGCCAGAAAGCCGATTCTTTCATTTATTCACAATCATTAGAGTCTTCTGTCCTGGAGAGACCCAACTATGAGATTGGGAACTTTGATAAGCCTGTTCAAAAATTTGGTCTCGGGTCTGTTTCTATTTCAGGTAAGTCTGCGGACGTGCATAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGAATGGTGTCAACTGGCTTGTTGGCTGCATCTGAGTTATCCAGTGATAAAGCAATGTTTTTAAATAAAATCGATCCTATATCTTCAGTCCTAACTCCGAATTCTTTTCAAAGCAGCAAGACTGAGAATTATGGGCCAAGTTTTGGTACAGCGAATGCTTTTGCAGGTTTTTCTGGAAAACCTTTTCAACCGAAGGATGTTCCAAGTACATTAACACAAAGTGGGAAACAAGTAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGTTACGTAGCTCACAAATATCATTGCAAGACAACTTGTCGGCGAAAATTTCTAATGAGAAACATGATGGTTCAGACCGAAATTACAGCAATGCCCCCCTGGCAAAACCAATGAAAGAAATGTGTGAAGGATTGGACATGCTTCTGGAGTCTATAGAAGAGCCGGGTGGGTTCTTGGATGCCTGCACTGCTTTCCAGAAAAGCTCTGTTGAAGCTTTGGAGCGTGGCTTAGCCAGTCTTTCAGACGAATGTCAAATATGGAAGAGCACAATGAATGAGCGTTCACAGGAGGTACAAAATCTCTTTGACAAAATGGTACAAGTTTTGTCAAAGAAGACGTACATTGAAGGTATTGTGATGCAAGCTTCTGACAGCAAGTACTGGGAACAATGGGATCGTCAAAAGTTGAGTTCAGAATTAGAGTTAAAGCGACAACACATCTTAAAGATGAATCAGAATATGACTAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGCCTTGAATTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGCGAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTATTCGAGGCATAGTCATTCATTACATAGTTTGAATAACATAATGGGATCTCAATTAGCAGCAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATAGAATCACCCTCTTTAAAAAGGCAGAGAGTCACGAAGGAATTGTTTGAGACTATTGGACTTACTTATGATGCTTCTTTCGGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTGCTGATTCTTTTTCAAGTAAAGATACACTGAGAAGAAAACAGCGGAGTGGAAGGAAAAATTCTGAAGCAGAAACTGGGAGAAGGAGAAGAGATTCACTCGACAGGAACCTGGCTAGTGTTGAACCTCCAAAAACAACTGTTAAGAGGATGCTTTTGCAAGGAATACCCTCCTCTGATGAGAAACTATTTCGGTCTCGCACACCTGAAGGGGCAGCAACAGTTGCAGGGCCAGCTAGTCGATTAACATCATCTAAGTCATCATCATCATCCAAAAATGCAGCAACTCCTTTTATGTGGGCTAGCCCTTTACAACCATCAAATACCTCCCGTCAGAAATCTCAACCATTGCCAAAAACTAATACTACAGCGCCATCTCCACTGTCAGTATTCCAATCATCACATGAAATGCTGAAAAAAAGTAATAATGAAGCTTTCAATGTGACTTCAGAAAACAAATTTATTGAAAAGTCGAAAGCTTCTGATTTCTTCTCAGTCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACCTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGACGCCCACACTGAAAGATTCTATTAATACCTCTAATTTGGACAATCAGAAGACTGCTAACGCAAAAGAGAGGCATACAACTACAAGTCCACTTTTTGGATCTGCAAATAAACCTGAATCTGCATCTCTTGGTACAATGTCTTCTCTGGTTCCTACTGTTAATGAAGCAAGAAAGACTGAAGAAAAAAGATCGCTGACAACGATTTCACCATCAGTTCCAGCATCAGCACAGTTAAATACTCCAAGTTCATCAACTTTATTTTTAGGATTTGCTGTAAGCAAACCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTGTGTCAACATCAACCCAATTGAACTTCTCCACCCCAGTTGTTTCTGTTTCTGATTCCCTATTTCAGGCACCTAAGATGATATCAACATCATCTACTCTGTCTTTGAATCCTTCATTGGAGTCCTCGAAAAAAGAGTTACCTGTTTCAAAATCAGATGATGATACTGAAAAGCAAACACCAGCTTCAAAGCCTGAGTCTTATGAACTGAAATTTCAACCTTCTGTAACACCTGATAAAAAGCATGTAGAGCCAACTTCTAAAACCCACACAGTTTCCAAAGATGTTGGAGGACAGGTTCCAAATGTAATAGGGGATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTCCATTACCTTCACCAAACTTAACTCCTAAGATTTTTGGTAATGTAAGAAATGAAACTTCAAACGTGACGGCTACTCAGGATGATGATATGGACGAAGAGGCTCCAGAGACGAATAACAACATCGAGTTTAGTTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCCACCCCTATGTCAGGTGCTCCTAAACCAAATCCATTTGGTGGTCCATTCGGTAATGTGAATGCAACCTCAATGACCTCTTCCTTTACTATGGCATCTCCTCCAAGTGGAGAGCTGTTTCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCAGTTGCATTCTCTGGTGGCTTTGGCTCTGCAATGGCTACTCAAGCCCCGTCGCAAGGTGGGTTCGGTCAGCCTGCTCAGATTGGAGTAGGGCAGCAAGCTCTTGGTAATGTTCTTGGTTCATTCGGACAATCAAGACAGCTTGGTCCTAGTCTCCCTGGAACTGGTTCAGGATCCCCTGGCGGTTTTGGTGGTGGGTTTACCAGTATGAAACCGGTTGGTGGGTTTGCCAGTGTTGGTTCAAGTGGTGGTGGTAGTGGTGGGTTCGCTGGTGTTGGTTCAGGGGGTGGTGGTGGGTTTGGTGGTGTTGGTTCGAATGGTGGTGGTTTCGCTGGCACAGTCCCAACCGGTGGTGGATTTGCTGGTGCTTCCTCTACAACGGGAGGTTTTGCTGGTGCTGCAGGCGGGGGTTTTGCAGGTGCCGCAGGTGGATTTGGGGCTTTCGGCAGCCAGCAAGGAAGCGGGGGTTTCTCTGCTTTTTGTGTTGCTGCTGGTGGAGCTGGAGGAACTGGAAAACCTCCTGAGCTTTTCACCCAGATGAGAAAGTAG

Protein sequence

MASVDSRPSTLIPLEDAGEGEQIVRNGFYFQKISKPVTVKLCDSIFYPETPPSQPLALSESFGGTTLVASKNLLQSFSLASERSLLQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLSGWTKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHVMHDIDAVDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFREDDLKMQVTEKLAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSVLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSSSKNAATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSSSTLFLGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAAGGGFAGAAGGFGAFGSQQGSGGFSAFCVAAGGAGGTGKPPELFTQMRK
Homology
BLAST of CcUC11G220380 vs. NCBI nr
Match: XP_038892124.1 (nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida])

HSP 1 Score: 2620.5 bits (6791), Expect = 0.0e+00
Identity = 1466/1683 (87.11%), Postives = 1520/1683 (90.31%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1    MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+    TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61   SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121  ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGSANG  THVMHDIDA                                           
Sbjct: 181  QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241  TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPGD GPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301  IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKV+VRVGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRE--DDLKMQVTEK 568
            IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL S SKKEFRE  +DLKMQV EK
Sbjct: 421  IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            +AISSEIP EKIKISNDIKSSNNDQS VSKI ESATVGAESNTKS+KADSFIYSQSL+SS
Sbjct: 481  IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540

Query: 629  VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 688
            VLER NYEIGNFDKPVQKFGLG VSISGKS DVHSQPFPNVKESTK++ STGLLAASELS
Sbjct: 541  VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600

Query: 689  SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
            SDKA+FLNKIDP+SSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601  SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660

Query: 749  GKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
            G+QV GGAGKIESLPV+RSSQISLQDNL  KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661  GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720

Query: 809  DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKM 868
            DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNER+QEVQNLFDKM
Sbjct: 721  DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780

Query: 869  VQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 928
            +QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF
Sbjct: 781  IQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHF 840

Query: 929  NGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 988
            NGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA
Sbjct: 841  NGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLA 900

Query: 989  ALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLR 1048
            ALNIESPSLKRQ VTKELFETIGLTYDASF SPNVNKIAETSSKKLLLSADSFS KDT R
Sbjct: 901  ALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSHKDTSR 960

Query: 1049 RKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGA 1108
            RKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS TP+GA
Sbjct: 961  RKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSHTPDGA 1020

Query: 1109 ATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTTAP 1168
            ATVA PASRLTSS SSSSSKNA       ATPFMWASPLQPSN SRQKSQPL KTN TAP
Sbjct: 1021 ATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKTNATAP 1080

Query: 1169 SPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLDQKSSI 1228
            S LSVFQSSHEMLKKSNNEAF+VTSENKF EKSKASDFFSVTR+DSVQKSN NLD+K SI
Sbjct: 1081 S-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLDKKPSI 1140

Query: 1229 FTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTV 1288
            FTISSKQ  T KD I+TSNLDNQKTAN+KERHTTTSPLFGSANKPESAS+GTMSSLVPTV
Sbjct: 1141 FTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSSLVPTV 1200

Query: 1289 NEARKTEEKRSLTTISPSVPA--SAQLNTP-SSSTLFLGFAVSKPLPSSAAVIDLNQPVS 1348
            +EARK  EKRSL TISPSVPA   A+ N+P SSSTLF GFAVSKPLPSSAA IDLNQP+S
Sbjct: 1201 DEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDLNQPLS 1260

Query: 1349 TSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPA 1408
            TSTQLNFS+PVVSVSDSLFQA KM+STSSTL SLNP LESSKKELPVSKS+ DTEK+TPA
Sbjct: 1261 TSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTEKKTPA 1320

Query: 1409 SKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPS 1468
            SKPES+ELKFQPS+TP +K H+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAFA LPS
Sbjct: 1321 SKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAFASLPS 1380

Query: 1469 PNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKP 1528
            PNLT K  GN RNETSNVT TQDDDMDEEAPET NN+EF+LS LGGFGNSS+PMSGAPKP
Sbjct: 1381 PNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMSGAPKP 1440

Query: 1529 NPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFG 1588
            NPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAFSGGFG
Sbjct: 1441 NPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAFSGGFG 1500

Query: 1589 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTS 1648
            SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F GGFT 
Sbjct: 1501 SAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFSGGFTG 1560

Query: 1649 MKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTT 1695
            MKP  VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS  GGFAGT+ TGGGFAGAS+TT
Sbjct: 1561 MKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAGASATT 1620

BLAST of CcUC11G220380 vs. NCBI nr
Match: XP_038892123.1 (nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida])

HSP 1 Score: 2614.3 bits (6775), Expect = 0.0e+00
Identity = 1466/1688 (86.85%), Postives = 1520/1688 (90.05%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDSRPSTLIPLEDAGEGEQ+VRNDFYFQKIG+PVPVKL DSIFDP+TPPSQPLALSE
Sbjct: 1    MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+    TKDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL LSTDDS
Sbjct: 61   SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            ILAA+VA DIHLFSVQSLLDKA+TPSSSCS+TDSS IKDFKWTRKLEDSYLVLSKHGQLY
Sbjct: 121  ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGSANG  THVMHDIDA                                           
Sbjct: 181  QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCII+GCFQ+TATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD
Sbjct: 241  TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPGD GPCLLLSYLDKCKLAIVANRLYME+HIVLLGLLQEVENEVAVINID
Sbjct: 301  IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKV+VRVGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRE--DDLKMQVTEK 568
            IMFQFSSVNETEAPHETV ACDEEEDDIIVPADDRSQL S SKKEFRE  +DLKMQV EK
Sbjct: 421  IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            +AISSEIP EKIKISNDIKSSNNDQS VSKI ESATVGAESNTKS+KADSFIYSQSL+SS
Sbjct: 481  IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540

Query: 629  VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 688
            VLER NYEIGNFDKPVQKFGLG VSISGKS DVHSQPFPNVKESTK++ STGLLAASELS
Sbjct: 541  VLERSNYEIGNFDKPVQKFGLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASELS 600

Query: 689  SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 748
            SDKA+FLNKIDP+SSVL PNSFQSSKTENY PSFGTAN FAGF+GKPFQPKDVPSTLTQS
Sbjct: 601  SDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLTQS 660

Query: 749  GKQVTGGAGKIESLPVLRSSQISLQDNLSAKISNEKHDGSDRNYSNAPLAKPMKEMCEGL 808
            G+QV GGAGKIESLPV+RSSQISLQDNL  KISNEKHDGSDR+YSNAPLAKPMKEMCE L
Sbjct: 661  GRQVMGGAGKIESLPVIRSSQISLQDNLPGKISNEKHDGSDRSYSNAPLAKPMKEMCEAL 720

Query: 809  DMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKM 868
            DMLLESIEEPGGFLDACTAFQKSSVEALE GLASL DECQIWKSTMNER+QEVQNLFDKM
Sbjct: 721  DMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLFDKM 780

Query: 869  VQ-----VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 928
            +Q     VLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE
Sbjct: 781  IQVYLVSVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIE 840

Query: 929  LERHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESL 988
            LERHFNGLELNKFGGNEESQ +ERALQRKFG SRHSHSLHSLNNIMGSQLAAAQLLSESL
Sbjct: 841  LERHFNGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESL 900

Query: 989  SKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSS 1048
            SKQLAALNIESPSLKRQ VTKELFETIGLTYDASF SPNVNKIAETSSKKLLLSADSFS 
Sbjct: 901  SKQLAALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSH 960

Query: 1049 KDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSR 1108
            KDT RRKQ SG KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLL GIPSSDEKLFRS 
Sbjct: 961  KDTSRRKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSH 1020

Query: 1109 TPEGAATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKT 1168
            TP+GAATVA PASRLTSS SSSSSKNA       ATPFMWASPLQPSN SRQKSQPL KT
Sbjct: 1021 TPDGAATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKT 1080

Query: 1169 NTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFIEKSKASDFFSVTRSDSVQKSNINLD 1228
            N TAPS LSVFQSSHEMLKKSNNEAF+VTSENKF EKSKASDFFSVTR+DSVQKSN NLD
Sbjct: 1081 NATAPS-LSVFQSSHEMLKKSNNEAFSVTSENKFTEKSKASDFFSVTRTDSVQKSNTNLD 1140

Query: 1229 QKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSS 1288
            +K SIFTISSKQ  T KD I+TSNLDNQKTAN+KERHTTTSPLFGSANKPESAS+GTMSS
Sbjct: 1141 KKPSIFTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGTMSS 1200

Query: 1289 LVPTVNEARKTEEKRSLTTISPSVPA--SAQLNTP-SSSTLFLGFAVSKPLPSSAAVIDL 1348
            LVPTV+EARK  EKRSL TISPSVPA   A+ N+P SSSTLF GFAVSKPLPSSAA IDL
Sbjct: 1201 LVPTVDEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAAIDL 1260

Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
            NQP+STSTQLNFS+PVVSVSDSLFQA KM+STSSTL SLNP LESSKKELPVSKS+ DTE
Sbjct: 1261 NQPLSTSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEGDTE 1320

Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
            K+TPASKPES+ELKFQPS+TP +K H+EPTSKT TV KDVGGQ+PNVIGDAQPQ PSVAF
Sbjct: 1321 KKTPASKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPSVAF 1380

Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
            A LPSPNLT K  GN RNETSNVT TQDDDMDEEAPET NN+EF+LS LGGFGNSS+PMS
Sbjct: 1381 ASLPSPNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSSPMS 1440

Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
            GAPKPNPFGG FGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAAS PTNSVAF
Sbjct: 1441 GAPKPNPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNSVAF 1500

Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
            SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGS G F 
Sbjct: 1501 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLGSFS 1560

Query: 1649 GGFTSMKP--VGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAG 1695
            GGFT MKP  VGGFA VGSS GGSGGFAGVGSGGGGGFGGVGS  GGFAGT+ TGGGFAG
Sbjct: 1561 GGFTGMKPVAVGGFAGVGSS-GGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGFAG 1620

BLAST of CcUC11G220380 vs. NCBI nr
Match: XP_031741375.1 (nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus])

HSP 1 Score: 2410.2 bits (6245), Expect = 0.0e+00
Identity = 1358/1673 (81.17%), Postives = 1441/1673 (86.13%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1    MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA---------------------------------------VDCI 328
            QGSANGP THVMHDIDA                                       VDCI
Sbjct: 181  QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNVDCI 240

Query: 329  KWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDIL 388
            KWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF DIHSGFTRDIL
Sbjct: 241  KWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDIL 300

Query: 389  PGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDRNTSLPKIEL 448
            PG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NIDRNTSLPKIEL
Sbjct: 301  PGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDRNTSLPKIEL 360

Query: 449  QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 508
            QANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE
Sbjct: 361  QANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 420

Query: 509  TEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEKLAISSEIPQE 568
            TEAPHETVSACD+EEDDI VP DDRS+      KE RE   D +MQVTEK+AISSEIP+E
Sbjct: 421  TEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEKIAISSEIPRE 480

Query: 569  KIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSVLER-PNYEI 628
            K K SNDIKSS NDQS V  IDESA V  E NTKSQK DSFIYSQSL+SS  ER P+YEI
Sbjct: 481  KGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERPPHYEI 540

Query: 629  GNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELSSDKAMFLN 688
            GNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASELSS+KAM   
Sbjct: 541  GNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEKAMSFK 600

Query: 689  KIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGA 748
            KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLTQSG+Q TGGA
Sbjct: 601  KIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGA 660

Query: 749  GKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESI 808
            GKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMCEGLD LLESI
Sbjct: 661  GKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESI 720

Query: 809  EEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKK 868
            EE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLFDKMVQVLSKK
Sbjct: 721  EESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKK 780

Query: 869  TYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNK 928
            TYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELERHFNGLELNK
Sbjct: 781  TYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNK 840

Query: 929  FGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESP 988
            FGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSKQLAALN+ESP
Sbjct: 841  FGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMESP 900

Query: 989  SLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGR 1048
            SLKRQ  TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK T RRKQ+SG 
Sbjct: 901  SLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRKQQSGT 960

Query: 1049 KNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPA 1108
            KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTPEGAATVA PA
Sbjct: 961  KNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAATVARPA 1020

Query: 1109 SRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQS 1168
            SR+TSS SSSS      S+N  TPFMW SPLQPSNTSRQKS PL K N T PSP  VFQS
Sbjct: 1021 SRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPPPVFQS 1080

Query: 1169 SHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTI 1228
            SH+MLKK NNEA +VTSENKF      EKSKASDFFS TRSDSVQKSNIN+DQKSSIFTI
Sbjct: 1081 SHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKSSIFTI 1140

Query: 1229 SSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEA 1288
            SSKQ PT  DSI TSN+DNQKTAN KERHTTTSP FGSANKPES  +G+M SLVPTV+ +
Sbjct: 1141 SSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVPTVDGS 1200

Query: 1289 RKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDLNQPVSTSTQL 1348
            RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDLNQP STSTQL
Sbjct: 1201 RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPSTSTQL 1260

Query: 1349 NFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPASKPES 1408
            NFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E+Q  +SKP S
Sbjct: 1261 NFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILSSKPGS 1320

Query: 1409 YELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTP 1468
            +ELKFQPS+TP DK HVEPTSKT TV KDVGGQ  NV+G+AQPQQPSVAFA +PSPNLT 
Sbjct: 1321 HELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTS 1380

Query: 1469 KIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGG 1528
            KIF N RNETSN   TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+SG PKPNPFGG
Sbjct: 1381 KIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGG 1440

Query: 1529 PFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMAT 1588
            PFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ T
Sbjct: 1441 PFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVPT 1500

Query: 1589 QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVG 1648
            Q PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF GGFT+ KPV 
Sbjct: 1501 QPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPV- 1560

Query: 1649 GFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGASSTTGGFAGAA 1695
                      G GGFAGVGSGGGGGFGGV    GGFAG   TGGGFAGASST GGFAGAA
Sbjct: 1561 ----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGASSTAGGFAGAA 1620

BLAST of CcUC11G220380 vs. NCBI nr
Match: XP_031741374.1 (nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hypothetical protein Csa_008316 [Cucumis sativus])

HSP 1 Score: 2406.3 bits (6235), Expect = 0.0e+00
Identity = 1358/1683 (80.69%), Postives = 1441/1683 (85.62%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1    MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGSANGP THVMHDIDA                                           
Sbjct: 181  QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241  TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301  IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
            IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+      KE RE   D +MQVTEK
Sbjct: 421  IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            +AISSEIP+EK K SNDIKSS NDQS V  IDESA V  E NTKSQK DSFIYSQSL+SS
Sbjct: 481  IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540

Query: 629  VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
              ER P+YEIGNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASE
Sbjct: 541  APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600

Query: 689  LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
            LSS+KAM   KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601  LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660

Query: 749  QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
            QSG+Q TGGAGKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661  QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720

Query: 809  EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
            EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLF
Sbjct: 721  EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780

Query: 869  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
            DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781  DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840

Query: 929  RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
            RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841  RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900

Query: 989  QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
            QLAALN+ESPSLKRQ  TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK 
Sbjct: 901  QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960

Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
            T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961  TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020

Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
            EGAATVA PASR+TSS SSSS      S+N  TPFMW SPLQPSNTSRQKS PL K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080

Query: 1169 APSPLSVFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
             PSP  VFQSSH+MLKK NNEA +VTSENKF      EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140

Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTM 1288
            +DQKSSIFTISSKQ PT  DSI TSN+DNQKTAN KERHTTTSP FGSANKPES  +G+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200

Query: 1289 SSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDL 1348
             SLVPTV+ +RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260

Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
            NQP STSTQLNFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320

Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
            +Q  +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQ  NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380

Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
            A +PSPNLT KIF N RNETSN   TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440

Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
            G PKPNPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500

Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
            SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF 
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560

Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1695
            GGFT+ KPV           G GGFAGVGSGGGGGFGGV    GGFAG   TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620

BLAST of CcUC11G220380 vs. NCBI nr
Match: KAA0034115.1 (nuclear pore complex protein NUP214 [Cucumis melo var. makuwa])

HSP 1 Score: 2399.4 bits (6217), Expect = 0.0e+00
Identity = 1362/1707 (79.79%), Postives = 1448/1707 (84.83%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS  S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1    MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGS NGP THVMHDIDA                                           
Sbjct: 181  QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241  TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301  IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
            IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+    SKKE RE   DLKMQVTEK
Sbjct: 421  IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            + ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V  E NTKSQK DSFI+SQSL+SS
Sbjct: 481  ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540

Query: 629  VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
              ER PN EIGNFDKPV KF GLGSVSISGK  DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541  APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600

Query: 689  LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
            LSS+K MF  KIDP+SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601  LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660

Query: 749  QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
            QSG+QVTGGAGKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661  QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720

Query: 809  EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
            EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721  EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780

Query: 869  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
            DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN           
Sbjct: 781  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840

Query: 929  --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
                          QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841  NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900

Query: 989  SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGS 1048
            SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQ  TKELFETIGLTYDASF S
Sbjct: 901  SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960

Query: 1049 PNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
            PNVNKIA+TSSKKLLLS+DSFSSK T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961  PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020

Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSS------SKNAATPF 1168
            KTTVKRMLLQG PSS+EK FRSRTPEGAATV  PASR+TSS SSSS      S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080

Query: 1169 MWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFI--- 1228
            MWAS LQPSNTSRQKS PL KTN TAPSP  VFQSSH+MLKK+NN A + TSENKF    
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140

Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANA 1288
              EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP  +DSI TSN+DNQKTAN 
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200

Query: 1289 KERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTP 1348
            KERHTTTS LFGSANKPES  +GTM SLVPTV+ ARKTEEK+S+TTIS SV A A LNT 
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260

Query: 1349 SS-STLFLGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMIST 1408
            SS STLF GFAVSK LPSS   AAV+DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTS 1320

Query: 1409 SSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKKHVEPTSKT 1468
             +  SLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT
Sbjct: 1321 PTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKT 1380

Query: 1469 HTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1528
             TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN   TQDDDMDE
Sbjct: 1381 QTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDE 1440

Query: 1529 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1588
            EAPETNNN+EF+LSSLGGFGNSSTP+SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGE
Sbjct: 1441 EAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGE 1500

Query: 1589 LFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGN 1648
            LFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALGN
Sbjct: 1501 LFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGN 1560

Query: 1649 VLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGG 1695
            VLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV           G GGFAGVGSGGG
Sbjct: 1561 VLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGGG 1620

BLAST of CcUC11G220380 vs. ExPASy Swiss-Prot
Match: F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)

HSP 1 Score: 788.5 bits (2035), Expect = 1.5e-226
Identity = 691/1852 (37.31%), Postives = 939/1852 (50.70%), Query Frame = 0

Query: 100  IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
            + +E+  EG++I  ND+YF++IG+P+ +K  D+ +D + PPSQPLA+SE   ++FVAH S
Sbjct: 4    VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63

Query: 160  GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
            G+    T DVI++++     G    +QDLS+VD+ +G V IL+LS DDSILA  VA DIH
Sbjct: 64   GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123

Query: 220  LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
             FSV SLL K   PS S S  +S F+KDF+W R  + SYLVLS  G+L+ G  N PP HV
Sbjct: 124  FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183

Query: 280  MHDIDA-------------------------------------------------VDCIK 339
            M  +DA                                                 VD I+
Sbjct: 184  MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243

Query: 340  WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
            WVR +CI++GCFQ+   G EE+Y V VIRS DGKI+D S+N V LSF D+      D++P
Sbjct: 244  WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303

Query: 400  GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
              +GP LL SY+D+CKLA+ ANR  ++EHIVLL     + ++ V+V++IDR T LP+I L
Sbjct: 304  VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363

Query: 460  QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
            Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF  +SV  
Sbjct: 364  QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423

Query: 520  TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
              A  +T  A   + +D   P   DD S+  S   ++     ++D K   TEK +    +
Sbjct: 424  RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483

Query: 580  PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
            P E I  K    +KSS     N  Q P ++         +S        SF     L  S
Sbjct: 484  PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543

Query: 640  VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
            +    N +   F   +         I  +S  +H Q     K +     S GL  A    
Sbjct: 544  LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603

Query: 700  -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
                SS        + P   V  P  F S +   +  S     +  G+   P   KD   
Sbjct: 604  PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663

Query: 760  TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
             + ++G+                   G  KIE +P +R+SQ+S Q   +     S+++H 
Sbjct: 664  QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723

Query: 820  GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
                      + N SN P    + EM   +D LL+SIE PGGF D+C    KS+VE LE+
Sbjct: 724  TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783

Query: 880  GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
            GL SL+ +CQ WKST++E+  E+Q+L DK +QVL+KKTY+EG+  Q +D++YW+ W+RQK
Sbjct: 784  GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843

Query: 940  LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
            L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++  +     + R +  +   SR 
Sbjct: 844  LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903

Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
              SLHSL+N M SQLAAA+ LSE LSKQ+  L I+SP   ++ V +ELFETIG+ YDASF
Sbjct: 904  VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963

Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDR---NL 1119
             SP+  K    SS K LLLS+   S     R++Q S  KNS+ ET RRRR+SLDR   N 
Sbjct: 964  SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRVIFNW 1023

Query: 1120 ASVEPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASR 1179
            A+ EPPKTTVKRMLLQ                + S++    RS     + A+ V      
Sbjct: 1024 AAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKG 1083

Query: 1180 LTSSKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTA 1239
            +  S    +S+  +TPF    P+  SN                 +  + S        +A
Sbjct: 1084 IMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESA 1143

Query: 1240 PSPL----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE--------- 1299
            PS +    +V Q                  +  KK+    F+    N F+E         
Sbjct: 1144 PSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRL 1203

Query: 1300 --KSKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP-- 1359
               S  SDF S                 S    KS    +  SSI    FT  +   P  
Sbjct: 1204 STTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLS 1263

Query: 1360 -TLKDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT-------- 1419
             T  DS +T     +   +++ +     S    SA  P++ S+ + S++  T        
Sbjct: 1264 GTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGK 1323

Query: 1420 --------VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA--- 1479
                    +N+A  +    S          L  +SPS P     +T  SS LF   A   
Sbjct: 1324 PLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTS 1383

Query: 1480 -VSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESS 1539
             VS    S+ + +  +  + +ST L+ STP ++  D+ FQ+P++ + SS + +   +   
Sbjct: 1384 QVSSDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEP 1443

Query: 1540 KKELPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQ 1599
            KK    S S   T+        A+K ++  L  +  ++     V P S +  +S    G 
Sbjct: 1444 KKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGT 1503

Query: 1600 VPNVI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1659
              ++           G +QPQQ S   AP P+ + T     +   E  ++  TQ+D+MDE
Sbjct: 1504 QSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDE 1563

Query: 1660 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1695
            EAPE +   E S+ S GGFG  STP  GAPK NPFGGPFG  NAT+ TS+    + PSGE
Sbjct: 1564 EAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGE 1623

BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match: A0A5A7SY34 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G001060 PE=4 SV=1)

HSP 1 Score: 2399.4 bits (6217), Expect = 0.0e+00
Identity = 1362/1707 (79.79%), Postives = 1448/1707 (84.83%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS  S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1    MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGS NGP THVMHDIDA                                           
Sbjct: 181  QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241  TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301  IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
            IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+    SKKE RE   DLKMQVTEK
Sbjct: 421  IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            + ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V  E NTKSQK DSFI+SQSL+SS
Sbjct: 481  ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540

Query: 629  VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
              ER PN EIGNFDKPV KF GLGSVSISGK  DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541  APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600

Query: 689  LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
            LSS+K MF  KIDP+SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601  LSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660

Query: 749  QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
            QSG+QVTGGAGKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661  QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720

Query: 809  EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
            EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721  EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780

Query: 869  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN----------- 928
            DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMN           
Sbjct: 781  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPL 840

Query: 929  --------------QNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRHSH 988
                          QN+TNQLIELERHFNGLELNKFGGNEESQ SERALQRKFG SRHSH
Sbjct: 841  NFSNFRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 900

Query: 989  SLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASFGS 1048
            SLHSLNNIMGSQLA AQLLSESLSKQLAALN+ESP LKRQ  TKELFETIGLTYDASF S
Sbjct: 901  SLHSLNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSS 960

Query: 1049 PNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPP 1108
            PNVNKIA+TSSKKLLLS+DSFSSK T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PP
Sbjct: 961  PNVNKIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPP 1020

Query: 1109 KTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSSSS------SKNAATPF 1168
            KTTVKRMLLQG PSS+EK FRSRTPEGAATV  PASR+TSS SSSS      S+N ATPF
Sbjct: 1021 KTTVKRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPF 1080

Query: 1169 MWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKSNNEAFNVTSENKFI--- 1228
            MWAS LQPSNTSRQKS PL KTN TAPSP  VFQSSH+MLKK+NN A + TSENKF    
Sbjct: 1081 MWASVLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMA 1140

Query: 1229 --EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANA 1288
              EKSKASDFFS TRSDSVQKS IN+DQKSSIFTISSKQTP  +DSI TSN+DNQKTAN 
Sbjct: 1141 CPEKSKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANV 1200

Query: 1289 KERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTP 1348
            KERHTTTS LFGSANKPES  +GTM SLVPTV+ ARKTEEK+S+TTIS SV A A LNT 
Sbjct: 1201 KERHTTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTS 1260

Query: 1349 SS-STLFLGFAVSKPLPSS---AAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMIST 1408
            SS STLF GFAVSK LPSS   AAV+DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++
Sbjct: 1261 SSASTLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTS 1320

Query: 1409 SSTLSLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPSVTP-DKKHVEPTSKT 1468
             +  SLNP++ESSK EL V KS+DD EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT
Sbjct: 1321 PTLSSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKT 1380

Query: 1469 HTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1528
             TV KDVGGQVPNV+GDAQ QQPSVAFA +PS NLT KIF N RNETSN   TQDDDMDE
Sbjct: 1381 QTVFKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDE 1440

Query: 1529 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1588
            EAPETNNN+EF+LSSLGGFGNSSTP+SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGE
Sbjct: 1441 EAPETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGE 1500

Query: 1589 LFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGN 1648
            LFRPASFSFQSPLASQAASQPTNSVAFSG FGSA+ATQAP QGGFGQPAQIGVGQQALGN
Sbjct: 1501 LFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGN 1560

Query: 1649 VLGSFGQSRQLGPSLPGTGSGSPGGFGGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGG 1695
            VLGSFGQSRQLGP+LPGTGSGSPGGF GGFT+ KPV           G GGFAGVGSGGG
Sbjct: 1561 VLGSFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPV-----------GVGGFAGVGSGGG 1620

BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match: A0A0A0KV45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1)

HSP 1 Score: 2390.5 bits (6194), Expect = 0.0e+00
Identity = 1358/1724 (78.77%), Postives = 1441/1724 (83.58%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS PS+LIPLEDAGEGEQIVRND YFQKIGKPVPVKL DSIFDP++PPSQPLALSE
Sbjct: 1    MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASAEEIKNGGTGSSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGSANGP THVMHDIDA                                           
Sbjct: 181  QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCIIIGCFQVTATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241  TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301  IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLCIDRVSL GKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
            IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+      KE RE   D +MQVTEK
Sbjct: 421  IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            +AISSEIP+EK K SNDIKSS NDQS V  IDESA V  E NTKSQK DSFIYSQSL+SS
Sbjct: 481  IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 540

Query: 629  VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
              ER P+YEIGNFDKPV KF GLGS SISGKS DV SQPFPNVKESTKR+ STGL+AASE
Sbjct: 541  APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 600

Query: 689  LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
            LSS+KAM   KIDP+ SV T NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601  LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660

Query: 749  QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
            QSG+Q TGGAGKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661  QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720

Query: 809  EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
            EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSD CQIW+STMNERSQEVQNLF
Sbjct: 721  EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 780

Query: 869  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
            DKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781  DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840

Query: 929  RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
            RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841  RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 900

Query: 989  QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
            QLAALN+ESPSLKRQ  TKELFE+IGLTYDASF SPNVNKIAETSSKKLLLS+DSFSSK 
Sbjct: 901  QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 960

Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
            T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQGIPSS+EK F SRTP
Sbjct: 961  TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1020

Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
            EGAATVA PASR+TSS SSSS      S+N  TPFMW SPLQPSNTSRQKS PL K N T
Sbjct: 1021 EGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVT 1080

Query: 1169 APSPLSVFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNIN 1228
             PSP  VFQSSH+MLKK NNEA +VTSENKF      EKSKASDFFS TRSDSVQKSNIN
Sbjct: 1081 PPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNIN 1140

Query: 1229 LDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTM 1288
            +DQKSSIFTISSKQ PT  DSI TSN+DNQKTAN KERHTTTSP FGSANKPES  +G+M
Sbjct: 1141 VDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSM 1200

Query: 1289 SSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDL 1348
             SLVPTV+ +RKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSSAAVIDL
Sbjct: 1201 PSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDL 1260

Query: 1349 NQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTE 1408
            NQP STSTQLNFS+PVVS S+SLFQAPK++ TS TL SLNP+LESSK EL V KS+DD E
Sbjct: 1261 NQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAE 1320

Query: 1409 KQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAF 1468
            +Q  +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQ  NV+G+AQPQQPSVAF
Sbjct: 1321 EQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAF 1380

Query: 1469 APLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMS 1528
            A +PSPNLT KIF N RNETSN   TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+S
Sbjct: 1381 ASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPIS 1440

Query: 1529 GAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1588
            G PKPNPFGGPFGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAASQPTNSVAF
Sbjct: 1441 GGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAF 1500

Query: 1589 SGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGFG 1648
            SG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGSPGGF 
Sbjct: 1501 SGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFS 1560

Query: 1649 GGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGAS 1695
            GGFT+ KPV           G GGFAGVGSGGGGGFGGV    GGFAG   TGGGFAGAS
Sbjct: 1561 GGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGAS 1620

BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match: A0A1S3BDU8 (LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656 GN=LOC103488807 PE=4 SV=1)

HSP 1 Score: 2377.8 bits (6161), Expect = 0.0e+00
Identity = 1350/1681 (80.31%), Postives = 1436/1681 (85.43%), Query Frame = 0

Query: 89   MASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSE 148
            MASVDS  S LIPLEDAGEGEQIVRNDFYFQKIGKPVPVKL DSIFDP++PPSQP+ALSE
Sbjct: 1    MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60

Query: 149  SFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDS 208
            S GLIFVAHLSG+     KDVIASA+EIKNGGT SSVQDLSIVD+SIGKVHIL +STD+S
Sbjct: 61   SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120

Query: 209  ILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLY 268
            +LAA+VAGD+H+FSVQSLLDKA+ P SSCS+TDSSFIKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121  VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180

Query: 269  QGSANGPPTHVMHDIDA------------------------------------------- 328
            QGS NGP THVMHDIDA                                           
Sbjct: 181  QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240

Query: 329  ------VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 388
                  VDCIKWVRADCIIIGCFQVTATGDEEDY VLVI+SKDGKITDVSSNKVLLSF D
Sbjct: 241  TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300

Query: 389  IHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINID 448
            IHSGFTRDILPG+ GPCLLLSYLD CKLAIVANRLY+E+HI LLGLL EVENEVAV+NID
Sbjct: 301  IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360

Query: 449  RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGEL 508
            RNTSLPKIELQANGDDNLVMGLC+DRVSLPGKV+V+VGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361  RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420

Query: 509  IMFQFSSVNETEAPHETVSACDEEEDDIIVPADDRSQLFSGSKKEFRED--DLKMQVTEK 568
            IMFQFSSVNETEAPHETVSACD+EEDDI VP DDRS+    SKKE RE   DLKMQVTEK
Sbjct: 421  IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE----SKKESREANVDLKMQVTEK 480

Query: 569  LAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 628
            + ISSEIP+EK+K SNDIKSSNND+SPVS IDESA V  E NTKSQK DSFI+SQSL+SS
Sbjct: 481  ITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSS 540

Query: 629  VLER-PNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASE 688
              ER PN EIGNFDKPV KF GLGSVSISGK  DV SQPFPNVKES KR+ STGL+AASE
Sbjct: 541  APERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASE 600

Query: 689  LSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLT 748
            LSS+K MF  K+  +SSVLT NS QSS TENYGPSFGTANAF GF+GKPFQPKDVPSTLT
Sbjct: 601  LSSEKTMFFKKL-IVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 660

Query: 749  QSGKQVTGGAGKIESLPVLRSSQISLQDNLSA-KISNEKHDGSDRNYSNAPLAKPMKEMC 808
            QSG+QVTGGAGKIESLPV+RSSQISLQD  S+ KISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661  QSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 720

Query: 809  EGLDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLF 868
            EGLD LLESIEE GGF+DACTAFQKSSVEALE GLASLSDECQIW+STMNER QEVQNLF
Sbjct: 721  EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLF 780

Query: 869  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 928
            DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781  DKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 840

Query: 929  RHFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 988
            RHFNGLELNKFGGNEESQ SERALQRKFG SRHSHSLHSLNNIMGSQLA AQLLSESLSK
Sbjct: 841  RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIMGSQLATAQLLSESLSK 900

Query: 989  QLAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKD 1048
            QLAALN+ESP LKRQ  TKELFETIGLTYDASF SPNVNKIA+TSSKKLLLS+DSFSSK 
Sbjct: 901  QLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADTSSKKLLLSSDSFSSKG 960

Query: 1049 TLRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTP 1108
            T RRKQ+SG KNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLLQG PSS+EK FRSRTP
Sbjct: 961  TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGTPSSEEKQFRSRTP 1020

Query: 1109 EGAATVAGPASRLTSSKSSSS------SKNAATPFMWASPLQPSNTSRQKSQPLPKTNTT 1168
            EGAATV  PASR+TSS SSSS      S+N ATPFMWAS LQPSNTSRQKS PL KTN T
Sbjct: 1021 EGAATVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPSNTSRQKSLPLQKTNAT 1080

Query: 1169 APSPLSVFQSSHEMLKKSNNEAF----NVTSENKFIEKSKASDFFSVTRSDSVQKSNINL 1228
            APSP  VFQSSH+MLKK   +               EKSKASDFFS TRSDSVQKS IN+
Sbjct: 1081 APSPPPVFQSSHDMLKKIIMQLTVRLQKTNLRTWHPEKSKASDFFSATRSDSVQKSKINV 1140

Query: 1229 DQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMS 1288
            DQKSSIFTISSKQTP  +DSI TSN+DNQKTAN KERHTTTS LFGSANKPES  +GTM 
Sbjct: 1141 DQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQLFGSANKPESPFVGTMP 1200

Query: 1289 SLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSS---AAVI 1348
            SLVPTV+ ARKTEEK+S+TTIS SV A A LNT SS STLF GFAVSK LPSS   AAV+
Sbjct: 1201 SLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKSLPSSAAVAAVV 1260

Query: 1349 DLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDT 1408
            DLNQP STSTQLNFS PVVS S+SLFQAPK+ ++ +  SLNP++ESSK EL V KS+DD 
Sbjct: 1261 DLNQPQSTSTQLNFS-PVVSGSNSLFQAPKVPTSPTLSSLNPTMESSKTELSVLKSNDDA 1320

Query: 1409 EKQTPASKPESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVA 1468
            EKQT +SKP S+ELKFQPS+TP DK HVEPTSKT TV KDVGGQVPNV+GDAQ QQPSVA
Sbjct: 1321 EKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQVPNVVGDAQAQQPSVA 1380

Query: 1469 FAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPM 1528
            FA +PS NLT KIF N RNETSN   TQDDDMDEEAPETNNN+EF+LSSLGGFGNSSTP+
Sbjct: 1381 FASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPI 1440

Query: 1529 SGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1588
            SGAPKPNPFGGPFGNVNA S+T+SF MASPPSGELFRPASFSFQSPLASQAASQPTNSVA
Sbjct: 1441 SGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1500

Query: 1589 FSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGF 1648
            FSG FGSA+ATQAP QGGFGQPAQIGVGQQALGNVLGSFGQSRQLGP+LPGTGSGSPGGF
Sbjct: 1501 FSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPTLPGTGSGSPGGF 1560

Query: 1649 GGGFTSMKPVGGFASVGSSGGGSGGFAGVGSGGGGGFGGVGSNGGGFAGTVPTGGGFAGA 1695
             GGFT+ KPV           G GGFAGVGSGGGGGFGGV    GGFAG   TGGGFAGA
Sbjct: 1561 SGGFTNAKPV-----------GVGGFAGVGSGGGGGFGGV----GGFAGAASTGGGFAGA 1620

BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match: A0A6J1CBF2 (nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC111010057 PE=4 SV=1)

HSP 1 Score: 2144.4 bits (5555), Expect = 0.0e+00
Identity = 1247/1713 (72.80%), Postives = 1360/1713 (79.39%), Query Frame = 0

Query: 86   LQFMASVDSRPSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 145
            LQ      S  ST I  E+A EGE +   D+YF+KIG+PVPVKL DSIFD ++PPSQPLA
Sbjct: 4    LQDSTPSTSSTSTPIRFEEAEEGEHVESTDYYFEKIGEPVPVKLHDSIFDSESPPSQPLA 63

Query: 146  LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 205
            +SESFGLIFVAHLSG+    T+DVIASA+EIKNGGTGSSVQDLSI+D+S+G+VHIL LS 
Sbjct: 64   VSESFGLIFVAHLSGFFVARTEDVIASAKEIKNGGTGSSVQDLSIMDVSVGRVHILALSA 123

Query: 206  DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 265
            D S +AA+VA DIHLFSV SLLDKA  P  SCS+TDSS IKDFKW RKLE SYLVLSKHG
Sbjct: 124  DSSTIAAVVAADIHLFSVHSLLDKAAKPFYSCSITDSSCIKDFKWIRKLESSYLVLSKHG 183

Query: 266  QLYQGSANGPPTHVMHDIDA---------------------------------------- 325
            QLYQGSANG   HVMHD DA                                        
Sbjct: 184  QLYQGSANGTLKHVMHDTDAVECSVKGRFIAVAKKDTLTIFSSKFKERLSMSLLPSDADS 243

Query: 326  -----VDCIKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDI 385
                 VDCIKWVRADCII+GCF+VTA GDEE+YFV VIRSKDGKITDVSSN+VLLSF+ I
Sbjct: 244  NFIVKVDCIKWVRADCIILGCFEVTAIGDEENYFVQVIRSKDGKITDVSSNRVLLSFQYI 303

Query: 386  HSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVINIDR 445
            H GFTRDILP   GPCL  SYL KCKLAIVANR   ++HIVLLG L EVEN+VAVI+I+R
Sbjct: 304  HPGFTRDILPVGSGPCLFSSYLGKCKLAIVANRNNTDQHIVLLGWLPEVENQVAVIDIER 363

Query: 446  NTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELI 505
            +TSLP+IELQ NGDDNLVMGLCIDRVSLP KV ++VG EDMREVSPYCIL+CLTLEG+L+
Sbjct: 364  DTSLPRIELQENGDDNLVMGLCIDRVSLPAKVKIQVGVEDMREVSPYCILLCLTLEGKLV 423

Query: 506  MFQFSSVNETEAPHETVSAC-DEEEDDIIVPADDRSQLFSGSKKEFREDDL-KMQVTEKL 565
            MF  SS+NETE PHETVSAC DEEEDD IVP DD+ Q+ S S+KE RE  + +M  T+K+
Sbjct: 424  MFHLSSINETETPHETVSACEDEEEDDTIVPIDDQPQVSSESRKELREAMVGQMHDTDKI 483

Query: 566  AISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESSV 625
              SSEIP+EKI ISNDIK S+ DQSPVS ID+SA V  ESN+KS+K  SFIYSQ L+SS+
Sbjct: 484  TTSSEIPEEKINISNDIKPSDIDQSPVSYIDKSAIVSRESNSKSEKVGSFIYSQPLKSSI 543

Query: 626  LERPNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAASELS 685
            LE+PN EIGNF KPVQKF GLGSV+ SG+SADV SQPF N KEST R+ STGL  ASELS
Sbjct: 544  LEKPNSEIGNFGKPVQKFTGLGSVAFSGQSADVPSQPFLNAKESTLRLGSTGLQDASELS 603

Query: 686  SDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQS 745
            SD+AMFLNKIDP SSVL  NS QS+KT+N GPSFG ANAF  F+G+ FQ KDV STLTQ 
Sbjct: 604  SDRAMFLNKIDPASSVLPLNSLQSTKTDNLGPSFGAANAFTAFTGRSFQTKDVSSTLTQI 663

Query: 746  GKQVTGGAGKIESLPVLRSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEG 805
            G+QVT GAGKIESLP +RSSQ+ LQDN S  K SNEKH  S+RNYSN PLAKPMKEMC+G
Sbjct: 664  GRQVTAGAGKIESLPPMRSSQVPLQDNFSLGKTSNEKHSRSERNYSNVPLAKPMKEMCDG 723

Query: 806  LDMLLESIEEPGGFLDACTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDK 865
            LDMLLESIEEPGGF DACTA QKSS+EALE GLA+LSD+CQIW  TMNER+QE+QNLFDK
Sbjct: 724  LDMLLESIEEPGGFWDACTASQKSSIEALELGLATLSDQCQIWGRTMNERAQEIQNLFDK 783

Query: 866  MV-QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELER 925
             V QV+ KKTYIEGIV QAS S YWE WDRQ+LSSELELKRQHILK NQNMTNQLIELER
Sbjct: 784  TVNQVMPKKTYIEGIVKQASHSHYWEHWDRQRLSSELELKRQHILKTNQNMTNQLIELER 843

Query: 926  HFNGLELNKFGGNEESQASERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQ 985
            HFNGLELNKFGGN+ESQ SERALQRKFG SRHSHS HSLNNI GSQLAAAQLLSESLSKQ
Sbjct: 844  HFNGLELNKFGGNDESQVSERALQRKFGSSRHSHSFHSLNNITGSQLAAAQLLSESLSKQ 903

Query: 986  LAALNIESPSLKRQRVTKELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDT 1045
            +AALNIESPS KRQ VTKELFETIG+TYDASF SPNVNKIAETSSKKLLLSADSFSSKD+
Sbjct: 904  MAALNIESPSSKRQSVTKELFETIGITYDASFSSPNVNKIAETSSKKLLLSADSFSSKDS 963

Query: 1046 LRRKQRSGRKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPE 1105
             RRK RSG KNSEAETGRRRR+SLDRNLASVEPPKTTVKRMLL+GIP +DEK FRS TPE
Sbjct: 964  SRRKLRSGMKNSEAETGRRRRESLDRNLASVEPPKTTVKRMLLEGIPLADEKHFRSPTPE 1023

Query: 1106 GAATVAGPASRLTSSKSSSSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTT 1165
            G ATV  PASR+ SS  SSSSKNA       ATPFMW+SP Q SN SRQKSQPL KTN T
Sbjct: 1024 GTATVTRPASRIASSMLSSSSKNAEHSSENPATPFMWSSPSQSSNISRQKSQPLKKTNAT 1083

Query: 1166 APSPLS-VFQSSHEMLKKSNNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNI 1225
            APSPL  V+QSSHEM KKSN EA++VTS+NKF      EKSK+SDF S+TRSDSVQKSNI
Sbjct: 1084 APSPLPVVYQSSHEMPKKSNTEAYSVTSDNKFTEATYPEKSKSSDFLSLTRSDSVQKSNI 1143

Query: 1226 NLDQKSSIFTISSKQTPTLKDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGT 1285
            NLDQKSSIF IS+ Q PTLKDSINTSNL+ QKTAN KERHT  S LF SANKPESA +GT
Sbjct: 1144 NLDQKSSIFKISNNQMPTLKDSINTSNLNGQKTANVKERHTPKSSLFESANKPESAFVGT 1203

Query: 1286 MSSLVPTVNEARKTEEKRSLTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVID 1345
             S+ VPTV  ARKTEEK SLT  SPSVPA A LNTPSS STLF GF+V+K L +S A +D
Sbjct: 1204 ASTPVPTVLGARKTEEKTSLTAFSPSVPAPALLNTPSSASTLFSGFSVTKSLTNSTAHVD 1263

Query: 1346 LNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKELPVSKSDDDTE 1405
            LN+P+ST TQ NFS+P VSVSDSLFQAPKM+S S      P+   SKKELP  KSD DT 
Sbjct: 1264 LNKPLSTFTQSNFSSPAVSVSDSLFQAPKMVSPS------PTTLESKKELPGPKSDADTP 1323

Query: 1406 KQTPASK-PESYELKFQPSVTP-DKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVA 1465
            K  P SK PES+ELK QPSVTP DK HVEPTS + TV KDVGG VPNV+     QQ S A
Sbjct: 1324 KPAPDSKPPESHELKLQPSVTPADKNHVEPTSGSQTVPKDVGGLVPNVL-----QQSSAA 1383

Query: 1466 FAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPM 1525
            F PLP+ NLT K   N +NETS+   TQDDDMDEEAPET NN+EFSLSSLGGFGNSSTP+
Sbjct: 1384 FVPLPTLNLTSKSSTNGKNETSDAALTQDDDMDEEAPET-NNVEFSLSSLGGFGNSSTPI 1443

Query: 1526 SGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1585
            S APK NPFGGPFGNVNATSM SSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA
Sbjct: 1444 SSAPKSNPFGGPFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVA 1503

Query: 1586 FSGGFGSAMAT--QAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPG 1645
            FSGGFGS MAT  Q  SQGGFGQPAQIGVGQQALG VLG+FG+SRQLGPSLPGT SGSP 
Sbjct: 1504 FSGGFGSGMATQPQTSSQGGFGQPAQIGVGQQALGTVLGAFGRSRQLGPSLPGTASGSPS 1563

Query: 1646 GFGGGFTSMKPVGGFASVGSSGGG---------SGGFAGVGSGGGGGFGGVGSN------ 1695
            GF GGFT +KP+GGFA VGS  GG          GGF GVGSG GGGFG VGS+      
Sbjct: 1564 GFSGGFTGVKPIGGFAGVGSGSGGGFGGVGSVSGGGFGGVGSGSGGGFGAVGSSSGGGFG 1623

BLAST of CcUC11G220380 vs. ExPASy TrEMBL
Match: A0A6J1HNV2 (nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111466378 PE=4 SV=1)

HSP 1 Score: 2040.0 bits (5284), Expect = 0.0e+00
Identity = 1193/1721 (69.32%), Postives = 1314/1721 (76.35%), Query Frame = 0

Query: 89   MASVDSR---PSTLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLA 148
            MASVDSR    ST IPLED+ EGE +  ND+YF+KIG+PVPVKL DSIFDP +PPSQPLA
Sbjct: 1    MASVDSRHSTSSTPIPLEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60

Query: 149  LSESFGLIFVAHLSGW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLST 208
            +SESFGLIFVAHLSG+    TKDV+ASA+E+KNGGTGSS+QDLSIVD+S+GKVH+L LS 
Sbjct: 61   VSESFGLIFVAHLSGFFAVRTKDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120

Query: 209  DDSILAAIVAGDIHLFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHG 268
            D+S LAA+VAGD+HLF V SLLDK + PS SCS TDSS IKDFKWTRK E+SYLVLSKHG
Sbjct: 121  DNSFLAAVVAGDVHLFLVDSLLDKGEEPSFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180

Query: 269  QLYQGSANGPPTHVMHDIDAVDC------------------------------------- 328
            +LYQGSA+GP  H+MHDIDAV+C                                     
Sbjct: 181  KLYQGSASGPFKHIMHDIDAVECSVKGKFIAVAKKDTLTVFSYKFKERLSMSLLPSLGNG 240

Query: 329  ------------IKWVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLS 388
                        IKWVRADCIIIGCFQVTATGDEEDYFV VIRSKDGKITDVSSNKVLLS
Sbjct: 241  DTDTDFAVKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300

Query: 389  FRDIHSGFTRDILPGDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQEVENEVAVI 448
            F DI+SGFT DILP + GPCLLLSYLDKCKLAIVANR   ++HIVLLG LQEVENEVAVI
Sbjct: 301  FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360

Query: 449  NIDRNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLE 508
            +I+R+ SLP+IELQ NGDDNLVMGLCIDRVSLPGKV V+VG E++REVSPYC L+CLTLE
Sbjct: 361  DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420

Query: 509  GELIMFQFSSVNETEAPHETVSACD-EEEDDIIVPADDRSQLFSGSKKEFREDDLKMQVT 568
            G+LI+F FSS NE+EA  ETVSACD EEED+ +VP DD+ QLF                 
Sbjct: 421  GKLILFHFSSANESEASDETVSACDEEEEDETVVPTDDQPQLF----------------- 480

Query: 569  EKLAISSEIPQEKIKISNDIKSSNNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLE 628
                                  SN DQ PVSK+D S  +  ESN KSQ+ DS  +SQ L+
Sbjct: 481  ----------------------SNIDQRPVSKVDGSPVITRESNAKSQQMDSLAFSQPLK 540

Query: 629  SSVLERPNYEIGNFDKPVQKF-GLGSVSISGKSADVHSQPFP------------NVKEST 688
             S LERPN EIGNF KPV+ F GLGSV+ SG+S DV SQP              N  +  
Sbjct: 541  PSTLERPNNEIGNFAKPVKNFTGLGSVAFSGQSVDVPSQPLKSSILERPNNEIGNFNKPF 600

Query: 689  KRMVSTGLLAASELSSD------KAMFL--------------NKIDPISSV--------L 748
             +    G +A S  S D      K  FL               K   + SV        +
Sbjct: 601  HKFTGLGSVAFSGQSVDMPNESLKPSFLERPNNQIGNFDKPVQKFTGLGSVAFSEQSVDV 660

Query: 749  TPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPSTLTQSGKQVTGGAGKIESLPVL 808
              + F + K      S G ANAF GF+GKPFQPKDVPSTLTQSG+QV+ GAGKIESLPV+
Sbjct: 661  PSHPFLNVKESTIKHSSGAANAFTGFAGKPFQPKDVPSTLTQSGRQVSAGAGKIESLPVI 720

Query: 809  RSSQISLQDNLS-AKISNEKHDGSDRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDA 868
            +SSQ+SLQDN S  KISN+K DGS+RNY N PLAKPM EMCEGLDMLLESIEEPGGFLDA
Sbjct: 721  QSSQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMNEMCEGLDMLLESIEEPGGFLDA 780

Query: 869  CTAFQKSSVEALERGLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQ 928
            CT FQKSSVEAL  GLA+LSD+CQIW+ TM ER+QEVQNLFD+ V+VLSKKTYIEGIV Q
Sbjct: 781  CTTFQKSSVEALGLGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQ 840

Query: 929  ASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQA 988
            ASDS YW+ WDRQKLSSELELKRQ IL+MNQNMTNQLIELERHFNGLELN FGGNEE Q 
Sbjct: 841  ASDSNYWDHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNTFGGNEEIQV 900

Query: 989  SERALQRKFGYSRHSHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTK 1048
            +ER LQRKFG SR SHSLHSLNNIMGSQLAAAQLLS++LSKQ+A LNI+SPS KRQ +TK
Sbjct: 901  NERTLQRKFGSSRQSHSLHSLNNIMGSQLAAAQLLSDNLSKQIATLNIKSPSSKRQSITK 960

Query: 1049 ELFETIGLTYDASFGSPNVNKIAETSSKKLLLSADSFSSKDTLRRKQRSGRKNSEAETGR 1108
            ELFETIG+TYDASF SPNVNKI ETSSKKLLLSADSFSSKDT RRKQRSG K SE ETGR
Sbjct: 961  ELFETIGITYDASFSSPNVNKIPETSSKKLLLSADSFSSKDTSRRKQRSGAKISETETGR 1020

Query: 1109 RRRDSLDRNLASVEPPKTTVKRMLLQGIPSSDEKLFRSRTPEGAATVAGPASRLTSSKSS 1168
            RRRDSLDRNLAS++PPKTTVKRM+LQG P S+EK FRS T EG ATVA PA R+ SS  S
Sbjct: 1021 RRRDSLDRNLASIQPPKTTVKRMILQGTPLSNEKQFRSPTLEGPATVARPAGRIPSSMLS 1080

Query: 1169 SSSKNA-------ATPFMWASPLQPSNTSRQKSQPLPKTNTTAPSPLSVFQSSHEMLKKS 1228
            SSSKNA       ATPF WASP       RQK QPL KTN TAPSPL V+QSSHEM+KKS
Sbjct: 1081 SSSKNAEQGSENPATPFSWASP------PRQKFQPLQKTNGTAPSPLPVYQSSHEMVKKS 1140

Query: 1229 NNEAFNVTSENKFI-----EKSKASDFFSVTRSDSVQKSNINLDQKSSIFTISSKQTPTL 1288
            N+EA++  SENKF      EKSKASDFFS+ RSDSVQKSN+N +QKSS F  SSK   T 
Sbjct: 1141 NSEAYSAASENKFAEVTYPEKSKASDFFSLARSDSVQKSNMNFEQKSSFFVTSSKPMSTP 1200

Query: 1289 KDSINTSNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPTVNEARKTEEKRS 1348
            KDSI T N ++QKTAN KER TT SPLFG+ANKPE AS+GT SSLVPTV+E RKTEEK+ 
Sbjct: 1201 KDSIETLNPNSQKTANVKERLTTPSPLFGAANKPEPASVGTTSSLVPTVDELRKTEEKKP 1260

Query: 1349 LTTISPSVPASAQLNTPSS-STLFLGFAVSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVS 1408
             T  SPSVPAS  +NTPSS STLF G  +SK  PS AAV+DLN+P+STSTQ +F++PVVS
Sbjct: 1261 PTVFSPSVPASVPVNTPSSASTLFSGSPLSKSFPSPAAVVDLNKPLSTSTQSSFASPVVS 1320

Query: 1409 VSDSLFQAPKMISTSSTL-SLNPSLESSKKELPVSKSDDDTEKQTPASKPESYELKFQPS 1468
            VSDSLFQAPKM+S  STL SLNPSL SS KE P+ KSD DTEKQ  ASKPE  ELK QPS
Sbjct: 1321 VSDSLFQAPKMVSPPSTLSSLNPSLVSSSKEQPIPKSDADTEKQAQASKPEFRELKLQPS 1380

Query: 1469 VT-PDKKHVEPTSKTHTVSKDVGGQVPNVIGDAQPQQPSVAFAPLPSPNLTPKIFGNVRN 1528
            VT     HVEPTS T TVSKDVGG VP+VI DAQPQQ S AF PLPSPN TPK+  N ++
Sbjct: 1381 VTLAVGNHVEPTSVTQTVSKDVGGHVPSVIADAQPQQSSAAFVPLPSPNSTPKVSANGKS 1440

Query: 1529 ETSNVTATQDDDMDEEAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNAT 1588
            ETS+   TQDDDMDEEAPET NN+EFSLSSLGGFG +STPMS APKPNPFGG FGN NAT
Sbjct: 1441 ETSDALITQDDDMDEEAPET-NNVEFSLSSLGGFGTTSTPMSNAPKPNPFGGSFGNANAT 1500

Query: 1589 SMTSSFTMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGGFGSAMATQAPSQGGF 1648
            SM SSFT ASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS  FGS MATQAP+QGGF
Sbjct: 1501 SMNSSFTTASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSSSFGSGMATQAPTQGGF 1560

Query: 1649 GQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSPGGF-GGGFTSMKPVGGFASVGS 1695
            GQPAQIGVGQQALG VLGSFGQSRQLGPSLPGT SGSPGGF GGGFTS+KPVG       
Sbjct: 1561 GQPAQIGVGQQALGTVLGSFGQSRQLGPSLPGTASGSPGGFNGGGFTSVKPVG------- 1620

BLAST of CcUC11G220380 vs. TAIR 10
Match: AT1G55540.1 (Nuclear pore complex protein )

HSP 1 Score: 793.9 bits (2049), Expect = 2.6e-229
Identity = 691/1849 (37.37%), Postives = 939/1849 (50.78%), Query Frame = 0

Query: 100  IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
            + +E+  EG++I  ND+YF++IG+P+ +K  D+ +D + PPSQPLA+SE   ++FVAH S
Sbjct: 4    VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63

Query: 160  GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
            G+    T DVI++++     G    +QDLS+VD+ +G V IL+LS DDSILA  VA DIH
Sbjct: 64   GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123

Query: 220  LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
             FSV SLL K   PS S S  +S F+KDF+W R  + SYLVLS  G+L+ G  N PP HV
Sbjct: 124  FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183

Query: 280  MHDIDA-------------------------------------------------VDCIK 339
            M  +DA                                                 VD I+
Sbjct: 184  MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243

Query: 340  WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
            WVR +CI++GCFQ+   G EE+Y V VIRS DGKI+D S+N V LSF D+      D++P
Sbjct: 244  WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303

Query: 400  GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
              +GP LL SY+D+CKLA+ ANR  ++EHIVLL     + ++ V+V++IDR T LP+I L
Sbjct: 304  VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363

Query: 460  QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
            Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF  +SV  
Sbjct: 364  QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423

Query: 520  TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
              A  +T  A   + +D   P   DD S+  S   ++     ++D K   TEK +    +
Sbjct: 424  RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483

Query: 580  PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
            P E I  K    +KSS     N  Q P ++         +S        SF     L  S
Sbjct: 484  PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543

Query: 640  VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
            +    N +   F   +         I  +S  +H Q     K +     S GL  A    
Sbjct: 544  LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603

Query: 700  -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
                SS        + P   V  P  F S +   +  S     +  G+   P   KD   
Sbjct: 604  PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663

Query: 760  TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
             + ++G+                   G  KIE +P +R+SQ+S Q   +     S+++H 
Sbjct: 664  QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723

Query: 820  GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
                      + N SN P    + EM   +D LL+SIE PGGF D+C    KS+VE LE+
Sbjct: 724  TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783

Query: 880  GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
            GL SL+ +CQ WKST++E+  E+Q+L DK +QVL+KKTY+EG+  Q +D++YW+ W+RQK
Sbjct: 784  GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843

Query: 940  LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
            L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++  +     + R +  +   SR 
Sbjct: 844  LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903

Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
              SLHSL+N M SQLAAA+ LSE LSKQ+  L I+SP   ++ V +ELFETIG+ YDASF
Sbjct: 904  VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963

Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDRNLASV 1119
             SP+  K    SS K LLLS+   S     R++Q S  KNS+ ET RRRR+SLDRN A+ 
Sbjct: 964  SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRNWAAF 1023

Query: 1120 EPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASRLTS 1179
            EPPKTTVKRMLLQ                + S++    RS     + A+ V      +  
Sbjct: 1024 EPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKGIME 1083

Query: 1180 SKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTAPSP 1239
            S    +S+  +TPF    P+  SN                 +  + S        +APS 
Sbjct: 1084 SFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESAPSQ 1143

Query: 1240 L----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE-----------K 1299
            +    +V Q                  +  KK+    F+    N F+E            
Sbjct: 1144 IKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRLSTT 1203

Query: 1300 SKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP---TL 1359
            S  SDF S                 S    KS    +  SSI    FT  +   P   T 
Sbjct: 1204 SSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLSGTP 1263

Query: 1360 KDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT----------- 1419
             DS +T     +   +++ +     S    SA  P++ S+ + S++  T           
Sbjct: 1264 LDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGKPLT 1323

Query: 1420 -----VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA----VS 1479
                 +N+A  +    S          L  +SPS P     +T  SS LF   A    VS
Sbjct: 1324 SVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTSQVS 1383

Query: 1480 KPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESSKKE 1539
                S+ + +  +  + +ST L+ STP ++  D+ FQ+P++ + SS + +   +   KK 
Sbjct: 1384 SDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEPKKP 1443

Query: 1540 LPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQVPN 1599
               S S   T+        A+K ++  L  +  ++     V P S +  +S    G   +
Sbjct: 1444 EAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGTQSS 1503

Query: 1600 VI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDEEAP 1659
            +           G +QPQQ S   AP P+ + T     +   E  ++  TQ+D+MDEEAP
Sbjct: 1504 LASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDEEAP 1563

Query: 1660 ETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGELFR 1695
            E +   E S+ S GGFG  STP  GAPK NPFGGPFG  NAT+ TS+    + PSGELF+
Sbjct: 1564 EASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGELFK 1623

BLAST of CcUC11G220380 vs. TAIR 10
Match: AT1G55540.2 (Nuclear pore complex protein )

HSP 1 Score: 788.5 bits (2035), Expect = 1.1e-227
Identity = 691/1852 (37.31%), Postives = 939/1852 (50.70%), Query Frame = 0

Query: 100  IPLEDAGEGEQIVRNDFYFQKIGKPVPVKLCDSIFDPQTPPSQPLALSESFGLIFVAHLS 159
            + +E+  EG++I  ND+YF++IG+P+ +K  D+ +D + PPSQPLA+SE   ++FVAH S
Sbjct: 4    VEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAHSS 63

Query: 160  GW----TKDVIASAEEIKNGGTGSSVQDLSIVDISIGKVHILTLSTDDSILAAIVAGDIH 219
            G+    T DVI++++     G    +QDLS+VD+ +G V IL+LS DDSILA  VA DIH
Sbjct: 64   GFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAADIH 123

Query: 220  LFSVQSLLDKAKTPSSSCSLTDSSFIKDFKWTRKLEDSYLVLSKHGQLYQGSANGPPTHV 279
             FSV SLL K   PS S S  +S F+KDF+W R  + SYLVLS  G+L+ G  N PP HV
Sbjct: 124  FFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPRHV 183

Query: 280  MHDIDA-------------------------------------------------VDCIK 339
            M  +DA                                                 VD I+
Sbjct: 184  MDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDSIR 243

Query: 340  WVRADCIIIGCFQVTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRDIHSGFTRDILP 399
            WVR +CI++GCFQ+   G EE+Y V VIRS DGKI+D S+N V LSF D+      D++P
Sbjct: 244  WVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDLVP 303

Query: 400  GDIGPCLLLSYLDKCKLAIVANRLYMEEHIVLLGLLQ-EVENEVAVINIDRNTSLPKIEL 459
              +GP LL SY+D+CKLA+ ANR  ++EHIVLL     + ++ V+V++IDR T LP+I L
Sbjct: 304  VGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRIGL 363

Query: 460  QANGDDNLVMGLCIDRVSLPGKVLVRVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNE 519
            Q N DDN VMGLCIDRVS+ G V VR G ++++E+ PY +LVCLTLEG+L+MF  +SV  
Sbjct: 364  QENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASVAG 423

Query: 520  TEAPHETVSACDEEEDDIIVP--ADDRSQLFSGSKKEFR---EDDLKMQVTEKLAISSEI 579
              A  +T  A   + +D   P   DD S+  S   ++     ++D K   TEK +    +
Sbjct: 424  RPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQRL 483

Query: 580  PQEKI--KISNDIKSS-----NNDQSPVSKIDESATVGAESNTKSQKADSFIYSQSLESS 639
            P E I  K    +KSS     N  Q P ++         +S        SF     L  S
Sbjct: 484  PNENIFSKEFESVKSSVSGDNNKKQEPYAEKPLQVEDAQQSMIPRLSGTSF---GQLPMS 543

Query: 640  VLERPNYEIGNFDKPVQKFGLGSVSISGKSADVHSQPFPNVKESTKRMVSTGLLAA---- 699
            +    N +   F   +         I  +S  +H Q     K +     S GL  A    
Sbjct: 544  LGYDTN-KFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAFFGSPGLQNAILQS 603

Query: 700  -SELSSDKAMFLNKIDPISSVLTPNSFQSSKTENYGPSFGTANAFAGFSGKPFQPKDVPS 759
                SS        + P   V  P  F S +   +  S     +  G+   P   KD   
Sbjct: 604  PQNTSSQPWSSGKSVSPPDFVSGP--FPSMRDTQHKQS---VQSGTGYVNPPMSIKDKSV 663

Query: 760  TLTQSGK---------------QVTGGAGKIESLPVLRSSQISLQ--DNLSAKISNEKHD 819
             + ++G+                   G  KIE +P +R+SQ+S Q   +     S+++H 
Sbjct: 664  QVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVKSSFEKSASHQQHK 723

Query: 820  GS--------DRNYSNAPLAKPMKEMCEGLDMLLESIEEPGGFLDACTAFQKSSVEALER 879
                      + N SN P    + EM   +D LL+SIE PGGF D+C    KS+VE LE+
Sbjct: 724  TPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSCAFILKSNVEELEQ 783

Query: 880  GLASLSDECQIWKSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQK 939
            GL SL+ +CQ WKST++E+  E+Q+L DK +QVL+KKTY+EG+  Q +D++YW+ W+RQK
Sbjct: 784  GLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQTADNQYWQLWNRQK 843

Query: 940  LSSELELKRQHILKMNQNMTNQLIELERHFNGLELNKFGGNEESQASERALQRKFGYSRH 999
            L+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++  +     + R +  +   SR 
Sbjct: 844  LNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVARRGVPNRSAPSRR 903

Query: 1000 SHSLHSLNNIMGSQLAAAQLLSESLSKQLAALNIESPSLKRQRVTKELFETIGLTYDASF 1059
              SLHSL+N M SQLAAA+ LSE LSKQ+  L I+SP   ++ V +ELFETIG+ YDASF
Sbjct: 904  VQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQELFETIGIPYDASF 963

Query: 1060 GSPNVNKIAETSS-KKLLLSADSFSSKDTLRRKQRSGRKNSEAETGRRRRDSLDR---NL 1119
             SP+  K    SS K LLLS+   S     R++Q S  KNS+ ET RRRR+SLDR   N 
Sbjct: 964  SSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETARRRRESLDRVIFNW 1023

Query: 1120 ASVEPPKTTVKRMLLQ---------------GIPSSDEKLFRS--RTPEGAATVAGPASR 1179
            A+ EPPKTTVKRMLLQ                + S++    RS     + A+ V      
Sbjct: 1024 AAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHVKDHASPVVSSNKG 1083

Query: 1180 LTSSKSSSSSKNAATPFMWASPLQPSN-----------------TSRQKSQPLPKTNTTA 1239
            +  S    +S+  +TPF    P+  SN                 +  + S        +A
Sbjct: 1084 IMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGNKSSNTTSYAEESA 1143

Query: 1240 PSPL----SVFQSS---------------HEMLKKSNNEAFNVTSENKFIE--------- 1299
            PS +    +V Q                  +  KK+    F+    N F+E         
Sbjct: 1144 PSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKANAFVETAAGSVQRL 1203

Query: 1300 --KSKASDFFS--------------VTRSDSVQKSNINLDQKSSI----FTISSKQTP-- 1359
               S  SDF S                 S    KS    +  SSI    FT  +   P  
Sbjct: 1204 STTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPGDKFTFPAVTAPLS 1263

Query: 1360 -TLKDSINT-SNLDNQKTANAKERHTTTSPLFGSANKPESASLGTMSSLVPT-------- 1419
             T  DS +T     +   +++ +     S    SA  P++ S+ + S++  T        
Sbjct: 1264 GTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTSTVSATGFNVPFGK 1323

Query: 1420 --------VNEARKTEEKRS----------LTTISPSVPASAQLNTPSSSTLFLGFA--- 1479
                    +N+A  +    S          L  +SPS P     +T  SS LF   A   
Sbjct: 1324 PLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTGQSS-LFPPSAPTS 1383

Query: 1480 -VSKPLPSSAAVIDLNQPVSTSTQLNFSTPVVSVSDSLFQAPKMISTSSTLSLNPSLESS 1539
             VS    S+ + +  +  + +ST L+ STP ++  D+ FQ+P++ + SS + +   +   
Sbjct: 1384 QVSSDQASATSSLTDSSRLFSSTSLS-STPPITPPDA-FQSPQVSTPSSAVPITEPVSEP 1443

Query: 1540 KKELPVSKSDDDTEKQTP----ASKPESYELKFQPSVTPDKKHVEPTSKTHTVSKDVGGQ 1599
            KK    S S   T+        A+K ++  L  +  ++     V P S +  +S    G 
Sbjct: 1444 KKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSSSGFLSGFSSGT 1503

Query: 1600 VPNVI----------GDAQPQQPSVAFAPLPSPNLTPKIFGNVRNETSNVTATQDDDMDE 1659
              ++           G +QPQQ S   AP P+ + T     +   E  ++  TQ+D+MDE
Sbjct: 1504 QSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKDIVDTQEDEMDE 1563

Query: 1660 EAPETNNNIEFSLSSLGGFGNSSTPMSGAPKPNPFGGPFGNVNATSMTSSFTMASPPSGE 1695
            EAPE +   E S+ S GGFG  STP  GAPK NPFGGPFG  NAT+ TS+    + PSGE
Sbjct: 1564 EAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFG--NATTTTSNPFNMTVPSGE 1623

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038892124.10.0e+0087.11nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida][more]
XP_038892123.10.0e+0086.85nuclear pore complex protein NUP214 isoform X1 [Benincasa hispida][more]
XP_031741375.10.0e+0081.17nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus][more]
XP_031741374.10.0e+0080.69nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hyp... [more]
KAA0034115.10.0e+0079.79nuclear pore complex protein NUP214 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
F4I1T71.5e-22637.31Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... [more]
Match NameE-valueIdentityDescription
A0A5A7SY340.0e+0079.79Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... [more]
A0A0A0KV450.0e+0078.77Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1[more]
A0A1S3BDU80.0e+0080.31LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656... [more]
A0A6J1CBF20.0e+0072.80nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC1110100... [more]
A0A6J1HNV20.0e+0069.32nuclear pore complex protein NUP214 isoform X3 OS=Cucurbita maxima OX=3661 GN=LO... [more]
Match NameE-valueIdentityDescription
AT1G55540.12.6e-22937.37Nuclear pore complex protein [more]
AT1G55540.21.1e-22737.31Nuclear pore complex protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 859..879
NoneNo IPR availableCOILSCoilCoilcoord: 918..938
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1312..1379
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1446..1470
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 989..1019
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1553..1577
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1059..1113
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1326..1344
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 989..1113
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 527..556
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1179..1217
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1355..1369
NoneNo IPR availableSUPERFAMILY117289Nucleoporin domaincoord: 106..460
IPR044694Nuclear pore complex protein NUP214PANTHERPTHR34418NUCLEAR PORE COMPLEX PROTEIN NUP214 ISOFORM X1coord: 12..63
coord: 100..1694

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC11G220380.1CcUC11G220380.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006405 RNA export from nucleus
biological_process GO:0010070 zygote asymmetric cell division
molecular_function GO:0017056 structural constituent of nuclear pore