Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACAGAGCGCGAGGAGATTCGTTACGAAGGAGGAAGAGGTGGGAAGTTTCAGAAACGGCCTCTCAGAAGGTCGCATACGACGCCGTATGATCGGCCGCCGACTGCTCTGAGAAACTCTGCTGGAAACGGATGGCTCTCTAAGCTTGTCGATCCGGCGCAGAAGCTCATCACCTCCAGTGCACATAGGCTGTTTTCTAGCGTGTTTCGTAAACGTATTCCCCCTCCGCCGCCGTCATTGCCCGTTTCTCGAGGTTTTTTGCTTCTTCTTTTTTTGTTTGTTTCCTGGTGTTTTTTGGTTTTTAATTGGAATGCTCTCTTAACGATTGTTAGAATGGTTTATGGGTTTTGTTTCTCGCATTCTGGTACTCGTTCGTTTATTTTGGAGATGTTGGTTGAAGATGGCTGAAAAAAAACGTGGATTTTTATATTTTATATTTTGCTTTATTCGTGAGGGAGTTACTATAGCTGTGGGGTTTTCTTTTTGGCATAGAGGAACCTCGAATTTATGATTTTGTATGGGTGTTTAGGTTTGTTGTGGCTCAATTGTTTAAATGTCATTAATTTGCGTGATTCCCTGCTGTATATTCCTCATGTTGAATTGCAAGTTCTCTAAAGCTTAATAGTTGAACAATATTGAGTCTGCTTGGAATGTTTTTTTGTTTTCGTTGCTTGGATGTCACATATATTAAAATTTGTTCTCCAGACAATAGGGGTAGCCCGAGTGTGATTTCTATCACTTTTTAAAACGACCACTTATATAAATTTTAAGGATTGGTTTTAATTTTTATTAAGAACTTCGTGAAAATATGTATTTTGTCCTTGTATTCATGTATTTGAGCTCAAAGCCAAGGCCAAGGTCCGATATTGTAATGCTGTTCGTGCCACCCCCATTGGACTATCTGTTTTGAGAACCCCAGAAGGATTTTTGGAAAGCATAACTTTTTTAGTCTCCATTTGGTTCTCTATATGTGTTTATATATATATGTATATATATATATATATTTACAGTATTTTTTTTACTGTTACCCTTTGGCTCTGTTTCGTTTGTATTTTACCATTGAATAGGACATTTTTGCTTCCCATTAATAATGTTAATCCAAAAAATGAATATGTTTAAAGTAAGTTATTCTGAGTAGTAATCAATAAAAAATGTATCACCCGTGGTTGAGAAGACTGCTTACAAGAACACATGTAAACAACAGAATTCTATTTCTTATTCTGTTGGCTGTAACGTCATTTTAATATTTAAAACTGCAAATTGTTTCTAAGTATGTAACGACTTTACTTGAACATGACCTTTTCTATAGGTTGTACTACTGTGTTCTAAAACATTGCCCTCAGACAACTTGATGTCATAATTTTGTTTTTCTACGAACGTGACATCTTAGATTTTGAAATATACCAATGTTGATACCCATTTAGGAAGCATGTTTTCATTATTCTATTATGTTTTTTGGGCTATACATTGTTGTGCTCGCATTAACTTTTGGCATGGGAGCTTGGTTGAAGTTTTCGCATTGACTTTTCTGGTTATTCTTTTACTGTTTTGTTCTTGCATTTTATACTTCCAGAGGCTAATGATGAGATGGGAAATGAGAACCATGAAGAAGTTGCAGCTGTAAGTTTTATCTCATATGGGTTTTTGCTTATCATTAACTTGTGCATGTGTATCTTTTTTGTTTTTTTTTTTTTAAATCCAATTCTGTTATTAATTGAAAACCGTTACATCATCACTGAGTTTTTTTCTCCCTCCATGTACATATTTTTGTTATGCTTTAACATTTTCATAATTGCAGATATTGTGAGATTTTCTTTAATTAAAAAACTGCATTATTTTCTTTTTTCTTTCTGATTTACAAATTGAAAAGGCTATTTTAAAATGTTGTTAAAAAAAAAAAAAAAGGACATGATGCCAAAAGTTAGCATTAGTGAGTGACCTATTTTATTTTCATGAATGCATTAATTTCCGTATCTGTTGAGGAGAAACTGCTTGCAAAAATCAGTTCCACAAAAATCTGGTATTTTTGCGTTTCTACAGTCTGAATTTTTATGTCAACATTTTCTGATTTTTTTTCTTAATCTTTAGGATCTTCCTGGAACTCAAGAAGGGACGAACCGCGATTTTGGTCCGAGTATCAAAACTGATAACACACATGGGGTGACTGACCTTGAGCAAATTTTGAAGGAGAAGACCTTTACAAGGCAAGTATTTGAGTGTTTGTTGGTTAAATTATTATTTTTTTTTCTCCTATCTCAATTCTATATGAAGAGTTGCCTTGTTTTGGTTGATATTTAATTGAGTGAATTATATGGAAATTTGAACATGGTTATATTGTAATTAGTAACTTAAAACATGTAAGAAGTTATGAAATAGTATGAATGCTCTTGCATGAAATGTATAAGCCACAATGTTGCTACAAAATACCGTGTCATTTTTTCTCCATCAGGATAGAAAATCTGGGTTTGCTTTTATATATTGGTTTTAGCAGGATATTGATTTGATATTTAATACGTCAAATGCTAACCACTCTACAGATTTGAGATTGATCGCTTGACTGAACTTTTGAAATCAAGAGTTGTTGATGTTCCAAGTGGGGCTGAAGAGAGGAAATTTGAACAGGCCCCTTCTACGCCTGTTATCAGTTATGAAATACAGGAAGGATCTCCAAAATTTCGAGCTCAAGATGGAATTAGCTCTCACGTGGTTCCAACTCAAGTTATGAGAGCAAATGTAGGAATTATTTTATTTTATTTTAATTTTTTTGTTGGTTTAGTTCCTGCGTTGGTTAATGTTTAGTGCTGGTTTTCCGTAGTTTGCTCACACTGAAAACTTTGTTTGCATTTTTGGCTTTACTGTCACTTGTATAATTGTTTCAGGTCCTTGATGAGGATGTTGCTTCACCTGCAGAGATTGCAAAAGCATACATGGGCAGCAGGCCTCCAAAATCAACTCCGTTGAGTATGGCGTCCCATAGTCACAAGTTTGGGGATAGTTTTGCTTCAGAAAATCTCTCGAAATCCTCTGCTTTAACTCTTGTGCCGAGATCTCCTGGAAATTTTGATGTTATTGAAAATGGTTTTGTCACCCCAAGATCTCGAGGCAGATCTGCTCTGTACAATATGGCTCGAATGCCATATTCCGGAGTTAGTGCAACCCATAGCATAAAGGTTTGTACTCTAATGAAATACATCTGCTTTATTGTTCAAGTCCTTGGTACCTTCCTTATCAGTTTTTTTTACCCCTCTGGCTCATACTTCAGAATAGTGTAGCGACAACAGATGCTTACAGGGCAACAGGGTCCTCATCATCTCAGTTAGCATGGGAGCGAGGGAGAGTTCTGGGGTCTAAACAAGGGGTACGAGTTATGAAAAGGCTATTTTATAGCATATTTTATTATTGAGTAACCTCCTAGATCATGTTATTTTATTTAATTTCCCTTTCTGTAGGCTTTAAAACGCAGAAGCTCAGTTTTAGATGATGAAATGGGACATGTTGGTCCTATTCGTAGACTTCGTCACAAATCCAATCTCCTTTACCCGACAGGTTTGAGTTTACCAAGCAGTTCCACTTCTATTCCAGTAAGTGGAATTGGTTCTGAGAATGCCCAGCAGTTTCAGTCTACAAAAGTGCATCCATTTTCATCTTCTTCTGGGAAGGGACTTTATTCGCGTAGTTTGTCCAAACGGTCTGCAGAGTCCGAAAATGATGTGAAACCTAGTTCAAGTTTTTCTCAAATTCCTCTCAGGTCAAGTGAGATGGCCTCAAAAATACTCGAGCAGCTTGAGAAATTGACCCCTCCAAAGGATAAGTCTTCGAAACTAAAGCTGCTTAGTGTGACGAATAACTCACCCACAAAGTTGTCGCCATCAATGTTGCATGGGCCAGCTCTTAGAAGCCTGGAAGATGTGGATTCATCTAAGTATTTGGAAAATGTTGAAGACATTCAGTCTAATGATGCCCGGGAGCTTACTTCTCAAAAGAACAACAAGGTTGAGGAAAGCAGTTCATTAAAATATAAATTACCCATTAATAAGGAAATTTCTGCAGGTGATGGATTAGGTTCTCCAGTTCCTACGAAGGACACCGTACTTAGTTCTCGTCCGCAAGTTTCATTCGTTGGTGCTTCCCCACAAACAAAATGTGCTTTCCAAATGAGTGCACATGAGGTGTGTGGTTTGCTTCCTTCTAGTTCATACAATAGATCAATCCTGTTGTGTTTCTATCTTTTAATTTATCAATCTTCCTGCGTTCCCTCCCAACTAAATAAATGAAAAGAATACTAATGAGGTGAGAATATTTTAATTTTAGCTTCTTGCATTTTCATTTTTAAACGAGGGAAATTACTCTTCAATCATCAAACGTCTTTCTATTTCCGTCCAAGTTTTTCATTTTCTTTTATTTCCCTCACACACCCAAACCTGAAAAAGAAGGTGGCTAAGTATATCTGTGTGTGTCCGTTATAAAACTTCTAAGCTTTAATATCTTCTTTATAATGCCTATTCTTTTGCTAGAATTTTGTGAATATCGATGCCCTCTATATGCATGTCCCTTATGAACACCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCCCCCAATTTGCAATATTTACGGGTCGTCTGTCTCCTTTTGTTAGGATTTTGTAGATATTGATGAAGAAGGATGTTCTAATGGGCCAGTGACTGATATATCGTTTGATAGGCGAGAAAAAATGGATGCCTCATTAGTGGCAATGAGCAAGCCGAGTGATACTGAAGCCATCACAGTAGATAAGCCTCAGGCTTCAGCTGAGGTTAAACCATCCACTGTATCTGAACTGAACAAAATAAATGGCCAGAGAAAATCTGATGTTCCTGTGACTGCTGAGAAGAGCCCCATCTTTTCCTTTGCTACAGCATCTCCACCTAGCATTACAACCAATGCAAAAGATCCTGAATCAACCTTGAGACCTGAAAAAAATATTTCACCTGAGGCACCAAAGCCCGCTAATGCTCCTATATTTGGCTTTGGAGATAAGCTACCATCACAGAAGGAATCATTTTCTTCTGCTCCCACCTTTGCGTTTGGAAACAAGTTTGCCCCCTCAACGAATGAACAAAATGCTGTTCCTGTTGCAACTTCTGAAAGCAATGTTGCACCAGGAAAAGCTACATTTCCTATTCCTGCAAATGCTGCCACAGAAAATGGAAATAAGAATACAGGGTCACCGTTTAAATTTGCATCGCCTTTAGTCAATGAAAAAGAAAGTGCTAAAGTGGGCAGCTCTTCGGTTTTTAAAGCCGAGAGTAATAGCAGCAGGTCAGTTTGATACTTTCGTTGTTTACCTTGGACTTGGAGAATGTTTGCTTTGTTTATACTTCTATCCATAGGGTTATGTTTCAATATTTAGTTTTTATGTGAAATGCATGATAAATGGGATTTTAATTATTTTGTGAAAACCTAGATATCTATTATCTACAACCCATATAGACTCCAACGAGGCCCCACTTACTATCTTAGATTTTGGCTTTATAGTACTTGCTATCTATTAAAGTACTCTGAATGATGTTCAAGAAGTACATTATTCTATCTAATGCAGTTGTCCCGAGGATTTCGAAACTATGGCCAGTCCTTGATTTCTACCACCATTTAAGCTTTTTGACTTTTCGATGACTGAATCTTTCCCTCTTTTGTGTTTGAGCTTTTGTGAGGACGGCACATACATGAATTTATATGAAGCTAATTGATTCTGTTGCATTTTTAAAGTTTGTAATCATCAATGGATGAAGCTGTATTCTTGTCGTTATTTTTCCTTAACTTATTCAAGAAATTACTTTGGTTTATCATTTTAGATTCTGTTAACAACTTTGTATCGTAGTTTTTTCGTTTTTTTAACAAGATTATATGTTTTTTGCGGGCTGAGAAAAGCGTTGTGCATCATAATCGACGTGTTTGTGCTTTGTATGGGCTTAGGACACTATGAAAAATTCGCATTCTTTTAATTGGAATTTCCCTCTTTCACCTCCCAAGATCTGGTCGCCTAGTCTGTTTCTAGGTTGTTGATACTCGTGATGCTTTATGAAAATTACTAGTGACTAATGTTTTAGACGGTTTTTATTGCAGCATCCTGTCATTTGGAGTTCCAAAAGAGTCCATGTCAGACAAAGCTGGTGATAAGAGTTTGAGTGCCGGTCTCTCGGTTGGTACATCTGAGAATTTGTTTTCGTCATCTGTCTCGACATCGACATCAGCTCCCAGCTTATTTTCCTTCAGCTCCCCTAGCACTAATTCAAATCTTAATAATGGATCTCTTGTTTCTACCCCATCTATATTTTCCTCCCCAGCCACCACCTTTTCGAATAATATAACAAATCAAAATCCATCCATCAAACCGTCCCTCACTACTGCCCCTAGCAACAGCGAACCCGCCACCACTACTAGCCTTTCTATGCCTTCTCCTGTGCCATCTTTTTCAGCTGCACCTATTTTCAAATTTGGGAGCCCTAGTGTTCCTTCATCTTCCGCTCCAGCTCTATCAGCAGCGGAAACCAAGACCAAGCAAGAGACAACCTTTGGTAATTTAAGTGGCATTCCTCCAAGCGACACGTCTGCTGCTAAAGTATCTAGTACTGGAGGCAGCGTTTTTCAATTCGGAGCTGCAGCTACTACAGATTCTAATAAACGACCAGAAAATTCGACTTCTGCTCCAGGCAATGTCCCTACATTTGGTGCTCCGGTTTTTCCTGCTAATAGTGGGGTTGCATCTTCTACTCAGAGTACACCTGTTTTGCCATTCAGTTCATCATCTACATCATTTGGTTTGGCTGCGAATACGGGTTTGTCTTCTGGCAGTTCTCTGTTTGGTTCTTCAGCTCCAGCGTCGAATTTGTTCTCTTCTGGCACGACGTTTGGGCTTACTGGTACAAGTTCTTCTGCTAATAATTCTGTCAGCTCGGGTGCTGGTACCAGCTCTAGCTTCTTTAACTGGCAGACATCCTCCACGCCATCATTTTCCACTGGATTCAGCTCAACTCCAACTGGAGGGTTCCCTTTTGGTCTTTCATCTTCTTCCGCTGCTTCTAATAGTTCACCTATGCTCTTTGGGTCATCAACAACTAGTGCATCGACAACGTCAATGTTCTCATTTACTTCCGCTGCGTCGTCACAGCCTGCTTTCAGTAATTCTAATCATGGCTTCACTTTTGGTTCAACACCCCCTGCCAATAATGATCAAGCAAATATGGAGGATAGCATGGCTGAGGATACCGTCCAGGCAGTCACGCTGCCTACGCCTACTTTTGGGCAACAGCCCCTTACACCACCTCCATCATCAGGGTTTATGTTTGGTTCAGCGGCCCCTCCTCCGGTAGCAGCAAGTCCTTTCCAGTTCGGCAGCCAGCAAAATGCACCTACCCCACAAAATCCCTCTCCATTTCATGCTTCTGGTAGCTTAGACTTCAATGCCAGTGCTGGAGGGAGCTTCTCATTAGGCGCGGGTGGCGGTGACAAAGCGAACCGAAAATACGTGAAAGTTAAAAGCAAATCACGAAAGAAGTAGGAGCAATCGAGAGAGAAGGTATTATTCGTTTCTTTTTGTGTTTGTATACCCCAGTTATTTTTCATCTAGGAAAAAGTTATATTTCCATCACCGTCACGAACATCGGTTGGATTGAGGAGGGGTGGAGGGAGCGGGATATCATTTGTCACCTGGATTGTTTGATTTCGACATAATACGAGGAAACGTCGAAGCCAAGAATTGTTGGTTGTGTTTTCTCATTTAAGTGGCAAATTATTTTGTGGTTTTGTTGTTGGTAGCACAAGTTCACAAATTTTGTATTAGGTTTAGCAAAGTGGAAGAACACTCCTTGTACTGTAATAGTAAGTAGTAGTCTGTGAATGTTTCAGGTAAAACATCAACCCTGTGCTATGGACTCAGTTTTCCTGTAATTTGTTATTTGTTGTAACATAAGAAAAGGGAAATATATGCTGTTGTTCAAGGAAGC
mRNA sequence
ATGGCGACAGAGCGCGAGGAGATTCGTTACGAAGGAGGAAGAGGTGGGAAGTTTCAGAAACGGCCTCTCAGAAGGTCGCATACGACGCCGTATGATCGGCCGCCGACTGCTCTGAGAAACTCTGCTGGAAACGGATGGCTCTCTAAGCTTGTCGATCCGGCGCAGAAGCTCATCACCTCCAGTGCACATAGGCTGTTTTCTAGCGTGTTTCGTAAACGTATTCCCCCTCCGCCGCCGTCATTGCCCGTTTCTCGAGGTTTTTTGCTTCTTCTTTTTTTGTTTGTTTCCTGGTGTTTTTTGGTTTTTAATTGGAATGCTCTCTTAACGATTGTTAGAATGGTTTATGGGTTTTGTTTCTCGCATTCTGGTACTCAGGCTAATGATGAGATGGGAAATGAGAACCATGAAGAAGTTGCAGCTGATCTTCCTGGAACTCAAGAAGGGACGAACCGCGATTTTGGTCCGAGTATCAAAACTGATAACACACATGGGGTGACTGACCTTGAGCAAATTTTGAAGGAGAAGACCTTTACAAGGCAAGTCCTTGATGAGGATGTTGCTTCACCTGCAGAGATTGCAAAAGCATACATGGGCAGCAGGCCTCCAAAATCAACTCCGTTGAGTATGGCGTCCCATAGTCACAAGTTTGGGGATAGTTTTGCTTCAGAAAATCTCTCGAAATCCTCTGCTTTAACTCTTGTGCCGAGATCTCCTGGAAATTTTGATGTTATTGAAAATGGTTTTGTCACCCCAAGATCTCGAGGCAGATCTGCTCTGTACAATATGGCTCGAATGCCATATTCCGGAGTTAGTGCAACCCATAGCATAAAGAATAGTGTAGCGACAACAGATGCTTACAGGGCAACAGGGTCCTCATCATCTCAGTTAGCATGGGAGCGAGGGAGAGTTCTGGGGTCTAAACAAGGGGCTTTAAAACGCAGAAGCTCAGTTTTAGATGATGAAATGGGACATGTTGGTCCTATTCGTAGACTTCGTCACAAATCCAATCTCCTTTACCCGACAGGTTTGAGTTTACCAAGCAGTTCCACTTCTATTCCAGTAAGTGGAATTGGTTCTGAGAATGCCCAGCAGTTTCAGTCTACAAAAGTGCATCCATTTTCATCTTCTTCTGGGAAGGGACTTTATTCGCGTAGTTTGTCCAAACGGTCTGCAGAGTCCGAAAATGATGTGAAACCTAGTTCAAGTTTTTCTCAAATTCCTCTCAGGTCAAGTGAGATGGCCTCAAAAATACTCGAGCAGCTTGAGAAATTGACCCCTCCAAAGGATAAGTCTTCGAAACTAAAGCTGCTTAGTGTGACGAATAACTCACCCACAAAGTTGTCGCCATCAATGTTGCATGGGCCAGCTCTTAGAAGCCTGGAAGATGTGGATTCATCTAAGTATTTGGAAAATGTTGAAGACATTCAGTCTAATGATGCCCGGGAGCTTACTTCTCAAAAGAACAACAAGGTTGAGGAAAGCAGTTCATTAAAATATAAATTACCCATTAATAAGGAAATTTCTGCAGGTGATGGATTAGGTTCTCCAGTTCCTACGAAGGACACCGTACTTAGTTCTCGTCCGCAAGTTTCATTCGTTGGTGCTTCCCCACAAACAAAATGTGCTTTCCAAATGAGTGCACATGAGGATTTTGTAGATATTGATGAAGAAGGATGTTCTAATGGGCCAGTGACTGATATATCGTTTGATAGGCGAGAAAAAATGGATGCCTCATTAGTGGCAATGAGCAAGCCGAGTGATACTGAAGCCATCACAGTAGATAAGCCTCAGGCTTCAGCTGAGGTTAAACCATCCACTGTATCTGAACTGAACAAAATAAATGGCCAGAGAAAATCTGATGTTCCTGTGACTGCTGAGAAGAGCCCCATCTTTTCCTTTGCTACAGCATCTCCACCTAGCATTACAACCAATGCAAAAGATCCTGAATCAACCTTGAGACCTGAAAAAAATATTTCACCTGAGGCACCAAAGCCCGCTAATGCTCCTATATTTGGCTTTGGAGATAAGCTACCATCACAGAAGGAATCATTTTCTTCTGCTCCCACCTTTGCGTTTGGAAACAAGTTTGCCCCCTCAACGAATGAACAAAATGCTGTTCCTGTTGCAACTTCTGAAAGCAATGTTGCACCAGGAAAAGCTACATTTCCTATTCCTGCAAATGCTGCCACAGAAAATGGAAATAAGAATACAGGGTCACCGTTTAAATTTGCATCGCCTTTAGTCAATGAAAAAGAAAGTGCTAAAGTGGGCAGCTCTTCGGTTTTTAAAGCCGAGAGTAATAGCAGCAGCATCCTGTCATTTGGAGTTCCAAAAGAGTCCATGTCAGACAAAGCTGGTGATAAGAGTTTGAGTGCCGGTCTCTCGGTTGGTACATCTGAGAATTTGTTTTCGTCATCTGTCTCGACATCGACATCAGCTCCCAGCTTATTTTCCTTCAGCTCCCCTAGCACTAATTCAAATCTTAATAATGGATCTCTTGTTTCTACCCCATCTATATTTTCCTCCCCAGCCACCACCTTTTCGAATAATATAACAAATCAAAATCCATCCATCAAACCGTCCCTCACTACTGCCCCTAGCAACAGCGAACCCGCCACCACTACTAGCCTTTCTATGCCTTCTCCTGTGCCATCTTTTTCAGCTGCACCTATTTTCAAATTTGGGAGCCCTAGTGTTCCTTCATCTTCCGCTCCAGCTCTATCAGCAGCGGAAACCAAGACCAAGCAAGAGACAACCTTTGGTAATTTAAGTGGCATTCCTCCAAGCGACACGTCTGCTGCTAAAGTATCTAGTACTGGAGGCAGCGTTTTTCAATTCGGAGCTGCAGCTACTACAGATTCTAATAAACGACCAGAAAATTCGACTTCTGCTCCAGGCAATGTCCCTACATTTGGTGCTCCGGTTTTTCCTGCTAATAGTGGGGTTGCATCTTCTACTCAGAGTACACCTGTTTTGCCATTCAGTTCATCATCTACATCATTTGGTTTGGCTGCGAATACGGGTTTGTCTTCTGGCAGTTCTCTGTTTGGTTCTTCAGCTCCAGCGTCGAATTTGTTCTCTTCTGGCACGACGTTTGGGCTTACTGGTACAAGTTCTTCTGCTAATAATTCTGTCAGCTCGGGTGCTGGTACCAGCTCTAGCTTCTTTAACTGGCAGACATCCTCCACGCCATCATTTTCCACTGGATTCAGCTCAACTCCAACTGGAGGGTTCCCTTTTGGTCTTTCATCTTCTTCCGCTGCTTCTAATAGTTCACCTATGCTCTTTGGGTCATCAACAACTAGTGCATCGACAACGTCAATGTTCTCATTTACTTCCGCTGCGTCGTCACAGCCTGCTTTCAGTAATTCTAATCATGGCTTCACTTTTGGTTCAACACCCCCTGCCAATAATGATCAAGCAAATATGGAGGATAGCATGGCTGAGGATACCGTCCAGGCAGTCACGCTGCCTACGCCTACTTTTGGGCAACAGCCCCTTACACCACCTCCATCATCAGGGTTTATGTTTGGTTCAGCGGCCCCTCCTCCGGTAGCAGCAAGTCCTTTCCAGTTCGGCAGCCAGCAAAATGCACCTACCCCACAAAATCCCTCTCCATTTCATGCTTCTGGTAGCTTAGACTTCAATGCCAGTGCTGGAGGGAGCTTCTCATTAGGCGCGGGTGGCGGTGACAAAGCGAACCGAAAATACGTGAAAGTTAAAAGCAAATCACGAAAGAAGTAGGAGCAATCGAGAGAGAAGGTATTATTCGTTTCTTTTTGTGTTTGTATACCCCAGTTATTTTTCATCTAGGAAAAAGTTATATTTCCATCACCGTCACGAACATCGGTTGGATTGAGGAGGGGTGGAGGGAGCGGGATATCATTTGTCACCTGGATTGTTTGATTTCGACATAATACGAGGAAACGTCGAAGCCAAGAATTGTTGGTTGTGTTTTCTCATTTAAGTGGCAAATTATTTTGTGGTTTTGTTGTTGGTAGCACAAGTTCACAAATTTTGTATTAGGTTTAGCAAAGTGGAAGAACACTCCTTGTACTGTAATAGTAAGTAGTAGTCTGTGAATGTTTCAGGTAAAACATCAACCCTGTGCTATGGACTCAGTTTTCCTGTAATTTGTTATTTGTTGTAACATAAGAAAAGGGAAATATATGCTGTTGTTCAAGGAAGC
Coding sequence (CDS)
ATGGCGACAGAGCGCGAGGAGATTCGTTACGAAGGAGGAAGAGGTGGGAAGTTTCAGAAACGGCCTCTCAGAAGGTCGCATACGACGCCGTATGATCGGCCGCCGACTGCTCTGAGAAACTCTGCTGGAAACGGATGGCTCTCTAAGCTTGTCGATCCGGCGCAGAAGCTCATCACCTCCAGTGCACATAGGCTGTTTTCTAGCGTGTTTCGTAAACGTATTCCCCCTCCGCCGCCGTCATTGCCCGTTTCTCGAGGTTTTTTGCTTCTTCTTTTTTTGTTTGTTTCCTGGTGTTTTTTGGTTTTTAATTGGAATGCTCTCTTAACGATTGTTAGAATGGTTTATGGGTTTTGTTTCTCGCATTCTGGTACTCAGGCTAATGATGAGATGGGAAATGAGAACCATGAAGAAGTTGCAGCTGATCTTCCTGGAACTCAAGAAGGGACGAACCGCGATTTTGGTCCGAGTATCAAAACTGATAACACACATGGGGTGACTGACCTTGAGCAAATTTTGAAGGAGAAGACCTTTACAAGGCAAGTCCTTGATGAGGATGTTGCTTCACCTGCAGAGATTGCAAAAGCATACATGGGCAGCAGGCCTCCAAAATCAACTCCGTTGAGTATGGCGTCCCATAGTCACAAGTTTGGGGATAGTTTTGCTTCAGAAAATCTCTCGAAATCCTCTGCTTTAACTCTTGTGCCGAGATCTCCTGGAAATTTTGATGTTATTGAAAATGGTTTTGTCACCCCAAGATCTCGAGGCAGATCTGCTCTGTACAATATGGCTCGAATGCCATATTCCGGAGTTAGTGCAACCCATAGCATAAAGAATAGTGTAGCGACAACAGATGCTTACAGGGCAACAGGGTCCTCATCATCTCAGTTAGCATGGGAGCGAGGGAGAGTTCTGGGGTCTAAACAAGGGGCTTTAAAACGCAGAAGCTCAGTTTTAGATGATGAAATGGGACATGTTGGTCCTATTCGTAGACTTCGTCACAAATCCAATCTCCTTTACCCGACAGGTTTGAGTTTACCAAGCAGTTCCACTTCTATTCCAGTAAGTGGAATTGGTTCTGAGAATGCCCAGCAGTTTCAGTCTACAAAAGTGCATCCATTTTCATCTTCTTCTGGGAAGGGACTTTATTCGCGTAGTTTGTCCAAACGGTCTGCAGAGTCCGAAAATGATGTGAAACCTAGTTCAAGTTTTTCTCAAATTCCTCTCAGGTCAAGTGAGATGGCCTCAAAAATACTCGAGCAGCTTGAGAAATTGACCCCTCCAAAGGATAAGTCTTCGAAACTAAAGCTGCTTAGTGTGACGAATAACTCACCCACAAAGTTGTCGCCATCAATGTTGCATGGGCCAGCTCTTAGAAGCCTGGAAGATGTGGATTCATCTAAGTATTTGGAAAATGTTGAAGACATTCAGTCTAATGATGCCCGGGAGCTTACTTCTCAAAAGAACAACAAGGTTGAGGAAAGCAGTTCATTAAAATATAAATTACCCATTAATAAGGAAATTTCTGCAGGTGATGGATTAGGTTCTCCAGTTCCTACGAAGGACACCGTACTTAGTTCTCGTCCGCAAGTTTCATTCGTTGGTGCTTCCCCACAAACAAAATGTGCTTTCCAAATGAGTGCACATGAGGATTTTGTAGATATTGATGAAGAAGGATGTTCTAATGGGCCAGTGACTGATATATCGTTTGATAGGCGAGAAAAAATGGATGCCTCATTAGTGGCAATGAGCAAGCCGAGTGATACTGAAGCCATCACAGTAGATAAGCCTCAGGCTTCAGCTGAGGTTAAACCATCCACTGTATCTGAACTGAACAAAATAAATGGCCAGAGAAAATCTGATGTTCCTGTGACTGCTGAGAAGAGCCCCATCTTTTCCTTTGCTACAGCATCTCCACCTAGCATTACAACCAATGCAAAAGATCCTGAATCAACCTTGAGACCTGAAAAAAATATTTCACCTGAGGCACCAAAGCCCGCTAATGCTCCTATATTTGGCTTTGGAGATAAGCTACCATCACAGAAGGAATCATTTTCTTCTGCTCCCACCTTTGCGTTTGGAAACAAGTTTGCCCCCTCAACGAATGAACAAAATGCTGTTCCTGTTGCAACTTCTGAAAGCAATGTTGCACCAGGAAAAGCTACATTTCCTATTCCTGCAAATGCTGCCACAGAAAATGGAAATAAGAATACAGGGTCACCGTTTAAATTTGCATCGCCTTTAGTCAATGAAAAAGAAAGTGCTAAAGTGGGCAGCTCTTCGGTTTTTAAAGCCGAGAGTAATAGCAGCAGCATCCTGTCATTTGGAGTTCCAAAAGAGTCCATGTCAGACAAAGCTGGTGATAAGAGTTTGAGTGCCGGTCTCTCGGTTGGTACATCTGAGAATTTGTTTTCGTCATCTGTCTCGACATCGACATCAGCTCCCAGCTTATTTTCCTTCAGCTCCCCTAGCACTAATTCAAATCTTAATAATGGATCTCTTGTTTCTACCCCATCTATATTTTCCTCCCCAGCCACCACCTTTTCGAATAATATAACAAATCAAAATCCATCCATCAAACCGTCCCTCACTACTGCCCCTAGCAACAGCGAACCCGCCACCACTACTAGCCTTTCTATGCCTTCTCCTGTGCCATCTTTTTCAGCTGCACCTATTTTCAAATTTGGGAGCCCTAGTGTTCCTTCATCTTCCGCTCCAGCTCTATCAGCAGCGGAAACCAAGACCAAGCAAGAGACAACCTTTGGTAATTTAAGTGGCATTCCTCCAAGCGACACGTCTGCTGCTAAAGTATCTAGTACTGGAGGCAGCGTTTTTCAATTCGGAGCTGCAGCTACTACAGATTCTAATAAACGACCAGAAAATTCGACTTCTGCTCCAGGCAATGTCCCTACATTTGGTGCTCCGGTTTTTCCTGCTAATAGTGGGGTTGCATCTTCTACTCAGAGTACACCTGTTTTGCCATTCAGTTCATCATCTACATCATTTGGTTTGGCTGCGAATACGGGTTTGTCTTCTGGCAGTTCTCTGTTTGGTTCTTCAGCTCCAGCGTCGAATTTGTTCTCTTCTGGCACGACGTTTGGGCTTACTGGTACAAGTTCTTCTGCTAATAATTCTGTCAGCTCGGGTGCTGGTACCAGCTCTAGCTTCTTTAACTGGCAGACATCCTCCACGCCATCATTTTCCACTGGATTCAGCTCAACTCCAACTGGAGGGTTCCCTTTTGGTCTTTCATCTTCTTCCGCTGCTTCTAATAGTTCACCTATGCTCTTTGGGTCATCAACAACTAGTGCATCGACAACGTCAATGTTCTCATTTACTTCCGCTGCGTCGTCACAGCCTGCTTTCAGTAATTCTAATCATGGCTTCACTTTTGGTTCAACACCCCCTGCCAATAATGATCAAGCAAATATGGAGGATAGCATGGCTGAGGATACCGTCCAGGCAGTCACGCTGCCTACGCCTACTTTTGGGCAACAGCCCCTTACACCACCTCCATCATCAGGGTTTATGTTTGGTTCAGCGGCCCCTCCTCCGGTAGCAGCAAGTCCTTTCCAGTTCGGCAGCCAGCAAAATGCACCTACCCCACAAAATCCCTCTCCATTTCATGCTTCTGGTAGCTTAGACTTCAATGCCAGTGCTGGAGGGAGCTTCTCATTAGGCGCGGGTGGCGGTGACAAAGCGAACCGAAAATACGTGAAAGTTAAAAGCAAATCACGAAAGAAGTAG
Protein sequence
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITSSAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFSHSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTRQVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQLAWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKPQASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVAPGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGVPKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNKRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTGGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPANNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQNAPTPQNPSPFHASGSLDFNASAGGSFSLGAGGGDKANRKYVKVKSKSRKK
Homology
BLAST of CmoCh04G026210 vs. ExPASy Swiss-Prot
Match:
Q9CAF4 (Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 SV=1)
HSP 1 Score: 332.0 bits (850), Expect = 2.8e-89
Identity = 455/1376 (33.07%), Postives = 643/1376 (46.73%), Query Frame = 0
Query: 13 GRGGKFQKRPLRRSHTTPYDRPPTALRNSA-------GNGWLSKLVDPAQKLITSSAHRL 72
G GGKF+K RRS TPYDRP T++RN+ G GWLSKLVDPAQ+LIT SA RL
Sbjct: 18 GTGGKFRKPTARRSQKTPYDRPTTSVRNAGLGGGDVRGGGWLSKLVDPAQRLITYSAQRL 77
Query: 73 FSSVFRKRI---------PPPPPSLP---------VSRGFLLLLFLFVSWCFLVFNWNA- 132
F S+ RKR+ P LP V + + + + NA
Sbjct: 78 FGSLSRKRLGSGETPLQSPEQQKQLPERGVNQETKVGHKEDVSNLSMKNGLIRMEDTNAS 137
Query: 133 -------LLTIVRMVYGFCFSHSGTQANDEMGNENHEEVAADLPGTQEGTNRDFG----- 192
+ +++ G F+ S + AAD E + G
Sbjct: 138 VDPPKDGFTDLEKILQGKTFTRSEVDRLTTLLRSK----AADSSTMNEEQRNEVGMVVRH 197
Query: 193 -PSIKTDNTHGVT-DLEQILKEKTFTRQVLDEDVASPAEIAKAYMGSRPPKSTP--LSMA 252
PS + D TH + ++ + + LDE +ASPA++AKAYMGSRP + TP L +
Sbjct: 198 PPSHERDRTHPDNGSMNTLVSTPPGSLRTLDECIASPAQLAKAYMGSRPSEVTPSMLGLR 257
Query: 253 SHSHKFGDSFASEN--LSKSSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYS 312
+ + F + KS ++LV + P +ENGFVTPRSRGRSA+Y+MAR PYS
Sbjct: 258 GQAGREDSVFLNRTPFPQKSPTMSLVTK-PSGQRPLENGFVTPRSRGRSAVYSMARTPYS 317
Query: 313 GVSATHSIKNSVATTDAYRATGSSSSQLAWERGRVLGSKQG---ALKRRSSVLDDEMGHV 372
++SV ++A+ S WE GS+QG LKRRSSVLD+++G V
Sbjct: 318 ------RPQSSVKIGSLFQASPSK-----WEESLPSGSRQGFQSGLKRRSSVLDNDIGSV 377
Query: 373 GPIRRLRHKSNLLYPTGLSLPSSSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRS 432
GP+RR+R KSN L S S ++PVS P S + G +
Sbjct: 378 GPVRRIRQKSN--------LSSRSLALPVS--------------ESPLSVRANGGEKTTH 437
Query: 433 LSKRSAESENDVKPSSSFSQIPLRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPT 492
SK SAE D+ P SSF+ +P +SSEMASKIL+QL+KL ++K SP+
Sbjct: 438 TSKDSAE---DI-PGSSFNLVPTKSSEMASKILQQLDKLVSTREK------------SPS 497
Query: 493 KLSPSMLHGPALRSLEDVDSSKYLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINK 552
KLSPSML GPAL+SL++V++ K+L N+ + ++N + + + QK ES S + K
Sbjct: 498 KLSPSMLRGPALKSLQNVEAPKFLGNLPEKKAN-SPDSSYQKQEISRESVSREVLAQSEK 557
Query: 553 EISAGDGLGSPVPTKDTVLSSR----PQVSFVGASPQTKCAFQMSAHEDFVDIDEE-GCS 612
A DG +KD + + P + + P K +F+MSAHEDF+++D++ G +
Sbjct: 558 TGDAVDGTSKTGSSKDQDMRGKGVYMPLTNSLEEHPPKKRSFRMSAHEDFLELDDDLGAA 617
Query: 613 NGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKPQASAEVKPSTVSELNKINGQRKSD 672
+ P +++ S ++M +KP +E PST N Q S+
Sbjct: 618 STPCEVAEKQNAFEVEKSHISMP--------IGEKPLTPSEAMPSTSYISNGDASQGTSN 677
Query: 673 VPVTAEKSPIFSFA--------TASPPS---ITTNAKDPESTLRP---EKNISPEAPKPA 732
+ E++ +F AS P+ I K S+ +P EK I E PK
Sbjct: 678 GSLETERNKFVAFPIEAVQQSNMASEPTSKFIQGTEKSSISSGKPTSEEKRIPLEEPKKP 737
Query: 733 NAPIFGFGDKLPSQ-------------KESFSSAPTFAFGNKFAPSTNEQ----NAVPVA 792
A P+ K +S+ F +A T + N+ A
Sbjct: 738 AAVFPNISFSPPATGLLNQNSGASADIKLEKTSSTAFGVSEAWAKPTESKKTFSNSASGA 797
Query: 793 TSESNVAP--GKATFPIPANAAT---ENGNKNTGSPFKFASPLVNEKESAKVGS--SSV- 852
S ++ AP + F ANA T NG+ T SP F + N VG S+V
Sbjct: 798 ESSTSAAPTLNGSIFSAGANAVTPPPSNGSL-TSSP-SFPPSISNIPSDNSVGDMPSTVQ 857
Query: 853 -FKAESNSSSILSFGVPKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFS 912
F A NSSSI FG K S+ + +S SA TS F + ++ S
Sbjct: 858 SFAATHNSSSI--FG--KLPTSNDSNSQSTSASPLSSTSPFKFGQPAAPFSAPAVSESSG 917
Query: 913 SPSTNSNLNNGSLVSTPSI-FSSPATTFSNNITNQNPSIKPSLTTAPSNSEP-------- 972
S + + N + +T + F A+ +Q+ I +A + S P
Sbjct: 918 QISKETEVKNATFGNTSTFKFGGMAS------ADQSTGIVFGAKSAENKSRPGFVFGSSS 977
Query: 973 ----ATTTSLSMPSPVPSFSAAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIP 1032
+T + + P S + IF S S P + +SA+ T + S
Sbjct: 978 VVGGSTLNPSTAAASAPESSGSLIFGVTSSSTPGTETSKISASSAATNTGNSVFGTSSFA 1037
Query: 1033 PSDTSAAKV----SSTGGSVFQFGA---AATTDSNKRPENSTSAPGNVPTFGAPVFPANS 1092
+ + ++ V +STG SVF F A A+ T S + N A GN T S
Sbjct: 1038 FTSSGSSMVGGVSASTGSSVFGFNAVSSASATSSQSQASNLFGA-GNAQTGN-----TGS 1097
Query: 1093 GVASSTQSTPV-LPFSSSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSS 1152
G +STQS P S S+ SFGL+ N+ L+S SS FG S +F+S +T L+ T+S
Sbjct: 1098 GTTTSTQSIPFQFGSSPSAPSFGLSGNSSLASNSSPFGFSKSEPAVFTSVSTPQLSSTNS 1157
Query: 1153 SANNSVSSGA---GTSSSFFNWQTSSTPSFSTGF--SSTPTGGFPFGLSSSSAASNSSPM 1212
SA++S + + GTS N +S P FS+ F SSTPT F FG SS++ S+++
Sbjct: 1158 SASSSSTMSSPLFGTSWQAPNSSPNSGPVFSSSFTTSSTPT-TFSFGGSSAATVSSTTTP 1217
Query: 1213 LFGSSTTSA-STTSMFSF--------------TSAASSQPAFSNSNHGFTFGSTPPA--- 1248
+FG+ST + S + +F F S SQ F NS GF FG+
Sbjct: 1218 IFGASTNNTPSPSPIFGFGSTPPTTPQQPVFGNSGTPSQSLFGNSTPGFAFGAPNNGNGI 1277
BLAST of CmoCh04G026210 vs. ExPASy TrEMBL
Match:
A0A6J1GY88 (nuclear pore complex protein NUP1-like OS=Cucurbita moschata OX=3662 GN=LOC111457932 PE=4 SV=1)
HSP 1 Score: 2167.9 bits (5616), Expect = 0.0e+00
Identity = 1206/1311 (91.99%), Postives = 1207/1311 (92.07%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGAEERKFEQAPSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS
Sbjct: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK
Sbjct: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA
Sbjct: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV
Sbjct: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS
Sbjct: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
Query: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK
Sbjct: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
Query: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF
Sbjct: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
Query: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG
Sbjct: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
Query: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA
Sbjct: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
Query: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1248
NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ
Sbjct: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1260
BLAST of CmoCh04G026210 vs. ExPASy TrEMBL
Match:
A0A6J1JSL2 (nuclear pore complex protein NUP1-like OS=Cucurbita maxima OX=3661 GN=LOC111487168 PE=4 SV=1)
HSP 1 Score: 2072.7 bits (5369), Expect = 0.0e+00
Identity = 1171/1314 (89.12%), Postives = 1179/1314 (89.73%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADL GTQEG N DFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLSGTQEG-NCDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGVEERKFEHASSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKA+MGSRPPK+TPLSMASHSHKF D+FASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAFMGSRPPKATPLSMASHSHKFRDNFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
A ERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 ALERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPF SSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPF-SSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKDKS KLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDKSLKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQK NKVEESSSLKYKLPINK ISAGDGLGSPVPTKDTVLSSRPQVSFVGAS
Sbjct: 541 SNDARELTSQK-NKVEESSSLKYKLPINKAISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMD SLVAMSKP+DTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDGSLVAMSKPNDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPST SELNKINGQ KSDVPVTAEKSPI SFATASPPSIT NAKDPES LRPEK
Sbjct: 661 QASAEVKPSTESELNKINGQSKSDVPVTAEKSPILSFATASPPSITANAKDPESILRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NIS EAPK ANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA
Sbjct: 721 NISSEAPKAANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTG+PFKFASPLVNEKES KVGSSSVFKAESNS SILSFG
Sbjct: 781 PGKATFPIPANAATENGNKNTGAPFKFASPLVNEKESTKVGSSSVFKAESNSGSILSFGF 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
P+ESMSDKAGDK SAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PQESMSDKAGDKRSSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS
Sbjct: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
Query: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSS GGSVFQFGAAATTDSNK
Sbjct: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSIGGSVFQFGAAATTDSNK 1020
Query: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
+PEN TSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF
Sbjct: 1021 QPENLTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
Query: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG
Sbjct: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
Query: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
GF FGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTS ASSQPAF NSNHGFTFGSTP A
Sbjct: 1141 GFSFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSTASSQPAFGNSNHGFTFGSTPLA 1200
Query: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFG---QQPLTPPPSSGFMFGSAAPPPVAASPFQFG 1248
NNDQANMEDSMAEDTVQAVTLPTPTFG QQPLTPPPSSGFMFGSAAPPPVAASPFQFG
Sbjct: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPQQPLTPPPSSGFMFGSAAPPPVAASPFQFG 1260
BLAST of CmoCh04G026210 vs. ExPASy TrEMBL
Match:
A0A6J1FKS9 (nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444902 PE=4 SV=1)
HSP 1 Score: 1613.6 bits (4177), Expect = 0.0e+00
Identity = 962/1339 (71.84%), Postives = 1046/1339 (78.12%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATERE + YEGGRGGKFQKRPLRRSHTTPYDRPP ALRNS G GWLSKLVDPAQKLITS
Sbjct: 1 MATEREGVHYEGGRGGKFQKRPLRRSHTTPYDRPPIALRNSNGKGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKR+PPPPPSLP+SR
Sbjct: 61 SAHRLFSSVFRKRLPPPPPSLPISR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEM +N EEVAAD PGTQEGTN DF PSI ++N HGV+DLE+ILKEKTFTR
Sbjct: 121 ----EANDEMEIKNQEEVAADPPGTQEGTNNDFVPSINSNNIHGVSDLEKILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVADVPSGVESRKFEMVSSTPVISYDIQEGSPKFPAQEGVRPHMVPTHV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
V DEDVASPAEIAKA+MGSRPPK+TPLSM +HS KFGD+FA N SKSS L+LVPR
Sbjct: 241 LNANVPDEDVASPAEIAKAFMGSRPPKATPLSMVAHSQKFGDTFALGNPSKSSTLSLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDV EN FVTPRSRGRSALY+MARMPYS V AT SIKNSVATTD+YRAT +SSSQ
Sbjct: 301 SPGNFDV-ENDFVTPRSRGRSALYSMARMPYSRVRATPSIKNSVATTDSYRATVTSSSQS 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWE+GR+L S QGALKRRSSVLDDE+G VGPIRR+RHKSNLL+P GLSLPSSSTSIPVSG
Sbjct: 361 AWEQGRLLESNQGALKRRSSVLDDEIGSVGPIRRIRHKSNLLFPKGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYS----RSLSKRSAESENDVKPSSSFSQIPLRSSE 480
IGSE +Q QSTKVHPFSS +GK YS R+LSK SAESEND PSSSF QIPLRSSE
Sbjct: 421 IGSETSQHLQSTKVHPFSSPAGKAPYSSETKRNLSKMSAESENDRTPSSSFPQIPLRSSE 480
Query: 481 MASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENV 540
MA KILEQL+KLTPPK+KSS+LKL SV NNSP KLSPSMLHGPALRSLEDVDS+KYLENV
Sbjct: 481 MALKILEQLDKLTPPKEKSSELKLHSVRNNSPMKLSPSMLHGPALRSLEDVDSAKYLENV 540
Query: 541 EDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSF 600
EDI+SND R+LTS+K +K E+SS LK K+P +K IS G G+GS VP+KDTV SS QVSF
Sbjct: 541 EDIRSNDGRDLTSKKKDKFEDSSLLKSKVPSDKSISTGGGVGSSVPSKDTVSSSGLQVSF 600
Query: 601 VGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAIT 660
VG S TKCAFQMS EDFVD+D+E SNGPV+ SF+RREK+D SLVA+ KPSDTEAIT
Sbjct: 601 VGPSSLTKCAFQMSVQEDFVDMDDEEYSNGPVSAKSFERREKVDDSLVAVGKPSDTEAIT 660
Query: 661 VDKPQASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTL 720
VDKPQAS + KPS VSE+ KIN Q KSDVPVT EKS IFSF TASP S T N +PEST
Sbjct: 661 VDKPQASIQAKPSPVSEMKKINDQAKSDVPVTTEKSSIFSFPTASPSSTTANVIEPESTT 720
Query: 721 RPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSE 780
RPEK S E PK A APIFGFG+KLPSQK+ S+PTF FGNK STNEQNAVP TSE
Sbjct: 721 RPEKIASSEVPKAAAAPIFGFGEKLPSQKDPVFSSPTFTFGNKVTTSTNEQNAVPAVTSE 780
Query: 781 SNVAP-------------GKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSS 840
NVAP KATFPIPA+ ATENGN GSPFKFAS LVNEKE AK GS+
Sbjct: 781 GNVAPTLQASAPTTFKFGDKATFPIPASTATENGNSEAGSPFKFASSLVNEKEGAKAGSA 840
Query: 841 SVFKAESNSSSILSFGVPKESMSDKAGD-KSLSAGLSVGTSENLFSSSVSTSTSAPSLFS 900
SVFK+ES+SSS LSFGVPKESMS+KAGD KS SAGLSVGTS NL SSVS ST PSLFS
Sbjct: 841 SVFKSESSSSSTLSFGVPKESMSEKAGDKKSSSAGLSVGTSGNLLLSSVS-STPTPSLFS 900
Query: 901 FSSPSTNSNLNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLS 960
FSSP+TNSNL NGSL STPS F SP+ TF +NITNQN SIKPSL A SNSEP TTTSLS
Sbjct: 901 FSSPTTNSNLINGSLGSTPSTFPSPSNTFPSNITNQNSSIKPSLNAATSNSEPVTTTSLS 960
Query: 961 MPSPVPSFSAAPIFKFGSPSVPSSSAPA-LSAAETKTKQETT-FGNLSGIPPSDTSAAKV 1020
SP+PSFSAAPIFKFGS SVPSSSAP+ + + ETKTKQETT FGN+SGI PSDTSAAKV
Sbjct: 961 TSSPMPSFSAAPIFKFGSSSVPSSSAPSGVGSVETKTKQETTPFGNVSGISPSDTSAAKV 1020
Query: 1021 SSTGGSVFQFGAAATT-DSNKRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFS 1080
STG SVFQFGAA+TT DSNK+PE ST AP +VP+FGAPV PA+SGVASSTQSTPV PFS
Sbjct: 1021 FSTGSSVFQFGAASTTSDSNKQPEKSTFAPVSVPSFGAPVLPASSGVASSTQSTPVSPFS 1080
Query: 1081 SSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSF 1140
SSSTSFGL NTGL+SG+SL GSSAPASNLF+SG TFG G+SSSANNSVSSGAGTSSSF
Sbjct: 1081 SSSTSFGLTGNTGLASGNSLVGSSAPASNLFTSGATFGF-GSSSSANNSVSSGAGTSSSF 1140
Query: 1141 FNWQTSSTPSFSTGFSSTPTGGFPFGLSSSSAASNSSPMLFGSSTT-SASTTSMFSFTSA 1200
FNWQ SS PSFS+GF STPTGGF FGL+SSSAAS+SSPMLFGSSTT +ASTTSMFSFTSA
Sbjct: 1141 FNWQASSAPSFSSGFGSTPTGGFSFGLASSSAASSSSPMLFGSSTTGAASTTSMFSFTSA 1200
Query: 1201 AS---SQPAFSNSNHGFTFGSTPPANNDQANMEDSMAEDTVQAVTLPT--PTFGQQPLTP 1248
A+ SQPAF SNHGFTFGSTPPANND ANMEDSMAEDTVQ V PT P+FGQQPLTP
Sbjct: 1201 ATAAPSQPAFGTSNHGFTFGSTPPANNDHANMEDSMAEDTVQTVASPTPMPSFGQQPLTP 1260
BLAST of CmoCh04G026210 vs. ExPASy TrEMBL
Match:
A0A6J1FF52 (nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444902 PE=4 SV=1)
HSP 1 Score: 1608.2 bits (4163), Expect = 0.0e+00
Identity = 962/1343 (71.63%), Postives = 1046/1343 (77.89%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATERE + YEGGRGGKFQKRPLRRSHTTPYDRPP ALRNS G GWLSKLVDPAQKLITS
Sbjct: 1 MATEREGVHYEGGRGGKFQKRPLRRSHTTPYDRPPIALRNSNGKGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKR+PPPPPSLP+SR
Sbjct: 61 SAHRLFSSVFRKRLPPPPPSLPISR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEM +N EEVAAD PGTQEGTN DF PSI ++N HGV+DLE+ILKEKTFTR
Sbjct: 121 ----EANDEMEIKNQEEVAADPPGTQEGTNNDFVPSINSNNIHGVSDLEKILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVADVPSGVESRKFEMVSSTPVISYDIQEGSPKFPAQEGVRPHMVPTHV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
V DEDVASPAEIAKA+MGSRPPK+TPLSM +HS KFGD+FA N SKSS L+LVPR
Sbjct: 241 LNANVPDEDVASPAEIAKAFMGSRPPKATPLSMVAHSQKFGDTFALGNPSKSSTLSLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDV EN FVTPRSRGRSALY+MARMPYS V AT SIKNSVATTD+YRAT +SSSQ
Sbjct: 301 SPGNFDV-ENDFVTPRSRGRSALYSMARMPYSRVRATPSIKNSVATTDSYRATVTSSSQS 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWE+GR+L S QGALKRRSSVLDDE+G VGPIRR+RHKSNLL+P GLSLPSSSTSIPVSG
Sbjct: 361 AWEQGRLLESNQGALKRRSSVLDDEIGSVGPIRRIRHKSNLLFPKGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYS----RSLSKRSAESENDVKPSSSFSQIPLRSSE 480
IGSE +Q QSTKVHPFSS +GK YS R+LSK SAESEND PSSSF QIPLRSSE
Sbjct: 421 IGSETSQHLQSTKVHPFSSPAGKAPYSSETKRNLSKMSAESENDRTPSSSFPQIPLRSSE 480
Query: 481 MASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENV 540
MA KILEQL+KLTPPK+KSS+LKL SV NNSP KLSPSMLHGPALRSLEDVDS+KYLENV
Sbjct: 481 MALKILEQLDKLTPPKEKSSELKLHSVRNNSPMKLSPSMLHGPALRSLEDVDSAKYLENV 540
Query: 541 EDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSF 600
EDI+SND R+LTS+K +K E+SS LK K+P +K IS G G+GS VP+KDTV SS QVSF
Sbjct: 541 EDIRSNDGRDLTSKKKDKFEDSSLLKSKVPSDKSISTGGGVGSSVPSKDTVSSSGLQVSF 600
Query: 601 VGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAIT 660
VG S TKCAFQMS EDFVD+D+E SNGPV+ SF+RREK+D SLVA+ KPSDTEAIT
Sbjct: 601 VGPSSLTKCAFQMSVQEDFVDMDDEEYSNGPVSAKSFERREKVDDSLVAVGKPSDTEAIT 660
Query: 661 VDKPQASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTL 720
VDKPQAS + KPS VSE+ KIN Q KSDVPVT EKS IFSF TASP S T N +PEST
Sbjct: 661 VDKPQASIQAKPSPVSEMKKINDQAKSDVPVTTEKSSIFSFPTASPSSTTANVIEPESTT 720
Query: 721 RPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSE 780
RPEK S E PK A APIFGFG+KLPSQK+ S+PTF FGNK STNEQNAVP TSE
Sbjct: 721 RPEKIASSEVPKAAAAPIFGFGEKLPSQKDPVFSSPTFTFGNKVTTSTNEQNAVPAVTSE 780
Query: 781 SNVAP-------------GKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSS 840
NVAP KATFPIPA+ ATENGN GSPFKFAS LVNEKE AK GS+
Sbjct: 781 GNVAPTLQASAPTTFKFGDKATFPIPASTATENGNSEAGSPFKFASSLVNEKEGAKAGSA 840
Query: 841 SVFKAESNSSSI----LSFGVPKESMSDKAGD-KSLSAGLSVGTSENLFSSSVSTSTSAP 900
SVFK+ES+SSS LSFGVPKESMS+KAGD KS SAGLSVGTS NL SSVS ST P
Sbjct: 841 SVFKSESSSSSFNCSTLSFGVPKESMSEKAGDKKSSSAGLSVGTSGNLLLSSVS-STPTP 900
Query: 901 SLFSFSSPSTNSNLNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATT 960
SLFSFSSP+TNSNL NGSL STPS F SP+ TF +NITNQN SIKPSL A SNSEP TT
Sbjct: 901 SLFSFSSPTTNSNLINGSLGSTPSTFPSPSNTFPSNITNQNSSIKPSLNAATSNSEPVTT 960
Query: 961 TSLSMPSPVPSFSAAPIFKFGSPSVPSSSAPA-LSAAETKTKQETT-FGNLSGIPPSDTS 1020
TSLS SP+PSFSAAPIFKFGS SVPSSSAP+ + + ETKTKQETT FGN+SGI PSDTS
Sbjct: 961 TSLSTSSPMPSFSAAPIFKFGSSSVPSSSAPSGVGSVETKTKQETTPFGNVSGISPSDTS 1020
Query: 1021 AAKVSSTGGSVFQFGAAATT-DSNKRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPV 1080
AAKV STG SVFQFGAA+TT DSNK+PE ST AP +VP+FGAPV PA+SGVASSTQSTPV
Sbjct: 1021 AAKVFSTGSSVFQFGAASTTSDSNKQPEKSTFAPVSVPSFGAPVLPASSGVASSTQSTPV 1080
Query: 1081 LPFSSSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGT 1140
PFSSSSTSFGL NTGL+SG+SL GSSAPASNLF+SG TFG G+SSSANNSVSSGAGT
Sbjct: 1081 SPFSSSSTSFGLTGNTGLASGNSLVGSSAPASNLFTSGATFGF-GSSSSANNSVSSGAGT 1140
Query: 1141 SSSFFNWQTSSTPSFSTGFSSTPTGGFPFGLSSSSAASNSSPMLFGSSTT-SASTTSMFS 1200
SSSFFNWQ SS PSFS+GF STPTGGF FGL+SSSAAS+SSPMLFGSSTT +ASTTSMFS
Sbjct: 1141 SSSFFNWQASSAPSFSSGFGSTPTGGFSFGLASSSAASSSSPMLFGSSTTGAASTTSMFS 1200
Query: 1201 FTSAAS---SQPAFSNSNHGFTFGSTPPANNDQANMEDSMAEDTVQAVTLPT--PTFGQQ 1248
FTSAA+ SQPAF SNHGFTFGSTPPANND ANMEDSMAEDTVQ V PT P+FGQQ
Sbjct: 1201 FTSAATAAPSQPAFGTSNHGFTFGSTPPANNDHANMEDSMAEDTVQTVASPTPMPSFGQQ 1260
BLAST of CmoCh04G026210 vs. ExPASy TrEMBL
Match:
A0A6J1K059 (nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111488998 PE=4 SV=1)
HSP 1 Score: 1603.6 bits (4151), Expect = 0.0e+00
Identity = 956/1339 (71.40%), Postives = 1041/1339 (77.74%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREE+RYEGGRGGKFQKRPLRR+HTTPYDRPP ALRNSAG GWLSKLVDPAQKLITS
Sbjct: 1 MATEREEVRYEGGRGGKFQKRPLRRAHTTPYDRPPAALRNSAGKGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKR+PPPPPSLP+ R
Sbjct: 61 SAHRLFSSVFRKRLPPPPPSLPIYR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEM +N EEVAAD PGTQEGTN DF PSI ++N HGV+DLE+ILKEKTFTR
Sbjct: 121 ----EANDEMEIKNQEEVAADSPGTQEGTNNDFVPSINSNNIHGVSDLEKILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVADVPSGVESRKFEKVPSTPVISYDIQEGSPKFPAQERVRPHMVPNHV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
V DEDVASPAEIAKA+MGSRPPK+TPLSM +HS KFGD+FA N SKSS L+LVPR
Sbjct: 241 VNANVPDEDVASPAEIAKAFMGSRPPKATPLSMVAHSQKFGDTFALGNPSKSSTLSLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDV EN FVTPRSRGRSALY+MARMPYS V AT SIKNSVATTD+YRA+ +SSSQ
Sbjct: 301 SPGNFDV-ENDFVTPRSRGRSALYSMARMPYSRVRATPSIKNSVATTDSYRASVTSSSQS 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWE+GR+L S QGALKRRSSVLDDE+G VGPIRR+RHKSNLL+P GLSLPSSSTSIPVSG
Sbjct: 361 AWEQGRLLESNQGALKRRSSVLDDEIGSVGPIRRIRHKSNLLFPKGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYS----RSLSKRSAESENDVKPSSSFSQIPLRSSE 480
IGSE Q QSTKVHPFSS++GK YS R+LSK SAESEND PSSSF QIPLRSSE
Sbjct: 421 IGSETYQHLQSTKVHPFSSTAGKAPYSSETKRNLSKMSAESENDRTPSSSFPQIPLRSSE 480
Query: 481 MASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENV 540
MA KILEQL+KLTPPK+KSS+LKL SV NNSP KLSPSMLHGPALRSLEDVDS+KYLENV
Sbjct: 481 MALKILEQLDKLTPPKEKSSELKLHSVRNNSPMKLSPSMLHGPALRSLEDVDSAKYLENV 540
Query: 541 EDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSF 600
EDI+SND R+LTS+KN+K E+SS LK ++P +K IS G G+GS VP+KDTV SS QVSF
Sbjct: 541 EDIRSNDGRDLTSKKNDKFEDSSLLKSEVPSDKSISTGGGVGSSVPSKDTVSSSGLQVSF 600
Query: 601 VGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAIT 660
VG S TKCAFQMS EDFVD+D+E SNGPV SF+RREK+D SLVA+ KPSD EAI
Sbjct: 601 VGPSSLTKCAFQMSVQEDFVDMDDEEYSNGPVAAKSFERREKVDDSLVAVGKPSDNEAII 660
Query: 661 VDKPQASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTL 720
VDKPQAS + KPSTVSE+ KIN Q KSD+PVT EKS IFSF TASP S T +PEST
Sbjct: 661 VDKPQASIQAKPSTVSEMKKINDQAKSDIPVTTEKSSIFSFPTASPSSTTATVIEPESTT 720
Query: 721 RPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSE 780
RPEK E PK A APIFGFG+K PSQK+ S+PTF FGNK STNEQNAVP TSE
Sbjct: 721 RPEKIAFSEVPKAAVAPIFGFGEKFPSQKDPVFSSPTFTFGNKVTTSTNEQNAVPAVTSE 780
Query: 781 SNVAP-------------GKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSS 840
NVAP KATFPIPA+ TENGN GSPFKFAS LVNEKE AK GS+
Sbjct: 781 GNVAPTLQASAPTTFKFGDKATFPIPASTTTENGNSEAGSPFKFASSLVNEKEGAKAGSA 840
Query: 841 SVFKAESNSSSILSFGVPKESMSDKAGD-KSLSAGLSVGTSENLFSSSVSTSTSAPSLFS 900
SVFK+ES+SSSILSFGVPKE MS+KAGD KS SAGLSVGTS NL SSVS ST PSLFS
Sbjct: 841 SVFKSESSSSSILSFGVPKELMSEKAGDKKSSSAGLSVGTSGNLLLSSVS-STPTPSLFS 900
Query: 901 FSSPSTNSNLNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLS 960
FSSP+TNSNL NGSL STPS F SP+ TF +NITNQN SIKPSL A SNSEP TTTSLS
Sbjct: 901 FSSPTTNSNLINGSLGSTPSTFPSPSNTFPSNITNQNSSIKPSLNAATSNSEPVTTTSLS 960
Query: 961 MPSPVPSFSAAPIFKFGSPSVPSSSAP-ALSAAETKTKQETT-FGNLSGIPPSDTSAAKV 1020
SP+PSFSAAPIFKFGS SVPS+SAP + ETKTKQETT FGN+SGI PSDTSAAKV
Sbjct: 961 TSSPMPSFSAAPIFKFGSSSVPSTSAPNGAGSVETKTKQETTPFGNVSGISPSDTSAAKV 1020
Query: 1021 SSTGGSVFQFGAAATT-DSNKRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFS 1080
STG SVFQFGAA+TT DSNK PE ST AP +VP+FGAPV PA+SGVASSTQSTPV PFS
Sbjct: 1021 FSTGSSVFQFGAASTTSDSNKEPEKSTFAPVSVPSFGAPVLPASSGVASSTQSTPVSPFS 1080
Query: 1081 SSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSF 1140
SSSTSFGL NTGL+SG+SL GSSAPASNLF+SG TFG G+SSSANNSVSSGAGTSSSF
Sbjct: 1081 SSSTSFGLTGNTGLASGNSLVGSSAPASNLFASGATFGF-GSSSSANNSVSSGAGTSSSF 1140
Query: 1141 FNWQTSSTPSFSTGFSSTPTGGFPFGLSSSSAASNSSPMLFGSSTT-SASTTSMFSFTS- 1200
FNWQ SS PSFS+GF STPTGGF FGL+SSSAAS+S+PMLFGSS+T +ASTTSMFSFTS
Sbjct: 1141 FNWQASSGPSFSSGFGSTPTGGFSFGLASSSAASSSAPMLFGSSSTGAASTTSMFSFTSA 1200
Query: 1201 --AASSQPAFSNSNHGFTFGSTPPANNDQANMEDSMAEDTVQAVTLPT--PTFGQQPLTP 1248
AASSQPAF NSNHGFTFGSTPPANND ANMEDSMAEDTVQ V LPT P+FGQQPLTP
Sbjct: 1201 ATAASSQPAFGNSNHGFTFGSTPPANNDHANMEDSMAEDTVQTVALPTPMPSFGQQPLTP 1260
BLAST of CmoCh04G026210 vs. NCBI nr
Match:
XP_022956154.1 (nuclear pore complex protein NUP1-like [Cucurbita moschata])
HSP 1 Score: 2167.9 bits (5616), Expect = 0.0e+00
Identity = 1206/1311 (91.99%), Postives = 1207/1311 (92.07%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGAEERKFEQAPSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS
Sbjct: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK
Sbjct: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA
Sbjct: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV
Sbjct: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS
Sbjct: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
Query: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK
Sbjct: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
Query: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF
Sbjct: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
Query: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG
Sbjct: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
Query: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA
Sbjct: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
Query: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1248
NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ
Sbjct: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1260
BLAST of CmoCh04G026210 vs. NCBI nr
Match:
KAG7032922.1 (Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2135.9 bits (5533), Expect = 0.0e+00
Identity = 1192/1320 (90.30%), Postives = 1195/1320 (90.53%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTRQ 180
+ANDEMGNENHEEVAADLPGTQEGTN DFGPSIKTDNTHGVTDLEQILKEKTFTRQ
Sbjct: 121 ----EANDEMGNENHEEVAADLPGTQEGTNHDFGPSIKTDNTHGVTDLEQILKEKTFTRQ 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 VFECLLVKFEIDRLTELLKSRVVDVPSGAEERKFEQAPSTPVISYEIQEGSPKFRAQDGI 240
Query: 241 -------------VLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSK 300
VLDEDVASPAEIAKAYMGSRPPK+TPLSMASHSHKFGDSFASENLSK
Sbjct: 241 SSHVVPTQVMRANVLDEDVASPAEIAKAYMGSRPPKATPLSMASHSHKFGDSFASENLSK 300
Query: 301 SSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYR 360
SSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYR
Sbjct: 301 SSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYR 360
Query: 361 ATGSSSSQLAWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPS 420
ATGSSSSQLAWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPS
Sbjct: 361 ATGSSSSQLAWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPS 420
Query: 421 SSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIP 480
SSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIP
Sbjct: 421 SSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIP 480
Query: 481 LRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSK 540
LRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSK
Sbjct: 481 LRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSK 540
Query: 541 YLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSR 600
YLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSR
Sbjct: 541 YLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSR 600
Query: 601 PQVSFVGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSD 660
PQVSFVGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMD SLVAMSKPSD
Sbjct: 601 PQVSFVGASPQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDGSLVAMSKPSD 660
Query: 661 TEAITVDKPQASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKD 720
TEAITVDKPQASAEVKPSTVSELNKINGQRKSDVPVTA KSPIFSFATASPPSITTNAKD
Sbjct: 661 TEAITVDKPQASAEVKPSTVSELNKINGQRKSDVPVTAVKSPIFSFATASPPSITTNAKD 720
Query: 721 PESTLRPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVP 780
PESTLRPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVP
Sbjct: 721 PESTLRPEKNISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVP 780
Query: 781 VATSESNVAPGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESN 840
VATSESNVAPGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAK GSSSV KAESN
Sbjct: 781 VATSESNVAPGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKGGSSSVSKAESN 840
Query: 841 SSSILSFGVPKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSN 900
SSSILSFGVPKESMSDKAGDKS SAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSN
Sbjct: 841 SSSILSFGVPKESMSDKAGDKSSSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSN 900
Query: 901 LNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFS 960
LNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLT APSN EPATTTSLSMPSPVPSFS
Sbjct: 901 LNNGSLVSTPSIFSSPATTFSNNITNQNPSIKPSLTAAPSNGEPATTTSLSMPSPVPSFS 960
Query: 961 AAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFG 1020
AAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFG
Sbjct: 961 AAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFG 1020
Query: 1021 AAATTDSNKRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANT 1080
AAATTDSNKRPENSTSA GNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANT
Sbjct: 1021 AAATTDSNKRPENSTSASGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANT 1080
Query: 1081 GLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFS 1140
GLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFS
Sbjct: 1081 GLSSGSSLFGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFS 1140
Query: 1141 TGFSSTPTGGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHG 1200
TGF STPTGGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAF NSNHG
Sbjct: 1141 TGFGSTPTGGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFGNSNHG 1200
Query: 1201 FTFGSTPPANNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAA 1248
FTFGSTPPANNDQANMEDSMAEDT+QAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAA
Sbjct: 1201 FTFGSTPPANNDQANMEDSMAEDTIQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAA 1260
BLAST of CmoCh04G026210 vs. NCBI nr
Match:
KAG6602242.1 (Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 2121.7 bits (5496), Expect = 0.0e+00
Identity = 1186/1311 (90.47%), Postives = 1190/1311 (90.77%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADLPGTQEGTN DFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLPGTQEGTNHDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGAEERKFEQAPSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKAYMGSRPPK+TPLSMASHSHKFGDSFASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAYMGSRPPKATPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQ
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQF 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWER RVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 AWERVRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS
Sbjct: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMD SLVAMSKPSDTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDGSLVAMSKPSDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPSTVSELNKINGQRKSDVPVTA KSPIFSFATASPPSITTNAKDPESTLRPEK
Sbjct: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAVKSPIFSFATASPPSITTNAKDPESTLRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQN VPVATSESNVA
Sbjct: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNDVPVATSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSV KAESNSSSILSFGV
Sbjct: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVSKAESNSSSILSFGV 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
PKESMSDKAGDKS SAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PKESMSDKAGDKSSSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
PSIFSSPATTFSNNIT QNPSIKPSLT APSN EPATTTSLSMPSPVPSFSAAPIFKFGS
Sbjct: 901 PSIFSSPATTFSNNIT-QNPSIKPSLTAAPSNGEPATTTSLSMPSPVPSFSAAPIFKFGS 960
Query: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK
Sbjct: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
Query: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
RPENSTSA GNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANT LSSGSSLF
Sbjct: 1021 RPENSTSASGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTVLSSGSSLF 1080
Query: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
GSSAPASNLFSSGTTFGLTGT+SSANNSVSSGAGTSSSFFNWQTSSTPSFSTGF STPTG
Sbjct: 1081 GSSAPASNLFSSGTTFGLTGTNSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFGSTPTG 1140
Query: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAF NSNHGFTFGSTPPA
Sbjct: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFGNSNHGFTFGSTPPA 1200
Query: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1248
NNDQANMEDSMAEDT+QAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ
Sbjct: 1201 NNDQANMEDSMAEDTIQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQQ 1260
BLAST of CmoCh04G026210 vs. NCBI nr
Match:
XP_023527371.1 (nuclear pore complex protein NUP1-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2119.0 bits (5489), Expect = 0.0e+00
Identity = 1182/1312 (90.09%), Postives = 1190/1312 (90.70%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGVEERKFEQAPSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKAYMGSRPPK+TPLSMASHSHKFGDSFASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAYMGSRPPKATPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPFSSSSGK LYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPFSSSSGKALYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKD+SSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDRSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQKNNKVEESSSLKYKLPI+K ISAGDGLGS VPTKDTVLSSRPQV+FVGAS
Sbjct: 541 SNDARELTSQKNNKVEESSSLKYKLPIDKAISAGDGLGSLVPTKDTVLSSRPQVAFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMD SL AMSKP+DTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDGSLAAMSKPNDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPST SELNK+NGQRKSDVPV AEKSPIFSFATASPPSI NAKDPESTLRPEK
Sbjct: 661 QASAEVKPSTASELNKMNGQRKSDVPVIAEKSPIFSFATASPPSIIANAKDPESTLRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVA SESNVA
Sbjct: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVANSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKE AKVGSSSVFKAESNSSSILSFGV
Sbjct: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKERAKVGSSSVFKAESNSSSILSFGV 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
PKESMSDKAGDKS SAGLSVGTSENLFSSSVSTSTS+PSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PKESMSDKAGDKSSSAGLSVGTSENLFSSSVSTSTSSPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPA-TTTSLSMPSPVPSFSAAPIFKFG 960
PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPA TTTSLSMPSPVPSFSAAPIFKFG
Sbjct: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTTSLSMPSPVPSFSAAPIFKFG 960
Query: 961 SPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSN 1020
SPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSN
Sbjct: 961 SPSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSN 1020
Query: 1021 KRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSL 1080
KRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGS L
Sbjct: 1021 KRPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSYL 1080
Query: 1081 FGSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPT 1140
FGSSAPASNLFS GTTFGLT TSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPT
Sbjct: 1081 FGSSAPASNLFSPGTTFGLTATSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPT 1140
Query: 1141 GGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPP 1200
GGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSA SSQPAF NSNHGFTFGSTPP
Sbjct: 1141 GGFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAVSSQPAFGNSNHGFTFGSTPP 1200
Query: 1201 ANNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQ 1248
ANNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQ
Sbjct: 1201 ANNDQANMEDSMAEDTVQAVTLPTPTFGQQPLTPPPSSGFMFGSAAPPPVAASPFQFGSQ 1260
BLAST of CmoCh04G026210 vs. NCBI nr
Match:
XP_022990208.1 (nuclear pore complex protein NUP1-like [Cucurbita maxima])
HSP 1 Score: 2072.7 bits (5369), Expect = 0.0e+00
Identity = 1171/1314 (89.12%), Postives = 1179/1314 (89.73%), Query Frame = 0
Query: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS
Sbjct: 1 MATEREEIRYEGGRGGKFQKRPLRRSHTTPYDRPPTALRNSAGNGWLSKLVDPAQKLITS 60
Query: 61 SAHRLFSSVFRKRIPPPPPSLPVSRGFLLLLFLFVSWCFLVFNWNALLTIVRMVYGFCFS 120
SAHRLFSSVFRKRIPPPPPSLPVSR
Sbjct: 61 SAHRLFSSVFRKRIPPPPPSLPVSR----------------------------------- 120
Query: 121 HSGTQANDEMGNENHEEVAADLPGTQEGTNRDFGPSIKTDNTHGVTDLEQILKEKTFTR- 180
+ANDEMGNENHEEVAADL GTQEG N DFGPSIKTDNTHGVTDLEQILKEKTFTR
Sbjct: 121 ----EANDEMGNENHEEVAADLSGTQEG-NCDFGPSIKTDNTHGVTDLEQILKEKTFTRF 180
Query: 181 ------------------------------------------------------------ 240
Sbjct: 181 EIDRLTELLKSRVVDVPSGVEERKFEHASSTPVISYEIQEGSPKFRAQDGISSHVVPTQV 240
Query: 241 ---QVLDEDVASPAEIAKAYMGSRPPKSTPLSMASHSHKFGDSFASENLSKSSALTLVPR 300
VLDEDVASPAEIAKA+MGSRPPK+TPLSMASHSHKF D+FASENLSKSSALTLVPR
Sbjct: 241 MRANVLDEDVASPAEIAKAFMGSRPPKATPLSMASHSHKFRDNFASENLSKSSALTLVPR 300
Query: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL
Sbjct: 301 SPGNFDVIENGFVTPRSRGRSALYNMARMPYSGVSATHSIKNSVATTDAYRATGSSSSQL 360
Query: 361 AWERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
A ERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG
Sbjct: 361 ALERGRVLGSKQGALKRRSSVLDDEMGHVGPIRRLRHKSNLLYPTGLSLPSSSTSIPVSG 420
Query: 421 IGSENAQQFQSTKVHPFSSSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
IGSENAQQFQSTKVHPF SSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK
Sbjct: 421 IGSENAQQFQSTKVHPF-SSSGKGLYSRSLSKRSAESENDVKPSSSFSQIPLRSSEMASK 480
Query: 481 ILEQLEKLTPPKDKSSKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
ILEQLEKLTPPKDKS KLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ
Sbjct: 481 ILEQLEKLTPPKDKSLKLKLLSVTNNSPTKLSPSMLHGPALRSLEDVDSSKYLENVEDIQ 540
Query: 541 SNDARELTSQKNNKVEESSSLKYKLPINKEISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
SNDARELTSQK NKVEESSSLKYKLPINK ISAGDGLGSPVPTKDTVLSSRPQVSFVGAS
Sbjct: 541 SNDARELTSQK-NKVEESSSLKYKLPINKAISAGDGLGSPVPTKDTVLSSRPQVSFVGAS 600
Query: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKP 660
PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMD SLVAMSKP+DTEAITVDKP
Sbjct: 601 PQTKCAFQMSAHEDFVDIDEEGCSNGPVTDISFDRREKMDGSLVAMSKPNDTEAITVDKP 660
Query: 661 QASAEVKPSTVSELNKINGQRKSDVPVTAEKSPIFSFATASPPSITTNAKDPESTLRPEK 720
QASAEVKPST SELNKINGQ KSDVPVTAEKSPI SFATASPPSIT NAKDPES LRPEK
Sbjct: 661 QASAEVKPSTESELNKINGQSKSDVPVTAEKSPILSFATASPPSITANAKDPESILRPEK 720
Query: 721 NISPEAPKPANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
NIS EAPK ANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA
Sbjct: 721 NISSEAPKAANAPIFGFGDKLPSQKESFSSAPTFAFGNKFAPSTNEQNAVPVATSESNVA 780
Query: 781 PGKATFPIPANAATENGNKNTGSPFKFASPLVNEKESAKVGSSSVFKAESNSSSILSFGV 840
PGKATFPIPANAATENGNKNTG+PFKFASPLVNEKES KVGSSSVFKAESNS SILSFG
Sbjct: 781 PGKATFPIPANAATENGNKNTGAPFKFASPLVNEKESTKVGSSSVFKAESNSGSILSFGF 840
Query: 841 PKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
P+ESMSDKAGDK SAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST
Sbjct: 841 PQESMSDKAGDKRSSAGLSVGTSENLFSSSVSTSTSAPSLFSFSSPSTNSNLNNGSLVST 900
Query: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS
Sbjct: 901 PSIFSSPATTFSNNITNQNPSIKPSLTTAPSNSEPATTTSLSMPSPVPSFSAAPIFKFGS 960
Query: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSTGGSVFQFGAAATTDSNK 1020
PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSS GGSVFQFGAAATTDSNK
Sbjct: 961 PSVPSSSAPALSAAETKTKQETTFGNLSGIPPSDTSAAKVSSIGGSVFQFGAAATTDSNK 1020
Query: 1021 RPENSTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
+PEN TSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF
Sbjct: 1021 QPENLTSAPGNVPTFGAPVFPANSGVASSTQSTPVLPFSSSSTSFGLAANTGLSSGSSLF 1080
Query: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG
Sbjct: 1081 GSSAPASNLFSSGTTFGLTGTSSSANNSVSSGAGTSSSFFNWQTSSTPSFSTGFSSTPTG 1140
Query: 1141 GFPFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSAASSQPAFSNSNHGFTFGSTPPA 1200
GF FGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTS ASSQPAF NSNHGFTFGSTP A
Sbjct: 1141 GFSFGLSSSSAASNSSPMLFGSSTTSASTTSMFSFTSTASSQPAFGNSNHGFTFGSTPLA 1200
Query: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFG---QQPLTPPPSSGFMFGSAAPPPVAASPFQFG 1248
NNDQANMEDSMAEDTVQAVTLPTPTFG QQPLTPPPSSGFMFGSAAPPPVAASPFQFG
Sbjct: 1201 NNDQANMEDSMAEDTVQAVTLPTPTFGQQPQQPLTPPPSSGFMFGSAAPPPVAASPFQFG 1260
BLAST of CmoCh04G026210 vs. TAIR 10
Match:
AT3G10650.1 (BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.1); Has 61042 Blast hits to 31782 proteins in 2093 species: Archae - 202; Bacteria - 16480; Metazoa - 16017; Fungi - 12552; Plants - 1653; Viruses - 629; Other Eukaryotes - 13509 (source: NCBI BLink). )
HSP 1 Score: 332.0 bits (850), Expect = 2.0e-90
Identity = 455/1376 (33.07%), Postives = 643/1376 (46.73%), Query Frame = 0
Query: 13 GRGGKFQKRPLRRSHTTPYDRPPTALRNSA-------GNGWLSKLVDPAQKLITSSAHRL 72
G GGKF+K RRS TPYDRP T++RN+ G GWLSKLVDPAQ+LIT SA RL
Sbjct: 18 GTGGKFRKPTARRSQKTPYDRPTTSVRNAGLGGGDVRGGGWLSKLVDPAQRLITYSAQRL 77
Query: 73 FSSVFRKRI---------PPPPPSLP---------VSRGFLLLLFLFVSWCFLVFNWNA- 132
F S+ RKR+ P LP V + + + + NA
Sbjct: 78 FGSLSRKRLGSGETPLQSPEQQKQLPERGVNQETKVGHKEDVSNLSMKNGLIRMEDTNAS 137
Query: 133 -------LLTIVRMVYGFCFSHSGTQANDEMGNENHEEVAADLPGTQEGTNRDFG----- 192
+ +++ G F+ S + AAD E + G
Sbjct: 138 VDPPKDGFTDLEKILQGKTFTRSEVDRLTTLLRSK----AADSSTMNEEQRNEVGMVVRH 197
Query: 193 -PSIKTDNTHGVT-DLEQILKEKTFTRQVLDEDVASPAEIAKAYMGSRPPKSTP--LSMA 252
PS + D TH + ++ + + LDE +ASPA++AKAYMGSRP + TP L +
Sbjct: 198 PPSHERDRTHPDNGSMNTLVSTPPGSLRTLDECIASPAQLAKAYMGSRPSEVTPSMLGLR 257
Query: 253 SHSHKFGDSFASEN--LSKSSALTLVPRSPGNFDVIENGFVTPRSRGRSALYNMARMPYS 312
+ + F + KS ++LV + P +ENGFVTPRSRGRSA+Y+MAR PYS
Sbjct: 258 GQAGREDSVFLNRTPFPQKSPTMSLVTK-PSGQRPLENGFVTPRSRGRSAVYSMARTPYS 317
Query: 313 GVSATHSIKNSVATTDAYRATGSSSSQLAWERGRVLGSKQG---ALKRRSSVLDDEMGHV 372
++SV ++A+ S WE GS+QG LKRRSSVLD+++G V
Sbjct: 318 ------RPQSSVKIGSLFQASPSK-----WEESLPSGSRQGFQSGLKRRSSVLDNDIGSV 377
Query: 373 GPIRRLRHKSNLLYPTGLSLPSSSTSIPVSGIGSENAQQFQSTKVHPFSSSSGKGLYSRS 432
GP+RR+R KSN L S S ++PVS P S + G +
Sbjct: 378 GPVRRIRQKSN--------LSSRSLALPVS--------------ESPLSVRANGGEKTTH 437
Query: 433 LSKRSAESENDVKPSSSFSQIPLRSSEMASKILEQLEKLTPPKDKSSKLKLLSVTNNSPT 492
SK SAE D+ P SSF+ +P +SSEMASKIL+QL+KL ++K SP+
Sbjct: 438 TSKDSAE---DI-PGSSFNLVPTKSSEMASKILQQLDKLVSTREK------------SPS 497
Query: 493 KLSPSMLHGPALRSLEDVDSSKYLENVEDIQSNDARELTSQKNNKVEESSSLKYKLPINK 552
KLSPSML GPAL+SL++V++ K+L N+ + ++N + + + QK ES S + K
Sbjct: 498 KLSPSMLRGPALKSLQNVEAPKFLGNLPEKKAN-SPDSSYQKQEISRESVSREVLAQSEK 557
Query: 553 EISAGDGLGSPVPTKDTVLSSR----PQVSFVGASPQTKCAFQMSAHEDFVDIDEE-GCS 612
A DG +KD + + P + + P K +F+MSAHEDF+++D++ G +
Sbjct: 558 TGDAVDGTSKTGSSKDQDMRGKGVYMPLTNSLEEHPPKKRSFRMSAHEDFLELDDDLGAA 617
Query: 613 NGPVTDISFDRREKMDASLVAMSKPSDTEAITVDKPQASAEVKPSTVSELNKINGQRKSD 672
+ P +++ S ++M +KP +E PST N Q S+
Sbjct: 618 STPCEVAEKQNAFEVEKSHISMP--------IGEKPLTPSEAMPSTSYISNGDASQGTSN 677
Query: 673 VPVTAEKSPIFSFA--------TASPPS---ITTNAKDPESTLRP---EKNISPEAPKPA 732
+ E++ +F AS P+ I K S+ +P EK I E PK
Sbjct: 678 GSLETERNKFVAFPIEAVQQSNMASEPTSKFIQGTEKSSISSGKPTSEEKRIPLEEPKKP 737
Query: 733 NAPIFGFGDKLPSQ-------------KESFSSAPTFAFGNKFAPSTNEQ----NAVPVA 792
A P+ K +S+ F +A T + N+ A
Sbjct: 738 AAVFPNISFSPPATGLLNQNSGASADIKLEKTSSTAFGVSEAWAKPTESKKTFSNSASGA 797
Query: 793 TSESNVAP--GKATFPIPANAAT---ENGNKNTGSPFKFASPLVNEKESAKVGS--SSV- 852
S ++ AP + F ANA T NG+ T SP F + N VG S+V
Sbjct: 798 ESSTSAAPTLNGSIFSAGANAVTPPPSNGSL-TSSP-SFPPSISNIPSDNSVGDMPSTVQ 857
Query: 853 -FKAESNSSSILSFGVPKESMSDKAGDKSLSAGLSVGTSENLFSSSVSTSTSAPSLFSFS 912
F A NSSSI FG K S+ + +S SA TS F + ++ S
Sbjct: 858 SFAATHNSSSI--FG--KLPTSNDSNSQSTSASPLSSTSPFKFGQPAAPFSAPAVSESSG 917
Query: 913 SPSTNSNLNNGSLVSTPSI-FSSPATTFSNNITNQNPSIKPSLTTAPSNSEP-------- 972
S + + N + +T + F A+ +Q+ I +A + S P
Sbjct: 918 QISKETEVKNATFGNTSTFKFGGMAS------ADQSTGIVFGAKSAENKSRPGFVFGSSS 977
Query: 973 ----ATTTSLSMPSPVPSFSAAPIFKFGSPSVPSSSAPALSAAETKTKQETTFGNLSGIP 1032
+T + + P S + IF S S P + +SA+ T + S
Sbjct: 978 VVGGSTLNPSTAAASAPESSGSLIFGVTSSSTPGTETSKISASSAATNTGNSVFGTSSFA 1037
Query: 1033 PSDTSAAKV----SSTGGSVFQFGA---AATTDSNKRPENSTSAPGNVPTFGAPVFPANS 1092
+ + ++ V +STG SVF F A A+ T S + N A GN T S
Sbjct: 1038 FTSSGSSMVGGVSASTGSSVFGFNAVSSASATSSQSQASNLFGA-GNAQTGN-----TGS 1097
Query: 1093 GVASSTQSTPV-LPFSSSSTSFGLAANTGLSSGSSLFGSSAPASNLFSSGTTFGLTGTSS 1152
G +STQS P S S+ SFGL+ N+ L+S SS FG S +F+S +T L+ T+S
Sbjct: 1098 GTTTSTQSIPFQFGSSPSAPSFGLSGNSSLASNSSPFGFSKSEPAVFTSVSTPQLSSTNS 1157
Query: 1153 SANNSVSSGA---GTSSSFFNWQTSSTPSFSTGF--SSTPTGGFPFGLSSSSAASNSSPM 1212
SA++S + + GTS N +S P FS+ F SSTPT F FG SS++ S+++
Sbjct: 1158 SASSSSTMSSPLFGTSWQAPNSSPNSGPVFSSSFTTSSTPT-TFSFGGSSAATVSSTTTP 1217
Query: 1213 LFGSSTTSA-STTSMFSF--------------TSAASSQPAFSNSNHGFTFGSTPPA--- 1248
+FG+ST + S + +F F S SQ F NS GF FG+
Sbjct: 1218 IFGASTNNTPSPSPIFGFGSTPPTTPQQPVFGNSGTPSQSLFGNSTPGFAFGAPNNGNGI 1277
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9CAF4 | 2.8e-89 | 33.07 | Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 S... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GY88 | 0.0e+00 | 91.99 | nuclear pore complex protein NUP1-like OS=Cucurbita moschata OX=3662 GN=LOC11145... | [more] |
A0A6J1JSL2 | 0.0e+00 | 89.12 | nuclear pore complex protein NUP1-like OS=Cucurbita maxima OX=3661 GN=LOC1114871... | [more] |
A0A6J1FKS9 | 0.0e+00 | 71.84 | nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita moschata OX=3662 ... | [more] |
A0A6J1FF52 | 0.0e+00 | 71.63 | nuclear pore complex protein NUP1-like isoform X1 OS=Cucurbita moschata OX=3662 ... | [more] |
A0A6J1K059 | 0.0e+00 | 71.40 | nuclear pore complex protein NUP1-like isoform X2 OS=Cucurbita maxima OX=3661 GN... | [more] |
Match Name | E-value | Identity | Description | |
XP_022956154.1 | 0.0e+00 | 91.99 | nuclear pore complex protein NUP1-like [Cucurbita moschata] | [more] |
KAG7032922.1 | 0.0e+00 | 90.30 | Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6602242.1 | 0.0e+00 | 90.47 | Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023527371.1 | 0.0e+00 | 90.09 | nuclear pore complex protein NUP1-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022990208.1 | 0.0e+00 | 89.12 | nuclear pore complex protein NUP1-like [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT3G10650.1 | 2.0e-90 | 33.07 | BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.... | [more] |