Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAATCTATTTCACTATAGTTTTCATTTCTTTCCCCCACAAGAAAATGCTAAAGGAAAGGAGAAAGAGAAATACATTTCATTTGCTTACCGGTTACCGCCCAAAGACACTTTGGAGTGCGATGCTCCTAAAACCCCAGGTGGAAGAACATCGGTTGGTTCCAAACCCCTTCGGCTGAGCTTCTCTATCGATTCAGAGAGGAGCATTTTGCAATTCATGGCTTCCGTTGATTCGGGACCTTCATCCTTGATTCCACTGGAGGACGCCGGCGAAGGAGAGCAAATTGTAAGGAACGATCTCTACTTCCAAAAGATCGGCAAACCTGTTCCCGTCAAGCTCGGCGACTCCATTTTTGACCCCGAAAGTCCTCCCTCTCAACCCCTTGCTCTTTCAGAGAGTTCCGGTCTCATCTTCGTTGCTCATTTGTCTGGTTGGTAATTTCAATTACTTTCCTCATTGTTGTGATTACCATTGGTTTTTCTATGGTTTTGAAGAAAATATTTCTTGTTAAGGTTTTTTTGTGGTGAGAATCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAATGGGGGAACGGGTTCTTCTGTCCAGGACTTAAGCATTGTGGACGTTTCCATTGGAAAAGTTCACATTCTAGCAGTTTCCACTGACAACTCAGTTCTTGCTGCCGTCGTAGCTGGTGATGTTCATATTTTTTCAGTCCAGTCGCTGCTTGATAAGGTAATTATTTTCGCTGGAGCTCGTCGTTAGACCCCTAATATCATACCGTAGTTTATTTGAAACGTCAATTCAGGTATTTACATCGACGGGTAAAATGAGGCAGTCATTAGTTATATCTCTTGCGCTCCATTCTCTCATGTCATTGCTATTATGCAGGCAGAAAAACCCTATTCTTCTTGTTCAATAACTGATTCCAGTTTTATTAAAGACTTCAAATGGACCAGAAAGTTGGAAAATACTTATCTGGTTCTTTCAAAGCACGGACAGTTATATCAAGGATCGGCGAATGGGCCTCTTACACATGTAATGCACGATATTGATGCTGGTATGCTATTTATGAATACAGTAACTTGTGTATAATTCTCAGCAACAATATCAAGTGGCATTTGTTAAGTTTCTTATGTTTTATATTAAATTACTGTATACTTCAAGAAAGTGGATCTTTTTGTAACATACTTGCATGGGGAGATGCTGTCTTTCCCTGCACATTTTTCCTTCCATAATTCTCCATTTCTGAGTGCCTCTTCATAAAAGTTAAACGAGCTCTCTAAAATCTTTCTAGTCAACTCTTACAGTTACTACATCTATTGTGAAGAGCTGTTGTCCGTGTATATTTCATTTAATCATTTCAACTTTGTTTCTTATTGAAGGACCTTCTGTTCTATTTATAATTGTAAGGACGGTGACTTGTAACTGTAATTTGTAAATGAGGAATCATTGCAACCTAGGCCCGTGGTTGTTATTAGCATTCGCCCTTAGGGAAACAGCAAAGGACCATCATTGAATTGAGACTCAATTCACTGCTTGGAAATGTTGGAAGGATGACATGGGAACACTGCTCATTGGCACAAAAGTTAGGGGAGCTAGGTCAGAGATAGGAATGTGACTGAAGATAGGATATTCATAGATTGTTGGACTTAGTTAGTTTGACTATTTATCTTAGAGATCTTTATTTTTCCAGGAATTAAAACTCATTTTTACTTTTTGATACTGAACAATTTTAATTATTGCATTTTTCCCTTTCTCTTGCATGTACCCTGCTCTATGCAGTGTCTTTGGCGTATATTTTCTTCTTCTTTTGTATTCTCTTGACCCTGGTTTTCTTACTTTTTTTCCCAAAGCAATCCATTAAAATGAATTATTGTCAGCAAAACTTTGTTTGTGTAGAGCTACTATTTTTGAACTTCTTTTAAATCTTATTTTGGATTTAGTTGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGACACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCCGAGTTTAGGGAATGGTGAAACTGATACAGACTTTACAGTGAAGGGTTCTCCCTCTCTCTATCTCTCCTGTGTGTGTGTGTGTGTGTGTGTGTGAAAATGATATTTATAGAATATTGAAATGTTTAACAATTTTCATCATTCATTTTTTATTTGGTTGGGGGGGGGGGGGGGGGGGGATATTTCTTGTATTTTACCTTTGTGATGTGTCATATGGGGTATGCGAGCTCAGTATTTTTAGGTTGGCTACAAGTTGAGTTTGTGTTGTTAGCTTGGACTGAAGTACAAGTTGAGTTTGAGTTGTTAGCTCGGTATTTTTATAATTATGATAAACGAAAATGACCATTGAGAAGAGGGAGAGAATACTGGTGAGCATATATATTCTAACACATAAGGAGCCAAAATAAATCTCCTATTACTGGGTTTTCGGTTCATCATGGCTAGTTTGGAAATTTGCCACTTCCTTCCCCTTTTGTAATTCTTTTTCATTACCACATTATTCCATTTCCTAAATAAAATCATAGGGAAACAATTAAATGGGCATTTCAAGAGAACAAGGGATTCTAAATTCTAATTAACTCTAAATAATCCAAAAAAATTAATATAATTAAAAAAGAAATTTCTGAAAACTAAAATAAATTGGAAACAATAGTAGTGCGTACCATACAACAAAACTGATATATTTGACTTCAATTATAATCTTACATTGGTTATTCTAGAAAGTAGTTATGGTTACTATCTTTTAACTTGATTTATGGTGATTTCTCAGTTGACTGTATCAAGTGGGTCCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACCGCAACAGGTGATGAAGAAGATTACCTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATCACTGATGTGAGTAGTTGTGATTTTTTTCCTTTCCCCTCTCCTATTCCATTTCTATTTTCATGATAATGATTGTGTTCTTTATCATGCTTGTATATATTTTTTTTAATATTTGTGAGTGTCTAGACCAACTTACGTGCTCCTCGACTAATCTCATGGAACAACCTGCTTAACCTTACAACATTAAAGTTGTAGGATATTAAATCCTAGGTAGGTGACCTACATTTCTATTATCATGCTTGTACTTAGAGATAGGAGAGACAATAAGGAGGATGATAAGGGTTGCATTTGACTTGAGATGGTGAGAGAAATTTTAGAGGATTTAAGAGGTGTAGACTCATGCTAAACTTAATGCCTTTTCAAACATTTGTGACTTAAGGACCTTTGTAATTATCACGTATCATCCTCTAGGCCCTGTTCTTTTAGACTCCTTTTCCTTGTTATGTCCATTGTTGTTTGATTCAGTTTTTTGGTTGTTTTTTTTACAAGCCATTTGTATTCTTTCATCTCTCTCTCAAGCTTGGTTTCTTGATGAAAAGACAATCTTGTACATGGGTCTTATTGATGTTTATAATTAATTGTGAAACCTCAGTTTGTTTATCAATTTCCTTGAGTTTATCTCTTTGGCAAATCATTGTGGGATATTTGGGGTGGGGGGCAGAAACAACAAAGTGTTTAGAGGTTTGGAGAGGGAGGGAGTTTCTAATGCTGTTTAGCTCTTGGTGAGGTTCCATGTTTCTCTCTCGAATTTGGTTTCAAAAACTTTTCATAATTATCTTCATCTTACTAGTTGAAAACCATTTCTTTAAAAGGCTTTTTGTGGGCTTGGTTTTTTGTATGATCTTGTATTCTTTCTTTTTTGAAAGCAATTGTTTCTATGTAATTTTTTTTTTTGGGAAATACAAACATTTCGTGGGGAAATAACTCCTTGGATGTTGAACGCAAATAATGAGCCTGATTATTGCATATGGTAATTTACAGTGGACTTCACCAGATGTACAAGATATAGAAAATTTGTCTTTAAAAACAAACCATTGTTATTGATGAGAAGAGACTAATGCTCAAAATACAATGATATAAAAAAAACAATAAAGAGACAAATCCTTCTACCATATTGTAATTTACTCAATGAATGAGACTTGTTTCTGAAAAAATGCTTCTTTTGTGTACATTCGTGGATCGTGCAAGGTTGTTACCAATGAATGTGGGGTGGTTCACTTTTGGAGCTTTTATTGTTAAACGGGAGAGATGGGATGCTTAGGAACACTGAAAAACTTATGCTATCCATTCATGCTTATGAATGGTGATAAAGTACAAGGAAACTATTTCAAATTTATTTCATTTGAGACGGGAAGAGTGGATGGAGATCACAGCTTCAATGCCTACATTTGTACTTTTCTAGATAGGGCTTTACTTATTGGCAGAAAAGTGGAATTTCACGATGGTTTCTCGTTGACTACGGCTAAAATCTTTTAAGGGTCGGGTGGCAGGCCAGGTTTCAACTTGATAAACTTATCTTAGTTGCGGCACGATGGAATTAGCCTCCCCACAATAATTTTTTTTTACCACAACCTTCCATTTCATTCGAAAACGACATGAGCTTTCTTGTTCTAGAAGGGCATCTTGTCAAAAAGTCCAATTTAAGGAGCCCTCAATTAATGAATTATATGTATTCCAATGTTTGCCAACAATCATGGAACCCTCAACTCTATCCCTCCTTTGAATCAAATTTTATGCCTTGCAAACATTTCCCTTTTTAAATGTCAATCAAGAGGGAAAAGTTGTTTAAACATCTCTTGAAGCTAGTTTCCCAAAGATTTTCTCTATTCTATAAAAAGGGAGCTTCGGTGGCAGATTGGTGGGATACAACTTAGATGGAACCTGGGATGGCTTTCGGTTGGGTCAAAGAAACGATGGAATTAGATGGTCCTTTGATGTCTCAAGCTTATTCTCCACCCTAAGAAGTTGAAGGGTTTTCGGTTGGGTCTGAGAAAGTACGGAATTAGATGATCCTTGGATGGCACAAGTTTTTTGTCCACCAAATCCTTGTTTTTTCACTGTCTCACCATTTTTTAACATGACCATGACAAAGTTGCTTTGGGAGTTTGAAATTTCAAAAAGTGATGATTTTCCTTTGGTTGCTAACATAGGAGATTAAATACTCAGGAGAAAATGCAAGCACAAAGTGTCCTTGAATTGCCTTCAACACTTCTATTTGGGATCTTTGTTTGGTATGTGCCTAGTCTTTGGACCATATCTCCTTACACTTCCTCTTGCGCCTTTTTGATTTGCATATCTGTCTATCCAAGTGGGTGGATTGGTGGTTACCTAAATCCCTCAACAATTGGAGTTTGGAAGGAACGTGTTAAATCAATCATAACTCAAAAGCTTAAATTGACGAATAAAGGTAAATTTTATATTATCAACACTCTCTTCACTTGTGGGCGTGAAATATGTTGAATACCCAACAAGTGGAAATCAATCTTTGTTAAATTTGATAGTATGAGGTTTCATCTCAAAATCAATTGGCAATGAGAGGAGTAACCCATCTATCTTATCTGGAGTATGAGTCCCTTTGGTTTTTTCAATGTGGGACTCCCAACTTCTCCAATATGCCCCCTCAAGATGGTGCCTCTTTGGGTTCACCTATCTTAGACCGAATACCTTTGGGTTTAATGGGCTCTGATACCATGTTAAATTTGAGAGTATGGGGTTCCATCTCAAAACCAATTGACAATGAGAGAAGTAGTCCATCTATCTTATATGGAGTATGAGTCCCCTTGGTTTTTCCAATTTGGGACTCCCAACTTCTCCAATAATCTTAATTGAAAAGGAAATGACATTGCATAGATTTGAATACATGACCTTGTGACCACCTTCTTTGATACCATGTTAAATCACTGGTTGACCCAAAAATGTAATTGATGGATGAAAGGTAAATGTTATATCATCTAATAGAAAAACTATATATGTAGATTACTTCGGGATCCTTTTTATGGGGTTTATGGCTTGAATTTAACAAGAGGATTTTTGAAAATCAGTCGACCTTTTGCTTCTTTCTTTCCCCACTTTGGAGTTATTGTATTTTGAGTATTAGCCTCGTTTCATCATTTCGATAAAAAGTTTTGTTTCCTTGGAAAAAATGAAGACTAGTCCACTACTTTTTCTACTTTTTGAGATTTTGTACAACTAGATGCTTCATGGTGGAGACATAGTTACGGCAAATTTTTCTGTTATGGATCCATTCCATGATCCTTCATGATTGGAAGGCTTTTTGCCGGTACCTTTCTTGAGTGGTGAAGCTTTCTTTTCCCAGCCTTTACATTTTTTTGTTTCCTTTTTTCTTTTTGTTACCTTCAAACCGTTAACTGAAGAAGGGATTTATCATGAAATATCCTATGATTCCTGCCAAACCAAATTTTGGAAAAGAGAGCTTTGATTGCATCAACTATAGTTGAGCTTTATTAGATAAACCATGTGCGGCCTTATTAGTAGAAAAGGAGTAGTTTAGGCTGAAACTTGAGTGGGATTGGCAGATGGTTGTGTGAGGGAACTTTAGCAGTCTATTGTTCAAGGGAATGATAGTTAAGTGTTTTAATATTTTCTTTTTGTGTTCATTGAATCTTTCATTTACCTGTGAATCATGTATCGTGTTATTATTATTAGTGCCCTATAAAAGTTAGTGTCCTAGAATCAAGGTCAATCATCTTGGATTAAACCGTCACCTTCTTTTTTTGGTTGTAAATTTGTTATTACTATTCTGATCCTCTGAATGAGAATGATGCAACTTTATCTAAAATTGACGGTGAAAGTTGTGAACTATCGCAGTTTTCATATATTTTTCACTAAATCGTGTTAAATGTATGCAACTTGTTTAAAACAATACAATCATAGAACAGTGTCATAAACTGATCGTCATGATGAATTGAATTTGTTATTTATTTATTTATTTTATATGTTTAGTAAATTAATTTTTGTAACTCTCTTTTGTAGGTTTCTTCAAACAAAGTTTTGTTATCATTCTGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGAAAGTGGGCCTTGTTTATTGTTGAGCTATTTGGATACATGGTATGCAGAAATTACTTGATTTGATTTCGATGCTTGAAAGTGATTTTTTCTTGAAATTCCTTATTTTTTATTACTACCTTTTCTTCCTTTAAACTTTTGGTTATCATTTCATCAATTTTTCTTTCTTCTTTTTGTTGCTGTGGGATTTTAATTATAAACCCACATATCAACTTTATTTTTTTTCTCTTTCTTGATATGGATGCAGCAAGCTTGCAATTGTTGCAAATAGGCTCTATGTGGAAGATCATATTGCATTGCTTGGTTTGTTGCTAGAGGTTGAGAATGAAGTTGCAGTTGTTAATATTGATAGGAATACCTCTCTCCCGAAGATTGAGCTTCAAGGTTAGTGATCTCGTGATTAGTGTTATGAAAGACTTCTGAAACTTTAAAATGATGTTGATTGCTGAATAGTGAATATAACCTTGAGTTAGCTATCTAAAGGCTACAGTCAGCCTGACAGCTTGTGTAGTTCAAAAGTAACTAATTTAGCATACCTTAAGTTTAAATATAAATTCTGATTTGTGAATATAAATAGTAATAAATCAAAGAAACTGATCCTAATTCTAATGAATTAAGGATTTGACCATTATATCTTAACCCTACTACATCACGTTTTATGCTATTGTTTTTGTAAGATAGTTCTACTCGTTTTTTAGTCAATGTTGGCCCTCTTGTATTTCGATTTTTTTTCTTAATGAAAGTTTGGTTTCTTGATTAGAAAAAAATAATAGTTACAATTTGGTAAATCTGAGTTATAATTTGAGACATTACATTATGTGATTATTTTATATCGATTCTTCTAATAGTTGCTTGGATTTTGTTTTCTTTTAGGGTTTATCTATTTGATTTTCCTGGTTTACTCATGCATACACAAAGTAAAATGATCATTTTTTGTTAGAAAGTTGGGGTTGGGTTGTGTAGTCAATTTTTACATACACATTTATGCATATGCTCATTGATTAGGTTTTTATTTTTAGATTCTGCAGCATCTCTTCTTCTGTGAGATGATTCTATCTCATTCTTGTTTTGTTTCAGCGAATGGAGATGATAATTTGGTTATGGGGCTGTGTATTGATCGAGTTTCTCTTCTGGGGAAGGTGATAGTTAAGGTTGGATTTGAAGATATGAGAGAAGTCTCTCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGGTAATGACTTTTATGTTTTAAACCTTGTTTGTTATTGTACCAGAGGCCCAAGTCTATTTTTACCTTAAATAGATGTTTCTATCTTTTGCTTTTTATAGCAGCCTGGATATTATGATTATTGCTCTTGATTCCCTTTTACTATTGGTGTTCAACTTTTCATGCAGTGTCAACGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGATGAGGAAGATGATATAACAGTGCCCACTGATGATCGTTCTGAATCAAAGGAGTCTAGAGAAGCAAACATAGATCATAGGATGCAAGTTACGGAAAAAATCGCAATCAGTAGTGAGATTCCTAGGGAAAAGGGTAAAACTTCAAATGACATTAAGTCTTCTAGGAATGATCAAAGTCTAGTATATAACATAGATGAGAGTGCAATTGTTAGCCCAGAGGGTAATACTAAAAGTCAGAAAGTGGATTCTTTCATTTATTCACAATCATTGAAGTCTTCTGCCCCGGAGAGACCACCTCACTATGAGATTGGGAACTTTGATAAGCCAGTTCTAAAATTTACTGGTCTCGGGTCTGCTTCTATTTCAGGAAAGTCTGAGGATGTGCCTAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGATTGGGGTCAACTGGCTTGATGGCTGCTTCTGAGTTATCCAGTGAGAAAGCAATGTCTTTTAAAAAAATTGATCCAGTACCTTCAGTCTTTACTTCAAATTCTCTTCAAAGCAGCAACACTGAGAATTATGGACCAAGTTTTGGTACAGCAAATGCTTTCACAGGTTTTGCTGGAAAACCTTTTCAACCAAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGCAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGATAAGTTCTCGTCAGGGAAAATTTCTAATGAGAAACATGATGGTTCAGAGCGATATTACAGCAATTCCCCCCTGGCAAAACCAGTAAGTTCTGGACGAATTTTATTCATACAATTTGTCAATGTGAAGATATTTGAGTCTAATATAGGAATGGCTTGGTTTAAAATCGGCCATTCTTTGGCTTGCTCATACTCCTCCAATCCATCGTCTTTGGAAAAGAATCTTTGTAAAGAAAATACAGACTGTCTCCATTTTTACTAGTATAAAAGGAAAAGTGGATAAAACACTTGCATGAGCTTTCCTCTCCCTATATATATACCTCTTAGGAACATCTTGAATTAGAAAATGAGATTTCCTTTGGTTGAATGATGACCCGTGTGTGACTAGCGAAATCATGAGGCCCATGTGTTTTTATATGTGAGTTCATATTGTATTTTTGCCGTTTAATTAGTCAACAAACATCTCTCTGTTTGTGTAATTATTGGTCGTGACTAGGTAACTAAAAAACATCCCCCCCCACCATAGACTTGTTTTATTTTTTTGATAGTTGTGTTTCTCTTCTATTCAAGACATTTGTCTCAATGGGAATACCTTTATTTTTCTGTTATAATTTATTATGTTTTGAAAATTTTTGTGTCGATTTTTATTAAGATTCTTTTCATCACATCAATGAATTTATTTTATTTTTAAAATTTTATTTCCTTTTAAAACCAAGGAGAAAAGGTGTCCCACCCTCAAACTGCAGTCTCCAGCCAGCTTTAATATTGTTTCTTAATAACTTTCTCCCTTCAATGCTCCCTTTTCCCTACAGATGAAGGAAATGTGCGAAGGGTTGGACACACTTCTCGAATCTATTGAAGAGTCGGGTGGGTTCATGGATGCCTGCACTGCTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAACTTGGCTTAGCCAGTCTTTCAGATGGATGCCAAATATGGAGGGTAACTGTCAATTGTTATTTTATATTTTTCAGTTTGTTAATAATTATCTCCTGTATTTTGTTAGACGGGGAATATGTTTGTTAATTAATATAGCCTTATTATTTATTTATTATTTAAAGTACTAGATTCAGGAAAGGGAGAAAAATGTAAAAAGGAAATTTATGGAGTAGACAGTGACTGCTGGGCTTCTCATCTAAGTAGTAATGGTCTGTTTAAGAGATTAGCAGTGGAAAGAACAGAGAAATGACGTTAGCTTGATGATGGAAGCATCTCATAAGAGAAATACTTGAAAAAGAAAATGTATGATATGAGAAAAGATGGCCAGTACTCCTTTCCTAGGGACTGATGGCCGTTCCTCGATACTAACTGAGAATCAACGAACATGTTTGACTAGAAAATTTAAGAAAGAGCTGAGAAACATTAGGTAATTGATTTACAAATGACTCATTCTATAAAATATTGAAAGTATAATTCAGGTTTCAAGTCTTTTCTAGGGATGTGACTAATTGAAGATTCATTACATGATTAGTTGCTTGAGTTTCTTCATGATTGTTTCCTCAGATTTGCCATTTATGTGGATTTAATGGCGGTTTCTGTAGATATATGTTGGGATAATTGTTTTAACTAGCAAAACTGCTTGAAATTTCAAATATAGCAAAATGTCACCGTCTATTAGTGATAGACGACCCTCACTAGATTTCTCTGGTGTCAAATTCTTTATGAAATTATCCTTTAGGTTTTGTGTTGCTTGCAAGGACTCTCTTTTGTGGCGATACTCCTTTTTGTTGCCTTAGCATTTTTATGGATGTCCTTTTAATATTATTTATATTATTTTTTATTTTTATACTTTTTTTTACATTTTTTCTCAATGGAAGTTTGGTTCTTTATATAAAAAATGTGTTTAAATAATGCCCTGCCCTCTCTTACCTTCCCCAAAGTTTACATTTTTTTAAATTCTTGCAATTGGAATCATATGCTTTTATTTTGGGTTCTTTGTTTAGGGCTCCTTTTCTTTTCTTTACCCTATTTTTTTCCTCGCTATCAATATAGTTATCTAACACGGGGATTTCTTTTTCCTTCTTTTCATAGAGCACAATGAATGAGCGTTCGCAGGAGGTACAAAATCTCTTTGACAAAATGGTTCAAGGTATTAGAAGCTCAATTTCTTGTTTGTTTATACATTATTGATGGTGACCCTGTTGGTGTAGAATGTTCCTTCCTGATCTATCTGATATTGGAAAAAAGCCTTTATTTAGAATGCATTGAATGTTTGAACTCAAAGAAATAGAACTCTGGAATTTCATTTACATCCTTTTCAAACATTCAGTCTAGATGTTTATAAAAAATCTTATTGGTTATCACGGGAATTTGATTGACATTTGCAAATAACAAGGCTGGAGATCTGGAGTACTGGGATTTTGTTGAAATAAGTAGCAATTATGTCAGTCCATTCATCAAGTTGACTTGGTGAAATATTAGATTGAAAATCATAATTGATTAATCTCTGGTGAAAATTTCATAGATATTTTACATTCTCAGTCTATTTGGTTTCAGTTTTGTCAAAGAAGACATACATTGAAGGTATTGTGATGCAATCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTAAGTTCTGAATTAGAGCTGAAGAGACAACATATCTTAAAGATGAATCAGGTAGTCTATTCTTGTTTCGAAAGTCATTGAATTTCCCCAATTTTAGACGTTATTCATATTCTTCTCTTTTCCAGAATATAACCAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGTCTTGAGCTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGTAAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTCGTCAAGGTACTTCCATTTTCTTTTTCTGGCTCTTAGATGTAAGACTTTCGTAATCTTTGTTCTTTTGTTGTTTTTAATTGTATTGCTTTACTTCTATTCTTATACTTCTTTCACTACTTTCTGGAGTTTGTATCTTTTGAGCAAATCTCTTTTCATTATATCGTTGCAAAGTTTTGTTTCTAATTTTGAACATGTGAGCAAGAAAATCCTGTCTTCATGATGATTGAAATGAAGTAATTTATTATCTCTGTTATCATTCACTTTTTAGCTGGTTTTTTCTTGAGTAGCTTGACACTGGTTTCTTCATCAAAATTTGTTACAGGCATAGTCATTCAGTACATAGTTTGAATAACATTATGGGGTCTCAATTAGCAACAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATGGAATCACCCTCTTTGAAAAGGCAGAGTGCCACGAAGGAATTGTTTGAGAGTATTGGACTTACTTATGATGCTTCTTTCAGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTTCTGATTCTTTTTCAAGTAAAGGTACATCGAGAAGAAAACAACAGAGTGGAACTAAAAATTCTGAAGCAGAGACTGGGAGAAGGAGAAGAGATTCACTTGACAGGGTACTGTTTCATTGAACGTCTATTTTTCTACTCTTACCTTTGTTTCATATGGAATTTCTTTTTGTTAATTGAAAATGATATTTATATTACATGATTAAAATAAATTTGGACTAGATGTGCCATACACAATAAAATAAAATGATGAAGTTGTTCGCTGACCACTTTTGCTAGGAAATAAATGTGCCATATCCATCCTCTAATTTTGCTATGCTTTTGTGTCCTAATATTTATTCCCTGTAAAATTTGTGTAGAACTTGGCTAGTGTTGACCCTCCAAAAACAACTGTCAAGAGGATGCTTTTGCAAGGAATTCCTTCCTCTGAGGAGAAACAATTTTGTTCTCGCACACCTGAAGGAGCTGCAACAGTTGCACGGCCTGCTAGTCGCATAACATCATCTATTTCATCGTCATCCAAAAATGCAGGTAATCTCTAAGATAAAGCATAAGCAGTAGGTTTTTATAGTCATTTGCTTGCAACTTTCACAAATTCTATAAGATCTCCTTTTGAGTTTTGAATGTTTCAGTTCGAGGGTATTAATAGGAGATTTAATAAGGGCATAGTGATAATTAGCTACCGGTGCTTTGGTTATAAATAAGATAAGTTATGTACTTGAGGAGTGGGAATCATTTTGGTATCCTTGGTAGATTTGTTGAGGGAGAGACATAGTCTTCTTGAATGGCTATTGATATTGCAAAAAGTTAATCTTTGATGTTTTCAATGTATTTTCTGTGTTTTGGTTACCTAACATCCAGCTTATATTTATTCTACATATGCAGGACATGACTCTGAGAACCCGGAGACTCCTTTTATGTGGAATAGCCCTTTACAACCATCTAATACCTCTAGACAGAAATCTCTACCTTTGCAAAAAATTAATGTGACACCACCATCTCCACCTCCAGTATTCCAATCATCACATGATATGCTGAAAAAAAAAAATAATGAAGCTCACAGTGTGACTTCAGAAAACAAATTTACAGACGTGGCGTGTCCTGAAAAGTCAAAAGCTTCTGATTTCTTCTCAGCCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACGTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGATGCCCACACCGATAGATTCTATTGCTACCTCTAATGTGGACAATCAGAAGACTGCTAACGTAAAGGAGAGGCATACAACTACAAGTCCATTTTTTGGATCTGCAAATAAGCCCGAATCTCCATTTGTTGGTTCGATGCCTTCTCTGGTTCCTACTGTTGATGGATCAAGAAAGACTGAAGAAAAAAAATCAGTGACAACAATTTCACAATCAGTTTCAGCACCGGCTCCGTTAAATACTTCTTCAAGTGCATCGACTTTATTTTCAGGATTTGCTGTAAGCAAAGCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTCCGTCAACATCAACCCAATTGAACTTCTCCTCTCCGGTTGTTTCTAGTTCTAATTCCCTATTTCAGGCACCTAAGATTGTACCAACATCACCCACTCTATCTTCTTTGAATCCTACATTGGAGTCCTCGAAAACAGAGCTATCGGTTCCGAAATCAAATGATGATGCTGAAGAGCAAATACTATCTTCGAAGCCTGGGTCTCATGAACTGAAATTTCAACCTTCTATAACACCTGCCGATAAAAATCATGTAGAGCCGACTTCTAAAACCCAGACTGTTTTCAAAGATGTGGGAGGACAGGATTCAAATGTAGTAGGGAATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTTCAATACCTTCACCAAACTTAACTTCTAAGATTTTTGCAAATAGTAGAAATGAAACTTCAAATGCAGTGGTTACTCAGGACGATGATATGGACGAGGAGGCCCCAGAGACGAATAACAATGTTGAGTTTAATTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCTACCCCTATATCAGGCGGTCCTAAACCAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAGCCTCAATGACCTCTTCCTTTAATATGGCATCTCCTCCAAGTGGAGAGCTGTTCCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCTGTTGCATTCTCTGGTGCCTTTGGCTCTGCAGTGCCTACTCAACCGCCTTCGCAAGGTGGGTTCGGTCAGCCTTCTCAGATTGGAGTAGGACAGCAAGCACTTGGTAATGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTACTGTCCATGGAACTGGTTCAGGATCACCTGGGGGTTTTAGTGGTGGCTTTACGAATGCAAAACCTGTTGGAGTTGGAGGTTTTGCAGGTGTCGGTTCAGGAGGTGGCGGCGGTTTTGGAGGTGTTGGTGGTTTTGCTGGTGCAGCCTCAACCGGTGGAGGATTTGCCGGTGCTTCCTCTACGGCAGGAGGTTTTGCAGGTGCTGCAGGTGGAGGCTTTGGAGGTACTGCAGGTGGATTTGGGGCGTTTGGCAGCCAGCAAGTAAGCGGCGGTTTCTCTGCTTTTGGTGCTGCTGCTGCTGCTGCTGCTGCTGGTGGAGCCGGAGTAACTGGAAAACCTCCTGAGCTTTTCACCCAAATTAGAAAGTAGAGTTTAGATCCACTATTTATTGATTGGGTTCATAAGCAGAGAGTAGAATGTAGATTATTTGTGTTGCACCCTTTTTCAGTTTTTAACTCTTTTTGGATCTTAGATAACCTTCACATTATTGATTAACTTCACATAATAGATTAAGAGATTAAGGCATATATATATATGTTTTTTCCCCTTTCAATATTATACTTTTTGAGGTTATTTTGACTGTTTATCTTTGACATGTACATTCAAGATGAAGTCATTTTTATTATTAGGATTTGTAGTGCAAGGATGC
Coding sequence (CDS)
AATCTATTTCACTATAGTTTTCATTTCTTTCCCCCACAAGAAAATGCTAAAGGAAAGGAGAAAGAGAAATACATTTCATTTGCTTACCGGTTACCGCCCAAAGACACTTTGGAGTGCGATGCTCCTAAAACCCCAGGTGGAAGAACATCGGTTGGTTCCAAACCCCTTCGGCTGAGCTTCTCTATCGATTCAGAGAGGAGCATTTTGCAATTCATGGCTTCCGTTGATTCGGGACCTTCATCCTTGATTCCACTGGAGGACGCCGGCGAAGGAGAGCAAATTGTAAGGAACGATCTCTACTTCCAAAAGATCGGCAAACCTGTTCCCGTCAAGCTCGGCGACTCCATTTTTGACCCCGAAAGTCCTCCCTCTCAACCCCTTGCTCTTTCAGAGAGTTCCGGTCTCATCTTCGTTGCTCATTTGTCTGGTTTTTTTGTGGTGAGAATCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAATGGGGGAACGGGTTCTTCTGTCCAGGACTTAAGCATTGTGGACGTTTCCATTGGAAAAGTTCACATTCTAGCAGTTTCCACTGACAACTCAGTTCTTGCTGCCGTCGTAGCTGGTGATGTTCATATTTTTTCAGTCCAGTCGCTGCTTGATAAGGCAGAAAAACCCTATTCTTCTTGTTCAATAACTGATTCCAGTTTTATTAAAGACTTCAAATGGACCAGAAAGTTGGAAAATACTTATCTGGTTCTTTCAAAGCACGGACAGTTATATCAAGGATCGGCGAATGGGCCTCTTACACATGTAATGCACGATATTGATGCTGTTGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGACACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCCGAGTTTAGGGAATGGTGAAACTGATACAGACTTTACAGTGAAGGTTGACTGTATCAAGTGGGTCCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACCGCAACAGGTGATGAAGAAGATTACCTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATCACTGATGTTTCTTCAAACAAAGTTTTGTTATCATTCTGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGAAAGTGGGCCTTGTTTATTGTTGAGCTATTTGGATACATGCAAGCTTGCAATTGTTGCAAATAGGCTCTATGTGGAAGATCATATTGCATTGCTTGGTTTGTTGCTAGAGGTTGAGAATGAAGTTGCAGTTGTTAATATTGATAGGAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGAGATGATAATTTGGTTATGGGGCTGTGTATTGATCGAGTTTCTCTTCTGGGGAAGGTGATAGTTAAGGTTGGATTTGAAGATATGAGAGAAGTCTCTCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAACGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGATGAGGAAGATGATATAACAGTGCCCACTGATGATCGTTCTGAATCAAAGGAGTCTAGAGAAGCAAACATAGATCATAGGATGCAAGTTACGGAAAAAATCGCAATCAGTAGTGAGATTCCTAGGGAAAAGGGTAAAACTTCAAATGACATTAAGTCTTCTAGGAATGATCAAAGTCTAGTATATAACATAGATGAGAGTGCAATTGTTAGCCCAGAGGGTAATACTAAAAGTCAGAAAGTGGATTCTTTCATTTATTCACAATCATTGAAGTCTTCTGCCCCGGAGAGACCACCTCACTATGAGATTGGGAACTTTGATAAGCCAGTTCTAAAATTTACTGGTCTCGGGTCTGCTTCTATTTCAGGAAAGTCTGAGGATGTGCCTAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGATTGGGGTCAACTGGCTTGATGGCTGCTTCTGAGTTATCCAGTGAGAAAGCAATGTCTTTTAAAAAAATTGATCCAGTACCTTCAGTCTTTACTTCAAATTCTCTTCAAAGCAGCAACACTGAGAATTATGGACCAAGTTTTGGTACAGCAAATGCTTTCACAGGTTTTGCTGGAAAACCTTTTCAACCAAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGCAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGATAAGTTCTCGTCAGGGAAAATTTCTAATGAGAAACATGATGGTTCAGAGCGATATTACAGCAATTCCCCCCTGGCAAAACCAATGAAGGAAATGTGCGAAGGGTTGGACACACTTCTCGAATCTATTGAAGAGTCGGGTGGGTTCATGGATGCCTGCACTGCTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAACTTGGCTTAGCCAGTCTTTCAGATGGATGCCAAATATGGAGGAGCACAATGAATGAGCGTTCGCAGGAGGTACAAAATCTCTTTGACAAAATGGTTCAAGTTTTGTCAAAGAAGACATACATTGAAGGTATTGTGATGCAATCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTAAGTTCTGAATTAGAGCTGAAGAGACAACATATCTTAAAGATGAATCAGAATATAACCAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGTCTTGAGCTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGTAAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTCGTCAAGGCATAGTCATTCAGTACATAGTTTGAATAACATTATGGGGTCTCAATTAGCAACAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATGGAATCACCCTCTTTGAAAAGGCAGAGTGCCACGAAGGAATTGTTTGAGAGTATTGGACTTACTTATGATGCTTCTTTCAGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTTCTGATTCTTTTTCAAGTAAAGGTACATCGAGAAGAAAACAACAGAGTGGAACTAAAAATTCTGAAGCAGAGACTGGGAGAAGGAGAAGAGATTCACTTGACAGGAACTTGGCTAGTGTTGACCCTCCAAAAACAACTGTCAAGAGGATGCTTTTGCAAGGAATTCCTTCCTCTGAGGAGAAACAATTTTGTTCTCGCACACCTGAAGGAGCTGCAACAGTTGCACGGCCTGCTAGTCGCATAACATCATCTATTTCATCGTCATCCAAAAATGCAGGACATGACTCTGAGAACCCGGAGACTCCTTTTATGTGGAATAGCCCTTTACAACCATCTAATACCTCTAGACAGAAATCTCTACCTTTGCAAAAAATTAATGTGACACCACCATCTCCACCTCCAGTATTCCAATCATCACATGATATGCTGAAAAAAAAAAATAATGAAGCTCACAGTGTGACTTCAGAAAACAAATTTACAGACGTGGCGTGTCCTGAAAAGTCAAAAGCTTCTGATTTCTTCTCAGCCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACGTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGATGCCCACACCGATAGATTCTATTGCTACCTCTAATGTGGACAATCAGAAGACTGCTAACGTAAAGGAGAGGCATACAACTACAAGTCCATTTTTTGGATCTGCAAATAAGCCCGAATCTCCATTTGTTGGTTCGATGCCTTCTCTGGTTCCTACTGTTGATGGATCAAGAAAGACTGAAGAAAAAAAATCAGTGACAACAATTTCACAATCAGTTTCAGCACCGGCTCCGTTAAATACTTCTTCAAGTGCATCGACTTTATTTTCAGGATTTGCTGTAAGCAAAGCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTCCGTCAACATCAACCCAATTGAACTTCTCCTCTCCGGTTGTTTCTAGTTCTAATTCCCTATTTCAGGCACCTAAGATTGTACCAACATCACCCACTCTATCTTCTTTGAATCCTACATTGGAGTCCTCGAAAACAGAGCTATCGGTTCCGAAATCAAATGATGATGCTGAAGAGCAAATACTATCTTCGAAGCCTGGGTCTCATGAACTGAAATTTCAACCTTCTATAACACCTGCCGATAAAAATCATGTAGAGCCGACTTCTAAAACCCAGACTGTTTTCAAAGATGTGGGAGGACAGGATTCAAATGTAGTAGGGAATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTTCAATACCTTCACCAAACTTAACTTCTAAGATTTTTGCAAATAGTAGAAATGAAACTTCAAATGCAGTGGTTACTCAGGACGATGATATGGACGAGGAGGCCCCAGAGACGAATAACAATGTTGAGTTTAATTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCTACCCCTATATCAGGCGGTCCTAAACCAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAGCCTCAATGACCTCTTCCTTTAATATGGCATCTCCTCCAAGTGGAGAGCTGTTCCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCTGTTGCATTCTCTGGTGCCTTTGGCTCTGCAGTGCCTACTCAACCGCCTTCGCAAGGTGGGTTCGGTCAGCCTTCTCAGATTGGAGTAGGACAGCAAGCACTTGGTAATGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTACTGTCCATGGAACTGGTTCAGGATCACCTGGGGGTTTTAGTGGTGGCTTTACGAATGCAAAACCTGTTGGAGTTGGAGGTTTTGCAGGTGTCGGTTCAGGAGGTGGCGGCGGTTTTGGAGGTGTTGGTGGTTTTGCTGGTGCAGCCTCAACCGGTGGAGGATTTGCCGGTGCTTCCTCTACGGCAGGAGGTTTTGCAGGTGCTGCAGGTGGAGGCTTTGGAGGTACTGCAGGTGGATTTGGGGCGTTTGGCAGCCAGCAAGTAAGCGGCGGTTTCTCTGCTTTTGGTGCTGCTGCTGCTGCTGCTGCTGCTGGTGGAGCCGGAGTAACTGGAAAACCTCCTGAGCTTTTCACCCAAATTAGAAAGTAG
Protein sequence
NLFHYSFHFFPPQENAKGKEKEKYISFAYRLPPKDTLECDAPKTPGGRTSVGSKPLRLSFSIDSERSILQFMASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSESSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNSVLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDRNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGGFGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK
Homology
BLAST of Cucsat.G9615.T1 vs. ExPASy Swiss-Prot
Match:
F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)
HSP 1 Score: 841.3 bits (2172), Expect = 2.0e-242
Identity = 715/1860 (38.44%), Postives = 971/1860 (52.20%), Query Frame = 0
Query: 81 SLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSESSGLIFVAH 140
S + +E+ EG++I ND YF++IG+P+ +K D+ +D E+PPSQPLA+SE ++FVAH
Sbjct: 2 SRVEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAH 61
Query: 141 LSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNSVLAAVVAGD 200
SGFFV R DVI++++ G +QDLS+VDV +G V IL++S D+S+LA VA D
Sbjct: 62 SSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAAD 121
Query: 201 VHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSANGPLT 260
+H FSV SLL K KP S S +S F+KDF+W R +++YLVLS G+L+ G N P
Sbjct: 122 IHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPR 181
Query: 261 HVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTVKVDC 320
HVM +DAVE S KG +IAVA+ ++L IFS KF E+ ++L G++D D VKVD
Sbjct: 182 HVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDS 241
Query: 321 IKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDI 380
I+WVR +CI++GCFQ+ G EE+YLVQVIRS DGKI+D S+N V LSF D+ D+
Sbjct: 242 IRWVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDL 301
Query: 381 LPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLL-EVENEVAVVNIDRNTSLPKI 440
+P GP LL SY+D CKLA+ ANR +++HI LL + ++ V+VV+IDR T LP+I
Sbjct: 302 VPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRI 361
Query: 441 ELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSV 500
LQ N DDN VMGLCIDRVS+ G V V+ G ++++E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 362 GLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASV 421
Query: 501 NETEAPHETVSACDDEEDDITVPTDDRSESKESRE----ANI----DHRMQVTEKIAISS 560
A +T A + +D P + SK+S E NI D + TEK +
Sbjct: 422 AGRPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQ 481
Query: 561 EIPREK--GKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQK-VDSFIYSQSLKSSAP 620
+P E K +KSS VS + N K + + + + + S
Sbjct: 482 RLPNENIFSKEFESVKSS---------------VSGDNNKKQEPYAEKPLQVEDAQQSMI 541
Query: 621 ERPPHYEIGNFDKPV----LKFTGLGSA---------SISGKSEDVPSQPFPNVKESTKR 680
R G + KF G G A I +S + Q K +
Sbjct: 542 PRLSGTSFGQLPMSLGYDTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAF 601
Query: 681 LGSTGLMAA-----SELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTG 740
GS GL A SS+ S K + P F S S + S + TG
Sbjct: 602 FGSPGLQNAILQSPQNTSSQPWSSGKSVSPPD--FVSGPFPSMRDTQHKQS---VQSGTG 661
Query: 741 FAGKPFQPKDVPSTLTQSGR---------------QATGGAGKIESLPVIRSSQISLQDK 800
+ P KD + ++GR G KIE +P IR+SQ+S Q K
Sbjct: 662 YVNPPMSIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVK 721
Query: 801 FSSGK-ISNEKHDGS--------ERYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDAC 860
S K S+++H E SN P + EM +DTLL+SIE GGF D+C
Sbjct: 722 SSFEKSASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSC 781
Query: 861 TAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQS 920
KS+VE LE GL SL+ CQ W+ST++E+ E+Q+L DK +QVL+KKTY+EG+ Q+
Sbjct: 782 AFILKSNVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQT 841
Query: 921 SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNKFGGNEESQVS 980
+D++YW+ W+RQKL+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++ + V+
Sbjct: 842 ADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVA 901
Query: 981 ERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAALNMESPSLKRQSATKE 1040
R + + SR S+HSL+N M SQLA A+ LSE LSKQ+ L ++SP +++ +E
Sbjct: 902 RRGVPNRSAPSRRVQSLHSLHNTMSSQLAAAEQLSECLSKQMTYLKIDSP--VKKNVKQE 961
Query: 1041 LFESIGLTYDASFSSPNVNKIAETSS-KKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGR 1100
LFE+IG+ YDASFSSP+ K SS K LLLSS S SR++Q S KNS+ ET R
Sbjct: 962 LFETIGIPYDASFSSPDAVKAKNASSAKNLLLSSIPASINQQSRQRQSSAMKNSDPETAR 1021
Query: 1101 RRRDSLDR---NLASVDPPKTTVKRMLLQ-----GIPS----SEEKQFCSRTPEGAAT-V 1160
RRR+SLDR N A+ +PPKTTVKRMLLQ G+ SE + + T + + V
Sbjct: 1022 RRRESLDRVIFNWAAFEPPKTTVKRMLLQEQQKTGMNQQTVLSERLRSANNTQDRSLLHV 1081
Query: 1161 ARPASRITSSISSSSKNAGHD-SENPETPFMWNSPLQPSNTS------------------ 1220
AS + SS ++ D SE TPF P+ SN+
Sbjct: 1082 KDHASPVVSSNKGIMESFQQDTSEAQSTPFKTRPPMPQSNSPFTISPISASKPSFNWSGN 1141
Query: 1221 --------RQKSLPLQKINVTPPSPP---------PVFQSSHDMLKKKNNE-AHSVTSEN 1280
++S P Q + S P PV + + +KK E S N
Sbjct: 1142 KSSNTTSYAEESAPSQIKDTRTVSQPGGSSFLPKRPVASTVLEQTEKKAGEFKFSEAKAN 1201
Query: 1281 KFTDVAC------PEKSKASDFFS--------------ATRSDSVQKSNINVDQKSSI-- 1340
F + A S SDF S A S KS + SSI
Sbjct: 1202 AFVETAAGSVQRLSTTSSGSDFESSKGFGAQFSTMSSGAPASSFSSKSLFGFNSSSSIPG 1261
Query: 1341 --FTISSKQMP---TPIDSIAT-SNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMP 1400
FT + P TP+DS +T + ++ + S SA P++ V S
Sbjct: 1262 DKFTFPAVTAPLSGTPLDSTSTLFTASSAPVSSSSQDPVPASIPISSAPVPQTFSVTSTS 1321
Query: 1401 SLVPT----VDGSRKTEEKKSVTTISQSVSAPAPLNTS-----------SSASTLFSGFA 1460
++ T G T K + + S +P+P T+ SS + S
Sbjct: 1322 TVSATGFNVPFGKPLTSVKVDLNQAAPSTPSPSPGPTAGFTFNLPALSPSSPEMVSSSTG 1381
Query: 1461 VSKALPSSAAVIDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVP----TSPTLSSLNPT 1520
S P SA ++ +++T S + SS SL P I P SP +S+ +
Sbjct: 1382 QSSLFPPSAPTSQVSSDQASATSSLTDSSRLFSSTSLSSTPPITPPDAFQSPQVSTPSSA 1441
Query: 1521 LESSKTELSVPKSNDDAEEQILSSKPG----SHELKFQPSITPADKNHVEP--------- 1580
+ ++ +S PK + ILS++ ++ K Q P P
Sbjct: 1442 VPITE-PVSEPKKPEAQSSSILSTQSTVDSVANATKTQNEPLPVKSEISNPGTTVTPVSS 1501
Query: 1581 -------TSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTSKIFANSRNETSN 1640
+S TQ+ + + G++QPQQ S A P+ + TS A+ E +
Sbjct: 1502 SGFLSGFSSGTQSSLASMAAPSFSWPGSSQPQQLSSTPAPFPASSPTS---ASPFGEKKD 1561
Query: 1641 AVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGGPFGNVNAASMTS 1700
V TQ+D+MDEEAPE + E ++ S GGFG STP G PK NPFGGPFGN + ++
Sbjct: 1562 IVDTQEDEMDEEAPEASQTTELSMGSFGGFGLGSTPNPGAPKTNPFGGPFGNATTTT-SN 1621
Query: 1701 SFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFGSAVPTQPPSQGGFGQPS 1735
FNM + PSGELF+PASF+FQ+P SQ A G+F S P+Q P+Q GFGQPS
Sbjct: 1622 PFNM-TVPSGELFKPASFNFQNPQPSQPAG--------FGSF-SVTPSQTPAQSGFGQPS 1681
BLAST of Cucsat.G9615.T1 vs. NCBI nr
Match:
XP_031741374.1 (nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hypothetical protein Csa_008316 [Cucumis sativus])
HSP 1 Score: 3110 bits (8064), Expect = 0.0
Identity = 1663/1663 (100.00%), Postives = 1663/1663 (100.00%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 552 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 611
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 612 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 671
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 672 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 731
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 732 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 791
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 792 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 851
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 852 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 911
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 912 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 971
LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 900
Query: 972 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 1031
NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK
Sbjct: 901 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 960
Query: 1032 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1091
QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT
Sbjct: 961 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1020
Query: 1092 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1151
VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP
Sbjct: 1021 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1080
Query: 1152 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1211
PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS
Sbjct: 1081 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1140
Query: 1212 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1271
SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP
Sbjct: 1141 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1200
Query: 1272 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1331
TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS
Sbjct: 1201 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1260
Query: 1332 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1391
TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS
Sbjct: 1261 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1320
Query: 1392 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1451
SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS
Sbjct: 1321 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1380
Query: 1452 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1511
PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP
Sbjct: 1381 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1440
Query: 1512 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1571
NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG
Sbjct: 1441 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1500
Query: 1572 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1631
SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN
Sbjct: 1501 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1560
Query: 1632 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1691
AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG
Sbjct: 1561 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1620
Query: 1692 FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1734
FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK
Sbjct: 1621 FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1663
BLAST of Cucsat.G9615.T1 vs. NCBI nr
Match:
XP_031741375.1 (nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus])
HSP 1 Score: 3081 bits (7989), Expect = 0.0
Identity = 1653/1663 (99.40%), Postives = 1653/1663 (99.40%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGN
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGN---- 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
VDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 ------VDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 552 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 611
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 612 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 671
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 672 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 731
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 732 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 791
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 792 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 851
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 852 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 911
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 912 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 971
LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 900
Query: 972 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 1031
NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK
Sbjct: 901 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 960
Query: 1032 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1091
QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT
Sbjct: 961 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1020
Query: 1092 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1151
VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP
Sbjct: 1021 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1080
Query: 1152 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1211
PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS
Sbjct: 1081 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1140
Query: 1212 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1271
SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP
Sbjct: 1141 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1200
Query: 1272 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1331
TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS
Sbjct: 1201 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1260
Query: 1332 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1391
TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS
Sbjct: 1261 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1320
Query: 1392 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1451
SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS
Sbjct: 1321 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1380
Query: 1452 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1511
PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP
Sbjct: 1381 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1440
Query: 1512 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1571
NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG
Sbjct: 1441 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1500
Query: 1572 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1631
SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN
Sbjct: 1501 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1560
Query: 1632 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1691
AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG
Sbjct: 1561 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1620
Query: 1692 FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1734
FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK
Sbjct: 1621 FGAFGSQQVSGGFSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1653
BLAST of Cucsat.G9615.T1 vs. NCBI nr
Match:
KAA0034115.1 (nuclear pore complex protein NUP214 [Cucumis melo var. makuwa])
HSP 1 Score: 2877 bits (7459), Expect = 0.0
Identity = 1570/1693 (92.73%), Postives = 1597/1693 (94.33%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 552 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 611
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 612 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 671
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 672 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 731
K M FKKIDPV SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 732 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 791
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 792 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 851
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 852 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQ-------------- 911
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQ
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPLNFSN 840
Query: 912 -----------NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHS 971
NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHS+HS
Sbjct: 841 FRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHS 900
Query: 972 LNNIMGSQLATAQLLSESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVN 1031
LNNIMGSQLATAQLLSESLSKQLAALNMESP LKRQSATKELFE+IGLTYDASFSSPNVN
Sbjct: 901 LNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVN 960
Query: 1032 KIAETSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV 1091
KIA+TSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV
Sbjct: 961 KIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV 1020
Query: 1092 KRMLLQGIPSSEEKQFCSRTPEGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNS 1151
KRMLLQG PSSEEKQF SRTPEGAATV RPASRITSSISSSSKNAGHDSENP TPFMW S
Sbjct: 1021 KRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPFMWAS 1080
Query: 1152 PLQPSNTSRQKSLPLQKINVTPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEK 1211
LQPSNTSRQKSLPLQK N T PSPPPVFQSSHDMLKK NN AHS TSENKFTD+ACPEK
Sbjct: 1081 VLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMACPEK 1140
Query: 1212 SKASDFFSATRSDSVQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERH 1271
SKASDFFSATRSDSVQKS INVDQKSSIFTISSKQ P P DSI TSNVDNQKTANVKERH
Sbjct: 1141 SKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERH 1200
Query: 1272 TTTSPFFGSANKPESPFVGSMPSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSAS 1331
TTTS FGSANKPESPFVG+MPSLVPTVDG+RKTEEKKSVTTISQSVSAPAPLNTSSSAS
Sbjct: 1201 TTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSAS 1260
Query: 1332 TLFSGFAVSKALPSSAAV---IDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTL 1391
TLFSGFAVSK+LPSSAAV +DLNQP STSTQLNFS PVVS SNSLFQAPK VPTSPTL
Sbjct: 1261 TLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPK-VPTSPTL 1320
Query: 1392 SSLNPTLESSKTELSVPKSNDDAEEQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTV 1451
SSLNPT+ESSKTELSV KSNDDAE+Q LSSKPGSHELKFQPSITPADKNHVEPTSKTQTV
Sbjct: 1321 SSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTV 1380
Query: 1452 FKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAP 1511
FKDVGGQ NVVG+AQ QQPSVAFASIPS NLTSKIFANSRNETSNAVVTQDDDMDEEAP
Sbjct: 1381 FKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAP 1440
Query: 1512 ETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFR 1571
ETNNNVEFNLSSLGGFGNSSTPISG PKPNPFGGPFGNVNAAS+T+SFNMASPPSGELFR
Sbjct: 1441 ETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFR 1500
Query: 1572 PASFSFQSPLASQAASQPTNSVAFSGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLG 1631
PASFSFQSPLASQAASQPTNSVAFSGAFGSAV TQ P QGGFGQP+QIGVGQQALGNVLG
Sbjct: 1501 PASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLG 1560
Query: 1632 SFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS 1691
SFGQSRQLGPT+ GTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS
Sbjct: 1561 SFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS 1620
Query: 1692 TGGGFAGASSTAGGFAGAAGGGFGGTAGGFGAFGSQQVSGG-FSAFGAAAAAAAAGGAGV 1734
TGGGFAGASST GGFAGAAGGGFGGTAGGFGAFGSQQVSGG FSAFG AAA GGA V
Sbjct: 1621 TGGGFAGASSTTGGFAGAAGGGFGGTAGGFGAFGSQQVSGGGFSAFGTAAA----GGASV 1680
BLAST of Cucsat.G9615.T1 vs. NCBI nr
Match:
XP_008445928.2 (PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 [Cucumis melo])
HSP 1 Score: 2837 bits (7354), Expect = 0.0
Identity = 1553/1668 (93.11%), Postives = 1583/1668 (94.90%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 552 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 611
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 612 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 671
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 672 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 731
K M FKK+ V SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKLI-VSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 732 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 791
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 792 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 851
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 852 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 911
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 840
Query: 912 GLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAA 971
GLELNKFGGNEESQVSERALQRKFGSSRHSHS+HSLNNIMGSQLATAQLLSESLSKQLAA
Sbjct: 841 GLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIMGSQLATAQLLSESLSKQLAA 900
Query: 972 LNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRR 1031
LNMESP LKRQSATKELFE+IGLTYDASFSSPNVNKIA+TSSKKLLLSSDSFSSKGTSRR
Sbjct: 901 LNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADTSSKKLLLSSDSFSSKGTSRR 960
Query: 1032 KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAA 1091
KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQG PSSEEKQF SRTPEGAA
Sbjct: 961 KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGTPSSEEKQFRSRTPEGAA 1020
Query: 1092 TVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSP 1151
TV RPASRITSSISSSSKNAGHDSENP TPFMW S LQPSNTSRQKSLPLQK N T PSP
Sbjct: 1021 TVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPSNTSRQKSLPLQKTNATAPSP 1080
Query: 1152 PPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQK 1211
PPVFQSSHDMLKK + +V + PEKSKASDFFSATRSDSVQKS INVDQK
Sbjct: 1081 PPVFQSSHDMLKKIIMQL-TVRLQKTNLRTWHPEKSKASDFFSATRSDSVQKSKINVDQK 1140
Query: 1212 SSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLV 1271
SSIFTISSKQ P P DSI TSNVDNQKTANVKERHTTTS FGSANKPESPFVG+MPSLV
Sbjct: 1141 SSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQLFGSANKPESPFVGTMPSLV 1200
Query: 1272 PTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAV---IDLN 1331
PTVDG+RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSK+LPSSAAV +DLN
Sbjct: 1201 PTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKSLPSSAAVAAVVDLN 1260
Query: 1332 QPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEE 1391
QP STSTQLNFS PVVS SNSLFQAPK VPTSPTLSSLNPT+ESSKTELSV KSNDDAE+
Sbjct: 1261 QPQSTSTQLNFS-PVVSGSNSLFQAPK-VPTSPTLSSLNPTMESSKTELSVLKSNDDAEK 1320
Query: 1392 QILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFA 1451
Q LSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQ NVVG+AQ QQPSVAFA
Sbjct: 1321 QTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQVPNVVGDAQAQQPSVAFA 1380
Query: 1452 SIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG 1511
SIPS NLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG
Sbjct: 1381 SIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG 1440
Query: 1512 GPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS 1571
PKPNPFGGPFGNVNAAS+T+SFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS
Sbjct: 1441 APKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS 1500
Query: 1572 GAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSG 1631
GAFGSAV TQ P QGGFGQP+QIGVGQQALGNVLGSFGQSRQLGPT+ GTGSGSPGGFSG
Sbjct: 1501 GAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPTLPGTGSGSPGGFSG 1560
Query: 1632 GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGG 1691
GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASST GGFAGAAGGGFGG
Sbjct: 1561 GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTTGGFAGAAGGGFGG 1620
Query: 1692 TAGGFGAFGSQQVSGG-FSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1734
TAGGFGAFGSQQVSGG FSAFG AAA GGA VTGKPPELFTQIRK
Sbjct: 1621 TAGGFGAFGSQQVSGGGFSAFGTAAA----GGASVTGKPPELFTQIRK 1660
BLAST of Cucsat.G9615.T1 vs. NCBI nr
Match:
XP_038892124.1 (nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida])
HSP 1 Score: 2536 bits (6573), Expect = 0.0
Identity = 1417/1695 (83.60%), Postives = 1490/1695 (87.91%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDS PS+LIPLEDAGEGEQ+VRND YFQKIG+PVPVKLGDSIFDPE+PPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVR KDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILA+STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
+LAAVVA D+H+FSVQSLLDKAE P SSCSITDSS IKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGSANG LTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPS GNGETD
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCII+GCFQ+TATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+EDHI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLCIDRVSL GKVIV+VGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 551
IMFQFSSVNETEAPHETV ACD+EEDDI VP DDRS+ KE REAN D +MQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 552 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 611
IAISSEIP EK K SNDIKSS NDQSLV I ESA V E NTKS+K DSFIYSQSLKSS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 612 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 671
ER +YEIGNFDKPV KF GLG SISGKS DV SQPFPNVKESTK+LGSTGL+AASE
Sbjct: 541 VLERS-NYEIGNFDKPVQKF-GLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASE 600
Query: 672 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 731
LSS+KA+ KIDPV SV NS QSS TENY PSFGTAN F GFAGKPFQPKDVPSTLT
Sbjct: 601 LSSDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLT 660
Query: 732 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 791
QSGRQ GGAGKIESLPVIRSSQISLQD GKISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVMGGAGKIESLPVIRSSQISLQDNLP-GKISNEKHDGSDRSYSNAPLAKPMKEMC 720
Query: 792 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 851
E LD LLESIEE GGF+DACTAFQKSSVEALELGLASL D CQIW+STMNER+QEVQNLF
Sbjct: 721 EALDMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLF 780
Query: 852 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 911
DKM+QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMIQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 840
Query: 912 RHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSK 971
RHFNGLELNKFGGNEESQV+ERALQRKFGSSRHSHS+HSLNNIMGSQLA AQLLSESLSK
Sbjct: 841 RHFNGLELNKFGGNEESQVNERALQRKFGSSRHSHSLHSLNNIMGSQLAAAQLLSESLSK 900
Query: 972 QLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKG 1031
QLAALN+ESPSLKRQS TKELFE+IGLTYDASFSSPNVNKIAETSSKKLLLS+DSFS K
Sbjct: 901 QLAALNIESPSLKRQSVTKELFETIGLTYDASFSSPNVNKIAETSSKKLLLSADSFSHKD 960
Query: 1032 TSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTP 1091
TSRRKQ SGTKNSEAETGRRRRDSLDRNLASV+PPKTTVKRMLL GIPSS+EK F S TP
Sbjct: 961 TSRRKQWSGTKNSEAETGRRRRDSLDRNLASVEPPKTTVKRMLLHGIPSSDEKLFRSHTP 1020
Query: 1092 EGAATVARPASRITSSISSSS-KNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINV 1151
+GAATVA PASR+TSS+SSSS KNAGHDSENP TPFMW SPLQPSN SRQKS PLQK N
Sbjct: 1021 DGAATVAWPASRLTSSMSSSSSKNAGHDSENPATPFMWASPLQPSNISRQKSQPLQKTNA 1080
Query: 1152 TPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNI 1211
T PS VFQSSH+MLKK NNEA SVTSENKFT EKSKASDFFS TR+DSVQKSN
Sbjct: 1081 TAPSLS-VFQSSHEMLKKSNNEAFSVTSENKFT-----EKSKASDFFSVTRTDSVQKSNT 1140
Query: 1212 NVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGS 1271
N+D+K SIFTISSKQM TP D I TSN+DNQKTAN KERHTTTSP FGSANKPES VG+
Sbjct: 1141 NLDKKPSIFTISSKQMATPKDFIDTSNLDNQKTANSKERHTTTSPLFGSANKPESASVGT 1200
Query: 1272 MPSLVPTVDGSRKTEEKKSVTTISQSVSAPAP--LNTSSSASTLFSGFAVSKALPSSAAV 1331
M SLVPTVD +RK EK+S+ TIS SV AP P N+ SS+STLFSGFAVSK LPSSAA
Sbjct: 1201 MSSLVPTVDEARK--EKRSLKTISPSVPAPTPARFNSPSSSSTLFSGFAVSKPLPSSAAA 1260
Query: 1332 IDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSND 1391
IDLNQP STSTQLNFSSPVVS S+SLFQA K+V TS TLSSLNP LESSK EL V KS
Sbjct: 1261 IDLNQPLSTSTQLNFSSPVVSVSDSLFQATKMVSTSSTLSSLNPILESSKKELPVSKSEG 1320
Query: 1392 DAEEQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPS 1451
D E++ +SKP SHELKFQPSITPA+KNH+EPTSKTQTV KDVGGQ NV+G+AQPQ PS
Sbjct: 1321 DTEKKTPASKPESHELKFQPSITPANKNHLEPTSKTQTVPKDVGGQIPNVIGDAQPQPPS 1380
Query: 1452 VAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSST 1511
VAFAS+PSPNLTSK N RNETSN VTQDDDMDEEAPET NNVEFNLS LGGFGNSS+
Sbjct: 1381 VAFASLPSPNLTSKTSGNGRNETSNVTVTQDDDMDEEAPETINNVEFNLSGLGGFGNSSS 1440
Query: 1512 PISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNS 1571
P+SG PKPNPFGG FGNVNA SMTSSF MASPPSGELFRPASFSFQSPLASQAAS PTNS
Sbjct: 1441 PMSGAPKPNPFGGSFGNVNATSMTSSFTMASPPSGELFRPASFSFQSPLASQAASLPTNS 1500
Query: 1572 VAFSGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPG 1631
VAFSG FGSA+ TQ PSQGGFGQP+QIGVGQQALGNVLGSFGQSRQLGP++ GTGSGS G
Sbjct: 1501 VAFSGGFGSAMATQAPSQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPSLPGTGSGSLG 1560
Query: 1632 GFSGGFTNAKPVGVGGFAGVGS------------GGGGGFGGVG----GFAGAASTGGGF 1691
FSGGFT KPV VGGFAGVGS GGGGGFGGVG GFAG STGGGF
Sbjct: 1561 SFSGGFTGMKPVAVGGFAGVGSSGGSGGFAGVGSGGGGGFGGVGSAAGGFAGTISTGGGF 1620
Query: 1692 AGASSTAGGFAGAAGGGFGGTAGG--------FGAFGSQQVSGGFSAFGAAAAAAAAGGA 1734
AGAS+T GGFAGAAGGGF G AGG FGAFGSQQ SGGFSAFGAAA GGA
Sbjct: 1621 AGASATTGGFAGAAGGGFAGAAGGGFAGAAGGFGAFGSQQGSGGFSAFGAAA-----GGA 1679
BLAST of Cucsat.G9615.T1 vs. ExPASy TrEMBL
Match:
A0A0A0KV45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1)
HSP 1 Score: 3090 bits (8012), Expect = 0.0
Identity = 1663/1704 (97.59%), Postives = 1663/1704 (97.59%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 552 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 611
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 612 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 671
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 672 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 731
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 732 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 791
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 792 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 851
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 852 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 911
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 912 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 971
LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAAL 900
Query: 972 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 1031
NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK
Sbjct: 901 NMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRRK 960
Query: 1032 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1091
QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT
Sbjct: 961 QQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAAT 1020
Query: 1092 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1151
VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP
Sbjct: 1021 VARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSPP 1080
Query: 1152 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1211
PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS
Sbjct: 1081 PVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQKS 1140
Query: 1212 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1271
SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP
Sbjct: 1141 SIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLVP 1200
Query: 1272 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1331
TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS
Sbjct: 1201 TVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAVIDLNQPPS 1260
Query: 1332 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1391
TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS
Sbjct: 1261 TSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEEQILS 1320
Query: 1392 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1451
SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS
Sbjct: 1321 SKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFASIPS 1380
Query: 1452 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1511
PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP
Sbjct: 1381 PNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISGGPKP 1440
Query: 1512 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1571
NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG
Sbjct: 1441 NPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFSGAFG 1500
Query: 1572 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1631
SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN
Sbjct: 1501 SAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSGGFTN 1560
Query: 1632 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1691
AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG
Sbjct: 1561 AKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGGTAGG 1620
Query: 1692 FGAFGSQQVSGGFSAFGAAAAAAAA----------------------------------- 1734
FGAFGSQQVSGGFSAFGAAAAAAAA
Sbjct: 1621 FGAFGSQQVSGGFSAFGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1680
BLAST of Cucsat.G9615.T1 vs. ExPASy TrEMBL
Match:
A0A5A7SY34 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G001060 PE=4 SV=1)
HSP 1 Score: 2877 bits (7459), Expect = 0.0
Identity = 1570/1693 (92.73%), Postives = 1597/1693 (94.33%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 552 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 611
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 612 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 671
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 672 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 731
K M FKKIDPV SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 732 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 791
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 792 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 851
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 852 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQ-------------- 911
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQ
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPLNFSN 840
Query: 912 -----------NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHS 971
NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHS+HS
Sbjct: 841 FRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHS 900
Query: 972 LNNIMGSQLATAQLLSESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVN 1031
LNNIMGSQLATAQLLSESLSKQLAALNMESP LKRQSATKELFE+IGLTYDASFSSPNVN
Sbjct: 901 LNNIMGSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVN 960
Query: 1032 KIAETSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV 1091
KIA+TSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV
Sbjct: 961 KIADTSSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTV 1020
Query: 1092 KRMLLQGIPSSEEKQFCSRTPEGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNS 1151
KRMLLQG PSSEEKQF SRTPEGAATV RPASRITSSISSSSKNAGHDSENP TPFMW S
Sbjct: 1021 KRMLLQGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPFMWAS 1080
Query: 1152 PLQPSNTSRQKSLPLQKINVTPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEK 1211
LQPSNTSRQKSLPLQK N T PSPPPVFQSSHDMLKK NN AHS TSENKFTD+ACPEK
Sbjct: 1081 VLQPSNTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMACPEK 1140
Query: 1212 SKASDFFSATRSDSVQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERH 1271
SKASDFFSATRSDSVQKS INVDQKSSIFTISSKQ P P DSI TSNVDNQKTANVKERH
Sbjct: 1141 SKASDFFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERH 1200
Query: 1272 TTTSPFFGSANKPESPFVGSMPSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSAS 1331
TTTS FGSANKPESPFVG+MPSLVPTVDG+RKTEEKKSVTTISQSVSAPAPLNTSSSAS
Sbjct: 1201 TTTSQLFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSAS 1260
Query: 1332 TLFSGFAVSKALPSSAAV---IDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTL 1391
TLFSGFAVSK+LPSSAAV +DLNQP STSTQLNFS PVVS SNSLFQAPK VPTSPTL
Sbjct: 1261 TLFSGFAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPK-VPTSPTL 1320
Query: 1392 SSLNPTLESSKTELSVPKSNDDAEEQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTV 1451
SSLNPT+ESSKTELSV KSNDDAE+Q LSSKPGSHELKFQPSITPADKNHVEPTSKTQTV
Sbjct: 1321 SSLNPTMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTV 1380
Query: 1452 FKDVGGQDSNVVGNAQPQQPSVAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAP 1511
FKDVGGQ NVVG+AQ QQPSVAFASIPS NLTSKIFANSRNETSNAVVTQDDDMDEEAP
Sbjct: 1381 FKDVGGQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAP 1440
Query: 1512 ETNNNVEFNLSSLGGFGNSSTPISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFR 1571
ETNNNVEFNLSSLGGFGNSSTPISG PKPNPFGGPFGNVNAAS+T+SFNMASPPSGELFR
Sbjct: 1441 ETNNNVEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFR 1500
Query: 1572 PASFSFQSPLASQAASQPTNSVAFSGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLG 1631
PASFSFQSPLASQAASQPTNSVAFSGAFGSAV TQ P QGGFGQP+QIGVGQQALGNVLG
Sbjct: 1501 PASFSFQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLG 1560
Query: 1632 SFGQSRQLGPTVHGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS 1691
SFGQSRQLGPT+ GTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS
Sbjct: 1561 SFGQSRQLGPTLPGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAAS 1620
Query: 1692 TGGGFAGASSTAGGFAGAAGGGFGGTAGGFGAFGSQQVSGG-FSAFGAAAAAAAAGGAGV 1734
TGGGFAGASST GGFAGAAGGGFGGTAGGFGAFGSQQVSGG FSAFG AAA GGA V
Sbjct: 1621 TGGGFAGASSTTGGFAGAAGGGFGGTAGGFGAFGSQQVSGGGFSAFGTAAA----GGASV 1680
BLAST of Cucsat.G9615.T1 vs. ExPASy TrEMBL
Match:
A0A1S3BDU8 (LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656 GN=LOC103488807 PE=4 SV=1)
HSP 1 Score: 2837 bits (7354), Expect = 0.0
Identity = 1553/1668 (93.11%), Postives = 1583/1668 (94.90%), Query Frame = 0
Query: 72 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 131
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 132 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 191
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 192 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 251
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 252 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 311
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 312 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 371
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 372 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 431
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 432 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 491
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 492 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 551
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 552 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 611
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 612 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 671
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 672 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 731
K M FKK+ V SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKLI-VSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 732 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 791
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 792 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 851
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 852 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 911
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 840
Query: 912 GLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLSESLSKQLAA 971
GLELNKFGGNEESQVSERALQRKFGSSRHSHS+HSLNNIMGSQLATAQLLSESLSKQLAA
Sbjct: 841 GLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIMGSQLATAQLLSESLSKQLAA 900
Query: 972 LNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDSFSSKGTSRR 1031
LNMESP LKRQSATKELFE+IGLTYDASFSSPNVNKIA+TSSKKLLLSSDSFSSKGTSRR
Sbjct: 901 LNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADTSSKKLLLSSDSFSSKGTSRR 960
Query: 1032 KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQFCSRTPEGAA 1091
KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQG PSSEEKQF SRTPEGAA
Sbjct: 961 KQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGTPSSEEKQFRSRTPEGAA 1020
Query: 1092 TVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPLQKINVTPPSP 1151
TV RPASRITSSISSSSKNAGHDSENP TPFMW S LQPSNTSRQKSLPLQK N T PSP
Sbjct: 1021 TVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPSNTSRQKSLPLQKTNATAPSP 1080
Query: 1152 PPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDSVQKSNINVDQK 1211
PPVFQSSHDMLKK + +V + PEKSKASDFFSATRSDSVQKS INVDQK
Sbjct: 1081 PPVFQSSHDMLKKIIMQL-TVRLQKTNLRTWHPEKSKASDFFSATRSDSVQKSKINVDQK 1140
Query: 1212 SSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPESPFVGSMPSLV 1271
SSIFTISSKQ P P DSI TSNVDNQKTANVKERHTTTS FGSANKPESPFVG+MPSLV
Sbjct: 1141 SSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQLFGSANKPESPFVGTMPSLV 1200
Query: 1272 PTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPSSAAV---IDLN 1331
PTVDG+RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSK+LPSSAAV +DLN
Sbjct: 1201 PTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKSLPSSAAVAAVVDLN 1260
Query: 1332 QPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVPKSNDDAEE 1391
QP STSTQLNFS PVVS SNSLFQAPK VPTSPTLSSLNPT+ESSKTELSV KSNDDAE+
Sbjct: 1261 QPQSTSTQLNFS-PVVSGSNSLFQAPK-VPTSPTLSSLNPTMESSKTELSVLKSNDDAEK 1320
Query: 1392 QILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQPQQPSVAFA 1451
Q LSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQ NVVG+AQ QQPSVAFA
Sbjct: 1321 QTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQVPNVVGDAQAQQPSVAFA 1380
Query: 1452 SIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG 1511
SIPS NLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG
Sbjct: 1381 SIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGFGNSSTPISG 1440
Query: 1512 GPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS 1571
PKPNPFGGPFGNVNAAS+T+SFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS
Sbjct: 1441 APKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFSFQSPLASQAASQPTNSVAFS 1500
Query: 1572 GAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHGTGSGSPGGFSG 1631
GAFGSAV TQ P QGGFGQP+QIGVGQQALGNVLGSFGQSRQLGPT+ GTGSGSPGGFSG
Sbjct: 1501 GAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQSRQLGPTLPGTGSGSPGGFSG 1560
Query: 1632 GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTAGGFAGAAGGGFGG 1691
GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASST GGFAGAAGGGFGG
Sbjct: 1561 GFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGFAGASSTTGGFAGAAGGGFGG 1620
Query: 1692 TAGGFGAFGSQQVSGG-FSAFGAAAAAAAAGGAGVTGKPPELFTQIRK 1734
TAGGFGAFGSQQVSGG FSAFG AAA GGA VTGKPPELFTQIRK
Sbjct: 1621 TAGGFGAFGSQQVSGGGFSAFGTAAA----GGASVTGKPPELFTQIRK 1660
BLAST of Cucsat.G9615.T1 vs. ExPASy TrEMBL
Match:
A0A5D3CWG0 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold35G003090 PE=4 SV=1)
HSP 1 Score: 2306 bits (5977), Expect = 0.0
Identity = 1276/1388 (91.93%), Postives = 1300/1388 (93.66%), Query Frame = 0
Query: 373 HSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDR 432
+ FTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDR
Sbjct: 15 YRSFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDR 74
Query: 433 NTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELI 492
NTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGELI
Sbjct: 75 NTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGELI 134
Query: 493 MFQFSS---------------------VNETEAPHETVSACDDEEDDITVPTDDRSESK- 552
MFQFSS VNETEAPHETVSACDDEEDDITVPTDDRSESK
Sbjct: 135 MFQFSSLNIMIIAPDSLLLWVFNFSCSVNETEAPHETVSACDDEEDDITVPTDDRSESKK 194
Query: 553 ESREANIDHRMQVTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTK 612
ESREAN+D +MQVTEKI ISSEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTK
Sbjct: 195 ESREANVDLKMQVTEKITISSEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTK 254
Query: 613 SQKVDSFIYSQSLKSSAPERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVK 672
SQKVDSFI+SQSLKSSAPERPP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVK
Sbjct: 255 SQKVDSFIHSQSLKSSAPERPPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVK 314
Query: 673 ESTKRLGSTGLMAASELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTG 732
ES KRLGSTGL+AASELSSEK M FKKIDPV SV TSNSLQSSNTENYGPSFGTANAFTG
Sbjct: 315 ESQKRLGSTGLVAASELSSEKTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTG 374
Query: 733 FAGKPFQPKDVPSTLTQSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSE 792
FAGKPFQPKDVPSTLTQSGRQ TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSE
Sbjct: 375 FAGKPFQPKDVPSTLTQSGRQVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSE 434
Query: 793 RYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQI 852
RYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQI
Sbjct: 435 RYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQI 494
Query: 853 WRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQH 912
WRSTMNER QEVQNLFDKMVQVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQH
Sbjct: 495 WRSTMNERVQEVQNLFDKMVQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQH 554
Query: 913 ILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIM 972
ILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHS+HSLNNIM
Sbjct: 555 ILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSLHSLNNIM 614
Query: 973 GSQLATAQLLSESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAET 1032
GSQLATAQLLSESLSKQLAALNMESP LKRQSATKELFE+IGLTYDASFSSPNVNKIA+T
Sbjct: 615 GSQLATAQLLSESLSKQLAALNMESPPLKRQSATKELFETIGLTYDASFSSPNVNKIADT 674
Query: 1033 SSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLL 1092
SSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLL
Sbjct: 675 SSKKLLLSSDSFSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLL 734
Query: 1093 QGIPSSEEKQFCSRTPEGAATVARPASRITSSISSSSKNAGHDSENPETPFMWNSPLQPS 1152
QG PSSEEKQF SRTPEGAATV RPASRITSSISSSSKNAGHDSENP TPFMW S LQPS
Sbjct: 735 QGTPSSEEKQFRSRTPEGAATVERPASRITSSISSSSKNAGHDSENPATPFMWASVLQPS 794
Query: 1153 NTSRQKSLPLQKINVTPPSPPPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASD 1212
NTSRQKSLPLQK N T PSPPPVFQSSHDMLKK NN AHS TSENKFTD+ACPEKSKASD
Sbjct: 795 NTSRQKSLPLQKTNATAPSPPPVFQSSHDMLKKNNNAAHSATSENKFTDMACPEKSKASD 854
Query: 1213 FFSATRSDSVQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSP 1272
FFSATRSDSVQKS INVDQKSSIFTISSKQ P P DSI TSNVDNQKTANVKERHTTTS
Sbjct: 855 FFSATRSDSVQKSKINVDQKSSIFTISSKQTPPPEDSIGTSNVDNQKTANVKERHTTTSQ 914
Query: 1273 FFGSANKPESPFVGSMPSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSG 1332
FGSANKPESPFVG+MPSLVPTVDG+RKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSG
Sbjct: 915 LFGSANKPESPFVGTMPSLVPTVDGARKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSG 974
Query: 1333 FAVSKALPSSAAV---IDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNP 1392
FAVSK+LPSSAAV +DLNQP STSTQLNFS PVVS SNSLFQAPK VPTSPTLSSLNP
Sbjct: 975 FAVSKSLPSSAAVAAVVDLNQPQSTSTQLNFS-PVVSGSNSLFQAPK-VPTSPTLSSLNP 1034
Query: 1393 TLESSKTELSVPKSNDDAEEQILSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVG 1452
T+ESSKTELSV KSNDDAE+Q LSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVG
Sbjct: 1035 TMESSKTELSVLKSNDDAEKQTLSSKPGSHELKFQPSITPADKNHVEPTSKTQTVFKDVG 1094
Query: 1453 GQDSNVVGNAQPQQPSVAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNN 1512
GQ NVVG+AQ QQPSVAFASIPS NLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNN
Sbjct: 1095 GQVPNVVGDAQAQQPSVAFASIPSQNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNN 1154
Query: 1513 VEFNLSSLGGFGNSSTPISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFS 1572
VEFNLSSLGGFGNSSTPISG PKPNPFGGPFGNVNAAS+T+SFNMASPPSGELFRPASFS
Sbjct: 1155 VEFNLSSLGGFGNSSTPISGAPKPNPFGGPFGNVNAASVTTSFNMASPPSGELFRPASFS 1214
Query: 1573 FQSPLASQAASQPTNSVAFSGAFGSAVPTQPPSQGGFGQPSQIGVGQQALGNVLGSFGQS 1632
FQSPLASQAASQPTNSVAFSGAFGSAV TQ P QGGFGQP+QIGVGQQALGNVLGSFGQS
Sbjct: 1215 FQSPLASQAASQPTNSVAFSGAFGSAVATQAPPQGGFGQPAQIGVGQQALGNVLGSFGQS 1274
Query: 1633 RQLGPTVHGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGF 1692
RQLGPT+ GTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGF
Sbjct: 1275 RQLGPTLPGTGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVGGFAGAASTGGGF 1334
Query: 1693 AGASSTAGGFAGAAGGGFGGTAGGFGAFGSQQVSGG-FSAFGAAAAAAAAGGAGVTGKPP 1734
AGASST GGFAGAAGGGFGGTAGGFGAFGSQQVSGG FSAFG AAA GGA VTGKPP
Sbjct: 1335 AGASSTTGGFAGAAGGGFGGTAGGFGAFGSQQVSGGGFSAFGTAAA----GGASVTGKPP 1394
BLAST of Cucsat.G9615.T1 vs. ExPASy TrEMBL
Match:
A0A6J1CBF2 (nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC111010057 PE=4 SV=1)
HSP 1 Score: 2163 bits (5605), Expect = 0.0
Identity = 1249/1727 (72.32%), Postives = 1373/1727 (79.50%), Query Frame = 0
Query: 69 LQFMASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLA 128
LQ S S+ I E+A EGE + D YF+KIG+PVPVKL DSIFD ESPPSQPLA
Sbjct: 4 LQDSTPSTSSTSTPIRFEEAEEGEHVESTDYYFEKIGEPVPVKLHDSIFDSESPPSQPLA 63
Query: 129 LSESSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVST 188
+SES GLIFVAHLSGFFV R +DVIASA+EIKNGGTGSSVQDLSI+DVS+G+VHILA+S
Sbjct: 64 VSESFGLIFVAHLSGFFVARTEDVIASAKEIKNGGTGSSVQDLSIMDVSVGRVHILALSA 123
Query: 189 DNSVLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHG 248
D+S +AAVVA D+H+FSV SLLDKA KP+ SCSITDSS IKDFKW RKLE++YLVLSKHG
Sbjct: 124 DSSTIAAVVAADIHLFSVHSLLDKAAKPFYSCSITDSSCIKDFKWIRKLESSYLVLSKHG 183
Query: 249 QLYQGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNG 308
QLYQGSANG L HVMHD DAVECSVKG+FIAVAKKDTLTIFS KFKERLSMSLLPS
Sbjct: 184 QLYQGSANGTLKHVMHDTDAVECSVKGRFIAVAKKDTLTIFSSKFKERLSMSLLPS---- 243
Query: 309 ETDTDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLS 368
+ D++F VKVDCIKWVRADCII+GCF+VTA GDEE+Y VQVIRSKDGKITDVSSN+VLLS
Sbjct: 244 DADSNFIVKVDCIKWVRADCIILGCFEVTAIGDEENYFVQVIRSKDGKITDVSSNRVLLS 303
Query: 369 FCDIHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVV 428
F IH GFTRDILP SGPCL SYL CKLAIVANR + HI LLG L EVEN+VAV+
Sbjct: 304 FQYIHPGFTRDILPVGSGPCLFSSYLGKCKLAIVANRNNTDQHIVLLGWLPEVENQVAVI 363
Query: 429 NIDRNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLE 488
+I+R+TSLP+IELQ NGDDNLVMGLCIDRVSL KV ++VG EDMREVSPYCIL+CLTLE
Sbjct: 364 DIERDTSLPRIELQENGDDNLVMGLCIDRVSLPAKVKIQVGVEDMREVSPYCILLCLTLE 423
Query: 489 GELIMFQFSSVNETEAPHETVSACDDEEDDIT-VPTDDR----SES-KESREANIDHRMQ 548
G+L+MF SS+NETE PHETVSAC+DEE+D T VP DD+ SES KE REA + M
Sbjct: 424 GKLVMFHLSSINETETPHETVSACEDEEEDDTIVPIDDQPQVSSESRKELREAMVGQ-MH 483
Query: 549 VTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQS 608
T+KI SSEIP EK SNDIK S DQS V ID+SAIVS E N+KS+KV SFIYSQ
Sbjct: 484 DTDKITTSSEIPEEKINISNDIKPSDIDQSPVSYIDKSAIVSRESNSKSEKVGSFIYSQP 543
Query: 609 LKSSAPERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLM 668
LKSS E+P + EIGNF KPV KFTGLGS + SG+S DVPSQPF N KEST RLGSTGL
Sbjct: 544 LKSSILEKP-NSEIGNFGKPVQKFTGLGSVAFSGQSADVPSQPFLNAKESTLRLGSTGLQ 603
Query: 669 AASELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVP 728
ASELSS++AM KIDP SV NSLQS+ T+N GPSFG ANAFT F G+ FQ KDV
Sbjct: 604 DASELSSDRAMFLNKIDPASSVLPLNSLQSTKTDNLGPSFGAANAFTAFTGRSFQTKDVS 663
Query: 729 STLTQSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPM 788
STLTQ GRQ T GAGKIESLP +RSSQ+ LQD FS GK SNEKH SER YSN PLAKPM
Sbjct: 664 STLTQIGRQVTAGAGKIESLPPMRSSQVPLQDNFSLGKTSNEKHSRSERNYSNVPLAKPM 723
Query: 789 KEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEV 848
KEMC+GLD LLESIEE GGF DACTA QKSS+EALELGLA+LSD CQIW TMNER+QE+
Sbjct: 724 KEMCDGLDMLLESIEEPGGFWDACTASQKSSIEALELGLATLSDQCQIWGRTMNERAQEI 783
Query: 849 QNLFDKMV-QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQ 908
QNLFDK V QV+ KKTYIEGIV Q+S S YWE WDRQ+LSSELELKRQHILK NQN+TNQ
Sbjct: 784 QNLFDKTVNQVMPKKTYIEGIVKQASHSHYWEHWDRQRLSSELELKRQHILKTNQNMTNQ 843
Query: 909 LIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSHSVHSLNNIMGSQLATAQLLS 968
LIELERHFNGLELNKFGGN+ESQVSERALQRKFGSSRHSHS HSLNNI GSQLA AQLLS
Sbjct: 844 LIELERHFNGLELNKFGGNDESQVSERALQRKFGSSRHSHSFHSLNNITGSQLAAAQLLS 903
Query: 969 ESLSKQLAALNMESPSLKRQSATKELFESIGLTYDASFSSPNVNKIAETSSKKLLLSSDS 1028
ESLSKQ+AALN+ESPS KRQS TKELFE+IG+TYDASFSSPNVNKIAETSSKKLLLS+DS
Sbjct: 904 ESLSKQMAALNIESPSSKRQSVTKELFETIGITYDASFSSPNVNKIAETSSKKLLLSADS 963
Query: 1029 FSSKGTSRRKQQSGTKNSEAETGRRRRDSLDRNLASVDPPKTTVKRMLLQGIPSSEEKQF 1088
FSSK +SRRK +SG KNSEAETGRRRR+SLDRNLASV+PPKTTVKRMLL+GIP ++EK F
Sbjct: 964 FSSKDSSRRKLRSGMKNSEAETGRRRRESLDRNLASVEPPKTTVKRMLLEGIPLADEKHF 1023
Query: 1089 CSRTPEGAATVARPASRITSS-ISSSSKNAGHDSENPETPFMWNSPLQPSNTSRQKSLPL 1148
S TPEG ATV RPASRI SS +SSSSKNA H SENP TPFMW+SP Q SN SRQKS PL
Sbjct: 1024 RSPTPEGTATVTRPASRIASSMLSSSSKNAEHSSENPATPFMWSSPSQSSNISRQKSQPL 1083
Query: 1149 QKINVTPPSP-PPVFQSSHDMLKKKNNEAHSVTSENKFTDVACPEKSKASDFFSATRSDS 1208
+K N T PSP P V+QSSH+M KK N EA+SVTS+NKFT+ PEKSK+SDF S TRSDS
Sbjct: 1084 KKTNATAPSPLPVVYQSSHEMPKKSNTEAYSVTSDNKFTEATYPEKSKSSDFLSLTRSDS 1143
Query: 1209 VQKSNINVDQKSSIFTISSKQMPTPIDSIATSNVDNQKTANVKERHTTTSPFFGSANKPE 1268
VQKSNIN+DQKSSIF IS+ QMPT DSI TSN++ QKTANVKERHT S F SANKPE
Sbjct: 1144 VQKSNINLDQKSSIFKISNNQMPTLKDSINTSNLNGQKTANVKERHTPKSSLFESANKPE 1203
Query: 1269 SPFVGSMPSLVPTVDGSRKTEEKKSVTTISQSVSAPAPLNTSSSASTLFSGFAVSKALPS 1328
S FVG+ + VPTV G+RKTEEK S+T S SV APA LNT SSASTLFSGF+V+K+L +
Sbjct: 1204 SAFVGTASTPVPTVLGARKTEEKTSLTAFSPSVPAPALLNTPSSASTLFSGFSVTKSLTN 1263
Query: 1329 SAAVIDLNQPPSTSTQLNFSSPVVSSSNSLFQAPKIVPTSPTLSSLNPTLESSKTELSVP 1388
S A +DLN+P ST TQ NFSSP VS S+SLFQAPK+V SPT TLES K EL P
Sbjct: 1264 STAHVDLNKPLSTFTQSNFSSPAVSVSDSLFQAPKMVSPSPT------TLESKK-ELPGP 1323
Query: 1389 KSNDDAEEQILSSKPG-SHELKFQPSITPADKNHVEPTSKTQTVFKDVGGQDSNVVGNAQ 1448
KS+ D + SKP SHELK QPS+TPADKNHVEPTS +QTV KDVGG NV+
Sbjct: 1324 KSDADTPKPAPDSKPPESHELKLQPSVTPADKNHVEPTSGSQTVPKDVGGLVPNVL---- 1383
Query: 1449 PQQPSVAFASIPSPNLTSKIFANSRNETSNAVVTQDDDMDEEAPETNNNVEFNLSSLGGF 1508
QQ S AF +P+ NLTSK N +NETS+A +TQDDDMDEEAPETNN VEF+LSSLGGF
Sbjct: 1384 -QQSSAAFVPLPTLNLTSKSSTNGKNETSDAALTQDDDMDEEAPETNN-VEFSLSSLGGF 1443
Query: 1509 GNSSTPISGGPKPNPFGGPFGNVNAASMTSSFNMASPPSGELFRPASFSFQSPLASQAAS 1568
GNSSTPIS PK NPFGGPFGNVNA SM SSF MASPPSGELFRPASFSFQSPLASQAAS
Sbjct: 1444 GNSSTPISSAPKSNPFGGPFGNVNATSMNSSFTMASPPSGELFRPASFSFQSPLASQAAS 1503
Query: 1569 QPTNSVAFSGAFGSAVPTQPP--SQGGFGQPSQIGVGQQALGNVLGSFGQSRQLGPTVHG 1628
QPTNSVAFSG FGS + TQP SQGGFGQP+QIGVGQQALG VLG+FG+SRQLGP++ G
Sbjct: 1504 QPTNSVAFSGGFGSGMATQPQTSSQGGFGQPAQIGVGQQALGTVLGAFGRSRQLGPSLPG 1563
Query: 1629 TGSGSPGGFSGGFTNAKPVGVGGFAGVGSGGGGGFGGVG--------------------- 1688
T SGSP GFSGGFT KP+ GGFAGVGSG GGGFGGVG
Sbjct: 1564 TASGSPSGFSGGFTGVKPI--GGFAGVGSGSGGGFGGVGSVSGGGFGGVGSGSGGGFGAV 1623
Query: 1689 ----------------------------GFAGAASTGGGFAGASSTAGGFAGAAGGGFGG 1734
GF G A GGGFA AS GGFAGAAGGGF
Sbjct: 1624 GSSSGGGFGAVGSGNGGGFSGVGAGGGGGFGGVAPAGGGFAAASPATGGFAGAAGGGFP- 1683
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4I1T7 | 2.0e-242 | 38.44 | Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... | [more] |
Match Name | E-value | Identity | Description | |
XP_031741374.1 | 0.0 | 100.00 | nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hyp... | [more] |
XP_031741375.1 | 0.0 | 99.40 | nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus] | [more] |
KAA0034115.1 | 0.0 | 92.73 | nuclear pore complex protein NUP214 [Cucumis melo var. makuwa] | [more] |
XP_008445928.2 | 0.0 | 93.11 | PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 [Cucumis mel... | [more] |
XP_038892124.1 | 0.0 | 83.60 | nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KV45 | 0.0 | 97.59 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1 | [more] |
A0A5A7SY34 | 0.0 | 92.73 | Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... | [more] |
A0A1S3BDU8 | 0.0 | 93.11 | LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656... | [more] |
A0A5D3CWG0 | 0.0 | 91.93 | Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E5... | [more] |
A0A6J1CBF2 | 0.0 | 72.32 | nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC1110100... | [more] |