Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTCCCCCACAAGAAAATGCTAAAGGAAAGGAGAAAGAGAAATACATTTCATTTGCTTACCGGTTACCGCCCAAAGACACTTTGGAGTGCGATGCTCCTAAAACCCCAGGTGGAAGAACATCGGTTGGTTCCAAACCCCTTCGGCTGAGCTTCTCTATCGATTCAGAGAGGAGCATTTTGCAATTCATGGCTTCCGTTGATTCGGGACCTTCATCCTTGATTCCACTGGAGGACGCCGGCGAAGGAGAGCAAATTGTAAGGAACGATCTCTACTTCCAAAAGATCGGCAAACCTGTTCCCGTCAAGCTCGGCGACTCCATTTTTGACCCCGAAAGTCCTCCCTCTCAACCCCTTGCTCTTTCAGAGAGTTCCGGTCTCATCTTCGTTGCTCATTTGTCTGGTTGGTAATTTCAATTACTTTCCTCATTGTTGTGATTACCATTGGTTTTTCTATGGTTTTGAAGAAAATATTTCTTGTTAAGGTTTTTTTGTGGTGAGAATCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAATGGGGGAACGGGTTCTTCTGTCCAGGACTTAAGCATTGTGGACGTTTCCATTGGAAAAGTTCACATTCTAGCAGTTTCCACTGACAACTCAGTTCTTGCTGCCGTCGTAGCTGGTGATGTTCATATTTTTTCAGTCCAGTCGCTGCTTGATAAGGTAATTATTTTCGCTGGAGCTCGTCGTTAGACCCCTAATATCATACCGTAGTTTATTTGAAACGTCAATTCAGGTATTTACATCGACGGGTAAAATGAGGCAGTCATTAGTTATATCTCTTGCGCTCCATTCTCTCATGTCATTGCTATTATGCAGGCAGAAAAACCCTATTCTTCTTGTTCAATAACTGATTCCAGTTTTATTAAAGACTTCAAATGGACCAGAAAGTTGGAAAATACTTATCTGGTTCTTTCAAAGCACGGACAGTTATATCAAGGATCGGCGAATGGGCCTCTTACACATGTAATGCACGATATTGATGCTGGTATGCTATTTATGAATACAGTAACTTGTGTATAATTCTCAGCAACAATATCAAGTGGCATTTGTTAAGTTTCTTATGTTTTATATTAAATTACTGTATACTTCAAGAAAGTGGATCTTTTTGTAACATACTTGCATGGGGAGATGCTGTCTTTCCCTGCACATTTTTCCTTCCATAATTCTCCATTTCTGAGTGCCTCTTCATAAAAGTTAAACGAGCTCTCTAAAATCTTTCTAGTCAACTCTTACAGTTACTACATCTATTGTGAAGAGCTGTTGTCCGTGTATATTTCATTTAATCATTTCAACTTTGTTTCTTATTGAAGGACCTTCTGTTCTATTTATAATTGTAAGGACGGTGACTTGTAACTGTAATTTGTAAATGAGGAATCATTGCAACCTAGGCCCGTGGTTGTTATTAGCATTCGCCCTTAGGGAAACAGCAAAGGACCATCATTGAATTGAGACTCAATTCACTGCTTGGAAATGTTGGAAGGATGACATGGGAACACTGCTCATTGGCACAAAAGTTAGGGGAGCTAGGTCAGAGATAGGAATGTGACTGAAGATAGGATATTCATAGATTGTTGGACTTAGTTAGTTTGACTATTTATCTTAGAGATCTTTATTTTTCCAGGAATTAAAACTCATTTTTACTTTTTGATACTGAACAATTTTAATTATTGCATTTTTCCCTTTCTCTTGCATGTACCCTGCTCTATGCAGTGTCTTTGGCGTATATTTTCTTCTTCTTTTGTATTCTCTTGACCCTGGTTTTCTTACTTTTTTTCCCAAAGCAATCCATTAAAATGAATTATTGTCAGCAAAACTTTGTTTGTGTAGAGCTACTATTTTTGAACTTCTTTTAAATCTTATTTTGGATTTAGTTGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGACACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCCGAGTTTAGGGAATGGTGAAACTGATACAGACTTTACAGTGAAGGGTTCTCCCTCTCTCTATCTCTCCTGTGTGTGTGTGTGTGTGTGTGTGTGAAAATGATATTTATAGAATATTGAAATGTTTAACAATTTTCATCATTCATTTTTTATTTGGTTGGGGGGGGGGGGGGGGGGGGATATTTCTTGTATTTTACCTTTGTGATGTGTCATATGGGGTATGCGAGCTCAGTATTTTTAGGTTGGCTACAAGTTGAGTTTGTGTTGTTAGCTTGGACTGAAGTACAAGTTGAGTTTGAGTTGTTAGCTCGGTATTTTTATAATTATGATAAACGAAAATGACCATTGAGAAGAGGGAGAGAATACTGGTGAGCATATATATTCTAACACATAAGGAGCCAAAATAAATCTCCTATTACTGGGTTTTCGGTTCATCATGGCTAGTTTGGAAATTTGCCACTTCCTTCCCCTTTTGTAATTCTTTTTCATTACCACATTATTCCATTTCCTAAATAAAATCATAGGGAAACAATTAAATGGGCATTTCAAGAGAACAAGGGATTCTAAATTCTAATTAACTCTAAATAATCCAAAAAAATTAATATAATTAAAAAAGAAATTTCTGAAAACTAAAATAAATTGGAAACAATAGTAGTGCGTACCATACAACAAAACTGATATATTTGACTTCAATTATAATCTTACATTGGTTATTCTAGAAAGTAGTTATGGTTACTATCTTTTAACTTGATTTATGGTGATTTCTCAGTTGACTGTATCAAGTGGGTCCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACCGCAACAGGTGATGAAGAAGATTACCTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATCACTGATGTGAGTAGTTGTGATTTTTTTCCTTTCCCCTCTCCTATTCCATTTCTATTTTCATGATAATGATTGTGTTCTTTATCATGCTTGTATATATTTTTTTTAATATTTGTGAGTGTCTAGACCAACTTACGTGCTCCTCGACTAATCTCATGGAACAACCTGCTTAACCTTACAACATTAAAGTTGTAGGATATTAAATCCTAGGTAGGTGACCTACATTTCTATTATCATGCTTGTACTTAGAGATAGGAGAGACAATAAGGAGGATGATAAGGGTTGCATTTGACTTGAGATGGTGAGAGAAATTTTAGAGGATTTAAGAGGTGTAGACTCATGCTAAACTTAATGCCTTTTCAAACATTTGTGACTTAAGGACCTTTGTAATTATCACGTATCATCCTCTAGGCCCTGTTCTTTTAGACTCCTTTTCCTTGTTATGTCCATTGTTGTTTGATTCAGTTTTTTGGTTGTTTTTTTTACAAGCCATTTGTATTCTTTCATCTCTCTCTCAAGCTTGGTTTCTTGATGAAAAGACAATCTTGTACATGGGTCTTATTGATGTTTATAATTAATTGTGAAACCTCAGTTTGTTTATCAATTTCCTTGAGTTTATCTCTTTGGCAAATCATTGTGGGATATTTGGGGTGGGGGGCAGAAACAACAAAGTGTTTAGAGGTTTGGAGAGGGAGGGAGTTTCTAATGCTGTTTAGCTCTTGGTGAGGTTCCATGTTTCTCTCTCGAATTTGGTTTCAAAAACTTTTCATAATTATCTTCATCTTACTAGTTGAAAACCATTTCTTTAAAAGGCTTTTTGTGGGCTTGGTTTTTTGTATGATCTTGTATTCTTTCTTTTTTGAAAGCAATTGTTTCTATGTAATTTTTTTTTTTGGGAAATACAAACATTTCGTGGGGAAATAACTCCTTGGATGTTGAACGCAAATAATGAGCCTGATTATTGCATATGGTAATTTACAGTGGACTTCACCAGATGTACAAGATATAGAAAATTTGTCTTTAAAAACAAACCATTGTTATTGATGAGAAGAGACTAATGCTCAAAATACAATGATATAAAAAAAACAATAAAGAGACAAATCCTTCTACCATATTGTAATTTACTCAATGAATGAGACTTGTTTCTGAAAAAATGCTTCTTTTGTGTACATTCGTGGATCGTGCAAGGTTGTTACCAATGAATGTGGGGTGGTTCACTTTTGGAGCTTTTATTGTTAAACGGGAGAGATGGGATGCTTAGGAACACTGAAAAACTTATGCTATCCATTCATGCTTATGAATGGTGATAAAGTACAAGGAAACTATTTCAAATTTATTTCATTTGAGACGGGAAGAGTGGATGGAGATCACAGCTTCAATGCCTACATTTGTACTTTTCTAGATAGGGCTTTACTTATTGGCAGAAAAGTGGAATTTCACGATGGTTTCTCGTTGACTACGGCTAAAATCTTTTAAGGGTCGGGTGGCAGGCCAGGTTTCAACTTGATAAACTTATCTTAGTTGCGGCACGATGGAATTAGCCTCCCCACAATAATTTTTTTTTACCACAACCTTCCATTTCATTCGAAAACGACATGAGCTTTCTTGTTCTAGAAGGGCATCTTGTCAAAAAGTCCAATTTAAGGAGCCCTCAATTAATGAATTATATGTATTCCAATGTTTGCCAACAATCATGGAACCCTCAACTCTATCCCTCCTTTGAATCAAATTTTATGCCTTGCAAACATTTCCCTTTTTAAATGTCAATCAAGAGGGAAAAGTTGTTTAAACATCTCTTGAAGCTAGTTTCCCAAAGATTTTCTCTATTCTATAAAAAGGGAGCTTCGGTGGCAGATTGGTGGGATACAACTTAGATGGAACCTGGGATGGCTTTCGGTTGGGTCAAAGAAACGATGGAATTAGATGGTCCTTTGATGTCTCAAGCTTATTCTCCACCCTAAGAAGTTGAAGGGTTTTCGGTTGGGTCTGAGAAAGTACGGAATTAGATGATCCTTGGATGGCACAAGTTTTTTGTCCACCAAATCCTTGTTTTTTCACTGTCTCACCATTTTTTAACATGACCATGACAAAGTTGCTTTGGGAGTTTGAAATTTCAAAAAGTGATGATTTTCCTTTGGTTGCTAACATAGGAGATTAAATACTCAGGAGAAAATGCAAGCACAAAGTGTCCTTGAATTGCCTTCAACACTTCTATTTGGGATCTTTGTTTGGTATGTGCCTAGTCTTTGGACCATATCTCCTTACACTTCCTCTTGCGCCTTTTTGATTTGCATATCTGTCTATCCAAGTGGGTGGATTGGTGGTTACCTAAATCCCTCAACAATTGGAGTTTGGAAGGAACGTGTTAAATCAATCATAACTCAAAAGCTTAAATTGACGAATAAAGGTAAATTTTATATTATCAACACTCTCTTCACTTGTGGGCGTGAAATATGTTGAATACCCAACAAGTGGAAATCAATCTTTGTTAAATTTGATAGTATGAGGTTTCATCTCAAAATCAATTGGCAATGAGAGGAGTAACCCATCTATCTTATCTGGAGTATGAGTCCCTTTGGTTTTTTCAATGTGGGACTCCCAACTTCTCCAATATGCCCCCTCAAGATGGTGCCTCTTTGGGTTCACCTATCTTAGACCGAATACCTTTGGGTTTAATGGGCTCTGATACCATGTTAAATTTGAGAGTATGGGGTTCCATCTCAAAACCAATTGACAATGAGAGAAGTAGTCCATCTATCTTATATGGAGTATGAGTCCCCTTGGTTTTTCCAATTTGGGACTCCCAACTTCTCCAATAATCTTAATTGAAAAGGAAATGACATTGCATAGATTTGAATACATGACCTTGTGACCACCTTCTTTGATACCATGTTAAATCACTGGTTGACCCAAAAATGTAATTGATGGATGAAAGGTAAATGTTATATCATCTAATAGAAAAACTATATATGTAGATTACTTCGGGATCCTTTTTATGGGGTTTATGGCTTGAATTTAACAAGAGGATTTTTGAAAATCAGTCGACCTTTTGCTTCTTTCTTTCCCCACTTTGGAGTTATTGTATTTTGAGTATTAGCCTCGTTTCATCATTTCGATAAAAAGTTTTGTTTCCTTGGAAAAAATGAAGACTAGTCCACTACTTTTTCTACTTTTTGAGATTTTGTACAACTAGATGCTTCATGGTGGAGACATAGTTACGGCAAATTTTTCTGTTATGGATCCATTCCATGATCCTTCATGATTGGAAGGCTTTTTGCCGGTACCTTTCTTGAGTGGTGAAGCTTTCTTTTCCCAGCCTTTACATTTTTTTGTTTCCTTTTTTCTTTTTGTTACCTTCAAACCGTTAACTGAAGAAGGGATTTATCATGAAATATCCTATGATTCCTGCCAAACCAAATTTTGGAAAAGAGAGCTTTGATTGCATCAACTATAGTTGAGCTTTATTAGATAAACCATGTGCGGCCTTATTAGTAGAAAAGGAGTAGTTTAGGCTGAAACTTGAGTGGGATTGGCAGATGGTTGTGTGAGGGAACTTTAGCAGTCTATTGTTCAAGGGAATGATAGTTAAGTGTTTTAATATTTTCTTTTTGTGTTCATTGAATCTTTCATTTACCTGTGAATCATGTATCGTGTTATTATTATTAGTGCCCTATAAAAGTTAGTGTCCTAGAATCAAGGTCAATCATCTTGGATTAAACCGTCACCTTCTTTTTTTGGTTGTAAATTTGTTATTACTATTCTGATCCTCTGAATGAGAATGATGCAACTTTATCTAAAATTGACGGTGAAAGTTGTGAACTATCGCAGTTTTCATATATTTTTCACTAAATCGTGTTAAATGTATGCAACTTGTTTAAAACAATACAATCATAGAACAGTGTCATAAACTGATCGTCATGATGAATTGAATTTGTTATTTATTTATTTATTTTATATGTTTAGTAAATTAATTTTTGTAACTCTCTTTTGTAGGTTTCTTCAAACAAAGTTTTGTTATCATTCTGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGAAAGTGGGCCTTGTTTATTGTTGAGCTATTTGGATACATGGTATGCAGAAATTACTTGATTTGATTTCGATGCTTGAAAGTGATTTTTTCTTGAAATTCCTTATTTTTTATTACTACCTTTTCTTCCTTTAAACTTTTGGTTATCATTTCATCAATTTTTCTTTCTTCTTTTTGTTGCTGTGGGATTTTAATTATAAACCCACATATCAACTTTATTTTTTTTCTCTTTCTTGATATGGATGCAGCAAGCTTGCAATTGTTGCAAATAGGCTCTATGTGGAAGATCATATTGCATTGCTTGGTTTGTTGCTAGAGGTTGAGAATGAAGTTGCAGTTGTTAATATTGATAGGAATACCTCTCTCCCGAAGATTGAGCTTCAAGGTTAGTGATCTCGTGATTAGTGTTATGAAAGACTTCTGAAACTTTAAAATGATGTTGATTGCTGAATAGTGAATATAACCTTGAGTTAGCTATCTAAAGGCTACAGTCAGCCTGACAGCTTGTGTAGTTCAAAAGTAACTAATTTAGCATACCTTAAGTTTAAATATAAATTCTGATTTGTGAATATAAATAGTAATAAATCAAAGAAACTGATCCTAATTCTAATGAATTAAGGATTTGACCATTATATCTTAACCCTACTACATCACGTTTTATGCTATTGTTTTTGTAAGATAGTTCTACTCGTTTTTTAGTCAATGTTGGCCCTCTTGTATTTCGATTTTTTTTCTTAATGAAAGTTTGGTTTCTTGATTAGAAAAAAATAATAGTTACAATTTGGTAAATCTGAGTTATAATTTGAGACATTACATTATGTGATTATTTTATATCGATTCTTCTAATAGTTGCTTGGATTTTGTTTTCTTTTAGGGTTTATCTATTTGATTTTCCTGGTTTACTCATGCATACACAAAGTAAAATGATCATTTTTTGTTAGAAAGTTGGGGTTGGGTTGTGTAGTCAATTTTTACATACACATTTATGCATATGCTCATTGATTAGGTTTTTATTTTTAGATTCTGCAGCATCTCTTCTTCTGTGAGATGATTCTATCTCATTCTTGTTTTGTTTCAGCGAATGGAGATGATAATTTGGTTATGGGGCTGTGTATTGATCGAGTTTCTCTTCTGGGGAAGGTGATAGTTAAGGTTGGATTTGAAGATATGAGAGAAGTCTCTCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGGTAATGACTTTTATGTTTTAAACCTTGTTTGTTATTGTACCAGAGGCCCAAGTCTATTTTTACCTTAAATAGATGTTTCTATCTTTTGCTTTTTATAGCAGCCTGGATATTATGATTATTGCTCTTGATTCCCTTTTACTATTGGTGTTCAACTTTTCATGCAGTGTCAACGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGATGAGGAAGATGATATAACAGTGCCCACTGATGATCGTTCTGAATCAAAGGAGTCTAGAGAAGCAAACATAGATCATAGGATGCAAGTTACGGAAAAAATCGCAATCAGTAGTGAGATTCCTAGGGAAAAGGGTAAAACTTCAAATGACATTAAGTCTTCTAGGAATGATCAAAGTCTAGTATATAACATAGATGAGAGTGCAATTGTTAGCCCAGAGGGTAATACTAAAAGTCAGAAAGTGGATTCTTTCATTTATTCACAATCATTGAAGTCTTCTGCCCCGGAGAGACCACCTCACTATGAGATTGGGAACTTTGATAAGCCAGTTCTAAAATTTACTGGTCTCGGGTCTGCTTCTATTTCAGGAAAGTCTGAGGATGTGCCTAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGATTGGGGTCAACTGGCTTGATGGCTGCTTCTGAGTTATCCAGTGAGAAAGCAATGTCTTTTAAAAAAATTGATCCAGTACCTTCAGTCTTTACTTCAAATTCTCTTCAAAGCAGCAACACTGAGAATTATGGACCAAGTTTTGGTACAGCAAATGCTTTCACAGGTTTTGCTGGAAAACCTTTTCAACCAAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGCAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGATAAGTTCTCGTCAGGGAAAATTTCTAATGAGAAACATGATGGTTCAGAGCGATATTACAGCAATTCCCCCCTGGCAAAACCAGTAAGTTCTGGACGAATTTTATTCATACAATTTGTCAATGTGAAGATATTTGAGTCTAATATAGGAATGGCTTGGTTTAAAATCGGCCATTCTTTGGCTTGCTCATACTCCTCCAATCCATCGTCTTTGGAAAAGAATCTTTGTAAAGAAAATACAGACTGTCTCCATTTTTACTAGTATAAAAGGAAAAGTGGATAAAACACTTGCATGAGCTTTCCTCTCCCTATATATATACCTCTTAGGAACATCTTGAATTAGAAAATGAGATTTCCTTTGGTTGAATGATGACCCGTGTGTGACTAGCGAAATCATGAGGCCCATGTGTTTTTATATGTGAGTTCATATTGTATTTTTGCCGTTTAATTAGTCAACAAACATCTCTCTGTTTGTGTAATTATTGGTCGTGACTAGGTAACTAAAAAACATCCCCCCCCACCATAGACTTGTTTTATTTTTTTGATAGTTGTGTTTCTCTTCTATTCAAGACATTTGTCTCAATGGGAATACCTTTATTTTTCTGTTATAATTTATTATGTTTTGAAAATTTTTGTGTCGATTTTTATTAAGATTCTTTTCATCACATCAATGAATTTATTTTATTTTTAAAATTTTATTTCCTTTTAAAACCAAGGAGAAAAGGTGTCCCACCCTCAAACTGCAGTCTCCAGCCAGCTTTAATATTGTTTCTTAATAACTTTCTCCCTTCAATGCTCCCTTTTCCCTACAGATGAAGGAAATGTGCGAAGGGTTGGACACACTTCTCGAATCTATTGAAGAGTCGGGTGGGTTCATGGATGCCTGCACTGCTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAACTTGGCTTAGCCAGTCTTTCAGATGGATGCCAAATATGGAGGGTAACTGTCAATTGTTATTTTATATTTTTCAGTTTGTTAATAATTATCTCCTGTATTTTGTTAGACGGGGAATATGTTTGTTAATTAATATAGCCTTATTATTTATTTATTATTTAAAGTACTAGATTCAGGAAAGGGAGAAAAATGTAAAAAGGAAATTTATGGAGTAGACAGTGACTGCTGGGCTTCTCATCTAAGTAGTAATGGTCTGTTTAAGAGATTAGCAGTGGAAAGAACAGAGAAATGACGTTAGCTTGATGATGGAAGCATCTCATAAGAGAAATACTTGAAAAAGAAAATGTATGATATGAGAAAAGATGGCCAGTACTCCTTTCCTAGGGACTGATGGCCGTTCCTCGATACTAACTGAGAATCAACGAACATGTTTGACTAGAAAATTTAAGAAAGAGCTGAGAAACATTAGGTAATTGATTTACAAATGACTCATTCTATAAAATATTGAAAGTATAATTCAGGTTTCAAGTCTTTTCTAGGGATGTGACTAATTGAAGATTCATTACATGATTAGTTGCTTGAGTTTCTTCATGATTGTTTCCTCAGATTTGCCATTTATGTGGATTTAATGGCGGTTTCTGTAGATATATGTTGGGATAATTGTTTTAACTAGCAAAACTGCTTGAAATTTCAAATATAGCAAAATGTCACCGTCTATTAGTGATAGACGACCCTCACTAGATTTCTCTGGTGTCAAATTCTTTATGAAATTATCCTTTAGGTTTTGTGTTGCTTGCAAGGACTCTCTTTTGTGGCGATACTCCTTTTTGTTGCCTTAGCATTTTTATGGATGTCCTTTTAATATTATTTATATTATTTTTTATTTTTATACTTTTTTTTACATTTTTTCTCAATGGAAGTTTGGTTCTTTATATAAAAAATGTGTTTAAATAATGCCCTGCCCTCTCTTACCTTCCCCAAAGTTTACATTTTTTTAAATTCTTGCAATTGGAATCATATGCTTTTATTTTGGGTTCTTTGTTTAGGGCTCCTTTTCTTTTCTTTACCCTATTTTTTTCCTCGCTATCAATATAGTTATCTAACACGGGGATTTCTTTTTCCTTCTTTTCATAGAGCACAATGAATGAGCGTTCGCAGGAGGTACAAAATCTCTTTGACAAAATGGTTCAAGGTATTAGAAGCTCAATTTCTTGTTTGTTTATACATTATTGATGGTGACCCTGTTGGTGTAGAATGTTCCTTCCTGATCTATCTGATATTGGAAAAAAGCCTTTATTTAGAATGCATTGAATGTTTGAACTCAAAGAAATAGAACTCTGGAATTTCATTTACATCCTTTTCAAACATTCAGTCTAGATGTTTATAAAAAATCTTATTGGTTATCACGGGAATTTGATTGACATTTGCAAATAACAAGGCTGGAGATCTGGAGTACTGGGATTTTGTTGAAATAAGTAGCAATTATGTCAGTCCATTCATCAAGTTGACTTGGTGAAATATTAGATTGAAAATCATAATTGATTAATCTCTGGTGAAAATTTCATAGATATTTTACATTCTCAGTCTATTTGGTTTCAGTTTTGTCAAAGAAGACATACATTGAAGGTATTGTGATGCAATCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTAAGTTCTGAATTAGAGCTGAAGAGACAACATATCTTAAAGATGAATCAGGTAGTCTATTCTTGTTTCGAAAGTCATTGAATTTCCCCAATTTTAGACGTTATTCATATTCTTCTCTTTTCCAGAATATAACCAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGTCTTGAGCTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGTAAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTCGTCAAGGTACTTCCATTTTCTTTTTCTGGCTCTTAGATGTAAGACTTTCGTAATCTTTGTTCTTTTGTTGTTTTTAATTGTATTGCTTTACTTCTATTCTTATACTTCTTTCACTACTTTCTGGAGTTTGTATCTTTTGAGCAAATCTCTTTTCATTATATCGTTGCAAAGTTTTGTTTCTAATTTTGAACATGTGAGCAAGAAAATCCTGTCTTCATGATGATTGAAATGAAGTAATTTATTATCTCTGTTATCATTCACTTTTTAGCTGGTTTTTTCTTGAGTAGCTTGACACTGGTTTCTTCATCAAAATTTGTTACAGGCATAGTCATTCAGTACATAGTTTGAATAACATTATGGGGTCTCAATTAGCAACAGCTCAACTTCTTTCTGAAAGTCTATCAAAACAATTGGCTGCACTCAATATGGAATCACCCTCTTTGAAAAGGCAGAGTGCCACGAAGGAATTGTTTGAGAGTATTGGACTTACTTATGATGCTTCTTTCAGTTCTCCAAATGTGAACAAAATTGCAGAAACTTCTAGCAAGAAGCTTTTACTTTCTTCTGATTCTTTTTCAAGTAAAGGTACATCGAGAAGAAAACAACAGAGTGGAACTAAAAATTCTGAAGCAGAGACTGGGAGAAGGAGAAGAGATTCACTTGACAGGGTACTGTTTCATTGAACGTCTATTTTTCTACTCTTACCTTTGTTTCATATGGAATTTCTTTTTGTTAATTGAAAATGATATTTATATTACATGATTAAAATAAATTTGGACTAGATGTGCCATACACAATAAAATAAAATGATGAAGTTGTTCGCTGACCACTTTTGCTAGGAAATAAATGTGCCATATCCATCCTCTAATTTTGCTATGCTTTTGTGTCCTAATATTTATTCCCTGTAAAATTTGTGTAGAACTTGGCTAGTGTTGACCCTCCAAAAACAACTGTCAAGAGGATGCTTTTGCAAGGAATTCCTTCCTCTGAGGAGAAACAATTTTGTTCTCGCACACCTGAAGGAGCTGCAACAGTTGCACGGCCTGCTAGTCGCATAACATCATCTATTTCATCGTCATCCAAAAATGCAGGTAATCTCTAAGATAAAGCATAAGCAGTAGGTTTTTATAGTCATTTGCTTGCAACTTTCACAAATTCTATAAGATCTCCTTTTGAGTTTTGAATGTTTCAGTTCGAGGGTATTAATAGGAGATTTAATAAGGGCATAGTGATAATTAGCTACCGGTGCTTTGGTTATAAATAAGATAAGTTATGTACTTGAGGAGTGGGAATCATTTTGGTATCCTTGGTAGATTTGTTGAGGGAGAGACATAGTCTTCTTGAATGGCTATTGATATTGCAAAAAGTTAATCTTTGATGTTTTCAATGTATTTTCTGTGTTTTGGTTACCTAACATCCAGCTTATATTTATTCTACATATGCAGGACATGACTCTGAGAACCCGGAGACTCCTTTTATGTGGAATAGCCCTTTACAACCATCTAATACCTCTAGACAGAAATCTCTACCTTTGCAAAAAATTAATGTGACACCACCATCTCCACCTCCAGTATTCCAATCATCACATGATATGCTGAAAAAAAAAAATAATGAAGCTCACAGTGTGACTTCAGAAAACAAATTTACAGACGTGGCGTGTCCTGAAAAGTCAAAAGCTTCTGATTTCTTCTCAGCCACTAGGAGCGACTCTGTCCAGAAATCTAATATAAACGTTGATCAGAAATCATCCATCTTTACGATATCATCTAAGCAGATGCCCACACCGATAGATTCTATTGCTACCTCTAATGTGGACAATCAGAAGACTGCTAACGTAAAGGAGAGGCATACAACTACAAGTCCATTTTTTGGATCTGCAAATAAGCCCGAATCTCCATTTGTTGGTTCGATGCCTTCTCTGGTTCCTACTGTTGATGGATCAAGAAAGACTGAAGAAAAAAAATCAGTGACAACAATTTCACAATCAGTTTCAGCACCGGCTCCGTTAAATACTTCTTCAAGTGCATCGACTTTATTTTCAGGATTTGCTGTAAGCAAAGCTCTTCCAAGTTCTGCTGCTGTTATAGATCTCAATCAACCTCCGTCAACATCAACCCAATTGAACTTCTCCTCTCCGGTTGTTTCTAGTTCTAATTCCCTATTTCAGGCACCTAAGATTGTACCAACATCACCCACTCTATCTTCTTTGAATCCTACATTGGAGTCCTCGAAAACAGAGCTATCGGTTCCGAAATCAAATGATGATGCTGAAGAGCAAATACTATCTTCGAAGCCTGGGTCTCATGAACTGAAATTTCAACCTTCTATAACACCTGCCGATAAAAATCATGTAGAGCCGACTTCTAAAACCCAGACTGTTTTCAAAGATGTGGGAGGACAGGATTCAAATGTAGTAGGGAATGCTCAACCACAACAGCCATCTGTTGCTTTTGCTTCAATACCTTCACCAAACTTAACTTCTAAGATTTTTGCAAATAGTAGAAATGAAACTTCAAATGCAGTGGTTACTCAGGACGATGATATGGACGAGGAGGCCCCAGAGACGAATAACAATGTTGAGTTTAATTTGAGCAGCTTGGGAGGATTTGGAAATAGCTCTACCCCTATATCAGGCGGTCCTAAACCAAATCCATTTGGTGGTCCATTTGGTAATGTGAATGCAGCCTCAATGACCTCTTCCTTTAATATGGCATCTCCTCCAAGTGGAGAGCTGTTCCGGCCTGCATCGTTTAGCTTCCAATCTCCATTGGCTTCACAAGCAGCATCACAACCCACAAATTCTGTTGCATTCTCTGGTGCCTTTGGCTCTGCAGTGCCTACTCAACCGCCTTCGCAAGGTGGGTTCGGTCAGCCTTCTCAGATTGGAGTAGGACAGCAAGCACTTGGTAATGTTCTTGGTTCATTTGGACAATCAAGACAGCTTGGTCCTACTGTCCATGGAACTGGTTCAGGATCACCTGGGGGTTTTAGTGGTGGCTTTACGAATGCAAAACCTGTTGGAGTTGGAGGTTTTGCAGGTGTCGGTTCAGGAGGTGGCGGCGGTTTTGGAGGTGTTGGTGGTTTTGCTGGTGCAGCCTCAACCGGTGGAGGATTTGCCGGTGCTTCCTCTACGGCAGGAGGTTTTGCAGGTGCTGCAGGTGGAGGCTTTGGAGGTACTGCAGGTGGATTTGGGGCGTTTGGCAGCCAGCAAGTAAGCGGCGGTTTCTCTGCTTTTGGTGCTGCTGCTGCTGCTGCTGCTGCTGGTGGAGCCGGAGTAACTGGAAAACCTCCTGAGCTTTTCACCCAAATTAGAAAGTAGAGTTTAGATCCACTATTTATTGATTGGGTTCATAAGCAGAGAGTAGAATGTAGATTATTTGTGTTGCACCCTTTTTCAGTTTTTAACTCTTTTTGGATCTTAGATAACCTTCACATTATTGATTAACTTCACATAATAGATTAAGAGATTAAGGCATATATATATATGTTTTTTCCCCTTTCAATATTATACTTTTTGAGGTTATTTTGACTGTTTATCTTTGACATGTACATTCAAGATGAAGTCATTTTTATTATTAGGATTTGTAGTGCAAGGATGCAAAACCAAACAATTTTCTTAT
Coding sequence (CDS)
TTTCCCCCACAAGAAAATGCTAAAGGAAAGGAGAAAGAGAAATACATTTCATTTGCTTACCGGTTACCGCCCAAAGACACTTTGGAGTGCGATGCTCCTAAAACCCCAGGTGGAAGAACATCGGTTGGTTCCAAACCCCTTCGGCTGAGCTTCTCTATCGATTCAGAGAGGAGCATTTTGCAATTCATGGCTTCCGTTGATTCGGGACCTTCATCCTTGATTCCACTGGAGGACGCCGGCGAAGGAGAGCAAATTGTAAGGAACGATCTCTACTTCCAAAAGATCGGCAAACCTGTTCCCGTCAAGCTCGGCGACTCCATTTTTGACCCCGAAAGTCCTCCCTCTCAACCCCTTGCTCTTTCAGAGAGTTCCGGTCTCATCTTCGTTGCTCATTTGTCTGGTTTTTTTGTGGTGAGAATCAAGGATGTAATTGCTTCGGCCGAGGAGATAAAAAATGGGGGAACGGGTTCTTCTGTCCAGGACTTAAGCATTGTGGACGTTTCCATTGGAAAAGTTCACATTCTAGCAGTTTCCACTGACAACTCAGTTCTTGCTGCCGTCGTAGCTGGTGATGTTCATATTTTTTCAGTCCAGTCGCTGCTTGATAAGGCAGAAAAACCCTATTCTTCTTGTTCAATAACTGATTCCAGTTTTATTAAAGACTTCAAATGGACCAGAAAGTTGGAAAATACTTATCTGGTTCTTTCAAAGCACGGACAGTTATATCAAGGATCGGCGAATGGGCCTCTTACACATGTAATGCACGATATTGATGCTGTTGAATGCAGTGTGAAAGGGAAATTCATTGCTGTGGCTAAAAAGGACACTCTTACCATTTTCTCACACAAATTCAAAGAACGACTATCCATGTCACTCTTGCCGAGTTTAGGGAATGGTGAAACTGATACAGACTTTACAGTGAAGGTTGACTGTATCAAGTGGGTCCGTGCTGATTGTATCATCATAGGATGCTTTCAAGTGACCGCAACAGGTGATGAAGAAGATTACCTTGTCCAAGTTATCAGAAGTAAAGATGGAAAAATCACTGATGTTTCTTCAAACAAAGTTTTGTTATCATTCTGTGATATACATTCAGGTTTCACTCGTGACATTTTGCCTGGTGAAAGTGGGCCTTGTTTATTGTTGAGCTATTTGGATACATGCAAGCTTGCAATTGTTGCAAATAGGCTCTATGTGGAAGATCATATTGCATTGCTTGGTTTGTTGCTAGAGGTTGAGAATGAAGTTGCAGTTGTTAATATTGATAGGAATACCTCTCTCCCGAAGATTGAGCTTCAAGCGAATGGAGATGATAATTTGGTTATGGGGCTGTGTATTGATCGAGTTTCTCTTCTGGGGAAGGTGATAGTTAAGGTTGGATTTGAAGATATGAGAGAAGTCTCTCCATATTGCATTCTCGTGTGTCTTACTTTAGAGGGAGAGCTCATTATGTTTCAATTTTCTAGTGTCAACGAAACTGAAGCTCCACATGAGACTGTTTCTGCTTGTGATGATGAGGAAGATGATATAACAGTGCCCACTGATGATCGTTCTGAATCAAAGGAGTCTAGAGAAGCAAACATAGATCATAGGATGCAAGTTACGGAAAAAATCGCAATCAGTAGTGAGATTCCTAGGGAAAAGGGTAAAACTTCAAATGACATTAAGTCTTCTAGGAATGATCAAAGTCTAGTATATAACATAGATGAGAGTGCAATTGTTAGCCCAGAGGGTAATACTAAAAGTCAGAAAGTGGATTCTTTCATTTATTCACAATCATTGAAGTCTTCTGCCCCGGAGAGACCACCTCACTATGAGATTGGGAACTTTGATAAGCCAGTTCTAAAATTTACTGGTCTCGGGTCTGCTTCTATTTCAGGAAAGTCTGAGGATGTGCCTAGCCAGCCCTTTCCCAATGTAAAAGAATCAACAAAAAGATTGGGGTCAACTGGCTTGATGGCTGCTTCTGAGTTATCCAGTGAGAAAGCAATGTCTTTTAAAAAAATTGATCCAGTACCTTCAGTCTTTACTTCAAATTCTCTTCAAAGCAGCAACACTGAGAATTATGGACCAAGTTTTGGTACAGCAAATGCTTTCACAGGTTTTGCTGGAAAACCTTTTCAACCAAAGGATGTTCCAAGTACATTAACACAAAGTGGGAGACAAGCAACGGGAGGTGCTGGTAAAATTGAATCTTTACCAGTGATACGTAGCTCACAAATATCATTGCAAGATAAGTTCTCGTCAGGGAAAATTTCTAATGAGAAACATGATGGTTCAGAGCGATATTACAGCAATTCCCCCCTGGCAAAACCAATGAAGGAAATGTGCGAAGGGTTGGACACACTTCTCGAATCTATTGAAGAGTCGGGTGGGTTCATGGATGCCTGCACTGCTTTCCAGAAAAGCTCCGTTGAAGCTTTGGAACTTGGCTTAGCCAGTCTTTCAGATGGATGCCAAATATGGAGGAGCACAATGAATGAGCGTTCGCAGGAGGTACAAAATCTCTTTGACAAAATGGTTCAAGTTTTGTCAAAGAAGACATACATTGAAGGTATTGTGATGCAATCTTCTGACAGCAAGTATTGGGAACAATGGGATCGTCAAAAGTTAAGTTCTGAATTAGAGCTGAAGAGACAACATATCTTAAAGATGAATCAGAATATAACCAACCAGTTAATTGAGTTAGAAAGACATTTTAATGGTCTTGAGCTGAATAAGTTTGGTGGAAATGAGGAAAGTCAAGTAAGTGAAAGAGCTCTTCAAAGGAAATTTGGTTCGTCAAGGTACTTCCATTTTCTTTTTCTGGCTCTTAGATGTAAGACTTTCGTAATCTTTGTTCTTTTGTTGTTTTTAATTGTATTGCTTTACTTCTATTCTTATACTTCTTTCACTACTTTCTGGAGTTTGTATCTTTTGAGCAAATCTCTTTTCATTATATCGTTGCAAAGTTTTGTTTCTAATTTTGAACATGTGAGCAAGAAAATCCTGTCTTCATGA
Protein sequence
FPPQENAKGKEKEKYISFAYRLPPKDTLECDAPKTPGGRTSVGSKPLRLSFSIDSERSILQFMASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSESSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNSVLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNIDRNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRYFHFLFLALRCKTFVIFVLLLFLIVLLYFYSYTSFTTFWSLYLLSKSLFIISLQSFVSNFEHVSKKILSS
Homology
BLAST of Cucsat.G9615.T4 vs. ExPASy Swiss-Prot
Match:
F4I1T7 (Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE=1 SV=1)
HSP 1 Score: 580.5 bits (1495), Expect = 3.7e-164
Identity = 373/912 (40.90%), Postives = 522/912 (57.24%), Query Frame = 0
Query: 72 SLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSESSGLIFVAH 131
S + +E+ EG++I ND YF++IG+P+ +K D+ +D E+PPSQPLA+SE ++FVAH
Sbjct: 2 SRVEIEEDTEGDRISTNDYYFERIGEPISIKEDDAQYDLENPPSQPLAISERHAVLFVAH 61
Query: 132 LSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNSVLAAVVAGD 191
SGFFV R DVI++++ G +QDLS+VDV +G V IL++S D+S+LA VA D
Sbjct: 62 SSGFFVGRTNDVISASKNSNGNGDKVFIQDLSLVDVPVGDVRILSLSADDSILAVTVAAD 121
Query: 192 VHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLYQGSANGPLT 251
+H FSV SLL K KP S S +S F+KDF+W R +++YLVLS G+L+ G N P
Sbjct: 122 IHFFSVDSLLKKDAKPSFSYSPDESGFVKDFRWRRNDKHSYLVLSNTGKLFHGIDNAPPR 181
Query: 252 HVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETDTDFTVKVDC 311
HVM +DAVE S KG +IAVA+ ++L IFS KF E+ ++L G++D D VKVD
Sbjct: 182 HVMDAVDAVEWSSKGSYIAVAQDNSLRIFSSKFNEKRCIALSFDSWIGDSDEDCFVKVDS 241
Query: 312 IKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCDIHSGFTRDI 371
I+WVR +CI++GCFQ+ G EE+YLVQVIRS DGKI+D S+N V LSF D+ D+
Sbjct: 242 IRWVRNNCILLGCFQL-IEGREENYLVQVIRSPDGKISDGSTNLVALSFSDLFPCSMDDL 301
Query: 372 LPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLL-EVENEVAVVNIDRNTSLPKI 431
+P GP LL SY+D CKLA+ ANR +++HI LL + ++ V+VV+IDR T LP+I
Sbjct: 302 VPVGVGPHLLFSYIDQCKLAVTANRKSIDEHIVLLDWSSGDDKSAVSVVDIDRETFLPRI 361
Query: 432 ELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGELIMFQFSSV 491
LQ N DDN VMGLCIDRVS+ G V V+ G ++++E+ PY +LVCLTLEG+L+MF +SV
Sbjct: 362 GLQENNDDNTVMGLCIDRVSIEGTVNVRSGDDELKELQPYFVLVCLTLEGKLVMFNVASV 421
Query: 492 NETEAPHETVSACDDEEDDITVPTDDRSESKESRE----ANI----DHRMQVTEKIAISS 551
A +T A + +D P + SK+S E NI D + TEK +
Sbjct: 422 AGRPASSDTDLASSSDIEDAYTPLIEDDLSKQSSEKHQQLNIAVQNDQKHLNTEKFSTEQ 481
Query: 552 EIPREK--GKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQK-VDSFIYSQSLKSSAP 611
+P E K +KSS VS + N K + + + + + S
Sbjct: 482 RLPNENIFSKEFESVKSS---------------VSGDNNKKQEPYAEKPLQVEDAQQSMI 541
Query: 612 ERPPHYEIGNFDKPV----LKFTGLGSA---------SISGKSEDVPSQPFPNVKESTKR 671
R G + KF G G A I +S + Q K +
Sbjct: 542 PRLSGTSFGQLPMSLGYDTNKFAGFGPALPVSEKLQKDIFAQSNSMHLQANVESKSTAAF 601
Query: 672 LGSTGLMAA-----SELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTG 731
GS GL A SS+ S K + P F S S + S + TG
Sbjct: 602 FGSPGLQNAILQSPQNTSSQPWSSGKSVSPPD--FVSGPFPSMRDTQHKQS---VQSGTG 661
Query: 732 FAGKPFQPKDVPSTLTQSGR---------------QATGGAGKIESLPVIRSSQISLQDK 791
+ P KD + ++GR G KIE +P IR+SQ+S Q K
Sbjct: 662 YVNPPMSIKDKSVQVIETGRVSALSNLSPLLGQNQDTNEGVEKIEPIPSIRASQLSQQVK 721
Query: 792 FSSGK-ISNEKHDGS--------ERYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDAC 851
S K S+++H E SN P + EM +DTLL+SIE GGF D+C
Sbjct: 722 SSFEKSASHQQHKTPLSTGPLRLEHNMSNQP--SNINEMAREMDTLLQSIEGPGGFKDSC 781
Query: 852 TAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQS 911
KS+VE LE GL SL+ CQ W+ST++E+ E+Q+L DK +QVL+KKTY+EG+ Q+
Sbjct: 782 AFILKSNVEELEQGLESLAGKCQTWKSTIHEQQAEIQHLLDKTIQVLAKKTYMEGMYKQT 841
Query: 912 SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNKFGGNEESQVS 930
+D++YW+ W+RQKL+ ELE KRQHI+K+N+++T+QLIELER+FN LEL+++ + V+
Sbjct: 842 ADNQYWQLWNRQKLNPELEAKRQHIMKLNKDLTHQLIELERYFNRLELDRYNEDGGHPVA 890
BLAST of Cucsat.G9615.T4 vs. NCBI nr
Match:
XP_031741374.1 (nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hypothetical protein Csa_008316 [Cucumis sativus])
HSP 1 Score: 1680 bits (4351), Expect = 0.0
Identity = 868/870 (99.77%), Postives = 869/870 (99.89%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 543 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 602
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 603 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 662
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 663 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 722
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 723 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 782
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 783 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 842
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 843 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 902
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 903 LELNKFGGNEESQVSERALQRKFGSSRYFH 932
LELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSH 870
BLAST of Cucsat.G9615.T4 vs. NCBI nr
Match:
XP_031741375.1 (nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus])
HSP 1 Score: 1651 bits (4276), Expect = 0.0
Identity = 858/870 (98.62%), Postives = 859/870 (98.74%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGN
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGN---- 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
VDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 ------VDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 543 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 602
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 603 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 662
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 663 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 722
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 723 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 782
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 783 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 842
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 843 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 902
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 903 LELNKFGGNEESQVSERALQRKFGSSRYFH 932
LELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSH 860
BLAST of Cucsat.G9615.T4 vs. NCBI nr
Match:
XP_008445928.2 (PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 [Cucumis melo])
HSP 1 Score: 1589 bits (4115), Expect = 0.0
Identity = 829/871 (95.18%), Postives = 843/871 (96.79%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 543 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 602
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 603 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 662
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 663 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 722
K M FKK+ V SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKLI-VSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 723 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 782
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 783 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 842
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 843 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 902
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 840
Query: 903 GLELNKFGGNEESQVSERALQRKFGSSRYFH 932
GLELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 GLELNKFGGNEESQVSERALQRKFGSSRHSH 870
BLAST of Cucsat.G9615.T4 vs. NCBI nr
Match:
KAA0034115.1 (nuclear pore complex protein NUP214 [Cucumis melo var. makuwa])
HSP 1 Score: 1587 bits (4109), Expect = 0.0
Identity = 832/896 (92.86%), Postives = 845/896 (94.31%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 543 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 602
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 603 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 662
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 663 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 722
K M FKKIDPV SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 723 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 782
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 783 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 842
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 843 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQ-------------- 902
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQ
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPLNFSN 840
Query: 903 -----------NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRYFH 932
NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 FRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 896
BLAST of Cucsat.G9615.T4 vs. NCBI nr
Match:
XP_038892124.1 (nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida])
HSP 1 Score: 1449 bits (3751), Expect = 0.0
Identity = 762/875 (87.09%), Postives = 799/875 (91.31%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDS PS+LIPLEDAGEGEQ+VRND YFQKIG+PVPVKLGDSIFDPE+PPSQPLALSE
Sbjct: 1 MASVDSRPSTLIPLEDAGEGEQVVRNDFYFQKIGRPVPVKLGDSIFDPETPPSQPLALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVR KDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILA+STD+S
Sbjct: 61 SSGLIFVAHLSGFFVVRTKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILALSTDDS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
+LAAVVA D+H+FSVQSLLDKAE P SSCSITDSS IKDFKWTRKLE++YLVLSKHGQLY
Sbjct: 121 ILAAVVARDIHLFSVQSLLDKAETPSSSCSITDSSCIKDFKWTRKLEDSYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGSANG LTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPS GNGETD
Sbjct: 181 QGSANGSLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSSGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCII+GCFQ+TATGDEEDY V VIRSKDGKITDVSSNKVLLSF D
Sbjct: 241 TDFTVKVDCIKWVRADCIIMGCFQMTATGDEEDYFVLVIRSKDGKITDVSSNKVLLSFRD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPG+SGPCLLLSYLD CKLAIVANRLY+EDHI LLGLL EVENEVAV+NID
Sbjct: 301 IHSGFTRDILPGDSGPCLLLSYLDKCKLAIVANRLYMEDHIVLLGLLQEVENEVAVINID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLCIDRVSL GKVIV+VGFEDMREVSPYCILVCLTLEG+L
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLPGKVIVRVGFEDMREVSPYCILVCLTLEGDL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSE-----SKESREANIDHRMQVTEK 542
IMFQFSSVNETEAPHETV ACD+EEDDI VP DDRS+ KE REAN D +MQV EK
Sbjct: 421 IMFQFSSVNETEAPHETVPACDEEEDDIIVPADDRSQLSSESKKEFREANNDLKMQVMEK 480
Query: 543 IAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSS 602
IAISSEIP EK K SNDIKSS NDQSLV I ESA V E NTKS+K DSFIYSQSLKSS
Sbjct: 481 IAISSEIPGEKIKISNDIKSSNNDQSLVSKIGESATVGAESNTKSRKADSFIYSQSLKSS 540
Query: 603 APERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASE 662
ER +YEIGNFDKPV KF GLG SISGKS DV SQPFPNVKESTK+LGSTGL+AASE
Sbjct: 541 VLERS-NYEIGNFDKPVQKF-GLGPVSISGKSVDVHSQPFPNVKESTKKLGSTGLLAASE 600
Query: 663 LSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLT 722
LSS+KA+ KIDPV SV NS QSS TENY PSFGTAN F GFAGKPFQPKDVPSTLT
Sbjct: 601 LSSDKAIFLNKIDPVSSVLIPNSFQSSKTENYVPSFGTANCFAGFAGKPFQPKDVPSTLT 660
Query: 723 QSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMC 782
QSGRQ GGAGKIESLPVIRSSQISLQD GKISNEKHDGS+R YSN+PLAKPMKEMC
Sbjct: 661 QSGRQVMGGAGKIESLPVIRSSQISLQDNLP-GKISNEKHDGSDRSYSNAPLAKPMKEMC 720
Query: 783 EGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLF 842
E LD LLESIEE GGF+DACTAFQKSSVEALELGLASL D CQIW+STMNER+QEVQNLF
Sbjct: 721 EALDMLLESIEEPGGFLDACTAFQKSSVEALELGLASLLDECQIWKSTMNERAQEVQNLF 780
Query: 843 DKMVQVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELE 902
DKM+QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQN+TNQLIELE
Sbjct: 781 DKMIQVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNMTNQLIELE 840
Query: 903 RHFNGLELNKFGGNEESQVSERALQRKFGSSRYFH 932
RHFNGLELNKFGGNEESQV+ERALQRKFGSSR+ H
Sbjct: 841 RHFNGLELNKFGGNEESQVNERALQRKFGSSRHSH 872
BLAST of Cucsat.G9615.T4 vs. ExPASy TrEMBL
Match:
A0A0A0KV45 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1)
HSP 1 Score: 1680 bits (4351), Expect = 0.0
Identity = 868/870 (99.77%), Postives = 869/870 (99.89%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE
Sbjct: 1 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD
Sbjct: 181 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKESREANIDHRMQVTEKIAISS 480
Query: 543 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 602
EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP
Sbjct: 481 EIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPERP 540
Query: 603 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 662
PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK
Sbjct: 541 PHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSEK 600
Query: 663 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 722
AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ
Sbjct: 601 AMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQ 660
Query: 723 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 782
ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT
Sbjct: 661 ATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDT 720
Query: 783 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 842
LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ
Sbjct: 721 LLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQ 780
Query: 843 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 902
VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG
Sbjct: 781 VLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNG 840
Query: 903 LELNKFGGNEESQVSERALQRKFGSSRYFH 932
LELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 LELNKFGGNEESQVSERALQRKFGSSRHSH 870
BLAST of Cucsat.G9615.T4 vs. ExPASy TrEMBL
Match:
A0A1S3BDU8 (LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656 GN=LOC103488807 PE=4 SV=1)
HSP 1 Score: 1589 bits (4115), Expect = 0.0
Identity = 829/871 (95.18%), Postives = 843/871 (96.79%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 543 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 602
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 603 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 662
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 663 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 722
K M FKK+ V SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKLI-VSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 723 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 782
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 783 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 842
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 843 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 902
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFN 840
Query: 903 GLELNKFGGNEESQVSERALQRKFGSSRYFH 932
GLELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 GLELNKFGGNEESQVSERALQRKFGSSRHSH 870
BLAST of Cucsat.G9615.T4 vs. ExPASy TrEMBL
Match:
A0A5A7SY34 (Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G001060 PE=4 SV=1)
HSP 1 Score: 1587 bits (4109), Expect = 0.0
Identity = 832/896 (92.86%), Postives = 845/896 (94.31%), Query Frame = 0
Query: 63 MASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLALSE 122
MASVDSG S LIPLEDAGEGEQIVRND YFQKIGKPVPVKLGDSIFDPESPPSQP+ALSE
Sbjct: 1 MASVDSGSSPLIPLEDAGEGEQIVRNDFYFQKIGKPVPVKLGDSIFDPESPPSQPIALSE 60
Query: 123 SSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVSTDNS 182
SSGLIFVAHLSGFFVVRIKDVIASA+EIKNGGT SSVQDLSIVDVSIGKVHILAVSTDNS
Sbjct: 61 SSGLIFVAHLSGFFVVRIKDVIASAQEIKNGGTCSSVQDLSIVDVSIGKVHILAVSTDNS 120
Query: 183 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 242
VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY
Sbjct: 121 VLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHGQLY 180
Query: 243 QGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNGETD 302
QGS NGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSH+FKERLSMSLLPSLGNGETD
Sbjct: 181 QGSVNGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHRFKERLSMSLLPSLGNGETD 240
Query: 303 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLSFCD 362
TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLV VI+SKDGKITDVSSNKVLLSFCD
Sbjct: 241 TDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVLVIKSKDGKITDVSSNKVLLSFCD 300
Query: 363 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 422
IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID
Sbjct: 301 IHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVVNID 360
Query: 423 RNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLEGEL 482
RNTSLPKIELQANGDDNLVMGLC+DRVSL GKVIVKVGFEDMREVSPYCILVCLTLEGEL
Sbjct: 361 RNTSLPKIELQANGDDNLVMGLCVDRVSLPGKVIVKVGFEDMREVSPYCILVCLTLEGEL 420
Query: 483 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK-ESREANIDHRMQVTEKIAIS 542
IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESK ESREAN+D +MQVTEKI IS
Sbjct: 421 IMFQFSSVNETEAPHETVSACDDEEDDITVPTDDRSESKKESREANVDLKMQVTEKITIS 480
Query: 543 SEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSAPER 602
SEIPREK KTSNDIKSS ND+S V NIDESAIVSPEGNTKSQKVDSFI+SQSLKSSAPER
Sbjct: 481 SEIPREKVKTSNDIKSSNNDRSPVSNIDESAIVSPEGNTKSQKVDSFIHSQSLKSSAPER 540
Query: 603 PPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLMAASELSSE 662
PP+ EIGNFDKPVLKFTGLGS SISGK EDVPSQPFPNVKES KRLGSTGL+AASELSSE
Sbjct: 541 PPNNEIGNFDKPVLKFTGLGSVSISGKPEDVPSQPFPNVKESQKRLGSTGLVAASELSSE 600
Query: 663 KAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 722
K M FKKIDPV SV TSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR
Sbjct: 601 KTMFFKKIDPVSSVLTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVPSTLTQSGR 660
Query: 723 QATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 782
Q TGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD
Sbjct: 661 QVTGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLD 720
Query: 783 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMV 842
TLLESIEESGGFMDACTAFQKSSVEALELGLASLSD CQIWRSTMNER QEVQNLFDKMV
Sbjct: 721 TLLESIEESGGFMDACTAFQKSSVEALELGLASLSDECQIWRSTMNERVQEVQNLFDKMV 780
Query: 843 QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQ-------------- 902
QVLSKKTYIEGIVMQ+SDSKYWEQWDRQKLSSELELKRQHILKMNQ
Sbjct: 781 QVLSKKTYIEGIVMQASDSKYWEQWDRQKLSSELELKRQHILKMNQVGLFLFQKPLNFSN 840
Query: 903 -----------NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRYFH 932
NITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSR+ H
Sbjct: 841 FRCYLYSSFFQNITNQLIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRHSH 896
BLAST of Cucsat.G9615.T4 vs. ExPASy TrEMBL
Match:
A0A6J1CBF2 (nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC111010057 PE=4 SV=1)
HSP 1 Score: 1213 bits (3139), Expect = 0.0
Identity = 655/880 (74.43%), Postives = 725/880 (82.39%), Query Frame = 0
Query: 60 LQFMASVDSGPSSLIPLEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLA 119
LQ S S+ I E+A EGE + D YF+KIG+PVPVKL DSIFD ESPPSQPLA
Sbjct: 4 LQDSTPSTSSTSTPIRFEEAEEGEHVESTDYYFEKIGEPVPVKLHDSIFDSESPPSQPLA 63
Query: 120 LSESSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVST 179
+SES GLIFVAHLSGFFV R +DVIASA+EIKNGGTGSSVQDLSI+DVS+G+VHILA+S
Sbjct: 64 VSESFGLIFVAHLSGFFVARTEDVIASAKEIKNGGTGSSVQDLSIMDVSVGRVHILALSA 123
Query: 180 DNSVLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHG 239
D+S +AAVVA D+H+FSV SLLDKA KP+ SCSITDSS IKDFKW RKLE++YLVLSKHG
Sbjct: 124 DSSTIAAVVAADIHLFSVHSLLDKAAKPFYSCSITDSSCIKDFKWIRKLESSYLVLSKHG 183
Query: 240 QLYQGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNG 299
QLYQGSANG L HVMHD DAVECSVKG+FIAVAKKDTLTIFS KFKERLSMSLLPS
Sbjct: 184 QLYQGSANGTLKHVMHDTDAVECSVKGRFIAVAKKDTLTIFSSKFKERLSMSLLPS---- 243
Query: 300 ETDTDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLS 359
+ D++F VKVDCIKWVRADCII+GCF+VTA GDEE+Y VQVIRSKDGKITDVSSN+VLLS
Sbjct: 244 DADSNFIVKVDCIKWVRADCIILGCFEVTAIGDEENYFVQVIRSKDGKITDVSSNRVLLS 303
Query: 360 FCDIHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVV 419
F IH GFTRDILP SGPCL SYL CKLAIVANR + HI LLG L EVEN+VAV+
Sbjct: 304 FQYIHPGFTRDILPVGSGPCLFSSYLGKCKLAIVANRNNTDQHIVLLGWLPEVENQVAVI 363
Query: 420 NIDRNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLE 479
+I+R+TSLP+IELQ NGDDNLVMGLCIDRVSL KV ++VG EDMREVSPYCIL+CLTLE
Sbjct: 364 DIERDTSLPRIELQENGDDNLVMGLCIDRVSLPAKVKIQVGVEDMREVSPYCILLCLTLE 423
Query: 480 GELIMFQFSSVNETEAPHETVSACDDEEDDIT-VPTDDR----SES-KESREANIDHRMQ 539
G+L+MF SS+NETE PHETVSAC+DEE+D T VP DD+ SES KE REA + M
Sbjct: 424 GKLVMFHLSSINETETPHETVSACEDEEEDDTIVPIDDQPQVSSESRKELREAMVGQ-MH 483
Query: 540 VTEKIAISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQS 599
T+KI SSEIP EK SNDIK S DQS V ID+SAIVS E N+KS+KV SFIYSQ
Sbjct: 484 DTDKITTSSEIPEEKINISNDIKPSDIDQSPVSYIDKSAIVSRESNSKSEKVGSFIYSQP 543
Query: 600 LKSSAPERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPNVKESTKRLGSTGLM 659
LKSS E+P + EIGNF KPV KFTGLGS + SG+S DVPSQPF N KEST RLGSTGL
Sbjct: 544 LKSSILEKP-NSEIGNFGKPVQKFTGLGSVAFSGQSADVPSQPFLNAKESTLRLGSTGLQ 603
Query: 660 AASELSSEKAMSFKKIDPVPSVFTSNSLQSSNTENYGPSFGTANAFTGFAGKPFQPKDVP 719
ASELSS++AM KIDP SV NSLQS+ T+N GPSFG ANAFT F G+ FQ KDV
Sbjct: 604 DASELSSDRAMFLNKIDPASSVLPLNSLQSTKTDNLGPSFGAANAFTAFTGRSFQTKDVS 663
Query: 720 STLTQSGRQATGGAGKIESLPVIRSSQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPM 779
STLTQ GRQ T GAGKIESLP +RSSQ+ LQD FS GK SNEKH SER YSN PLAKPM
Sbjct: 664 STLTQIGRQVTAGAGKIESLPPMRSSQVPLQDNFSLGKTSNEKHSRSERNYSNVPLAKPM 723
Query: 780 KEMCEGLDTLLESIEESGGFMDACTAFQKSSVEALELGLASLSDGCQIWRSTMNERSQEV 839
KEMC+GLD LLESIEE GGF DACTA QKSS+EALELGLA+LSD CQIW TMNER+QE+
Sbjct: 724 KEMCDGLDMLLESIEEPGGFWDACTASQKSSIEALELGLATLSDQCQIWGRTMNERAQEI 783
Query: 840 QNLFDKMV-QVLSKKTYIEGIVMQSSDSKYWEQWDRQKLSSELELKRQHILKMNQNITNQ 899
QNLFDK V QV+ KKTYIEGIV Q+S S YWE WDRQ+LSSELELKRQHILK NQN+TNQ
Sbjct: 784 QNLFDKTVNQVMPKKTYIEGIVKQASHSHYWEHWDRQRLSSELELKRQHILKTNQNMTNQ 843
Query: 900 LIELERHFNGLELNKFGGNEESQVSERALQRKFGSSRYFH 932
LIELERHFNGLELNKFGGN+ESQVSERALQRKFGSSR+ H
Sbjct: 844 LIELERHFNGLELNKFGGNDESQVSERALQRKFGSSRHSH 877
BLAST of Cucsat.G9615.T4 vs. ExPASy TrEMBL
Match:
A0A6J1G030 (nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449495 PE=4 SV=1)
HSP 1 Score: 1132 bits (2927), Expect = 0.0
Identity = 623/914 (68.16%), Postives = 703/914 (76.91%), Query Frame = 0
Query: 63 MASVDSGPS-SLIP--LEDAGEGEQIVRNDLYFQKIGKPVPVKLGDSIFDPESPPSQPLA 122
MASVDS S SL P LED+ EGE + ND YF+KIG+PVPVKL DSIFDP SPPSQPLA
Sbjct: 1 MASVDSRHSISLTPIALEDSYEGEHVETNDYYFEKIGEPVPVKLNDSIFDPGSPPSQPLA 60
Query: 123 LSESSGLIFVAHLSGFFVVRIKDVIASAEEIKNGGTGSSVQDLSIVDVSIGKVHILAVST 182
+SES GLIFVAHLSGFFVVR DV+ASA+E+KNGGTGSS+QDLSIVDVS+GKVH+LA+S
Sbjct: 61 VSESFGLIFVAHLSGFFVVRTTDVVASAKEMKNGGTGSSIQDLSIVDVSVGKVHVLALSN 120
Query: 183 DNSVLAAVVAGDVHIFSVQSLLDKAEKPYSSCSITDSSFIKDFKWTRKLENTYLVLSKHG 242
DNS LAAVVAGDVH+FSV SLLDKAEKPY SCS TDSS IKDFKWTRK EN+YLVLSKHG
Sbjct: 121 DNSFLAAVVAGDVHLFSVDSLLDKAEKPYFSCSTTDSSCIKDFKWTRKPENSYLVLSKHG 180
Query: 243 QLYQGSANGPLTHVMHDIDAVECSVKGKFIAVAKKDTLTIFSHKFKERLSMSLLPSLGNG 302
+LYQGSA+ P H+MHDIDAVECSVKGKFIAVAKKDTL+IFS+KFKERLSMSLLPS GNG
Sbjct: 181 KLYQGSASVPFKHIMHDIDAVECSVKGKFIAVAKKDTLSIFSYKFKERLSMSLLPSSGNG 240
Query: 303 ETDTDFTVKVDCIKWVRADCIIIGCFQVTATGDEEDYLVQVIRSKDGKITDVSSNKVLLS 362
+TDTDF +KVD IKWVRADCIIIGCFQVTATGDEEDY VQVIRSKDGKITDVSSNKVLLS
Sbjct: 241 DTDTDFAMKVDSIKWVRADCIIIGCFQVTATGDEEDYFVQVIRSKDGKITDVSSNKVLLS 300
Query: 363 FCDIHSGFTRDILPGESGPCLLLSYLDTCKLAIVANRLYVEDHIALLGLLLEVENEVAVV 422
F DI+SGFT DILP E+GPCLLLSYLD CKLAIVANR + HI LLG L EVENEVAV+
Sbjct: 301 FHDIYSGFTPDILPVETGPCLLLSYLDKCKLAIVANRNNTDQHIVLLGWLQEVENEVAVI 360
Query: 423 NIDRNTSLPKIELQANGDDNLVMGLCIDRVSLLGKVIVKVGFEDMREVSPYCILVCLTLE 482
+I+R+ SLP+IELQ NGDDNLVMGLCIDRVSL GKV V+VG E++REVSPYC L+CLTLE
Sbjct: 361 DIERDKSLPRIELQDNGDDNLVMGLCIDRVSLPGKVEVQVGNEEIREVSPYCTLLCLTLE 420
Query: 483 GELIMFQFSSVNETEAPHETVSACDDEE-DDITVPTDDRSESKESREANIDHRMQVTEKI 542
G+L++F FSS NE+EA ETVSACD+EE DD VPTDD+ +
Sbjct: 421 GKLLLFHFSSANESEASDETVSACDEEEEDDTVVPTDDQPQ------------------- 480
Query: 543 AISSEIPREKGKTSNDIKSSRNDQSLVYNIDESAIVSPEGNTKSQKVDSFIYSQSLKSSA 602
+ S I DQ V +DES +++ E N KSQ++DSF +SQ LK S
Sbjct: 481 -LFSNI----------------DQRPVSEVDESPVITRESNAKSQQMDSFAFSQPLKPST 540
Query: 603 PERPPHYEIGNFDKPVLKFTGLGSASISGKSEDVPSQPFPN--VKESTKRLGS------- 662
ERP + EIGNF KPV FTGLGS + SG+S DVPSQ + ++ +G+
Sbjct: 541 LERPNN-EIGNFTKPVKSFTGLGSVAFSGQSVDVPSQSLKSSILERPNNEIGNFNKPFHK 600
Query: 663 -TGLMAASELSSEKAMSFKKIDPVPSVFTSNSL--------------------QSSNTEN 722
TGL + + M+ + + P +N + QS++ +
Sbjct: 601 FTGLGSVAFSGQSVDMANQSLKPSFLERPNNQIGNFDKPVQKFTGLGSVAFSEQSADVPS 660
Query: 723 YG----------PSFGTANAFTGFAGKPFQPKDVPSTLTQSGRQATGGAGKIESLPVIRS 782
+ S G ANAFTGFAGKPFQPKDVPSTLTQSGRQ + GAGKIESLPVI+S
Sbjct: 661 HPFLNVKESTIKQSSGAANAFTGFAGKPFQPKDVPSTLTQSGRQVSAGAGKIESLPVIQS 720
Query: 783 SQISLQDKFSSGKISNEKHDGSERYYSNSPLAKPMKEMCEGLDTLLESIEESGGFMDACT 842
SQ+SLQD FS GKISN+K DGSER Y N PLAKPM EMCEGLD LLESIEE GGF+DACT
Sbjct: 721 SQVSLQDNFSLGKISNKKQDGSERNYGNVPLAKPMTEMCEGLDMLLESIEEPGGFLDACT 780
Query: 843 AFQKSSVEALELGLASLSDGCQIWRSTMNERSQEVQNLFDKMVQVLSKKTYIEGIVMQSS 902
FQKSSVEALELGLA+LSD CQIWR TM ER+QEVQNLFD+ V+VLSKKTYIEGIV Q+S
Sbjct: 781 TFQKSSVEALELGLATLSDQCQIWRRTMTERAQEVQNLFDRTVEVLSKKTYIEGIVTQAS 840
Query: 903 DSKYWEQWDRQKLSSELELKRQHILKMNQNITNQLIELERHFNGLELNKFGGNEESQVSE 932
DS YWE WDRQKLSSELELKRQ IL+MNQN+TNQLIELERHFNGLELNKFGGNEE QV+E
Sbjct: 841 DSNYWEHWDRQKLSSELELKRQRILQMNQNMTNQLIELERHFNGLELNKFGGNEEIQVNE 877
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
F4I1T7 | 3.7e-164 | 40.90 | Nuclear pore complex protein NUP214 OS=Arabidopsis thaliana OX=3702 GN=NUP214 PE... | [more] |
Match Name | E-value | Identity | Description | |
XP_031741374.1 | 0.0 | 99.77 | nuclear pore complex protein NUP214 isoform X1 [Cucumis sativus] >KGN52214.2 hyp... | [more] |
XP_031741375.1 | 0.0 | 98.62 | nuclear pore complex protein NUP214 isoform X2 [Cucumis sativus] | [more] |
XP_008445928.2 | 0.0 | 95.18 | PREDICTED: LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 [Cucumis mel... | [more] |
KAA0034115.1 | 0.0 | 92.86 | nuclear pore complex protein NUP214 [Cucumis melo var. makuwa] | [more] |
XP_038892124.1 | 0.0 | 87.09 | nuclear pore complex protein NUP214 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KV45 | 0.0 | 99.77 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G583270 PE=4 SV=1 | [more] |
A0A1S3BDU8 | 0.0 | 95.18 | LOW QUALITY PROTEIN: nuclear pore complex protein NUP214 OS=Cucumis melo OX=3656... | [more] |
A0A5A7SY34 | 0.0 | 92.86 | Nuclear pore complex protein NUP214 OS=Cucumis melo var. makuwa OX=1194695 GN=E6... | [more] |
A0A6J1CBF2 | 0.0 | 74.43 | nuclear pore complex protein NUP214 OS=Momordica charantia OX=3673 GN=LOC1110100... | [more] |
A0A6J1G030 | 0.0 | 68.16 | nuclear pore complex protein NUP214 isoform X1 OS=Cucurbita moschata OX=3662 GN=... | [more] |