Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCCATTGTAGAACCGGAAAAAACCTCCACGCAATAAACTTTGATCTTGAATTTGATTGAGATTTAAGATACTTCATAGTTCTTCCTCGCCTTTATTCTCATTCATGGTGACTGCGAGGCAACAGAAAAACCTTGAAGAAGAAGATGAAGAAGGGTTGGGAAGGGTTAGGAAGTTAATAGATGAAAGATTCGTTAAGAAATCACCGCCGAAACCTTATGATCGGCCGCCAGATGGCATAAGAACGTCTGGAAACAATTCGTGGATCTTGAAGCTCGTTGATCCAGGTCAAAGGCTCATTTCCTCTGGTTCTCGAATGCTTTTTTCCTCCGTGATCCGAAAATTTCCTCACCATTTAACGTCTCGTGTTTCGTCTCAAGGTTTGTTATGTTAAAACTTGTGTATGATTTTCCTTATGGCTCTGTAGTGGATCATTCACAGACGTTATTACTGGAGTTGGTTTTTCTGATCATCCATCTAAGAATACTTACGTACTATATTTAAGACTGTTTTTCTTTTACGAACTTTATTAGATAAGTAGCATCTGATTTTGTTATTTCCAAGAGTAATCTGCTCGTTTATTATTATTTTTTCTTTCATGCATCAGAATCAAGCCAGTCAAGAAAGGATGACAACAAGGTCGATGTAACAGTAAGTAGATGATATGAATGTTCTTTTAAACTTTATGGCCATCGATTATTATATGCTTAGTTATAAAAAGTTAGCTATCTATATTGTCATTAATGCATCGTCCTTGATTAATCTAATTGGAAAATTTTTGTGGTATGCAAATTTACTTGCTAAGGCCCCTTTTGAGGTCCGAGTAGCAACCAACGTAGGTGATAATCGGAGTAGATCATCTGATCAATTTTTAATGATGGAGCTTGAGAAAACTTTGAAGCAAAAGACCTTCACCAGGTAACTTTTGGAGTATACATTACTTTTATGGTTCATGAGTAACATTTCAGGTAGTTCAGATTTTATTTTCTTATTGTTTTTACTTCATTTTCCTTCTATGCAGTCTTGTTTCAGTAACTTTTGGTTTGCATTGTTTCTGTTTTCTATATTTTCCATTGTAGTGAAATGTAATAGAATTTTCAAAGTTAACCATTTTTATAGACTTGTTTTTTTAGCTCTTGAGATTATTACTATGACTCATTAAAATGAAAATCACTTTGCAGGTCTGAGATTAATCATTTGACAACGTTATTGCATTCAAGAAATGGTGATTTACCCGTTGTGCATAAGGAGAAAAGTTTCAAGTTTATCTCTTCCATTCCAGAACCTAACAGGAAGGAGTTTGTAAAAATACCAAATTCAGAAGTTAGGATGGGCAGGCCATCGATTTCAACCCCCATTTTGAGTTCAAGTGTACGGCTCTTTCTTTTCTTTCCCCATTTCTTAGTGCCTTACGTTTTTTTTACCACTGTGATTTCATTTAGTGGTCAAAATGCCTTCAGGTACTTGATGGAGATATTTCTTCACCTGCAGAGGTTGCAAGGGCATACATGGGGAGTAGAGAGTCAAAAGTTTGTCCTTCAATGCGATCTCTGAGAGCTCAAGGACTTGGAAAAAATTCAACTGATTCAACTAGTTTAACCAATATGTTGCTTGCACCACCATCTATTAGTCAGGTACTTGCAGGAATTGTTCTATAACTTTAAACTCTCAAGCCCGCTCTACAGTGTACAGCATGAGTCAGTGGCCCTCTTCCAGAGTTCATACAACATTCATTAGCAAGGTTTTATGAGACCCCTAGTGTGTTTTCTTGGTCAGGCTTTCGTGTTGAGATCACTGATAACGAGAACGTATTTGAGAATTTAAGGAAGATTTTATAATCTAACCATTCTGCATGTCTCAGATGCTCTGTTTAATCGAGTTGCAGATTGAACCATTGATTGTATTTTTTTTCTACCATATGCAGGGTTTGAAACGTAGGAGCTCATTTCTTGATAATCACATCAGATCCATTGTTTCTCTGCGCAAAATTCGACAAAAACCTAACATTCATCTTTCAAAAGGATTAAGCTTACCTATTTCTGCTAGGCCTATTTCTGTCCCTGTAGTTGGACTTAGTTTTGATGCTTCCCAGAGCTCCAAATTTGGGAGAACTCAGAATTTTCCATCTTGTATTTGGAACTCACAACTGTCTACTAAACCAAATAAAACTTTTGCAAGAAAGTTTATTACGAACGTGGAGAGTGATAACATTCCTGGTGCAGGTAGCAGCTCTATTTATACTCTTTCAAGGTCTTCTAAGATGGCTTCTAAAATATTGGAGCAGCTTGAAAAGTTGACCTCTCCAAAGGAGAAAGTATCTACATTTAATCTACTTCCTGTTAGGGAAAAATATCACCCTAAGCTGTCACCAGCCGAAGTAGTTGGGCATCTCAAAAGTGTGAAGGATGTGGACTTGCCCAGAGACGACAAGCAGTCAAATAGTTTGCTTGGGATCTCATATCAGGGCAACCGAGAAAACACTTTCCAACATAAAGAGAAGCTGGAAAAACTGAAATCATCGGATCCTCATCCTAATCGTGATTTACTGAAGGACTATGGATCAATGGGTTCTAGTAAGGATTCCATGAATGATCAAGGAATGCCTGAATCTGCTGTGGTGAAATCTACTATTCAGCCCCCAAAAGACAAACAGGCATTTCCAATGTTGCCTGACAAGGTTTGTAACGTTTAAACTTTCTTTGTAATGGCAAAACATTTCGGCAAAAGCACTCAATAATATCTTGATGTTCTTTTTCTAATTTTCCGCATCCTCACTCATAAATCTCATGGTGAAAATCTGCTGTTGAATTTTGTCTCAATACTCTTGAACTTCCAAAAGTTTCAAATGATACTCTTAAACTTTCATAAAAAGTTCATAAATACCCTTACTTCTAATTTTGGATAGAAATCGGTAGTAGTACGTTGTTTCAAAAGTACGTATGAACTTTAAAACTTCAAAAGCCTAATTAAAATCCATACATTATTATAATTTTTTCTTCTACTTATCATTTGCTTCTTTACTACTATAATATTTGGTAGTAACGTGAGAATGTTCAATAAGATTGTTATATTTGCAGCAGTTAAATAACTATTTGAATGATCAAATATATTATCTCTGGTTTCTTTCTTTTGAATATATGTTTGTTTAAATTCTTTGAATCTTTAACCAATCTCACCACCTTTCTCAATGATAGGATAGTGTTTACCAAGATGAAAGTTCTGCCGCTAGAGTTGCACCTGCCACTGCTGAGGTTAGAGAAGGTGATGTTTCTTTGGCTGTGAGACAAACAACTGCTAATGAGTCCCTCTCTCCAGCAAGGATACAAAAACCATCTGAAGTAATAGTGGGTTCTTCTCTCTACGGAAGTTCTGATTCGGAAACTTTCGGTGACAGCATTGATGATGATATCGATACCGGACTAACTTTTCAAAATGCATCTTCACTTTGCACTTCACAGCCAGAAACTAATGATTCTTTTGGAAATAAGAATCTTCCAGAAAATAAGCAAATTGTTTCTCCAGTTTTTAGCTTTGTAAATAATGTCTCTCCACGTAAACAGCCAATCGCTAGTTCTGCTGCATTGGATATTGGTAATAAGGATGATTCTCTTACAGAATTATGTGCTGATTCTGAAAATGTCAATGAACCTTCATACCCATACACGCAGTGCAATCCAGCTTCTTCAAACGATAAGCTAGATTCCTCTTGGAGGTCAGTATATTCATTTCGCTCTTTATTAAGAAATATAGTTTTGTAGTCTTTCCATTATCTTCTATAAATTAGAGAAACATATACAAGTCTATTTTGGTTTTGAGTGGGTTGATAGCTTGTCTCCCCTTTTTATCCACGTCTCCTTTATTAATGAAAAGCTCTTTCTTATATTTATTTATTTCTTTTTTAAAAAAAGGACCTTGAATGTATGGAAAACGTTGTTGGAAGTATTTAAGATTCCTTCTAACTGGACGTGAATTTAATGGAAAGAGAATCATTTGTGAAGTTCTTAAGGGAAGAAAAGAACATTAAAGCACGATCATCAACATATTCTAAATGATATTTGTTTGAAAACTGACATTTCATCTTTCTTCATTTCTTACAACATCTTTGCACTCGTTCACTTCTTATATCTGGAAAATTTGTTTCCTTCATGTTTACTGTCCATCCAATTGAGTTTATTGAAGAATCTGTATCATGTAACTGATTGGGATGATCTTTCAAATTTTCTTTTCACCAGGACCTGCAATGATGCATTCTCATCCTCTGTTTCCCTATCAGCTGGACTTGCATTCTCATTTAGCTCGAATCCTGGCAATCAAAGTCCAAATGATGGCCTTTCTATTTCATGTCCATCTCTATACTCTTCCTATAGTCCATCAACTGGGTTTATGAATCGAAGTTCATCCAGAAATATCTTCCTCTCTGCCCCGTATGCTATCAACAACGCTAATATAATCACAACTATGGCATCTTTATTTTCTCCAACGACTTCAGGCGCAGGAAGTTATGAAGACGAGATCAAGCAGGATGCAAGCCTACGCAATGTAAATGACACTTATTTCAGCAGCATAACTACACCTGCAAATTCTCACTACAGTATGTTCAGCTTCGGTTCTGCAGCAACACCTTCGTTTGTGACTAATCTGTTGAGTAAACCTACTGTTAGCAGTGCAACTGAGCTCAGTGCTCCGGACGTTTCTGTTGAAAAGGAATTTATAGCTAATGCGGAAAAAACATCCATGATTTTAGAATCATCCACGTCTCATGTATCATCGGGGATGGCTGGAAAAGCATCCGTCTGTTGTGGCCTTTCTTTTGGGTGCTCATCTCCTGCTTCTGAACAGTTTAATTCAGGAAATAGGCCATCAGAATTTCCCATCACTGGGTTCACTAGTGCCCACGCAACTTCAACCATCAGTACCTCCAATGTTTCTACATCTAGTACACTTCTTGAATTTGAGTCATTTACCGGGGCATCTTTCAGTTCTATACGTTGTACAACCTCAGCAGCAGCATTAGCAAATTCCACGCCTGTTTTGAGTAATTCGTATCCCAAAGTTGCTTTTAGCGTTTCTTCAGTCAATAATGACTGTGAAGAGCAGGGAACCTCCAAGGACAATGTTCCACTTTTCAGTCAAAAGCCAAAATTTTCTTTTGGCTCAGGCACATCCGAATTAACTCTCTTTCAAGTTGGAAAATTAGAGAACCAGCAGACTTTGGCCGAACCCCAAAATTCATATCCATATATGGCTGCTTCCAACAGCCTAGAAGCTAAAGCTGGAGGCAGCTTCTCCTTGAATGCTGGTGGCAGCGACAAGGCTAACCGGAGATCTGTGAAGTTCAAACGAAGGAAATAATGAGAGCAACACCGGTAGAGTTGATTTTGCAGCCTAAACTATGCAGAGCCATGCATTGAAAACAGTTAAGTACTAGTGCTAGACATACAACAGAGTCATCTATCCTGTATTTTTCTTATTTCTTATGTTTAAATGAAGAAGTAATCTCATTATAGACAGTTTCTTATGATTGATTTTTCTCTGATCAGTTCATTCAGGTTTGTGACGATGTATTTTCACTGTTATGGGATCCGCAAAGTTCACAAGACAGAATTCAGGTAAGGACAAACCTTCATAAATCCAACTCTTCTTTTTTTCTCTTACACATCCAAATGTTCTTTTCTCTAGATTCCTTTTCTTTTTCTTTTTTTTTCTTGAATTGTTTGTTGGGTCTAATACAAAGAATATAGTTTTGCATAACATGGAGTAACATCCACACACACACACACACTCCCCACCTCCCATACCAACCCACAACAGCCCCATTAGGCTTAATAATATATACCCTTTTTTCAACCAATCATTGAAAATATGCAAAACCATTCAAGGAAAGAAAAAAAAAATAGGAAGCCAAAACATTGACAAGGAGTTTATGTTCATTGAACAAACTATGTTTTCTTTTTTCTTTTTGGGAGGGTGCAAGAAAGTGAGAATACTCGTCCTTGCGTCAAATATAGACCAAACAAAGGGCCACATCTCTAATCAATAATCTCTTTTTCTTCTCTAAATTTGTA
mRNA sequence
GCCATTGTAGAACCGGAAAAAACCTCCACGCAATAAACTTTGATCTTGAATTTGATTGAGATTTAAGATACTTCATAGTTCTTCCTCGCCTTTATTCTCATTCATGGTGACTGCGAGGCAACAGAAAAACCTTGAAGAAGAAGATGAAGAAGGGTTGGGAAGGGTTAGGAAGTTAATAGATGAAAGATTCGTTAAGAAATCACCGCCGAAACCTTATGATCGGCCGCCAGATGGCATAAGAACGTCTGGAAACAATTCGTGGATCTTGAAGCTCGTTGATCCAGGTCAAAGGCTCATTTCCTCTGGTTCTCGAATGCTTTTTTCCTCCGTGATCCGAAAATTTCCTCACCATTTAACGTCTCGTGTTTCGTCTCAAGAATCAAGCCAGTCAAGAAAGGATGACAACAAGGTCGATGTAACAGCCCCTTTTGAGGTCCGAGTAGCAACCAACGTAGGTGATAATCGGAGTAGATCATCTGATCAATTTTTAATGATGGAGCTTGAGAAAACTTTGAAGCAAAAGACCTTCACCAGGTCTGAGATTAATCATTTGACAACGTTATTGCATTCAAGAAATGGTGATTTACCCGTTGTGCATAAGGAGAAAAGTTTCAAGTTTATCTCTTCCATTCCAGAACCTAACAGGAAGGAGTTTGTAAAAATACCAAATTCAGAAGTTAGGATGGGCAGGCCATCGATTTCAACCCCCATTTTGAGTTCAAGTGTACTTGATGGAGATATTTCTTCACCTGCAGAGGTTGCAAGGGCATACATGGGGAGTAGAGAGTCAAAAGTTTGTCCTTCAATGCGATCTCTGAGAGCTCAAGGACTTGGAAAAAATTCAACTGATTCAACTAGTTTAACCAATATGTTGCTTGCACCACCATCTATTAGTCAGGGTTTGAAACGTAGGAGCTCATTTCTTGATAATCACATCAGATCCATTGTTTCTCTGCGCAAAATTCGACAAAAACCTAACATTCATCTTTCAAAAGGATTAAGCTTACCTATTTCTGCTAGGCCTATTTCTGTCCCTGTAGTTGGACTTAGTTTTGATGCTTCCCAGAGCTCCAAATTTGGGAGAACTCAGAATTTTCCATCTTGTATTTGGAACTCACAACTGTCTACTAAACCAAATAAAACTTTTGCAAGAAAGTTTATTACGAACGTGGAGAGTGATAACATTCCTGGTGCAGGTAGCAGCTCTATTTATACTCTTTCAAGGTCTTCTAAGATGGCTTCTAAAATATTGGAGCAGCTTGAAAAGTTGACCTCTCCAAAGGAGAAAGTATCTACATTTAATCTACTTCCTGTTAGGGAAAAATATCACCCTAAGCTGTCACCAGCCGAAGTAGTTGGGCATCTCAAAAGTGTGAAGGATGTGGACTTGCCCAGAGACGACAAGCAGTCAAATAGTTTGCTTGGGATCTCATATCAGGGCAACCGAGAAAACACTTTCCAACATAAAGAGAAGCTGGAAAAACTGAAATCATCGGATCCTCATCCTAATCGTGATTTACTGAAGGACTATGGATCAATGGGTTCTAGTAAGGATTCCATGAATGATCAAGGAATGCCTGAATCTGCTGTGGTGAAATCTACTATTCAGCCCCCAAAAGACAAACAGGCATTTCCAATGTTGCCTGACAAGGATAGTGTTTACCAAGATGAAAGTTCTGCCGCTAGAGTTGCACCTGCCACTGCTGAGGTTAGAGAAGGTGATGTTTCTTTGGCTGTGAGACAAACAACTGCTAATGAGTCCCTCTCTCCAGCAAGGATACAAAAACCATCTGAAGTAATAGTGGGTTCTTCTCTCTACGGAAGTTCTGATTCGGAAACTTTCGGTGACAGCATTGATGATGATATCGATACCGGACTAACTTTTCAAAATGCATCTTCACTTTGCACTTCACAGCCAGAAACTAATGATTCTTTTGGAAATAAGAATCTTCCAGAAAATAAGCAAATTGTTTCTCCAGTTTTTAGCTTTGTAAATAATGTCTCTCCACGTAAACAGCCAATCGCTAGTTCTGCTGCATTGGATATTGGTAATAAGGATGATTCTCTTACAGAATTATGTGCTGATTCTGAAAATGTCAATGAACCTTCATACCCATACACGCAGTGCAATCCAGCTTCTTCAAACGATAAGCTAGATTCCTCTTGGAGGACCTGCAATGATGCATTCTCATCCTCTGTTTCCCTATCAGCTGGACTTGCATTCTCATTTAGCTCGAATCCTGGCAATCAAAGTCCAAATGATGGCCTTTCTATTTCATGTCCATCTCTATACTCTTCCTATAGTCCATCAACTGGGTTTATGAATCGAAGTTCATCCAGAAATATCTTCCTCTCTGCCCCGTATGCTATCAACAACGCTAATATAATCACAACTATGGCATCTTTATTTTCTCCAACGACTTCAGGCGCAGGAAGTTATGAAGACGAGATCAAGCAGGATGCAAGCCTACGCAATGTAAATGACACTTATTTCAGCAGCATAACTACACCTGCAAATTCTCACTACAGTATGTTCAGCTTCGGTTCTGCAGCAACACCTTCGTTTGTGACTAATCTGTTGAGTAAACCTACTGTTAGCAGTGCAACTGAGCTCAGTGCTCCGGACGTTTCTGTTGAAAAGGAATTTATAGCTAATGCGGAAAAAACATCCATGATTTTAGAATCATCCACGTCTCATGTATCATCGGGGATGGCTGGAAAAGCATCCGTCTGTTGTGGCCTTTCTTTTGGGTGCTCATCTCCTGCTTCTGAACAGTTTAATTCAGGAAATAGGCCATCAGAATTTCCCATCACTGGGTTCACTAGTGCCCACGCAACTTCAACCATCAGTACCTCCAATGTTTCTACATCTAGTACACTTCTTGAATTTGAGTCATTTACCGGGGCATCTTTCAGTTCTATACGTTGTACAACCTCAGCAGCAGCATTAGCAAATTCCACGCCTGTTTTGAGTAATTCGTATCCCAAAGTTGCTTTTAGCGTTTCTTCAGTCAATAATGACTGTGAAGAGCAGGGAACCTCCAAGGACAATGTTCCACTTTTCAGTCAAAAGCCAAAATTTTCTTTTGGCTCAGGCACATCCGAATTAACTCTCTTTCAAGTTGGAAAATTAGAGAACCAGCAGACTTTGGCCGAACCCCAAAATTCATATCCATATATGGCTGCTTCCAACAGCCTAGAAGCTAAAGCTGGAGGCAGCTTCTCCTTGAATGCTGGTGGCAGCGACAAGGCTAACCGGAGATCTGTGAAGTTCAAACGAAGGAAATAATGAGAGCAACACCGGTAGAGTTGATTTTGCAGCCTAAACTATGCAGAGCCATGCATTGAAAACATTCATTCAGGTTTGTGACGATGTATTTTCACTGTTATGGGATCCGCAAAGTTCACAAGACAGAATTCAGGTAAGGACAAACCTTCATAAATCCAACTCTTCTTTTTTTCTCTTACACATCCAAATGTTCTTTTCTCTAGATTCCTTTTCTTTTTCTTTTTTTTTCTTGAATTGTTTGTTGGGTCTAATACAAAGAATATAGTTTTGCATAACATGGAGTAACATCCACACACACACACACACTCCCCACCTCCCATACCAACCCACAACAGCCCCATTAGGCTTAATAATATATACCCTTTTTTCAACCAATCATTGAAAATATGCAAAACCATTCAAGGAAAGAAAAAAAAAATAGGAAGCCAAAACATTGACAAGGAGTTTATGTTCATTGAACAAACTATGTTTTCTTTTTTCTTTTTGGGAGGGTGCAAGAAAGTGAGAATACTCGTCCTTGCGTCAAATATAGACCAAACAAAGGGCCACATCTCTAATCAATAATCTCTTTTTCTTCTCTAAATTTGTA
Coding sequence (CDS)
ATGGTGACTGCGAGGCAACAGAAAAACCTTGAAGAAGAAGATGAAGAAGGGTTGGGAAGGGTTAGGAAGTTAATAGATGAAAGATTCGTTAAGAAATCACCGCCGAAACCTTATGATCGGCCGCCAGATGGCATAAGAACGTCTGGAAACAATTCGTGGATCTTGAAGCTCGTTGATCCAGGTCAAAGGCTCATTTCCTCTGGTTCTCGAATGCTTTTTTCCTCCGTGATCCGAAAATTTCCTCACCATTTAACGTCTCGTGTTTCGTCTCAAGAATCAAGCCAGTCAAGAAAGGATGACAACAAGGTCGATGTAACAGCCCCTTTTGAGGTCCGAGTAGCAACCAACGTAGGTGATAATCGGAGTAGATCATCTGATCAATTTTTAATGATGGAGCTTGAGAAAACTTTGAAGCAAAAGACCTTCACCAGGTCTGAGATTAATCATTTGACAACGTTATTGCATTCAAGAAATGGTGATTTACCCGTTGTGCATAAGGAGAAAAGTTTCAAGTTTATCTCTTCCATTCCAGAACCTAACAGGAAGGAGTTTGTAAAAATACCAAATTCAGAAGTTAGGATGGGCAGGCCATCGATTTCAACCCCCATTTTGAGTTCAAGTGTACTTGATGGAGATATTTCTTCACCTGCAGAGGTTGCAAGGGCATACATGGGGAGTAGAGAGTCAAAAGTTTGTCCTTCAATGCGATCTCTGAGAGCTCAAGGACTTGGAAAAAATTCAACTGATTCAACTAGTTTAACCAATATGTTGCTTGCACCACCATCTATTAGTCAGGGTTTGAAACGTAGGAGCTCATTTCTTGATAATCACATCAGATCCATTGTTTCTCTGCGCAAAATTCGACAAAAACCTAACATTCATCTTTCAAAAGGATTAAGCTTACCTATTTCTGCTAGGCCTATTTCTGTCCCTGTAGTTGGACTTAGTTTTGATGCTTCCCAGAGCTCCAAATTTGGGAGAACTCAGAATTTTCCATCTTGTATTTGGAACTCACAACTGTCTACTAAACCAAATAAAACTTTTGCAAGAAAGTTTATTACGAACGTGGAGAGTGATAACATTCCTGGTGCAGGTAGCAGCTCTATTTATACTCTTTCAAGGTCTTCTAAGATGGCTTCTAAAATATTGGAGCAGCTTGAAAAGTTGACCTCTCCAAAGGAGAAAGTATCTACATTTAATCTACTTCCTGTTAGGGAAAAATATCACCCTAAGCTGTCACCAGCCGAAGTAGTTGGGCATCTCAAAAGTGTGAAGGATGTGGACTTGCCCAGAGACGACAAGCAGTCAAATAGTTTGCTTGGGATCTCATATCAGGGCAACCGAGAAAACACTTTCCAACATAAAGAGAAGCTGGAAAAACTGAAATCATCGGATCCTCATCCTAATCGTGATTTACTGAAGGACTATGGATCAATGGGTTCTAGTAAGGATTCCATGAATGATCAAGGAATGCCTGAATCTGCTGTGGTGAAATCTACTATTCAGCCCCCAAAAGACAAACAGGCATTTCCAATGTTGCCTGACAAGGATAGTGTTTACCAAGATGAAAGTTCTGCCGCTAGAGTTGCACCTGCCACTGCTGAGGTTAGAGAAGGTGATGTTTCTTTGGCTGTGAGACAAACAACTGCTAATGAGTCCCTCTCTCCAGCAAGGATACAAAAACCATCTGAAGTAATAGTGGGTTCTTCTCTCTACGGAAGTTCTGATTCGGAAACTTTCGGTGACAGCATTGATGATGATATCGATACCGGACTAACTTTTCAAAATGCATCTTCACTTTGCACTTCACAGCCAGAAACTAATGATTCTTTTGGAAATAAGAATCTTCCAGAAAATAAGCAAATTGTTTCTCCAGTTTTTAGCTTTGTAAATAATGTCTCTCCACGTAAACAGCCAATCGCTAGTTCTGCTGCATTGGATATTGGTAATAAGGATGATTCTCTTACAGAATTATGTGCTGATTCTGAAAATGTCAATGAACCTTCATACCCATACACGCAGTGCAATCCAGCTTCTTCAAACGATAAGCTAGATTCCTCTTGGAGGACCTGCAATGATGCATTCTCATCCTCTGTTTCCCTATCAGCTGGACTTGCATTCTCATTTAGCTCGAATCCTGGCAATCAAAGTCCAAATGATGGCCTTTCTATTTCATGTCCATCTCTATACTCTTCCTATAGTCCATCAACTGGGTTTATGAATCGAAGTTCATCCAGAAATATCTTCCTCTCTGCCCCGTATGCTATCAACAACGCTAATATAATCACAACTATGGCATCTTTATTTTCTCCAACGACTTCAGGCGCAGGAAGTTATGAAGACGAGATCAAGCAGGATGCAAGCCTACGCAATGTAAATGACACTTATTTCAGCAGCATAACTACACCTGCAAATTCTCACTACAGTATGTTCAGCTTCGGTTCTGCAGCAACACCTTCGTTTGTGACTAATCTGTTGAGTAAACCTACTGTTAGCAGTGCAACTGAGCTCAGTGCTCCGGACGTTTCTGTTGAAAAGGAATTTATAGCTAATGCGGAAAAAACATCCATGATTTTAGAATCATCCACGTCTCATGTATCATCGGGGATGGCTGGAAAAGCATCCGTCTGTTGTGGCCTTTCTTTTGGGTGCTCATCTCCTGCTTCTGAACAGTTTAATTCAGGAAATAGGCCATCAGAATTTCCCATCACTGGGTTCACTAGTGCCCACGCAACTTCAACCATCAGTACCTCCAATGTTTCTACATCTAGTACACTTCTTGAATTTGAGTCATTTACCGGGGCATCTTTCAGTTCTATACGTTGTACAACCTCAGCAGCAGCATTAGCAAATTCCACGCCTGTTTTGAGTAATTCGTATCCCAAAGTTGCTTTTAGCGTTTCTTCAGTCAATAATGACTGTGAAGAGCAGGGAACCTCCAAGGACAATGTTCCACTTTTCAGTCAAAAGCCAAAATTTTCTTTTGGCTCAGGCACATCCGAATTAACTCTCTTTCAAGTTGGAAAATTAGAGAACCAGCAGACTTTGGCCGAACCCCAAAATTCATATCCATATATGGCTGCTTCCAACAGCCTAGAAGCTAAAGCTGGAGGCAGCTTCTCCTTGAATGCTGGTGGCAGCGACAAGGCTAACCGGAGATCTGTGAAGTTCAAACGAAGGAAATAA
Protein sequence
MVTARQQKNLEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKLVDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNVGDNRSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDSTSLTNMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGHLKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTSELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK*
Homology
BLAST of CsGy5G025550 vs. ExPASy Swiss-Prot
Match:
Q9CAF4 (Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 SV=1)
HSP 1 Score: 130.2 bits (326), Expect = 1.4e-28
Identity = 303/1147 (26.42%), Postives = 456/1147 (39.76%), Query Frame = 0
Query: 31 KKSPPKPYDRPPDGIRTSG-------NNSWILKLVDPGQRLISSGSRMLFSSVIRKFPHH 90
++S PYDRP +R +G W+ KLVDP QRLI+ ++ LF S+ RK
Sbjct: 29 RRSQKTPYDRPTTSVRNAGLGGGDVRGGGWLSKLVDPAQRLITYSAQRLFGSLSRKRLGS 88
Query: 91 LTSRVSSQES---------SQSRKDDNKVDVTAPFEVRVATNVGD-NRSRSSDQFLMMEL 150
+ + S E +Q K +K DV+ + D N S + +L
Sbjct: 89 GETPLQSPEQQKQLPERGVNQETKVGHKEDVSNLSMKNGLIRMEDTNASVDPPKDGFTDL 148
Query: 151 EKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNSEVR 210
EK L+ KTFTRSE++ LTTLL S+ D +++E+ + + P E +
Sbjct: 149 EKILQGKTFTRSEVDRLTTLLRSKAADSSTMNEEQRNEVGMVVRHPPSHERDRTHPDNGS 208
Query: 211 MGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNST----- 270
M +STP S LD I+SPA++A+AYMGSR S+V PSM LR Q ++S
Sbjct: 209 M-NTLVSTPPGSLRTLDECIASPAQLAKAYMGSRPSEVTPSMLGLRGQAGREDSVFLNRT 268
Query: 271 ------------------------------------------------DSTSLTNMLLAP 330
S + ++ A
Sbjct: 269 PFPQKSPTMSLVTKPSGQRPLENGFVTPRSRGRSAVYSMARTPYSRPQSSVKIGSLFQAS 328
Query: 331 PS-------------ISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLSLPISARP 390
PS GLKRRSS LDN I S+ +R+IRQK N+ S+ L+LP+S P
Sbjct: 329 PSKWEESLPSGSRQGFQSGLKRRSSVLDNDIGSVGPVRRIRQKSNLS-SRSLALPVSESP 388
Query: 391 ISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDNIPGAGSS 450
+SV G + +T +K A ++IPG+ +
Sbjct: 389 LSVRANG-----------------------GEKTTHTSKDSA---------EDIPGSSFN 448
Query: 451 SIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVG-HLKSVKD 510
+ T +SS+MASKIL+QL+KL S REK KLSP+ + G LKS+++
Sbjct: 449 LVPT--KSSEMASKILQQLDKLVS------------TREKSPSKLSPSMLRGPALKSLQN 508
Query: 511 VDLPR-----DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMGS 570
V+ P+ +K++NS SYQ + +E + + + D + GS
Sbjct: 509 VEAPKFLGNLPEKKANS-PDSSYQKQEIS----RESVSREVLAQSEKTGDAVDGTSKTGS 568
Query: 571 SKD-SMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGD 630
SKD M +G+ PPK K++F M +D + D+ A P EV E
Sbjct: 569 SKDQDMRGKGVYMPLTNSLEEHPPK-KRSFRMSAHEDFLELDDDLGAASTP--CEVAEKQ 628
Query: 631 VSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSE--TFGDSIDDDIDTGLTF---- 690
+ V ++ + + + PSE + +S + D+ T S++ + + + F
Sbjct: 629 NAFEVEKSHISMPIGEKPL-TPSEAMPSTSYISNGDASQGTSNGSLETERNKFVAFPIEA 688
Query: 691 ---QNASSLCTSQ----PETNDSFGNKNLPENKQI-----VSPVFSFVN-NVSPRKQPIA 750
N +S TS+ E + K E K+I P F N + SP +
Sbjct: 689 VQQSNMASEPTSKFIQGTEKSSISSGKPTSEEKRIPLEEPKKPAAVFPNISFSPPATGLL 748
Query: 751 ---SSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSS 810
S A+ DI + S T T N AS + S+ T N
Sbjct: 749 NQNSGASADIKLEKTSSTAFGVSEAWAKPTESKKTFSNSASGAESSTSAAPTLN------ 808
Query: 811 VSLSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYS-----------PST--GFMNRSSS 870
G FS +N P++G S PS S S PST F +S
Sbjct: 809 -----GSIFSAGANAVTPPPSNGSLTSSPSFPPSISNIPSDNSVGDMPSTVQSFAATHNS 868
Query: 871 RNIFLSAPYAINNANIITTMASLFSPT-------------------TSGAGSYEDEIKQD 930
+IF P + N++N +T AS S T +SG S E E+K +
Sbjct: 869 SSIFGKLPTS-NDSNSQSTSASPLSSTSPFKFGQPAAPFSAPAVSESSGQISKETEVK-N 928
Query: 931 ASLRNVNDTYFSSITTPANSHYSMFSFGSA---ATPSFV---TNLLSKPTVSSAT-ELSA 990
A+ N + F + + S +F SA + P FV ++++ T++ +T SA
Sbjct: 929 ATFGNTSTFKFGGMASADQSTGIVFGAKSAENKSRPGFVFGSSSVVGGSTLNPSTAAASA 988
Query: 991 PDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPS 999
P+ S F + T S S S+ SV SF +S S + +
Sbjct: 989 PESSGSLIFGVTSSSTPGTETSKISASSAATNTGNSVFGTSSFAFTSSGSSMVGGVSAST 1048
BLAST of CsGy5G025550 vs. NCBI nr
Match:
XP_011656263.1 (nuclear pore complex protein NUP1 [Cucumis sativus] >XP_031741373.1 nuclear pore complex protein NUP1 [Cucumis sativus] >KAE8648811.1 hypothetical protein Csa_009344 [Cucumis sativus])
HSP 1 Score: 2006 bits (5196), Expect = 0.0
Identity = 1061/1061 (100.00%), Postives = 1061/1061 (100.00%), Query Frame = 0
Query: 1 MVTARQQKNLEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKLVDP 60
MVTARQQKNLEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKLVDP
Sbjct: 1 MVTARQQKNLEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKLVDP 60
Query: 61 GQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNVGDN 120
GQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNVGDN
Sbjct: 61 GQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNVGDN 120
Query: 121 RSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPN 180
RSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPN
Sbjct: 121 RSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPN 180
Query: 181 RKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRA 240
RKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRA
Sbjct: 181 RKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRA 240
Query: 241 QGLGKNSTDSTSLTNMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLS 300
QGLGKNSTDSTSLTNMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLS
Sbjct: 241 QGLGKNSTDSTSLTNMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLS 300
Query: 301 LPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDN 360
LPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDN
Sbjct: 301 LPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDN 360
Query: 361 IPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGH 420
IPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGH
Sbjct: 361 IPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGH 420
Query: 421 LKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMG 480
LKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMG
Sbjct: 421 LKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMG 480
Query: 481 SSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGD 540
SSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGD
Sbjct: 481 SSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGD 540
Query: 541 VSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSL 600
VSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSL
Sbjct: 541 VSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSL 600
Query: 601 CTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCA 660
CTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCA
Sbjct: 601 CTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCA 660
Query: 661 DSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPND 720
DSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPND
Sbjct: 661 DSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPND 720
Query: 721 GLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYE 780
GLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYE
Sbjct: 721 GLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYE 780
Query: 781 DEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAP 840
DEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAP
Sbjct: 781 DEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAP 840
Query: 841 DVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSE 900
DVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSE
Sbjct: 841 DVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSE 900
Query: 901 FPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSY 960
FPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSY
Sbjct: 901 FPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSY 960
Query: 961 PKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTSELTLFQVGKLENQQTLAEPQ 1020
PKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTSELTLFQVGKLENQQTLAEPQ
Sbjct: 961 PKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTSELTLFQVGKLENQQTLAEPQ 1020
Query: 1021 NSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK 1061
NSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK
Sbjct: 1021 NSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK 1061
BLAST of CsGy5G025550 vs. NCBI nr
Match:
XP_008446727.1 (PREDICTED: nuclear pore complex protein NUP1 isoform X1 [Cucumis melo] >XP_008446728.1 PREDICTED: nuclear pore complex protein NUP1 isoform X1 [Cucumis melo] >KAA0034635.1 nuclear pore complex protein NUP1 isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 1698 bits (4398), Expect = 0.0
Identity = 937/1082 (86.60%), Postives = 984/1082 (90.94%), Query Frame = 0
Query: 1 MVTARQQKN---LEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKL 60
MVTARQQKN EE++EE LG V K IDERFVKKSP KPYDRPP+GIRT+GNNSWILKL
Sbjct: 1 MVTARQQKNPEEKEEDEEERLGTVGKFIDERFVKKSPAKPYDRPPNGIRTTGNNSWILKL 60
Query: 61 VDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNV 120
VDP QRLISSGSRMLFSSVIR FP HLTSRVSSQESSQSRKDD K DVT PFEV+VA NV
Sbjct: 61 VDPAQRLISSGSRMLFSSVIRNFPTHLTSRVSSQESSQSRKDDKKADVTGPFEVQVAFNV 120
Query: 121 GDNRSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIP 180
GDNRSRSSDQFLMMELEKTLKQKTF+RSEI+HLTTLLHSRNGDLP V++EKSFKFISSIP
Sbjct: 121 GDNRSRSSDQFLMMELEKTLKQKTFSRSEIDHLTTLLHSRNGDLPGVNEEKSFKFISSIP 180
Query: 181 EPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRS 240
EPNRKEFVKIPNSEVRMGRPSIS PIL SSVLDGDISSPAEVARAYMGSRESKVCPS RS
Sbjct: 181 EPNRKEFVKIPNSEVRMGRPSISPPILCSSVLDGDISSPAEVARAYMGSRESKVCPSKRS 240
Query: 241 LRAQGLGKNSTDSTSLT------NMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKP 300
LRAQGLG+NST+STSL+ NMLLAPPSIS+G KRRSSFLDNHI+SIVSLR+IRQKP
Sbjct: 241 LRAQGLGENSTNSTSLSFYSKSNNMLLAPPSISRGSKRRSSFLDNHIKSIVSLRRIRQKP 300
Query: 301 NIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARK 360
NIHLSKGLSLPIS VPVVGLSFDASQSSKFGRT+NFPSCIWNSQLS KPNKTFARK
Sbjct: 301 NIHLSKGLSLPIS-----VPVVGLSFDASQSSKFGRTRNFPSCIWNSQLSPKPNKTFARK 360
Query: 361 FITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPK 420
FITNV SDNI GA SSIYTL+RSSKMASKILEQLEKLT PKEKVSTFN LPV EKYH K
Sbjct: 361 FITNVGSDNILGASCSSIYTLTRSSKMASKILEQLEKLTPPKEKVSTFNRLPVGEKYHSK 420
Query: 421 LSPAEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSD 480
LSP EVVGHLKSVKDVDLPR DDKQSNSLLGISYQGNREN+FQHKE+LEKLKSSD
Sbjct: 421 LSPPEVVGHLKSVKDVDLPRNEEFVYDDKQSNSLLGISYQGNRENSFQHKERLEKLKSSD 480
Query: 481 PHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESS 540
PHP+RDLLKD GS+GS+ DSMNDQGMPESAV KSTIQPPKDKQAFPMLPD+DSV QDESS
Sbjct: 481 PHPSRDLLKDSGSIGSTNDSMNDQGMPESAVGKSTIQPPKDKQAFPMLPDEDSVDQDESS 540
Query: 541 AARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSID 600
A RVAPATAEVREGDVSLAVRQTTANES+SPAR+QK SEVIVGSSL GSSDSETFGDSID
Sbjct: 541 ADRVAPATAEVREGDVSLAVRQTTANESVSPARLQKSSEVIVGSSLDGSSDSETFGDSID 600
Query: 601 DDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAA 660
DDIDT LT Q ASSL TSQPE DSFGNK LPENKQIVSPVFSFVNNVSPRKQ IASS A
Sbjct: 601 DDIDTRLTVQIASSLRTSQPEAIDSFGNKILPENKQIVSPVFSFVNNVSPRKQLIASSTA 660
Query: 661 LDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGL 720
LDIGNKDDSLTELCAD EN NEPSYPYTQCNPASSNDKLD SWRTCNDAFSSSVS+SAGL
Sbjct: 661 LDIGNKDDSLTELCADFENGNEPSYPYTQCNPASSNDKLDFSWRTCNDAFSSSVSVSAGL 720
Query: 721 AFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTM 780
AFSFSS PG+QS N+GLSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP AINN NIITT+
Sbjct: 721 AFSFSSTPGHQSLNNGLSISCPSLYSSYSPSTGFMNQSSSRNIFLSAPCAINNTNIITTL 780
Query: 781 ASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
AS F+ TTSG GSY D+IK+D SLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL
Sbjct: 781 ASSFASTTSGTGSY-DKIKRDESLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
Query: 841 LSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCS 900
LSKPTVSSAT LSA +VSV K+FIANAE+TSMIL SS SHVSSGMAGKAS+CCGLSF CS
Sbjct: 841 LSKPTVSSATGLSAQEVSVGKKFIANAERTSMILGSSMSHVSSGMAGKASLCCGLSFECS 900
Query: 901 SPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTS 960
SPASE+FNSG+RPSEFPIT FTSA ATSTISTSNVSTSSTLL FESFTGASFSS+RC+TS
Sbjct: 901 SPASERFNSGSRPSEFPITAFTSAPATSTISTSNVSTSSTLLGFESFTGASFSSLRCSTS 960
Query: 961 AAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSG------T 1020
AAALA+STPVLSNS+PKVAF VSSVNN+CEEQGTSKDNVPLFSQKPKFS GSG T
Sbjct: 961 AAALADSTPVLSNSHPKVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKFSSGSGPSGSAGT 1020
Query: 1021 SELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKR 1061
SELT FQVGK QQTLAEPQNSYPY+AASNSL+AK+GGSFSLNAGGSDKANRR VKFKR
Sbjct: 1021 SELTSFQVGK---QQTLAEPQNSYPYIAASNSLQAKSGGSFSLNAGGSDKANRRFVKFKR 1073
BLAST of CsGy5G025550 vs. NCBI nr
Match:
TYK09186.1 (nuclear pore complex protein NUP1 isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 1650 bits (4273), Expect = 0.0
Identity = 921/1094 (84.19%), Postives = 969/1094 (88.57%), Query Frame = 0
Query: 1 MVTARQQKN---LEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKL 60
MVTARQQKN EE++EE LG V K IDERFVKKSP KPYDRPP+GIRT+GNNSWILKL
Sbjct: 1 MVTARQQKNPEEKEEDEEERLGTVGKFIDERFVKKSPAKPYDRPPNGIRTTGNNSWILKL 60
Query: 61 VDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNV 120
VDP QRLISSGSRMLFSSVIR FP HLTSRVSSQESSQSRKDD K DVT PFEV+VA NV
Sbjct: 61 VDPAQRLISSGSRMLFSSVIRNFPTHLTSRVSSQESSQSRKDDKKADVTGPFEVQVAFNV 120
Query: 121 GDNRSRSSDQFLMMELEKTLKQKTFTR------------SEINHLTTLLHSRNGDLPVVH 180
GDNRSRSSDQFLMMELEKTLKQKTF+ SEI+HLTTLLHSRNGDLP V+
Sbjct: 121 GDNRSRSSDQFLMMELEKTLKQKTFSSIVSVTFGASIFWSEIDHLTTLLHSRNGDLPGVN 180
Query: 181 KEKSFKFISSIPEPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMG 240
+EKSFKFISSIPEPNRKEFVKIPNSEV LDGDISSPAEVARAYMG
Sbjct: 181 EEKSFKFISSIPEPNRKEFVKIPNSEV----------------LDGDISSPAEVARAYMG 240
Query: 241 SRESKVCPSMRSLRAQGLGKNSTDSTSLT------NMLLAPPSISQGLKRRSSFLDNHIR 300
SRESKVCPS RSLRAQGLG+NST+STSL+ NMLLAPPSIS+G KRRSSFLDNHI+
Sbjct: 241 SRESKVCPSKRSLRAQGLGENSTNSTSLSFYSKSNNMLLAPPSISRGSKRRSSFLDNHIK 300
Query: 301 SIVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQ 360
SIVSLR+IRQKPNIHLSKGLSLPIS VPVVGLSFDASQSSKFGRT+NFPSCIWNSQ
Sbjct: 301 SIVSLRRIRQKPNIHLSKGLSLPIS-----VPVVGLSFDASQSSKFGRTRNFPSCIWNSQ 360
Query: 361 LSTKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTF 420
LS KPNKTFARKFITNV SDNI GA SSIYTL+RSSKMASKILEQLEKLT PKEKVSTF
Sbjct: 361 LSPKPNKTFARKFITNVGSDNILGASCSSIYTLTRSSKMASKILEQLEKLTPPKEKVSTF 420
Query: 421 NLLPVREKYHPKLSPAEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQ 480
N LPV EKYH KLSP EVVGHLKSVKDVDLPR DDKQSNSLLGISYQGNREN+FQ
Sbjct: 421 NRLPVGEKYHSKLSPPEVVGHLKSVKDVDLPRNEEFVYDDKQSNSLLGISYQGNRENSFQ 480
Query: 481 HKEKLEKLKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPML 540
HKE+LEKLKSSDPHP+RDLLKD GS+GS+ DSMNDQGMPESAV KSTIQPPKDKQAFPML
Sbjct: 481 HKERLEKLKSSDPHPSRDLLKDSGSIGSTNDSMNDQGMPESAVGKSTIQPPKDKQAFPML 540
Query: 541 PDKDSVYQDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYG 600
PD+DSV QDESSA RVAPATAEVREGDVSLAVRQTTANES+SPAR+QK SEVIVGSSL G
Sbjct: 541 PDEDSVDQDESSADRVAPATAEVREGDVSLAVRQTTANESVSPARLQKSSEVIVGSSLDG 600
Query: 601 SSDSETFGDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNV 660
SSDSETFGDSIDDDIDT LT Q ASSL TSQPE DSFGNK LPENKQIVSPVFSFVN+V
Sbjct: 601 SSDSETFGDSIDDDIDTRLTVQIASSLRTSQPEAIDSFGNKILPENKQIVSPVFSFVNDV 660
Query: 661 SPRKQPIASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCND 720
SPRKQ IASS ALDIGNKDDSLTELCAD EN NEPSYPYTQCNPASSNDKLD SWRTCND
Sbjct: 661 SPRKQLIASSTALDIGNKDDSLTELCADFENGNEPSYPYTQCNPASSNDKLDFSWRTCND 720
Query: 721 AFSSSVSLSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAP 780
AFSSSVS+SAGLAFSFSS PG+QS N+GLSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP
Sbjct: 721 AFSSSVSVSAGLAFSFSSTPGHQSLNNGLSISCPSLYSSYSPSTGFMNQSSSRNIFLSAP 780
Query: 781 YAINNANIITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSF 840
AINN NIITT+AS F+ TTSG GSY D+IK+D SLRNVNDTYFSSITTPANSHYSMFSF
Sbjct: 781 CAINNTNIITTLASSFASTTSGTGSY-DKIKRDESLRNVNDTYFSSITTPANSHYSMFSF 840
Query: 841 GSAATPSFVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGK 900
GSAATPSFVTNLLSKPTVSSAT LSA +VSV K+FIANAE+TSMIL SS SHVSSGMAGK
Sbjct: 841 GSAATPSFVTNLLSKPTVSSATGLSAQEVSVGKKFIANAERTSMILGSSMSHVSSGMAGK 900
Query: 901 ASVCCGLSFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFT 960
AS+CCGLSF CSSPASE+FNSG+RPSEFPIT FTSA ATSTISTSNVSTSSTLL FESFT
Sbjct: 901 ASLCCGLSFECSSPASERFNSGSRPSEFPITAFTSAPATSTISTSNVSTSSTLLGFESFT 960
Query: 961 GASFSSIRCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKF 1020
GASFSS+RC+TSAAALA+STPVLSNS+PKVAF VSSVNN+CEEQGTSKDNVPLFSQKPKF
Sbjct: 961 GASFSSLRCSTSAAALADSTPVLSNSHPKVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKF 1020
Query: 1021 SFGSG------TSELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGS 1061
S GSG TSELT FQVGK QQTLAEPQNSYPY+AASNSL+AK+GGSFSLNAGGS
Sbjct: 1021 SSGSGPSGSAGTSELTSFQVGK---QQTLAEPQNSYPYIAASNSLQAKSGGSFSLNAGGS 1069
BLAST of CsGy5G025550 vs. NCBI nr
Match:
XP_038893389.1 (nuclear pore complex protein NUP1-like isoform X1 [Benincasa hispida])
HSP 1 Score: 1441 bits (3730), Expect = 0.0
Identity = 819/1084 (75.55%), Postives = 886/1084 (81.73%), Query Frame = 0
Query: 1 MVTARQQKNLEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKLVDP 60
M TAR+QK+ E+ EGL V K DERFV+K P KPYDRP +RTSGNNSWILKLVDP
Sbjct: 1 MATAREQKSPVEK--EGLETVGKFRDERFVRKPPVKPYDRPLTTLRTSGNNSWILKLVDP 60
Query: 61 GQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNVGDN 120
QRLISSGSRMLFSSVIR FPHHLTSRVSSQESSQSRKDD K +V PFEV+V TN GDN
Sbjct: 61 AQRLISSGSRMLFSSVIRNFPHHLTSRVSSQESSQSRKDDKKANVNDPFEVKVVTNEGDN 120
Query: 121 RSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPN 180
RSRSSDQ LMMELEKTLKQKTFTRSEI+HLTTLLHSRN DLPVV++EK KFISSIPE N
Sbjct: 121 RSRSSDQCLMMELEKTLKQKTFTRSEIDHLTTLLHSRNVDLPVVNEEKRLKFISSIPESN 180
Query: 181 RKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRA 240
RKEFVKIPNSEVRMGRP ISTPILSSSVLD DISSPAE+ARAYMGS++ KVCPSM+SLRA
Sbjct: 181 RKEFVKIPNSEVRMGRPLISTPILSSSVLDEDISSPAEIARAYMGSKQPKVCPSMQSLRA 240
Query: 241 QGLGKNSTDSTSL------TNMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIH 300
QGLG+NS TS+ +MLLAP S SQGLKRRSSF D HI +V LR+ RQKPNIH
Sbjct: 241 QGLGENSAGPTSILFSSKSNDMLLAPSSTSQGLKRRSSFFDKHIGPVVPLRRTRQKPNIH 300
Query: 301 LSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFIT 360
LSKGLSLP+SARPISVP GL+FDASQSSKFGR QNFPS IWNSQL KP KTF RKF
Sbjct: 301 LSKGLSLPVSARPISVPEDGLNFDASQSSKFGRFQNFPSSIWNSQLPLKPKKTFGRKFTM 360
Query: 361 NVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSP 420
NVE+ NIP AG+ SIYT SRSSK+ASKILEQL+KLT PKEK+STFN LPV EK H KLSP
Sbjct: 361 NVENHNIPVAGTGSIYTPSRSSKIASKILEQLDKLTPPKEKISTFNRLPVGEKSHAKLSP 420
Query: 421 AEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHP 480
V GHL++VKDVDLPR DDKQSNSL GISYQ NRENTFQ+ EKLEKLKSSDPHP
Sbjct: 421 LTVGGHLRNVKDVDLPRNEEFVHDDKQSNSLHGISYQENRENTFQNGEKLEKLKSSDPHP 480
Query: 481 NRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAAR 540
+ LLKD GS+GS KD MND G+P SAVVKSTI+P KDK+AFPM PDKDSV QDESSA +
Sbjct: 481 SCALLKDTGSIGSCKDCMNDLGVPASAVVKSTIRPLKDKRAFPMSPDKDSVDQDESSADK 540
Query: 541 VAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDI 600
VAPATAE REGD+SLAVRQTTANE+L+PA+ Q S+VI+GSSL SSD +T DS DDDI
Sbjct: 541 VAPATAEAREGDISLAVRQTTANEALAPAKPQTTSQVIMGSSLNRSSDLKTSDDSFDDDI 600
Query: 601 DTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDI 660
D LTFQNAS LCT QPET DSFGNK+LPENKQI S VFSFVNN SP KQP ASS A D+
Sbjct: 601 DARLTFQNAS-LCTLQPETIDSFGNKDLPENKQIDSSVFSFVNNASPLKQPNASSTAFDV 660
Query: 661 GNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFS 720
GNKDDSLTE CA S N +EPSYPYTQCN ASSN KLD SWRTCNDAFSSS S+SAG AFS
Sbjct: 661 GNKDDSLTESCAASANGDEPSYPYTQCNLASSNHKLDCSWRTCNDAFSSSASISAGPAFS 720
Query: 721 FSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASL 780
FSS P QS N GLSISCPSL+SSYSPSTGFM++SSSRNIFLSA A NNANI T+ S
Sbjct: 721 FSSTPSYQSLNSGLSISCPSLFSSYSPSTGFMSQSSSRNIFLSATCASNNANITATLPSS 780
Query: 781 FSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSK 840
F P+TSG GSYED+IKQDASL NVNDTYFS ITTPANSHYSMFSF SAA PSFVTNLL
Sbjct: 781 FVPSTSGIGSYEDKIKQDASLHNVNDTYFSCITTPANSHYSMFSFNSAAIPSFVTNLLRA 840
Query: 841 PTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPA 900
PTVS ATELSA +VS KEF AN+EKTS+IL S SHVSSGMAG CSSPA
Sbjct: 841 PTVSCATELSAEEVSAVKEFTANSEKTSVILGSPMSHVSSGMAG-----------CSSPA 900
Query: 901 SEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAA 960
SE FNSG+RPSEFPITGFTSA TSTI SN+STS T L FESFTGASFSS+ TTSAAA
Sbjct: 901 SELFNSGSRPSEFPITGFTSAPETSTIGKSNLSTSGTRLGFESFTGASFSSLNSTTSAAA 960
Query: 961 LANST--PVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKP------KFSFG---S 1020
LA S+ PV+SNS+PKVAF VS NNDCEEQG SKDNVPLFSQKP FSFG +
Sbjct: 961 LAGSSSEPVMSNSHPKVAFRVSLGNNDCEEQGISKDNVPLFSQKPIPPPSSGFSFGPGSA 1020
Query: 1021 GTSELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKF 1061
GTSEL FQVGK QQTLAEPQNSYPY+A+S+SLEAKA GSFSLNAG SDK+ RR VK
Sbjct: 1021 GTSELNPFQVGK---QQTLAEPQNSYPYIASSSSLEAKAEGSFSLNAGSSDKSKRRFVKV 1067
BLAST of CsGy5G025550 vs. NCBI nr
Match:
XP_016900298.1 (PREDICTED: nuclear pore complex protein NUP1 isoform X2 [Cucumis melo])
HSP 1 Score: 1379 bits (3569), Expect = 0.0
Identity = 767/886 (86.57%), Postives = 805/886 (90.86%), Query Frame = 0
Query: 194 MGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDSTSL 253
MGRPSIS PIL SSVLDGDISSPAEVARAYMGSRESKVCPS RSLRAQGLG+NST+STSL
Sbjct: 1 MGRPSISPPILCSSVLDGDISSPAEVARAYMGSRESKVCPSKRSLRAQGLGENSTNSTSL 60
Query: 254 T------NMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLSLPISARP 313
+ NMLLAPPSIS+G KRRSSFLDNHI+SIVSLR+IRQKPNIHLSKGLSLPIS
Sbjct: 61 SFYSKSNNMLLAPPSISRGSKRRSSFLDNHIKSIVSLRRIRQKPNIHLSKGLSLPIS--- 120
Query: 314 ISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDNIPGAGSS 373
VPVVGLSFDASQSSKFGRT+NFPSCIWNSQLS KPNKTFARKFITNV SDNI GA S
Sbjct: 121 --VPVVGLSFDASQSSKFGRTRNFPSCIWNSQLSPKPNKTFARKFITNVGSDNILGASCS 180
Query: 374 SIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGHLKSVKDV 433
SIYTL+RSSKMASKILEQLEKLT PKEKVSTFN LPV EKYH KLSP EVVGHLKSVKDV
Sbjct: 181 SIYTLTRSSKMASKILEQLEKLTPPKEKVSTFNRLPVGEKYHSKLSPPEVVGHLKSVKDV 240
Query: 434 DLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMGS 493
DLPR DDKQSNSLLGISYQGNREN+FQHKE+LEKLKSSDPHP+RDLLKD GS+GS
Sbjct: 241 DLPRNEEFVYDDKQSNSLLGISYQGNRENSFQHKERLEKLKSSDPHPSRDLLKDSGSIGS 300
Query: 494 SKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGDV 553
+ DSMNDQGMPESAV KSTIQPPKDKQAFPMLPD+DSV QDESSA RVAPATAEVREGDV
Sbjct: 301 TNDSMNDQGMPESAVGKSTIQPPKDKQAFPMLPDEDSVDQDESSADRVAPATAEVREGDV 360
Query: 554 SLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSLC 613
SLAVRQTTANES+SPAR+QK SEVIVGSSL GSSDSETFGDSIDDDIDT LT Q ASSL
Sbjct: 361 SLAVRQTTANESVSPARLQKSSEVIVGSSLDGSSDSETFGDSIDDDIDTRLTVQIASSLR 420
Query: 614 TSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCAD 673
TSQPE DSFGNK LPENKQIVSPVFSFVNNVSPRKQ IASS ALDIGNKDDSLTELCAD
Sbjct: 421 TSQPEAIDSFGNKILPENKQIVSPVFSFVNNVSPRKQLIASSTALDIGNKDDSLTELCAD 480
Query: 674 SENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPNDG 733
EN NEPSYPYTQCNPASSNDKLD SWRTCNDAFSSSVS+SAGLAFSFSS PG+QS N+G
Sbjct: 481 FENGNEPSYPYTQCNPASSNDKLDFSWRTCNDAFSSSVSVSAGLAFSFSSTPGHQSLNNG 540
Query: 734 LSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYED 793
LSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP AINN NIITT+AS F+ TTSG GSY D
Sbjct: 541 LSISCPSLYSSYSPSTGFMNQSSSRNIFLSAPCAINNTNIITTLASSFASTTSGTGSY-D 600
Query: 794 EIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAPD 853
+IK+D SLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSAT LSA +
Sbjct: 601 KIKRDESLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATGLSAQE 660
Query: 854 VSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSEF 913
VSV K+FIANAE+TSMIL SS SHVSSGMAGKAS+CCGLSF CSSPASE+FNSG+RPSEF
Sbjct: 661 VSVGKKFIANAERTSMILGSSMSHVSSGMAGKASLCCGLSFECSSPASERFNSGSRPSEF 720
Query: 914 PITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSYP 973
PIT FTSA ATSTISTSNVSTSSTLL FESFTGASFSS+RC+TSAAALA+STPVLSNS+P
Sbjct: 721 PITAFTSAPATSTISTSNVSTSSTLLGFESFTGASFSSLRCSTSAAALADSTPVLSNSHP 780
Query: 974 KVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSG------TSELTLFQVGKLENQQT 1033
KVAF VSSVNN+CEEQGTSKDNVPLFSQKPKFS GSG TSELT FQVGK QQT
Sbjct: 781 KVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKFSSGSGPSGSAGTSELTSFQVGK---QQT 840
Query: 1034 LAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK 1061
LAEPQNSYPY+AASNSL+AK+GGSFSLNAGGSDKANRR VKFKRRK
Sbjct: 841 LAEPQNSYPYIAASNSLQAKSGGSFSLNAGGSDKANRRFVKFKRRK 877
BLAST of CsGy5G025550 vs. ExPASy TrEMBL
Match:
A0A0A0KWL0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G613440 PE=4 SV=1)
HSP 1 Score: 1741 bits (4508), Expect = 0.0
Identity = 931/961 (96.88%), Postives = 931/961 (96.88%), Query Frame = 0
Query: 131 MELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNS 190
MELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNS
Sbjct: 1 MELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNS 60
Query: 191 EVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDS 250
EVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDS
Sbjct: 61 EVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDS 120
Query: 251 TSLTNMLLAPPSISQ------------------------------GLKRRSSFLDNHIRS 310
TSLTNMLLAPPSISQ GLKRRSSFLDNHIRS
Sbjct: 121 TSLTNMLLAPPSISQFIQHSLARFYETPSVFSWSGFRVEITDNENGLKRRSSFLDNHIRS 180
Query: 311 IVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQL 370
IVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQL
Sbjct: 181 IVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQL 240
Query: 371 STKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFN 430
STKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFN
Sbjct: 241 STKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFN 300
Query: 431 LLPVREKYHPKLSPAEVVGHLKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEK 490
LLPVREKYHPKLSPAEVVGHLKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEK
Sbjct: 301 LLPVREKYHPKLSPAEVVGHLKSVKDVDLPRDDKQSNSLLGISYQGNRENTFQHKEKLEK 360
Query: 491 LKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVY 550
LKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVY
Sbjct: 361 LKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVY 420
Query: 551 QDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETF 610
QDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETF
Sbjct: 421 QDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETF 480
Query: 611 GDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPI 670
GDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPI
Sbjct: 481 GDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPI 540
Query: 671 ASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVS 730
ASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVS
Sbjct: 541 ASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVS 600
Query: 731 LSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNAN 790
LSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNAN
Sbjct: 601 LSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNAN 660
Query: 791 IITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPS 850
IITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPS
Sbjct: 661 IITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPS 720
Query: 851 FVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGL 910
FVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGL
Sbjct: 721 FVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGL 780
Query: 911 SFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSI 970
SFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSI
Sbjct: 781 SFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSI 840
Query: 971 RCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTS 1030
RCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTS
Sbjct: 841 RCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSGTS 900
Query: 1031 ELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRR 1061
ELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRR
Sbjct: 901 ELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRR 960
BLAST of CsGy5G025550 vs. ExPASy TrEMBL
Match:
A0A1S3BFS9 (nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103489364 PE=4 SV=1)
HSP 1 Score: 1698 bits (4398), Expect = 0.0
Identity = 937/1082 (86.60%), Postives = 984/1082 (90.94%), Query Frame = 0
Query: 1 MVTARQQKN---LEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKL 60
MVTARQQKN EE++EE LG V K IDERFVKKSP KPYDRPP+GIRT+GNNSWILKL
Sbjct: 1 MVTARQQKNPEEKEEDEEERLGTVGKFIDERFVKKSPAKPYDRPPNGIRTTGNNSWILKL 60
Query: 61 VDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNV 120
VDP QRLISSGSRMLFSSVIR FP HLTSRVSSQESSQSRKDD K DVT PFEV+VA NV
Sbjct: 61 VDPAQRLISSGSRMLFSSVIRNFPTHLTSRVSSQESSQSRKDDKKADVTGPFEVQVAFNV 120
Query: 121 GDNRSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIP 180
GDNRSRSSDQFLMMELEKTLKQKTF+RSEI+HLTTLLHSRNGDLP V++EKSFKFISSIP
Sbjct: 121 GDNRSRSSDQFLMMELEKTLKQKTFSRSEIDHLTTLLHSRNGDLPGVNEEKSFKFISSIP 180
Query: 181 EPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRS 240
EPNRKEFVKIPNSEVRMGRPSIS PIL SSVLDGDISSPAEVARAYMGSRESKVCPS RS
Sbjct: 181 EPNRKEFVKIPNSEVRMGRPSISPPILCSSVLDGDISSPAEVARAYMGSRESKVCPSKRS 240
Query: 241 LRAQGLGKNSTDSTSLT------NMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKP 300
LRAQGLG+NST+STSL+ NMLLAPPSIS+G KRRSSFLDNHI+SIVSLR+IRQKP
Sbjct: 241 LRAQGLGENSTNSTSLSFYSKSNNMLLAPPSISRGSKRRSSFLDNHIKSIVSLRRIRQKP 300
Query: 301 NIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARK 360
NIHLSKGLSLPIS VPVVGLSFDASQSSKFGRT+NFPSCIWNSQLS KPNKTFARK
Sbjct: 301 NIHLSKGLSLPIS-----VPVVGLSFDASQSSKFGRTRNFPSCIWNSQLSPKPNKTFARK 360
Query: 361 FITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPK 420
FITNV SDNI GA SSIYTL+RSSKMASKILEQLEKLT PKEKVSTFN LPV EKYH K
Sbjct: 361 FITNVGSDNILGASCSSIYTLTRSSKMASKILEQLEKLTPPKEKVSTFNRLPVGEKYHSK 420
Query: 421 LSPAEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSD 480
LSP EVVGHLKSVKDVDLPR DDKQSNSLLGISYQGNREN+FQHKE+LEKLKSSD
Sbjct: 421 LSPPEVVGHLKSVKDVDLPRNEEFVYDDKQSNSLLGISYQGNRENSFQHKERLEKLKSSD 480
Query: 481 PHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESS 540
PHP+RDLLKD GS+GS+ DSMNDQGMPESAV KSTIQPPKDKQAFPMLPD+DSV QDESS
Sbjct: 481 PHPSRDLLKDSGSIGSTNDSMNDQGMPESAVGKSTIQPPKDKQAFPMLPDEDSVDQDESS 540
Query: 541 AARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSID 600
A RVAPATAEVREGDVSLAVRQTTANES+SPAR+QK SEVIVGSSL GSSDSETFGDSID
Sbjct: 541 ADRVAPATAEVREGDVSLAVRQTTANESVSPARLQKSSEVIVGSSLDGSSDSETFGDSID 600
Query: 601 DDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAA 660
DDIDT LT Q ASSL TSQPE DSFGNK LPENKQIVSPVFSFVNNVSPRKQ IASS A
Sbjct: 601 DDIDTRLTVQIASSLRTSQPEAIDSFGNKILPENKQIVSPVFSFVNNVSPRKQLIASSTA 660
Query: 661 LDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGL 720
LDIGNKDDSLTELCAD EN NEPSYPYTQCNPASSNDKLD SWRTCNDAFSSSVS+SAGL
Sbjct: 661 LDIGNKDDSLTELCADFENGNEPSYPYTQCNPASSNDKLDFSWRTCNDAFSSSVSVSAGL 720
Query: 721 AFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTM 780
AFSFSS PG+QS N+GLSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP AINN NIITT+
Sbjct: 721 AFSFSSTPGHQSLNNGLSISCPSLYSSYSPSTGFMNQSSSRNIFLSAPCAINNTNIITTL 780
Query: 781 ASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
AS F+ TTSG GSY D+IK+D SLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL
Sbjct: 781 ASSFASTTSGTGSY-DKIKRDESLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
Query: 841 LSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCS 900
LSKPTVSSAT LSA +VSV K+FIANAE+TSMIL SS SHVSSGMAGKAS+CCGLSF CS
Sbjct: 841 LSKPTVSSATGLSAQEVSVGKKFIANAERTSMILGSSMSHVSSGMAGKASLCCGLSFECS 900
Query: 901 SPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTS 960
SPASE+FNSG+RPSEFPIT FTSA ATSTISTSNVSTSSTLL FESFTGASFSS+RC+TS
Sbjct: 901 SPASERFNSGSRPSEFPITAFTSAPATSTISTSNVSTSSTLLGFESFTGASFSSLRCSTS 960
Query: 961 AAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSG------T 1020
AAALA+STPVLSNS+PKVAF VSSVNN+CEEQGTSKDNVPLFSQKPKFS GSG T
Sbjct: 961 AAALADSTPVLSNSHPKVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKFSSGSGPSGSAGT 1020
Query: 1021 SELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKR 1061
SELT FQVGK QQTLAEPQNSYPY+AASNSL+AK+GGSFSLNAGGSDKANRR VKFKR
Sbjct: 1021 SELTSFQVGK---QQTLAEPQNSYPYIAASNSLQAKSGGSFSLNAGGSDKANRRFVKFKR 1073
BLAST of CsGy5G025550 vs. ExPASy TrEMBL
Match:
A0A5A7SZK9 (Nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold65G006520 PE=4 SV=1)
HSP 1 Score: 1698 bits (4398), Expect = 0.0
Identity = 937/1082 (86.60%), Postives = 984/1082 (90.94%), Query Frame = 0
Query: 1 MVTARQQKN---LEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKL 60
MVTARQQKN EE++EE LG V K IDERFVKKSP KPYDRPP+GIRT+GNNSWILKL
Sbjct: 1 MVTARQQKNPEEKEEDEEERLGTVGKFIDERFVKKSPAKPYDRPPNGIRTTGNNSWILKL 60
Query: 61 VDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNV 120
VDP QRLISSGSRMLFSSVIR FP HLTSRVSSQESSQSRKDD K DVT PFEV+VA NV
Sbjct: 61 VDPAQRLISSGSRMLFSSVIRNFPTHLTSRVSSQESSQSRKDDKKADVTGPFEVQVAFNV 120
Query: 121 GDNRSRSSDQFLMMELEKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIP 180
GDNRSRSSDQFLMMELEKTLKQKTF+RSEI+HLTTLLHSRNGDLP V++EKSFKFISSIP
Sbjct: 121 GDNRSRSSDQFLMMELEKTLKQKTFSRSEIDHLTTLLHSRNGDLPGVNEEKSFKFISSIP 180
Query: 181 EPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRS 240
EPNRKEFVKIPNSEVRMGRPSIS PIL SSVLDGDISSPAEVARAYMGSRESKVCPS RS
Sbjct: 181 EPNRKEFVKIPNSEVRMGRPSISPPILCSSVLDGDISSPAEVARAYMGSRESKVCPSKRS 240
Query: 241 LRAQGLGKNSTDSTSLT------NMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKP 300
LRAQGLG+NST+STSL+ NMLLAPPSIS+G KRRSSFLDNHI+SIVSLR+IRQKP
Sbjct: 241 LRAQGLGENSTNSTSLSFYSKSNNMLLAPPSISRGSKRRSSFLDNHIKSIVSLRRIRQKP 300
Query: 301 NIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARK 360
NIHLSKGLSLPIS VPVVGLSFDASQSSKFGRT+NFPSCIWNSQLS KPNKTFARK
Sbjct: 301 NIHLSKGLSLPIS-----VPVVGLSFDASQSSKFGRTRNFPSCIWNSQLSPKPNKTFARK 360
Query: 361 FITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPK 420
FITNV SDNI GA SSIYTL+RSSKMASKILEQLEKLT PKEKVSTFN LPV EKYH K
Sbjct: 361 FITNVGSDNILGASCSSIYTLTRSSKMASKILEQLEKLTPPKEKVSTFNRLPVGEKYHSK 420
Query: 421 LSPAEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSD 480
LSP EVVGHLKSVKDVDLPR DDKQSNSLLGISYQGNREN+FQHKE+LEKLKSSD
Sbjct: 421 LSPPEVVGHLKSVKDVDLPRNEEFVYDDKQSNSLLGISYQGNRENSFQHKERLEKLKSSD 480
Query: 481 PHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESS 540
PHP+RDLLKD GS+GS+ DSMNDQGMPESAV KSTIQPPKDKQAFPMLPD+DSV QDESS
Sbjct: 481 PHPSRDLLKDSGSIGSTNDSMNDQGMPESAVGKSTIQPPKDKQAFPMLPDEDSVDQDESS 540
Query: 541 AARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSID 600
A RVAPATAEVREGDVSLAVRQTTANES+SPAR+QK SEVIVGSSL GSSDSETFGDSID
Sbjct: 541 ADRVAPATAEVREGDVSLAVRQTTANESVSPARLQKSSEVIVGSSLDGSSDSETFGDSID 600
Query: 601 DDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAA 660
DDIDT LT Q ASSL TSQPE DSFGNK LPENKQIVSPVFSFVNNVSPRKQ IASS A
Sbjct: 601 DDIDTRLTVQIASSLRTSQPEAIDSFGNKILPENKQIVSPVFSFVNNVSPRKQLIASSTA 660
Query: 661 LDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGL 720
LDIGNKDDSLTELCAD EN NEPSYPYTQCNPASSNDKLD SWRTCNDAFSSSVS+SAGL
Sbjct: 661 LDIGNKDDSLTELCADFENGNEPSYPYTQCNPASSNDKLDFSWRTCNDAFSSSVSVSAGL 720
Query: 721 AFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTM 780
AFSFSS PG+QS N+GLSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP AINN NIITT+
Sbjct: 721 AFSFSSTPGHQSLNNGLSISCPSLYSSYSPSTGFMNQSSSRNIFLSAPCAINNTNIITTL 780
Query: 781 ASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
AS F+ TTSG GSY D+IK+D SLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL
Sbjct: 781 ASSFASTTSGTGSY-DKIKRDESLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNL 840
Query: 841 LSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCS 900
LSKPTVSSAT LSA +VSV K+FIANAE+TSMIL SS SHVSSGMAGKAS+CCGLSF CS
Sbjct: 841 LSKPTVSSATGLSAQEVSVGKKFIANAERTSMILGSSMSHVSSGMAGKASLCCGLSFECS 900
Query: 901 SPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTS 960
SPASE+FNSG+RPSEFPIT FTSA ATSTISTSNVSTSSTLL FESFTGASFSS+RC+TS
Sbjct: 901 SPASERFNSGSRPSEFPITAFTSAPATSTISTSNVSTSSTLLGFESFTGASFSSLRCSTS 960
Query: 961 AAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSG------T 1020
AAALA+STPVLSNS+PKVAF VSSVNN+CEEQGTSKDNVPLFSQKPKFS GSG T
Sbjct: 961 AAALADSTPVLSNSHPKVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKFSSGSGPSGSAGT 1020
Query: 1021 SELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKR 1061
SELT FQVGK QQTLAEPQNSYPY+AASNSL+AK+GGSFSLNAGGSDKANRR VKFKR
Sbjct: 1021 SELTSFQVGK---QQTLAEPQNSYPYIAASNSLQAKSGGSFSLNAGGSDKANRRFVKFKR 1073
BLAST of CsGy5G025550 vs. ExPASy TrEMBL
Match:
A0A5D3CFP1 (Nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G001780 PE=4 SV=1)
HSP 1 Score: 1650 bits (4273), Expect = 0.0
Identity = 921/1094 (84.19%), Postives = 969/1094 (88.57%), Query Frame = 0
Query: 1 MVTARQQKN---LEEEDEEGLGRVRKLIDERFVKKSPPKPYDRPPDGIRTSGNNSWILKL 60
MVTARQQKN EE++EE LG V K IDERFVKKSP KPYDRPP+GIRT+GNNSWILKL
Sbjct: 1 MVTARQQKNPEEKEEDEEERLGTVGKFIDERFVKKSPAKPYDRPPNGIRTTGNNSWILKL 60
Query: 61 VDPGQRLISSGSRMLFSSVIRKFPHHLTSRVSSQESSQSRKDDNKVDVTAPFEVRVATNV 120
VDP QRLISSGSRMLFSSVIR FP HLTSRVSSQESSQSRKDD K DVT PFEV+VA NV
Sbjct: 61 VDPAQRLISSGSRMLFSSVIRNFPTHLTSRVSSQESSQSRKDDKKADVTGPFEVQVAFNV 120
Query: 121 GDNRSRSSDQFLMMELEKTLKQKTFTR------------SEINHLTTLLHSRNGDLPVVH 180
GDNRSRSSDQFLMMELEKTLKQKTF+ SEI+HLTTLLHSRNGDLP V+
Sbjct: 121 GDNRSRSSDQFLMMELEKTLKQKTFSSIVSVTFGASIFWSEIDHLTTLLHSRNGDLPGVN 180
Query: 181 KEKSFKFISSIPEPNRKEFVKIPNSEVRMGRPSISTPILSSSVLDGDISSPAEVARAYMG 240
+EKSFKFISSIPEPNRKEFVKIPNSEV LDGDISSPAEVARAYMG
Sbjct: 181 EEKSFKFISSIPEPNRKEFVKIPNSEV----------------LDGDISSPAEVARAYMG 240
Query: 241 SRESKVCPSMRSLRAQGLGKNSTDSTSLT------NMLLAPPSISQGLKRRSSFLDNHIR 300
SRESKVCPS RSLRAQGLG+NST+STSL+ NMLLAPPSIS+G KRRSSFLDNHI+
Sbjct: 241 SRESKVCPSKRSLRAQGLGENSTNSTSLSFYSKSNNMLLAPPSISRGSKRRSSFLDNHIK 300
Query: 301 SIVSLRKIRQKPNIHLSKGLSLPISARPISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQ 360
SIVSLR+IRQKPNIHLSKGLSLPIS VPVVGLSFDASQSSKFGRT+NFPSCIWNSQ
Sbjct: 301 SIVSLRRIRQKPNIHLSKGLSLPIS-----VPVVGLSFDASQSSKFGRTRNFPSCIWNSQ 360
Query: 361 LSTKPNKTFARKFITNVESDNIPGAGSSSIYTLSRSSKMASKILEQLEKLTSPKEKVSTF 420
LS KPNKTFARKFITNV SDNI GA SSIYTL+RSSKMASKILEQLEKLT PKEKVSTF
Sbjct: 361 LSPKPNKTFARKFITNVGSDNILGASCSSIYTLTRSSKMASKILEQLEKLTPPKEKVSTF 420
Query: 421 NLLPVREKYHPKLSPAEVVGHLKSVKDVDLPR------DDKQSNSLLGISYQGNRENTFQ 480
N LPV EKYH KLSP EVVGHLKSVKDVDLPR DDKQSNSLLGISYQGNREN+FQ
Sbjct: 421 NRLPVGEKYHSKLSPPEVVGHLKSVKDVDLPRNEEFVYDDKQSNSLLGISYQGNRENSFQ 480
Query: 481 HKEKLEKLKSSDPHPNRDLLKDYGSMGSSKDSMNDQGMPESAVVKSTIQPPKDKQAFPML 540
HKE+LEKLKSSDPHP+RDLLKD GS+GS+ DSMNDQGMPESAV KSTIQPPKDKQAFPML
Sbjct: 481 HKERLEKLKSSDPHPSRDLLKDSGSIGSTNDSMNDQGMPESAVGKSTIQPPKDKQAFPML 540
Query: 541 PDKDSVYQDESSAARVAPATAEVREGDVSLAVRQTTANESLSPARIQKPSEVIVGSSLYG 600
PD+DSV QDESSA RVAPATAEVREGDVSLAVRQTTANES+SPAR+QK SEVIVGSSL G
Sbjct: 541 PDEDSVDQDESSADRVAPATAEVREGDVSLAVRQTTANESVSPARLQKSSEVIVGSSLDG 600
Query: 601 SSDSETFGDSIDDDIDTGLTFQNASSLCTSQPETNDSFGNKNLPENKQIVSPVFSFVNNV 660
SSDSETFGDSIDDDIDT LT Q ASSL TSQPE DSFGNK LPENKQIVSPVFSFVN+V
Sbjct: 601 SSDSETFGDSIDDDIDTRLTVQIASSLRTSQPEAIDSFGNKILPENKQIVSPVFSFVNDV 660
Query: 661 SPRKQPIASSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCND 720
SPRKQ IASS ALDIGNKDDSLTELCAD EN NEPSYPYTQCNPASSNDKLD SWRTCND
Sbjct: 661 SPRKQLIASSTALDIGNKDDSLTELCADFENGNEPSYPYTQCNPASSNDKLDFSWRTCND 720
Query: 721 AFSSSVSLSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYSPSTGFMNRSSSRNIFLSAP 780
AFSSSVS+SAGLAFSFSS PG+QS N+GLSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP
Sbjct: 721 AFSSSVSVSAGLAFSFSSTPGHQSLNNGLSISCPSLYSSYSPSTGFMNQSSSRNIFLSAP 780
Query: 781 YAINNANIITTMASLFSPTTSGAGSYEDEIKQDASLRNVNDTYFSSITTPANSHYSMFSF 840
AINN NIITT+AS F+ TTSG GSY D+IK+D SLRNVNDTYFSSITTPANSHYSMFSF
Sbjct: 781 CAINNTNIITTLASSFASTTSGTGSY-DKIKRDESLRNVNDTYFSSITTPANSHYSMFSF 840
Query: 841 GSAATPSFVTNLLSKPTVSSATELSAPDVSVEKEFIANAEKTSMILESSTSHVSSGMAGK 900
GSAATPSFVTNLLSKPTVSSAT LSA +VSV K+FIANAE+TSMIL SS SHVSSGMAGK
Sbjct: 841 GSAATPSFVTNLLSKPTVSSATGLSAQEVSVGKKFIANAERTSMILGSSMSHVSSGMAGK 900
Query: 901 ASVCCGLSFGCSSPASEQFNSGNRPSEFPITGFTSAHATSTISTSNVSTSSTLLEFESFT 960
AS+CCGLSF CSSPASE+FNSG+RPSEFPIT FTSA ATSTISTSNVSTSSTLL FESFT
Sbjct: 901 ASLCCGLSFECSSPASERFNSGSRPSEFPITAFTSAPATSTISTSNVSTSSTLLGFESFT 960
Query: 961 GASFSSIRCTTSAAALANSTPVLSNSYPKVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKF 1020
GASFSS+RC+TSAAALA+STPVLSNS+PKVAF VSSVNN+CEEQGTSKDNVPLFSQKPKF
Sbjct: 961 GASFSSLRCSTSAAALADSTPVLSNSHPKVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKF 1020
Query: 1021 SFGSG------TSELTLFQVGKLENQQTLAEPQNSYPYMAASNSLEAKAGGSFSLNAGGS 1061
S GSG TSELT FQVGK QQTLAEPQNSYPY+AASNSL+AK+GGSFSLNAGGS
Sbjct: 1021 SSGSGPSGSAGTSELTSFQVGK---QQTLAEPQNSYPYIAASNSLQAKSGGSFSLNAGGS 1069
BLAST of CsGy5G025550 vs. ExPASy TrEMBL
Match:
A0A1S4DWF1 (nuclear pore complex protein NUP1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103489364 PE=4 SV=1)
HSP 1 Score: 1379 bits (3569), Expect = 0.0
Identity = 767/886 (86.57%), Postives = 805/886 (90.86%), Query Frame = 0
Query: 194 MGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNSTDSTSL 253
MGRPSIS PIL SSVLDGDISSPAEVARAYMGSRESKVCPS RSLRAQGLG+NST+STSL
Sbjct: 1 MGRPSISPPILCSSVLDGDISSPAEVARAYMGSRESKVCPSKRSLRAQGLGENSTNSTSL 60
Query: 254 T------NMLLAPPSISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLSLPISARP 313
+ NMLLAPPSIS+G KRRSSFLDNHI+SIVSLR+IRQKPNIHLSKGLSLPIS
Sbjct: 61 SFYSKSNNMLLAPPSISRGSKRRSSFLDNHIKSIVSLRRIRQKPNIHLSKGLSLPIS--- 120
Query: 314 ISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDNIPGAGSS 373
VPVVGLSFDASQSSKFGRT+NFPSCIWNSQLS KPNKTFARKFITNV SDNI GA S
Sbjct: 121 --VPVVGLSFDASQSSKFGRTRNFPSCIWNSQLSPKPNKTFARKFITNVGSDNILGASCS 180
Query: 374 SIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVGHLKSVKDV 433
SIYTL+RSSKMASKILEQLEKLT PKEKVSTFN LPV EKYH KLSP EVVGHLKSVKDV
Sbjct: 181 SIYTLTRSSKMASKILEQLEKLTPPKEKVSTFNRLPVGEKYHSKLSPPEVVGHLKSVKDV 240
Query: 434 DLPR------DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMGS 493
DLPR DDKQSNSLLGISYQGNREN+FQHKE+LEKLKSSDPHP+RDLLKD GS+GS
Sbjct: 241 DLPRNEEFVYDDKQSNSLLGISYQGNRENSFQHKERLEKLKSSDPHPSRDLLKDSGSIGS 300
Query: 494 SKDSMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGDV 553
+ DSMNDQGMPESAV KSTIQPPKDKQAFPMLPD+DSV QDESSA RVAPATAEVREGDV
Sbjct: 301 TNDSMNDQGMPESAVGKSTIQPPKDKQAFPMLPDEDSVDQDESSADRVAPATAEVREGDV 360
Query: 554 SLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSETFGDSIDDDIDTGLTFQNASSLC 613
SLAVRQTTANES+SPAR+QK SEVIVGSSL GSSDSETFGDSIDDDIDT LT Q ASSL
Sbjct: 361 SLAVRQTTANESVSPARLQKSSEVIVGSSLDGSSDSETFGDSIDDDIDTRLTVQIASSLR 420
Query: 614 TSQPETNDSFGNKNLPENKQIVSPVFSFVNNVSPRKQPIASSAALDIGNKDDSLTELCAD 673
TSQPE DSFGNK LPENKQIVSPVFSFVNNVSPRKQ IASS ALDIGNKDDSLTELCAD
Sbjct: 421 TSQPEAIDSFGNKILPENKQIVSPVFSFVNNVSPRKQLIASSTALDIGNKDDSLTELCAD 480
Query: 674 SENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSSVSLSAGLAFSFSSNPGNQSPNDG 733
EN NEPSYPYTQCNPASSNDKLD SWRTCNDAFSSSVS+SAGLAFSFSS PG+QS N+G
Sbjct: 481 FENGNEPSYPYTQCNPASSNDKLDFSWRTCNDAFSSSVSVSAGLAFSFSSTPGHQSLNNG 540
Query: 734 LSISCPSLYSSYSPSTGFMNRSSSRNIFLSAPYAINNANIITTMASLFSPTTSGAGSYED 793
LSISCPSLYSSYSPSTGFMN+SSSRNIFLSAP AINN NIITT+AS F+ TTSG GSY D
Sbjct: 541 LSISCPSLYSSYSPSTGFMNQSSSRNIFLSAPCAINNTNIITTLASSFASTTSGTGSY-D 600
Query: 794 EIKQDASLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATELSAPD 853
+IK+D SLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSAT LSA +
Sbjct: 601 KIKRDESLRNVNDTYFSSITTPANSHYSMFSFGSAATPSFVTNLLSKPTVSSATGLSAQE 660
Query: 854 VSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPSEF 913
VSV K+FIANAE+TSMIL SS SHVSSGMAGKAS+CCGLSF CSSPASE+FNSG+RPSEF
Sbjct: 661 VSVGKKFIANAERTSMILGSSMSHVSSGMAGKASLCCGLSFECSSPASERFNSGSRPSEF 720
Query: 914 PITGFTSAHATSTISTSNVSTSSTLLEFESFTGASFSSIRCTTSAAALANSTPVLSNSYP 973
PIT FTSA ATSTISTSNVSTSSTLL FESFTGASFSS+RC+TSAAALA+STPVLSNS+P
Sbjct: 721 PITAFTSAPATSTISTSNVSTSSTLLGFESFTGASFSSLRCSTSAAALADSTPVLSNSHP 780
Query: 974 KVAFSVSSVNNDCEEQGTSKDNVPLFSQKPKFSFGSG------TSELTLFQVGKLENQQT 1033
KVAF VSSVNN+CEEQGTSKDNVPLFSQKPKFS GSG TSELT FQVGK QQT
Sbjct: 781 KVAFKVSSVNNNCEEQGTSKDNVPLFSQKPKFSSGSGPSGSAGTSELTSFQVGK---QQT 840
Query: 1034 LAEPQNSYPYMAASNSLEAKAGGSFSLNAGGSDKANRRSVKFKRRK 1061
LAEPQNSYPY+AASNSL+AK+GGSFSLNAGGSDKANRR VKFKRRK
Sbjct: 841 LAEPQNSYPYIAASNSLQAKSGGSFSLNAGGSDKANRRFVKFKRRK 877
BLAST of CsGy5G025550 vs. TAIR 10
Match:
AT3G10650.1 (BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.1); Has 61042 Blast hits to 31782 proteins in 2093 species: Archae - 202; Bacteria - 16480; Metazoa - 16017; Fungi - 12552; Plants - 1653; Viruses - 629; Other Eukaryotes - 13509 (source: NCBI BLink). )
HSP 1 Score: 130.2 bits (326), Expect = 1.0e-29
Identity = 303/1147 (26.42%), Postives = 456/1147 (39.76%), Query Frame = 0
Query: 31 KKSPPKPYDRPPDGIRTSG-------NNSWILKLVDPGQRLISSGSRMLFSSVIRKFPHH 90
++S PYDRP +R +G W+ KLVDP QRLI+ ++ LF S+ RK
Sbjct: 29 RRSQKTPYDRPTTSVRNAGLGGGDVRGGGWLSKLVDPAQRLITYSAQRLFGSLSRKRLGS 88
Query: 91 LTSRVSSQES---------SQSRKDDNKVDVTAPFEVRVATNVGD-NRSRSSDQFLMMEL 150
+ + S E +Q K +K DV+ + D N S + +L
Sbjct: 89 GETPLQSPEQQKQLPERGVNQETKVGHKEDVSNLSMKNGLIRMEDTNASVDPPKDGFTDL 148
Query: 151 EKTLKQKTFTRSEINHLTTLLHSRNGDLPVVHKEKSFKFISSIPEPNRKEFVKIPNSEVR 210
EK L+ KTFTRSE++ LTTLL S+ D +++E+ + + P E +
Sbjct: 149 EKILQGKTFTRSEVDRLTTLLRSKAADSSTMNEEQRNEVGMVVRHPPSHERDRTHPDNGS 208
Query: 211 MGRPSISTPILSSSVLDGDISSPAEVARAYMGSRESKVCPSMRSLRAQGLGKNST----- 270
M +STP S LD I+SPA++A+AYMGSR S+V PSM LR Q ++S
Sbjct: 209 M-NTLVSTPPGSLRTLDECIASPAQLAKAYMGSRPSEVTPSMLGLRGQAGREDSVFLNRT 268
Query: 271 ------------------------------------------------DSTSLTNMLLAP 330
S + ++ A
Sbjct: 269 PFPQKSPTMSLVTKPSGQRPLENGFVTPRSRGRSAVYSMARTPYSRPQSSVKIGSLFQAS 328
Query: 331 PS-------------ISQGLKRRSSFLDNHIRSIVSLRKIRQKPNIHLSKGLSLPISARP 390
PS GLKRRSS LDN I S+ +R+IRQK N+ S+ L+LP+S P
Sbjct: 329 PSKWEESLPSGSRQGFQSGLKRRSSVLDNDIGSVGPVRRIRQKSNLS-SRSLALPVSESP 388
Query: 391 ISVPVVGLSFDASQSSKFGRTQNFPSCIWNSQLSTKPNKTFARKFITNVESDNIPGAGSS 450
+SV G + +T +K A ++IPG+ +
Sbjct: 389 LSVRANG-----------------------GEKTTHTSKDSA---------EDIPGSSFN 448
Query: 451 SIYTLSRSSKMASKILEQLEKLTSPKEKVSTFNLLPVREKYHPKLSPAEVVG-HLKSVKD 510
+ T +SS+MASKIL+QL+KL S REK KLSP+ + G LKS+++
Sbjct: 449 LVPT--KSSEMASKILQQLDKLVS------------TREKSPSKLSPSMLRGPALKSLQN 508
Query: 511 VDLPR-----DDKQSNSLLGISYQGNRENTFQHKEKLEKLKSSDPHPNRDLLKDYGSMGS 570
V+ P+ +K++NS SYQ + +E + + + D + GS
Sbjct: 509 VEAPKFLGNLPEKKANS-PDSSYQKQEIS----RESVSREVLAQSEKTGDAVDGTSKTGS 568
Query: 571 SKD-SMNDQGMPESAVVKSTIQPPKDKQAFPMLPDKDSVYQDESSAARVAPATAEVREGD 630
SKD M +G+ PPK K++F M +D + D+ A P EV E
Sbjct: 569 SKDQDMRGKGVYMPLTNSLEEHPPK-KRSFRMSAHEDFLELDDDLGAASTP--CEVAEKQ 628
Query: 631 VSLAVRQTTANESLSPARIQKPSEVIVGSSLYGSSDSE--TFGDSIDDDIDTGLTF---- 690
+ V ++ + + + PSE + +S + D+ T S++ + + + F
Sbjct: 629 NAFEVEKSHISMPIGEKPL-TPSEAMPSTSYISNGDASQGTSNGSLETERNKFVAFPIEA 688
Query: 691 ---QNASSLCTSQ----PETNDSFGNKNLPENKQI-----VSPVFSFVN-NVSPRKQPIA 750
N +S TS+ E + K E K+I P F N + SP +
Sbjct: 689 VQQSNMASEPTSKFIQGTEKSSISSGKPTSEEKRIPLEEPKKPAAVFPNISFSPPATGLL 748
Query: 751 ---SSAALDIGNKDDSLTELCADSENVNEPSYPYTQCNPASSNDKLDSSWRTCNDAFSSS 810
S A+ DI + S T T N AS + S+ T N
Sbjct: 749 NQNSGASADIKLEKTSSTAFGVSEAWAKPTESKKTFSNSASGAESSTSAAPTLN------ 808
Query: 811 VSLSAGLAFSFSSNPGNQSPNDGLSISCPSLYSSYS-----------PST--GFMNRSSS 870
G FS +N P++G S PS S S PST F +S
Sbjct: 809 -----GSIFSAGANAVTPPPSNGSLTSSPSFPPSISNIPSDNSVGDMPSTVQSFAATHNS 868
Query: 871 RNIFLSAPYAINNANIITTMASLFSPT-------------------TSGAGSYEDEIKQD 930
+IF P + N++N +T AS S T +SG S E E+K +
Sbjct: 869 SSIFGKLPTS-NDSNSQSTSASPLSSTSPFKFGQPAAPFSAPAVSESSGQISKETEVK-N 928
Query: 931 ASLRNVNDTYFSSITTPANSHYSMFSFGSA---ATPSFV---TNLLSKPTVSSAT-ELSA 990
A+ N + F + + S +F SA + P FV ++++ T++ +T SA
Sbjct: 929 ATFGNTSTFKFGGMASADQSTGIVFGAKSAENKSRPGFVFGSSSVVGGSTLNPSTAAASA 988
Query: 991 PDVSVEKEFIANAEKTSMILESSTSHVSSGMAGKASVCCGLSFGCSSPASEQFNSGNRPS 999
P+ S F + T S S S+ SV SF +S S + +
Sbjct: 989 PESSGSLIFGVTSSSTPGTETSKISASSAATNTGNSVFGTSSFAFTSSGSSMVGGVSAST 1048
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9CAF4 | 1.4e-28 | 26.42 | Nuclear pore complex protein NUP1 OS=Arabidopsis thaliana OX=3702 GN=NUP1 PE=1 S... | [more] |
Match Name | E-value | Identity | Description | |
XP_011656263.1 | 0.0 | 100.00 | nuclear pore complex protein NUP1 [Cucumis sativus] >XP_031741373.1 nuclear pore... | [more] |
XP_008446727.1 | 0.0 | 86.60 | PREDICTED: nuclear pore complex protein NUP1 isoform X1 [Cucumis melo] >XP_00844... | [more] |
TYK09186.1 | 0.0 | 84.19 | nuclear pore complex protein NUP1 isoform X1 [Cucumis melo var. makuwa] | [more] |
XP_038893389.1 | 0.0 | 75.55 | nuclear pore complex protein NUP1-like isoform X1 [Benincasa hispida] | [more] |
XP_016900298.1 | 0.0 | 86.57 | PREDICTED: nuclear pore complex protein NUP1 isoform X2 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KWL0 | 0.0 | 96.88 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G613440 PE=4 SV=1 | [more] |
A0A1S3BFS9 | 0.0 | 86.60 | nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10348... | [more] |
A0A5A7SZK9 | 0.0 | 86.60 | Nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A5D3CFP1 | 0.0 | 84.19 | Nuclear pore complex protein NUP1 isoform X1 OS=Cucumis melo var. makuwa OX=1194... | [more] |
A0A1S4DWF1 | 0.0 | 86.57 | nuclear pore complex protein NUP1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10348... | [more] |
Match Name | E-value | Identity | Description | |
AT3G10650.1 | 1.0e-29 | 26.42 | BEST Arabidopsis thaliana protein match is: nucleoporin-related (TAIR:AT5G20200.... | [more] |