Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGTCTCAATATCTCAGAATCCTCTATTTCGTGTTTTTCCTTTCCAAACCTAAGCCCATATGGGGGCTTAGGACATATGTTTCAATTTTCTTCTTGCTCTACTTTCACTGGGGTTTGTCTGATTCGGTTTGTTCCAAGGATTGTTCCGTGAATATTTCTTGTAAATTGGAAGTTTCATATTATCTCTCTTTTATATTATTGTCTTGTTTCTCTACTGTTTTCCGTTCAAATCTGTGAACTGAAGCTTAAAAATCTGTAGCAGTTCGACAGTAGCATAAATTTTATATAAAGGGAGCTAAGAAGTAGCCATTGTTGTGGTTGTGCCTTCTTTGCAACTTTGATTTGTTAGTTTTAGTAAACGTGAAGTTCTACCTCTCTGAGTCCGAGTACAAATGCTTCTGGAACAAGTGGTTTGAGAGGCTGTTGACGTCTATTTTTTATACCAATCAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGGTAAGAAATGCCTAGGAGCAGAATTTTTCTGGAGTTTTCTTAGCCTTCCTTCACTTATGATCTAAGAATGTTTTGTGTAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGGTTTGCAGTGTAACCTTACAGCTTCCGCTATGCGTTTGTCCTGAATGCCTTTATTCAAGAGGATTTCTCTGTGATTAACAAATTGGCTGCATTTCTGTGCATGTAAATTGCAGAAAGCCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTTTGCTTTTCCTTCATGTTTGAATTCTCTGAATCTCCCTCATCATTAAGTAGTAATACTTTTATAGTTTTTCTTAATTGAATTATGATATACCCTTTATTATTGATGATATTATGTAATGACTACTAAGTTTCAAGTTGAAATTATTTTTACCCTAAACCTTATGAACTGTTGCTACATCAATGAAATTGTTTCTTTCACAAAAAAAAAAACCTTATGAACTGTTGCATTGCTACATTTTTGTTAGTTTTTGCCTGGTTTTAAGTTTAATTGATTGACTTATGGCTACTCAATGGCAAAAATATGGCAATTTGAACAATAAATTTAAAGCTTGTTGTTTGCATATACTACATAATCATTAAAAAGAGCAATAAGAGAGATTAAAAAACGAGAATGAACAACACCATGCATTTTGAATTCTGTTTGTCTTTTCTCTTGAAATATAGGAAAATACTAATTCTCACTGAATCCAGTTGTCTGACATATGCCCCTATATTGTCTNTGTGGGGGAAGACCTAACTATCTATTTGGCACTCTATTGTTATTAAATAGATTATAGAGCTTGAATTGTTTGTTTGCGCAGAGTAGGAAGAAAAACGTGTATTAGTTGACTATTTCTGTATTCTGTCCTTACAACTTTCTGGAAGTGATAAGATCATATTTGTAAGAGATAGGTGATTTGGAGGTTCTGTCTTTAGCTAACTTTTACCCTACGTCCCTACCTTGTATGCTCTTTGCCAAGCTTAGCAGTCAGCTACCAAAGCCGTGTCTGATAGTTTCTTGCAGTGCTGAATTCCCATTTCTGGGAGGAAATTATAAGAGAATGGAAATTCAGGATTGTTTGTTGCCTCTTTCTTATTTCTTCTTTAGATGGTATCATACTATTTTTATGGAATCAGAACTCTATATGTTGATGCCTCAGGCTCTCAGATACATTCACTTGTTTTATCTGAAAGCTCTTGCAACAAAGGATTCAAGGGCCTATATCGGTGTTCTCCTAAATTTGATGTGGAGGTTTAAAACTCCAATGAAGTTCCAATTCTTTGCGAGAGTTGTTTCTTAGGAGGCTGAATTCTATTAATAAATTTCAGAGAATGCACCCTTTCTTCTATCTCTATTATCATTTGTGCGTCCTTCGAGAGAAGGCAACTGTAAACTTGGACCATGTTTTTGTGCACTGCCCTCTCGTGAGGAAGTTGTGGGAGAATTTTTAGGAGATGCTTAGACTTTCTTGGGAGCCCTTCCTTTGGTTGAAAAAAATTCAGATGATATGAGGTCCTTTGTTTTTTGTCTAAAAAAAGATGATGTACAAGTCTCACCTTATGGGTGGTTTGGGACGTAGAACTTGAGGATATAGTGTGAGGCTTCAAATAAGGTTGGAGATAGAGTTATTGTTTTGCAGCAGTTATTCTTTTTAACACCCAACAGAGTTCTATAATTTATCAATAATTTTGTAATCCACCAGCTTATAAAGTTCTCTGTAATATTCTAGTCCCTTGGTGAGGTGGAACCCCTGTTCTTTTATCCTTATTTATACATTTTTGTAGCAATTCATCTATTTTTTCTTCCAGAAAAGGAGTAGGGAAAAGGAAAACATGAAGTCCACTTAAAAAAAAAAAATTAATGATAGCAGTGGTTCATTTATTTATGCATTTTAAAATGTTGATAAGTTAGTACCTCGAAGCTGATTTTGGTATCATGAATTTTCTAAAAATCTAAAACCTTTCGCAGACAAATTAAATAAGTTGTCAATATATTTATTTGATATGTTATTGCAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAAGTGAGTACATCTATAATTCCAGTCTTGTTCATGGAACTCTTTACATTAGCCTTAATTCTTTTTTATGCGTTTATTTTCATATTTCAGTGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTTCTGTTTTTTCTTTGACCACATGTTTCTTTTTATTTCTTCAACGTTTATTAATTTTTCTTAAGGCTTATCTAGAATTCTTAACTAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTAAACTTCCTTGACATCAAACTACACATTCATGAGAAAATATATGCTATACTATGAAGGGTGATTTGTTCGCTGAAGGTTATTTGATTTTTAGATACGCAAACTCAAGGTTTCGCAAAAGTGTAAATCCTTTTAATAGTTATAGTTTAAATTCCCTTATATCCAACTAGAGAAGTTTTTTGTAACTTCATTGGATAGGCTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAACTCAAGGTTTCGCACAAGATTTAATTCTTCCTCTATAATATGAAATCATCATAAAGTGCTGTCTCTTTAAGCACGTAGCACGTGGCATGTGTTAAAAAATCTTTGGAAATTGGAATGATAAGGCCTTTTTGGGTTTCTGATGTATGACTTTCATTTTGCTGCATTGATATTGTGTTTTAATTTACATTATGAAAACATTTTGTTAGTATGATTTTTAATTGGAGATAAGAGATTGCCTTAGCTTTGAACTTTCAATTTCACCAGATTGGACTCTAAACTAGAATAAGTAGTACAATTAATACCTCCTCATTCCTCAAAACAATATAAGTTGTTTCTAAACTAACTAATAAATCACTCAAATGCTCAAAATTAAGCCATAAAAATACCTAAATCCAACCACACAAATATTCAAATCTTGCCTCAAAATCCTGCCAGGTTAAAAATGCTCAAACTCAATGCTGAAAATGTCCAAATCTAATTAAAAAGCACAAACACACCCTGAACAATCAATATCCACAAAAATAACCTCATCCAAGTACAATGATGCTCTTACATCACAGTTCTTTAAAATAAATAAAATTCAACTATTAGAGAGTATTAATTTCACTTAAGTTCTATGAATAAAAATTTTAAGACAAAATTTGGCTCACGATCTCAAAATGAACTTGTTCCTTCTTTGTATCAACCAATATTCATAATGGGTTTATTGGTTTAAATCATGAATGTTAATCCATTTGATTCACCACTACTATGCATTGACTCAGATGCAGCACCACATGGATGACAGCATCCAAATTTCTGAGTTGGGCTACAGAGGGGTGGATAGGAAATTATAGAGATTGATTATCCTTTTCAGTGATATCAGAGTAACTATGATCTGAAGTTTTTATAGAGTTCCTAATTTCTCCTACTCAATCTGTATCCAGTTTTTCCGACCACCAGCATACTCCTTGCCCTGTGTAAATCCTTGTTACTCATCATGACCTCAGCGTCCACGCTCAACTGCATAACATCCTTGCATTCCTCATTTACCCTTTCCCTATTATTGTGTGAGTACATTTATGTAAGGTATCCCCTTACTTTTATTGATATTCAAGAACAAGAACAATATAATCAACCCAACCAATTTGAGAGGGCCCAAGTCTCCAAGAAAGCACTATGCTATTCTTCCTTACTAAGAAATTACCTACCTCCCCCAAGGGAACATTCTCCCTTTTAACAAAACTACTCCATACGCCCAAATTGGGAGACGGAGTCCATTTATAAGGAGCTAATGCCCCATTACCTACCTCCCCCAAGGGAACATTCTCCCCTTTAACAAAATTATTCCATACGCCCAAATGGGCCCCACTCTTCTAACTAACTGGAAGTACCCCTTCTACCCCTCCTAGTATATGTCTTCACCACTGGGGTCTAACAACTTATGCTCGAGGGCCTATTGGATACCTTGTCCCAATGAAGTAACCTTATCCTCAAGGTGAGGTTTGGAAACTGAGTTTGTAAATCCTTAATAAGCTTCCATCATTTGAAGGCTAGCTGGGTTCGACTTATTTGATCAATTAGTCTACCTTAAAATTTTTTTATGATTGATACCTGATTCTAGTTAAAAGATGAAAAGAACAAGGGCTTCTTTCAAGACATTGCTCAAAAATAAACAAGGGCTTCTTTGGAACTTAGATATTAAATTAAAGAAACAAGGGCTTCTTTTAAGACATTGTTAAAAATTAAATACAATAAAGCTAGTTTCTTCTCATCCAAGAAAAGGGAAAAAAAAGAAGGCTTAAAATCATATGCAAGGTGAAGTGCCAGGGAGGGGTCATACTCAAGGGGTGCTTTTGTTCAATGCTTGATAATAAAAGTTATTTTTTATTGAAAATTGACTCTTGCGAAAATTGATGTTAAATTTTCGGAGCTCAAGAAACTCCATGAATTAATGTTTTGCTTATGAGTTCTGACCGTATGCACTCCAATGTTATGTGCTTGGTAAATGAGAAATTACTACTTTCTGAAATTAATTAGGGACTATGCTTATGAGAACAAGTCTTTCCATTCATTTGTTATTTTATGTATTTTGTTTTTTATTCCGCTTGTGTATGAGTTACTATATTTTAGAATTGTCATATAAAGTCTGACTTCAATAGCCATGTAGAAACCCCCCAAGCCCTTGCCATTTGAGTTAGTTGCTTTACTCTTTCTTAAACAAGTTATGGAATATTGTTAGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGTAAGAAGTCCTTTTTTCTTTTCTTTTGAAAAATGTTTGTGAGTTTAACTCTTTTGCCTTTGGCATTCGCTAATGTTTTCAATCTTGAAACTTAATGTGGCTACTGTAAAAGTAATTTTCTTCTCAACATATGATTTTACTTGCACAATACCAATAAAAAAACAAGTGTGGCTTCATGTTTTGTTTTTTTTTTTTTAAATCTTTGCAGGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGTATTGGACTCTCCCTAGCGATGTAAAGTGTATACATTATATAAATTAAAGGTGTAAGTAATTCGGTTCAGTATTTTTATTTTTATTTTTTTTGTTAAAACTGACATATTGATTTTAAATGACCAAACTAATAAAATCTATCTAATCCAAACTGACTGTTTTCTAATTTTATACTTCAGAAAACTGAACTAACTGGCAAGTATAATTAAAAAAACTGAACTGAGTGACTTGGTTCTTTTAGTTTCGTCGATTTTATTTTATTTAATTAATTAATTAATTTTTTTTTTTTTTAATTTTGGCTTGCACCCAACACCACTGATACAAATGTGATTAAACATATTTACAATCAATCATATCATGACTACTTTTTTTTTTCCTGCCTCTGCTCTGCTATTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGTGGTCCTTTCCTTTTTTTTCGTTGGCTAAATATTCAGAGTTTTCTTTTGAAGCATAGCTTTGTTTACTGCTCCATAACACTCATCTTTCCTGCAAGATTATGTGTATCACATGCATATCATTTTCATATATGCTTGTTGAAATTTTTTAAGACAGTAGATTATTATCATATCCCATTTACAATATTTTGGTATGTTGTTTGAGGTGTCCTTCTATTCTGGAGAGGAATCATAGATTGTTCGGGGGATGGAGAAATCTGTAGATTATGTTTGGGATATCGTTAGATTTAACACTTCTTTATGGGGCTCGGTTTCTAAGCCCTTTTGTAATTATTCGCTAGGCCTTATTCTTTTGGATTGGAGTCCTTTTCTACAGGGGATCTCCTCTTTCAGGGCTGTTTTTTTTTATGCCCTTTTGTAGGCTTTTTTNTGGTTTTTTTTTTTGGATGCCCTTTTGTAGGCTTTTTGTAGCTCATGTATCTTTTTCATTATCCTCATAAAAAAAAACCCCTTCGAAGGGTTTTTCGTGTAGCTCCTATTTCCATATCTTGAACTCCCCGTCTCCTTCTTCCCCTTCGACCCCTTGATTTACCTCTCTTTGAAAGATTAAAATTCCGAAGAAGATCAAGTTCTTTGGGTGGCAAGTCTTACTTGGGAAAGTCAATACCATGGATTATATCCAGAGGATGTCTTCCTTTTGTCTGGGCCCGCTTTGGTGTGTTCTCTGTAGGAGTGCCTCGGTAGATCTCGACCATTTGTTGTGGACTTGTCAGTTCCCGCAGGATTTATGGTTTCATTTCTTTAGGTGTTTTGGGATGACCTGGGTTTGTACAAGGGATTGTAGGGCGATGATGAAGGATGTGCTGTTATTCCCACCTTTTCGTGACAGGGGTCGTTTTTTGTGGCAGACTTGTTTTCTAGCTATTTTATGGGGTATTTGGTTGGATAGGAATAATAGACTGTTTAGGGGATGGAGAAATCTGTGGACTATGTGTGGGAACTCATTAGATTTAATTCTTCGCTTTGGGGCTCGGTTTCTAAGGCCTTTTGTAATTATCCGTTAGGCATCATTCTTTTGGATTGGAGCCATTTTCTTTAGAAGCTTCCTCTTTCAGGGTTGTCTTTTTGTAGGCCCTTTTTGTTTGTCTTGTATCTCTTTTCATTTTTCTCAATGAAAGCGTAGTTTCTTACGAAAAAAGAACAATAGCATGACTATTTGTATCACAGTCAAGGTTAGGTTGAAAGTGCATATGTTAATCTTGGAACAAATGTCAATTAGCCCCTATTTATTTTTTGGTTCATACTTTTATTTTGTCTCAAATGTGGTTTAAATTTTATTTTCGTCTTTAAACTTTTTGTCTCTCAACTTTCAAAGGTTATATTTTGGTTCCTTCACACATTTATTTTGTTCGAGTCTCTCAACTTTTAAATGTTACGTTTTAGTTTCTCAACTTTTATACTTCTGTTTTGTTTTAGCCTCACATCATGTTATGTTTTAATTCCTCAACTTTCTAATGATATGTTTGGCTCTAAACATTGCATAAAATTTTTTGTTAATCCTTACCTCAAATTTTCATCAATCATTTAACAAGAATTTAAGTTTGTTCCAAAACCTGATATTTAACCTACTATATATGGTTTTAAATATTTATCTATGTATACATTTATTGATCAGACAAAGAAACAAAAGATGTATCTATGATTCTACATTAAGTTTCTTTGAAATAATTTATAAAAGGCTTAAAGTTCAAGAACTGAATGAAATAATTATGAAAGTTGAGAGAATCAATATTATGTAAATCAACACAACAACCATTATCTTTCAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTGTGAAAAACTCTTCACAATCTTTAACTTAGATTGATAATCCATCACTTGAGGGTGATGTTTCTCAAATCAAAACTCTCTCTCGTCGCACTTTCAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAA
mRNA sequence
ATGGAAATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAGCCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAA
Coding sequence (CDS)
ATGGAAATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAGCCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAA
Protein sequence
MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLLRNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Homology
BLAST of MS006864 vs. NCBI nr
Match:
XP_022152868.1 (uncharacterized protein LOC111020493 [Momordica charantia])
HSP 1 Score: 925.6 bits (2391), Expect = 1.6e-265
Identity = 454/455 (99.78%), Postives = 454/455 (99.78%), Query Frame = 0
Query: 3 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 62
MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
Query: 63 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLLRN 122
DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQK PPTLQSPELKSCHLLRN
Sbjct: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
Query: 123 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 182
FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
Query: 183 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 242
RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
Query: 243 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 302
GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
Query: 303 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 362
SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
Query: 363 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 422
CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
Query: 423 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455
BLAST of MS006864 vs. NCBI nr
Match:
XP_038895435.1 (uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida])
HSP 1 Score: 820.8 bits (2119), Expect = 5.7e-234
Identity = 401/457 (87.75%), Postives = 424/457 (92.78%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y
Sbjct: 1 MEMDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYE 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK P LQSPEL+SC+ L
Sbjct: 61 SDDDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNLMLYSSRE
Sbjct: 121 QTFLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
+LRTSA DFWRDIML +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPGF+H GS
Sbjct: 181 DLRTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPGFIHTGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
RGGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG+++LLCEC
Sbjct: 241 RRGGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGITVLLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS +AYQIL
Sbjct: 301 LQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTIAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L+CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKPLICEMW
Sbjct: 361 LICFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKPLICEMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 421 GLFLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 457
BLAST of MS006864 vs. NCBI nr
Match:
XP_011653682.1 (uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hypothetical protein Csa_013682 [Cucumis sativus])
HSP 1 Score: 816.6 bits (2108), Expect = 1.1e-232
Identity = 397/457 (86.87%), Postives = 425/457 (93.00%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+
Sbjct: 1 MEMDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC L
Sbjct: 61 SDDDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE
Sbjct: 121 QTFLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
LRTSA DFW+DIML +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GS
Sbjct: 181 GLRTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L+CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR EGKPL EMW
Sbjct: 361 LICFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 421 GLFLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457
BLAST of MS006864 vs. NCBI nr
Match:
XP_038895434.1 (uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida])
HSP 1 Score: 812.8 bits (2098), Expect = 1.5e-231
Identity = 400/463 (86.39%), Postives = 424/463 (91.58%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y
Sbjct: 1 MEMDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYE 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK P LQSPEL+SC+ L
Sbjct: 61 SDDDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNL------M 180
+ FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNL +
Sbjct: 121 QTFLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLIYFHIPV 180
Query: 181 LYSSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPG 240
LYSSRE+LRTSA DFWRDIML +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPG
Sbjct: 181 LYSSREDLRTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPG 240
Query: 241 FVHAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLS 300
F+H GS RGGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG++
Sbjct: 241 FIHTGSRRGGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGIT 300
Query: 301 MLLCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSA 360
+LLCECLQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS
Sbjct: 301 VLLCECLQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRST 360
Query: 361 VAYQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKP 420
+AYQILL+CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKP
Sbjct: 361 IAYQILLICFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKP 420
Query: 421 LICEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
LICEMWGLFLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 421 LICEMWGLFLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 463
BLAST of MS006864 vs. NCBI nr
Match:
XP_023520957.1 (uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520958.1 uncharacterized protein LOC111784515 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023520965.1 uncharacterized protein LOC111784527 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 809.3 bits (2089), Expect = 1.7e-230
Identity = 390/457 (85.34%), Postives = 426/457 (93.22%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+
Sbjct: 1 MEMDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDD+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQSPELK+C+LL
Sbjct: 61 SDDEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQSPELKNCYLL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSRE
Sbjct: 121 QTFLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
EL TSA DFWRDIML +EV QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GS
Sbjct: 181 ELSTSACDFWRDIMLTTNEVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKF+T+CCQTK++KHIFT SEVE+LAEA++CLFLDRQF+G+++LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFVTVCCQTKMRKHIFTSSEVERLAEAVLCLFLDRQFRGVTVLLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSL+ YFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLVRYFTDEDWKACCDNIAKTLVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L+ FE EATNEE VLR+LTS+TVKDKSCDLFKLYIYLVLTENWLVGSR L+GKPLICEMW
Sbjct: 361 LIFFENEATNEEEVLRVLTSMTVKDKSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLR CSCQI STDLRSYASKVRNKASYILQSSF E
Sbjct: 421 GLFLRKCSCQIASTDLRSYASKVRNKASYILQSSFEE 457
BLAST of MS006864 vs. ExPASy TrEMBL
Match:
A0A6J1DG18 (uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020493 PE=4 SV=1)
HSP 1 Score: 925.6 bits (2391), Expect = 7.9e-266
Identity = 454/455 (99.78%), Postives = 454/455 (99.78%), Query Frame = 0
Query: 3 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 62
MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
Query: 63 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLLRN 122
DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQK PPTLQSPELKSCHLLRN
Sbjct: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
Query: 123 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 182
FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
Query: 183 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 242
RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
Query: 243 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 302
GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
Query: 303 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 362
SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
Query: 363 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 422
CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
Query: 423 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455
BLAST of MS006864 vs. ExPASy TrEMBL
Match:
A0A0A0LVC0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1)
HSP 1 Score: 816.6 bits (2108), Expect = 5.2e-233
Identity = 397/457 (86.87%), Postives = 425/457 (93.00%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+
Sbjct: 1 MEMDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC L
Sbjct: 61 SDDDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE
Sbjct: 121 QTFLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
LRTSA DFW+DIML +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GS
Sbjct: 181 GLRTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L+CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR EGKPL EMW
Sbjct: 361 LICFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 421 GLFLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457
BLAST of MS006864 vs. ExPASy TrEMBL
Match:
A0A5D3BL50 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2042G00060 PE=4 SV=1)
HSP 1 Score: 803.9 bits (2075), Expect = 3.5e-229
Identity = 394/457 (86.21%), Postives = 421/457 (92.12%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+
Sbjct: 1 MEMDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC L
Sbjct: 61 SDDDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE
Sbjct: 121 QTFLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
+LRTSA DFW+DIML +EV Q LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GS
Sbjct: 181 DLRTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL EMW
Sbjct: 361 LFCFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 421 GLFLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456
BLAST of MS006864 vs. ExPASy TrEMBL
Match:
A0A1S3CJ26 (uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501522 PE=4 SV=1)
HSP 1 Score: 803.9 bits (2075), Expect = 3.5e-229
Identity = 394/457 (86.21%), Postives = 421/457 (92.12%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+
Sbjct: 1 MEMDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC L
Sbjct: 61 SDDDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE
Sbjct: 121 QTFLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
+LRTSA DFW+DIML +EV Q LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GS
Sbjct: 181 DLRTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL EMW
Sbjct: 361 LFCFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 421 GLFLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456
BLAST of MS006864 vs. ExPASy TrEMBL
Match:
A0A6J1HJZ9 (uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464803 PE=4 SV=1)
HSP 1 Score: 801.6 bits (2069), Expect = 1.7e-228
Identity = 387/457 (84.68%), Postives = 424/457 (92.78%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
MEMDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+
Sbjct: 1 MEMDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYD 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDD+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VF EQK PP LQSPELK+C+LL
Sbjct: 61 SDDEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFREQKPPPALQSPELKNCYLL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSRE
Sbjct: 121 QAFLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSRE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGS 240
EL TSA DFWRDIML ++V QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GS
Sbjct: 181 ELSTSACDFWRDIMLTTNKVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGS 240
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GRGGPPQNIRAWIKF+T+CCQTK+KK+IFT SEVE+LAEAI+CLFLDRQF+G+++LLCEC
Sbjct: 241 GRGGPPQNIRAWIKFVTVCCQTKMKKYIFTSSEVERLAEAILCLFLDRQFRGVTVLLCEC 300
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
LQSL+HYFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQIL
Sbjct: 301 LQSLVHYFTDEDWKACCDNIAKALVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQIL 360
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
L+ FE E TNEE VLR+LTS+TVKD+SCDLFKLYIYLVLTENWLVGSR L+GKPLICEMW
Sbjct: 361 LIFFENEGTNEEEVLRVLTSMTVKDRSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMW 420
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 458
GLFLR CSCQI STDLRSYASKVRNKASYILQS F E
Sbjct: 421 GLFLRKCSCQIASTDLRSYASKVRNKASYILQSCFEE 457
BLAST of MS006864 vs. TAIR 10
Match:
AT2G28130.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 497.7 bits (1280), Expect = 1.0e-140
Identity = 241/454 (53.08%), Postives = 335/454 (73.79%), Query Frame = 0
Query: 1 MEMDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYN 60
M++DGPLDFE+EDPL++ P ++KRK +IGLDDLL+D YK+K K+++K +++ K K Y+
Sbjct: 1 MDLDGPLDFENEDPLVNPPTIIEKRKKVIGLDDLLSDFYKEKSKVIDKVNKKRKVSKVYH 60
Query: 61 SDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKAPPTLQSPELKSCHLL 120
SDDD+ G+ +SQ V ECQN+M+++ EE+ WGL +FG+QK P +L SC LL
Sbjct: 61 SDDDEQGQVDKLSQCVVECQNQMNEIADEEENQEWGLSMFGDQKTPIPSLLVDLDSCCLL 120
Query: 121 RNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSRE 180
+ F+NN++N +V LT+++G F+EGLLVNGWL+ L+ GRVEK I WT N++LYSS+E
Sbjct: 121 KEFMNNQLNLVVGLTVDEGTTFIEGLLVNGWLTRLIMTCGRVEKFICKWTLNILLYSSKE 180
Query: 181 ELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPG--FVHA 240
+LR+SA DFW I+L +++V ++I W PN+ +L EALE+YGFR S + A
Sbjct: 181 DLRSSACDFWCSILLSQNKVNGASVEIYWLPNYQELKEALESYGFRISLSHSQDVELAEA 240
Query: 241 GSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLC 300
S GPPQNIRAW+ +T CCQ + KK IFT S+VEQ+AE ++ L LDR GLS+LL
Sbjct: 241 DSECQGPPQNIRAWLTLVTTCCQFRCKKPIFTTSQVEQIAEILVSLLLDRSLLGLSILLQ 300
Query: 301 ECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQ 360
ECL S+I F +EEW + C+KIA SL R+P+D+NCLR VE +SG DARSK+LRS++A+Q
Sbjct: 301 ECLISVIGSFKEEEWISSCKKIANSLASRVPQDINCLRVVESVSGVDARSKHLRSSIAHQ 360
Query: 361 ILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICE 420
+L++ + ++E +L L SI VK++SC+LFK+YI+LVL ENWL S +E KP++ +
Sbjct: 361 MLVVLLD-HKDSDENLLSSLMSINVKERSCNLFKMYIFLVLAENWLFSSTLVEAKPVLRD 420
Query: 421 MWGLFLRNCSCQITSTDLRSYASKVRNKASYILQ 453
MW +FLRNCSCQI STDLRSYASKVR +A+Y+LQ
Sbjct: 421 MWAVFLRNCSCQINSTDLRSYASKVRTRAAYLLQ 453
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022152868.1 | 1.6e-265 | 99.78 | uncharacterized protein LOC111020493 [Momordica charantia] | [more] |
XP_038895435.1 | 5.7e-234 | 87.75 | uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida] | [more] |
XP_011653682.1 | 1.1e-232 | 86.87 | uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hy... | [more] |
XP_038895434.1 | 1.5e-231 | 86.39 | uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida] | [more] |
XP_023520957.1 | 1.7e-230 | 85.34 | uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DG18 | 7.9e-266 | 99.78 | uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A0A0LVC0 | 5.2e-233 | 86.87 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1 | [more] |
A0A5D3BL50 | 3.5e-229 | 86.21 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CJ26 | 3.5e-229 | 86.21 | uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1HJZ9 | 1.7e-228 | 84.68 | uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT2G28130.1 | 1.0e-140 | 53.08 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |