Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGTCTCAATATCTCAGAATCCTCTATTTCGTGTTTTTCCTTTCCAAACCTAAGCCCATATGGGGGCTTAGGACATATGTTTCAATTTTCTTCTTGCTCTACTTTCACTGGGGTTTGTCTGATTCGGTTTGTTCCAAGGATTGTTCCGTGAATATTTCTTGTAAATTGGAAGTTTCATATTATCTCTCTCTTATATTATTGTCTTGTTTCTCTACTGTTTTCCGTTCAAATCTGTGAACTGAAGCTTAAAAATCTGTAGCAGTTCGACAGTAGCATAAATTTTATATAAAGGGAGCTAAGAAGTAGCCATTGTTGTGGTTGTGCCTTCTTTGCAACTTTGATTTGTTAGTTTTAGTAAACGTGAAGTTCTACCTCTCTGAGTCCGAGTACAAATGCTTCTGGAACAAGTGGTTTGAGAGGCTGTTGACGTCTATTTTTTATACCAATCAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGGTAAGAAATGCCTAGGAGCAGAATTTTTCTGGAGTTTTCTTAGCCTTCCTTCACTTATGATCTAAGAATGTTTTGTGTAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGGTTTGCAGTGTAACCTTACAGCTTCCGCAATGCGTTTGTCCTGAATGCCTTTATTCAAGAGGATTTCTCTGTGATTAACAAATTGGTTGCATTTCTGTGCATGTAAATTGCAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTTTGCTTTTCCTTCATGTTTGAATTCTCTGAATCTCCCTCATCATTAAGTAGTAATACTTTTATAGTTTTTCTTAATTGAGTTATGATATAACCTTTATTATTGATGATATTATGTAATGACTACTGAAGTTTCAAGTTGAAATTATTTTTACCCTAAACCTTATGAACTGTTGCTACATCAATGAAATTGTTTCTTTCACAAAAAAAAAAAACCTTATGAACTGTTGCATTGCTACATTTTTTTTAGTTTTTGCCTGGTTTTAAGTTTAATTGATTGACTTATGGCTACTCAATGGCAAAAATATGGCAATTTGAACAATAAATTTAAAGCTTGTTGTTTGCATATACTACATAATCATTAAAAAGAGCAATAAGAGAGATTAAAAAACGAGAATGAACAACACCATGCATTTTGAATTCTGTTTGTCTTTTCTCTTGAAATATAGGAAAATACTAATTCTCACTGAATCCAGTTGTCTGACATATGCCCCTATATTGTCTTTATCATAGAGAATAGGAAGGTGGGATGATGAAGATGCCAACCACTTGGTGAACTGGAAACTAGCTTCTTTAGCTCCCATTTGGGGGGGGGGGGGGGTGGGTTGTTGTTGTGGGGGAAGACCAAACTATCTATTTGGCACTCTATTGTTATTAAATAGATTATAGAGCTTGAATTGTTTGTTTGCGCAGAGTAGGAAGAAAAACGTGTATTAGTTGACTATTTCTGTATTCTGTCCTTACAACTTTCTGGAAGTGTAAGATCATATTTGTAAGAGATAGGTGATTTGGAGGTTCTGTCTTTAGCTAACTTTTACCCTACGTCCCTACCTTGTATGCTCTTTGCCAAGCTTAGCAGTCAGCTACCAAAGCCGTGTCTGATAGTTTCTTGCAGTGCTGAATTCCCATTTCTGGGAGGAAATTATAAGAGAATGGAAATTCAGGATTGTTTGTTGCCTCTTTCTTATTTCTTCTTTAGATGGTATCATACTATTTTTATGGAATCAGAACTCTATATGTTGATGCCTCAGGCTCTGAGATACATTCACTTGTTTTATCTGAAAGCTCTTGCAACAAAGGATTCAAGGGCCTATATCGGTGTTCTCCTAAATTGGATGTGGAGGTTTAAAACTCCAATGAAGTTCCAATTCTTTGCGAGAGTTGTTTCTTAGGAGGCTGAATTCTATTAATAAATTTCAGAGAATGCACCCTTTCTTCTATCTCTATTATCATTTGTGCGTCCTTCGAGAGAAGGCAACTGTAAACTTGGACCATGTTTTTGTGCACTGCCCTCTCGTGAGGAAGTTGTGGGAGAATTTTTAGGAGATGCTTAGACTTTCTTGGGAGCCCTTCCTTTGGTTGAAAAAAATTCAGATGATATGAGGTCCTTTGTTTTTTGTCTAAAAAAAGATGATGTACAAGTCTCACCTTATGGGTGGTTTGGGACGTAGAACTTGAGGATATAGTGTGAGGCTTCAAATAAGGTCGGAGATAGAGTTATTGTTTTGCAGCAGTTATTCTTTTTAACACCCAACAGAGTTCTATAATTTATCAATAATTTTGTAATCCACCAGCTTATAAAGTTCTCTGTAATATTCTAGTCCCTTGGTGAGGTGGAACCCCTGTTCTTTTATCCTTATTTATACATTTTTGTAGCAATTCATCTATTTTTTCTTCCAGAAAAGGAGTAGGGAAAAGGAAAACATGAAGTCCACTTAAAAAAAAAAAAATAATGATAGCAGTGGTTCATTTATTTATGCATTTTAAAATGTTGATAAGTTAGTACCTCGAAGCTGATTTTGGTATCATGAATTTTCTAAAAATCTAAAACCTTTCGCAGACAAATTAAATAAGTTGTCAATATATTTATTTGATATGTTATTGCAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAAGTGAGTACATCTATAATTCCAGTCTTGTTCATGGAACTCTTTACATTAGCCTTAATTCTTTTTTATGCGTTTATTTTCATATTTCAGTGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTTCTGTTTTTTCTTTGACCACATGTTTCTTTTTATTTCTTCAACGTTTATTAATTTTTCTTAAGGCTTATCTAGAATTCTTAACTAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTAAACTTCCTTGACATCAAACTACACATTCATGAGAAAATATATGCTATACTATGAAGGGTGATTTGTTCGCTGAAGGTTATTTGATTTTTAGATACGCAAACTCAAGGTTTCGCAAAAGTGTAAATCCTTTTAATAGTTATAGTTTAAATTCCCTTATATCCAACTAGAGAAGTTTTTTGTAACTTCATTGGATAGGCGCTTTTTTTTTTTTTTTTTTTTTTAATATATTTCACATTATCAATGAAATTTGATTCCTATCCCAAAAAAAAAAAAAAAAAAAACTCAAGGTTTCGCACAAGATTTAATTCTTCCTCTATAATATGAAATCATCATAAAGTGCTGTCTCTTTAAGCACGTAGCACGTGGCATGTGTTAAAAAATCTTTGGAAATTGGAATGATAAGGCCTTTTTGGGTTTCTGATGTATGACTTTCATTTTGCTGCATTGATATTGTGTTTTAATTTACATTATGAAAACATTTTGTTAGTATGATTTTTAATTGGAGATAAGAGATTGCCTTAGCTTTGAACTTTCAATTTCACCAGATTGGACTCTAAACTAGAATAAGTAGTACAATTAATACCTCCTCATTCCTCAAAAGAATATAAGTTGTTTCTAAACTAACTAATAAATCACTCAAATGCTCAAAATTAAGCCATAAAAATACCTAAATCCAACCACACAAATATTCAAATCTTGCCTCAAAATCCTGCCAGGTTAAAAATGCTCAAACTCAATGCTGAAAATGTCCAAATCTAATTAAAAAGCACAAACACACCCTGAACAATCAATATCCACAAAAATAACCTCATCCAAGTACAATGATGCTCTTACATCACAGTTCTTTAAAATAAATAAAATTCAACTATTAGAGAGTATTAATTTCACTTAAGTTCTATGAATAAAAATTTTAAGACAAAATTTGGCTCACGATCTCAAAATGAACTTGTTCCTTCTTTGTATCAACCAATATTCATAATGGGTTTATTGGTTTAAATCATGAATGTTAATCCATTTGATTCACCACTACTATGCATTGACTTAGATGCAGCACCACATGGATGACAGCATCCAAATTTCTGAGTTGGGCTACAGAGGGGTGGATAGTAAATTATAGAGATTGATTATCCTTTTCAGTGATATCAGAGTAACTATGATCTGAAGTTTTTATAGAGTTCCTAATTTCTCCTACTCAATCTGTATCCAGTTTTTCCGACCACCAGCATACTCCTTGCCCTGTGTAAATCCTTGTTACTCATCATGACCTCAGCGTCCACGCCTCAACTGCATAACATCCTTGCATTCCTCATTTACCCTTTCCCTATTATTGTGTGAGTACATTTATGTAAGGTATCCCCTTACTTTTATTGATATTCAAGAACAAGAACAATATAATCAACCCAACCAATTTGAGAGGGCCCAAGTCTCCAAGAAAGCACTATGCTATTCTTCCTTACTAAGAAATTACCTACCTCCCCCAAGGGAGCATTCTCCCTTTTAACAAAACTACTCCATACGCCCAAATTGGGAGACGGAGTCCATTTATAAGGAGCTAATGCCCCATTACCTACCTCCCCCAAGGGAACATTCTCCCCTTTAACAAAACTATTCCATACGCCCAAATGGGCCCCACTCTTCTAACTAACTGGAAGTACCCCTTCTACCCCTCCTAGTATATGTCTTCACCACTGGGGTCTAACAACTTATGCTCGAGGGCCTATTGGATACCTTGTCCCAATGAAGTAACCTTATCCTCAAGGTGAGGTTTGGAAACTGAGTTTGTAAATCCTTAATAAGCTTCCATCATTTGAAGGCTAGCTGGGTTCGACTTATTTGATCAATTAGTCTACCTTAAAATTTTTTTATGATTGATACCTGATTCTAGTTAAAAGATGAAAAAAACAAGGGCTTCTTTCAAGACATTGCTCAAAAATAAACAAGGGCTTCTTTGGAACTTAGATATTAAATTAAAGAAACAAGGGCTTCTTTTAAGACATTGTTAAAAATTAAATACAATAAAGCTAGTTTCTTCTCATCCAAGAAAAGGGAAAAAAAAGAAGGCTTAAAATCATATGCAAGGTGAAGTGCCAGGGAGGGGTCATACTCAAGGGGTGCTTTTGTTCAATGCTTGATAATAAAAGTTATTTTTTATTGAAAATTGACTCTTGCGAAAATTGATGTTAAATTTTCGGAGCTCAAGAAACTCCATGAATTAATGTTTTGCTTATGAGTTCTGACCGTATGCACTCCAATGTTATGTGCTTGGAAAATGAGAAATTACTACTTTCTGAAATTAATTAGGGACTATGCTTATGAGAACAAGTCTTTCCATTCATTTGTTATTTTATGTATTTTGTTTTTTATTCCGCTTGTGTATGAGTTACTATATTTTAGAATTGTCATATAAAGTCTGACTTCAATAGCCATGTAGAAACCCCCCAAGCCCTTGCCATTTGAGTTAGTTGCTTTACTCTTTCTTACACAAGTTATGGAATATTGTTAGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGTAAGAAGTCCTTTTTTCTTTTCTTTTGAAAAATGTTTGTGAGTTTAACTCTTTTGCCTTTGGCATTCGCTAATGTTTTCAATCTTGAAACTTAATGTGGCTACTGTAAAAGTAATTTTCTTCTCAACATATGATTTTACTTGCACAATACCAATAAAAAAACAAGTGTGGCTTCATGTTTTGTTTTTTTTTTTTTAAATCTTTGCAGGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGTATTGGACTCTCCCTAGTGATGTAAAGTGTATACATTATATAAATTAAAGGTGTAAGTAATTCGGTTCAGTATTTTTATTTTTATTTTTTTTGTTAAAACTGACATTGATTTTAAATGACCAAACTAATAAAATCTATCTAATCCAAACTGACTGTTTTCTAATTTTATACTTCAGAAAACTGAACTAACTGGCAAGTATAATTAAAAAAACTGAACTGAGTGACTTGGTTCTTTTAGTTTCGTCGATTTTATTTTATTTAATTAATTAATTAATTTTTTTTCTTAATTTTGGCTTGCACCCAACACCACTGATACAAATGTGATTAAACATATTTACAATCAATCATATCATGACTACTTTTTTTTTTTCCTGCCTCTGCTCTGCTATTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGTGGTCCTTTCCTTTTTTTTCGTTGGCTAAATATTCAGAGTTTTCTTTTGAAGCATAGCTTTGTTTACTGCTCCATAACACTCATCTTTCCTGCAAGATTATGTGTATCACATGCATATCATTTTCATATATGCTTGTTGAAATTTTTTAAGACAGTAGATTATTATCATATCCCATTTACAATATTCTGGTATGTTGTTTGAGGTGTCCTTCTATTCTGGAGAGGAATCATAGATTGTTCGGGGGATGGAGAAATCTGTAGATTATGTTTGGGATATCGTTAGATTTAACACTTCTTTATGGGGCTCGGTTTCTAAGCCCATTTGTAATTATTCGCTAGGCCTTATTCTTTTGGATTGGAGTCCTTTTCTACAGGGGATCTCCTCTTTCAGGGCTGTTTTTTTTTATGCCCTTTTGTAGGCTTTTTTGTAGCTTTTTTTTTGGATGCCCTTTTGTAGGCTTTTTGTAGCTCTGGTATCTTTTTCATTATCCTCATAAAAAAAACCCCTTCGAAGGGTTTTTCGTGTAGCTCCTATTTCCATATCTTGAACTCCCCGTCTCCTTCTTCCCCTTCGACCCCTTGATTTACCTCTCTTTGAAAGATTAAAATTCCGAAGAAGATCAAGTTCTTTGGGTGGCAAGTCTTACTTGGGAAAGTCAATACCATGGATTATATCCAGAGGATGTCTTCCTTTTGTCTGGGCCCGCTTTGGTGTGTTCTCTGTAGGAGTGCCTCGGTAGATCTCGACCATTTGTTGTGGACTTGTCAGTTCCCGCAGGATTTATGGTTTCATTTCTTTAGGTGTTTTGGGATGACCTGGGTTTGTACAAGGGATTGTAGGGCGATGATGAAGGATTTGCTGTTATTCCCACCTTTTCGTGACAGGCGTCGTTTTTTGTGGCAGACTTGTTTTCTAGCTATTTTATGGGGTATTTTCTAGCTATTTTATGGGGTATTTGGTTGGATAGGAATAATAGACTGTTTAGGGGATGGAGAAATCTGTGGACTATGTGTGGGAACTCATTAGATTTAATTCTTCGCTTTGGGGCTCGGTTTATAAGGCCTTTTGTAATTATCCGTTAGGCATCATTCTTTTGGATTGGAGCCATTTTCTTTAGAAGATTCCTCTTTCAAGGTTGTCTTTTTGTAGGCCCTTTTTGTTCGTCTTGTATCTCTTTTCATTTTTCTCAATGAAAGCGTAGTTTCTTACGAAAAAAGAACAATAGCATGACTATTTGTATCACAGTCAAGGTTAGGTTGAAAGTGCATATGTTAATCTTGGAACAAATGTCAATTAGCCCGCTATTTATTTTTTGGTTCATACTTTTATTTTGTCTCAAATGTGGTTTAAATTTTATTTTTGTCTTTAAACTTTTTGTCTCTCAACTTCAAAGGTTATATTTTGGTTCCTTCACACATTTATTTTGTTCGAGTCTCTCAACTTTTAAATGTTACGTTTTAGTTTCTCAACTTTTATACTTCTGTTTTGTTTTAGCCTCACATCATGTTATGTTTTAATTCCTCAACTTTCTAATGATATGTTTGGCTCTAAACATTGCATAAAATTTTTTGTTAATCCTTACCTCAAATTTTCATCAATCATTTAACAAGAATTTAAGTTTGTTCCAAAACCTGATATTTAACCTACTATATATGGTTTTAAATATTTATCTATGTATACATTTATTGATCAGACAAAGAAACAAAAGATGTATCTATGATTCTACATTAAGTTTCTTTGAAATAATTTATAAAAGGCTTAAAGTTCAAGAACTGAATGAAATAATTATGAAAGTTGAGAGAATCAATATTATGTAAATCAACACAACAACCTTCTAACTTCACATTATCTTTCAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTGTGAAAAACTCTTCACAATCTTTAACTTAGATTGATAATCCATCACTTGAGGGTGATGTTTCTCAAATCAAAACTCTCTCTCGTCACACTTTCAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG
mRNA sequence
ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG
Coding sequence (CDS)
ATGGATGGTCCTTTAGATTTCGAGTCAGAAGACCCGCTTCTCAGTTCCCCTGTCGCTCTCAAGAAGAGGAAAATGATTATCGGGTTAGATGATCTTCTGACTGATCACTACAAGGATAAATGCAAACTCGTTGAGAAAGAATCTAGACAGGCAAAGAAGAAAAAGAACTATAATTCGGATGATGATGATTTTGGCAAAGAGGCTGTGGTGTCCCAAGTTGTTGATGAATGCCAAAATAAGATGAGCCAACTAGGGGGTGAGGAAGATACATCCATATGGGGCTTAAAGGTTTTTGGAGAACAGAAAACCCCACCAACTTTACAAAGTCCTGAACTCAAAAGTTGCCACCTTTTGCGAAATTTCCTAAATAATGAAGTTAACTCTTTGGTGAATCTCACCATGGAGAAAGGTGATGCTTTCCTTGAAGGTTTATTGGTAAATGGCTGGCTGTCAACACTTGTTTCTTTGACAGGTCGTGTAGAAAAACCAATTGCCATATGGACCTTCAATTTAATGTTATATTCATCAAGAGAAGAGCTAAGAACATCAGCTTTTGATTTCTGGAGGGATATCATGTTACCTAAACATGAGGTTGAACAACAGCCTCTCCAAATTGATTGGTTTCCTAACCATGCTCAATTAGGAGAAGCTCTTGAGACTTATGGATTTAGATTTGAGTGCTCATTAAATCCTGGATTTGTCCACGCTGGTTCTGGACGTGGAGGGCCACCTCAGAATATAAGGGCTTGGATCAAATTTATTACTATTTGTTGTCAAACAAAGATTAAAAAGCATATATTTACATTCTCCGAAGTTGAGCAACTAGCTGAAGCCATTATTTGTTTATTTCTAGACCGCCAGTTTCAAGGCTTATCCATGCTCTTATGTGAATGCTTACAATCGCTTATCCATTACTTCACAGATGAAGAATGGAAGGCATGTTGTGAGAAAATAGCAAAATCTCTTGTTTGCAGGATTCCTAGGGATCTAAATTGCTTGCGAGCTGTGGAATGCATTTCTGGAGCTGATGCCCGTAGTAAATATCTGAGGAGTGCAGTTGCCTATCAAATTCTTCTTATGTGCTTTGAAACTGAGGCTACTAATGAGGAAGGGGTTTTGAGATTACTCACCTCAATCACAGTTAAAGACAAAAGCTGCGACCTATTTAAGCTGTATATTTACTTGGTGTTGACGGAGAACTGGCTTGTCGGTAGTCGAACGTTAGAAGGCAAGCCGCTAATTTGTGAAATGTGGGGTCTGTTTCTCAGAAACTGCTCCTGCCAAATCACCAGTACAGATTTGAGGTCCTACGCATCAAAGGTTCGTAACAAAGCTTCATATATCCTGCAAAGCTCTTTTAACGAATAG
Protein sequence
MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSDDDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRNFLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Homology
BLAST of Moc03g21040 vs. NCBI nr
Match:
XP_022152868.1 (uncharacterized protein LOC111020493 [Momordica charantia])
HSP 1 Score: 927.5 bits (2396), Expect = 4.3e-266
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN
Sbjct: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455
BLAST of Moc03g21040 vs. NCBI nr
Match:
XP_038895435.1 (uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida])
HSP 1 Score: 817.0 bits (2109), Expect = 8.1e-233
Identity = 399/455 (87.69%), Postives = 422/455 (92.75%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y SD
Sbjct: 3 MDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYESD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK P LQSPEL+SC+ L+
Sbjct: 63 DDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLMLYSSREDL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSA DFWRDIML +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPGF+H GS R
Sbjct: 183 RTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPGFIHTGSRR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGITVLLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS +AYQILL+
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTIAYQILLI 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKPLICEMWGL
Sbjct: 363 CFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKPLICEMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 457
BLAST of Moc03g21040 vs. NCBI nr
Match:
XP_011653682.1 (uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hypothetical protein Csa_013682 [Cucumis sativus])
HSP 1 Score: 812.8 bits (2098), Expect = 1.5e-231
Identity = 395/455 (86.81%), Postives = 423/455 (92.97%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+SD
Sbjct: 3 MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC L+
Sbjct: 63 DDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE L
Sbjct: 123 FLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSREGL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSA DFW+DIML +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLI 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR EGKPL EMWGL
Sbjct: 363 CFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457
BLAST of Moc03g21040 vs. NCBI nr
Match:
XP_038895434.1 (uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida])
HSP 1 Score: 808.9 bits (2088), Expect = 2.2e-230
Identity = 398/461 (86.33%), Postives = 422/461 (91.54%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDG LDFESEDPLLSSPV LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKK K Y SD
Sbjct: 3 MDGTLDFESEDPLLSSPVTLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKIKKYESD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDD GKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK P LQSPEL+SC+ L+
Sbjct: 63 DDDLGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPSPALQSPELESCNFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNL------MLY 180
FLNNEVNSLVNLT+E GDAFLEGLLVNGWLSTLV LTGR EK +AIWTFNL +LY
Sbjct: 123 FLNNEVNSLVNLTVETGDAFLEGLLVNGWLSTLVYLTGRAEKSLAIWTFNLIYFHIPVLY 182
Query: 181 SSREELRTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFV 240
SSRE+LRTSA DFWRDIML +EVEQQ LQIDWFPN+AQLGEALETYG+RF+CSLNPGF+
Sbjct: 183 SSREDLRTSAGDFWRDIMLTTNEVEQQHLQIDWFPNYAQLGEALETYGYRFQCSLNPGFI 242
Query: 241 HAGSGRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSML 300
H GS RGGPPQNIRAWIKFITICCQTKIKK+IFT SEVEQLAEAIICLFLDRQFQG+++L
Sbjct: 243 HTGSRRGGPPQNIRAWIKFITICCQTKIKKNIFTSSEVEQLAEAIICLFLDRQFQGITVL 302
Query: 301 LCECLQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVA 360
LCECLQSLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS +A
Sbjct: 303 LCECLQSLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTIA 362
Query: 361 YQILLMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLI 420
YQILL+CFE EA+NEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGS+TLEGKPLI
Sbjct: 363 YQILLICFENEASNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSQTLEGKPLI 422
Query: 421 CEMWGLFLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
CEMWGLFLRNCSCQI STDLRSYASKVRNKASYILQS+F E
Sbjct: 423 CEMWGLFLRNCSCQIASTDLRSYASKVRNKASYILQSAFEE 463
BLAST of Moc03g21040 vs. NCBI nr
Match:
XP_023520957.1 (uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023520958.1 uncharacterized protein LOC111784515 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023520965.1 uncharacterized protein LOC111784527 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 805.4 bits (2079), Expect = 2.4e-229
Identity = 388/455 (85.27%), Postives = 424/455 (93.19%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+SD
Sbjct: 3 MDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
D+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQSPELK+C+LL+
Sbjct: 63 DEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQSPELKNCYLLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSREEL
Sbjct: 123 FLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSREEL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
TSA DFWRDIML +EV QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GSGR
Sbjct: 183 STSACDFWRDIMLTTNEVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKF+T+CCQTK++KHIFT SEVE+LAEA++CLFLDRQF+G+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFVTVCCQTKMRKHIFTSSEVERLAEAVLCLFLDRQFRGVTVLLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SL+ YFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLVRYFTDEDWKACCDNIAKTLVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQILLI 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
FE EATNEE VLR+LTS+TVKDKSCDLFKLYIYLVLTENWLVGSR L+GKPLICEMWGL
Sbjct: 363 FFENEATNEEEVLRVLTSMTVKDKSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLR CSCQI STDLRSYASKVRNKASYILQSSF E
Sbjct: 423 FLRKCSCQIASTDLRSYASKVRNKASYILQSSFEE 457
BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match:
A0A6J1DG18 (uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020493 PE=4 SV=1)
HSP 1 Score: 927.5 bits (2396), Expect = 2.1e-266
Identity = 455/455 (100.00%), Postives = 455/455 (100.00%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD
Sbjct: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN
Sbjct: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL
Sbjct: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR
Sbjct: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ
Sbjct: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM
Sbjct: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL
Sbjct: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE
Sbjct: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 455
BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match:
A0A0A0LVC0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1)
HSP 1 Score: 812.8 bits (2098), Expect = 7.4e-232
Identity = 395/455 (86.81%), Postives = 423/455 (92.97%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKES+ AKK+KNY+SD
Sbjct: 3 MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKESKLAKKRKNYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VFGEQK PP LQ+PEL+SC L+
Sbjct: 63 DDDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFGEQKPPPALQTPELESCQFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLT+EKGD FLEGLLVNGWLSTLVSLTG VEK +AIWTFNLMLYSSRE L
Sbjct: 123 FLNNEVNSLVNLTVEKGDVFLEGLLVNGWLSTLVSLTGHVEKSLAIWTFNLMLYSSREGL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSA DFW+DIML +EVEQQ LQ+DWFP++AQLGEAL+TYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVEQQHLQVDWFPSYAQLGEALDTYGYRFECSLNPGLIHTGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTK+KK+IFT SEV +LAEAIICLFLDRQFQG+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKVKKNIFTSSEVGRLAEAIICLFLDRQFQGITVLLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDE+WKACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLIHYFTDEDWKACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLI 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CF+ EATNEE VLR+LTSITVKDKSCDLFKLYIYLVLTENWLVGSR EGKPL EMWGL
Sbjct: 363 CFKNEATNEEEVLRVLTSITVKDKSCDLFKLYIYLVLTENWLVGSRMCEGKPLTREMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 457
BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match:
A0A5D3BL50 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold2042G00060 PE=4 SV=1)
HSP 1 Score: 800.0 bits (2065), Expect = 5.0e-228
Identity = 392/455 (86.15%), Postives = 419/455 (92.09%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+SD
Sbjct: 3 MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC L+
Sbjct: 63 DDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSREDL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSA DFW+DIML +EV Q LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLF 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL EMWGL
Sbjct: 363 CFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456
BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match:
A0A1S3CJ26 (uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501522 PE=4 SV=1)
HSP 1 Score: 800.0 bits (2065), Expect = 5.0e-228
Identity = 392/455 (86.15%), Postives = 419/455 (92.09%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MD PLDFESEDPLLSSPVALKKRK IIGLDDLLTDHYKDKCKLVEKE++ AKKKK Y+SD
Sbjct: 3 MDAPLDFESEDPLLSSPVALKKRKKIIGLDDLLTDHYKDKCKLVEKEAKLAKKKKKYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DDDFGKEAVV+QVVD+CQNKM+QLGGEEDTSIWGL VFGEQK PP L+SPEL+SC L+
Sbjct: 63 DDDFGKEAVVTQVVDQCQNKMNQLGGEEDTSIWGLVVFGEQKPPPALESPELESCQFLQT 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLN+EVNSLVNLT+EKGDAFLEGLLVNGWLSTLVSLTGR EK +AIWTFNLMLYSSRE+L
Sbjct: 123 FLNSEVNSLVNLTVEKGDAFLEGLLVNGWLSTLVSLTGRAEKSLAIWTFNLMLYSSREDL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
RTSA DFW+DIML +EV Q LQIDWFPN+AQLGEALETYG+RFECSLNPG +H GSGR
Sbjct: 183 RTSACDFWKDIMLTTNEVGQH-LQIDWFPNYAQLGEALETYGYRFECSLNPGLIHTGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKFITICCQTK KK+IFT SEVE+LAEAIICLFLDRQFQG++ LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFITICCQTKNKKNIFTSSEVERLAEAIICLFLDRQFQGITELLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SLIHYFTDEEW ACCEKIAKSLVCRIP DLNCLRAVECISG D RSKYLRS VAYQILL
Sbjct: 303 SLIHYFTDEEWNACCEKIAKSLVCRIPMDLNCLRAVECISGVDPRSKYLRSTVAYQILLF 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
CF+ EATNEE VLR++TSITVKDK+CDLFKLYIYLVLTENWLVG RT EGKPL EMWGL
Sbjct: 363 CFKKEATNEEEVLRVVTSITVKDKNCDLFKLYIYLVLTENWLVGGRTCEGKPLTREMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLRNCSCQI STDLRSYASKVRNKASYILQSSF+E
Sbjct: 423 FLRNCSCQIASTDLRSYASKVRNKASYILQSSFDE 456
BLAST of Moc03g21040 vs. ExPASy TrEMBL
Match:
A0A6J1HJZ9 (uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464803 PE=4 SV=1)
HSP 1 Score: 797.7 bits (2059), Expect = 2.5e-227
Identity = 385/455 (84.62%), Postives = 422/455 (92.75%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
MDGPLDFESEDPLLSSPV+LKKRK IIGLDDLLTDHYKDKCKLVEKES+QAKKKK Y+SD
Sbjct: 3 MDGPLDFESEDPLLSSPVSLKKRKKIIGLDDLLTDHYKDKCKLVEKESKQAKKKKKYDSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
D+DFGKEAVVSQVVDECQNKM+QLGGEEDTSIWGL VF EQK PP LQSPELK+C+LL+
Sbjct: 63 DEDFGKEAVVSQVVDECQNKMNQLGGEEDTSIWGLNVFREQKPPPALQSPELKNCYLLQA 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
FLNNEVNSLVNLT+EKG+AFLEGLLVNGWLSTLVS+T RVEKP+AIWTFNLMLYSSREEL
Sbjct: 123 FLNNEVNSLVNLTVEKGEAFLEGLLVNGWLSTLVSVTNRVEKPLAIWTFNLMLYSSREEL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPGFVHAGSGR 240
TSA DFWRDIML ++V QQ L+I+WFP +AQLGEALETYG+RFECSL PGFVH GSGR
Sbjct: 183 STSACDFWRDIMLTTNKVNQQHLRIEWFPTYAQLGEALETYGYRFECSLKPGFVHNGSGR 242
Query: 241 GGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCECLQ 300
GGPPQNIRAWIKF+T+CCQTK+KK+IFT SEVE+LAEAI+CLFLDRQF+G+++LLCECLQ
Sbjct: 243 GGPPQNIRAWIKFVTVCCQTKMKKYIFTSSEVERLAEAILCLFLDRQFRGVTVLLCECLQ 302
Query: 301 SLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQILLM 360
SL+HYFTDE+WKACC+ IAK+LV RIP DLNCLRAVECISG D RSKYLRS VAYQILL+
Sbjct: 303 SLVHYFTDEDWKACCDNIAKALVSRIPMDLNCLRAVECISGVDQRSKYLRSTVAYQILLI 362
Query: 361 CFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMWGL 420
FE E TNEE VLR+LTS+TVKD+SCDLFKLYIYLVLTENWLVGSR L+GKPLICEMWGL
Sbjct: 363 FFENEGTNEEEVLRVLTSMTVKDRSCDLFKLYIYLVLTENWLVGSRMLDGKPLICEMWGL 422
Query: 421 FLRNCSCQITSTDLRSYASKVRNKASYILQSSFNE 456
FLR CSCQI STDLRSYASKVRNKASYILQS F E
Sbjct: 423 FLRKCSCQIASTDLRSYASKVRNKASYILQSCFEE 457
BLAST of Moc03g21040 vs. TAIR 10
Match:
AT2G28130.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 13 growth stages; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 496.9 bits (1278), Expect = 1.7e-140
Identity = 241/452 (53.32%), Postives = 334/452 (73.89%), Query Frame = 0
Query: 1 MDGPLDFESEDPLLSSPVALKKRKMIIGLDDLLTDHYKDKCKLVEKESRQAKKKKNYNSD 60
+DGPLDFE+EDPL++ P ++KRK +IGLDDLL+D YK+K K+++K +++ K K Y+SD
Sbjct: 3 LDGPLDFENEDPLVNPPTIIEKRKKVIGLDDLLSDFYKEKSKVIDKVNKKRKVSKVYHSD 62
Query: 61 DDDFGKEAVVSQVVDECQNKMSQLGGEEDTSIWGLKVFGEQKTPPTLQSPELKSCHLLRN 120
DD+ G+ +SQ V ECQN+M+++ EE+ WGL +FG+QKTP +L SC LL+
Sbjct: 63 DDEQGQVDKLSQCVVECQNQMNEIADEEENQEWGLSMFGDQKTPIPSLLVDLDSCCLLKE 122
Query: 121 FLNNEVNSLVNLTMEKGDAFLEGLLVNGWLSTLVSLTGRVEKPIAIWTFNLMLYSSREEL 180
F+NN++N +V LT+++G F+EGLLVNGWL+ L+ GRVEK I WT N++LYSS+E+L
Sbjct: 123 FMNNQLNLVVGLTVDEGTTFIEGLLVNGWLTRLIMTCGRVEKFICKWTLNILLYSSKEDL 182
Query: 181 RTSAFDFWRDIMLPKHEVEQQPLQIDWFPNHAQLGEALETYGFRFECSLNPG--FVHAGS 240
R+SA DFW I+L +++V ++I W PN+ +L EALE+YGFR S + A S
Sbjct: 183 RSSACDFWCSILLSQNKVNGASVEIYWLPNYQELKEALESYGFRISLSHSQDVELAEADS 242
Query: 241 GRGGPPQNIRAWIKFITICCQTKIKKHIFTFSEVEQLAEAIICLFLDRQFQGLSMLLCEC 300
GPPQNIRAW+ +T CCQ + KK IFT S+VEQ+AE ++ L LDR GLS+LL EC
Sbjct: 243 ECQGPPQNIRAWLTLVTTCCQFRCKKPIFTTSQVEQIAEILVSLLLDRSLLGLSILLQEC 302
Query: 301 LQSLIHYFTDEEWKACCEKIAKSLVCRIPRDLNCLRAVECISGADARSKYLRSAVAYQIL 360
L S+I F +EEW + C+KIA SL R+P+D+NCLR VE +SG DARSK+LRS++A+Q+L
Sbjct: 303 LISVIGSFKEEEWISSCKKIANSLASRVPQDINCLRVVESVSGVDARSKHLRSSIAHQML 362
Query: 361 LMCFETEATNEEGVLRLLTSITVKDKSCDLFKLYIYLVLTENWLVGSRTLEGKPLICEMW 420
++ + ++E +L L SI VK++SC+LFK+YI+LVL ENWL S +E KP++ +MW
Sbjct: 363 VVLLD-HKDSDENLLSSLMSINVKERSCNLFKMYIFLVLAENWLFSSTLVEAKPVLRDMW 422
Query: 421 GLFLRNCSCQITSTDLRSYASKVRNKASYILQ 451
+FLRNCSCQI STDLRSYASKVR +A+Y+LQ
Sbjct: 423 AVFLRNCSCQINSTDLRSYASKVRTRAAYLLQ 453
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022152868.1 | 4.3e-266 | 100.00 | uncharacterized protein LOC111020493 [Momordica charantia] | [more] |
XP_038895435.1 | 8.1e-233 | 87.69 | uncharacterized protein LOC120083668 isoform X2 [Benincasa hispida] | [more] |
XP_011653682.1 | 1.5e-231 | 86.81 | uncharacterized protein LOC101216339 isoform X1 [Cucumis sativus] >KGN64767.1 hy... | [more] |
XP_038895434.1 | 2.2e-230 | 86.33 | uncharacterized protein LOC120083668 isoform X1 [Benincasa hispida] | [more] |
XP_023520957.1 | 2.4e-229 | 85.27 | uncharacterized protein LOC111784515 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DG18 | 2.1e-266 | 100.00 | uncharacterized protein LOC111020493 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A0A0LVC0 | 7.4e-232 | 86.81 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G095530 PE=4 SV=1 | [more] |
A0A5D3BL50 | 5.0e-228 | 86.15 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CJ26 | 5.0e-228 | 86.15 | uncharacterized protein LOC103501522 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1HJZ9 | 2.5e-227 | 84.62 | uncharacterized protein LOC111464803 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT2G28130.1 | 1.7e-140 | 53.32 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |