Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGAGCGAAGACGAAGGGTTCGAAGATTGGGACGCTGATTTCTTGGACCAACTCATCCAAGTCGAAGAGCTTGCCATCGCCTCCACCGCCGATAATCATCTCATTACGATTCCGATCTCTTCTTCTACCTTCTGCCCCCCGCCGCCATCGCAACCGGAACCGCTACATTTTGTGCAAGCGTTTCATGACCGTCCCATTAGTTATTCGCCTCCTCGAGAACTCTCACAGAGGATCACCGGTGTCCGCTCTCCCAATGGCTTGGGCGAATGTGGTCCTTCTTCTTCGATGCTGGCTCCGTGCTTGCCTCGCCCGGACGCTGCAAAAGAACTCGAGATTTGTAATTTGAAGGTTAGATTCATGGTTAAATGTTCGAGGCTACCAGATTCCTGTATTTTTTTTACGTTGATTTGTTGTAGTGTAGTCTCTGAATAGAACGCTTACAAGTTAGAGCTTAGAAGAACTAGGTTTTGAATCGTCTGAACTGCCATTTATTTTTTCGTTCATCGTTGCCGCTCGAAATGTCTGCAAAGTGTGCTGTCTTCTTCTACCCAGAAGGAAGGAGATGTACTTGGATTAGTCTGACCGATCTCCGGGTGCTCCGAGTCTTCTGACTTCGTGCTTATGCCTAGCGAATTTTTTGACTGCATCGGATTATTGCATTTCGATCAGTTTATCTTTTGCATTTTCGCATTCTTTCCCGATCAAACGCGATAATTATCGCCGGGAACAATTTGAAGTTTTGTTATGCTTGAAATAAGGATTTCTTTCTGTGGATCATCGTCGTCTAAGTTCTTATGATTGATGTGAACATTAGACATTTTAGTATCACGTGGCCTTGCTTCTGCCAACAATCAGCAGTATTATGTATTTTGTTTTTCTCTTTTATTCTAGTGGTGGCCAGAGATTTTCATTCTTTAACCATTCTTTAACCACAATTACTTAACGTTTATAGTTGTATGCATAATGAGAAGGGGGAGTTTATGAAATGTATAATAAGTTTTTATGGTTTGTGGTGATTGGTGACAGAGGGAGCTAGGGCGCGTCTCAAAGCAACTCAAGGACTTGGTATGTTAATTAATTAATTATTACTGTTATTATTTTTTCATTTTCTATTATGCATATGAAATTATTAGAATGACCATTTCTTGTTTTCTTTTTACTTATCTTAGACAAGAAAAACCATGCGTGAATTTTGCTTTTTCTTTTATGATGAGCAAAGATTTGATAAATGATATTTGTGTAGCCAGCTCTAGGTCTCCTTGGCACCTCGAAAACTGATTCTATAAGGAAGCATAAAATTTATCCTAAAAGGCTAGCGCTCCGTTTGTGATTTTATCTTATTCAAACAAATAGAATGGCATTTGTTCACTGATTTTCTGATGGTCATTTACACGATTTTCCTGTGAATTAATTGTTTATTTCCCCGATTTGGAAACTAATATGATATATTCTTAATGAAAATTTGTTCAGTTTGCAGTTAAAATTGGTACTTATTCCTTTTGTAAAAATCAATTGTATTTATAAATTTAGGTTTCATCTAATTAAAAGAATTCAGTGAATGCTTTCAGGAACAAGAGTGTGTTGAACTCAGGAAGAAAAGAGACAGGAAAGAGGAGCAGCTTAAAGTTGTCTTTTCCAATAAAGATGAACAATTTATTGGCCATCATGGTTCAGAGAGTACAGATTTGTGAGTGAAATTGTCATTTTTTTGGAATAATATTACAAATTATGGACTGCAGTTGGCCTCTAGGCTCTAGTTATTATTTTGATTGGCAACTCAGCGGTACAAATACTGAACTTCCAACACTACTAGTATAGGGCGATTGTCATTATGCTCCTTCCCCTTTCTGTAAAACATCATAAAATGTGGTTTATTAAATCTACTCATATGAATGATGCATTGCTGCGAGAACAAATGTATCTTGACTACTGTATGAACTCTACGTGAGGTGACTATTATATTAATACATGTGTGTACCTTCTCTAATTATATTGTTACATAATATTTGTGCCTAGATGCATTCACTTTTTTTTTGGTGCAAAATCTGGCTTGGCTAACTCATGTTCTTAACATTACTACCTTGCTGTTAATTAAGAAGTTGAGACCTGGAATGAACTATAGTTTCTTATGTGCAATTATCTTCTGATGTTTATTGCTATTTCTTATTCTGCATCATCAATATAACCAGGAGAACAGCGGGGAATGATGGTGAGCATAATTCTAGGAAGATTGAAGATCTTGCTGGTGACCTTGGTGCCCCTCACATTGGTAGTTCTAAGGACGAGCTAATAATCATTTCAAGTTATTTTATGGTGCATACCAATTAACTATCTTTATTTTACTGTTTTTGCATCTACAAGTTTCTTCTAGTAGTAAAGCCATCGGCGAACAGGGAGGCCAAGCTCATAATTCTGCTGGGGAGAGAGTCAATGATAAATTACCTGCTTTTCACAACCTATCCAAGAAGCTGCAAGTATTCTGGGTCCCCGAAAGTGGCTCTAAGATGGGACAATCTTTGGTTTCAGAATTACTTTTATCATGTGAAACAGATTTTCATGTGCTTTTTGGGTGCATCAGCACGAAGTTATCCCCCAAATTTTCTGTCGATTCCCTAGCTGGGGATAACGTTTCTGATGTAGCTTTAAAGAACCCTTTGCAGTTTCTTCATGGTCTGGAAGCCGTAAAAGTATCTAATCTCTACACCACTTTGACTAAGGTATCTTGACATATTTTATCACTATATGGTTGGTCCATAAAATAGTGATAAACATGAGATCCGAACTTTTTTCATTGTTTTGAGAATTCATTTTCTAACATTATGTGTTACCAACCTAAGCATAACTCAACTTGTCGAGGAATATATCTTCAACCTATCGGAAGTTTCAATTCTCACTCTCACATGTGGTTGAACTAAAAAAAAATGATGGTTAACCTAAAAAAAATGATGTTATATATGTTAATGTTTTTAAGTAGTCATGTCCACTGTTATTTAAAAATTTCTATAATTATTTTCTTCTGTTTGTTATCTATTATTTTTTTTTTTTTTTGTAGGTAAGTAATGGAATAGTAAAGATGGAGGCATTGTTTACACCGTTACTTGATCTCTGTAATCTTGACAATGTGAGTAAGACTTGAGGATTTCCTGGTTATACAAACAAATTAATGTACGAGTAAAGATTTGAGTAGTTTTGACAGTACAAGTTGCTATTAACTTTCTCAATTTGCATGTTTCATTTTTATCAAATTCATGCCACGATTGCCTAGTTTCATTTTTTTTTCCCTCTTCCATTTATCTTCATATGCACAAAAAAATCATTGAAAGAGGGGAGGAATGACTAGCTGTTAAACCCAAGCCCAAGTTAGGGAGGTCACAGAAATGCCCTTTTCTTTCTTCTCTCTCTCCCCTTAATAATAAACAAACATAAATAATGAATGAAATTACAAGAGAGGGAAGGAGCCTCAACCAAAGATAACTAGGAAGTTACAAAAAATGCATTCAATTGGCATAAATAGAATAGAGGCTATATGGATAAGATATACGGCAAGAACAAGCAACAAACTTACGTTTTCCCTTGAAAGCACATCTATTCCTTTGTATCATATTCAGCTGAAATAATGTCATAAATTTTAATTTTAACCGAGCAAGTTTTTTCACCATTATTGACTTCTGTATTCATCAAACAATTGTGGTTATTTCTTATTCATCTAGAGAAGATAATTGATGATTATGATCAATTATATTGTAGCCTGTACAAACCTGGGAAGATCTTGTGGTTTGCATGTCTTCGGATTGATGAGTTGGTTGTTTTATGGTCACAGTTCACTATATGGTTAATACAATTGAATCTGGATTCTGAATGTTGTTATGCTTTGCAATATGGTTGTGTTTATGTCACTATTAGTTCATCAAATCTTTTCATAAAAAAATATGGTTATGTCGTTTCTTCTAAGTTGGCATTGATATTTCTTGATGAAAGCTTGGTTTTACATTTTAAAAATAATAATAATAATAAAAAAAAGACAATTGTGGCACTCTTCCTCTCTTTGAGAAATTAGTTCCTTTATTTTTGGATACAGGTCGTCATAGTTCACAGAACTCTGCATATATTGCATATGTTTGTGAAACACCTGTTTTGGTTGGAGAGGAAATCCGAAAGAAGGTACAGTGTATTTTATTTTGCAATATATGTATATTTTATCTCTCTAGAAGTATGTTGATTATGAGGCAGCAGAAATTTTATATTGGCAAATTCTCTAGGGCTATCTCAAATCTATGGCCAATCAAAAAGCTAATAGTTTGACTTTATTCTACGACCAATGAGGTTACTTGCTGTTTGAATATTTAAAAAAATTCAGGGGCAATTTATGAAAGTGTTGTCCATGGACATGGATGTTAGTCAAATAGTTCTATCATGTATCATCTATGTACTACAACCTTTTTTTTTTTGGCTGAGCTGGGAAGTTTCACCCTACTATTAATGATGGTACCAAATCTAAATTAATTTTTCTTTCTTGCTTATTTTAACACCATGTTAATTCTATGGAGTATTTAGTTGGTCGTAGAAAGAATTGAAGTTAATTATGACTTGGATATATATTTGGTGATATCAACATATTGCATGACTTCATGTGGTATTATTGTTTACGATAGGTACTCTTCTGAGTTTCCTTACTATCATTTAGGATCAAATCCTTTTTGAGTGATGCTGTAGTGTTCAGTGTTCAAGAACTTCCATGTCTTCGTTTGGTTCTTATGAACTCTGATTTGATCTTTCCAGTGCAATCTGCTCAGGAAAACAGTCATGGTTGAGGGACTTGGCTCTAGGAACAATGATTTGGATTCTCATGGATCACAGAGTGTAGAAGGTGAAGAATTTGCTGTTGTGAACATGGATGGAACATCTCATGACAGTTGCGCTCCAGCTTGCAGTAGAATCCCTGGTGCTGATATGCCGTGCAAGAACAGAAACTTGAATACGTACACAAATTTAGTTCCTCAAGTAAACTGGGTGTCTTTCTTTGAGATGATGCATCAGGTTGCTAAGACGCATTGTGTAGAATGTGTGAGGATGGAAGCAGTTTCAATCATGAATTTGATTCTGATGAGAAGTAGTACGTATATGGAGAGGGAGAAGTAAGTTTGTGCTTTTTTTAGTCTATATTTCTACGTGTCAAATAAGAAACTTCTCACATTCCATATACTGTTACAGGTTTGGTCCGGGACTTTTATTTGATAGTGTAGTGGAGTTTATCAGAAAGGAATCCGGTTCAGCTATACAAAAGCATGCTGTGCGTCTTCTATTCCTGATACTAAACTGTAAGCTTTAATAATTACAGTTCTTATTGCCCTAAACGATCCTTTTGGAACTTTCAAGCTCATGATAATTCTGCTTGATAATTATTTAAGTTTTAAACTGGTAGGGAACGTTACAAGTTCTCTCTGTTTCTTATTAAAAAAAAAAACTAGTAGGGAACGTCGGTCTCTTTTCTCATTCTCCTTGACTTAAATTTCTGAGTACATTGTATTCAGACTTCAAATCATCTTGGATTAATTTTTCATTTAGTGAATCGAGGTCATTGGTTCCTTTAATCTTCTTTTGACATCTTCCTAGGTTTAATTTTATGACCCTGAACTTTGGAACTGGATGTGGCATTTGTTTTCATTTTGAATTATTTACTTATCTTAACCTTCCGCAAGACATTGTAAATGCTTGCTATCTGGTCGCAGCTAGAGCCAAAGCCTTAGCACCGCAGCATTATAGGCCTGCTTGTGGCCTTATTTGTTTGAACTTCTAATCAATCGATCTTATCGCACAACATAAACGTGAACTAATTTTTCCATTTATATCGTTTATTTTCTCATAATGTCTGTTTTAATTTAGGTCCTACGTTTTTTGTCGCATTTTGTTCTGGTTGCATGGAGACAGAAGCTACATGTGCTGCAGACGAAAATGTGAGATCTGCTGCAGGTTTTGAGAAATTCAGAACCATCCTTCATGGCTTGGCAGATTGTCTTGCATGTTGCGGAAACGGTATTGAGGTAAGCATCTTAAACGATCCTTCCGCTGCTCCTTTGTTTTCCTTTTAATTCCCCCCTGGTGCAAAACTGAGGGCTATACTGAATCCTATTATATTGGTAGTCAAGAAACTTGAGGTTTGGAACCAAACCACTTGGGTTACTTAAAAAGGGCACTGCTACTGTTGCTCAATTCTTGCAGAGAAAATAGGGAACTTGTGAGAAGTGGTAATTAAACTTGAGGTTTGTCTTTGTTTTAGAGTTCGGTTGCCGTTGCCATGAAATTGAAGTTGATGAAAAAATTATTGTGGAGAGTGTGGAATGAGGCAACCATAGGGATTTTCAAAAGCCTATAGTGGGAAAGATCTTGTGTAGACTTGGTAAATGGAACAAACATTTTCTGTTAGAGGCAGGAAACCTACTCTTGCCGAAGCAGTCCTTGCAAAGTTTCTAACCTATTGCCATTCAATTTTTAGTATAACTGCTCTCTAGCCAAGTAGGTTACTGTTAGAAGCAGGAAAGTCATCACCGTTGACGACCTTATGTTCTCGGGCTTAACAAATTGTAAGAAATTCATATAGGTGTAATTTTTATTTTTTATAATTGATTATTGTTTCAAAAGTTCATCGTGTCATGTAGATTAGGAGGTAATGAACATTATATTATTGTGCTGAGCAACCCACGTGTTTGATGTTAGGTATATACCTTGTTATTGCAGGAGTTGAAACTTCGAAGAAACACTGTTCTTTTGCTCGCTTTTCTAGCATCGTCTGGCAAAGCTGGCTTTGAAATTCTCATAAGCAACAAGCTACCTACAGAGTCAAACTTCCTCGTGTTGATTCTTCAAGTTGTGGTTTCAGAGGTCGAGCACGAACAAAAAGTTCCACAGCCCGTAGAAATTCACGGGGAAAGGTAACTTTGATGAGCTTCTCCATATTATTGTAGATATGAAATCCCACAATTCCTCGAGAACATGCTACTCTAGAAACTCGGAAACTTCAGTTTCAATAACGCATCCATATCAAACACTCCGACACTTATCGAACACGTATTACACTTGTTAGCTCAACAGTTGAGCAGTACAAAGTTGATATAGATTCTGACACTTGTTAGATACGAATTCGAGTTTTCCGTGACATTCCAGAAATGTCATGGTTCCTCATATATGCATTCCATCAAATTAGTGCATATCAAAGATTCCTTGAGCATAAGGATGGGAGAAAATAACAATTTTTTAGAGTAAAATACATGACTTATTTCTTTTAAGCATTAAAAGCATTTATTGATTTTGATTTTCCTTCGTGTATAAAAGTAATATATACTTAAAATAAATAGTATATCTTAATAAATGTGTCCTTATCGTGTCATATGTTGTATTCGCTTCTCATGTTTGGATTTGTGCTTCTTAGATACTACTTTTCCATATACGTTTCAACATTTCTCTTCTCATCAGCTATCTTCAAATGATGTAACCGTGTTCTATTAATTTTCATTATTTAGGACTTTGCTGTTGCGGGAGGTACTTATACTTCTTAATAGACTTGCATCTCATTCGTTATACTCAGCCACAGTCTTGCGAGTGTTAACGAACAGCAGAGATATGGCCAGCCTCACCATTGATGTAATTACCAAGTTGTCCAGAAGAAACAATAGAACTTACCAATTTGACAGGAAGACAAGACAGATGAGGGAATCTGAAGTTGCCGACTTATCCCAGGTATTCAAGAAAAGAGTTCTTACATATTTGGGAAATAGCATAGTATAA
mRNA sequence
ATGCGGAGCGAAGACGAAGGGTTCGAAGATTGGGACGCTGATTTCTTGGACCAACTCATCCAAGTCGAAGAGCTTGCCATCGCCTCCACCGCCGATAATCATCTCATTACGATTCCGATCTCTTCTTCTACCTTCTGCCCCCCGCCGCCATCGCAACCGGAACCGCTACATTTTGTGCAAGCGTTTCATGACCGTCCCATTAGTTATTCGCCTCCTCGAGAACTCTCACAGAGGATCACCGGTGTCCGCTCTCCCAATGGCTTGGGCGAATGTGGTCCTTCTTCTTCGATGCTGGCTCCGTGCTTGCCTCGCCCGGACGCTGCAAAAGAACTCGAGATTTGTAATTTGAAGAGGGAGCTAGGGCGCGTCTCAAAGCAACTCAAGGACTTGGAACAAGAGTGTGTTGAACTCAGGAAGAAAAGAGACAGGAAAGAGGAGCAGCTTAAAGTTGTCTTTTCCAATAAAGATGAACAATTTATTGGCCATCATGGTTCAGAGAGTACAGATTTGAGAACAGCGGGGAATGATGGTGAGCATAATTCTAGGAAGATTGAAGATCTTGCTGGTGACCTTGGTGCCCCTCACATTGTTTCTTCTAGTAGTAAAGCCATCGGCGAACAGGGAGGCCAAGCTCATAATTCTGCTGGGGAGAGAGTCAATGATAAATTACCTGCTTTTCACAACCTATCCAAGAAGCTGCAAGTATTCTGGGTCCCCGAAAGTGGCTCTAAGATGGGACAATCTTTGGTTTCAGAATTACTTTTATCATGTGAAACAGATTTTCATGTGCTTTTTGGGTGCATCAGCACGAAGTTATCCCCCAAATTTTCTGTCGATTCCCTAGCTGGGGATAACGTTTCTGATGTAGCTTTAAAGAACCCTTTGCAGTTTCTTCATGGTCTGGAAGCCGTAAAAGTATCTAATCTCTACACCACTTTGACTAAGTGCAATCTGCTCAGGAAAACAGTCATGGTTGAGGGACTTGGCTCTAGGAACAATGATTTGGATTCTCATGGATCACAGAGTGTAGAAGGTGAAGAATTTGCTGTTGTGAACATGGATGGAACATCTCATGACAGTTGCGCTCCAGCTTGCAGTAGAATCCCTGGTGCTGATATGCCGTGCAAGAACAGAAACTTGAATACGTACACAAATTTAGTTCCTCAAGTAAACTGGGTGTCTTTCTTTGAGATGATGCATCAGGTTGCTAAGACGCATTGTGTAGAATGTGTGAGGATGGAAGCAGTTTCAATCATGAATTTGATTCTGATGAGAAGTAGTACGTATATGGAGAGGGAGAAGTTTGGTCCGGGACTTTTATTTGATAGTGTAGTGGAGTTTATCAGAAAGGAATCCGGTTCAGCTATACAAAAGCATGCTGTGCGTCTTCTATTCCTGATACTAAACTGTCCTACGTTTTTTGTCGCATTTTGTTCTGGTTGCATGGAGACAGAAGCTACATGTGCTGCAGACGAAAATGTGAGATCTGCTGCAGGTTTTGAGAAATTCAGAACCATCCTTCATGGCTTGGCAGATTGTCTTGCATGTTGCGGAAACGGTATTGAGGAGTTGAAACTTCGAAGAAACACTGTTCTTTTGCTCGCTTTTCTAGCATCGTCTGGCAAAGCTGGCTTTGAAATTCTCATAAGCAACAAGCTACCTACAGAGTCAAACTTCCTCGTGTTGATTCTTCAAGTTGTGGTTTCAGAGGTCGAGCACGAACAAAAAGTTCCACAGCCCGTAGAAATTCACGGGGAAAGGACTTTGCTGTTGCGGGAGGTACTTATACTTCTTAATAGACTTGCATCTCATTCGTTATACTCAGCCACAGTCTTGCGAGTGTTAACGAACAGCAGAGATATGGCCAGCCTCACCATTGATGTAATTACCAAGTTGTCCAGAAGAAACAATAGAACTTACCAATTTGACAGGAAGACAAGACAGATGAGGGAATCTGAAGTTGCCGACTTATCCCAGGTATTCAAGAAAAGAGTTCTTACATATTTGGGAAATAGCATAGTATAA
Coding sequence (CDS)
ATGCGGAGCGAAGACGAAGGGTTCGAAGATTGGGACGCTGATTTCTTGGACCAACTCATCCAAGTCGAAGAGCTTGCCATCGCCTCCACCGCCGATAATCATCTCATTACGATTCCGATCTCTTCTTCTACCTTCTGCCCCCCGCCGCCATCGCAACCGGAACCGCTACATTTTGTGCAAGCGTTTCATGACCGTCCCATTAGTTATTCGCCTCCTCGAGAACTCTCACAGAGGATCACCGGTGTCCGCTCTCCCAATGGCTTGGGCGAATGTGGTCCTTCTTCTTCGATGCTGGCTCCGTGCTTGCCTCGCCCGGACGCTGCAAAAGAACTCGAGATTTGTAATTTGAAGAGGGAGCTAGGGCGCGTCTCAAAGCAACTCAAGGACTTGGAACAAGAGTGTGTTGAACTCAGGAAGAAAAGAGACAGGAAAGAGGAGCAGCTTAAAGTTGTCTTTTCCAATAAAGATGAACAATTTATTGGCCATCATGGTTCAGAGAGTACAGATTTGAGAACAGCGGGGAATGATGGTGAGCATAATTCTAGGAAGATTGAAGATCTTGCTGGTGACCTTGGTGCCCCTCACATTGTTTCTTCTAGTAGTAAAGCCATCGGCGAACAGGGAGGCCAAGCTCATAATTCTGCTGGGGAGAGAGTCAATGATAAATTACCTGCTTTTCACAACCTATCCAAGAAGCTGCAAGTATTCTGGGTCCCCGAAAGTGGCTCTAAGATGGGACAATCTTTGGTTTCAGAATTACTTTTATCATGTGAAACAGATTTTCATGTGCTTTTTGGGTGCATCAGCACGAAGTTATCCCCCAAATTTTCTGTCGATTCCCTAGCTGGGGATAACGTTTCTGATGTAGCTTTAAAGAACCCTTTGCAGTTTCTTCATGGTCTGGAAGCCGTAAAAGTATCTAATCTCTACACCACTTTGACTAAGTGCAATCTGCTCAGGAAAACAGTCATGGTTGAGGGACTTGGCTCTAGGAACAATGATTTGGATTCTCATGGATCACAGAGTGTAGAAGGTGAAGAATTTGCTGTTGTGAACATGGATGGAACATCTCATGACAGTTGCGCTCCAGCTTGCAGTAGAATCCCTGGTGCTGATATGCCGTGCAAGAACAGAAACTTGAATACGTACACAAATTTAGTTCCTCAAGTAAACTGGGTGTCTTTCTTTGAGATGATGCATCAGGTTGCTAAGACGCATTGTGTAGAATGTGTGAGGATGGAAGCAGTTTCAATCATGAATTTGATTCTGATGAGAAGTAGTACGTATATGGAGAGGGAGAAGTTTGGTCCGGGACTTTTATTTGATAGTGTAGTGGAGTTTATCAGAAAGGAATCCGGTTCAGCTATACAAAAGCATGCTGTGCGTCTTCTATTCCTGATACTAAACTGTCCTACGTTTTTTGTCGCATTTTGTTCTGGTTGCATGGAGACAGAAGCTACATGTGCTGCAGACGAAAATGTGAGATCTGCTGCAGGTTTTGAGAAATTCAGAACCATCCTTCATGGCTTGGCAGATTGTCTTGCATGTTGCGGAAACGGTATTGAGGAGTTGAAACTTCGAAGAAACACTGTTCTTTTGCTCGCTTTTCTAGCATCGTCTGGCAAAGCTGGCTTTGAAATTCTCATAAGCAACAAGCTACCTACAGAGTCAAACTTCCTCGTGTTGATTCTTCAAGTTGTGGTTTCAGAGGTCGAGCACGAACAAAAAGTTCCACAGCCCGTAGAAATTCACGGGGAAAGGACTTTGCTGTTGCGGGAGGTACTTATACTTCTTAATAGACTTGCATCTCATTCGTTATACTCAGCCACAGTCTTGCGAGTGTTAACGAACAGCAGAGATATGGCCAGCCTCACCATTGATGTAATTACCAAGTTGTCCAGAAGAAACAATAGAACTTACCAATTTGACAGGAAGACAAGACAGATGAGGGAATCTGAAGTTGCCGACTTATCCCAGGTATTCAAGAAAAGAGTTCTTACATATTTGGGAAATAGCATAGTATAA
Protein sequence
MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQAFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHGLEAVKVSNLYTTLTKCNLLRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV
Homology
BLAST of Moc06g36330 vs. NCBI nr
Match:
XP_022155903.1 (uncharacterized protein LOC111022902 isoform X1 [Momordica charantia])
HSP 1 Score: 1289.6 bits (3336), Expect = 0.0e+00
Identity = 673/720 (93.47%), Postives = 673/720 (93.47%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ 60
MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ 60
Query: 61 AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL 120
AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL
Sbjct: 61 AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL 120
Query: 121 GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN 180
GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN
Sbjct: 121 GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN 180
Query: 181 SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE 240
SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE
Sbjct: 181 SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE 240
Query: 241 SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG 300
SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG
Sbjct: 241 SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG 300
Query: 301 LEAVKVSNLYTTLTK------------------CNL------------------------ 360
LEAVKVSNLYTTLTK CNL
Sbjct: 301 LEAVKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVVIVHRTLHILHMFVKHLFWLE 360
Query: 361 ----LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP 420
RKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP
Sbjct: 361 RKSERRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP 420
Query: 421 CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK 480
CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK
Sbjct: 421 CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK 480
Query: 481 FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV 540
FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV
Sbjct: 481 FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV 540
Query: 541 RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP 600
RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP
Sbjct: 541 RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP 600
Query: 601 TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV 660
TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV
Sbjct: 601 TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV 660
Query: 661 LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 675
LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV
Sbjct: 661 LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 720
BLAST of Moc06g36330 vs. NCBI nr
Match:
XP_038888976.1 (protein SENSITIVE TO UV 2 [Benincasa hispida])
HSP 1 Score: 946.8 bits (2446), Expect = 1.0e-271
Identity = 519/732 (70.90%), Postives = 572/732 (78.14%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHL---ITIPISSSTFCPPPPSQPE--- 60
M +EDEGFEDWDADFLDQLIQVEELAI+STA+NH+ I+IP SSST+ P PP QPE
Sbjct: 1 METEDEGFEDWDADFLDQLIQVEELAISSTANNHIPIPISIPSSSSTYFPLPPPQPEPEP 60
Query: 61 -PLHFVQAFHDRPISYSPPRELSQRITGVRS-----PNGLGECGPSSSMLAPCLPRPDAA 120
P H V+ FHDRPISYSPPRELSQR TG+RS PNG GE GPSSS LAPCL RPDAA
Sbjct: 61 QPQHLVEVFHDRPISYSPPRELSQRATGLRSHAIRLPNGFGEYGPSSSALAPCLHRPDAA 120
Query: 121 KELEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSEST 180
KELEI +LKRELGRVSKQLKDLEQECVELRKKRD+ EEQLKVV SNKDEQ+IG SEST
Sbjct: 121 KELEIYDLKRELGRVSKQLKDLEQECVELRKKRDKNEEQLKVVSSNKDEQYIGRCVSEST 180
Query: 181 DLRTAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHN 240
DLR AG DG K ED+AGDLG PH V+S KA EQ G+AH+S GER ND LPAF
Sbjct: 181 DLRVAGKDGGRTGMKSEDIAGDLGGPHTVTSRRKA-NEQVGKAHSSVGERANDDLPAFDK 240
Query: 241 LSKKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSD 300
LSKKLQVFWVPES SK+GQ+LVSELLLSCETDF VLF IST+LSPKFSVD L GDN SD
Sbjct: 241 LSKKLQVFWVPESDSKIGQNLVSELLLSCETDFRVLFHSISTELSPKFSVDFLGGDNSSD 300
Query: 301 VALKNPLQFLHGLEAVKVSNLYTTLTK------------------CNL------------ 360
+ +QFL EA KVSNLYTTLTK CNL
Sbjct: 301 I-----VQFLRCPEAKKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVAIVHRSLHI 360
Query: 361 ----------------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCA 420
R+TVM+ GLGSRNN +DSHGSQS EGEEFA+ NMD TSH SCA
Sbjct: 361 LHMFLKRLLWLERKSERRETVMIGGLGSRNNAVDSHGSQSAEGEEFALANMDKTSHGSCA 420
Query: 421 PACSRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLI 480
PA +R+PGA + CKNRNLN NLVPQ+NWV+FFE+MHQVAK H +CVR+EAVS+MNLI
Sbjct: 421 PAGTRLPGAALLCKNRNLNKNINLVPQINWVAFFEVMHQVAKRHSAKCVRIEAVSVMNLI 480
Query: 481 LMRSSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCM 540
LMR++TY+E+EKFG LLFDSVVEFIRKESGSAIQKH VRLLFLILNCPTFFV FCSGC
Sbjct: 481 LMRNNTYLEKEKFGQALLFDSVVEFIRKESGSAIQKHGVRLLFLILNCPTFFVVFCSGCK 540
Query: 541 ETEATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGK 600
E EAT AA+ENVR A GF+KFRTILH LADCL CCGNGIEELKLRRNT+LLLAFLASSGK
Sbjct: 541 EAEATDAAEENVRCAGGFQKFRTILHSLADCLTCCGNGIEELKLRRNTILLLAFLASSGK 600
Query: 601 AGFEILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLA 660
GFEILISNKL TESNFL LILQV SEVE E+ VP+PVE ER LLLREVLILLNRLA
Sbjct: 601 VGFEILISNKLYTESNFLALILQVTASEVEQEKTVPEPVENLEERALLLREVLILLNRLA 660
Query: 661 SHSLYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFK 675
SHSLYS TVLRVLTNSRDMASL IDV KL R+NNR +QFD K R+MRE+EV +L+QVF+
Sbjct: 661 SHSLYSGTVLRVLTNSRDMASLAIDVTNKLCRKNNRNWQFDSKKRKMRETEVVELAQVFR 720
BLAST of Moc06g36330 vs. NCBI nr
Match:
XP_022967198.1 (uncharacterized protein LOC111466806 isoform X1 [Cucurbita maxima])
HSP 1 Score: 940.3 bits (2429), Expect = 9.4e-270
Identity = 513/730 (70.27%), Postives = 569/730 (77.95%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP-----SQPEP 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+CPPPP +PEP
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPEPEP 60
Query: 61 LHFVQAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKE 120
H V+ HDR ISYSPPRELSQR G +RS GLGECGPSSS APCLP PDAAKE
Sbjct: 61 QHLVEVSHDRLISYSPPRELSQRAAGSRSHAIRSAIGLGECGPSSSAKAPCLPCPDAAKE 120
Query: 121 LEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDL 180
LEI NLKRELGRVSKQLK+LEQEC+ELRKKRD+KEEQL VVFSNKD+Q+I HHG E T+L
Sbjct: 121 LEISNLKRELGRVSKQLKNLEQECIELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITEL 180
Query: 181 RTAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLS 240
R AG DG H K ED++ DLG PH V+S SKA EQG ++HNS GER +D PAF LS
Sbjct: 181 RVAGKDGGHPGIKSEDISCDLGGPHTVTSRSKA-NEQGEKSHNSVGERADDNSPAFDKLS 240
Query: 241 KKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVA 300
KKLQVFWVPE SKMGQSLVSELLLSCE DFHVL+ CI T+LSPKFSV+SLAG N SDVA
Sbjct: 241 KKLQVFWVPEKDSKMGQSLVSELLLSCERDFHVLYQCIGTELSPKFSVNSLAGVNSSDVA 300
Query: 301 LKNPLQFLHGLEAVKVSNLYTTLTK------------------CNL-------------- 360
LK+PLQFLHGLE++KVSNLYTTL K CNL
Sbjct: 301 LKHPLQFLHGLESIKVSNLYTTLAKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILH 360
Query: 361 --------------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPA 420
RKTVM+ GLG RNN +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 MFLKRLMWLERKSERRKTVMIGGLGPRNNVVDSYGSHSAEGEEFSLLNMDETSTGHCSPA 420
Query: 421 CSRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILM 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAKTH EC R+EAVS+MNLILM
Sbjct: 421 GMGFPGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKTHSAECARLEAVSVMNLILM 480
Query: 481 RSSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMET 540
R++TY+EREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E
Sbjct: 481 RNNTYLEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEA 540
Query: 541 EATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAG 600
EA AA+EN R A GF+KFRTILHGL DCL C GNGI+ELKLRRNTVLLLAFL+SSGKAG
Sbjct: 541 EAADAAEENGRCAGGFQKFRTILHGLTDCLTCLGNGIQELKLRRNTVLLLAFLSSSGKAG 600
Query: 601 FEILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASH 660
FEIL+SN L +SNFL LILQ VVSEVE E++V + VE ER LLLREVLILLNRLASH
Sbjct: 601 FEILVSNTLHKDSNFLTLILQAVVSEVEQEKRVSEAVETLEERALLLREVLILLNRLASH 660
Query: 661 SLYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKR 675
S+YSATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KR
Sbjct: 661 SVYSATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDGKKRKMRESEVVDLAQVFRKR 720
BLAST of Moc06g36330 vs. NCBI nr
Match:
KAG7011360.1 (hypothetical protein SDJN02_26265, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 939.9 bits (2428), Expect = 1.2e-269
Identity = 517/726 (71.21%), Postives = 567/726 (78.10%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFC-PPPPSQPEPLHFV 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+C PPPP +PEP H V
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPQHLV 60
Query: 61 QAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKELEIC 120
+ HDRPISYSPPRELSQR G +RSP GLGECGPSSS LAPCLP PDAAKELEI
Sbjct: 61 EVSHDRPISYSPPRELSQRAAGLRSHSIRSPIGLGECGPSSSALAPCLPCPDAAKELEIS 120
Query: 121 NLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAG 180
+LKRELGRVSKQLK+LEQECVELRKKRD+KEEQL VVFSNKD+Q+I HHG E TDLR AG
Sbjct: 121 SLKRELGRVSKQLKNLEQECVELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITDLRVAG 180
Query: 181 NDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQ 240
DG H K ED + G PH V+S SKA EQG + HNS GER ND PAF LSKKLQ
Sbjct: 181 KDGGHPGIKSED---NSGGPHTVTSRSKA-NEQGEKTHNSVGERANDDSPAFDKLSKKLQ 240
Query: 241 VFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNP 300
VFWVPE KMGQSLVSELLLSCE DFHVLF CI T+LSPKFSV+SLAG N SDVALK+P
Sbjct: 241 VFWVPEKDFKMGQSLVSELLLSCERDFHVLFQCIGTELSPKFSVNSLAGVNSSDVALKHP 300
Query: 301 LQFLHGLEAVKVSNLYTTLTK------------------CNL------------------ 360
LQ LHG E++KVSNLYTTLTK CNL
Sbjct: 301 LQVLHGPESIKVSNLYTTLTKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILHMFLK 360
Query: 361 ----------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRI 420
RKTVM+ GLG RNN +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 RLMWLERKSERRKTVMIGGLGPRNNVVDSYGSHSAEGEEFSLLNMDETSTGHCSPAGMGF 420
Query: 421 PGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSST 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAKTH EC R+EAVS+MNLILMR++T
Sbjct: 421 PGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKTHSAECARLEAVSVMNLILMRNNT 480
Query: 481 YMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATC 540
Y+EREKFG LLFDSVVEFI KESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E EA
Sbjct: 481 YLEREKFGQALLFDSVVEFIGKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEAEAAD 540
Query: 541 AADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEIL 600
AA+ENVR A GF+KFRTILHGLADCL C GNGI ELKLRRNTVLLLAFL+SSGKAGFEIL
Sbjct: 541 AAEENVRCAGGFQKFRTILHGLADCLTCLGNGILELKLRRNTVLLLAFLSSSGKAGFEIL 600
Query: 601 ISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYS 660
+SN L +SNFL LILQVVVSEVE E++V + VE ER LLLREVLILLNRLASHS+YS
Sbjct: 601 VSNTLHKDSNFLTLILQVVVSEVEQEKRVSEAVETMEERALLLREVLILLNRLASHSVYS 660
Query: 661 ATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTY 675
ATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KRVLTY
Sbjct: 661 ATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDSKKRKMRESEVVDLAQVFRKRVLTY 720
BLAST of Moc06g36330 vs. NCBI nr
Match:
XP_023553684.1 (uncharacterized protein LOC111811167 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 937.9 bits (2423), Expect = 4.7e-269
Identity = 518/729 (71.06%), Postives = 566/729 (77.64%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP----SQPEPL 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+CPPPP +PEPL
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPEPEPEPEPL 60
Query: 61 HFVQAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKEL 120
H V+ HDRPISYSPPRELSQR G +RSP GLGECGPSSS LAPCLP PDAAKEL
Sbjct: 61 HLVEVSHDRPISYSPPRELSQRAAGLRSHAIRSPIGLGECGPSSSALAPCLPCPDAAKEL 120
Query: 121 EICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLR 180
EI NLKRELGRVSKQLK+LEQECVELRKKRD+KEEQL VVFSNKD+Q+I HHG E TDLR
Sbjct: 121 EISNLKRELGRVSKQLKNLEQECVELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITDLR 180
Query: 181 TAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSK 240
AG DG H K ED + V+S SKA EQG + HNS GER ND PAF LSK
Sbjct: 181 VAGKDGGHPGIKSED--------NSVTSRSKA-NEQGEKTHNSVGERANDDSPAFDKLSK 240
Query: 241 KLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVAL 300
KLQVFWVPE KMGQSLVSELL SCE DFHVLF CI T+LSPKFSV+SLAG N SDVAL
Sbjct: 241 KLQVFWVPEKDFKMGQSLVSELLFSCERDFHVLFQCIGTELSPKFSVNSLAGVNSSDVAL 300
Query: 301 KNPLQFLHGLEAVKVSNLYTTLTK------------------CNL--------------- 360
K+PLQFLHG E++KVSNLYTTLTK CNL
Sbjct: 301 KHPLQFLHGPESIKVSNLYTTLTKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILHM 360
Query: 361 -------------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPAC 420
RKTVM+ GLG RNN +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 FLKRLMWLERKSERRKTVMIGGLGPRNNVVDSYGSHSAEGEEFSLLNMDETSTGHCSPAG 420
Query: 421 SRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMR 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAKTH EC R+EAVS+MNLILMR
Sbjct: 421 MGFPGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKTHSAECARLEAVSVMNLILMR 480
Query: 481 SSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETE 540
++TYMEREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E E
Sbjct: 481 NNTYMEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEAE 540
Query: 541 ATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGF 600
A AA+ENVR A GF+KFRTILHGLADCL C GNGI+ELKLRRNTVLLLAFL+SSGKAGF
Sbjct: 541 AADAAEENVRCAGGFQKFRTILHGLADCLTCLGNGIQELKLRRNTVLLLAFLSSSGKAGF 600
Query: 601 EILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHS 660
EIL+SN L SNFL LILQVVVSEVE E++V + VE ER LLLREVLILLNRLASHS
Sbjct: 601 EILVSNTLHKNSNFLSLILQVVVSEVEQEKRVSEAVETLEERALLLREVLILLNRLASHS 660
Query: 661 LYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRV 675
+YSATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KRV
Sbjct: 661 VYSATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDGKKRKMRESEVVDLAQVFRKRV 719
BLAST of Moc06g36330 vs. ExPASy Swiss-Prot
Match:
C8KI33 (Protein SENSITIVE TO UV 2 OS=Arabidopsis thaliana OX=3702 GN=SUV2 PE=1 SV=1)
HSP 1 Score: 278.1 bits (710), Expect = 2.6e-73
Identity = 233/705 (33.05%), Postives = 350/705 (49.65%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP-------SQP 60
M DE F D +FL + +E AD + P TF P PP S
Sbjct: 1 MSGNDEEFND---EFLLAIDSIE--TTLKKADMYRPLPPPYLPTFLPAPPPSTKISSSLS 60
Query: 61 EPLHFVQA---------FHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLA--PCL 120
P+ + D +SYSPPRELSQR+ + + L + S+ + A P
Sbjct: 61 HPMQLQSSAGQQRKQIQVPDPFLSYSPPRELSQRVVSGFN-DALMDYSNSTVVTAAKPIS 120
Query: 121 P-----RPDAAKELEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDE 180
P R D+ K+LEI LK+EL RVSKQL D+EQEC +L+K + ++ E + +
Sbjct: 121 PTTSNRRCDSEKDLEIDRLKKELERVSKQLLDVEQECSQLKKGKSKETESRNLCADDNRG 180
Query: 181 QFIGHHGSESTDLR-----TAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAH 240
Q H S+ DL ++ N E++SR D S K G Q A+
Sbjct: 181 QCSTVHASKRIDLEPDVATSSVNHRENDSRMALD----------DKRSFKTTGVQADVAN 240
Query: 241 NSAGERVNDKLPAFHNLSKKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKL 300
+S +LSKKL W + ++L+SELLL+C TD +LF +
Sbjct: 241 HS-------------DLSKKLLDIWRTSNYQDPRKNLISELLLACSTDLQILFSFMKIST 300
Query: 301 SPKFSVDSLAGDNVSDVALKNPLQFLHGLEAVKVSNLYTTLTKCN--LLRKTVMVEGLGS 360
P+ N + + Q LE+ KV LY+ +TK + + +VE L
Sbjct: 301 PPQEL-------NKQEAKTSSDRQSSKALESEKVYQLYSAVTKISYGFVNLKTLVEPL-- 360
Query: 361 RNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMPCKNRNLNTYTNLVPQV 420
DL + + V+++ I G + +
Sbjct: 361 --LDLCKAETAVLVHRSLRVLHV----------LLEHICGDEKRFE---------ASWDA 420
Query: 421 NWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREKFGPGLLFDSVVEFIRK 480
NW S F++M+Q+A + V+ EA+SIMN+I+M + Y RE F +F+S+ +RK
Sbjct: 421 NWHSLFKLMNQIASKRTEQDVKQEALSIMNIIVMSTDAYTARESFVSKEVFESISLLLRK 480
Query: 481 ESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENVRSAAGFEKFRTILHGL 540
E G ++K A+ L +L+LNCP + F S E ++ +++ + E F I GL
Sbjct: 481 EGGLHVRKEAIHLFYLLLNCPKLYDTFDSLHEEKNSSDTENDSEGNFFALEAFGKIFEGL 540
Query: 541 ADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLPTESNFLVLILQVVVSE 600
ADCL E+L+L RN +++LA ASSG +G+E+L S+KLP +S+FL+LIL ++V+E
Sbjct: 541 ADCLTSPRKTSEDLELCRNVIMILALAASSGNSGYELLSSHKLPQDSSFLMLILHLLVAE 600
Query: 601 VEHEQKVPQP-VEIHGERTLLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVI 660
++ E P EI RTLL+RE+LILLNRL S SAT+L+ LT SRDMASLT+D
Sbjct: 601 IDSESTEFHPKAEIFKARTLLMREILILLNRLVSGLSSSATILKELTTSRDMASLTVDAA 646
Query: 661 TKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 675
T+LSR+ N + + +MR +E+ DL+++FKKRV +LG++ +
Sbjct: 661 TRLSRKRNLLGKPESSVERMRNTEIMDLARIFKKRVFAFLGDNTI 646
BLAST of Moc06g36330 vs. ExPASy TrEMBL
Match:
A0A6J1DP62 (uncharacterized protein LOC111022902 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111022902 PE=4 SV=1)
HSP 1 Score: 1289.6 bits (3336), Expect = 0.0e+00
Identity = 673/720 (93.47%), Postives = 673/720 (93.47%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ 60
MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPPSQPEPLHFVQ 60
Query: 61 AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL 120
AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL
Sbjct: 61 AFHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLAPCLPRPDAAKELEICNLKREL 120
Query: 121 GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN 180
GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN
Sbjct: 121 GRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAGNDGEHN 180
Query: 181 SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE 240
SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE
Sbjct: 181 SRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQVFWVPE 240
Query: 241 SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG 300
SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG
Sbjct: 241 SGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNPLQFLHG 300
Query: 301 LEAVKVSNLYTTLTK------------------CNL------------------------ 360
LEAVKVSNLYTTLTK CNL
Sbjct: 301 LEAVKVSNLYTTLTKVSNGIVKMEALFTPLLDLCNLDNVVIVHRTLHILHMFVKHLFWLE 360
Query: 361 ----LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP 420
RKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP
Sbjct: 361 RKSERRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMP 420
Query: 421 CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK 480
CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK
Sbjct: 421 CKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREK 480
Query: 481 FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV 540
FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV
Sbjct: 481 FGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENV 540
Query: 541 RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP 600
RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP
Sbjct: 541 RSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLP 600
Query: 601 TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV 660
TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV
Sbjct: 601 TESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYSATVLRV 660
Query: 661 LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 675
LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV
Sbjct: 661 LTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 720
BLAST of Moc06g36330 vs. ExPASy TrEMBL
Match:
A0A6J1HUD9 (uncharacterized protein LOC111466806 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466806 PE=4 SV=1)
HSP 1 Score: 940.3 bits (2429), Expect = 4.6e-270
Identity = 513/730 (70.27%), Postives = 569/730 (77.95%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP-----SQPEP 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+CPPPP +PEP
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPEPEP 60
Query: 61 LHFVQAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKE 120
H V+ HDR ISYSPPRELSQR G +RS GLGECGPSSS APCLP PDAAKE
Sbjct: 61 QHLVEVSHDRLISYSPPRELSQRAAGSRSHAIRSAIGLGECGPSSSAKAPCLPCPDAAKE 120
Query: 121 LEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDL 180
LEI NLKRELGRVSKQLK+LEQEC+ELRKKRD+KEEQL VVFSNKD+Q+I HHG E T+L
Sbjct: 121 LEISNLKRELGRVSKQLKNLEQECIELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITEL 180
Query: 181 RTAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLS 240
R AG DG H K ED++ DLG PH V+S SKA EQG ++HNS GER +D PAF LS
Sbjct: 181 RVAGKDGGHPGIKSEDISCDLGGPHTVTSRSKA-NEQGEKSHNSVGERADDNSPAFDKLS 240
Query: 241 KKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVA 300
KKLQVFWVPE SKMGQSLVSELLLSCE DFHVL+ CI T+LSPKFSV+SLAG N SDVA
Sbjct: 241 KKLQVFWVPEKDSKMGQSLVSELLLSCERDFHVLYQCIGTELSPKFSVNSLAGVNSSDVA 300
Query: 301 LKNPLQFLHGLEAVKVSNLYTTLTK------------------CNL-------------- 360
LK+PLQFLHGLE++KVSNLYTTL K CNL
Sbjct: 301 LKHPLQFLHGLESIKVSNLYTTLAKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILH 360
Query: 361 --------------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPA 420
RKTVM+ GLG RNN +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 MFLKRLMWLERKSERRKTVMIGGLGPRNNVVDSYGSHSAEGEEFSLLNMDETSTGHCSPA 420
Query: 421 CSRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILM 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAKTH EC R+EAVS+MNLILM
Sbjct: 421 GMGFPGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKTHSAECARLEAVSVMNLILM 480
Query: 481 RSSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMET 540
R++TY+EREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E
Sbjct: 481 RNNTYLEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEA 540
Query: 541 EATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAG 600
EA AA+EN R A GF+KFRTILHGL DCL C GNGI+ELKLRRNTVLLLAFL+SSGKAG
Sbjct: 541 EAADAAEENGRCAGGFQKFRTILHGLTDCLTCLGNGIQELKLRRNTVLLLAFLSSSGKAG 600
Query: 601 FEILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASH 660
FEIL+SN L +SNFL LILQ VVSEVE E++V + VE ER LLLREVLILLNRLASH
Sbjct: 601 FEILVSNTLHKDSNFLTLILQAVVSEVEQEKRVSEAVETLEERALLLREVLILLNRLASH 660
Query: 661 SLYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKR 675
S+YSATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KR
Sbjct: 661 SVYSATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDGKKRKMRESEVVDLAQVFRKR 720
BLAST of Moc06g36330 vs. ExPASy TrEMBL
Match:
A0A6J1HLQ0 (uncharacterized protein LOC111464104 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464104 PE=4 SV=1)
HSP 1 Score: 930.6 bits (2404), Expect = 3.6e-267
Identity = 513/726 (70.66%), Postives = 565/726 (77.82%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFC-PPPPSQPEPLHFV 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+C PPPP +PEP H V
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPQHLV 60
Query: 61 QAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKELEIC 120
+ HDRPISYSPPRELSQR G +RSP GLGECGPSSS LAPCLP PDAAKELEI
Sbjct: 61 EVSHDRPISYSPPRELSQRAAGLRSHSIRSPIGLGECGPSSSALAPCLPCPDAAKELEIS 120
Query: 121 NLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAG 180
+LKRELGRVSKQLK+LEQECVELRKKRD+KEEQL VVFSNKD+Q+I HHG E TDLR A
Sbjct: 121 SLKRELGRVSKQLKNLEQECVELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITDLRVAR 180
Query: 181 NDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQ 240
DG H K ED + G PH V+S SKA EQG + HNS GER ND PAF LSKKLQ
Sbjct: 181 KDGGHPGIKNED---NSGGPHTVTSRSKA-NEQGEKTHNSVGERANDDSPAFDKLSKKLQ 240
Query: 241 VFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNP 300
VFWVPE KMGQSLVSELLLSCE DFHVLF CI T+LSPKFSV+SLAG N SDVALK+P
Sbjct: 241 VFWVPEKDFKMGQSLVSELLLSCERDFHVLFQCIGTELSPKFSVNSLAGVNSSDVALKHP 300
Query: 301 LQFLHGLEAVKVSNLYTTLTK------------------CNL------------------ 360
LQ LHG E++KVSNLYTTLTK CNL
Sbjct: 301 LQVLHGPESIKVSNLYTTLTKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILHMFLK 360
Query: 361 ----------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRI 420
RKTVM+ GLG RN+ +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 RLMWLERKSERRKTVMIGGLGPRNSVVDSYGSHSAEGEEFSLLNMDETSTGHCSPAGMGF 420
Query: 421 PGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSST 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAK H EC R+EAVS+MNLILMR++T
Sbjct: 421 PGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKMHSAECARLEAVSVMNLILMRNNT 480
Query: 481 YMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATC 540
Y+EREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E EA
Sbjct: 481 YLEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEAEAAD 540
Query: 541 AADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEIL 600
AA+ENVR A GF+KF TILHGLADCL C GNGI ELKLRR+TVLLLAFL+SSGKAGFEIL
Sbjct: 541 AAEENVRCAGGFQKFSTILHGLADCLTCLGNGILELKLRRSTVLLLAFLSSSGKAGFEIL 600
Query: 601 ISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYS 660
+SN L +SNFL LILQVVVSEVE E++V + VE ER LLLREVLILLNRLASHS+YS
Sbjct: 601 VSNTLHKDSNFLTLILQVVVSEVEQEKRVSEAVETMEERALLLREVLILLNRLASHSVYS 660
Query: 661 ATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTY 675
ATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KRVLTY
Sbjct: 661 ATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDSKKRKMRESEVVDLAQVFRKRVLTY 720
BLAST of Moc06g36330 vs. ExPASy TrEMBL
Match:
A0A6J1HW38 (uncharacterized protein LOC111466806 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111466806 PE=4 SV=1)
HSP 1 Score: 921.0 bits (2379), Expect = 2.9e-264
Identity = 507/730 (69.45%), Postives = 563/730 (77.12%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP-----SQPEP 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+CPPPP +PEP
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPEPEP 60
Query: 61 LHFVQAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKE 120
H V+ HDR ISYSPPRELSQR G +RS GLGECGPSSS APCLP PDAAKE
Sbjct: 61 QHLVEVSHDRLISYSPPRELSQRAAGSRSHAIRSAIGLGECGPSSSAKAPCLPCPDAAKE 120
Query: 121 LEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDL 180
LEI NLKRELGRVSKQLK+LEQEC+ELRKKRD+KEEQL VVFSNKD+Q+I HHG E T+L
Sbjct: 121 LEISNLKRELGRVSKQLKNLEQECIELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITEL 180
Query: 181 RTAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLS 240
R AG DG H K ED++ +S SKA EQG ++HNS GER +D PAF LS
Sbjct: 181 RVAGKDGGHPGIKSEDIS--------FTSRSKA-NEQGEKSHNSVGERADDNSPAFDKLS 240
Query: 241 KKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVA 300
KKLQVFWVPE SKMGQSLVSELLLSCE DFHVL+ CI T+LSPKFSV+SLAG N SDVA
Sbjct: 241 KKLQVFWVPEKDSKMGQSLVSELLLSCERDFHVLYQCIGTELSPKFSVNSLAGVNSSDVA 300
Query: 301 LKNPLQFLHGLEAVKVSNLYTTLTK------------------CNL-------------- 360
LK+PLQFLHGLE++KVSNLYTTL K CNL
Sbjct: 301 LKHPLQFLHGLESIKVSNLYTTLAKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILH 360
Query: 361 --------------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPA 420
RKTVM+ GLG RNN +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 MFLKRLMWLERKSERRKTVMIGGLGPRNNVVDSYGSHSAEGEEFSLLNMDETSTGHCSPA 420
Query: 421 CSRIPGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILM 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAKTH EC R+EAVS+MNLILM
Sbjct: 421 GMGFPGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKTHSAECARLEAVSVMNLILM 480
Query: 481 RSSTYMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMET 540
R++TY+EREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E
Sbjct: 481 RNNTYLEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEA 540
Query: 541 EATCAADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAG 600
EA AA+EN R A GF+KFRTILHGL DCL C GNGI+ELKLRRNTVLLLAFL+SSGKAG
Sbjct: 541 EAADAAEENGRCAGGFQKFRTILHGLTDCLTCLGNGIQELKLRRNTVLLLAFLSSSGKAG 600
Query: 601 FEILISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASH 660
FEIL+SN L +SNFL LILQ VVSEVE E++V + VE ER LLLREVLILLNRLASH
Sbjct: 601 FEILVSNTLHKDSNFLTLILQAVVSEVEQEKRVSEAVETLEERALLLREVLILLNRLASH 660
Query: 661 SLYSATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKR 675
S+YSATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KR
Sbjct: 661 SVYSATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDGKKRKMRESEVVDLAQVFRKR 720
BLAST of Moc06g36330 vs. ExPASy TrEMBL
Match:
A0A6J1HHI3 (uncharacterized protein LOC111464104 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111464104 PE=4 SV=1)
HSP 1 Score: 921.0 bits (2379), Expect = 2.9e-264
Identity = 510/726 (70.25%), Postives = 562/726 (77.41%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFC-PPPPSQPEPLHFV 60
MRSEDEGFEDWDADFLDQLIQVEELAI+STA+N I SSST+C PPPP +PEP H V
Sbjct: 1 MRSEDEGFEDWDADFLDQLIQVEELAISSTANNP-NPIQCSSSTYCPPPPPPEPEPQHLV 60
Query: 61 QAFHDRPISYSPPRELSQRITG-----VRSPNGLGECGPSSSMLAPCLPRPDAAKELEIC 120
+ HDRPISYSPPRELSQR G +RSP GLGECGPSSS LAPCLP PDAAKELEI
Sbjct: 61 EVSHDRPISYSPPRELSQRAAGLRSHSIRSPIGLGECGPSSSALAPCLPCPDAAKELEIS 120
Query: 121 NLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDEQFIGHHGSESTDLRTAG 180
+LKRELGRVSKQLK+LEQECVELRKKRD+KEEQL VVFSNKD+Q+I HHG E TDLR A
Sbjct: 121 SLKRELGRVSKQLKNLEQECVELRKKRDKKEEQLNVVFSNKDKQYIAHHGPEITDLRVAR 180
Query: 181 NDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAHNSAGERVNDKLPAFHNLSKKLQ 240
DG H K ED + V+S SKA EQG + HNS GER ND PAF LSKKLQ
Sbjct: 181 KDGGHPGIKNED--------NSVTSRSKA-NEQGEKTHNSVGERANDDSPAFDKLSKKLQ 240
Query: 241 VFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKLSPKFSVDSLAGDNVSDVALKNP 300
VFWVPE KMGQSLVSELLLSCE DFHVLF CI T+LSPKFSV+SLAG N SDVALK+P
Sbjct: 241 VFWVPEKDFKMGQSLVSELLLSCERDFHVLFQCIGTELSPKFSVNSLAGVNSSDVALKHP 300
Query: 301 LQFLHGLEAVKVSNLYTTLTK------------------CNL------------------ 360
LQ LHG E++KVSNLYTTLTK CNL
Sbjct: 301 LQVLHGPESIKVSNLYTTLTKVSNGIVKMEALFTPLIDLCNLDNVAIVHRSLHILHMFLK 360
Query: 361 ----------LRKTVMVEGLGSRNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRI 420
RKTVM+ GLG RN+ +DS+GS S EGEEF+++NMD TS C+PA
Sbjct: 361 RLMWLERKSERRKTVMIGGLGPRNSVVDSYGSHSAEGEEFSLLNMDETSTGHCSPAGMGF 420
Query: 421 PGADMPCKNRNLNTYTNLVPQVNWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSST 480
PGA++ KNRNLN NLVP+VNWVSFFEMMH+VAK H EC R+EAVS+MNLILMR++T
Sbjct: 421 PGAELLFKNRNLNKNINLVPRVNWVSFFEMMHRVAKMHSAECARLEAVSVMNLILMRNNT 480
Query: 481 YMEREKFGPGLLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATC 540
Y+EREKFG LLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGC E EA
Sbjct: 481 YLEREKFGQALLFDSVVEFIRKESGSAIQKHAVRLLFLILNCPTFFVAFCSGCKEAEAAD 540
Query: 541 AADENVRSAAGFEKFRTILHGLADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEIL 600
AA+ENVR A GF+KF TILHGLADCL C GNGI ELKLRR+TVLLLAFL+SSGKAGFEIL
Sbjct: 541 AAEENVRCAGGFQKFSTILHGLADCLTCLGNGILELKLRRSTVLLLAFLSSSGKAGFEIL 600
Query: 601 ISNKLPTESNFLVLILQVVVSEVEHEQKVPQPVEIHGERTLLLREVLILLNRLASHSLYS 660
+SN L +SNFL LILQVVVSEVE E++V + VE ER LLLREVLILLNRLASHS+YS
Sbjct: 601 VSNTLHKDSNFLTLILQVVVSEVEQEKRVSEAVETMEERALLLREVLILLNRLASHSVYS 660
Query: 661 ATVLRVLTNSRDMASLTIDVITKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTY 675
ATVLRVLT+SRDMASLTIDV KLSR+NNR QFD K R+MRESEV DL+QVF+KRVLTY
Sbjct: 661 ATVLRVLTSSRDMASLTIDVTNKLSRKNNRNCQFDSKKRKMRESEVVDLAQVFRKRVLTY 716
BLAST of Moc06g36330 vs. TAIR 10
Match:
AT5G45610.1 (protein dimerizations )
HSP 1 Score: 278.1 bits (710), Expect = 1.9e-74
Identity = 233/705 (33.05%), Postives = 350/705 (49.65%), Query Frame = 0
Query: 1 MRSEDEGFEDWDADFLDQLIQVEELAIASTADNHLITIPISSSTFCPPPP-------SQP 60
M DE F D +FL + +E AD + P TF P PP S
Sbjct: 1 MSGNDEEFND---EFLLAIDSIE--TTLKKADMYRPLPPPYLPTFLPAPPPSTKISSSLS 60
Query: 61 EPLHFVQA---------FHDRPISYSPPRELSQRITGVRSPNGLGECGPSSSMLA--PCL 120
P+ + D +SYSPPRELSQR+ + + L + S+ + A P
Sbjct: 61 HPMQLQSSAGQQRKQIQVPDPFLSYSPPRELSQRVVSGFN-DALMDYSNSTVVTAAKPIS 120
Query: 121 P-----RPDAAKELEICNLKRELGRVSKQLKDLEQECVELRKKRDRKEEQLKVVFSNKDE 180
P R D+ K+LEI LK+EL RVSKQL D+EQEC +L+K + ++ E + +
Sbjct: 121 PTTSNRRCDSEKDLEIDRLKKELERVSKQLLDVEQECSQLKKGKSKETESRNLCADDNRG 180
Query: 181 QFIGHHGSESTDLR-----TAGNDGEHNSRKIEDLAGDLGAPHIVSSSSKAIGEQGGQAH 240
Q H S+ DL ++ N E++SR D S K G Q A+
Sbjct: 181 QCSTVHASKRIDLEPDVATSSVNHRENDSRMALD----------DKRSFKTTGVQADVAN 240
Query: 241 NSAGERVNDKLPAFHNLSKKLQVFWVPESGSKMGQSLVSELLLSCETDFHVLFGCISTKL 300
+S +LSKKL W + ++L+SELLL+C TD +LF +
Sbjct: 241 HS-------------DLSKKLLDIWRTSNYQDPRKNLISELLLACSTDLQILFSFMKIST 300
Query: 301 SPKFSVDSLAGDNVSDVALKNPLQFLHGLEAVKVSNLYTTLTKCN--LLRKTVMVEGLGS 360
P+ N + + Q LE+ KV LY+ +TK + + +VE L
Sbjct: 301 PPQEL-------NKQEAKTSSDRQSSKALESEKVYQLYSAVTKISYGFVNLKTLVEPL-- 360
Query: 361 RNNDLDSHGSQSVEGEEFAVVNMDGTSHDSCAPACSRIPGADMPCKNRNLNTYTNLVPQV 420
DL + + V+++ I G + +
Sbjct: 361 --LDLCKAETAVLVHRSLRVLHV----------LLEHICGDEKRFE---------ASWDA 420
Query: 421 NWVSFFEMMHQVAKTHCVECVRMEAVSIMNLILMRSSTYMEREKFGPGLLFDSVVEFIRK 480
NW S F++M+Q+A + V+ EA+SIMN+I+M + Y RE F +F+S+ +RK
Sbjct: 421 NWHSLFKLMNQIASKRTEQDVKQEALSIMNIIVMSTDAYTARESFVSKEVFESISLLLRK 480
Query: 481 ESGSAIQKHAVRLLFLILNCPTFFVAFCSGCMETEATCAADENVRSAAGFEKFRTILHGL 540
E G ++K A+ L +L+LNCP + F S E ++ +++ + E F I GL
Sbjct: 481 EGGLHVRKEAIHLFYLLLNCPKLYDTFDSLHEEKNSSDTENDSEGNFFALEAFGKIFEGL 540
Query: 541 ADCLACCGNGIEELKLRRNTVLLLAFLASSGKAGFEILISNKLPTESNFLVLILQVVVSE 600
ADCL E+L+L RN +++LA ASSG +G+E+L S+KLP +S+FL+LIL ++V+E
Sbjct: 541 ADCLTSPRKTSEDLELCRNVIMILALAASSGNSGYELLSSHKLPQDSSFLMLILHLLVAE 600
Query: 601 VEHEQKVPQP-VEIHGERTLLLREVLILLNRLASHSLYSATVLRVLTNSRDMASLTIDVI 660
++ E P EI RTLL+RE+LILLNRL S SAT+L+ LT SRDMASLT+D
Sbjct: 601 IDSESTEFHPKAEIFKARTLLMREILILLNRLVSGLSSSATILKELTTSRDMASLTVDAA 646
Query: 661 TKLSRRNNRTYQFDRKTRQMRESEVADLSQVFKKRVLTYLGNSIV 675
T+LSR+ N + + +MR +E+ DL+++FKKRV +LG++ +
Sbjct: 661 TRLSRKRNLLGKPESSVERMRNTEIMDLARIFKKRVFAFLGDNTI 646
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022155903.1 | 0.0e+00 | 93.47 | uncharacterized protein LOC111022902 isoform X1 [Momordica charantia] | [more] |
XP_038888976.1 | 1.0e-271 | 70.90 | protein SENSITIVE TO UV 2 [Benincasa hispida] | [more] |
XP_022967198.1 | 9.4e-270 | 70.27 | uncharacterized protein LOC111466806 isoform X1 [Cucurbita maxima] | [more] |
KAG7011360.1 | 1.2e-269 | 71.21 | hypothetical protein SDJN02_26265, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023553684.1 | 4.7e-269 | 71.06 | uncharacterized protein LOC111811167 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
C8KI33 | 2.6e-73 | 33.05 | Protein SENSITIVE TO UV 2 OS=Arabidopsis thaliana OX=3702 GN=SUV2 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DP62 | 0.0e+00 | 93.47 | uncharacterized protein LOC111022902 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1HUD9 | 4.6e-270 | 70.27 | uncharacterized protein LOC111466806 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HLQ0 | 3.6e-267 | 70.66 | uncharacterized protein LOC111464104 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1HW38 | 2.9e-264 | 69.45 | uncharacterized protein LOC111466806 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HHI3 | 2.9e-264 | 70.25 | uncharacterized protein LOC111464104 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT5G45610.1 | 1.9e-74 | 33.05 | protein dimerizations | [more] |