Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATATTTTCCCGAAAACTATATATACTTACCTTTGTTCCCCAATTCGTGCCCTAAAAGCTCTGATTTCTCCTACTCTTAGCCGCAACATGCTAAACACTCCCGTTTTTCTTCCTTTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTTCTTGACTCACTCTCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTTTTCTCTAAGTTCAGATTGGTATCTGTTTTCGACGGCGTTTTAGGTTCCTTTGACATGTTCGGCGGCTGCAGTGAATCGTCTTCTATTTATAGTTCGCTGTTGTTAGGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTCAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGGTGAAACAGCCTTTTTTTTTTGCTGTAATACTGCTGTTTAATTTTTTGATTTGTTGCTGGTTTCGTATTTTTAACTGCATCCGATGGCTTAGTAACCTCCTCGCATTCCAATTGGCCAATACTGGAGGCTATGGAGAGTGATATTTGTTTCGAAAAAGAGATGATAATGAATATAGTGCTTGGATTTTGATGCATTGCTTGTTCATGAACTGACTCTATGTAGTAGGATGAAACTTACAGAGAGTTTTGAGTCACTTATTGTTGAAGACCAATGCGTTTCAGTAGCCCCATGAAGACAGTCACCACTTTGTAATAACATTTTTTCAATAAATGTGTTCCCGCGGATTTTGCGAATTGAAGTGCTTTATATAAACCCTAGGCGTTTGTGAAGATGTCTTATCCTTTTTTCTTTTTGTGCTTTTTCATTATTAATAAAGGACTCTTCTTATTAAATGTGTTCCAGTGGATTTTGAGTGAGGTTGACTTGGGATCCAGCACGCAGATTACTTAAATGTGGCAGTCATTTGGTTTTGTATTTTATTATTTTGAAAGGGAAATGTGATAAAGAAAAAATTCTGCTCTTAGGATATGGTGGCGGAAACATCAAACATTTTGCATAAGAAGAAACACCGTGTTTATTTTTAGTATTCTTTCATAGGATGGTTTATATCATCTGTCTACTGAAATTCTGAACATGGAATAAGGTCTATATGGTGGGTTTAGACAATCTTGAAGAAAAAATTGCATCTTTTTTTAACATACTTTGTATAGATGGATGACGATGATAGTTGGTTCTGGCTGCTGCATTTTTGAAAGGATTGCCATATATTTTGGTTTATTTGGACTTTCTTAAGTCTTATACGTTTTTATTCTTTGCCACATATCATCTATGTATGTGATTCCACTTCAAGTTGTTTTTTGTAAAATTTATATATTACAAGTCTCTAGCATAATCATCTCCTTTATCTGCCATTTTTTCTTTCTTCATTTTCTATATTTTAATGAAATAATGGTCTATAAATGTCGGTGTGCCAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATTCATGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCACTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGGTAATGTTCAGTTCCCAATTTAATGTTATAAATGTTCATCTTTTGCTATAACTTTAGACTATTCTTATCATTTCCAAATAACGCCATGCAATACATCTATTTTATATTTTCAGATAGGATCATTGTGCAAGAAATCTGATTAAAAAATAATCCAAATGTTTAGCTATAACTAGAAGCAATAGGAACGGATAAAGAATCTAAAAATTAAAAGTAGCCTCAGTTGTTTAAAATTATTGCATGTGAATTGTTTATAAGATCCATGTGTAGAGGACCATGAAAGGCAATTCAATGAACTTTGATCCCAATTATTCAGGATGTGGTTATTTTCTTCTTCGCTATTCGAATGTTATGATTCTTTCATTTATCGTAAATTTATGACCGAGGAGGATATTTGGGAAATTTTTTGGTTACATTGGTAATGTTATCAATTGATTGATCTTACTATGTACTTTCTATTTCTAGAGTTAAGAAATCATTGTATTCATATTTTCTTGACTTAATGTATTGAAAAACTGGTGCTACAGTGGTGATTTTACTTTTTTTAAAATATTTTAAATTTGTGTAAGCTTAGAATGGGCAATATCTGTTGCATATATGTGATTGAGTTAGGATATCCTTTCGCAGTTTCTATACTTCCTATGCTTTCCACGTTTAAGTTGTTACTGATGTATACACCCCTCCNAAAAAAAAAAAAAAAAAAAAAAACAGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGGAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGTATGATGATGTCCTGTGAATTATTAGTTAATGTGCTCAGAAACTTGCATTTATGTGGTACTTATGGTATTTTGAGGACCTTATGACATGTATTGAAGAAAGACTCCACTGGATAATGCAGACGTTTTAAATTGTGTAGATATACTTAGTTTGTTGTGAAGAAAATTTATATCATTGTTGTGACAGAACCTAAATTGAGGGATAACTAATAAGTCTAACCAAATCAGGACCGTTCTAATTCTATCATGTCTTGATTTGTGTGTTACTTCAATATTTTTCTGTAAAGATATTAATTGGATAGTCCTTGTAATTAAGATATCCCTGGAAAGACATTATCACATGAATAGCCTTTTGAATAAGAGGTATAGCCATTGCACATCTCGCGAAAAGATAGAAAGTGAAATGTCGGCTATTTTATTTTGACTTCCAACAGGAGTTATTCCTTGAGTATGTCTTGATGTACAAAAATGAGCTGCGTCCCAGTTTTCACAAGTGTTATAATGAAAGCAATTTCTCTTTACCTTCCTTGATTTTTTTTTTCCTTCCATCACTGTAGGTCGCTACTGGATCTTGTTTTCAGTTTTGCAAAGAGTTTAATTGCTCAGTGCTCAATTCCATTAGTTGTTACTTCCTATTATGACATAAGTAAAGAAACAAATCACTTGTTTATAATCTATTTATAATAAACGTTCTCATTGGTTTTTACTTTTGTGCACCTTACTTTGCCTTACACTGCATAAAATCATTTATGGGCCTTGTGAAATGTTTACAGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCCCCGCTGAATGATCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCTGTCCATCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAACTTCGATGCAAAAGAACTTAGTACAACAGATGCACCTTTATTAAGCAAGGTCTGTGAGTAAGAATTTGGCTATCTCTCTGGCCTATGTGCACATGCACTCACTCTAACACATACTGTACCTAAGTGTTTGTTGTTTTTAACTTCTTGATCTGTACGCTGACTGACTATACATTTTTATTCAATTGGTTTAAATTATTTTGACATCTCTGTTATAGTTATTTGAGATCTGTCTACAACCTGATTGAATCTATTTTTGCTGATTCTCTTATGTGGTAATTGCCGTGGCAAAATACTCAATTCTTCATCGAAAGGATATTTTTTAGTGTCCTAAACTTCAATCATAAATTCGGAGTTTCTTTCCATTATTACATTCAGATTTAGACCTTAATTTCGTTCTCTCTTGGTTTTCCATGAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTCCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCAACAGATTTGCGTTCTGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGATTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGGTAAGGATTTATTGGAAGGCATTTATGTCGCGTAATTGAGATACAGTATTCACCTTTGAACCATTCTGCTATTATTGTTTATTTTACTTAAATCGTGACCACTGCTTTTGGTCTACATACAGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGGTATGACCTCCTTATTTTTCTGCTATGAATCAACTAAATCTTATTCTTGATTTTTTTTTTTTTTTTTTTTTGAAGTTAGTAGTTAAGATCCAATAAATTATAAAAAAAGTTGAACTGTGTCCGTAACTGACTTTTCAATGGCACCATAAGAACTACCCACTTAATTTCTTATATCACATAAATGACTTTTCTTATGAAATACCAACAGTAACCCTACGTAAAGGAGTCTTAATGGGGTCCTTTCTCTGAAGGAACCAAAAAGAATCCACTGAAGGAAGATGCTTATTGAAGAGCTGAGGGAGAGAAATATTGGTCTTGAGAGAGAAAAGTAAGAGAGAGGAAAGTGAGATGAGGAAGAGAAGTCCTTAAGGAGTGATCTGAGTCTGTAGTTATAAACGAGAAGTTTTTTTCAGAAGTTCAGTAGTTGAAGAAAGTGCTTTGCTAGTCTGAGAATTTTTGGCAAGTACAGAGAAGCCTGCTCTAGAAAACATTAAGCGTTTTAGAGGAAACTAAATCAAATCCCTTGCATTTATTTGTTATGCATAATTCAGAAATGTTTAAAACTTGGATTCTTAATCTCCCTAGAGCTATGAGTACGAACTCCTTAGTTTGGCTAGTGTGTCCATGTCTAACACGTTTTAGACACTTCAACACTTCTGACATTTGTTGGATATGTATCAGATACTTTTTAGCACAATAGTTATGTGTTAGACTTTGGTACAAAATCAATATGAGTTCTATATTTGTCCGACACATATCGAACACTTATTAAGTATACTAAATAGACACATAGGACAAAAATCATTATGTTTGAGAGTAAGACATATCAAACTCATTTATTTAATCATATAAATGCCTTAACTTATTGACTTTGGATTTCTTTAGACACCTAACGTACACTTCTTAAGTATACTAAATACACGTATAAGACAATAATCATTAAATTTGAGAGTGAGATACATCGAACTCATTTATTTAATCTTATAAATGTCTTAACTTATTGACTTTGGATTTCTTTATATAAACGTATCCTTGCTGTGTCTATGTCATAGTTTTTTTAAATAATGATGTGTTTCTGTGATGTCATATCCACGTCCCGTTCTTCTTAGCCGTAGAGAACTGTAGATATAATATACGTCTCTCAAACATTAGCCACTTTCCTAATCTTTTTTACAAATGCTATCTCGCTGCAGCTTGAAACTTTATGAATTTTGTTGTGTCATTTGCATGCAACTAAAGCCTAGTTGAATTTTTTGTCTTCGGCTATTGATATTGCTTCTCTACAGAGCCGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCAACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCAATTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGGAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTGCGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCAGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACGCGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAACAGATTAGATTGATCAAGAAACTCTACATTATTTATATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTAGGAGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGATTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACCTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGCAAAAGTATTTGACGAGTCAATAGAAATTTTAGTTCTTATTACTGTTCCCTTGTGCCCCTAGATGTGCTCTTCACCAG
mRNA sequence
CATATTTTCCCGAAAACTATATATACTTACCTTTGTTCCCCAATTCGTGCCCTAAAAGCTCTGATTTCTCCTACTCTTAGCCGCAACATGCTAAACACTCCCGTTTTTCTTCCTTTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTTCTTGACTCACTCTCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTTTTCTCTAAGTTCAGATTGGTATCTGTTTTCGACGGCGTTTTAGGTTCCTTTGACATGTTCGGCGGCTGCAGTGAATCGTCTTCTATTTATAGTTCGCTGTTGTTAGGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTCAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATTCATGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCACTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGGAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCCCCGCTGAATGATCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCTGTCCATCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAACTTCGATGCAAAAGAACTTAGTACAACAGATGCACCTTTATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTCCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCAACAGATTTGCGTTCTGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGATTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCCGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCAACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCAATTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGGAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTGCGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCAGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACGCGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAACAGATTAGATTGATCAAGAAACTCTACATTATTTATATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTAGGAGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGATTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACCTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGCAAAAGTATTTGACGAGTCAATAGAAATTTTAGTTCTTATTACTGTTCCCTTGTGCCCCTAGATGTGCTCTTCACCAG
Coding sequence (CDS)
ATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTCAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATTCATGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCACTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAAAGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGGAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCCCCGCTGAATGATCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCTGTCCATCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAACTTCGATGCAAAAGAACTTAGTACAACAGATGCACCTTTATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTCCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCAACAGATTTGCGTTCTGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTTTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGATTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCCGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCAACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCAATTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGGAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTGCGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCAGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACGCGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAACAGATTAG
Protein sequence
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGTD
Homology
BLAST of CmaCh04G007210 vs. ExPASy TrEMBL
Match:
A0A6J1INL3 (uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111479123 PE=4 SV=1)
HSP 1 Score: 1325.1 bits (3428), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 661 SEGQQKTRKLMLMDWKRGGTD 682
SEGQQKTRKLMLMDWKRGGTD
Sbjct: 661 SEGQQKTRKLMLMDWKRGGTD 681
BLAST of CmaCh04G007210 vs. ExPASy TrEMBL
Match:
A0A6J1FNQ1 (uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447442 PE=4 SV=1)
HSP 1 Score: 1293.9 bits (3347), Expect = 0.0e+00
Identity = 668/679 (98.38%), Postives = 669/679 (98.53%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LN LLDGSY
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 240
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELSTTDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVDSRGKMNREANE HCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWA TKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 661 SEGQQKTRKLMLMDWKRGG 680
SEGQQKTRKLMLMDWKRGG
Sbjct: 661 SEGQQKTRKLMLMDWKRGG 679
BLAST of CmaCh04G007210 vs. ExPASy TrEMBL
Match:
A0A6J1C5T9 (uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008234 PE=4 SV=1)
HSP 1 Score: 1094.3 bits (2829), Expect = 0.0e+00
Identity = 576/685 (84.09%), Postives = 618/685 (90.22%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDA+EL YPVDVAAPKLMGPDGSVRTG+ IEEV+LCE+DR SAPPSYSFQHFSSYGSQK
Sbjct: 1 MDAVELTYPVDVAAPKLMGPDGSVRTGVTIEEVELCESDRVSAPPSYSFQHFSSYGSQKA 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSIND+GSVSLD+I DGAVSKDGE T ED ESRNKRS L TSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDVGSVSLDKIPDGAVSKDGEGTSEDLESRNKRSLLFTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSH--EKPQLMKQKSSVSSK 180
SSSLCSKR R+V+LEDSL LSGAD+VKDTSDKLGSYLKKC SH EK QL+KQKSS+SSK
Sbjct: 121 SSSLCSKRPRVVRLEDSLFLSGADDVKDTSDKLGSYLKKCSSHETEKAQLLKQKSSLSSK 180
Query: 181 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLD 240
RGDKRNLKVSLKTKFDSL INAGNGSA AG F LYGLKSDVHDFTKL DDPPLNDLLD
Sbjct: 181 RGDKRNLKVSLKTKFDSLSINAGNGSAAAGSSFLALYGLKSDVHDFTKLVDDPPLNDLLD 240
Query: 241 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSL 300
GSYD +SLS KGKKDTNVNECFLQS+RKACSVLQLPWPVHPQN AESE CSNSKPSTS+
Sbjct: 241 GSYDSASLSIDKGKKDTNVNECFLQSVRKACSVLQLPWPVHPQNIAESEGCSNSKPSTSI 300
Query: 301 VSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 360
VS VSSMEEGVNFD KE TD+P L+KV+DACSNSETLTN LDFKLYKPDDMFVK+GLP
Sbjct: 301 VSYVSSMEEGVNFDVKEPIATDSPSLNKVRDACSNSETLTNPLDFKLYKPDDMFVKMGLP 360
Query: 361 LPKDLESLLQDASKSSV-SSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKF 420
LPKDLESLLQDASKSSV SSKN TDLRSAKQQSRRA+LQPFPWSHSFNGHSK+NSDSSKF
Sbjct: 361 LPKDLESLLQDASKSSVSSSKNVTDLRSAKQQSRRAMLQPFPWSHSFNGHSKSNSDSSKF 420
Query: 421 SANRTTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMR-VGPDDGKSSSVS 480
SANRTTC GRWWR+GNF++IP+ATADCFTK+LESL FNQSLFPSTMR VGPDD +SSSVS
Sbjct: 421 SANRTTCPGRWWRIGNFSSIPSATADCFTKDLESLTFNQSLFPSTMRVVGPDDRRSSSVS 480
Query: 481 VNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQ 540
VNHHQ GWDSLSSA CSKASS+LV+SRGK N EAN+ CP+V+AAA+TLYDIAT AA RQ
Sbjct: 481 VNHHQCGWDSLSSAICSKASSVLVESRGKTNYEANDQQCPKVIAAAKTLYDIATYAASRQ 540
Query: 541 NIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPG 600
NIDGIV+WPKKPSQKSM+ARKLKSEETEELYAAP TYGLWS+N FK+EG +H SKKPK G
Sbjct: 541 NIDGIVRWPKKPSQKSMRARKLKSEETEELYAAP-TYGLWSDNPFKSEGHMHSSKKPKLG 600
Query: 601 TVESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPAT 660
T ESRRD+ TN R+GPLNWA +SSRSSPSKFVRDS S+AKHSTSG+VK SSMMPPPAT
Sbjct: 601 TTESRRDLAHTNCRRGPLNWATPRSSRSSPSKFVRDSASDAKHSTSGIVKPSSMMPPPAT 660
Query: 661 H-LSKASEGQQKTRKLMLMDWKRGG 680
L K EGQQKTRKLMLMDWKRGG
Sbjct: 661 TLLCKGGEGQQKTRKLMLMDWKRGG 684
BLAST of CmaCh04G007210 vs. ExPASy TrEMBL
Match:
A0A6J1EYC2 (uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC111437570 PE=4 SV=1)
HSP 1 Score: 1059.3 bits (2738), Expect = 6.8e-306
Identity = 562/683 (82.28%), Postives = 599/683 (87.70%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
M ALEL PVDV KLMGPDGSVRTG+ IEEV+LCEADRGSAPPSYSFQHFSSYG +K
Sbjct: 1 MYALELTCPVDVVVSKLMGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGCKKD 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLG VSLD++ DGAV KDGE+T EDFESRNKRSHLSTSS GVQ RK LKVSR
Sbjct: 61 GTSSINDLGPVSLDKVPDGAVFKDGENTSEDFESRNKRSHLSTSSLGVQPRKPLKVSR-G 120
Query: 121 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRG 180
SSSLCSKR R+VQLED L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRG
Sbjct: 121 SSSLCSKRPRVVQLEDPLFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRG 180
Query: 181 DKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGS 240
DKRNLKVSLKTKFDS NAGNGSA AG F GLYGLKS DFTKLTDDPPLND+LDGS
Sbjct: 181 DKRNLKVSLKTKFDSFSTNAGNGSAAAGSSFHGLYGLKSGARDFTKLTDDPPLNDILDGS 240
Query: 241 YDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVS 300
YD ++LSK KGKKDTNVNECFLQSIRKACSVLQLPWPV PQN AESESCSNSKP TSLVS
Sbjct: 241 YDCANLSKDKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNMAESESCSNSKPDTSLVS 300
Query: 301 SVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLP 360
SVSSMEE VNFD KELS TD+P L+KV+DAC+NSE LTN LDFKLYKPD MF+KLGLP+P
Sbjct: 301 SVSSMEEKVNFDVKELSATDSPSLNKVEDACNNSEPLTNALDFKLYKPDHMFMKLGLPIP 360
Query: 361 KDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSAN 420
KDL SLLQDASKSSVSS NATDLRSAKQQSRRA+LQPF WSHSFNGHSKANSDSSKFSAN
Sbjct: 361 KDLNSLLQDASKSSVSSNNATDLRSAKQQSRRAMLQPFAWSHSFNGHSKANSDSSKFSAN 420
Query: 421 RTTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRV-GPDDGKSSSVSVNH 480
RTTC GRWWRV NF+NIP+ATADCFTK+LESL FNQSLFPSTMRV GPDDG+ SS+SVNH
Sbjct: 421 RTTCLGRWWRVRNFSNIPSATADCFTKDLESLTFNQSLFPSTMRVIGPDDGR-SSISVNH 480
Query: 481 HQSGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNID 540
HQ GWDSLSSATCSK SS+LV+SRGKMN E+ E CPRVMAAAQTLYDIAT AALRQNID
Sbjct: 481 HQCGWDSLSSATCSKTSSVLVESRGKMNSESYEQQCPRVMAAAQTLYDIATSAALRQNID 540
Query: 541 GIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG-QLHPSKKPKPGTV 600
G+V+WPKK SQKSM+ARKLKSEETEELY PTTYGLWSNNS KNEG HPSKKPK GT
Sbjct: 541 GMVRWPKKASQKSMRARKLKSEETEELYTTPTTYGLWSNNSIKNEGHHAHPSKKPKLGTT 600
Query: 601 ESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQ-SSMMPPPATH 660
ESRRD+ QTN ++GPLNW +SSRSSPSKF+RDSVSEAK ST+G +KQ SSMMPPPAT
Sbjct: 601 ESRRDVAQTNCKRGPLNWTTPRSSRSSPSKFIRDSVSEAKPSTAGAIKQSSSMMPPPATL 660
Query: 661 LSKASEGQQKTRKLMLMDWKRGG 680
L KA EGQQKTRKLMLMDWKRGG
Sbjct: 661 LCKAGEGQQKTRKLMLMDWKRGG 678
BLAST of CmaCh04G007210 vs. ExPASy TrEMBL
Match:
A0A6J1JD46 (uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483405 PE=4 SV=1)
HSP 1 Score: 1049.3 bits (2712), Expect = 7.0e-303
Identity = 553/666 (83.03%), Postives = 590/666 (88.59%), Query Frame = 0
Query: 18 MGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIH 77
MGPDGSVRTG+ IEEV+LCEADRGSAPPSYSFQHFSSYGS+K GTSSINDLG VSLD++
Sbjct: 1 MGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGSKKDGTSSINDLGPVSLDKVP 60
Query: 78 DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKR-RLVQLEDS 137
DGAV KDGE+T EDFESRNKRSHLSTSSPGVQ RK LKVSR SSSLCSKR R+VQLED
Sbjct: 61 DGAVFKDGENTYEDFESRNKRSHLSTSSPGVQPRKPLKVSR-GSSSLCSKRPRVVQLEDP 120
Query: 138 LLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLKVSLKTKFDSLP 197
L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRGDKRNLKVSLKTKFDS
Sbjct: 121 LFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRGDKRNLKVSLKTKFDSFS 180
Query: 198 INAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSYDYSSLSKAKGKKDTNV 257
NAGNGSA AG F GLYGLKS DFTKLTDDPPLND+L+GSYDY++LSK KGKKDTNV
Sbjct: 181 TNAGNGSAAAGSSFHGLYGLKSGARDFTKLTDDPPLNDILNGSYDYANLSKDKGKKDTNV 240
Query: 258 NECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS 317
NECFLQSIRKACSVLQLPWPV PQN AESESCSNSKP TSLVSSVSSMEE VNFD KELS
Sbjct: 241 NECFLQSIRKACSVLQLPWPVRPQNMAESESCSNSKPDTSLVSSVSSMEEKVNFDVKELS 300
Query: 318 TTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSS 377
TD+P L+KV+DAC NSE LTN LDFKLYKPD MF+KLGLP+PKDL SLLQDASKSSVSS
Sbjct: 301 ATDSPSLNKVEDACDNSEPLTNALDFKLYKPDHMFMKLGLPIPKDLNSLLQDASKSSVSS 360
Query: 378 KNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANI 437
NATDLRSAKQQSRRA+LQPF WSHSFNGHSKANSDSSKFSANRTTC GRWWRV NF+NI
Sbjct: 361 NNATDLRSAKQQSRRAMLQPFAWSHSFNGHSKANSDSSKFSANRTTCLGRWWRVRNFSNI 420
Query: 438 PTATADCFTKNLESLIFNQSLFPSTMR-VGPDDGKSSSVSVNHHQSGWDSLSSATCSKAS 497
P+ATADCFTK+LESL FNQSLFPSTMR VGPDDG+ SS+SVNHHQ GWDSLSSATCSK S
Sbjct: 421 PSATADCFTKDLESLTFNQSLFPSTMRVVGPDDGR-SSISVNHHQCGWDSLSSATCSKTS 480
Query: 498 SMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKAR 557
S+LV+SRGKMN EANE CPRVMAAAQTLYDIAT AALRQNIDG+V+WPKK SQKSM+AR
Sbjct: 481 SVLVESRGKMNNEANEQQCPRVMAAAQTLYDIATSAALRQNIDGMVRWPKKASQKSMRAR 540
Query: 558 KLKSEETEELYAAPTTYGLWSNNSFKNEG-QLHPSKKPKPGTVESRRDITQTNNRKGPLN 617
KLKSEETEELY PTTYGLWSNNS KNEG HPSKKPK GT ESRRD+ QTN ++GPLN
Sbjct: 541 KLKSEETEELYTTPTTYGLWSNNSIKNEGHHAHPSKKPKLGTTESRRDVAQTNCKRGPLN 600
Query: 618 WAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQ-SSMMPPPATHLSKASEGQQKTRKLMLM 677
W +SSRSSPSKF+RDS+SEAK ST+G +KQ SSMMPPPAT L KA EGQQ TRKLMLM
Sbjct: 601 WTTPRSSRSSPSKFIRDSISEAKPSTAGAIKQSSSMMPPPATLLCKAGEGQQNTRKLMLM 660
Query: 678 DWKRGG 680
DWKRGG
Sbjct: 661 DWKRGG 661
BLAST of CmaCh04G007210 vs. NCBI nr
Match:
XP_022979382.1 (uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1325.1 bits (3428), Expect = 0.0e+00
Identity = 681/681 (100.00%), Postives = 681/681 (100.00%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 661 SEGQQKTRKLMLMDWKRGGTD 682
SEGQQKTRKLMLMDWKRGGTD
Sbjct: 661 SEGQQKTRKLMLMDWKRGGTD 681
BLAST of CmaCh04G007210 vs. NCBI nr
Match:
XP_023535222.1 (uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1296.2 bits (3353), Expect = 0.0e+00
Identity = 668/679 (98.38%), Postives = 670/679 (98.67%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLN LLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLDGSY 240
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELSTTDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVD RGKMNREANE HCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDFRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWA TKSSRSSPSKF+RDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFIRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 661 SEGQQKTRKLMLMDWKRGG 680
SEGQQKTRKLMLMDWKRGG
Sbjct: 661 SEGQQKTRKLMLMDWKRGG 679
BLAST of CmaCh04G007210 vs. NCBI nr
Match:
XP_022942381.1 (uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1293.9 bits (3347), Expect = 0.0e+00
Identity = 668/679 (98.38%), Postives = 669/679 (98.53%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LN LLDGSY
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 240
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELSTTDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVDSRGKMNREANE HCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWA TKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 661 SEGQQKTRKLMLMDWKRGG 680
SEGQQKTRKLMLMDWKRGG
Sbjct: 661 SEGQQKTRKLMLMDWKRGG 679
BLAST of CmaCh04G007210 vs. NCBI nr
Match:
KAG6600547.1 (hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1287.7 bits (3331), Expect = 0.0e+00
Identity = 664/679 (97.79%), Postives = 667/679 (98.23%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 15 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 74
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 75 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 134
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 135 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 194
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLN LLDGSY
Sbjct: 195 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLDGSY 254
Query: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 255 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 314
Query: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
VSSMEEGVNFDAKELS TDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 315 VSSMEEGVNFDAKELSATDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 374
Query: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 375 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 434
Query: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTM VGPDDGKSSSVSVNHHQ
Sbjct: 435 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVNHHQ 494
Query: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
SGWDSLSSATCSKASSMLVDSRGKMNREANE HCPRVMAAAQTLYDIATRAA RQNIDGI
Sbjct: 495 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAASRQNIDGI 554
Query: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 555 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 614
Query: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
RDITQTNNRKGPLNWA TKSSRSSPSKF RDSVSEAKHSTSG+VKQSSMMPPPATHLSKA
Sbjct: 615 RDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMMPPPATHLSKA 674
Query: 661 SEGQQKTRKLMLMDWKRGG 680
SEGQQKTRKLMLMDWKRGG
Sbjct: 675 SEGQQKTRKLMLMDWKRGG 693
BLAST of CmaCh04G007210 vs. NCBI nr
Match:
KAG7031186.1 (hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1275.0 bits (3298), Expect = 0.0e+00
Identity = 665/707 (94.06%), Postives = 668/707 (94.48%), Query Frame = 0
Query: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 240
KRNLKVSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
Query: 241 LYGLKSDVHDFTKLTDDPPLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
LYGLKSDVHDFTKLTDDPPLN LLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 301 LPWPVHPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSN 360
LPWPV PQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAP LSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
Query: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLI 480
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESL
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 481 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEP 540
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANE
Sbjct: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
Query: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 601 GLWSNNSFKNEGQLHPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSPSKFVRDS 660
GLWSNNSFKNEG LHPSKKPKPGTVESRRDITQTNNRKGPLNWA TKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
Query: 661 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 680
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG
Sbjct: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGG 707
BLAST of CmaCh04G007210 vs. TAIR 10
Match:
AT1G64050.1 (unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; Bacteria - 106; Metazoa - 106; Fungi - 24; Plants - 25; Viruses - 0; Other Eukaryotes - 263 (source: NCBI BLink). )
HSP 1 Score: 256.1 bits (653), Expect = 7.7e-68
Identity = 242/708 (34.18%), Postives = 348/708 (49.15%), Query Frame = 0
Query: 1 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIEEVQL---CEADRGSAPPSYSFQHFSSYG 60
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 61 SQKVGTSSINDLGSVSL--------DEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGV 120
K G S + SL D +HD +++ E +N S +++
Sbjct: 61 INKKGAGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAE 120
Query: 121 QQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLM 180
RK K+SRSSS + +R + L D + + D DT + G C +KP ++
Sbjct: 121 SPRKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVV 180
Query: 181 KQKSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTD 240
KQ+SS + KRGDKR KV ++T + SAT FFG YGLK ++D TKL +
Sbjct: 181 KQRSSYNGKRGDKRISKVPVRTL-------STINSATGENAFFGAYGLKPAINDVTKLVE 240
Query: 241 DPPLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESC 300
D L LL+GSY+ SL K K KK N N L ++ S+L PV Q++ E ++C
Sbjct: 241 DFSLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQSSTELDTC 300
Query: 301 SNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPD 360
+ + S +++ N D +++ D L S +D C NSE + L F L
Sbjct: 301 LSRTLGSPPSSISATLPNSENID--KVNALDGDLSSSSKDHCINSEIPSTPLSFPLCDAG 360
Query: 361 DMFVKLGLPLPKDLESLLQDASKSSVSSKNATD-LRSAKQQSRRAILQPFPWSHSFNGHS 420
D+ +LGLP KDL+SLLQDASK S +SKN D RSAK + L FPWS FNG S
Sbjct: 361 DVLKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPP--HSGLPHFPWSQPFNGSS 420
Query: 421 KANSDSSKFSANRTTCTGRWWRVGNFA-NIPTATADCFTKNLESLIFNQSLFPSTMRVGP 480
+ NS+++K +T C GRW R+ + + + P D F NLESL FNQ+L P +
Sbjct: 421 RTNSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQNLVPPLL---- 480
Query: 481 DDGKSSSVSVNHHQSGWDSLSSATCSKAS-SMLVDS-------RGKMNREANEPHCPRVM 540
K + V Q+ + + S C++AS S L +S G + E + CP+++
Sbjct: 481 ---KQTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVEDDALSCPQLL 540
Query: 541 AAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLK--SEETEELYAAPTTYGLWS 600
AA+TL DIA ++A N +GI++WPKK SQKSMKARK K + E ++ L S
Sbjct: 541 EAARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRTTVSSIDLNS 600
Query: 601 NNSFKNEGQL-----------HPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSP 660
+N+ N+ + H PKP R ++ N+K + S SSP
Sbjct: 601 SNNNNNKNHVRKDSAAEHNHHHHHHHPKP---SKRLKLSTMENKK------RSFPSSSSP 660
Query: 661 SKFVRDSVSEAKHSTSGVVKQSSMM----PPPATHLSKASEGQQKTRK 670
+ S+ KHS+S K S M PPP L K+S QK RK
Sbjct: 661 IE------SDRKHSSSSKFKNHSRMMLPPPPPTRTLQKSSTYPQKARK 666
BLAST of CmaCh04G007210 vs. TAIR 10
Match:
AT1G64050.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 256.1 bits (653), Expect = 7.7e-68
Identity = 243/706 (34.42%), Postives = 350/706 (49.58%), Query Frame = 0
Query: 1 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIEEVQL---CEADRGSAPPSYSFQHFSSYG 60
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 61 SQKVGTSSINDLGSV------SLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQ 120
K G+SS S+ S D +HD +++ E +N S +++
Sbjct: 61 INKKGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAESP 120
Query: 121 RKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQ 180
RK K+SRSSS + +R + L D + + D DT + G C +KP ++KQ
Sbjct: 121 RKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVVKQ 180
Query: 181 KSSVSSKRGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP 240
+SS + KRGDKR KV ++T + SAT FFG YGLK ++D TKL +D
Sbjct: 181 RSSYNGKRGDKRISKVPVRTL-------STINSATGENAFFGAYGLKPAINDVTKLVEDF 240
Query: 241 PLNDLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSN 300
L LL+GSY+ SL K K KK N N L ++ S+L PV Q++ E ++C +
Sbjct: 241 SLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQSSTELDTCLS 300
Query: 301 SKPSTSLVSSVSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDM 360
+ S +++ N D +++ D L S +D C NSE + L F L D+
Sbjct: 301 RTLGSPPSSISATLPNSENID--KVNALDGDLSSSSKDHCINSEIPSTPLSFPLCDAGDV 360
Query: 361 FVKLGLPLPKDLESLLQDASKSSVSSKNATD-LRSAKQQSRRAILQPFPWSHSFNGHSKA 420
+LGLP KDL+SLLQDASK S +SKN D RSAK + L FPWS FNG S+
Sbjct: 361 LKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPP--HSGLPHFPWSQPFNGSSRT 420
Query: 421 NSDSSKFSANRTTCTGRWWRVGNFA-NIPTATADCFTKNLESLIFNQSLFPSTMRVGPDD 480
NS+++K +T C GRW R+ + + + P D F NLESL FNQ+L P +
Sbjct: 421 NSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQNLVPPLL------ 480
Query: 481 GKSSSVSVNHHQSGWDSLSSATCSKAS-SMLVDS-------RGKMNREANEPHCPRVMAA 540
K + V Q+ + + S C++AS S L +S G + E + CP+++ A
Sbjct: 481 -KQTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVEDDALSCPQLLEA 540
Query: 541 AQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLK--SEETEELYAAPTTYGLWSNN 600
A+TL DIA ++A N +GI++WPKK SQKSMKARK K + E ++ L S+N
Sbjct: 541 ARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRTTVSSIDLNSSN 600
Query: 601 SFKNEGQL-----------HPSKKPKPGTVESRRDITQTNNRKGPLNWAATKSSRSSPSK 660
+ N+ + H PKP R ++ N+K + S SSP +
Sbjct: 601 NNNNKNHVRKDSAAEHNHHHHHHHPKP---SKRLKLSTMENKK------RSFPSSSSPIE 660
Query: 661 FVRDSVSEAKHSTSGVVKQSSMM----PPPATHLSKASEGQQKTRK 670
S+ KHS+S K S M PPP L K+S QK RK
Sbjct: 661 ------SDRKHSSSSKFKNHSRMMLPPPPPTRTLQKSSTYPQKARK 664
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1INL3 | 0.0e+00 | 100.00 | uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FNQ1 | 0.0e+00 | 98.38 | uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1C5T9 | 0.0e+00 | 84.09 | uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1EYC2 | 6.8e-306 | 82.28 | uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A6J1JD46 | 7.0e-303 | 83.03 | uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_022979382.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima] | [more] |
XP_023535222.1 | 0.0e+00 | 98.38 | uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022942381.1 | 0.0e+00 | 98.38 | uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata] | [more] |
KAG6600547.1 | 0.0e+00 | 97.79 | hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7031186.1 | 0.0e+00 | 94.06 | hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
AT1G64050.1 | 7.7e-68 | 34.18 | unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; ... | [more] |
AT1G64050.2 | 7.7e-68 | 34.42 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_c... | [more] |