Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAAACACTCCCGTTTTTCTTCCTTTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTTCTTGACTCACTCCCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTCTTCTGTAAGTTCAGATTGGTATCTGTTTTCGACGGCGTTTTAGGTTCCTTTGAGATGTTCGGCGGCTGCAGTGAATCGTCTTCTTTTTATAGTTCGCTGTTGTTAGGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGGTGAAACAGCCTTTTTTTTTGCTGTAATACTGCTGTTTAATTTTTTGATTTGTTGCTGGTTTCGTATTTTTAACTGCATCCGATGGCTTAGTAACCTCCTCGCATTCCAATTGGCCAATACTGGAGGCCATGGAGAGTGATATTTGTTTCGAAAAAGAGATGATAATGAATATAGTGCTTGGAGTTTGATGCATTGCTTGTTCATGAACTGACTCTATGTAGTAGGATGAAAGTTACAGAGCGTTTTGAGTCACTTATTGTTGAAGACCAATGCGTTTCAGCAGCCCCATGAAGACAGCCACCACTTTGTAATAACATTTTTTCAATAAATGTGTTCCCGCGGATTTTGCGAATTGAAGTGTTTTATATAAACCCTAGGCGTTTGTGAGGATGTCTTATCCTTTTTTCTTTTTGTGCTTTTTCATTATTAATAAAGGACTCTTCTTATTAAATGTGTTCCAGTGGATTTTGAGTGAGGTTGACTTGGGATCCAGCATGCAGATTACTTAAATGTGGCAGTCGTTTGGTTTTGTATTTTATTATTTTGAAAGGGAAATGTGATAAAGAAAAAATTCTGCTCTTAGGATATGGTGGCGGAAACATCAAACATTTTGCATAAGAAGAAACACAGTGTTTTTTTTAGTATTCTTTCATAGGATGGTTTATATCATCTGTCTAGTGAAATTCTGAACATGGAATAAGGTCTATATGGTGGATTTAGACAATCTTGAAGAAAAAATTGCATCTTTTTTTTAACATACTTTCTATAGATGAATGACGATGATAGTTGGTTCTGGCTGCTGCATTTGTGAAAGGATTGTCATATATTTTGGTTTATTTGGACTTTTTTAAGTCTTCTACTTTTTTATTCTTTGCCACATATCATCTATGTATGTGATTCCACTTCGGGTTGATTTTTGTAAAATTTATATATCACAAGTCTCTAGCATATCATCTCCTTTATCTGTCATTTTTTCTTTCTTCATTTTCTATATTTTAATGAAATAATGGTCTATAAATGTCGGTGTGCCAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGGTAATGTTCAGTTCCCAATTTAATGTTATAAATGTTCATCTTTTGCTATTACTTTAGACTATTCTTATCATTTCCAAATAACGCCATGCAATACATCTATTTCATATTTTCAGATAGGATCATTGTGCAAGAAATCTGATTAAAAAATAATCCAAATGTTTAGCTATAACTAGAAGCAATAGGAACGGATAAAGAATCTAAAAATTAAAAGTAGCCTCAGTTGTTTAAAATTATTACATGTGAATTGTTTATAAGATCCACGTGTAGAGGACCATGAAAGGCAATTCAATGAACTTTGATCCCAATTATTCAGGAAGTGGTTATTTTCTTCTTTGCTATTTGAATGTTATGATTCTTTCATTTATCGTAAATTTATGACAGAGGAGGATATTTGGGAAATTTTTTGGTTACATTGGTAATGTTATCAATTCATTGATCTTACTATGTACTTTCTATTTCTAGAGTAAAGAAATCATTGTATTCATATTTTCTTGACTTAATGTATTGAAAAACTGGTGCTACGGTGGTGATTTGACTTTTTTTAAAATATTTTAAATATGTAATATTGTGCAAGCTTAGAATGGGCAATATCTGTTGCATATATGTGATTGAGTTAGGATATCCTTTCGCATTTTCTATACTTCCTATGCTTTCCTCATTTAAGTTGTTACTGATGTATACACCCCCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTCCACAAAAAAAAAAAAAAAAAACAGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAGTGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGTATGATGATGTCCTGTGAATTATTAATTAATGTGCTCAGAAACTTGCATTTATGTGGTACTTACGGTATTTTGAGGACCTTATGACATGTATTGAAGAAAGACTCCACTGGATAATGCAGACGTTTTAAATTGTGTAGATATACTTAGTTTGTTGTGAAGAAAATTTATATCATTGTTGTGACAGAACCTAAATTGAGGGATAACTAATAAGTCTAACCAAATTAGGACAGTTCTAATTCTATCATGTCTTGATTTGTGTGTTACTTCAATATTTTTCTGTGAAGATATTAATTGGATAGTCCTTGTAATTAAGGTATCTCTGGAAAGACATTATCACATGAATAGCCTTTTGAATAAGAGGTATAGCCATTGCACCTATCTCGCGAAAAGATAGAAAGTGAAATGTCGGCTATTTTATTTTGACTTCCAACAAGAGTTATTCCTTGAGTATGTCTTGATGTACAAAAATGAGTTGCGTCCCAGTTTTTACAAGTGTTATAATGAAAGCATTTTCTCTTTACCTTCTTTGATTTTTTTTTTTTCCTTCCATCACTATAGGTCGCTACTGGATCTTGTTTTCAGTTTTGCAAAGAGTTTAATTGCTCAGTGCTCAATTCCTTTAGTTGTTACTTCCTATTATGACATAAGTGAAAAAACAAATCACTTGTTTATAATCTATTTATAATAAAGGTTCTCATTGGTTTTTACTTTTGTGCACCTTACTTTGCATCACACAGCATAAAATCATTTATGGGCCTTGTGAAATGTTTACAGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCAACGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTACAACAGATGCACCTTCATTAAGCAAGGTCTGTGAGTAAGAATTTGGCTATCTCTCTGGCCTATGTGCACATGCACTCACTCTAACACATACTGTACCTTAGTGTTTGTTGTTTTTAACTTCTTGATCTGTACGCTGACTATACATTTTTATTCAATTGGTTTAAATTATTTTGACATCTCTTTTATAGTTATTTGAGATCTGTCTACAACCTGATTGAATCTATTTTTGCTGATTCTCTTATGTGGTAATTGCCGTGGCAAAATACTCAATTCTTCATCGAAAGGATATTTTTTAGTGTCCTAAACTTCAATCATAAATTCGGAGTTTCTTTCCATTATTACATTTAGATTTAGACCTTAATTTCGTTCTCTCTTGGTTTTCCATGAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTCTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGGTAAGGATTTATTGGAAGGCATTTATGTCGCGTAATTGAGATACAGTATCCACCTTTGAACCATTCTGCTATTATTGTTTATTTTACTTAAATCGTGACCACTGCTTTTGGTCTACATACAGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGGTATGACCTCCTTATTTTTCTGCTATGAATCAACTAAATCTTATTCTTGATTATTTTTATTTTTATTTTTTAAAGTTAGTAGTTAAGATCCAATAAATTATAAAAAAGTTGAACTGTGTCCGTAACTGACTTTTCAATGGCACCATAAGAACTACCCACTTAATTTCTTATATCACATAAATGACTTCTCTTAATGGGGTCCTTTCTCTGAAGGAACCAAAAAGAATCCACTGAAGGAAGATGCTTATTGAAGAGCTGAGGGAGAGAAAGATTGATCTTGAGAGAGAAAAGTAAGAGGAAAGTGAGATGAGGAAGAGAAGTCCTTGAGGAGTAATCTGGGTCTGTAGTTATAAACGAGAAGTTTTTTCAGAAGTTCAGTAGTTGAAGAAAGTGCTTTGCTAGTCTGAGAATTTTTGGCAAGTACAGAGAAGCCTGCTCTAGAAAACATTAAGCGTTTTAGAGGAAACTAAATCAAATCCCTTGCATTTATTTGTTATGCATAATTCAGAAATGTTTAAAACTTGGATTCTTAATCTCCCTAGAGCTATGAGTACGAACTCCTTAGTTTGGCTAGTGTGTCCATGTCTAACACGTTTTAGACACTTCAACACTTCGACATTTGTTGGATATGTATCAGATACTCTTTGGTGCAAAATCAATATGAGTTCTATATTTGTCCGACACTTATCGAACACTTATTAAGTATACTAAATAGACACATAGAACAAAAATCATTAAGTTTGAGAGTAAGACATATCAAACTCATTTATATCATATAAATGCCTTAACTTATTGACTTCGGATTTCTTTAGACACCTATCGTACACTTCTAAAGTATACTAAATACACATATAAGACGATAATCATTAAATTTGAGAGTGAAATACATCGAACTCATTTATTTAATCTTATAAATGTCTTAACTTATTGACTTTGGATTTCTTTATATAAACGTATCCTTGATGTGTCTATGTCATAATTTTTTTAAATAATGATGTGTTTCGTGTGATGTCATATCCACGTCCCGTTCTTCTTAGCGATAGAGAACTGTAGATATAATGTACGTCTCTCAAACATTAAGCCACTTTCCTAATCTTTTTTGCGAATGCTATCTCGCTGCAGCTTGAAACTTCATGAATTTTGTTGTGTCATTTGCATGCAACTAAAGCCTAATTGAATTTTTTTGTCTTCGGCTATTGATATTGCTTCTCTACAGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAGATTGATCAAGAAACTCTACATTATTATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTTAGGTGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGATTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACATCAACTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGAAAAAGTATTTGACGAGTCAATAG
mRNA sequence
ATGCTAAACACTCCCGTTTTTCTTCCTTTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTTCTTGACTCACTCCCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTCTTCTGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAGTGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCAACGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTACAACAGATGCACCTTCATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTCTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAGATTGATCAAGAAACTCTACATTATTATATCCCTTTCCAACAAAACGCTGCTATTAAATTAGAACCACTCCTCTGGGCTTAGGTGTGGCTAAACTGGTAGGTTGTTTGTGTACATAATATTGTTGTAAGTGGCTGATTGATATGATTATTGTAATGGTCCGTCCTGAATAGTGACACGCTTTCTAATTGTTGTAACATCACATCAACTTCATTCCAGAACGCAGTAGCAAATTCCTTGCCACTGTGTGAAAAAGTATTTGACGAGTCAATAG
Coding sequence (CDS)
ATGCTAAACACTCCCGTTTTTCTTCCTTTCTCCTTTCGTTTTCTCCCACCTAATTCACTTTTCTTGACTCACTCCCCGATCCCAGTCTTTCCGGTGCGATGTCCAAATGGTCTTCTGGCTTCGGTGATGGATGCTCTTGAGTTGAATTATCCGGTGGATGTGGCGGCGCCGAAGCTCATGGGACCTGACGGCTCTGTTAGAACCGGGATAATAATCGAGGAGGTTGAGTTGTGCGAAGCTGATCGTGGTTCTGCCCCTCCTAGCTATTCGTTTCAGCATTTTAGCTCGTACGGTAGTCAAAAAGTTGGGACGAGCTCTATCAATGATTTGGGTTCTGTTTCTTTGGATGAGATCCCTGATGGGGCAGTTTCTAAGGATGGTGAAGATACACCTGAAGATTTTGAGAGTAGGAACAAAAGAAGTCATTTGTCCACTTCAAGTCCAGGAGTACAGCAACGTAAATCGTTAAAGGTCTCAAGGAGTAGTAGCAGTAGTTTATGTTCTAAGAGGCGTTTGGTTCAATTGGAAGATTCTTTATTACTAAGTGGGGCTGATGAAGTAAAAGATACGTCTGACAAGCTTGGATCATATCTTAAAAAGTGCGGCTCTCATGAGAAGCCTCAATTGATGAAACAGAAAAGCAGCGTAAGCAGCAAGCGGGGTGATAAGAGAAATCTCAGTGTGTCATTGAAGACCAAATTTGATTCACTCCCTATAAATGCTGGAAATGGCTCAGCTACCGCAGGGTGCATTTTTTTCGGACTATATGGGCTGAAATCAGATGTTCATGATTTCACAAAGCTTACGGATGATCCAACGCTGAATGGTCTTCTTGATGGCAGTTATGACTATTCTAGTTTAAGTAAAGCCAAAGGTAAAAAAGACACAAATGTAAATGAATGTTTTCTGCAGTCAATTAGAAAAGCTTGTTCTGTTCTTCAGCTCCCATGGCCCGTCCGTCCACAAAATACTGCGGAGTCAGAGAGTTGTTCTAATAGCAAACCATCCACAAGTCTAGTTAGTTCTGTTTCAAGCATGGAAGAAGGGGTGAATTTTGATGCAAAAGAACTTAGTACAACAGATGCACCTTCATTAAGCAAGGTGCAAGATGCTTGTAGCAATTCTGAAACTTTGACCAACGTACTTGATTTTAAGTTGTACAAACCAGATGACATGTTTGTGAAATTGGGCCTTCCTCTACCAAAGGATTTGGAATCTTTGCTTCAGGATGCCAGCAAGTCCTCTGTATCTTCAAAGAATGCGACAGATTTGCGTTCAGCAAAGCAACAATCTCGTCGAGCAATCTTGCAACCATTTCCGTGGTCACATTCTTTTAATGGGCACTCTAAAGCAAATTCTGATTCATCCAAGTTCTCTGCAAATAGAACCACATGTACCGGTAGGTGGTGGCGAGTCGGAAACTTTGCCAATATCCCCACTGCTACTGCTGATTGTTTTACAAAAAACTTGGAATCATTGACTTTTAACCAGAGTTTATTTCCTTCAACAATGAGAGTTGGTCCTGATGATGGAAAGTCCTCCTCTGTATCTGTTAATCATCATCAAAGTGGATGGGATTCCCTGTCTTCTGCAACTTGTTCGAAAGCTTCCTCTATGCTTGTGGATTCTCGAGGGAAGATGAATCGTGAGGCAAATGAGCAGCACTGTCCAAGAGTAATGGCTGCTGCTCAAACTCTCTACGATATTGCAACTCGTGCTGCATTGAGGCAAAACATAGACGGTATAGTAAAGTGGCCAAAAAAGCCTTCACAAAAGTCCATGAAGGCTCGAAAGTTGAAATCAGAGGAAACCGAAGAATTATATGCCGCACCGACCACATACGGATTATGGTCCAACAATTCATTTAAAAACGAGGGCCATTTGCATCCCTCGAAGAAGCCAAAGCCAGGAACAGTAGAGAGCAGAAGAGACATTACTCAGACAAATAATAGAAAGGGACCATTGAATTGGGCTACGACCAAATCGAGTAGATCGTCCCCAAGTAAGTTCGTTAGAGATTCGGTTTCGGAAGCTAAACATTCAACCTCCGGCGTAGTAAAACAATCATCAATGATGCCTCCTCCAGCAACTCATTTGTCCAAGGCTAGCGAGGGACAACAAAAGACACGAAAGTTAATGCTCATGGATTGGAAAAGAGGAGGAGGAACAGGTTAG
Protein sequence
MLNTPVFLPFSFRFLPPNSLFLTHSPIPVFPVRCPNGLLASVMDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Homology
BLAST of CmoCh04G007630 vs. ExPASy TrEMBL
Match:
A0A6J1FNQ1 (uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447442 PE=4 SV=1)
HSP 1 Score: 1322.4 bits (3421), Expect = 0.0e+00
Identity = 682/682 (100.00%), Postives = 682/682 (100.00%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 282
KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 240
Query: 283 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 342
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 343 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 402
VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 403 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 462
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 463 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 522
TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 523 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 582
SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 583 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 642
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 643 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 702
RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 703 SEGQQKTRKLMLMDWKRGGGTG 725
SEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 SEGQQKTRKLMLMDWKRGGGTG 682
BLAST of CmoCh04G007630 vs. ExPASy TrEMBL
Match:
A0A6J1INL3 (uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111479123 PE=4 SV=1)
HSP 1 Score: 1291.2 bits (3340), Expect = 0.0e+00
Identity = 668/679 (98.38%), Postives = 669/679 (98.53%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 282
KRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LN LLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
Query: 283 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 342
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
Query: 343 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 402
VSSMEEGVNFDAKELSTTDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 403 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 462
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 463 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 522
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 523 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 582
SGWDSLSSATCSKASSMLVDSRGKMNREANE HCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 583 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 642
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
Query: 643 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 702
RDITQTNNRKGPLNWA TKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 703 SEGQQKTRKLMLMDWKRGG 722
SEGQQKTRKLMLMDWKRGG
Sbjct: 661 SEGQQKTRKLMLMDWKRGG 679
BLAST of CmoCh04G007630 vs. ExPASy TrEMBL
Match:
A0A6J1C5T9 (uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008234 PE=4 SV=1)
HSP 1 Score: 1103.6 bits (2853), Expect = 0.0e+00
Identity = 582/688 (84.59%), Postives = 623/688 (90.55%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDA+EL YPVDVAAPKLMGPDGSVRTG+ IEEVELCE+DR SAPPSYSFQHFSSYGSQK
Sbjct: 1 MDAVELTYPVDVAAPKLMGPDGSVRTGVTIEEVELCESDRVSAPPSYSFQHFSSYGSQKA 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSIND+GSVSLD+IPDGAVSKDGE T ED ESRNKRS L TSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDVGSVSLDKIPDGAVSKDGEGTSEDLESRNKRSLLFTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSH--EKPQLMKQKSSVSSK 222
SSSLCSKR R+V+LEDSL LSGAD+VKDTSDKLGSYLKKC SH EK QL+KQKSS+SSK
Sbjct: 121 SSSLCSKRPRVVRLEDSLFLSGADDVKDTSDKLGSYLKKCSSHETEKAQLLKQKSSLSSK 180
Query: 223 RGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLD 282
RGDKRNL VSLKTKFDSL INAGNGSA AG F LYGLKSDVHDFTKL DDP LN LLD
Sbjct: 181 RGDKRNLKVSLKTKFDSLSINAGNGSAAAGSSFLALYGLKSDVHDFTKLVDDPPLNDLLD 240
Query: 283 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 342
GSYD +SLS KGKKDTNVNECFLQS+RKACSVLQLPWPV PQN AESE CSNSKPSTS+
Sbjct: 241 GSYDSASLSIDKGKKDTNVNECFLQSVRKACSVLQLPWPVHPQNIAESEGCSNSKPSTSI 300
Query: 343 VSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 402
VS VSSMEEGVNFD KE TD+PSL+KV+DACSNSETLTN LDFKLYKPDDMFVK+GLP
Sbjct: 301 VSYVSSMEEGVNFDVKEPIATDSPSLNKVRDACSNSETLTNPLDFKLYKPDDMFVKMGLP 360
Query: 403 LPKDLESLLQDASKSSV-SSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKF 462
LPKDLESLLQDASKSSV SSKN TDLRSAKQQSRRA+LQPFPWSHSFNGHSK+NSDSSKF
Sbjct: 361 LPKDLESLLQDASKSSVSSSKNVTDLRSAKQQSRRAMLQPFPWSHSFNGHSKSNSDSSKF 420
Query: 463 SANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMR-VGPDDGKSSSVS 522
SANRTTC GRWWR+GNF++IP+ATADCFTK+LESLTFNQSLFPSTMR VGPDD +SSSVS
Sbjct: 421 SANRTTCPGRWWRIGNFSSIPSATADCFTKDLESLTFNQSLFPSTMRVVGPDDRRSSSVS 480
Query: 523 VNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQ 582
VNHHQ GWDSLSSA CSKASS+LV+SRGK N EAN+Q CP+V+AAA+TLYDIAT AA RQ
Sbjct: 481 VNHHQCGWDSLSSAICSKASSVLVESRGKTNYEANDQQCPKVIAAAKTLYDIATYAASRQ 540
Query: 583 NIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPG 642
NIDGIV+WPKKPSQKSM+ARKLKSEETEELYAAP TYGLWS+N FK+EGH+H SKKPK G
Sbjct: 541 NIDGIVRWPKKPSQKSMRARKLKSEETEELYAAP-TYGLWSDNPFKSEGHMHSSKKPKLG 600
Query: 643 TVESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPAT 702
T ESRRD+ TN R+GPLNWAT +SSRSSPSKFVRDS S+AKHSTSG+VK SSMMPPPAT
Sbjct: 601 TTESRRDLAHTNCRRGPLNWATPRSSRSSPSKFVRDSASDAKHSTSGIVKPSSMMPPPAT 660
Query: 703 H-LSKASEGQQKTRKLMLMDWKRGGGTG 725
L K EGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 TLLCKGGEGQQKTRKLMLMDWKRGGGTG 687
BLAST of CmoCh04G007630 vs. ExPASy TrEMBL
Match:
A0A6J1EYC2 (uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC111437570 PE=4 SV=1)
HSP 1 Score: 1067.4 bits (2759), Expect = 2.7e-308
Identity = 567/683 (83.02%), Postives = 603/683 (88.29%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
M ALEL PVDV KLMGPDGSVRTG+ IEEVELCEADRGSAPPSYSFQHFSSYG +K
Sbjct: 1 MYALELTCPVDVVVSKLMGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGCKKD 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLG VSLD++PDGAV KDGE+T EDFESRNKRSHLSTSS GVQ RK LKVSR
Sbjct: 61 GTSSINDLGPVSLDKVPDGAVFKDGENTSEDFESRNKRSHLSTSSLGVQPRKPLKVSR-G 120
Query: 163 SSSLCSKR-RLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRG 222
SSSLCSKR R+VQLED L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRG
Sbjct: 121 SSSLCSKRPRVVQLEDPLFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRG 180
Query: 223 DKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGS 282
DKRNL VSLKTKFDS NAGNGSA AG F GLYGLKS DFTKLTDDP LN +LDGS
Sbjct: 181 DKRNLKVSLKTKFDSFSTNAGNGSAAAGSSFHGLYGLKSGARDFTKLTDDPPLNDILDGS 240
Query: 283 YDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVS 342
YD ++LSK KGKKDTNVNECFLQSIRKACSVLQLPWPVRPQN AESESCSNSKP TSLVS
Sbjct: 241 YDCANLSKDKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNMAESESCSNSKPDTSLVS 300
Query: 343 SVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLP 402
SVSSMEE VNFD KELS TD+PSL+KV+DAC+NSE LTN LDFKLYKPD MF+KLGLP+P
Sbjct: 301 SVSSMEEKVNFDVKELSATDSPSLNKVEDACNNSEPLTNALDFKLYKPDHMFMKLGLPIP 360
Query: 403 KDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSAN 462
KDL SLLQDASKSSVSS NATDLRSAKQQSRRA+LQPF WSHSFNGHSKANSDSSKFSAN
Sbjct: 361 KDLNSLLQDASKSSVSSNNATDLRSAKQQSRRAMLQPFAWSHSFNGHSKANSDSSKFSAN 420
Query: 463 RTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRV-GPDDGKSSSVSVNH 522
RTTC GRWWRV NF+NIP+ATADCFTK+LESLTFNQSLFPSTMRV GPDDG+ SS+SVNH
Sbjct: 421 RTTCLGRWWRVRNFSNIPSATADCFTKDLESLTFNQSLFPSTMRVIGPDDGR-SSISVNH 480
Query: 523 HQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNID 582
HQ GWDSLSSATCSK SS+LV+SRGKMN E+ EQ CPRVMAAAQTLYDIAT AALRQNID
Sbjct: 481 HQCGWDSLSSATCSKTSSVLVESRGKMNSESYEQQCPRVMAAAQTLYDIATSAALRQNID 540
Query: 583 GIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG-HLHPSKKPKPGTV 642
G+V+WPKK SQKSM+ARKLKSEETEELY PTTYGLWSNNS KNEG H HPSKKPK GT
Sbjct: 541 GMVRWPKKASQKSMRARKLKSEETEELYTTPTTYGLWSNNSIKNEGHHAHPSKKPKLGTT 600
Query: 643 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQ-SSMMPPPATH 702
ESRRD+ QTN ++GPLNW T +SSRSSPSKF+RDSVSEAK ST+G +KQ SSMMPPPAT
Sbjct: 601 ESRRDVAQTNCKRGPLNWTTPRSSRSSPSKFIRDSVSEAKPSTAGAIKQSSSMMPPPATL 660
Query: 703 LSKASEGQQKTRKLMLMDWKRGG 722
L KA EGQQKTRKLMLMDWKRGG
Sbjct: 661 LCKAGEGQQKTRKLMLMDWKRGG 678
BLAST of CmoCh04G007630 vs. ExPASy TrEMBL
Match:
A0A6J1JD46 (uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483405 PE=4 SV=1)
HSP 1 Score: 1057.4 bits (2733), Expect = 2.8e-305
Identity = 558/666 (83.78%), Postives = 594/666 (89.19%), Query Frame = 0
Query: 60 MGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKVGTSSINDLGSVSLDEIP 119
MGPDGSVRTG+ IEEVELCEADRGSAPPSYSFQHFSSYGS+K GTSSINDLG VSLD++P
Sbjct: 1 MGPDGSVRTGVTIEEVELCEADRGSAPPSYSFQHFSSYGSKKDGTSSINDLGPVSLDKVP 60
Query: 120 DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSSSSSLCSKR-RLVQLEDS 179
DGAV KDGE+T EDFESRNKRSHLSTSSPGVQ RK LKVSR SSSLCSKR R+VQLED
Sbjct: 61 DGAVFKDGENTYEDFESRNKRSHLSTSSPGVQPRKPLKVSR-GSSSLCSKRPRVVQLEDP 120
Query: 180 LLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGDKRNLSVSLKTKFDSLP 239
L LSGAD D SDKLGSYLKKC SHEK QL+KQKSS+SSKRGDKRNL VSLKTKFDS
Sbjct: 121 LFLSGAD---DVSDKLGSYLKKCNSHEKTQLLKQKSSLSSKRGDKRNLKVSLKTKFDSFS 180
Query: 240 INAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSYDYSSLSKAKGKKDTNV 299
NAGNGSA AG F GLYGLKS DFTKLTDDP LN +L+GSYDY++LSK KGKKDTNV
Sbjct: 181 TNAGNGSAAAGSSFHGLYGLKSGARDFTKLTDDPPLNDILNGSYDYANLSKDKGKKDTNV 240
Query: 300 NECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS 359
NECFLQSIRKACSVLQLPWPVRPQN AESESCSNSKP TSLVSSVSSMEE VNFD KELS
Sbjct: 241 NECFLQSIRKACSVLQLPWPVRPQNMAESESCSNSKPDTSLVSSVSSMEEKVNFDVKELS 300
Query: 360 TTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSS 419
TD+PSL+KV+DAC NSE LTN LDFKLYKPD MF+KLGLP+PKDL SLLQDASKSSVSS
Sbjct: 301 ATDSPSLNKVEDACDNSEPLTNALDFKLYKPDHMFMKLGLPIPKDLNSLLQDASKSSVSS 360
Query: 420 KNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANI 479
NATDLRSAKQQSRRA+LQPF WSHSFNGHSKANSDSSKFSANRTTC GRWWRV NF+NI
Sbjct: 361 NNATDLRSAKQQSRRAMLQPFAWSHSFNGHSKANSDSSKFSANRTTCLGRWWRVRNFSNI 420
Query: 480 PTATADCFTKNLESLTFNQSLFPSTMR-VGPDDGKSSSVSVNHHQSGWDSLSSATCSKAS 539
P+ATADCFTK+LESLTFNQSLFPSTMR VGPDDG+ SS+SVNHHQ GWDSLSSATCSK S
Sbjct: 421 PSATADCFTKDLESLTFNQSLFPSTMRVVGPDDGR-SSISVNHHQCGWDSLSSATCSKTS 480
Query: 540 SMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKAR 599
S+LV+SRGKMN EANEQ CPRVMAAAQTLYDIAT AALRQNIDG+V+WPKK SQKSM+AR
Sbjct: 481 SVLVESRGKMNNEANEQQCPRVMAAAQTLYDIATSAALRQNIDGMVRWPKKASQKSMRAR 540
Query: 600 KLKSEETEELYAAPTTYGLWSNNSFKNEG-HLHPSKKPKPGTVESRRDITQTNNRKGPLN 659
KLKSEETEELY PTTYGLWSNNS KNEG H HPSKKPK GT ESRRD+ QTN ++GPLN
Sbjct: 541 KLKSEETEELYTTPTTYGLWSNNSIKNEGHHAHPSKKPKLGTTESRRDVAQTNCKRGPLN 600
Query: 660 WATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQ-SSMMPPPATHLSKASEGQQKTRKLMLM 719
W T +SSRSSPSKF+RDS+SEAK ST+G +KQ SSMMPPPAT L KA EGQQ TRKLMLM
Sbjct: 601 WTTPRSSRSSPSKFIRDSISEAKPSTAGAIKQSSSMMPPPATLLCKAGEGQQNTRKLMLM 660
Query: 720 DWKRGG 722
DWKRGG
Sbjct: 661 DWKRGG 661
BLAST of CmoCh04G007630 vs. NCBI nr
Match:
XP_022942381.1 (uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1322.4 bits (3421), Expect = 0.0e+00
Identity = 682/682 (100.00%), Postives = 682/682 (100.00%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 282
KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY
Sbjct: 181 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 240
Query: 283 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 342
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 343 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 402
VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 403 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 462
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 463 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 522
TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 523 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 582
SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 583 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 642
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 643 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 702
RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 703 SEGQQKTRKLMLMDWKRGGGTG 725
SEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 SEGQQKTRKLMLMDWKRGGGTG 682
BLAST of CmoCh04G007630 vs. NCBI nr
Match:
XP_023535222.1 (uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1315.8 bits (3404), Expect = 0.0e+00
Identity = 678/682 (99.41%), Postives = 679/682 (99.56%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 282
KRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LNGLLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLDGSY 240
Query: 283 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 342
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 300
Query: 343 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 402
VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 403 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 462
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 463 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 522
TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 523 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 582
SGWDSLSSATCSKASSMLVD RGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDFRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 583 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 642
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 600
Query: 643 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 702
RDITQTNNRKGPLNWATTKSSRSSPSKF+RDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWATTKSSRSSPSKFIRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 703 SEGQQKTRKLMLMDWKRGGGTG 725
SEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 SEGQQKTRKLMLMDWKRGGGTG 682
BLAST of CmoCh04G007630 vs. NCBI nr
Match:
KAG6600547.1 (hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1312.0 bits (3394), Expect = 0.0e+00
Identity = 677/685 (98.83%), Postives = 679/685 (99.12%), Query Frame = 0
Query: 40 ASVMDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGS 99
ASVMDALELNYPVDVAAPKLMGPDGSVRTGIII+EVELCEADRGSAPPSYSFQHFSSYGS
Sbjct: 12 ASVMDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGS 71
Query: 100 QKVGTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 159
QKVGTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS
Sbjct: 72 QKVGTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVS 131
Query: 160 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 219
RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK
Sbjct: 132 RSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSK 191
Query: 220 RGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLD 279
RGDKRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LNGLLD
Sbjct: 192 RGDKRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNGLLD 251
Query: 280 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 339
GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL
Sbjct: 252 GSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSL 311
Query: 340 VSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 399
VSSVSSMEEGVNFDAKELS TDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP
Sbjct: 312 VSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLP 371
Query: 400 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 459
LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS
Sbjct: 372 LPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFS 431
Query: 460 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVN 519
ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTM VGPDDGKSSSVSVN
Sbjct: 432 ANRTTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMGVGPDDGKSSSVSVN 491
Query: 520 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNI 579
HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAA RQNI
Sbjct: 492 HHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAASRQNI 551
Query: 580 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 639
DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV
Sbjct: 552 DGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTV 611
Query: 640 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHL 699
ESRRDITQTNNRKGPLNWATTKSSRSSPSKF RDSVSEAKHSTSG+VKQSSMMPPPATHL
Sbjct: 612 ESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDSVSEAKHSTSGLVKQSSMMPPPATHL 671
Query: 700 SKASEGQQKTRKLMLMDWKRGGGTG 725
SKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 672 SKASEGQQKTRKLMLMDWKRGGGTG 696
BLAST of CmoCh04G007630 vs. NCBI nr
Match:
KAG7031186.1 (hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1294.6 bits (3349), Expect = 0.0e+00
Identity = 675/710 (95.07%), Postives = 677/710 (95.35%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIII+EVELCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIDEVELCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFF----------------------------G 282
KRNL VSLKTKFDSLPINAGNGSATAGCIFF G
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGSHWFLLLCTLLCITQHKIIYGPCEMFTG 240
Query: 283 LYGLKSDVHDFTKLTDDPTLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 342
LYGLKSDVHDFTKLTDDP LNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ
Sbjct: 241 LYGLKSDVHDFTKLTDDPPLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQ 300
Query: 343 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSN 402
LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELS TDAPSLSKVQDACSN
Sbjct: 301 LPWPVRPQNTAESESCSNSKPSTSLVSSVSSMEEGVNFDAKELSATDAPSLSKVQDACSN 360
Query: 403 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 462
SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA
Sbjct: 361 SETLTNVLDFKLYKPDDMFVKLGLPLPKDLESLLQDASKSSVSSKNATDLRSAKQQSRRA 420
Query: 463 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 522
ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT
Sbjct: 421 ILQPFPWSHSFNGHSKANSDSSKFSANRTTCTGRWWRVGNFANIPTATADCFTKNLESLT 480
Query: 523 FNQSLFPSTMRVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 582
FNQSLFPSTM VGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ
Sbjct: 481 FNQSLFPSTMGVGPDDGKSSSVSVNHHQSGWDSLSSATCSKASSMLVDSRGKMNREANEQ 540
Query: 583 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 642
HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY
Sbjct: 541 HCPRVMAAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLKSEETEELYAAPTTY 600
Query: 643 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFVRDS 702
GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKF RDS
Sbjct: 601 GLWSNNSFKNEGHLHPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSKFFRDS 660
Query: 703 VSEAKHSTSGVVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 725
VSEAKHSTSG+VKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG
Sbjct: 661 VSEAKHSTSGLVKQSSMMPPPATHLSKASEGQQKTRKLMLMDWKRGGGTG 710
BLAST of CmoCh04G007630 vs. NCBI nr
Match:
XP_022979382.1 (uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1291.2 bits (3340), Expect = 0.0e+00
Identity = 668/679 (98.38%), Postives = 669/679 (98.53%), Query Frame = 0
Query: 43 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVELCEADRGSAPPSYSFQHFSSYGSQKV 102
MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEV+LCEADRGSAPPSYSFQHFSSYGSQKV
Sbjct: 1 MDALELNYPVDVAAPKLMGPDGSVRTGIIIEEVQLCEADRGSAPPSYSFQHFSSYGSQKV 60
Query: 103 GTSSINDLGSVSLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 162
GTSSINDLGSVSLDEI DGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS
Sbjct: 61 GTSSINDLGSVSLDEIHDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQRKSLKVSRSS 120
Query: 163 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 222
SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD
Sbjct: 121 SSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQKSSVSSKRGD 180
Query: 223 KRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPTLNGLLDGSY 282
KRNL VSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP LN LLDGSY
Sbjct: 181 KRNLKVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDPPLNDLLDGSY 240
Query: 283 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSNSKPSTSLVSS 342
DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPV PQNTAESESCSNSKPSTSLVSS
Sbjct: 241 DYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVHPQNTAESESCSNSKPSTSLVSS 300
Query: 343 VSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 402
VSSMEEGVNFDAKELSTTDAP LSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK
Sbjct: 301 VSSMEEGVNFDAKELSTTDAPLLSKVQDACSNSETLTNVLDFKLYKPDDMFVKLGLPLPK 360
Query: 403 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 462
DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR
Sbjct: 361 DLESLLQDASKSSVSSKNATDLRSAKQQSRRAILQPFPWSHSFNGHSKANSDSSKFSANR 420
Query: 463 TTCTGRWWRVGNFANIPTATADCFTKNLESLTFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 522
TTCTGRWWRVGNFANIPTATADCFTKNLESL FNQSLFPSTMRVGPDDGKSSSVSVNHHQ
Sbjct: 421 TTCTGRWWRVGNFANIPTATADCFTKNLESLIFNQSLFPSTMRVGPDDGKSSSVSVNHHQ 480
Query: 523 SGWDSLSSATCSKASSMLVDSRGKMNREANEQHCPRVMAAAQTLYDIATRAALRQNIDGI 582
SGWDSLSSATCSKASSMLVDSRGKMNREANE HCPRVMAAAQTLYDIATRAALRQNIDGI
Sbjct: 481 SGWDSLSSATCSKASSMLVDSRGKMNREANEPHCPRVMAAAQTLYDIATRAALRQNIDGI 540
Query: 583 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGHLHPSKKPKPGTVESR 642
VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEG LHPSKKPKPGTVESR
Sbjct: 541 VKWPKKPSQKSMKARKLKSEETEELYAAPTTYGLWSNNSFKNEGQLHPSKKPKPGTVESR 600
Query: 643 RDITQTNNRKGPLNWATTKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 702
RDITQTNNRKGPLNWA TKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA
Sbjct: 601 RDITQTNNRKGPLNWAATKSSRSSPSKFVRDSVSEAKHSTSGVVKQSSMMPPPATHLSKA 660
Query: 703 SEGQQKTRKLMLMDWKRGG 722
SEGQQKTRKLMLMDWKRGG
Sbjct: 661 SEGQQKTRKLMLMDWKRGG 679
BLAST of CmoCh04G007630 vs. TAIR 10
Match:
AT1G64050.1 (unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; Bacteria - 106; Metazoa - 106; Fungi - 24; Plants - 25; Viruses - 0; Other Eukaryotes - 263 (source: NCBI BLink). )
HSP 1 Score: 252.3 bits (643), Expect = 1.2e-66
Identity = 241/708 (34.04%), Postives = 349/708 (49.29%), Query Frame = 0
Query: 43 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIEEVEL---CEADRGSAPPSYSFQHFSSYG 102
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 103 SQKVGTSSINDLGSVSL--------DEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGV 162
K G S + SL D + D +++ E +N S +++
Sbjct: 61 INKKGAGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAE 120
Query: 163 QQRKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLM 222
RK K+SRSSS + +R + L D + + D DT + G C +KP ++
Sbjct: 121 SPRKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVV 180
Query: 223 KQKSSVSSKRGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTD 282
KQ+SS + KRGDKR V ++T + SAT FFG YGLK ++D TKL +
Sbjct: 181 KQRSSYNGKRGDKRISKVPVRTL-------STINSATGENAFFGAYGLKPAINDVTKLVE 240
Query: 283 DPTLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESC 342
D +L LL+GSY+ SL K K KK N N L ++ S+L PV+ Q++ E ++C
Sbjct: 241 DFSLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQSSTELDTC 300
Query: 343 SNSKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPD 402
+ + S +++ N D +++ D S +D C NSE + L F L
Sbjct: 301 LSRTLGSPPSSISATLPNSENID--KVNALDGDLSSSSKDHCINSEIPSTPLSFPLCDAG 360
Query: 403 DMFVKLGLPLPKDLESLLQDASKSSVSSKNATD-LRSAKQQSRRAILQPFPWSHSFNGHS 462
D+ +LGLP KDL+SLLQDASK S +SKN D RSAK + L FPWS FNG S
Sbjct: 361 DVLKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPP--HSGLPHFPWSQPFNGSS 420
Query: 463 KANSDSSKFSANRTTCTGRWWRVGNFA-NIPTATADCFTKNLESLTFNQSLFPSTMRVGP 522
+ NS+++K +T C GRW R+ + + + P D F NLESLTFNQ+L P +
Sbjct: 421 RTNSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQNLVPPLL---- 480
Query: 523 DDGKSSSVSVNHHQSGWDSLSSATCSKAS-SMLVDS-------RGKMNREANEQHCPRVM 582
K + V Q+ + + S C++AS S L +S G + E + CP+++
Sbjct: 481 ---KQTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVEDDALSCPQLL 540
Query: 583 AAAQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLK--SEETEELYAAPTTYGLWS 642
AA+TL DIA ++A N +GI++WPKK SQKSMKARK K + E ++ L S
Sbjct: 541 EAARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRTTVSSIDLNS 600
Query: 643 NNSFKNEGHL-----------HPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSP 702
+N+ N+ H+ H PKP R ++ N+K + S SSP
Sbjct: 601 SNNNNNKNHVRKDSAAEHNHHHHHHHPKP---SKRLKLSTMENKK------RSFPSSSSP 660
Query: 703 SKFVRDSVSEAKHSTSGVVKQSSMM----PPPATHLSKASEGQQKTRK 712
+ S+ KHS+S K S M PPP L K+S QK RK
Sbjct: 661 IE------SDRKHSSSSKFKNHSRMMLPPPPPTRTLQKSSTYPQKARK 666
BLAST of CmoCh04G007630 vs. TAIR 10
Match:
AT1G64050.2 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages. )
HSP 1 Score: 252.3 bits (643), Expect = 1.2e-66
Identity = 242/706 (34.28%), Postives = 351/706 (49.72%), Query Frame = 0
Query: 43 MDALELNYPVDVAAP-KLMGPDGSVRTGIIIEEVEL---CEADRGSAPPSYSFQHFSSYG 102
MD L+++ PVDV+ P KLMG +G G+ + + C+ R S + S + SS
Sbjct: 1 MDGLKISCPVDVSLPAKLMGSEG-CGGGVRVSSNKADNNCDKARVSIGVNSSIERCSSAS 60
Query: 103 SQKVGTSSINDLGSV------SLDEIPDGAVSKDGEDTPEDFESRNKRSHLSTSSPGVQQ 162
K G+SS S+ S D + D +++ E +N S +++
Sbjct: 61 INKKGSSSGASDSSLWRKLMHSHDFVHDRLTKLRVDNSS---EPQNGYSPIASPESAESP 120
Query: 163 RKSLKVSRSSSSSLCSKRRLVQLEDSLLLSGADEVKDTSDKLGSYLKKCGSHEKPQLMKQ 222
RK K+SRSSS + +R + L D + + D DT + G C +KP ++KQ
Sbjct: 121 RKRGKLSRSSSGNGTPRRTKLILLDETVRTQRD--NDTKEICGQGSTSC--LDKPFVVKQ 180
Query: 223 KSSVSSKRGDKRNLSVSLKTKFDSLPINAGNGSATAGCIFFGLYGLKSDVHDFTKLTDDP 282
+SS + KRGDKR V ++T + SAT FFG YGLK ++D TKL +D
Sbjct: 181 RSSYNGKRGDKRISKVPVRTL-------STINSATGENAFFGAYGLKPAINDVTKLVEDF 240
Query: 283 TLNGLLDGSYDYSSLSKAKGKKDTNVNECFLQSIRKACSVLQLPWPVRPQNTAESESCSN 342
+L LL+GSY+ SL K K KK N N L ++ S+L PV+ Q++ E ++C +
Sbjct: 241 SLKSLLEGSYECPSLGKDKMKKSENTNNTLLSVVKNVWSILPTKRPVQSQSSTELDTCLS 300
Query: 343 SKPSTSLVSSVSSMEEGVNFDAKELSTTDAPSLSKVQDACSNSETLTNVLDFKLYKPDDM 402
+ S +++ N D +++ D S +D C NSE + L F L D+
Sbjct: 301 RTLGSPPSSISATLPNSENID--KVNALDGDLSSSSKDHCINSEIPSTPLSFPLCDAGDV 360
Query: 403 FVKLGLPLPKDLESLLQDASKSSVSSKNATD-LRSAKQQSRRAILQPFPWSHSFNGHSKA 462
+LGLP KDL+SLLQDASK S +SKN D RSAK + L FPWS FNG S+
Sbjct: 361 LKRLGLPPSKDLDSLLQDASKPSHNSKNNLDQQRSAKPP--HSGLPHFPWSQPFNGSSRT 420
Query: 463 NSDSSKFSANRTTCTGRWWRVGNFA-NIPTATADCFTKNLESLTFNQSLFPSTMRVGPDD 522
NS+++K +T C GRW R+ + + + P D F NLESLTFNQ+L P +
Sbjct: 421 NSEAAKLVTCKTLCQGRWLRIADTSMSSPEGITDNFA-NLESLTFNQNLVPPLL------ 480
Query: 523 GKSSSVSVNHHQSGWDSLSSATCSKAS-SMLVDS-------RGKMNREANEQHCPRVMAA 582
K + V Q+ + + S C++AS S L +S G + E + CP+++ A
Sbjct: 481 -KQTITGVKTSQTKFANTISCQCAEASVSTLQNSFFVPKEPEGSPDVEDDALSCPQLLEA 540
Query: 583 AQTLYDIATRAALRQNIDGIVKWPKKPSQKSMKARKLK--SEETEELYAAPTTYGLWSNN 642
A+TL DIA ++A N +GI++WPKK SQKSMKARK K + E ++ L S+N
Sbjct: 541 ARTLCDIAVQSANHDNPNGILRWPKKLSQKSMKARKSKLIEKPLERHRTTVSSIDLNSSN 600
Query: 643 SFKNEGHL-----------HPSKKPKPGTVESRRDITQTNNRKGPLNWATTKSSRSSPSK 702
+ N+ H+ H PKP R ++ N+K + S SSP +
Sbjct: 601 NNNNKNHVRKDSAAEHNHHHHHHHPKP---SKRLKLSTMENKK------RSFPSSSSPIE 660
Query: 703 FVRDSVSEAKHSTSGVVKQSSMM----PPPATHLSKASEGQQKTRK 712
S+ KHS+S K S M PPP L K+S QK RK
Sbjct: 661 ------SDRKHSSSSKFKNHSRMMLPPPPPTRTLQKSSTYPQKARK 664
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FNQ1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447442 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1INL3 | 0.0e+00 | 98.38 | uncharacterized protein LOC111479123 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1C5T9 | 0.0e+00 | 84.59 | uncharacterized protein LOC111008234 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1EYC2 | 2.7e-308 | 83.02 | uncharacterized protein LOC111437570 OS=Cucurbita moschata OX=3662 GN=LOC1114375... | [more] |
A0A6J1JD46 | 2.8e-305 | 83.78 | uncharacterized protein LOC111483405 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_022942381.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447442 isoform X1 [Cucurbita moschata] | [more] |
XP_023535222.1 | 0.0e+00 | 99.41 | uncharacterized protein LOC111796709 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG6600547.1 | 0.0e+00 | 98.83 | hypothetical protein SDJN03_05780, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7031186.1 | 0.0e+00 | 95.07 | hypothetical protein SDJN02_05226 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022979382.1 | 0.0e+00 | 98.38 | uncharacterized protein LOC111479123 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G64050.1 | 1.2e-66 | 34.04 | unknown protein; Has 524 Blast hits to 342 proteins in 101 species: Archae - 0; ... | [more] |
AT1G64050.2 | 1.2e-66 | 34.28 | unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_c... | [more] |