Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTCCAATTGACTATCATCGCCATAGATACGAGGTTTTCCCGTGGAATCAAAGCGTGGTGCTATGGCTTCCAAACGCTCCTCCATTGTTCATCAACCTCAACCTCTACAAGCTGGCTTCCTCCACTTACCCCGCAAAAAACCTAAGAGCCTGCCGCCCTCGGATGAACTTGCATCCAAAGTTGGGGATAACATTCCGTATTATGCTGCGAAGGATCTTCGTATCAAGCGAGTTTTTTCTCCCAATTTTGATAATCGCTCTTCGCTGCCATCGGAAGGGCAGATTAGCGACAGAGAAGCGCGGATCACCCCCAATGGAACTTGTCCAAATGGAGATAGTGGAGTCGGTAAAAGCTCCATAACTGCAGAGGTACGGAATGAGAATTTTTGCAATTCTAGTGGGTATGTTGAATTGGGCGATGAGGATCGGAGATGTAGCGGGAAGATTGTGGAACGGGTGCATTCTACACCTCCTGATGCGGAGGTTATGGCTGGGGGTCTTGTGGCGGCTTCTTCAAATGGATGTCCTCGCTCAAGCAATGGGAGCGTTTTAGGGGATACTTGTGCGAAAGCTGACTGTAGAATTGACGCTGTTACGAGAACCGGATCGGTAAGTATCTTGTTTAAGTTGAATTCTCCCTCCATTGGTTTTATTTTACTATTATTGCTATTATTATTTTGCATTGATTTCTTACGAATGCTGTTCTGTTGGGTCAATCAATAGACCTCATGGGATTATTGTCATTAGTGAATGCTTGAAGAAATTAGAACTGCTAATACTCATGGCCACTTCGCACACTTATAGGTATTCTCTTGTTTATGTTGATCTAATATGGATCTTAAAATGATGAATTAGGTTCTCAAACCATGTTCTAAACGGAAGTTATTCAAGGCACCTGGTTCCATTGCCTACAAAAGATTGCTGCCCTTTTTGCCGGATGGCGATAATTGTAACTCCTGTTCCCGCTATGTTATTTACTATTTATCCAGGTTTATTTCGCACTAACTTATTTTAATTGCCAATATGCTACAGACATCCTACGAGGTGATTTGTGCTCAAAACGTGAAAAAAAATTGGAGAAGAAGGAAAATATTGAATCTAATCTGTGTAATCGTGCCAATGAGTCATCTTTTATTGATTCAGATACTAGTGTAAATAATGCAGTTTTGGCCTACGGCATATCATGTAACACTATGAAGCTAAATTCAACGCCTCCATATAATGGGGATGCTAAAAACTTTCAGAATGGCAGTGATTCAAGGAATGACCCGACTTTAGTCAAAGAGAATTCTGGTTTGAAAAGGGATGTAGTTTCTGTCTCTTCTCTTGACAAAAAGCTGACAGAGAATGGAGGGCCCTCAGAGCATCAGATAGAGGATCGATTCTCTAATGAACAATCCAAAACCTTCGTAATTGAGAGGCTTGATGGAGGAGATCCCTTTATATCTTCAAATCTTTCCTCAGAAGTAGACAACTTTAAGTCCCATGTTTCGGAGAAGTTGTGTAACAATGTTTCTGAGGATATCAAATCAGAAAATCATTCCAAGGTAGAAATCGAGATATCATCATTGGATTCTAACATTGCTTGTAACCTAGTGAAGGAAGAAAGGAAGAACGAAAAAGTGTCATGCACTCGAGGCACAGATCAAAATCTTGGTAGTTCCACTGTTGGTGAAAATGATTTCAACTTTGCTACAGAGAGTGACAAGAAATATGGCCCCTGTGTTAGAAACAAAGTGGTGAGATTCATGAAATGGTTTTACTTATATCTTTTTCTGCTACCTCCTTTAAACATTCTGCATGATTGCTTTTGGTGGGTGCCATAATTTGGAAATGATTGTTCTGCAGCCTTAGGGAATGACATCCAACTTGATCATCAAACTATTATTGTTGTTGTTTCATTATTATTATTATTATTGTGTGTGTGTGTATTACTACTAATATAATTAACTACTTATCAGGTTCGCAATCCACTTGTACAACTGAAGTCGAAATACAGCCAAGTTTCAGTTAGCTTTCGTAGGATGCTTCCGTTCCTTACGGATCTTTTTAAAGATAACCCAGAGAACTGTATGCATCCACTCCCCCACGATAATCTTCATTCGATATAGTTGTGCTTGTTGATTCTGTTAAAGACTGATTTTTTTAACAATATTTTACATTCTCAGGTGCTTCGGGAAACATTGACTCTCCAAGACCGGAGAAAGAATTGCCAACTATGAATTTGCAATCACCGAGTTCAAATTCTCACAATTCCTGGGATAGATCAGAAGGCTTAGCATCTTGCAACATGCCATGCGATGGAAGTTTAGATACTCCCTCAATGCCTGGGTCGAATACTATGAACGAAATGGTTTGTGAAACAGAAAAAGTTCTATTGCATAATGGACTCAATGATGAACTTCTATCATCACCAAAATTACAGATGCATCACTTGCATTCTGAACAGGAGATGTTGGATACATGTATGTTGAAAGTGGACCCTCAATTGCATGATCAAGCAGTCTTATCTAGTTATGATCCGCTAACTGGAAAAGGATCTAGAATGGTCTCTCAACAATCACCAATCACTTCAGAAGTCTGCACAAATTTGACAGATAATGTTTCTGATGCCGCTAAACTTTCTGAGAGAAACAGCTTAGAACCTAATTCTTTATGCGTAGAAGGATGTGTTCCAGTAATTCGCATTAATGTTGGAAAGGGAATTCCTAAGCAAAACCCACGAGGATGCAGAGGAATCTGCAATTGTTTGAACTGTTCCTCTTTTCGTCTCCATGCTGAAAGAGCATTTGAATTTTCTAGAAATCAGCTGCAAGATGCTGAAGTAGTTGCTTCAGACTTGATGAAAGAATTGTCGTTTATCCGTGATGTGCTGGAAAAATGTTCTGATGGTGCATACGGTGATGCTGGATATTATTCAAATAAGGTAAGTGCTTAAATTAGTCATTGTGCTTTCCTACGCTTTTGCTTTGAAATTTTCCTTGCTATGAGCTATTGCTGTATTTTTGTCTTACATTTTAATTGATTGGCACTCATTTTATTTAAAAACTTCAGTATTTCATATGAGCAATAATGCACCGCTGCAATAGTATACACGTTACACCTGTAAATATCGTCTTAAGAGAGTAATTGATTCTAATAACAGGTAAGGAGTAATTACTTTAGTTAATTTCTAATTTTGATATTTATTATGGCAACGCTGATGTGAGCTCACAAACTTGAATAAGTTACTCACTTAGGAGAATTGTTACCAAACTTAATAGACTAACTATTGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATTAATAGAATAAGTTACTAACTTATGGAGCCACCGTAGGAGAATTGATAGTAGTTGCCAAAGTGTAGATCAAGTTTTGTGCTTGTGTACTAAGTTTTATGCTTTAATTAGGAAAATTGTTTGTGGTTCTTAACTCAAGGAGAAAGTTTAGGAAGTTAGTCAATGCTTTCTCCATTATAAATACTTATGATGTATATTCTTTTCACAATCACAACAATAGAGAGAATTGATACCAAATGTTGGATTTGTCTCCCGATCACAAGATCTCTCCTCTCCAGTTTTTTTTTTTGCTTTTTTTACTTGTTTCGACATATTTTGATCGTGGGATAAATCCAACACTGGAGGCTGCAAAGTTTTCTATTTTTATTTTTTTCTTATGAAAGAACTTCTATAGTCCTACTCAGCCAGACTTTTATGTCACTCTTTCAAGCTTGCTTGCATACTCGTGTAATAACCTTGACCATGAAATATCAGAGCTCTATTTCATGCATTGCCAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTTAGTTGCAAAAGACCGCCTTCTACAAATGAACTGCAAACTTGACATTCAAAGCAGAATCATGGTACACTCCATATATTAGCAATATTTATTTCTTCCATTTAAAACTATATGTTCTCTTCAATGTCAAGCCTTTGAAGATTAAGTTTGCTTAGTCATTGAAAGTTCATTGTCTTTGACAGTGCCCCCAACGACCAAATGTTAGATTTTCTAGTGAAATTAAGAAAAGAAAGATTGAAGGTGGCAAATAGAGCAACGTGATTGAAGACTTGGAATCTGAATAGTGTGACAAAAGTTACAGCCTTCACTCTTTGTGCATATGTGACGCTAGGATTGCATGAATGAATTGGATGGTCTTTCTTTCCATGTGAAAGGCATTTGCATCACAGAGTGAAAGGATTGCTTCTTGACGAGAAGGAAGAAAAAGGTTTCATTCTGGTTGACCTCTAAATATGTGGTAAGTTTCATAACCTTTTCTTTCTCTTTAAAAGAAGGATCGTATCATAACAGTAATAGAGTAAATTATGTATACTGTAGATTCTTGGTGGGTTGGTAAGGGGTGTAATATTAACTTCATTGATATGATCTTGAGTTGAAATTAAGAGTGAGTGATTTAGTCATACTATCTGCCATTGTTTATTTAGGGGAAGGTACGTAGTTGAATATTAGCAAGTTTTAAAGTAGTTTTCAAAATCTCTTTTGTACTTCATTGAATTAGTAATTGTAAGTGCGTGGAGTAGGCGGTACAATGCCTCAACATCCCTAACCAACC
mRNA sequence
CTCCAATTGACTATCATCGCCATAGATACGAGGTTTTCCCGTGGAATCAAAGCGTGGTGCTATGGCTTCCAAACGCTCCTCCATTGTTCATCAACCTCAACCTCTACAAGCTGGCTTCCTCCACTTACCCCGCAAAAAACCTAAGAGCCTGCCGCCCTCGGATGAACTTGCATCCAAAGTTGGGGATAACATTCCGTATTATGCTGCGAAGGATCTTCGTATCAAGCGAGTTTTTTCTCCCAATTTTGATAATCGCTCTTCGCTGCCATCGGAAGGGCAGATTAGCGACAGAGAAGCGCGGATCACCCCCAATGGAACTTGTCCAAATGGAGATAGTGGAGTCGGTAAAAGCTCCATAACTGCAGAGGTACGGAATGAGAATTTTTGCAATTCTAGTGGGTATGTTGAATTGGGCGATGAGGATCGGAGATGTAGCGGGAAGATTGTGGAACGGGTGCATTCTACACCTCCTGATGCGGAGGTTATGGCTGGGGGTCTTGTGGCGGCTTCTTCAAATGGATGTCCTCGCTCAAGCAATGGGAGCGTTTTAGGGGATACTTGTGCGAAAGCTGACTGTAGAATTGACGCTGTTACGAGAACCGGATCGGTTCTCAAACCATGTTCTAAACGGAAGTTATTCAAGGCACCTGGTTCCATTGCCTACAAAAGATTGCTGCCCTTTTTGCCGGATGGCGATAATTACATCCTACGAGGTGATTTGTGCTCAAAACGTGAAAAAAAATTGGAGAAGAAGGAAAATATTGAATCTAATCTGTGTAATCGTGCCAATGAGTCATCTTTTATTGATTCAGATACTAGTGTAAATAATGCAGTTTTGGCCTACGGCATATCATGTAACACTATGAAGCTAAATTCAACGCCTCCATATAATGGGGATGCTAAAAACTTTCAGAATGGCAGTGATTCAAGGAATGACCCGACTTTAGTCAAAGAGAATTCTGGTTTGAAAAGGGATGTAGTTTCTGTCTCTTCTCTTGACAAAAAGCTGACAGAGAATGGAGGGCCCTCAGAGCATCAGATAGAGGATCGATTCTCTAATGAACAATCCAAAACCTTCGTAATTGAGAGGCTTGATGGAGGAGATCCCTTTATATCTTCAAATCTTTCCTCAGAAGTAGACAACTTTAAGTCCCATGTTTCGGAGAAGTTGTGTAACAATGTTTCTGAGGATATCAAATCAGAAAATCATTCCAAGGTAGAAATCGAGATATCATCATTGGATTCTAACATTGCTTGTAACCTAGTGAAGGAAGAAAGGAAGAACGAAAAAGTGTCATGCACTCGAGGCACAGATCAAAATCTTGGTAGTTCCACTGTTGGTGAAAATGATTTCAACTTTGCTACAGAGAGTGACAAGAAATATGGCCCCTGTGTTAGAAACAAAGTGGTTCGCAATCCACTTGTACAACTGAAGTCGAAATACAGCCAAGTTTCAGTTAGCTTTCGTAGGATGCTTCCGTTCCTTACGGATCTTTTTAAAGATAACCCAGAGAACTGTGCTTCGGGAAACATTGACTCTCCAAGACCGGAGAAAGAATTGCCAACTATGAATTTGCAATCACCGAGTTCAAATTCTCACAATTCCTGGGATAGATCAGAAGGCTTAGCATCTTGCAACATGCCATGCGATGGAAGTTTAGATACTCCCTCAATGCCTGGGTCGAATACTATGAACGAAATGGTTTGTGAAACAGAAAAAGTTCTATTGCATAATGGACTCAATGATGAACTTCTATCATCACCAAAATTACAGATGCATCACTTGCATTCTGAACAGGAGATGTTGGATACATGTATGTTGAAAGTGGACCCTCAATTGCATGATCAAGCAGTCTTATCTAGTTATGATCCGCTAACTGGAAAAGGATCTAGAATGGTCTCTCAACAATCACCAATCACTTCAGAAGTCTGCACAAATTTGACAGATAATGTTTCTGATGCCGCTAAACTTTCTGAGAGAAACAGCTTAGAACCTAATTCTTTATGCGTAGAAGGATGTGTTCCAGTAATTCGCATTAATGTTGGAAAGGGAATTCCTAAGCAAAACCCACGAGGATGCAGAGGAATCTGCAATTGTTTGAACTGTTCCTCTTTTCGTCTCCATGCTGAAAGAGCATTTGAATTTTCTAGAAATCAGCTGCAAGATGCTGAAGTAGTTGCTTCAGACTTGATGAAAGAATTGTCGTTTATCCGTGATGTGCTGGAAAAATGTTCTGATGGTGCATACGGTGATGCTGGATATTATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTTAGTTGCAAAAGACCGCCTTCTACAAATGAACTGCAAACTTGACATTCAAAGCAGAATCATGTGCCCCCAACGACCAAATGTTAGATTTTCTAGTGAAATTAAGAAAAGAAAGATTGAAGGTGGCAAATAGAGCAACGTGATTGAAGACTTGGAATCTGAATAGTGTGACAAAAGTTACAGCCTTCACTCTTTGTGCATATGTGACGCTAGGATTGCATGAATGAATTGGATGGTCTTTCTTTCCATGTGAAAGGCATTTGCATCACAGAGTGAAAGGATTGCTTCTTGACGAGAAGGAAGAAAAAGGTTTCATTCTGGTTGACCTCTAAATATGTGGTAAGTTTCATAACCTTTTCTTTCTCTTTAAAAGAAGGATCGTATCATAACAGTAATAGAGTAAATTATGTATACTGTAGATTCTTGGTGGGTTGGTAAGGGGTGTAATATTAACTTCATTGATATGATCTTGAGTTGAAATTAAGAGTGAGTGATTTAGTCATACTATCTGCCATTGTTTATTTAGGGGAAGGTACGTAGTTGAATATTAGCAAGTTTTAAAGTAGTTTTCAAAATCTCTTTTGTACTTCATTGAATTAGTAATTGTAAGTGCGTGGAGTAGGCGGTACAATGCCTCAACATCCCTAACCAACC
Coding sequence (CDS)
ATGGCTTCCAAACGCTCCTCCATTGTTCATCAACCTCAACCTCTACAAGCTGGCTTCCTCCACTTACCCCGCAAAAAACCTAAGAGCCTGCCGCCCTCGGATGAACTTGCATCCAAAGTTGGGGATAACATTCCGTATTATGCTGCGAAGGATCTTCGTATCAAGCGAGTTTTTTCTCCCAATTTTGATAATCGCTCTTCGCTGCCATCGGAAGGGCAGATTAGCGACAGAGAAGCGCGGATCACCCCCAATGGAACTTGTCCAAATGGAGATAGTGGAGTCGGTAAAAGCTCCATAACTGCAGAGGTACGGAATGAGAATTTTTGCAATTCTAGTGGGTATGTTGAATTGGGCGATGAGGATCGGAGATGTAGCGGGAAGATTGTGGAACGGGTGCATTCTACACCTCCTGATGCGGAGGTTATGGCTGGGGGTCTTGTGGCGGCTTCTTCAAATGGATGTCCTCGCTCAAGCAATGGGAGCGTTTTAGGGGATACTTGTGCGAAAGCTGACTGTAGAATTGACGCTGTTACGAGAACCGGATCGGTTCTCAAACCATGTTCTAAACGGAAGTTATTCAAGGCACCTGGTTCCATTGCCTACAAAAGATTGCTGCCCTTTTTGCCGGATGGCGATAATTACATCCTACGAGGTGATTTGTGCTCAAAACGTGAAAAAAAATTGGAGAAGAAGGAAAATATTGAATCTAATCTGTGTAATCGTGCCAATGAGTCATCTTTTATTGATTCAGATACTAGTGTAAATAATGCAGTTTTGGCCTACGGCATATCATGTAACACTATGAAGCTAAATTCAACGCCTCCATATAATGGGGATGCTAAAAACTTTCAGAATGGCAGTGATTCAAGGAATGACCCGACTTTAGTCAAAGAGAATTCTGGTTTGAAAAGGGATGTAGTTTCTGTCTCTTCTCTTGACAAAAAGCTGACAGAGAATGGAGGGCCCTCAGAGCATCAGATAGAGGATCGATTCTCTAATGAACAATCCAAAACCTTCGTAATTGAGAGGCTTGATGGAGGAGATCCCTTTATATCTTCAAATCTTTCCTCAGAAGTAGACAACTTTAAGTCCCATGTTTCGGAGAAGTTGTGTAACAATGTTTCTGAGGATATCAAATCAGAAAATCATTCCAAGGTAGAAATCGAGATATCATCATTGGATTCTAACATTGCTTGTAACCTAGTGAAGGAAGAAAGGAAGAACGAAAAAGTGTCATGCACTCGAGGCACAGATCAAAATCTTGGTAGTTCCACTGTTGGTGAAAATGATTTCAACTTTGCTACAGAGAGTGACAAGAAATATGGCCCCTGTGTTAGAAACAAAGTGGTTCGCAATCCACTTGTACAACTGAAGTCGAAATACAGCCAAGTTTCAGTTAGCTTTCGTAGGATGCTTCCGTTCCTTACGGATCTTTTTAAAGATAACCCAGAGAACTGTGCTTCGGGAAACATTGACTCTCCAAGACCGGAGAAAGAATTGCCAACTATGAATTTGCAATCACCGAGTTCAAATTCTCACAATTCCTGGGATAGATCAGAAGGCTTAGCATCTTGCAACATGCCATGCGATGGAAGTTTAGATACTCCCTCAATGCCTGGGTCGAATACTATGAACGAAATGGTTTGTGAAACAGAAAAAGTTCTATTGCATAATGGACTCAATGATGAACTTCTATCATCACCAAAATTACAGATGCATCACTTGCATTCTGAACAGGAGATGTTGGATACATGTATGTTGAAAGTGGACCCTCAATTGCATGATCAAGCAGTCTTATCTAGTTATGATCCGCTAACTGGAAAAGGATCTAGAATGGTCTCTCAACAATCACCAATCACTTCAGAAGTCTGCACAAATTTGACAGATAATGTTTCTGATGCCGCTAAACTTTCTGAGAGAAACAGCTTAGAACCTAATTCTTTATGCGTAGAAGGATGTGTTCCAGTAATTCGCATTAATGTTGGAAAGGGAATTCCTAAGCAAAACCCACGAGGATGCAGAGGAATCTGCAATTGTTTGAACTGTTCCTCTTTTCGTCTCCATGCTGAAAGAGCATTTGAATTTTCTAGAAATCAGCTGCAAGATGCTGAAGTAGTTGCTTCAGACTTGATGAAAGAATTGTCGTTTATCCGTGATGTGCTGGAAAAATGTTCTGATGGTGCATACGGTGATGCTGGATATTATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTTAGTTGCAAAAGACCGCCTTCTACAAATGAACTGCAAACTTGACATTCAAAGCAGAATCATGTGCCCCCAACGACCAAATGTTAGATTTTCTAGTGAAATTAAGAAAAGAAAGATTGAAGGTGGCAAATAG
Protein sequence
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSPNFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDEDRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRTGSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCNRANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENSGLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVDNFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQNLGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFKDNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPGSNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLSSYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDVLEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRFSSEIKKRKIEGGK
Homology
BLAST of CmoCh08G008370 vs. ExPASy TrEMBL
Match:
A0A6J1HGP9 (uncharacterized protein LOC111464172 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464172 PE=4 SV=1)
HSP 1 Score: 1574.3 bits (4075), Expect = 0.0e+00
Identity = 793/793 (100.00%), Postives = 793/793 (100.00%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT
Sbjct: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK
Sbjct: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 793
BLAST of CmoCh08G008370 vs. ExPASy TrEMBL
Match:
A0A6J1HLX0 (uncharacterized protein LOC111464172 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111464172 PE=4 SV=1)
HSP 1 Score: 1493.8 bits (3866), Expect = 0.0e+00
Identity = 761/793 (95.96%), Postives = 761/793 (95.96%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT
Sbjct: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK
Sbjct: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEM EMLDTCMLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEM--------------------------------EMLDTCMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 761
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 761
BLAST of CmoCh08G008370 vs. ExPASy TrEMBL
Match:
A0A6J1KHD4 (uncharacterized protein LOC111494390 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111494390 PE=4 SV=1)
HSP 1 Score: 1389.0 bits (3594), Expect = 0.0e+00
Identity = 715/799 (89.49%), Postives = 737/799 (92.24%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQ LQAGFLHLPRKKPK LPPSDELAS VGD I YYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQSLQAGFLHLPRKKPKRLPPSDELASVVGDKISYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
N DNRSS+PSEGQISD EA IT NGTCPNGDSGVGK SITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NLDNRSSVPSEGQISDEEAPITANGTCPNGDSGVGKISITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRC+GK VE VHSTPPDAEV+AGGLVAASSNGCPRSS+GSVLGD CAKADCRID+VTRT
Sbjct: 121 DRRCNGKNVELVHSTPPDAEVLAGGLVAASSNGCPRSSHGSVLGDICAKADCRIDSVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFL DGDNYIL+GDLCSKRE LEKKENIESN CN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLLDGDNYILQGDLCSKRENNLEKKENIESNRCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSF+DSDTSV A+LA+GISCNTMKLN TPP NGD KNF NGSDSRNDPTLVKENS
Sbjct: 241 RANESSFVDSDTSVKYAILAHGISCNTMKLNLTPPDNGDTKNFHNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDK+LTENG ++QSKTF IERLDGGDPF SSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKRLTENG------------SQQSKTFGIERLDGGDPFTSSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVS DIKSENHSK EI+ISSLDS+IACNLVKEERKNEKV CTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSADIKSENHSKEEIKISSLDSDIACNLVKEERKNEKVLCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGEND N ATESDKKYGPCVRNKV+RNPLVQLKSKYSQV VS+RRMLPFL DLFK
Sbjct: 421 LGSSTVGENDCNIATESDKKYGPCVRNKVIRNPLVQLKSKYSQVLVSYRRMLPFLEDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCAS NIDSPRPEKELPTMNLQSPSSNSHNS D+SE LASCNMPC+G+LDTPSMPG
Sbjct: 481 DNPENCASVNIDSPRPEKELPTMNLQSPSSNSHNSRDKSESLASCNMPCNGNLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
NTMNEMVCETEKVLLHNGL DELLSSPKLQMHH HSEQEMLD CMLKVDPQLHDQAVLS
Sbjct: 541 LNTMNEMVCETEKVLLHNGLIDELLSSPKLQMHHFHSEQEMLDKCMLKVDPQLHDQAVLS 600
Query: 601 -----SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCV- 660
SYDPLTG+GSRMVSQQSPITSE CTNLTDNVSDAAKLSERNSLEPNSLCVEGCV
Sbjct: 601 LYAAASYDPLTGEGSRMVSQQSPITSEGCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVL 660
Query: 661 PVIRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL 720
P RINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL
Sbjct: 661 PESRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL 720
Query: 721 SFIRDVLEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQ 780
SFIRDVLEKCS+GAYGDAGYYSNKVKEACRKASEAELVAKDRL QMNCKLDI SRIMCPQ
Sbjct: 721 SFIRDVLEKCSNGAYGDAGYYSNKVKEACRKASEAELVAKDRLQQMNCKLDIHSRIMCPQ 780
Query: 781 RPNVRFSSEIKKRKIEGGK 794
RPNVRFSSEIKKR+IEGGK
Sbjct: 781 RPNVRFSSEIKKREIEGGK 787
BLAST of CmoCh08G008370 vs. ExPASy TrEMBL
Match:
A0A6J1KEX1 (uncharacterized protein LOC111494390 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111494390 PE=4 SV=1)
HSP 1 Score: 1313.1 bits (3397), Expect = 0.0e+00
Identity = 685/799 (85.73%), Postives = 707/799 (88.49%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQ LQAGFLHLPRKKPK LPPSDELAS VGD I YYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQSLQAGFLHLPRKKPKRLPPSDELASVVGDKISYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
N DNRSS+PSEGQISD EA IT NGTCPNGDSGVGK SITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NLDNRSSVPSEGQISDEEAPITANGTCPNGDSGVGKISITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRC+GK VE VHSTPPDAEV+AGGLVAASSNGCPRSS+GSVLGD CAKADCRID+VTRT
Sbjct: 121 DRRCNGKNVELVHSTPPDAEVLAGGLVAASSNGCPRSSHGSVLGDICAKADCRIDSVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFL DGDNYIL+GDLCSKRE LEKKENIESN CN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLLDGDNYILQGDLCSKRENNLEKKENIESNRCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSF+DSDTSV A+LA+GISCNTMKLN TPP NGD KNF NGSDSRNDPTLVKENS
Sbjct: 241 RANESSFVDSDTSVKYAILAHGISCNTMKLNLTPPDNGDTKNFHNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDK+LTENG ++QSKTF IERLDGGDPF SSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKRLTENG------------SQQSKTFGIERLDGGDPFTSSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVS DIKSENHSK EI+ISSLDS+IACNLVKEERKNEKV CTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSADIKSENHSKEEIKISSLDSDIACNLVKEERKNEKVLCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGEND N ATESDKKYGPCVRNKV+RNPLVQLKSKYSQV VS+RRMLPFL DLFK
Sbjct: 421 LGSSTVGENDCNIATESDKKYGPCVRNKVIRNPLVQLKSKYSQVLVSYRRMLPFLEDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCAS NIDSPRPEKELPTMNLQSPSSNSHNS D+SE LASCNMPC+G+LDTPSMPG
Sbjct: 481 DNPENCASVNIDSPRPEKELPTMNLQSPSSNSHNSRDKSESLASCNMPCNGNLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
NTMNEM EMLD CMLKVDPQLHDQAVLS
Sbjct: 541 LNTMNEM--------------------------------EMLDKCMLKVDPQLHDQAVLS 600
Query: 601 -----SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCV- 660
SYDPLTG+GSRMVSQQSPITSE CTNLTDNVSDAAKLSERNSLEPNSLCVEGCV
Sbjct: 601 LYAAASYDPLTGEGSRMVSQQSPITSEGCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVL 660
Query: 661 PVIRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL 720
P RINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL
Sbjct: 661 PESRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKEL 720
Query: 721 SFIRDVLEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQ 780
SFIRDVLEKCS+GAYGDAGYYSNKVKEACRKASEAELVAKDRL QMNCKLDI SRIMCPQ
Sbjct: 721 SFIRDVLEKCSNGAYGDAGYYSNKVKEACRKASEAELVAKDRLQQMNCKLDIHSRIMCPQ 755
Query: 781 RPNVRFSSEIKKRKIEGGK 794
RPNVRFSSEIKKR+IEGGK
Sbjct: 781 RPNVRFSSEIKKREIEGGK 755
BLAST of CmoCh08G008370 vs. ExPASy TrEMBL
Match:
A0A6J1H4T6 (uncharacterized protein LOC111460150 OS=Cucurbita moschata OX=3662 GN=LOC111460150 PE=4 SV=1)
HSP 1 Score: 1014.6 bits (2622), Expect = 2.2e-292
Identity = 550/806 (68.24%), Postives = 629/806 (78.04%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPS--DELASKVGDNIPYYAAKDLRIKRVF 60
MASKRSSIV+QP+ LQAGFLHLPRKKPK LP S +ELASK GD + + AKDLR+KRVF
Sbjct: 8 MASKRSSIVYQPRALQAGFLHLPRKKPKMLPLSQPNELASKDGDGVSDFVAKDLRLKRVF 67
Query: 61 SPNFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELG 120
SPN +NRSS+ S ISD+E +T NGTC N DSGVGK S EVRNENFCNS+ Y E
Sbjct: 68 SPNLENRSSVTSGELISDKEGPMTANGTCLNEDSGVGKISEITEVRNENFCNSNRYAEC- 127
Query: 121 DEDRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVT 180
DEDR+C+GK E++HSTPPD E +AGG VAASSNGCPRSSNG V+GD CAKADCR+D+VT
Sbjct: 128 DEDRKCNGKSGEQIHSTPPDVEFLAGGFVAASSNGCPRSSNGGVIGDNCAKADCRVDSVT 187
Query: 181 RTGSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNL 240
RTGSVLKPCSKRKLFKAPGSIAYKR+LPFL D DN+ L D SKRE LEKKENIESNL
Sbjct: 188 RTGSVLKPCSKRKLFKAPGSIAYKRMLPFLLDSDNFTLLSDPYSKRENNLEKKENIESNL 247
Query: 241 CNRANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKE 300
CN AN SSF+DSDT V NAV A G +C MKLN P NGD K FQNGSD +DPTLV+E
Sbjct: 248 CNPANGSSFVDSDTCVKNAVFASGNACKIMKLNLPTPDNGDTKKFQNGSDLNSDPTLVEE 307
Query: 301 NSGLKRD-VVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSS 360
S LK+D VV S +D++ P+++ EDR S EQSKT +ERLDGG+ I S
Sbjct: 308 GSCLKKDNVVCASFIDER------PTKYDTEDRSSKEQSKTSGMERLDGGNYAI-----S 367
Query: 361 EVDNFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGT 420
E +NFKSHVSEKLCNN+SED+ E+H E+++S LDSNI CN VKEER+ EKV C+RG
Sbjct: 368 EAENFKSHVSEKLCNNISEDVNREDHFNEELKMSLLDSNIGCNPVKEERREEKVGCSRGA 427
Query: 421 DQNLGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTD 480
DQ LGS TVGEN N ATESDKKYG VRNK+VRNPLVQLK YSQ SVS+RRMLPFL D
Sbjct: 428 DQKLGSFTVGENHCNIATESDKKYGTYVRNKMVRNPLVQLKLNYSQASVSYRRMLPFLED 487
Query: 481 LFKDNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPS 540
LFKDNPENCA GNI+ PRPEKEL TMNL SPSSNS+NS D+SE L SCNMPCDG+ D S
Sbjct: 488 LFKDNPENCALGNINCPRPEKELATMNLDSPSSNSYNSQDKSEFLVSCNMPCDGNSDALS 547
Query: 541 MPGSNTMNEMVCETEKVLLHNGLNDELL----SSPKLQMHHLHSEQEMLDTCMLKVDPQL 600
+P SN++N++VCE ++VL+ G+ND LL S PKL L S+QEML+ C LK+DPQL
Sbjct: 548 LPLSNSINDVVCEADEVLMPAGVNDILLSPPISPPKLL---LQSDQEMLEKCKLKMDPQL 607
Query: 601 HDQAVLS-----SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSL 660
+DQAV S SY+PLTG+GSRM S+QSP TSE CTNLT+ VSD KL ERNSL+P
Sbjct: 608 NDQAVSSSYLATSYEPLTGEGSRMTSEQSPNTSEDCTNLTEYVSDGTKLPERNSLKPIEA 667
Query: 661 CVEGCVPVIRINVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVAS 720
C+ +P INV KGI K+NPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAE VAS
Sbjct: 668 CI---LPENHINVRKGILKRNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEEVAS 727
Query: 721 DLMKELSFIRDVLEKCSDGA-YGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQ 780
DLMKEL +R VLEK +D GDAGY+SNKVKEACRKASEAEL+AKDRLLQMN +L I
Sbjct: 728 DLMKELLLLRGVLEKYADSTKEGDAGYHSNKVKEACRKASEAELIAKDRLLQMNYELGIH 787
Query: 781 SRIMCPQRPNVRFSSEIKKRKIEGGK 794
RI C QRPNVRFSSE+++ +IE GK
Sbjct: 788 CRITCSQRPNVRFSSEVERIEIEDGK 795
BLAST of CmoCh08G008370 vs. NCBI nr
Match:
XP_022964022.1 (uncharacterized protein LOC111464172 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1574.3 bits (4075), Expect = 0.0e+00
Identity = 793/793 (100.00%), Postives = 793/793 (100.00%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT
Sbjct: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK
Sbjct: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 793
BLAST of CmoCh08G008370 vs. NCBI nr
Match:
XP_022964024.1 (uncharacterized protein LOC111464172 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1493.8 bits (3866), Expect = 0.0e+00
Identity = 761/793 (95.96%), Postives = 761/793 (95.96%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT
Sbjct: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK
Sbjct: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEM EMLDTCMLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEM--------------------------------EMLDTCMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 761
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 761
BLAST of CmoCh08G008370 vs. NCBI nr
Match:
KAG7025965.1 (hypothetical protein SDJN02_12463, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1454.5 bits (3764), Expect = 0.0e+00
Identity = 745/793 (93.95%), Postives = 755/793 (95.21%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLP SDELASKVGD I YYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPQSDELASKVGDKISYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
N DNRSS+ SEGQISD++ IT NG CPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE
Sbjct: 61 NLDNRSSVRSEGQISDKDGPITANGICPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRC+GKIVERVHSTPPDAEV+AGGLVAASSNGCPRSS+GSVLGD CAK DCRID+VTRT
Sbjct: 121 DRRCNGKIVERVHSTPPDAEVLAGGLVAASSNGCPRSSHGSVLGDICAKVDCRIDSVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFL DGDNYIL+GDLCSKREK LEKKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLLDGDNYILQGDLCSKREKNLEKKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGIS NTMKLNSTPP NGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISRNTMKLNSTPPDNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLC+NVSED KSENHSK EI+ISSLDSNI CNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCHNVSEDTKSENHSKEEIKISSLDSNIGCNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGEND NFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFR
Sbjct: 421 LGSSTVGENDCNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFR----------- 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
ASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 ------ASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDT MLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTRMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSE CTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEGCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 776
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 776
BLAST of CmoCh08G008370 vs. NCBI nr
Match:
XP_023514501.1 (uncharacterized protein LOC111778760 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1453.7 bits (3762), Expect = 0.0e+00
Identity = 746/794 (93.95%), Postives = 762/794 (95.97%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLP SDELASKVGDNI YYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPQSDELASKVGDNISYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
N DNRSS PSEGQISD++ IT NGTCPNGDSGVGK SITAEVRNENF NS+G VELGDE
Sbjct: 61 NLDNRSSEPSEGQISDKDGPITANGTCPNGDSGVGKISITAEVRNENFRNSNGCVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRC GK VE VHSTPPDAEV+AGGLVAASSNGCPRSS+GSVLGD CAKADCRID+VTRT
Sbjct: 121 DRRCDGKSVELVHSTPPDAEVLAGGLVAASSNGCPRSSHGSVLGDICAKADCRIDSVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFL DGDNYIL+GDLCSKREK L KKENIESNLCN
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLLDGDNYILQGDLCSKREKNL-KKENIESNLCN 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPP NGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPDNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSE+QIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSENQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLCNNVSEDIKSENHSK EI+ISSLDS+IACNLVKEERKNEKVSCTRGTDQ
Sbjct: 361 NFKSHVSEKLCNNVSEDIKSENHSKEEIKISSLDSDIACNLVKEERKNEKVSCTRGTDQY 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGEND N ATESDKKYGPCVRNKVVRNPLVQLKSKY+QVSVS+RRMLPFL DLFK
Sbjct: 421 LGSSTVGENDCNIATESDKKYGPCVRNKVVRNPLVQLKSKYNQVSVSYRRMLPFLEDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNS DRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSRDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQL+DQAVLS
Sbjct: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLYDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCV-PVIRI 660
SYD LTGKGSRMVSQQSPITSE CTNLTDNVSDAAKLSERNSLEPN LCVE CV P RI
Sbjct: 601 SYDLLTGKGSRMVSQQSPITSEGCTNLTDNVSDAAKLSERNSLEPNPLCVEACVLPARRI 660
Query: 661 NVGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRD 720
NVGKGI KQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDA+VVASDLMKELSFIRD
Sbjct: 661 NVGKGIRKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAKVVASDLMKELSFIRD 720
Query: 721 VLEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVR 780
VLEKCSDGAYGDAGY+SNKVKEACRKASEAELVAK+RLLQMNCKLDIQSRIMCPQRPNVR
Sbjct: 721 VLEKCSDGAYGDAGYHSNKVKEACRKASEAELVAKNRLLQMNCKLDIQSRIMCPQRPNVR 780
Query: 781 FSSEIKKRKIEGGK 794
FSSEIKKRKIEGGK
Sbjct: 781 FSSEIKKRKIEGGK 793
BLAST of CmoCh08G008370 vs. NCBI nr
Match:
KAG6593619.1 (hypothetical protein SDJN03_13095, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1420.2 bits (3675), Expect = 0.0e+00
Identity = 728/793 (91.80%), Postives = 739/793 (93.19%), Query Frame = 0
Query: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPPSDELASKVGDNIPYYAAKDLRIKRVFSP 60
MASKRSSIVHQPQPLQAGFLHLPRKKPKSLP SDELASKVGD I YYAAKDLRIKRVFSP
Sbjct: 1 MASKRSSIVHQPQPLQAGFLHLPRKKPKSLPQSDELASKVGDKISYYAAKDLRIKRVFSP 60
Query: 61 NFDNRSSLPSEGQISDREARITPNGTCPNGDSGVGKSSITAEVRNENFCNSSGYVELGDE 120
N DNRSS+ SEGQISD++ IT NG CPNGDSGVGKSSITAEVRN+NFCNSSGYVELGDE
Sbjct: 61 NLDNRSSVRSEGQISDKDGPITANGICPNGDSGVGKSSITAEVRNDNFCNSSGYVELGDE 120
Query: 121 DRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGCPRSSNGSVLGDTCAKADCRIDAVTRT 180
DRRC+GKIVERVHSTPPDAEV+AGGLVAASSNGCPRSS+GSVLGD CAK DCRID+VTRT
Sbjct: 121 DRRCNGKIVERVHSTPPDAEVLAGGLVAASSNGCPRSSHGSVLGDICAKVDCRIDSVTRT 180
Query: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLPDGDNYILRGDLCSKREKKLEKKENIESNLCN 240
GSVLKPCSKRKLFKAPGSIAYKRLLPFL DGDNYIL+
Sbjct: 181 GSVLKPCSKRKLFKAPGSIAYKRLLPFLLDGDNYILQ----------------------- 240
Query: 241 RANESSFIDSDTSVNNAVLAYGISCNTMKLNSTPPYNGDAKNFQNGSDSRNDPTLVKENS 300
DTSVNNAVLAYGIS NTMKLNSTPP NGDAKNFQNGSDSRNDPTLVKENS
Sbjct: 241 ----------DTSVNNAVLAYGISRNTMKLNSTPPDNGDAKNFQNGSDSRNDPTLVKENS 300
Query: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD
Sbjct: 301 GLKRDVVSVSSLDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVD 360
Query: 361 NFKSHVSEKLCNNVSEDIKSENHSKVEIEISSLDSNIACNLVKEERKNEKVSCTRGTDQN 420
NFKSHVSEKLC+NVSED KSENHSK EI+ISSLDSNI CNLVKEERKNEKVSCTRGTDQN
Sbjct: 361 NFKSHVSEKLCHNVSEDTKSENHSKEEIKISSLDSNIGCNLVKEERKNEKVSCTRGTDQN 420
Query: 421 LGSSTVGENDFNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
LGSSTVGEND NFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK
Sbjct: 421 LGSSTVGENDCNFATESDKKYGPCVRNKVVRNPLVQLKSKYSQVSVSFRRMLPFLTDLFK 480
Query: 481 DNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
NPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG
Sbjct: 481 YNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLASCNMPCDGSLDTPSMPG 540
Query: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTCMLKVDPQLHDQAVLS 600
SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDT MLKVDPQLHDQAVLS
Sbjct: 541 SNTMNEMVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEMLDTRMLKVDPQLHDQAVLS 600
Query: 601 SYDPLTGKGSRMVSQQSPITSEVCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
SYDPLTGKGSRMVSQQSPITSE CTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN
Sbjct: 601 SYDPLTGKGSRMVSQQSPITSEGCTNLTDNVSDAAKLSERNSLEPNSLCVEGCVPVIRIN 660
Query: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV
Sbjct: 661 VGKGIPKQNPRGCRGICNCLNCSSFRLHAERAFEFSRNQLQDAEVVASDLMKELSFIRDV 720
Query: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 780
LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF
Sbjct: 721 LEKCSDGAYGDAGYYSNKVKEACRKASEAELVAKDRLLQMNCKLDIQSRIMCPQRPNVRF 760
Query: 781 SSEIKKRKIEGGK 794
SSEIKKRKIEGGK
Sbjct: 781 SSEIKKRKIEGGK 760
BLAST of CmoCh08G008370 vs. TAIR 10
Match:
AT3G23740.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: C globular stage, F mature embryo stage, petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G14120.1); Has 155 Blast hits to 130 proteins in 48 species: Archae - 0; Bacteria - 16; Metazoa - 19; Fungi - 48; Plants - 47; Viruses - 0; Other Eukaryotes - 25 (source: NCBI BLink). )
HSP 1 Score: 140.6 bits (353), Expect = 5.5e-33
Identity = 120/340 (35.29%), Postives = 175/340 (51.47%), Query Frame = 0
Query: 465 SVSFRRMLPFLTDLFKDNPENCASGNIDSPRPEKELPTMNLQSPSSNSHNSWDRSEGLAS 524
SV++RRMLP+L D+ +DN P P+K + + SP NS + + ++ + +
Sbjct: 249 SVNYRRMLPYLKDIQEDN-----------PYPQKNTEEV-ISSPMLNSESDNEGTQEVVT 308
Query: 525 CNMPCDGSLDTPSMPGSNTMNE--MVCETEKVLLHNGLNDELLSSPKLQMHHLHSEQEML 584
N+ T S+ NE + CE V L D+ + Q+ H+ + E
Sbjct: 309 SNV-------TRESGTSSDENEEPLPCERVPVNLEQSDPDK---EQETQIKHVIPDTE-- 368
Query: 585 DTCMLKVDPQLHDQAVLSSYDPLTGKGSRMVSQQSPITSEVCTNLT--DNVSDA----AK 644
L + LSS PL G S S + + NL +N++ A AK
Sbjct: 369 --------NNLGSEIPLSS--PLVGSRSSSEVNSSALHNTFVDNLVGEENMNGAEITEAK 428
Query: 645 LS----ERNSLEPNSLCVEGCVPVI---RINVGKGIPKQNPRGCRGICNCLNCSSFRLHA 704
+S E +S + + V+ V + + KGI K++ RGCRGIC+CLNCSSFRLHA
Sbjct: 429 ISAEELEAHSSDATAELVDPSVILATPSSFSPSKGILKRSMRGCRGICSCLNCSSFRLHA 488
Query: 705 ERAFEFSRNQLQDAEVVASDLMKELSFIRDVLEKCSDGAYGDAGYYSNKVKEACRKASEA 764
ERAFEFSRNQLQD EV+ DL+ E+S +RD+LEK + + + Y ++ EA ++A EA
Sbjct: 489 ERAFEFSRNQLQDTEVMVLDLVGEISHLRDLLEKYNSADHSEP--YKSQAGEASKRACEA 548
Query: 765 ELVAKDRLLQMNCKLDIQSRIMCPQRPNVRFSSEIKKRKI 790
+AK RL QMN L I RI QR V+F+ I ++ I
Sbjct: 549 AELAKSRLHQMNDDLQIHYRIPNEQRARVKFAHYIHEKTI 552
HSP 2 Score: 47.4 bits (111), Expect = 6.3e-05
Identity = 101/402 (25.12%), Postives = 150/402 (37.31%), Query Frame = 0
Query: 35 ELASKVGDNIPYYAAKDLRIKRVFSPNFDNRSSLPSEGQISDREARITPNGTCPNGDSGV 94
++ SK+ P KD+R++RVFSP TP T
Sbjct: 55 KIDSKITPTYPRATIKDIRLRRVFSP---------------------TPIST-------- 114
Query: 95 GKSSITAEVRNENFCNSSGYVELGDEDRRCSGKIVERVHSTPPDAEVMAGGLVAASSNGC 154
+++N + L D D S + +TPPD+E++A
Sbjct: 115 ---DCECNCKDKNQTKRQNHECLCDCDNSNSDDFAQ---TTPPDSELLA----------I 174
Query: 155 PRSSNGSVLGDTCAKADCRIDAVTRTGSVLKPCSKRKLFKAPGSIAYKRLLPFLPD-GDN 214
NGSV+ + D SVL PCS+ K+FK G +YKRLLP+L D+
Sbjct: 175 SEEINGSVVN--------KSDTNLWRKSVLLPCSRPKIFKNTGPFSYKRLLPYLMQASDD 234
Query: 215 YILRGDLCSKREKKLEKKENIES--NLCNRANESSFIDSDTSVNNAVLAYGISCNTM--- 274
CSK + K +S ++ ++ + SF DTS +V+A + N
Sbjct: 235 GTSSSSRCSKSLSQNITKPVSQSMDSVYDKDSTGSFC-RDTSPLKSVIASTPNKNAAFSR 294
Query: 275 -KLNSTP---------PYNGDAKN-----FQNGSDSRNDPTLVKE--NSGLKRDVVSVSS 334
KL TP PY D + +N + + P L E N G + V S
Sbjct: 295 GKLFKTPGSVNYRRMLPYLKDIQEDNPYPQKNTEEVISSPMLNSESDNEGTQEVVTS--- 354
Query: 335 LDKKLTENGGPSEHQIEDRFSNEQSKTFVIERLDGGDPFISSNLSSEVDNFKSHVSEKLC 394
+T G S + E+ E+ V L+ DP E + HV
Sbjct: 355 ---NVTRESGTSSDENEEPLPCER----VPVNLEQSDP------DKEQETQIKHVIPDTE 386
Query: 395 NNVSEDIKSE-----NHSKVEIEISSLDSNIACNLVKEERKN 409
NN+ +I + S E+ S+L + NLV EE N
Sbjct: 415 NNLGSEIPLSSPLVGSRSSSEVNSSALHNTFVDNLVGEENMN 386
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HGP9 | 0.0e+00 | 100.00 | uncharacterized protein LOC111464172 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1HLX0 | 0.0e+00 | 95.96 | uncharacterized protein LOC111464172 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1KHD4 | 0.0e+00 | 89.49 | uncharacterized protein LOC111494390 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KEX1 | 0.0e+00 | 85.73 | uncharacterized protein LOC111494390 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1H4T6 | 2.2e-292 | 68.24 | uncharacterized protein LOC111460150 OS=Cucurbita moschata OX=3662 GN=LOC1114601... | [more] |
Match Name | E-value | Identity | Description | |
XP_022964022.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111464172 isoform X1 [Cucurbita moschata] | [more] |
XP_022964024.1 | 0.0e+00 | 95.96 | uncharacterized protein LOC111464172 isoform X2 [Cucurbita moschata] | [more] |
KAG7025965.1 | 0.0e+00 | 93.95 | hypothetical protein SDJN02_12463, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023514501.1 | 0.0e+00 | 93.95 | uncharacterized protein LOC111778760 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG6593619.1 | 0.0e+00 | 91.80 | hypothetical protein SDJN03_13095, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT3G23740.1 | 5.5e-33 | 35.29 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |