Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTCGCGTTTCCGCCCAAAAAGGATTTTAAAGCTTGAAGTGCGGACTTCTCTGCTTAAACCTCCATTTTCTCCAAGCTTCGAAACTATAGCCATGGCCGACGAAGATCCACCATCGGAGCTCTCAGTGACATTGCAGAAACCGAAACTCGACGAAATCCTGAACGTCAGTAAGGCCACCACTGAGTCCGATGTTCTGGGAGGCCTCGACTCGTCTTGCAATTGTTCCAACGAAACGAAGCCCGCTGCTGAATCCACTGCTCAAACCTCTGATGGAAGTGGTGAAAAATCGTTGGAGCTGGCCGAAGAGTTGCTGGAGAAGGGTTCCAAGGCTATTAAGGACAATGATTTCGTTGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTAGGTTAGGGTTTGGTTATTTTTCACTTTCTCTTCAGTTTCTTCTTGATTATGAATTGAATTCTGGAACGAAGATGAAAGTCGAGCTTCTATTACCCTTGGAGTCTTAAAGTTGATTTGTTACAATTGCTTCACTCTGTTAGATCACATGTTTCCACTTATCATTTAGTCAGTTTTAAGATTTTCATCTTGGTAAATCTTAATTCCATTTTCTTTAGCGGTGGGGGAGCTTTTCGTTGCTCCAGATTCATTTTCGGAACTTTATCTTTCTTTTGTTGAAGATCAACCGCTCGTCAGGATCACCGAACAGTCTTGGTAGATATGAATTGAGTTTTAACGAGGTTTTGTAATGTTGTTAGGTTCTAACTTTATTTAACTTTTGCATATAAGAACTTGTTGCTCATTTTTATTTCTTGTCTATATCATTGGATGCCTGTTGCGAACATGTCTCTTTTACGATGTGAAAAGTGTATTCTTAAATCAAGCATTTGCATGCGTTTAGTTTCATCCTTTGATTTGTTTTTTCTAGCAATAATCTTGAAAATATTTCACTGCAGAGCTGCGCGATATGGTGAACTGGCTCCGGAATGTGTTAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTCCCAAAGAAAGAAGGTGAACCCTATCAACAATCTTACAAAGATGAATCTGTTAAGAATGTTGAAAATGGCGAGTCATCAAAAGCTTCTGTTTCCAGCAATGCTGAACTGGTGGATGGAGTTGCAGATGATGGTTGGTATCTTCCTCATGATGTCTAACTTATCTAGTTCCTAACATACATTGCCTTTCTCTTTTCCTGTTGGACACTTTCCAGTGGTGAAGGCAAGTACATTTTTGAAATTGAAAATTCTTTCATCTTCTTCCTAAGAAAGGAGGAGTTTCGGTTCAAATAAATTATTCGAAATTGAAAACTGAAGCTTGCAAACTATTCTTCTTAATTTTCTAGTCTTTTCGTTTCTTTTCGTTTCCTTGGATATTTTCTATTTTACATTATTTAAAATCTACATGAAACATGGCATGTTGCATCGTAATGAATTCCCCTTTAATATTTGACCAACTTGAAGAGGATTCCACCTTGATCTACTATTCTCTCGCCTTTCCCTTCCTTAAAGATAGAGAGGATCTTATATTATATACTTGAGCATTGAGAGCAACATGCTTCTGGTTTTATTTTATCTTTGTTTCATTGATGTATTTCTTGGTAACCTAAGCGATGAATTTGGTGCTACAGAAGCATACTGAGCATGAATAAATTGCTGCTGGTTTTATGGTGTCTAAGTTTCATTGAAGTAGTTCAACTGAATGCGAAGAGATCAATTACTTTTTTATAATGTGTTTTCACATCTTTTCAATGTATTGTTTTATTTCGATATTGAGATGTTACAAGCTCTATAGCTTTTTTTTTTCTTATGGACATCAGAATGTGATGATTTTCATTGTTTCGTGAAAAGCAGGATTTTTTATATATTGCTAAGTTAGAATGCATACACGTTAATCATTGATGACTATCATCATTTCCTGTTTAGAACACACCTACATGTTTCCTAGGCTTCCAGCTCTAATTGGGTAGGTGGGTGGGTACCTTGGCTTCTGTGGAAGTCATTTGACTTGAGTCTAGGTGACTGACATACCTTGGCGTGGTGACTTATCGCAGTGGACATTCAAGTACCTTTCCTTATGTCTTGCATATAAGTGCCTTTACGGAAGCTTTGTCTTCCACTTTAGTTTCTTGTTTCCTCCTTTCACTTTGTCTCTATTGCTTCCTTTTATTCTCTTTTTTCAAAGTGCTTGCAGCAACATTTTATGAATTTTTATGTCCTGTAATATTTCAATATTTAGAAAGAATTACTTTTCCATGTGATATCATGCACCTCACCTCATCTAGGCTTGAATCTTCTTTTTTGCATCTCCAGTCTGGCTCCGTGCATTTTTTTTTTCATTTTTCAATCCACTTAGTGCTTTAGAAAATGCCGATCACTTTATACATTATTTTAGAATTTGATGAATTATAAATTTCTGAAGTTTTATTGTTTGAGTTGCAAATGGGAATATTAACTACAGGTCTTTAGGGATCTTTACAACTTATGTTCAACAAATTATTGATGCCTTATTGCTTTACTTCTGGATGTGTCTCCACTGTCACACTCGTCAATATTTACCTTCTCATGTACATGTTATCTTAAGGGGAAAAAAAAAAAAATAATAAAAAGAAATTAGCTGCAGTGATACTGCCACCCACCAGTTTTATTACAAAAAAAAAAAGGTTAATGCTATCATCATCCTATGTAGTCGTTTCAGAGCTTTGGGGCCCTAACTCCTAAGAAATAACTTGAATAAATTTTTGGTGATTAAGACAAGACATCTGCCTTGCAATTCTTTGCTAACTTGCTGATGTCTATCTGGCCATTTAATCTTAACTTCCCTTATGTGGTCAAAATACTGTTTTAGAAGAGTTTTTGCATTGAATCATCGTGTTTGCAACAACAGATTTTTATTGTATTCAACTTCAATAATTTTATAAATTTCTATCTATGTAATCAAGCACTTCGAGTAGAACAATTTTCTCGGGGCTTGTAGTTCACTGCTTTACACATCATTTGAATGAATTTGCTGTTTCTTGTTTTTAAAAGTACAATTATGTAGATTTGATGGTACCTACTGATGCATCAAAAAGAAGTTGTATCTTATCTTCCTTTATAAGAGTTTTGTTTGATGCATAATGTACATGGAATCTCAACTTCTATCATGTATCATTTTTACATGCATTATCTGTGAATCACTTAAATTATGAACAGTTACTAGGATGACAGAGTCGATTTTCAACTGTGAAGAGGCATGTATAAATGCTCTTCAATTTTGAATGATGGACTATCATGCTTCAACATTTTGTTCCGCTTGATTTTGATTGTGTTATTTTATTGCTGGTCTTCATTTTAAAATTATCCCCATCTCTTATCTATAAAAAATGTACATTTTAAAAAAAGTTATCTCATTAGGAGATTTTTGACCTTCCATTTTCCCTTTTTTTTTTCTTTGAGACAGTTTCCAGTAAGAAAGATCAGGATGAAGAAGAGGGTGATGATAGTGGTACTGAGGACTTAGCAGATGCAGATGAAGACGAATCTGACCTTGATTTAGCTTGGAAAATGCTAGACGTTGCCAGAGCAATAGTTGAAAAAGACTCGGGTGATACGATGGAGAAAGTGGACATATTGTCAGCCTTGGCAGAAGTTGCTCTGGAAAGAGGTACACATGCATTTTATTTGCTCTGTTGTGATACACATTCAATTGGTTTGGTTTGTGTTTCTTTGATATTATTCCCCTTTCTTTCTCCCCAGTTAGAGGGTTATCCTCTAATTTTGAAGTTCATTTAATTTTCTCTCCTTCATGTAGAGGACATTGAAACTTCCCTGAGCGACTACCAGAAAGCATTATCAATTTTAGAAAGACTTGTCGAACCTGATAATCGACAGCTTGCTGAATTGTATCCTTCTTTCAGATTCTAAATCTCATATAAGTACGTAAATAGAATATAATATATACGAAAAAATCTGTTTTCCAAATATTCTAACCGAGAATTCAAGATAAAAAGAAGAAGAAAAAACATCCTGTTTTCATGGTAGCCGACTTCAATTCTATTTATAATAGTAGCTTATAAATTCTGGGAGCTTGTTAGTCTGTATTTCTACCTCGTTTATAATAGTAGCTTATAAATTCTAGGAGCTTGATGATAAAAGTTTAGTTTTTAATTTAATTATATATTTCTTTTTCTTAAGAAAAAGAAAACCGTATGAATTGTTATTATTGACATTTGAACCACATCACTTCCTTTCAAGTGAGGAAAGTGTCTTTTGCAACCTTAAACTGAAACTTCCAGAAACTTCCGTGTATGCTTGTGCCTGGAGTTTGGTTCCCAGCCGCAGGAAGCCATTTCATTTTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGATGCGGCTTACTGATGAAGTAAAGGGTATCATTGTACCGACCACAGCTTCTTCTACTTCTGGGTCAGAACCAGAGATCCCACTATCGTCCAATGACTCCCAGACTGACAATGCCAATGCTGCAACAGAGAAACAATCTGAGATTGAAATTCTATCTGGGCTTTTGGTCGAGCTAGAAAAGAAGGCGAGTTTAGATTATAACTTCTGCAAGTAACTGTGACAGAGAAACGGAAAAATCTAACCAATGTTTTGTGTTTGTGCAGCTTGAAGATCTGCAACAGCTGGCCTCAAACCCAAAGTCAATCCTCTCGGAGATCCTCGGGATAGGAGCGGCGAAGGCGAAAGTCGAAAAGATCGCTCCTCCACTTCCAGCGGTGTTGAACTCCTCACAGATGGGTTCAGCTAACAGCAATGGAGGATTTGACTCTCCAACAGTCTCAACTGCGCACACGAACGGCGCGGCTGGAGTGACGCACCTTGGCGTCGTTGGAAGAGGAGTCAAACGAGTATCAACAAATTCAGAGTCTGCTGACTCCTCCCACCCAACTAAGAAACGGGCAACAGATTCATCAACACAAGATAAAGGTGATGGCAGTTCCGCCTGA
mRNA sequence
TGTCGCGTTTCCGCCCAAAAAGGATTTTAAAGCTTGAAGTGCGGACTTCTCTGCTTAAACCTCCATTTTCTCCAAGCTTCGAAACTATAGCCATGGCCGACGAAGATCCACCATCGGAGCTCTCAGTGACATTGCAGAAACCGAAACTCGACGAAATCCTGAACGTCAGTAAGGCCACCACTGAGTCCGATGTTCTGGGAGGCCTCGACTCGTCTTGCAATTGTTCCAACGAAACGAAGCCCGCTGCTGAATCCACTGCTCAAACCTCTGATGGAAGTGGTGAAAAATCGTTGGAGCTGGCCGAAGAGTTGCTGGAGAAGGGTTCCAAGGCTATTAAGGACAATGATTTCGTTGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTAGAGCTGCGCGATATGGTGAACTGGCTCCGGAATGTGTTAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTCCCAAAGAAAGAAGGTGAACCCTATCAACAATCTTACAAAGATGAATCTGTTAAGAATGTTGAAAATGGCGAGTCATCAAAAGCTTCTGTTTCCAGCAATGCTGAACTGGTGGATGGAGTTGCAGATGATGTTTCCAGTAAGAAAGATCAGGATGAAGAAGAGGGTGATGATAGTGGTACTGAGGACTTAGCAGATGCAGATGAAGACGAATCTGACCTTGATTTAGCTTGGAAAATGCTAGACGTTGCCAGAGCAATAGTTGAAAAAGACTCGGGTGATACGATGGAGAAAGTGGACATATTGTCAGCCTTGGCAGAAGTTGCTCTGGAAAGAGGTACACATGCATTTTATTTGCTCTGTTGTGATACACATTCAATTGGTTTGGTTTGTGTTTCTTTGATATTATTCCCCTTTCTTTCTCCCCAGTTAGAGGAGGACATTGAAACTTCCCTGAGCGACTACCAGAAAGCATTATCAATTTTAGAAAGACTTGTCGAACCTGATAATCGACAGCTTGCTGAATTAAACTTCCGTGTATGCTTGTGCCTGGAGTTTGGTTCCCAGCCGCAGGAAGCCATTTCATTTTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGATGCGGCTTACTGATGAAGTAAAGGGTATCATTGTACCGACCACAGCTTCTTCTACTTCTGGGTCAGAACCAGAGATCCCACTATCGTCCAATGACTCCCAGACTGACAATGCCAATGCTGCAACAGAGAAACAATCTGAGATTGAAATTCTATCTGGGCTTTTGGTCGAGCTAGAAAAGAAGGCGAGTTTAGATTATAACTTCTGCAAAGAAACGGAAAAATCTAACCAATGTTTTGTGTTTGTGCAGCTTGAAGATCTGCAACAGCTGGCCTCAAACCCAAAGTCAATCCTCTCGGAGATCCTCGGGATAGGAGCGGCGAAGGCGAAAGTCGAAAAGATCGCTCCTCCACTTCCAGCGGTGTTGAACTCCTCACAGATGGGTTCAGCTAACAGCAATGGAGGATTTGACTCTCCAACAGTCTCAACTGCGCACACGAACGGCGCGGCTGGAGTGACGCACCTTGGCGTCGTTGGAAGAGGAGTCAAACGAGTATCAACAAATTCAGAGTCTGCTGACTCCTCCCACCCAACTAAGAAACGGGCAACAGATTCATCAACACAAGATAAAGGTGATGGCAGTTCCGCCTGA
Coding sequence (CDS)
ATGGCCGACGAAGATCCACCATCGGAGCTCTCAGTGACATTGCAGAAACCGAAACTCGACGAAATCCTGAACGTCAGTAAGGCCACCACTGAGTCCGATGTTCTGGGAGGCCTCGACTCGTCTTGCAATTGTTCCAACGAAACGAAGCCCGCTGCTGAATCCACTGCTCAAACCTCTGATGGAAGTGGTGAAAAATCGTTGGAGCTGGCCGAAGAGTTGCTGGAGAAGGGTTCCAAGGCTATTAAGGACAATGATTTCGTTGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTAGAGCTGCGCGATATGGTGAACTGGCTCCGGAATGTGTTAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTCCCAAAGAAAGAAGGTGAACCCTATCAACAATCTTACAAAGATGAATCTGTTAAGAATGTTGAAAATGGCGAGTCATCAAAAGCTTCTGTTTCCAGCAATGCTGAACTGGTGGATGGAGTTGCAGATGATGTTTCCAGTAAGAAAGATCAGGATGAAGAAGAGGGTGATGATAGTGGTACTGAGGACTTAGCAGATGCAGATGAAGACGAATCTGACCTTGATTTAGCTTGGAAAATGCTAGACGTTGCCAGAGCAATAGTTGAAAAAGACTCGGGTGATACGATGGAGAAAGTGGACATATTGTCAGCCTTGGCAGAAGTTGCTCTGGAAAGAGGTACACATGCATTTTATTTGCTCTGTTGTGATACACATTCAATTGGTTTGGTTTGTGTTTCTTTGATATTATTCCCCTTTCTTTCTCCCCAGTTAGAGGAGGACATTGAAACTTCCCTGAGCGACTACCAGAAAGCATTATCAATTTTAGAAAGACTTGTCGAACCTGATAATCGACAGCTTGCTGAATTAAACTTCCGTGTATGCTTGTGCCTGGAGTTTGGTTCCCAGCCGCAGGAAGCCATTTCATTTTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGATGCGGCTTACTGATGAAGTAAAGGGTATCATTGTACCGACCACAGCTTCTTCTACTTCTGGGTCAGAACCAGAGATCCCACTATCGTCCAATGACTCCCAGACTGACAATGCCAATGCTGCAACAGAGAAACAATCTGAGATTGAAATTCTATCTGGGCTTTTGGTCGAGCTAGAAAAGAAGGCGAGTTTAGATTATAACTTCTGCAAAGAAACGGAAAAATCTAACCAATGTTTTGTGTTTGTGCAGCTTGAAGATCTGCAACAGCTGGCCTCAAACCCAAAGTCAATCCTCTCGGAGATCCTCGGGATAGGAGCGGCGAAGGCGAAAGTCGAAAAGATCGCTCCTCCACTTCCAGCGGTGTTGAACTCCTCACAGATGGGTTCAGCTAACAGCAATGGAGGATTTGACTCTCCAACAGTCTCAACTGCGCACACGAACGGCGCGGCTGGAGTGACGCACCTTGGCGTCGTTGGAAGAGGAGTCAAACGAGTATCAACAAATTCAGAGTCTGCTGACTCCTCCCACCCAACTAAGAAACGGGCAACAGATTCATCAACACAAGATAAAGGTGATGGCAGTTCCGCCTGA
Protein sequence
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Homology
BLAST of CmoCh20G011750 vs. ExPASy Swiss-Prot
Match:
Q17886 (Protein NASP homolog 1 OS=Caenorhabditis elegans OX=6239 GN=nasp-1 PE=1 SV=1)
HSP 1 Score: 57.0 bits (136), Expect = 7.6e-07
Identity = 71/309 (22.98%), Postives = 123/309 (39.81%), Query Frame = 0
Query: 43 NCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAA 102
+ S ++ T + +K LA ELL G +A+K ND +A D S A E+ +
Sbjct: 16 DASGDSDEKGNGTTTEEETVEQKEKRLA-ELLAAGRRALKVNDIDKASDSLSEATELSSE 75
Query: 103 RYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESS 162
YGE Y YG A L A+EE+ L +KE +Q+ + + ENGE+
Sbjct: 76 IYGENHENTFDSLYYYGMATLELAKEESQLLKGPGEKESGDEEQAGNSDDKTDEENGETE 135
Query: 163 KASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAI 222
K E+G++SG E+ D+D+ + L+W++L+ AR I
Sbjct: 136 K-------------------------EDGEESGEEE----DDDDDTMKLSWEILETARCI 195
Query: 223 VEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEED 282
+ +SA+ E L+ A L+ H I +
Sbjct: 196 AAAKIEALEAEQSGISAIEEWNLKL---ADVLVLLGEHGIS----------------DGK 255
Query: 283 IETSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSR 342
+ D +AL+I ++ P +R++A+ + + E + + K + +R
Sbjct: 256 YTQAFEDLDRALNIQRNVLPPSSRKIAQTYILIGNACASDANYDETVQYFGKTKDVLIAR 275
Query: 343 VMRLTDEVK 352
L E++
Sbjct: 316 QTELKHELE 275
BLAST of CmoCh20G011750 vs. ExPASy TrEMBL
Match:
A0A6J1EJQ1 (protein HGV2 OS=Cucurbita moschata OX=3662 GN=LOC111434946 PE=4 SV=1)
HSP 1 Score: 861.3 bits (2224), Expect = 2.2e-246
Identity = 484/539 (89.80%), Postives = 484/539 (89.80%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 484
BLAST of CmoCh20G011750 vs. ExPASy TrEMBL
Match:
A0A6J1I412 (protein HGV2 OS=Cucurbita maxima OX=3661 GN=LOC111470382 PE=4 SV=1)
HSP 1 Score: 849.0 bits (2192), Expect = 1.1e-242
Identity = 476/539 (88.31%), Postives = 482/539 (89.42%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGL+SSCNCSNETKP AESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLESSCNCSNETKPGAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVK+ ENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPHQQSYKDESVKSAENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPE+PLSSNDSQ+D+ANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEVPLSSNDSQSDHANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 484
BLAST of CmoCh20G011750 vs. ExPASy TrEMBL
Match:
A0A1S3C6L8 (NASP-related protein sim3 OS=Cucumis melo OX=3656 GN=LOC103497438 PE=4 SV=1)
HSP 1 Score: 715.7 bits (1846), Expect = 1.5e-202
Identity = 421/542 (77.68%), Postives = 436/542 (80.44%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSE+SVT+ KPKLDE LNVS+ TTES GGLDSSCN NE KP E TAQTSD
Sbjct: 1 MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIAQGGLDSSCNSPNEKKPITEPTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIRAA YGELA ECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEG QS K ES K+ NGESSKASVSSNAE+VDGV DDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEG----QSDKAESAKSAVNGESSKASVSSNAEVVDGVTDDV 180
Query: 181 S---SKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDIL 240
S SKKDQDEEE DDS EDLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDIL
Sbjct: 181 SETVSKKDQDEEETDDSDAEDLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDIL 240
Query: 241 SALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL 300
SALAEVALER EDI TSLSDYQKALSIL
Sbjct: 241 SALAEVALER---------------------------------EDIGTSLSDYQKALSIL 300
Query: 301 ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPT 360
ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS+CQKAISICKSRV+RLTDEVK +IVPT
Sbjct: 301 ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVKSVIVPT 360
Query: 361 TASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEK 420
TASSTSGSEPEIPLSSN SQTDN NAATEKQSEIE LSGLLVELEKK
Sbjct: 361 TASSTSGSEPEIPLSSNGSQTDNENAATEKQSEIETLSGLLVELEKK------------- 420
Query: 421 SNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNG 480
LEDLQQLASNP SILSEILGIG+AK +EKI PP+P+V NSSQMGSANSNG
Sbjct: 421 ---------LEDLQQLASNPMSILSEILGIGSAKPNIEKITPPVPSVFNSSQMGSANSNG 479
Query: 481 GFDSPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGS 540
GFDSPTVSTAHTN GVTHLGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD S
Sbjct: 481 GFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSQDKGDSS 479
BLAST of CmoCh20G011750 vs. ExPASy TrEMBL
Match:
A0A6J1E053 (NASP-related protein sim3 OS=Momordica charantia OX=3673 GN=LOC111025157 PE=4 SV=1)
HSP 1 Score: 706.4 bits (1822), Expect = 8.9e-200
Identity = 411/540 (76.11%), Postives = 440/540 (81.48%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADE PPSE+SVT+ KPK+DE LN S+ T ES+ GG++SS NCSN+ A E+TAQTSD
Sbjct: 1 MADEGPPSEVSVTVDKPKVDESLNASEVTIESNAQGGVESSSNCSNDKNAATEATAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIRAA YGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLEMAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGE +Q+S KD SVK+ NGESSKASVSSNAELVDGV DDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGESHQESDKDGSVKSAVNGESSKASVSSNAELVDGVTDDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEE+ D+S EDLA+ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEDADNSDAEDLAEADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQKAISICKSRV+RLTDEVK I+VPTTAS
Sbjct: 301 VEPDNRQLAELNFRLCLCLEFGSKPQEAIPFCQKAISICKSRVLRLTDEVKSILVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEP LSSN SQ D NAA+EKQSEIE LSGLLVELEKK
Sbjct: 361 STSGSEPAAQLSSNVSQIDTDNAASEKQSEIETLSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKV-EKIAPPLPAVLNSSQMGSANSNGGF 480
LEDLQQLASNPKSILSEILGIG+A++KV EK AP PA LNSSQ+ SANSNGGF
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGSARSKVNEKSAP--PAALNSSQLASANSNGGF 480
Query: 481 DSPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
DSPTVSTAHTNGA GVTHLGVVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Sbjct: 481 DSPTVSTAHTNGAPGVTHLGVVGRGVKRVSTNSESAE-SNPMKKPAIDSSSQDKGDGSSA 482
BLAST of CmoCh20G011750 vs. ExPASy TrEMBL
Match:
A0A0A0LMR2 (TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G287190 PE=4 SV=1)
HSP 1 Score: 699.5 bits (1804), Expect = 1.1e-197
Identity = 413/543 (76.06%), Postives = 435/543 (80.11%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSE+SVT+ KPKLDE LNVS+ TTES V GGL SSCN NE KP + TAQTSD
Sbjct: 1 MADEDPPSEVSVTVDKPKLDETLNVSEVTTESIVQGGLQSSCNSPNEKKPITQPTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
SG+KSL+LAEELLEKGSKA+KDNDF EAVDCFSRALEIRAA YGELA ECVKLYYKYGC
Sbjct: 61 ESGDKSLDLAEELLEKGSKAMKDNDFNEAVDCFSRALEIRAAHYGELASECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEG QS KD+SVK+ NGESSKASVSSNAE VDGV DDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEG----QSDKDDSVKSAVNGESSKASVSSNAEAVDGVTDDV 180
Query: 181 S---SKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDIL 240
S SKKD+DEEE D S EDLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDIL
Sbjct: 181 SETVSKKDRDEEESDGSDAEDLADADEDESDLDLAWKMLDVARAIVEKDSADTMEKVDIL 240
Query: 241 SALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL 300
SALAEVALER EDI TSLSDYQKALSIL
Sbjct: 241 SALAEVALER---------------------------------EDIGTSLSDYQKALSIL 300
Query: 301 ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPT 360
ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS+CQKAISICKSRV+RLTDEVK +IVPT
Sbjct: 301 ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISYCQKAISICKSRVVRLTDEVKSVIVPT 360
Query: 361 TASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEK 420
TASSTSGSEPE+PLSSN SQTDN NA TEKQSEI+ LSGLLVELEKK
Sbjct: 361 TASSTSGSEPEVPLSSNGSQTDNENATTEKQSEIDTLSGLLVELEKK------------- 420
Query: 421 SNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNG 480
LEDLQQ ASNPKSILSEILGIG+AK +EKI PP+P+V NSSQMGSA+SNG
Sbjct: 421 ---------LEDLQQQASNPKSILSEILGIGSAKPNLEKITPPVPSVFNSSQMGSAHSNG 480
Query: 481 GFDSPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATD-SSTQDKGDG 540
GFDSPTVSTAHTN GVTHLGVVGRGVKRVSTNSES D S+PTKK A D SS+QDKGD
Sbjct: 481 GFDSPTVSTAHTN---GVTHLGVVGRGVKRVSTNSESND-SNPTKKLAKDLSSSQDKGDS 480
BLAST of CmoCh20G011750 vs. NCBI nr
Match:
XP_022928044.1 (protein HGV2 [Cucurbita moschata])
HSP 1 Score: 861.3 bits (2224), Expect = 4.5e-246
Identity = 484/539 (89.80%), Postives = 484/539 (89.80%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 484
BLAST of CmoCh20G011750 vs. NCBI nr
Match:
KAG7011181.1 (Nuclear autoantigenic sperm protein, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 857.4 bits (2214), Expect = 6.4e-245
Identity = 481/539 (89.24%), Postives = 483/539 (89.61%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKP AESTAQTSD
Sbjct: 36 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPGAESTAQTSD 95
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 96 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 155
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVKNVENGESSKASVSSNAELVDGVADDV
Sbjct: 156 ALLYKAQEEADPLGAVPKKEGEPHQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 215
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 216 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 275
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 276 AEVALER---------------------------------EDIETSLSDYQKALSILERL 335
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 336 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 395
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK
Sbjct: 396 STSGSEPEVPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK---------------- 455
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 456 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 515
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 516 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 519
BLAST of CmoCh20G011750 vs. NCBI nr
Match:
XP_023513004.1 (LOW QUALITY PROTEIN: protein HGV2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 855.5 bits (2209), Expect = 2.4e-244
Identity = 480/539 (89.05%), Postives = 482/539 (89.42%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVKN ENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPHQQSYKDESVKNAENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDIL AL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILXAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEVPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 484
BLAST of CmoCh20G011750 vs. NCBI nr
Match:
XP_022971716.1 (protein HGV2 [Cucurbita maxima])
HSP 1 Score: 849.0 bits (2192), Expect = 2.3e-242
Identity = 476/539 (88.31%), Postives = 482/539 (89.42%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGL+SSCNCSNETKP AESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLESSCNCSNETKPGAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVK+ ENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPHQQSYKDESVKSAENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPE+PLSSNDSQ+D+ANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEVPLSSNDSQSDHANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 540
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA 484
BLAST of CmoCh20G011750 vs. NCBI nr
Match:
KAG6571417.1 (hypothetical protein SDJN03_30332, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 845.1 bits (2182), Expect = 3.3e-241
Identity = 474/533 (88.93%), Postives = 476/533 (89.31%), Query Frame = 0
Query: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSD 60
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKP AESTAQTSD
Sbjct: 1 MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPGAESTAQTSD 60
Query: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC
Sbjct: 61 GSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGC 120
Query: 121 ALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDV 180
ALLYKAQEEADPLGAVPKKEGEP+QQSYK ESVKNVENGESSKASVSSNAELVDGVADDV
Sbjct: 121 ALLYKAQEEADPLGAVPKKEGEPHQQSYKGESVKNVENGESSKASVSSNAELVDGVADDV 180
Query: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL
Sbjct: 181 SSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSAL 240
Query: 241 AEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL 300
AEVALER EDIETSLSDYQKALSILERL
Sbjct: 241 AEVALER---------------------------------EDIETSLSDYQKALSILERL 300
Query: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS
Sbjct: 301 VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTAS 360
Query: 361 STSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQ 420
STSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK
Sbjct: 361 STSGSEPEVPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKK---------------- 420
Query: 421 CFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 480
LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD
Sbjct: 421 ------LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD 478
Query: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDK 534
SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDK
Sbjct: 481 SPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDK 478
BLAST of CmoCh20G011750 vs. TAIR 10
Match:
AT4G37210.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 379.4 bits (973), Expect = 4.8e-105
Identity = 266/548 (48.54%), Postives = 340/548 (62.04%), Query Frame = 0
Query: 5 DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSG 64
+P +E++ TL+ P L I +AT ES V GG +S+CN AA+S A + D
Sbjct: 19 EPATEIAQTLE-PNLASI----EATVESVVQGGTESTCNNDANNNNAADSAATEVCDEER 78
Query: 65 EKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGCALL 124
EK+LE AEEL EKGS +K+NDF EAVDCFSRALEIR A YGEL EC+ YY+YG ALL
Sbjct: 79 EKTLEFAEELTEKGSVFLKENDFAEAVDCFSRALEIRVAHYGELDAECINAYYRYGLALL 138
Query: 125 YKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSS 184
KAQ EADPLG +PKKEGE Q+S ES+ +V +G+ + SS E S
Sbjct: 139 AKAQAEADPLGNMPKKEGEVQQESSNGESLAPSVVSGDPERQGSSSGQE--------GSG 198
Query: 185 KKDQDEEEGDDSGTEDLA----DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILS 244
KDQ E+G+D +DL+ DADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL
Sbjct: 199 GKDQG-EDGEDCQDDDLSDADGDADEDESDLDMAWKMLDIARVITDKQSTETMEKVDILC 258
Query: 245 ALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE 304
+LAEV+LER EDIE+SLSDY+ ALSILE
Sbjct: 259 SLAEVSLER---------------------------------EDIESSLSDYKNALSILE 318
Query: 305 RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTT 364
RLVEPD+R+ AELNFR+C+CLE G QP+EAI +CQKA+ ICK+R+ RL++E+KG T
Sbjct: 319 RLVEPDSRRTAELNFRICICLETGCQPKEAIPYCQKALLICKARMERLSNEIKGASGSAT 378
Query: 365 ASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKS 424
+S+ S + I SSN D +A++K+ EI L+GL +LEKK
Sbjct: 379 SSTVSEIDEGIQQSSNVPYID--KSASDKEVEIGDLAGLAEDLEKK-------------- 438
Query: 425 NQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGG 484
LEDL+Q A NPK +L+E++G+ +AK P A ++SS+MG+ N+N G
Sbjct: 439 --------LEDLKQQAENPKQVLAELMGMVSAKPNASDKVVPAAAEMSSSRMGTVNTNFG 492
Query: 485 FD--SPTVSTAHT-----NGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQ 540
D SPTVSTAHT A+GVTHLGVVGRGVKRV N+ S +SS +KK A + S
Sbjct: 499 KDLESPTVSTAHTGAAGGGAASGVTHLGVVGRGVKRVLMNTTSIESS-ASKKPALEFS-- 492
BLAST of CmoCh20G011750 vs. TAIR 10
Match:
AT4G37210.2 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 310.1 bits (793), Expect = 3.6e-84
Identity = 205/408 (50.25%), Postives = 261/408 (63.97%), Query Frame = 0
Query: 5 DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSG 64
+P +E++ TL+ P L I +AT ES V GG +S+CN AA+S A + D
Sbjct: 19 EPATEIAQTLE-PNLASI----EATVESVVQGGTESTCNNDANNNNAADSAATEVCDEER 78
Query: 65 EKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGCALL 124
EK+LE AEEL EKGS +K+NDF EAVDCFSRALEIR A YGEL EC+ YY+YG ALL
Sbjct: 79 EKTLEFAEELTEKGSVFLKENDFAEAVDCFSRALEIRVAHYGELDAECINAYYRYGLALL 138
Query: 125 YKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSS 184
KAQ EADPLG +PKKEGE Q+S ES+ +V +G+ + SS E S
Sbjct: 139 AKAQAEADPLGNMPKKEGEVQQESSNGESLAPSVVSGDPERQGSSSGQE--------GSG 198
Query: 185 KKDQDEEEGDDSGTEDLA----DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILS 244
KDQ E+G+D +DL+ DADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL
Sbjct: 199 GKDQG-EDGEDCQDDDLSDADGDADEDESDLDMAWKMLDIARVITDKQSTETMEKVDILC 258
Query: 245 ALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE 304
+LAEV+LER EDIE+SLSDY+ ALSILE
Sbjct: 259 SLAEVSLER---------------------------------EDIESSLSDYKNALSILE 318
Query: 305 RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTT 364
RLVEPD+R+ AELNFR+C+CLE G QP+EAI +CQKA+ ICK+R+ RL++E+KG T
Sbjct: 319 RLVEPDSRRTAELNFRICICLETGCQPKEAIPYCQKALLICKARMERLSNEIKGASGSAT 377
Query: 365 ASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKAS 407
+S+ S + I SSN D +A++K+ EI L+GL +LEKKA+
Sbjct: 379 SSTVSEIDEGIQQSSNVPYID--KSASDKEVEIGDLAGLAEDLEKKAT 377
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q17886 | 7.6e-07 | 22.98 | Protein NASP homolog 1 OS=Caenorhabditis elegans OX=6239 GN=nasp-1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EJQ1 | 2.2e-246 | 89.80 | protein HGV2 OS=Cucurbita moschata OX=3662 GN=LOC111434946 PE=4 SV=1 | [more] |
A0A6J1I412 | 1.1e-242 | 88.31 | protein HGV2 OS=Cucurbita maxima OX=3661 GN=LOC111470382 PE=4 SV=1 | [more] |
A0A1S3C6L8 | 1.5e-202 | 77.68 | NASP-related protein sim3 OS=Cucumis melo OX=3656 GN=LOC103497438 PE=4 SV=1 | [more] |
A0A6J1E053 | 8.9e-200 | 76.11 | NASP-related protein sim3 OS=Momordica charantia OX=3673 GN=LOC111025157 PE=4 SV... | [more] |
A0A0A0LMR2 | 1.1e-197 | 76.06 | TPR_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G287190 ... | [more] |
Match Name | E-value | Identity | Description | |
XP_022928044.1 | 4.5e-246 | 89.80 | protein HGV2 [Cucurbita moschata] | [more] |
KAG7011181.1 | 6.4e-245 | 89.24 | Nuclear autoantigenic sperm protein, partial [Cucurbita argyrosperma subsp. argy... | [more] |
XP_023513004.1 | 2.4e-244 | 89.05 | LOW QUALITY PROTEIN: protein HGV2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022971716.1 | 2.3e-242 | 88.31 | protein HGV2 [Cucurbita maxima] | [more] |
KAG6571417.1 | 3.3e-241 | 88.93 | hypothetical protein SDJN03_30332, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT4G37210.1 | 4.8e-105 | 48.54 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |
AT4G37210.2 | 3.6e-84 | 50.25 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |