Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AATTTTCCCTCCATATTTGCATCTCTCGGTTGGTATTATCGTATCCCAGTCTGGCGCGCGCTTTATCGCTCTATTCCTTGGCTTCCGCTCTGCGATTTCAATCGCTTCAAGTTTTCTCGGCACCGAACTGAAACTTGAACTGAATCTCAGCTTCTTGCTCTGCGAACTGTCGAAGATGGAAGAGTATTTGCAGTACATGAAGACACTGCGCTTACAAATGAGCGGTACGTATTGGCTCCTTCAAATTCAAGTTTTAACGTACGGAATTATTTTTTGTTGCGTTATTGTGCTACTTTTGGGTTGATTCTTTATCAAACTTGACTTAGACGTGGAGGATCAAGTTTCGAAGATCTCTGTGGAAGAGCACATGCACTTTACTACTATTCGAACCATGGAGAATGATCTTACTGCTGGTCAAGTCTCTACATCTAGTCTTCTAGTTCGTTATTTGTTTATCTGAACGTTCTTCCTTTTGCACCTAGAAAAATAATTCAGCACGTACTTTGTTTACGGAGATTTTGTAGATTTATTCTCTCTTTTGTTCCTGTCTCGTCCTCGATTATTACTATTTTTTTTAATTTCGCTTCAGATCATTTCGAGGATATCTAACTGGTTATTGTGATCATTTTCTTGCTCAAAACTCCATAGCATCTAGTTTTCTTTTTGATCAAGTTTTTCCCGGGTTTCTAAATCAGTCTGATACATATCTATGGATGACTGGCTTACTTGAAAGTAGTTTTGGTTATTCATGCCTACTACAACCGTGACTATAGAAAGTGAAGCTGAAGGATTTTGCAGTTTCTTATAAAGACGGCTGAAGAATTCTTCTAATTCATTCAGAGACGCGTTTGGCCAAAAAATCTATTTTATTGCTTTTGTTATTGCCAAAAATTTCAAAGGGGTATAATAATCCTGTTTCGAGAGTGTCACGACTCATATTGGTAATATATAATTACGTGAATCGAGTAACAAAAACCTAAGAACATTCATCTTCTTCCAAGTCGATCAAACATATTGTCCATTGGACCTAATGTGATAACTAACTTCTTTGCTTTTATCTTAGGCATAAGATTGCTTACTCTTGATGCACATCCAGCCTGGTGCCATGTAATTCATAGCGTGTGTTTGAAGTGCTTTGATTTTGTCTCATTCTACTCTTGCTTTAGTCAGACTGAGATTTTCTGATATCTTGCTGATATAACTAACTCCCAATATTATCCTTGAACCATTACTTAAATCAACCAAATAGGACTGAACTATCATTTTATTTACCTATGTTTATTACTTAATCCAGTACTATTCTAGAAGTCTTTTAATATAATATGTGGTCTAATTTATTTTTTTGAGCCGGTTTCACCAATTTTTGTTCGTACATGTAGTGCTGATTAGCAGCTGTTGCCGTTTGATTACTTTCTGTTGAGAATTGGTCGGTCGAATGAACTTCGGCACGAATTATTGCATGTTAAATTCAGATTTAATTGTAGTCTTACCTTGTTTTTTTCCTAAAACTTCAATATTTCGAGCAAACTAAAATTTTTGAGTTGTAGATCGAATGGCTATAGTTTGCTTTCACTAAAGACGTCTTGATGCAGCAAAAAGTGAGTTAAAAAAACTCAAAGAAGATGCTGAGAGAATGATGCGGGCAAAGGGTGAAATATGCTCCCAGATATTAGAACAACAAAGAAAAATCACCTCTTTGGAGCATGACATATGTACACTTTCACAGGTGTCTATTCTCTGTACTGGTGTCTTGGTGCTTTCCTTGTGTCCTTTCCCTCTCGTGTAGGTCTAAAAGCTGTTTAGCAACTGCTTAGTTAGTGTAAAAGGACCATGCTGTAGCATCTACATGGTAAATAATCTATCTCCATTTCATCATTTCTCGTCGCTGACAAAAATGGAGAAAATGGAACAAAAGTTTATTTTCTCTCTCTTATTTCAGCGCTAGTATTGAATTTTCTAGGTAAATACAATGTATATGATTCAATTTAGTGGAGTCATGAACTAAAAAGCAAAGTCATACTGCTCAACCTCAAGTGTCCTCATGAAATGGAATTGGACTTTTCTTCTGGTCCGGCCTCAATTTTTTTTTGCTTGCCTTTAGTCGATCAAATATGTGAGCTGAGGCTGGATATTGTGCCCAACTCTTGTATATATATACGTTTAAGAGAAAAAGAGTTTCTATGATGTCTAGTTTCGTTAGACTAAAAGAACAAAGGGAACTATTTTCTCCGGTTCACTTATTCTCCTGTTTAACAAGAGGAGAAAAATGGTTGTTTAGTTTCTTTCCATTATGGATTTCAACGAATACTGTTTTGCATCCGCCTCTTGCTTGTTAAGTACTCTCAACTGTTTGATATACAGACACTCGAGCTCATTCAGCAAGAAAAAGTCAGCTTAGGAGCCAAAATTATTGAGAAGAGGTGATTGAACTATGCAGTATTGAATTTCTGGTTCTACCAATGGTGCGTGATTGTGTTATTAACAATAGTATATTTGACCATGATAGTAATTATTATGCCAAAGTTTCTGAGGACATCAGTCTCAAATTCCAAGATCAGCAGGTTGAAGTCCAAAACATATGTCATATAAATCATGAATTCTAAATCCATTATTTATGTTTTGAAATTCAGTCTATTATGCAACCTTTGGTGTAGGACTGGGTAAATGCTAACATGATCTGCGGAGAAGCGGAAGAGCACGGATTGGTATTTCTCACTAGATCATCACTCTTGTCTCTCCACTACTTATCATATTAGTAGTTATGGTATATTTGAGCTGCCCTTTCTTTGCTCTCATGAGATTAGATGGTTGTCCCTCAACTGCTCCAATAGATATCGGCGCTAGAGTTTGTACATGGGTTTAAAGAGAGTTAGTAAACAAGTATCCAAGTGATTCTTGCCGCTACAATTTAATATTCCTTCATAGGAAACTAGATGTTTTGAACCTGTCTATTTTAAATTCTATAGCTCATATATGAATTATTTTTCCTTCTCAGGTTCTAAATAGAAGCTTCCTCCATCCCTATATGTGAATTTCTAATGTCATCGTGGGAATCTATTATATTTTTCATGCTTAAGTATCATCTTATTTTACGCTCTTAATGTCATTTTTATAGCATATACTTAGTTTGGCTCTGGATCTTTCTTTCAAGAATATTTGCCCTTGATCACGACCTTTTTTCTTTTATCAACCTTTAGTTACTTGGTCCTTATCGTATTGGCTTTAAATGAGAATGTATAAAACACGAGAGATAAATATTGAGCCCTCGATCTTTTCAATAAAACCGTCATGTACTAAATCGTCAATCTAAGTGATGGAGTGAAGATAGTTCAAGTGTTCATTCACGACCCTTGGCCCTTATTCACCACGCAAGTAATTTGGAGTTAAAAGCTTGTGCTAGGGCCAATCGTTTTATTTACAACAAGATAAGGCTGAAGAAGTTTTAATCTTTAAATCATCTTTTCTAAGGTCTCTTCTGTTTGCAGGTTTTGCATTCCCACTGCGTGCAATAAGTTATTCCTTCCAGCATAACTGATTACAATTTATTTCAGGTTAAGTTTGAAACTGCTAAGCGAGGAAGTGAAACGGAAGGATCTTATGATACAGTTGGAGGTAGTTCTGGGCCTCCTATCCCCTCTTCAAGTTTTTAACTAACCATAATACGTAGAATGGTTTACTTTGGAACACTTTAGATTAAATCCATCCTATTTGTAACTAGTTTTGCTGTTAAGATATAGAATGGATGATAAAGTAGAATCGTCAGACATTTTCCAAAGTTTTTCAAACTTCACTGCCTTTAGAAATAATTTATTTTAGTTTCTGAATCAATAGCGTGGAAACATTTTATATAATGGTTTGCAAGTGAAGCTCAAATGTAAAATTTGATTATTTGAGCTTCCATTTAACAATCTAACTACTGTAGCATGAAATTCATACTAATTTTATTCCATCCTCTCTCGTAGGGATATCTGGCACTCGTATTTACTGCAGTCCTAACAATCTGGTACTGAACTCCTGCCATGGACTGGCGAGTGTGTTGAAGCATAATGTGCTGATCTAATGTTGGTTTACAGGTGGAAGAGAGGAAAGATTTATTGGGTAAGTTGGAATCTGCGAAAGACAAACTTAGTCAAGTTTCAAAGATGAAATGTGCAGTTGTTTTGGAGAACTCCAAGGTAATCAATGTCGTCAAAGGTTGTATGTGCGATTATAAGAAGAGTGCTTCCATTCGTAACTTCCTACCACCCGCATATATGGGAAAAGAAAGAGCATGTAAGAAGATAGGAATCATTTTTTAACTGTAGACTGTTAATTTGGACCGGCATCTATTTTTGTTTCAGATTGGACAGTCAATTGAGGAAGTTAAGAACGAATTAAATGATTTCAAGGTTAGTGTTCGTGGTGCTCATATTTATCTATTTAGTAAAATTAATTAATTTTAGCTAGGTAATCGATTTGATGGTAGTAAATAAGAGCATCAACCCTTATAATATTTAAACAACTTACGACAATGGAATCTCGAGAACAGATCACAATAGAAGACGCTATCTAGCGCAAGTCATGAAATACCTTCTACATGGACACTATCTGCTTTGTGATCTGTACATATCGTAATTCCTTATGCTATTGACTTAAGGTTGTCCCCTAACTATAGCAATAGTTTTTACAACCCAATTATGGGAGCTACGTTTCGAACATACAAATCAATAATATCCCAAATTTTCTATGAAAGCATTCTAGAATCAACCTGATTTATTATAATCTATTATTTAATACTTTTTCCTGGATAAAAACTTCAGCCAGAACTCAGAGCAATGGATGATGTTACATTGGAGGAAGAGTCCAAGGCGCTCTTATCAGATAAAGCTGGAGAAACCAAGTATTCACGGTCCCTTCAAGACCAAATTGCAAAACTGAAGGTGAATTTGAAATCAATGATTTTTTTCTTACCATTTGACGGTTGGTTTTATTATATGTATGTATATATATATATATTTATTTAATTCAACTGCACTGTGGACGTTCTTCTGTCTTTTAATAAGAAAGAGTTATGTCGTACTTTGTAGGAAATTTCGCGTGTGATTAAATGCACTTGTGGTAAGGAATACAAGGCTGGAATAAGCTCAAGTACGTGATAATGCAAACAGTCTGTTGTTTGGAAGAGTGAAGAAGAAATAGGACAAAGCTAACCACTATGCAGGTAATTGAGAGGGGAATAGATGTTTGATGGTGATCTTGTGCTAGAATCTGTCAAAACTTAAACTTCGCTGAGCGGGTAATTGCTTCTTTTGCAATGTGATATATTGTCTCTTAACGAGAGTAATGTGCTATGTTCTCAATCCTATTGCTGTGGAAAACGTTGCAACTACTCGAGCTAAGAAATACTTCTGCTTAGTTCAAAATTCTTCTTCATTGCAAAAC
mRNA sequence
AATTTTCCCTCCATATTTGCATCTCTCGGTTGGTATTATCGTATCCCAGTCTGGCGCGCGCTTTATCGCTCTATTCCTTGGCTTCCGCTCTGCGATTTCAATCGCTTCAAGTTTTCTCGGCACCGAACTGAAACTTGAACTGAATCTCAGCTTCTTGCTCTGCGAACTGTCGAAGATGGAAGAGTATTTGCAGTACATGAAGACACTGCGCTTACAAATGAGCGACGTGGAGGATCAAGTTTCGAAGATCTCTGTGGAAGAGCACATGCACTTTACTACTATTCGAACCATGGAGAATGATCTTACTGCTGGTCAAGTCTCTACATCTAGTCTTCTAATCATTTCGAGGATATCTAACTGGTTATTACGTCTTGATGCAGCAAAAAGTGAGTTAAAAAAACTCAAAGAAGATGCTGAGAGAATGATGCGGGCAAAGGGTGAAATATGCTCCCAGATATTAGAACAACAAAGAAAAATCACCTCTTTGGAGCATGACATATGTACACTTTCACAGACACTCGAGCTCATTCAGCAAGAAAAAGTCAGCTTAGGAGCCAAAATTATTGAGAAGAGTAATTATTATGCCAAAGTTTCTGAGGACATCAGTCTCAAATTCCAAGATCAGCAGGACTGGGTAAATGCTAACATGATCTGCGGAGAAGCGGAAGAGCACGGATTGGTTAAGTTTGAAACTGCTAAGCGAGGAAGTGAAACGGAAGGATCTTATGATACAGTTGGAGGGATATCTGGCACTCGTATTTACTGCAGTCCTAACAATCTGGTGGAAGAGAGGAAAGATTTATTGGGTAAGTTGGAATCTGCGAAAGACAAACTTAGTCAAGTTTCAAAGATGAAATGTGCAGTTGTTTTGGAGAACTCCAAGATTGGACAGTCAATTGAGGAAGTTAAGAACGAATTAAATGATTTCAAGCCAGAACTCAGAGCAATGGATGATGTTACATTGGAGGAAGAGTCCAAGGCGCTCTTATCAGATAAAGCTGGAGAAACCAAGTATTCACGGTCCCTTCAAGACCAAATTGCAAAACTGAAGGAAATTTCGCGTGTGATTAAATGCACTTGTGGTAAGGAATACAAGGCTGGAATAAGCTCAAGTACGTGATAATGCAAACAGTCTGTTGTTTGGAAGAGTGAAGAAGAAATAGGACAAAGCTAACCACTATGCAGGTAATTGAGAGGGGAATAGATGTTTGATGGTGATCTTGTGCTAGAATCTGTCAAAACTTAAACTTCGCTGAGCGGGTAATTGCTTCTTTTGCAATGTGATATATTGTCTCTTAACGAGAGTAATGTGCTATGTTCTCAATCCTATTGCTGTGGAAAACGTTGCAACTACTCGAGCTAAGAAATACTTCTGCTTAGTTCAAAATTCTTCTTCATTGCAAAAC
Coding sequence (CDS)
ATGGAAGAGTATTTGCAGTACATGAAGACACTGCGCTTACAAATGAGCGACGTGGAGGATCAAGTTTCGAAGATCTCTGTGGAAGAGCACATGCACTTTACTACTATTCGAACCATGGAGAATGATCTTACTGCTGGTCAAGTCTCTACATCTAGTCTTCTAATCATTTCGAGGATATCTAACTGGTTATTACGTCTTGATGCAGCAAAAAGTGAGTTAAAAAAACTCAAAGAAGATGCTGAGAGAATGATGCGGGCAAAGGGTGAAATATGCTCCCAGATATTAGAACAACAAAGAAAAATCACCTCTTTGGAGCATGACATATGTACACTTTCACAGACACTCGAGCTCATTCAGCAAGAAAAAGTCAGCTTAGGAGCCAAAATTATTGAGAAGAGTAATTATTATGCCAAAGTTTCTGAGGACATCAGTCTCAAATTCCAAGATCAGCAGGACTGGGTAAATGCTAACATGATCTGCGGAGAAGCGGAAGAGCACGGATTGGTTAAGTTTGAAACTGCTAAGCGAGGAAGTGAAACGGAAGGATCTTATGATACAGTTGGAGGGATATCTGGCACTCGTATTTACTGCAGTCCTAACAATCTGGTGGAAGAGAGGAAAGATTTATTGGGTAAGTTGGAATCTGCGAAAGACAAACTTAGTCAAGTTTCAAAGATGAAATGTGCAGTTGTTTTGGAGAACTCCAAGATTGGACAGTCAATTGAGGAAGTTAAGAACGAATTAAATGATTTCAAGCCAGAACTCAGAGCAATGGATGATGTTACATTGGAGGAAGAGTCCAAGGCGCTCTTATCAGATAAAGCTGGAGAAACCAAGTATTCACGGTCCCTTCAAGACCAAATTGCAAAACTGAAGGAAATTTCGCGTGTGATTAAATGCACTTGTGGTAAGGAATACAAGGCTGGAATAAGCTCAAGTACGTGA
Protein sequence
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRISNWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQEKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSETEGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQSIEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST
Homology
BLAST of Carg22871 vs. NCBI nr
Match:
KAG7015899.1 (hypothetical protein SDJN02_21002 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 593.2 bits (1528), Expect = 1.3e-165
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
Query: 301 TCGKEYKAGISSST 315
TCGKEYKAGISSST
Sbjct: 301 TCGKEYKAGISSST 314
BLAST of Carg22871 vs. NCBI nr
Match:
XP_023550047.1 (uncharacterized protein LOC111808353 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 519.2 bits (1336), Expect = 2.4e-143
Identity = 283/314 (90.13%), Postives = 286/314 (91.08%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+LKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMI GEAEEH LV FETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIRGEAEEHELVTFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAK KLSQVSKMKCAVVLEN KIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKAKLSQVSKMKCAVVLENFKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YS+SLQDQIAKLKEISRVIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSQSLQDQIAKLKEISRVIKC 291
Query: 301 TCGKEYKAGISSST 315
TCGKEYKAGISSST
Sbjct: 301 TCGKEYKAGISSST 291
BLAST of Carg22871 vs. NCBI nr
Match:
KAG6578320.1 (hypothetical protein SDJN03_22768, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 518.5 bits (1334), Expect = 4.1e-143
Identity = 283/314 (90.13%), Postives = 285/314 (90.76%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+ KEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQFKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIC EAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICREAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYCSPNNLVEERKD LESAKDKLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVEERKD----LESAKDKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEISRVIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISRVIKC 287
Query: 301 TCGKEYKAGISSST 315
TCGKEYKAGISSST
Sbjct: 301 TCGKEYKAGISSST 287
BLAST of Carg22871 vs. NCBI nr
Match:
XP_022939802.1 (uncharacterized protein LOC111445570 isoform X3 [Cucurbita moschata])
HSP 1 Score: 516.9 bits (1330), Expect = 1.2e-142
Identity = 283/314 (90.13%), Postives = 285/314 (90.76%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+ KEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQFKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMI GEAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIRGEAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYCSPNNLVEERKD LESAKDKLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVEERKD----LESAKDKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEISRVIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISRVIKC 287
Query: 301 TCGKEYKAGISSST 315
TCGKEYKAGISSST
Sbjct: 301 TCGKEYKAGISSST 287
BLAST of Carg22871 vs. NCBI nr
Match:
XP_022993971.1 (uncharacterized protein LOC111489811 isoform X1 [Cucurbita maxima])
HSP 1 Score: 505.4 bits (1300), Expect = 3.6e-139
Identity = 277/314 (88.22%), Postives = 281/314 (89.49%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+LKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKS YYAKVSEDISLKFQDQQDWVNANMI GEAE H LVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSTYYAKVSEDISLKFQDQQDWVNANMIRGEAEGHELVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYC+ N +VEERKDLLGKLESAK KLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCNLNYVVEERKDLLGKLESAKAKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEIS VIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISHVIKC 291
Query: 301 TCGKEYKAGISSST 315
TCGKEYK GIS ST
Sbjct: 301 TCGKEYKTGISLST 291
BLAST of Carg22871 vs. ExPASy TrEMBL
Match:
A0A6J1FHU2 (uncharacterized protein LOC111445570 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111445570 PE=4 SV=1)
HSP 1 Score: 516.9 bits (1330), Expect = 5.8e-143
Identity = 283/314 (90.13%), Postives = 285/314 (90.76%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+ KEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQFKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMI GEAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIRGEAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYCSPNNLVEERKD LESAKDKLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVEERKD----LESAKDKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEISRVIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISRVIKC 287
Query: 301 TCGKEYKAGISSST 315
TCGKEYKAGISSST
Sbjct: 301 TCGKEYKAGISSST 287
BLAST of Carg22871 vs. ExPASy TrEMBL
Match:
A0A6J1JXU6 (uncharacterized protein LOC111489811 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489811 PE=4 SV=1)
HSP 1 Score: 505.4 bits (1300), Expect = 1.7e-139
Identity = 277/314 (88.22%), Postives = 281/314 (89.49%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+LKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKS YYAKVSEDISLKFQDQQDWVNANMI GEAE H LVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSTYYAKVSEDISLKFQDQQDWVNANMIRGEAEGHELVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYC+ N +VEERKDLLGKLESAK KLSQVSKMKCAVVLENSKIGQS
Sbjct: 181 EGSYDTVGGISGTRIYCNLNYVVEERKDLLGKLESAKAKLSQVSKMKCAVVLENSKIGQS 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEIS VIKC
Sbjct: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISHVIKC 291
Query: 301 TCGKEYKAGISSST 315
TCGKEYK GIS ST
Sbjct: 301 TCGKEYKTGISLST 291
BLAST of Carg22871 vs. ExPASy TrEMBL
Match:
A0A6J1FNS2 (uncharacterized protein LOC111445570 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111445570 PE=4 SV=1)
HSP 1 Score: 503.8 bits (1296), Expect = 5.1e-139
Identity = 283/337 (83.98%), Postives = 285/337 (84.57%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+ KEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQFKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMI GEAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIRGEAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNL-----------------------VEERKDLLGKLESAK 240
EGSYDTVGGISGTRIYCSPNNL VEERKD LESAK
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVLNSCHGLASVLKHNVLSNVGLQVEERKD----LESAK 240
Query: 241 DKLSQVSKMKCAVVLENSKIGQSIEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGE 300
DKLSQVSKMKCAVVLENSKIGQSIEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGE
Sbjct: 241 DKLSQVSKMKCAVVLENSKIGQSIEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGE 300
Query: 301 TKYSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST 315
T+YSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST
Sbjct: 301 TEYSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST 310
BLAST of Carg22871 vs. ExPASy TrEMBL
Match:
A0A6J1JUE5 (uncharacterized protein LOC111489811 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489811 PE=4 SV=1)
HSP 1 Score: 464.9 bits (1195), Expect = 2.6e-127
Identity = 261/314 (83.12%), Postives = 265/314 (84.39%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+LKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKS YYAKVSEDISLKFQDQQDWVNANMI GEAE H LVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSTYYAKVSEDISLKFQDQQDWVNANMIRGEAEGHELVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
EGSYDTVGGISGTRIYC+ N +VEERKDLLGKLESAK KLSQVSKMKCAVVLENS
Sbjct: 181 EGSYDTVGGISGTRIYCNLNYVVEERKDLLGKLESAKAKLSQVSKMKCAVVLENS----- 240
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLKEISRVIKC 300
KPELRAMDDVTLEEESKALLSDKAGET+YSRSLQDQIAKLKEIS VIKC
Sbjct: 241 -----------KPELRAMDDVTLEEESKALLSDKAGETEYSRSLQDQIAKLKEISHVIKC 275
Query: 301 TCGKEYKAGISSST 315
TCGKEYK GIS ST
Sbjct: 301 TCGKEYKTGISLST 275
BLAST of Carg22871 vs. ExPASy TrEMBL
Match:
A0A6J1FGW1 (uncharacterized protein LOC111445570 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111445570 PE=4 SV=1)
HSP 1 Score: 463.8 bits (1192), Expect = 5.8e-127
Identity = 267/337 (79.23%), Postives = 269/337 (79.82%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT
Sbjct: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLT---------------- 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
AAKSELK+ KEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ
Sbjct: 61 -------AAKSELKQFKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMI GEAEEHGLVKFETAKRGSET
Sbjct: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMIRGEAEEHGLVKFETAKRGSET 180
Query: 181 EGSYDTVGGISGTRIYCSPNNL-----------------------VEERKDLLGKLESAK 240
EGSYDTVGGISGTRIYCSPNNL VEERKD LESAK
Sbjct: 181 EGSYDTVGGISGTRIYCSPNNLVLNSCHGLASVLKHNVLSNVGLQVEERKD----LESAK 240
Query: 241 DKLSQVSKMKCAVVLENSKIGQSIEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGE 300
DKLSQVSKMKCAVVLENS KPELRAMDDVTLEEESKALLSDKAGE
Sbjct: 241 DKLSQVSKMKCAVVLENS----------------KPELRAMDDVTLEEESKALLSDKAGE 294
Query: 301 TKYSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST 315
T+YSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST
Sbjct: 301 TEYSRSLQDQIAKLKEISRVIKCTCGKEYKAGISSST 294
BLAST of Carg22871 vs. TAIR 10
Match:
AT1G33500.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 185.7 bits (470), Expect = 5.9e-47
Identity = 123/292 (42.12%), Postives = 169/292 (57.88%), Query Frame = 0
Query: 1 MEEYLQYMKTLRLQMSDVEDQVSKISVEEHMHFTTIRTMENDLTAGQVSTSSLLIISRIS 60
MEEYLQYMKTLR QM+DVED +K+SVEE M TTI T+E D
Sbjct: 1 MEEYLQYMKTLRSQMTDVEDHAAKVSVEEQMQVTTISTLEKD------------------ 60
Query: 61 NWLLRLDAAKSELKKLKEDAERMMRAKGEICSQILEQQRKITSLEHDICTLSQTLELIQQ 120
L+ A SE K+LKE+ ++ R +GEICS ILE+QRKI+S+E D ++Q+LELI Q
Sbjct: 61 -----LEHALSETKRLKEETDQKTRTRGEICSHILEKQRKISSMESDSVNIAQSLELILQ 120
Query: 121 EKVSLGAKIIEKSNYYAKVSEDISLKFQDQQDWVNANMICGEAEEHGLVKFETAKRGSET 180
E+ SL AK++ K + Y K +E+ K ++Q+ W ++M ET ++G +
Sbjct: 121 ERDSLSAKLVSKRSNYLKTAEEARTKLEEQKGWFISHM-----------SNETGQQGHKK 180
Query: 181 EGSYDTVGGISGTRIYCSPNNLVEERKDLLGKLESAKDKLSQVSKMKCAVVLENSKIGQS 240
E + NNL+E +SA+ KL Q M+ ++ ENSKI S
Sbjct: 181 E----------------TRNNLME-------LSDSARAKLDQAKLMRSNLLQENSKIKLS 235
Query: 241 IEEVKNELNDFKPELRAMDDVTLEEESKALLSDKAGETKYSRSLQDQIAKLK 293
IE VK+++N+FKPEL ++D LEEE ALLSD++GE +Y SLQ Q KLK
Sbjct: 241 IENVKHKINEFKPELMSVDIKILEEEYTALLSDESGEAEYLSSLQSQAEKLK 235
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG7015899.1 | 1.3e-165 | 100.00 | hypothetical protein SDJN02_21002 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_023550047.1 | 2.4e-143 | 90.13 | uncharacterized protein LOC111808353 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG6578320.1 | 4.1e-143 | 90.13 | hypothetical protein SDJN03_22768, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022939802.1 | 1.2e-142 | 90.13 | uncharacterized protein LOC111445570 isoform X3 [Cucurbita moschata] | [more] |
XP_022993971.1 | 3.6e-139 | 88.22 | uncharacterized protein LOC111489811 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FHU2 | 5.8e-143 | 90.13 | uncharacterized protein LOC111445570 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JXU6 | 1.7e-139 | 88.22 | uncharacterized protein LOC111489811 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FNS2 | 5.1e-139 | 83.98 | uncharacterized protein LOC111445570 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JUE5 | 2.6e-127 | 83.12 | uncharacterized protein LOC111489811 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FGW1 | 5.8e-127 | 79.23 | uncharacterized protein LOC111445570 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT1G33500.1 | 5.9e-47 | 42.12 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |