Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGGCTCGCCACTCTCTGGTCTTCTCAGACGCCTGTGTTGGTGAGGTCCACTGCTATCGCTCGAGTGACCAAAAATCTCCATTGATGGTTACTCTGCTCCTACCCATTTTCGTTCCCTGAGAAGAAGCATTAAACATCTTCAACAACCAACCTCAAATTCCCTTCTGTATTTCTCTACAAATGTCATTGCCTTCTCAGATTCTCTTCAGATGTTCGAGCCATTTGAAGCTTTGTTTCTTCCCCAACTCCTTTCCTACAAACAACAGTTTCTTCTCTCTCCCAATTTCTCCTCGATCCCTCAATCAACTTCATCAATTCCATCTTCATGCACATAACAATTCAACCTCCCGCTTTCGGAGTTATTGTCAATACGGCATTGGAGCTTTCGAATCCGAAGACGTTGCGCAGAGTAACGACAAGGATGGTGGCGATTTCGATTTAGAATCGGTTCTTTTGTTTTCTGAATTGTTTTCTCTCTTTTCTTCGGCTGTTTTCTTGGTTGTTTTTGTTGTGAATTTCGTGGGTTCGAGTTCGAAGAGGGCGCTTAGGGTATTGATGGGGGATAGGGGTTTGATTTGGGGGTTTTCTCTGCTAGTGGCTACCGCTGTTCTTAACTCGTGGATTCGAAGACGGCAATGGAGACGAATTTGTGGGGGAAAACGGAGCGGTGGGTTGAAGGTGGATTGCTTGGATAGGATCGAGAAATTAGAGGAGGATTTTAGGAGCTTGACGACTGTGATTCGGGGCTTGTCTAGGAAGCTTGAGAAGTTGAGCATAAGGTTTAGGGTAACACAAAAAACTCTGATGGATCCAATTGTTGAGGTATTCTTATCTCTCTTTTTTGTTTGTGATCTGTGTTAATACTCATAACTGGTGAATTGAATTGCTTAATTCTTCTTTGGGGTTAAACCTGTTTAGCATCACTGTTCTTGTGGGTATATATATATATTTTGATTCATGGGATGGAAATTATTGTATGGTGTAGATTTTTCATTGAGCTGTTGTTTTTGTTTTGTGTTTTAGTGTGTCCGTTTTCCTGTAAGATTGTTTTGTGTTTTTGTTTGTACGCTATTAAAGACCGCAGGTTTAGCTCAAAGAAATTATGAGGACACTCGAACTTCGGCTGTGCAAGAAGATGTTCTTGAGAAAGAACTCCTTGAAATACAAAAGGTCTTACTAGCCATGCAGGTAAAAGTTCAAATGATTATTATAAACTCTGTAGTTAGTTTACATTTATTACTCTTTATATGGTGTGTTAGGAATCACGAACTTTCACAATGGTATGATATTGTCCACTCTTTGAGCATAAGTTCTTGTGACTTTGCTTTTGATTTTCCTAAAATGTCTCGTACCAATGGAGATAGGGTTCTTAATAATTCTCAACAATCCTCCACTCGAACAAAATACACTACTGTAGAACCTCCTCTGAGGCCTCCTTCTGTGGAGCTCTCAAACAACCTCTCCTTAATCGAGACTCAACTCCTTCTCTGGAGCCCTCGAATAAAGTACACCATTTGTTCAACACTTAAGTCACTTTTGACTACACCTTCGATGCTTACAACTTCTTTGTTCGACATTTGAGGATATTGACATGACTAAGTTAAGGGTATGGCTCTGATACCATGTTAGAAACCATGATCCTCCACAATGGTATGATATTGTCCACTTTGAGCATAAGCACTCATGGCTTTGCTTTTGATTTCCCCAAAAGGCCTTATACCAATGGAGATAGTGTTCCTTACTGATAAACTCATGATCTGATCTTCTTCTTAATTAGCCAACGTGGGATTACTCCTCTCAATAATCGTCTGGATGGATGTATGAGATTGGATGAGGGTTCCTTGGCTTCAATGTGTTATATATACTTATTTCACCATCAATAGGCAAACTATAGACCAGCTTTTATAATTTTTTGAGGAATCCTTAATTTTGTCTTCGGACTAATATGCAATACGTTATTCGATTATGTATGCTCTTGAGATGATCTTTTTTCATCTTGTGATTCTGAAGTTAGTCTTGCAAGAAAAGCAGGTTGATTTTGGTAAGATTTCATTGTTTTGCATGTCGATTACGCAATATCGTTCTAAGAATGAGGCTAAAGAATGAATTTTGCTAATAACTTAATGGAGTTTCTGCAATGTGATCATGTGTAGGAGCAGCAGCAAAAGCAACTTGAGCTGATTATTGCAATAGGAGAAAAAGGGAAGCTGATGGAAAGCAAACGGGCACTTGATTAAGAATGAACAAGAATGGAAAGACGCAATTCTGCCAATGAGGATTCAAAAAAAACGGGAAGCTTATGAAATCTGAGGTAGGATCCTCATAAAACATTTCCCATTCATTATCAATCTTTTTTGTTGTGATTTGAAGTTAGAACTTGTCTCTTCCATATATTTATAATTGAAAAAAGCAGCAAATTGATTAATAGAAACTTATGATTTGGGTTTAGTAGAAGAGCACTGAAATGGAATATCTACACAATAAGAAGGATTGTTTCTATACTTGTGTTCTTGTGGCTGCTGTTTTCAAGTTTTCAAGTTTTGGTTGAGCCTTTTGAGCAATGGTATGGACCGTGGAATTTGGAAGATTCATAGAGCATGATGAAGAACTTGAATTGTCTAAGAGTCAGTAGTTAAGAGTTGAGTGAATTTAGAGTCGAGACTTGGATGATCATGACATGAAAGTGTAGACAACCACCTACATTGGAAGTGAAAGAAGCAAAATGAAGGAACCGATGACATATTAGAAAAGAACGAACAATCAGTGGAGCCAACTTGAGCATAGCTTTACTCATTAAGACATATGTTCTCAATCAAGAAGTTAGAGGTTCGAGTTCTCTCACTCCTTATTGAACTCAAAAAAACATTCCATGGTGAGGGACATGTTGGAAAGGTTCTAGATATGTTCGGGTTGAATGAAATTTTCATCTTTGAGTGAACATGTTAAGAATGATAAATGCAACCTTCGGACTTGGTCTGTCATTTAACTCATTGAGTTTAAGATGTGATAATTATGGTACATAAACCCGTGTAATCATCTCTACTTGGACAGTTGCATCCCATTGATATCAATTATATAGGTTAGATTCAGCATACAGATACACTCGAAGAACCGCGTCCCATCTCGAAGGGCCTCATTCTGTCAACATCAATTGC
mRNA sequence
CTGGCTCGCCACTCTCTGGTCTTCTCAGACGCCTGTGTTGGTGAGGTCCACTGCTATCGCTCGAGTGACCAAAAATCTCCATTGATGGTTACTCTGCTCCTACCCATTTTCGTTCCCTGAGAAGAAGCATTAAACATCTTCAACAACCAACCTCAAATTCCCTTCTGTATTTCTCTACAAATGTCATTGCCTTCTCAGATTCTCTTCAGATGTTCGAGCCATTTGAAGCTTTGTTTCTTCCCCAACTCCTTTCCTACAAACAACAGTTTCTTCTCTCTCCCAATTTCTCCTCGATCCCTCAATCAACTTCATCAATTCCATCTTCATGCACATAACAATTCAACCTCCCGCTTTCGGAGTTATTGTCAATACGGCATTGGAGCTTTCGAATCCGAAGACGTTGCGCAGAGTAACGACAAGGATGGTGGCGATTTCGATTTAGAATCGGTTCTTTTGTTTTCTGAATTGTTTTCTCTCTTTTCTTCGGCTGTTTTCTTGGTTGTTTTTGTTGTGAATTTCGTGGGTTCGAGTTCGAAGAGGGCGCTTAGGGTATTGATGGGGGATAGGGGTTTGATTTGGGGGTTTTCTCTGCTAGTGGCTACCGCTGTTCTTAACTCGTGGATTCGAAGACGGCAATGGAGACGAATTTGTGGGGGAAAACGGAGCGGTGGGTTGAAGGTGGATTGCTTGGATAGGATCGAGAAATTAGAGGAGGATTTTAGGAGCTTGACGACTGTGATTCGGGGCTTGTCTAGGAAGCTTGAGAAGTTGAGCATAAGGTTTAGGGTAACACAAAAAACTCTGATGGATCCAATTGTTGAGACCGCAGGTTTAGCTCAAAGAAATTATGAGGACACTCGAACTTCGGCTGTGCAAGAAGATGTTCTTGAGAAAGAACTCCTTGAAATACAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCAAAAGCAACTTGAGCTGATTATTGCAATAGGAGAAAAAGGGAAGCTGATGGAAAGCAAACGGGCACTTGATTAAGAATGAACAAGAATGGAAAGACGCAATTCTGCCAATGAGGATTCAAAAAAAACGGGAAGCTTATGAAATCTGAGGTAGGATCCTCATAAAACATTTCCCATTCATTATCAATCTTTTTTGTTGTGATTTGAAGTTAGAACTTGTCTCTTCCATATATTTATAATTGAAAAAAGCAGCAAATTGATTAATAGAAACTTATGATTTGGGTTTAGTAGAAGAGCACTGAAATGGAATATCTACACAATAAGAAGGATTGTTTCTATACTTGTGTTCTTGTGGCTGCTGTTTTCAAGTTTTCAAGTTTTGGTTGAGCCTTTTGAGCAATGGTATGGACCGTGGAATTTGGAAGATTCATAGAGCATGATGAAGAACTTGAATTGTCTAAGAGTCAGTAGTTAAGAGTTGAGTGAATTTAGAGTCGAGACTTGGATGATCATGACATGAAAGTGTAGACAACCACCTACATTGGAAGTGAAAGAAGCAAAATGAAGGAACCGATGACATATTAGAAAAGAACGAACAATCAGTGGAGCCAACTTGAGCATAGCTTTACTCATTAAGACATATGTTCTCAATCAAGAAGTTAGAGGTTCGAGTTCTCTCACTCCTTATTGAACTCAAAAAAACATTCCATGGTGAGGGACATGTTGGAAAGGTTCTAGATATGTTCGGGTTGAATGAAATTTTCATCTTTGAGTGAACATGTTAAGAATGATAAATGCAACCTTCGGACTTGGTCTGTCATTTAACTCATTGAGTTTAAGATGTGATAATTATGGTACATAAACCCGTGTAATCATCTCTACTTGGACAGTTGCATCCCATTGATATCAATTATATAGGTTAGATTCAGCATACAGATACACTCGAAGAACCGCGTCCCATCTCGAAGGGCCTCATTCTGTCAACATCAATTGC
Coding sequence (CDS)
ATGTCATTGCCTTCTCAGATTCTCTTCAGATGTTCGAGCCATTTGAAGCTTTGTTTCTTCCCCAACTCCTTTCCTACAAACAACAGTTTCTTCTCTCTCCCAATTTCTCCTCGATCCCTCAATCAACTTCATCAATTCCATCTTCATGCACATAACAATTCAACCTCCCGCTTTCGGAGTTATTGTCAATACGGCATTGGAGCTTTCGAATCCGAAGACGTTGCGCAGAGTAACGACAAGGATGGTGGCGATTTCGATTTAGAATCGGTTCTTTTGTTTTCTGAATTGTTTTCTCTCTTTTCTTCGGCTGTTTTCTTGGTTGTTTTTGTTGTGAATTTCGTGGGTTCGAGTTCGAAGAGGGCGCTTAGGGTATTGATGGGGGATAGGGGTTTGATTTGGGGGTTTTCTCTGCTAGTGGCTACCGCTGTTCTTAACTCGTGGATTCGAAGACGGCAATGGAGACGAATTTGTGGGGGAAAACGGAGCGGTGGGTTGAAGGTGGATTGCTTGGATAGGATCGAGAAATTAGAGGAGGATTTTAGGAGCTTGACGACTGTGATTCGGGGCTTGTCTAGGAAGCTTGAGAAGTTGAGCATAAGGTTTAGGGTAACACAAAAAACTCTGATGGATCCAATTGTTGAGACCGCAGGTTTAGCTCAAAGAAATTATGAGGACACTCGAACTTCGGCTGTGCAAGAAGATGTTCTTGAGAAAGAACTCCTTGAAATACAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCAAAAGCAACTTGAGCTGATTATTGCAATAGGAGAAAAAGGGAAGCTGATGGAAAGCAAACGGGCACTTGATTAA
Protein sequence
MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDFRSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD
Homology
BLAST of Cp4.1LG14g01090 vs. NCBI nr
Match:
XP_023552629.1 (uncharacterized protein LOC111810221 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 533 bits (1372), Expect = 1.87e-190
Identity = 278/278 (100.00%), Postives = 278/278 (100.00%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF
Sbjct: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD
Sbjct: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
BLAST of Cp4.1LG14g01090 vs. NCBI nr
Match:
XP_023552630.1 (uncharacterized protein LOC111810221 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 525 bits (1351), Expect = 2.77e-187
Identity = 276/278 (99.28%), Postives = 276/278 (99.28%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF
Sbjct: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQ QQKQLELIIAIGEKGKLMESKRALD
Sbjct: 241 LEIQKVLLAMQ--QQKQLELIIAIGEKGKLMESKRALD 276
BLAST of Cp4.1LG14g01090 vs. NCBI nr
Match:
KAG6577290.1 (hypothetical protein SDJN03_24864, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 509 bits (1310), Expect = 5.28e-181
Identity = 268/278 (96.40%), Postives = 271/278 (97.48%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSS LK C FPNSFPTNNSFFSLPISPRS NQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFSLPISPRSFNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
CQYGIGAFESE+VAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 CCQYGIGAFESENVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVD LDRIEKLEEDF
Sbjct: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDLLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRGLSRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGLSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESK+ALD
Sbjct: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD 278
BLAST of Cp4.1LG14g01090 vs. NCBI nr
Match:
XP_022985520.1 (uncharacterized protein LOC111483510 isoform X1 [Cucurbita maxima])
HSP 1 Score: 507 bits (1306), Expect = 2.15e-180
Identity = 266/278 (95.68%), Postives = 270/278 (97.12%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSSHLK C FPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSHLKFCCFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVD LDRIEKLEEDF
Sbjct: 121 ALRVLMGDRVLIWGFPLLVATAVLNSWIRRRQWRRICGVKRSGGLKVDLLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRG+SRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGMSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQEQQQKQLELI+AIGEKGKL ESK+ALD
Sbjct: 241 LEIQKVLLAMQEQQQKQLELIVAIGEKGKLTESKQALD 278
BLAST of Cp4.1LG14g01090 vs. NCBI nr
Match:
KAG7015376.1 (hypothetical protein SDJN02_23011, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 507 bits (1305), Expect = 3.05e-180
Identity = 267/278 (96.04%), Postives = 270/278 (97.12%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSS LK C FPNSFPTNNSFFSLPISPRS NQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFSLPISPRSFNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
CQYGIGAFESE+VAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 CCQYGIGAFESENVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVD LDRIEKLEEDF
Sbjct: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDLLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRGLSRKLEKL RFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGLSRKLEKLGTRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESK+ALD
Sbjct: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD 278
BLAST of Cp4.1LG14g01090 vs. ExPASy TrEMBL
Match:
A0A6J1JBJ5 (uncharacterized protein LOC111483510 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483510 PE=4 SV=1)
HSP 1 Score: 507 bits (1306), Expect = 1.04e-180
Identity = 266/278 (95.68%), Postives = 270/278 (97.12%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSSHLK C FPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSHLKFCCFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVD LDRIEKLEEDF
Sbjct: 121 ALRVLMGDRVLIWGFPLLVATAVLNSWIRRRQWRRICGVKRSGGLKVDLLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRG+SRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGMSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQEQQQKQLELI+AIGEKGKL ESK+ALD
Sbjct: 241 LEIQKVLLAMQEQQQKQLELIVAIGEKGKLTESKQALD 278
BLAST of Cp4.1LG14g01090 vs. ExPASy TrEMBL
Match:
A0A6J1EN38 (uncharacterized protein LOC111436002 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111436002 PE=4 SV=1)
HSP 1 Score: 504 bits (1298), Expect = 1.79e-179
Identity = 268/279 (96.06%), Postives = 270/279 (96.77%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFF-SLPISPRSLNQLHQFHLHAHNNSTSRFR 60
MSLPSQILFRCSS LK C FPNSFPTNNSFF SLPISPRS NQLHQFHLHAHNNSTSRFR
Sbjct: 1 MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFR 60
Query: 61 SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK 120
SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK
Sbjct: 61 SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK 120
Query: 121 RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEED 180
RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICG KRSGGLKVD LDRIEKLEED
Sbjct: 121 RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEED 180
Query: 181 FRSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKE 240
FRSLTTVIR LSRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKE
Sbjct: 181 FRSLTTVIRALSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKE 240
Query: 241 LLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESK+ALD
Sbjct: 241 LLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD 279
BLAST of Cp4.1LG14g01090 vs. ExPASy TrEMBL
Match:
A0A6J1J8G1 (uncharacterized protein LOC111483510 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483510 PE=4 SV=1)
HSP 1 Score: 499 bits (1285), Expect = 1.53e-177
Identity = 264/278 (94.96%), Postives = 268/278 (96.40%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
MSLPSQILFRCSSHLK C FPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS
Sbjct: 1 MSLPSQILFRCSSHLKFCCFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHNNSTSRFRS 60
Query: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR
Sbjct: 61 YCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKR 120
Query: 121 ALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDF 180
ALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVD LDRIEKLEEDF
Sbjct: 121 ALRVLMGDRVLIWGFPLLVATAVLNSWIRRRQWRRICGVKRSGGLKVDLLDRIEKLEEDF 180
Query: 181 RSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKEL 240
RSLTTVIRG+SRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKEL
Sbjct: 181 RSLTTVIRGMSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKEL 240
Query: 241 LEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LEIQKVLLAMQ QQKQLELI+AIGEKGKL ESK+ALD
Sbjct: 241 LEIQKVLLAMQ--QQKQLELIVAIGEKGKLTESKQALD 276
BLAST of Cp4.1LG14g01090 vs. ExPASy TrEMBL
Match:
A0A6J1EMR8 (uncharacterized protein LOC111436002 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111436002 PE=4 SV=1)
HSP 1 Score: 496 bits (1277), Expect = 2.64e-176
Identity = 266/279 (95.34%), Postives = 268/279 (96.06%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFF-SLPISPRSLNQLHQFHLHAHNNSTSRFR 60
MSLPSQILFRCSS LK C FPNSFPTNNSFF SLPISPRS NQLHQFHLHAHNNSTSRFR
Sbjct: 1 MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFR 60
Query: 61 SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK 120
SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK
Sbjct: 61 SYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSK 120
Query: 121 RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEED 180
RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICG KRSGGLKVD LDRIEKLEED
Sbjct: 121 RALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEED 180
Query: 181 FRSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKE 240
FRSLTTVIR LSRKLEKL IRFRVTQKTLMDPIVETAGLAQRNYEDT+TSAVQEDVLEKE
Sbjct: 181 FRSLTTVIRALSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKE 240
Query: 241 LLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
LLEIQKVLLAMQ QQKQLELIIAIGEKGKLMESK+ALD
Sbjct: 241 LLEIQKVLLAMQ--QQKQLELIIAIGEKGKLMESKQALD 277
BLAST of Cp4.1LG14g01090 vs. ExPASy TrEMBL
Match:
A0A6J1FNU5 (uncharacterized protein LOC111446891 OS=Cucurbita moschata OX=3662 GN=LOC111446891 PE=4 SV=1)
HSP 1 Score: 380 bits (976), Expect = 4.79e-130
Identity = 209/282 (74.11%), Postives = 233/282 (82.62%), Query Frame = 0
Query: 1 MSLPSQILFRCSSHLKLCFFPNSFPTNNSFFSLPISPRSLNQLHQFHLHAHN----NSTS 60
MSL SQ LFRCS+ LK C F NS P N+ FSLPI+ R L L+QFH+H H N+ S
Sbjct: 1 MSLSSQNLFRCSNRLKFCSFTNSLPRCNTSFSLPIASRFLTNLYQFHVHTHKLQNPNNLS 60
Query: 61 RFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGS 120
RSYC YGIG ESED+ QS+D+ GGDF+LESVLLFSELFSLF+SAVFLV FVVNFVGS
Sbjct: 61 SLRSYCHYGIGVSESEDIEQSDDR-GGDFNLESVLLFSELFSLFASAVFLVGFVVNFVGS 120
Query: 121 SSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKL 180
SSK+AL VL+GDRGL+WGF LLVAT VLN+WIRRRQWRR+CG K SGGLKV+ LDRIEKL
Sbjct: 121 SSKKALWVLIGDRGLVWGFPLLVATVVLNTWIRRRQWRRVCGEKASGGLKVNLLDRIEKL 180
Query: 181 EEDFRSLTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVL 240
EED RS TTVIR LSRKLEKL IRF VT+KT+ D I E+A LAQRN +DTRT AVQEDVL
Sbjct: 181 EEDLRSSTTVIRALSRKLEKLGIRFLVTRKTVRDSIAESAALAQRNSQDTRTLAVQEDVL 240
Query: 241 EKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKRALD 278
EKELLEIQKVLLAMQEQQQKQLELIIAIGEK KL++SK+ D
Sbjct: 241 EKELLEIQKVLLAMQEQQQKQLELIIAIGEKEKLLKSKQRHD 281
BLAST of Cp4.1LG14g01090 vs. TAIR 10
Match:
AT5G65250.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 167.5 bits (423), Expect = 1.5e-41
Identity = 107/211 (50.71%), Postives = 134/211 (63.51%), Query Frame = 0
Query: 66 IGAFESEDVAQSN---DKDGGDFDLESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKRAL 125
IG+F ED + SN D FDL S + F+E + SSAV VV VN+V
Sbjct: 72 IGSF--EDSSSSNLLEDASSDGFDLGSFVSFAEALCILSSAVISVVLAVNYVVVGE---- 131
Query: 126 RVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGGKRSGGLKVDCLDRIEKLEEDFRS 185
+G + L GF LV + SW+RRRQW RIC G R + + R+EKLE+D +S
Sbjct: 132 ---IGKKVLSLGFVGLVGSVATGSWLRRRQWMRICKGARESE-GTNLIRRLEKLEKDLKS 191
Query: 186 LTTVIRGLSRKLEKLSIRFRVTQKTLMDPIVETAGLAQRNYEDTRTSAVQEDVLEKELLE 245
T+++R LSR LEKL IRFRVT+K L +PI ETA LAQ+N E TR Q+++LEKEL E
Sbjct: 192 STSIVRVLSRHLEKLGIRFRVTRKALKEPISETAALAQKNSEATRVLVAQQEILEKELGE 251
Query: 246 IQKVLLAMQEQQQKQLELIIAIGEKGKLMES 274
IQKVLLAMQEQQ+KQLELI+ I + KL ES
Sbjct: 252 IQKVLLAMQEQQRKQLELILTIAKSSKLFES 272
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023552629.1 | 1.87e-190 | 100.00 | uncharacterized protein LOC111810221 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_023552630.1 | 2.77e-187 | 99.28 | uncharacterized protein LOC111810221 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
KAG6577290.1 | 5.28e-181 | 96.40 | hypothetical protein SDJN03_24864, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022985520.1 | 2.15e-180 | 95.68 | uncharacterized protein LOC111483510 isoform X1 [Cucurbita maxima] | [more] |
KAG7015376.1 | 3.05e-180 | 96.04 | hypothetical protein SDJN02_23011, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1JBJ5 | 1.04e-180 | 95.68 | uncharacterized protein LOC111483510 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1EN38 | 1.79e-179 | 96.06 | uncharacterized protein LOC111436002 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J8G1 | 1.53e-177 | 94.96 | uncharacterized protein LOC111483510 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1EMR8 | 2.64e-176 | 95.34 | uncharacterized protein LOC111436002 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FNU5 | 4.79e-130 | 74.11 | uncharacterized protein LOC111446891 OS=Cucurbita moschata OX=3662 GN=LOC1114468... | [more] |
Match Name | E-value | Identity | Description | |
AT5G65250.1 | 1.5e-41 | 50.71 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |