Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATGTAACAAAATGAATGAGTTTATGGTGAGCTAAAAATTAATACGGACAAATTTAAGGAAAAAAGTATTATTATTTTAAATAAATAACTTTGTCGGGTTACGAACCGTACGACCAATAGTCCACCAAGTCTCCCTTCTAAATTTACAAACCGTGATTCGATGCCGCGTCTTCGCCGATATGGAAATGAAATCAAGAGTGAATCATCTTCTTCACACCATTTCACTGAATCATCTTCTCTCACCTTCAAATCCGAACCAGACCTCCAATGGCGGCTTTACACTCTTATTTGTCCCGTTTCTTCTCCGATTTCCCATTTCGGATTACGAATTACTCTGAAGCAGACCTACTCATGTTCTTCGTCTTCTTCACCTTCTCCGCCCTCGGTTTCTCTCTTTATCTTCGTAAAAGGTTGCGGACGATCAGTAAGGCGAACGAGCCTGAGAGAATCGATATCGATCAGACCTGTTTGGTTCATTCGCTTCTATTCGAGAGATTACCGCCGGATTCTCCGAAATGGACGAGTTTCTTTGTTGAACAGGGCCAGGATGACCTCGATTTGAACAAGGAATTTGGAGAGTCTGGTCAGGAGGAGCAAGGGGGGAAGAGGAAGAAGAAGAGGGCAAAGAAGAAGAGGGCGAACCTGCTGCACGGCGGAGATGAGCCGGAATTGACTTTGTTGTATCCATTTACATCGTCTACTAGTGTGATTCAGAGGAAGATTAAACGGCAGTATGATGAACTTATGAAGTGTCAGGAATCAAAGGAGTTAACGTTGGCTCAGGTGCTGTATTCTTATACCTTGTCAATCATTCTTCTGTGTTATTGTCTGTGCTTCAAATAGAGTTTCCGCAGCAGTTCTTCTGGTTCGTTTTGACTCTGCCATTACCATAATTTTTGCTTGCAAGACTGCCTAGGAACAGACATCGAGTAATTCCTGGACTAAAATGACTAACCTATGTAGAATGATTGTTGTTCATCGAGTTTCCATTTGAAATCTGCTTAATCTATACGAAATGATGTTTTTGGTTCGACTGTTGTGAGGTCCTATACGGTTGGAGAGGGGAATGAAGCATTCCTTATATAAGGGTGTAGAAACCTCTCCCTAGTAGACGCGTTGTGAGGCTGACGATGATACGTAACAGGGTCAAAGTGGACAATATCTGCTAGAGGTGGACTTGAGTTATTACAAATGATACGTAACAGGCTCTGGGCGGTGTGCTAGCAAGTACGCTGGGCTCAAGGGAGGTGGATTGTGAGGTCCCCATTGATTGGAAAGGGAAATAAAACATTCCTTATAAGGTTGTGGAAACCTCTCCCTAGTAGATGCGTTTTAAAATCGTGAAGTTGAAAGTGATACGTAACAGGACAAAGCAGACAATATCTACTAGCGGTGGGCTTGGGCTATTACAATTCTGTTATGTGAGAATTCATGGTTTTAGCTGATGAATATGGAGTTTTTAGCTGCTCGAGTCCAAAACTCGATCTGTCCCTGTTATCCATAACTTTCCGATACCACTGGTTTCGTGTCGCTAGTTGAAATCGAGACTGAATCGGAGATCGAGTTCTAAGCAATTTATTTCTGTTTTCCTTTCAAATGCAGGTCACACAATTTGCCAACTGCTTAATCAACGCTAGAAGCAAGCTGCAGCACAAGTATGCTTCATATCTTTGCTTCAACAATATTAGTGTCCATTTTTTAGGAGTTTATATGCATAACTCTCCAACTATGTACACTTTTTCTACTGACAGAGCTGATGTTATCCGCCGAAAGTTCACCATAACGAAAGCTCTGCTTTATAAGGCAGATCGATCCTCCCTCGATCGGCTTCAACAGCAGGTTGGTGACATGATATGATTTGGTATATGTTGCCCAATGTTTCATCTATACTCAAATTTGTTGATCTGTTGTTGTCTTAGATATACAAGTTAGAATTGGAACAGAAAAGACTGGAGGAGGATACCTTTGTTTTTAACTGGCTTCAACAACAGCTTAAACTCTCTCCAGCATACAAAAAGGTATTGCTGCTACCCCACTTGCTTCATTTGCACCGTTTTTCGCTGCCGATGCTTCCTAGACGTTGGAAAGTACGCTGCTAGTCTGTTGTAGCCTCTGAATCCCATCCCCCGAACATAGTTATCGCCCTTCCGATTCTGTCACCATTTTCCCATAATTGTAAGAGTCCAAGCCCACCGCTAAAAACCCTGATCATTTAACCCTAAACCCTTGAGGGGAAGCTCAAAAGTGCGTACTAGTCTGTTGTAGCCTCTGAATCCCATCCCCCGAATATAGTTATCGCCCTTCCGATTCTGTCACCCTTTTCCCATAATTGTAAGAGTCCAAGCCCACCGCTAAAAACCCTAATCATTAAATTCTAAACCCTTGAGGGAAACCCCGAAAGGGGAAGTCCAAAGAGGGAAAGCCCAAAGAGGACAATATCTACTAGCGGTGGGTTTATGCCGTTAGTTGAAGTTATAAAATAGGATATCTTCTTCCCTTCGTTTTGCTCTCTCTCTTTTCATACACACTTCAAATTGTTATTGAGAATAATGCAACAGCCAGAGCTGATGATTGCTCTGAACCAACCTTCTGATGGTCCAGATGCTGGAAATTGGTAGCTGCACGGAGTTAATGGAAAAATCTGAGAACTCGACAGAAAAGATCGATCTCGAGTCTACCGACATATCGTTTGAAGAACTATTAGCGCAGGAAAAAAAGGATTCATTTTGGTAAGACACCATCATATCTGATAGTTCATTGTTTGTAATGGATGAGGGCTTAGTAACAGTGGTTCCCCTCCAAATTTTCATGCAGGCAGAGGAATGGGAAACTGAGATCATGCTCAAGCTGATACAAGTGGTGAGCTTCCTCTTCTCTAATCTCTTCCTGTAACAAAAGAACAGATTTGCAGCATGTAAATCCAAAAAGAAGAACACTTTTTTTTTTCTTTAACCTAGAAGGCATTTTGTTAGCAGCCTCTTCTCTAATCTCTCTAATCTCTTCATTAGATGCTTTTTTGTTTATACCATCAAAACATGTAGCTGTAGTCAATCACTGGCTATTGGCTTTGCAATTATAAAATTTCAATACAATTATAC
mRNA sequence
GATGTAACAAAATGAATGAGTTTATGGTGAGCTAAAAATTAATACGGACAAATTTAAGGAAAAAAGTATTATTATTTTAAATAAATAACTTTGTCGGGTTACGAACCGTACGACCAATAGTCCACCAAGTCTCCCTTCTAAATTTACAAACCGTGATTCGATGCCGCGTCTTCGCCGATATGGAAATGAAATCAAGAGTGAATCATCTTCTTCACACCATTTCACTGAATCATCTTCTCTCACCTTCAAATCCGAACCAGACCTCCAATGGCGGCTTTACACTCTTATTTGTCCCGTTTCTTCTCCGATTTCCCATTTCGGATTACGAATTACTCTGAAGCAGACCTACTCATGTTCTTCGTCTTCTTCACCTTCTCCGCCCTCGGTTTCTCTCTTTATCTTCGTAAAAGGTTGCGGACGATCAGTAAGGCGAACGAGCCTGAGAGAATCGATATCGATCAGACCTGTTTGGTTCATTCGCTTCTATTCGAGAGATTACCGCCGGATTCTCCGAAATGGACGAGTTTCTTTGTTGAACAGGGCCAGGATGACCTCGATTTGAACAAGGAATTTGGAGAGTCTGGTCAGGAGGAGCAAGGGGGGAAGAGGAAGAAGAAGAGGGCAAAGAAGAAGAGGGCGAACCTGCTGCACGGCGGAGATGAGCCGGAATTGACTTTGTTGTATCCATTTACATCGTCTACTAGTGTGATTCAGAGGAAGATTAAACGGCAGTATGATGAACTTATGAAGTGTCAGGAATCAAAGGAGTTAACGTTGGCTCAGGTCACACAATTTGCCAACTGCTTAATCAACGCTAGAAGCAAGCTGCAGCACAAAGCTGATGTTATCCGCCGAAAGTTCACCATAACGAAAGCTCTGCTTTATAAGGCAGATCGATCCTCCCTCGATCGGCTTCAACAGCAGATGCTGGAAATTGGTAGCTGCACGGAGTTAATGGAAAAATCTGAGAACTCGACAGAAAAGATCGATCTCGAGTCTACCGACATATCGTTTGAAGAACTATTAGCGCAGGAAAAAAAGGATTCATTTTGGCAGAGGAATGGGAAACTGAGATCATGCTCAAGCTGATACAAGTGGTGAGCTTCCTCTTCTCTAATCTCTTCCTGTAACAAAAGAACAGATTTGCAGCATGTAAATCCAAAAAGAAGAACACTTTTTTTTTTCTTTAACCTAGAAGGCATTTTGTTAGCAGCCTCTTCTCTAATCTCTCTAATCTCTTCATTAGATGCTTTTTTGTTTATACCATCAAAACATGTAGCTGTAGTCAATCACTGGCTATTGGCTTTGCAATTATAAAATTTCAATACAATTATAC
Coding sequence (CDS)
ATGGCGGCTTTACACTCTTATTTGTCCCGTTTCTTCTCCGATTTCCCATTTCGGATTACGAATTACTCTGAAGCAGACCTACTCATGTTCTTCGTCTTCTTCACCTTCTCCGCCCTCGGTTTCTCTCTTTATCTTCGTAAAAGGTTGCGGACGATCAGTAAGGCGAACGAGCCTGAGAGAATCGATATCGATCAGACCTGTTTGGTTCATTCGCTTCTATTCGAGAGATTACCGCCGGATTCTCCGAAATGGACGAGTTTCTTTGTTGAACAGGGCCAGGATGACCTCGATTTGAACAAGGAATTTGGAGAGTCTGGTCAGGAGGAGCAAGGGGGGAAGAGGAAGAAGAAGAGGGCAAAGAAGAAGAGGGCGAACCTGCTGCACGGCGGAGATGAGCCGGAATTGACTTTGTTGTATCCATTTACATCGTCTACTAGTGTGATTCAGAGGAAGATTAAACGGCAGTATGATGAACTTATGAAGTGTCAGGAATCAAAGGAGTTAACGTTGGCTCAGGTCACACAATTTGCCAACTGCTTAATCAACGCTAGAAGCAAGCTGCAGCACAAAGCTGATGTTATCCGCCGAAAGTTCACCATAACGAAAGCTCTGCTTTATAAGGCAGATCGATCCTCCCTCGATCGGCTTCAACAGCAGATGCTGGAAATTGGTAGCTGCACGGAGTTAATGGAAAAATCTGAGAACTCGACAGAAAAGATCGATCTCGAGTCTACCGACATATCGTTTGAAGAACTATTAGCGCAGGAAAAAAAGGATTCATTTTGGCAGAGGAATGGGAAACTGAGATCATGCTCAAGCTGA
Protein sequence
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPERIDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAKKKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCLINARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQMLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGKLRSCSS
Homology
BLAST of Cp4.1LG12g08730 vs. NCBI nr
Match:
XP_023548104.1 (uncharacterized protein LOC111806840 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 508 bits (1309), Expect = 1.71e-180
Identity = 273/306 (89.22%), Postives = 273/306 (89.22%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK
Sbjct: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ--------------------- 240
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKLELEQKRLEEDTFVFNWL 240
Query: 241 ------------MLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGK 273
MLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGK
Sbjct: 241 QQQLKLSPAYKKMLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGK 300
BLAST of Cp4.1LG12g08730 vs. NCBI nr
Match:
XP_022953217.1 (uncharacterized protein LOC111455829 isoform X1 [Cucurbita moschata])
HSP 1 Score: 479 bits (1232), Expect = 9.18e-169
Identity = 262/306 (85.62%), Postives = 264/306 (86.27%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYL KRLRTISKA EPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLCKRLRTISKAIEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQT LVH LLFE LPPDSPKWTSFFVEQGQD LDLN+EFGES QEEQGGKRKKKRAK
Sbjct: 61 IDIDQTWLVHLLLFESLPPDSPKWTSFFVEQGQDVLDLNREFGESVQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSST+VIQRKIKRQYDELMKCQESKELTLAQV QFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTTVIQRKIKRQYDELMKCQESKELTLAQVRQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ--------------------- 240
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKLELEQKRLEEDTLVFNWL 240
Query: 241 ------------MLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGK 273
MLEIGSCTELMEKSENSTEKID ESTDISFEELLAQEKKDSFWQRNGK
Sbjct: 241 QQQLKLSPAYKKMLEIGSCTELMEKSENSTEKIDPESTDISFEELLAQEKKDSFWQRNGK 300
BLAST of Cp4.1LG12g08730 vs. NCBI nr
Match:
KAG7014022.1 (hypothetical protein SDJN02_24193 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 461 bits (1187), Expect = 1.25e-161
Identity = 258/324 (79.63%), Postives = 260/324 (80.25%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYL KRLRTISKA PER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLCKRLRTISKAIHPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQTCL HSLLFE L PDSPKWTSFFVEQGQDDLDLNKEFGES QEEQGGKRKKKR K
Sbjct: 61 IDIDQTCLAHSLLFESLLPDSPKWTSFFVEQGQDDLDLNKEFGESVQEEQGGKRKKKREK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTL+YPFTSST+VIQRKIKRQYDELMKCQESKELTLAQV QFANCL
Sbjct: 121 KKRANLLHGGDEPELTLMYPFTSSTTVIQRKIKRQYDELMKCQESKELTLAQVRQFANCL 180
Query: 181 INARSKLQHK------------------ADVIRRKFTITKALLYKADRSSLDRLQQQ--- 240
IN RSKLQHK ADVIRRKFTITKALL KADRSSLDRLQQQ
Sbjct: 181 INVRSKLQHKYGVYMHNSPTMYTFSTDRADVIRRKFTITKALLCKADRSSLDRLQQQIYK 240
Query: 241 ------------------------------MLEIGSCTELMEKSENSTEKIDLESTDISF 273
MLEIGSC ELMEKSENSTEKID ESTDISF
Sbjct: 241 LELEQKRLEEDTFVFNWLQQQLKLSPAYKKMLEIGSCMELMEKSENSTEKIDPESTDISF 300
BLAST of Cp4.1LG12g08730 vs. NCBI nr
Match:
XP_023548105.1 (uncharacterized protein LOC111806840 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 424 bits (1089), Expect = 1.56e-147
Identity = 219/223 (98.21%), Postives = 222/223 (99.55%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK
Sbjct: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQMLEI 223
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ+ ++
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKL 223
BLAST of Cp4.1LG12g08730 vs. NCBI nr
Match:
XP_022953218.1 (uncharacterized protein LOC111455829 isoform X2 [Cucurbita moschata])
HSP 1 Score: 397 bits (1019), Expect = 7.09e-137
Identity = 209/223 (93.72%), Postives = 214/223 (95.96%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYL KRLRTISKA EPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLCKRLRTISKAIEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQT LVH LLFE LPPDSPKWTSFFVEQGQD LDLN+EFGES QEEQGGKRKKKRAK
Sbjct: 61 IDIDQTWLVHLLLFESLPPDSPKWTSFFVEQGQDVLDLNREFGESVQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSST+VIQRKIKRQYDELMKCQESKELTLAQV QFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTTVIQRKIKRQYDELMKCQESKELTLAQVRQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQMLEI 223
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ+ ++
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKL 223
BLAST of Cp4.1LG12g08730 vs. ExPASy TrEMBL
Match:
A0A6J1GMM6 (uncharacterized protein LOC111455829 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455829 PE=4 SV=1)
HSP 1 Score: 479 bits (1232), Expect = 4.44e-169
Identity = 262/306 (85.62%), Postives = 264/306 (86.27%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYL KRLRTISKA EPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLCKRLRTISKAIEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQT LVH LLFE LPPDSPKWTSFFVEQGQD LDLN+EFGES QEEQGGKRKKKRAK
Sbjct: 61 IDIDQTWLVHLLLFESLPPDSPKWTSFFVEQGQDVLDLNREFGESVQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSST+VIQRKIKRQYDELMKCQESKELTLAQV QFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTTVIQRKIKRQYDELMKCQESKELTLAQVRQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ--------------------- 240
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKLELEQKRLEEDTLVFNWL 240
Query: 241 ------------MLEIGSCTELMEKSENSTEKIDLESTDISFEELLAQEKKDSFWQRNGK 273
MLEIGSCTELMEKSENSTEKID ESTDISFEELLAQEKKDSFWQRNGK
Sbjct: 241 QQQLKLSPAYKKMLEIGSCTELMEKSENSTEKIDPESTDISFEELLAQEKKDSFWQRNGK 300
BLAST of Cp4.1LG12g08730 vs. ExPASy TrEMBL
Match:
A0A6J1GMT9 (uncharacterized protein LOC111455829 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111455829 PE=4 SV=1)
HSP 1 Score: 397 bits (1019), Expect = 3.43e-137
Identity = 209/223 (93.72%), Postives = 214/223 (95.96%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLRKRLRTISKANEPER 60
MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYL KRLRTISKA EPER
Sbjct: 1 MAALHSYLSRFFSDFPFRITNYSEADLLMFFVFFTFSALGFSLYLCKRLRTISKAIEPER 60
Query: 61 IDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQEEQGGKRKKKRAK 120
IDIDQT LVH LLFE LPPDSPKWTSFFVEQGQD LDLN+EFGES QEEQGGKRKKKRAK
Sbjct: 61 IDIDQTWLVHLLLFESLPPDSPKWTSFFVEQGQDVLDLNREFGESVQEEQGGKRKKKRAK 120
Query: 121 KKRANLLHGGDEPELTLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCL 180
KKRANLLHGGDEPELTLLYPFTSST+VIQRKIKRQYDELMKCQESKELTLAQV QFANCL
Sbjct: 121 KKRANLLHGGDEPELTLLYPFTSSTTVIQRKIKRQYDELMKCQESKELTLAQVRQFANCL 180
Query: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQMLEI 223
INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQ+ ++
Sbjct: 181 INARSKLQHKADVIRRKFTITKALLYKADRSSLDRLQQQIYKL 223
BLAST of Cp4.1LG12g08730 vs. ExPASy TrEMBL
Match:
A0A0A0KCQ0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451370 PE=4 SV=1)
HSP 1 Score: 369 bits (948), Expect = 3.01e-125
Identity = 226/348 (64.94%), Postives = 238/348 (68.39%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLM-----FFVFFTFSALGFSLYLRKRLRTI--- 60
MAALHSYLSRFF +FPFRITN SE D+ M FF+FFTF L S +L KR++ I
Sbjct: 1 MAALHSYLSRFFPNFPFRITNLSEGDIPMLLLCSFFLFFTFFVLVLSFFLYKRVKKIEFG 60
Query: 61 ------SKANEPERIDI-----------DQTCLVHSLLFERLPPDSPKWTSFFVEQGQDD 120
S EPE+IDI D+TCL HSLLFE LPPDSPKW SFFVE DD
Sbjct: 61 QHQQLISNPIEPEKIDIGNSVADCGNGTDRTCLTHSLLFEILPPDSPKWASFFVEGRCDD 120
Query: 121 LDL-----NKEFGESGQEEQGGKRKKKRAKKKRANLLHG------------GDEPELTLL 180
LDL NKEFG+SGQE QGGKRKKK+AKKKRANL G G E ELTLL
Sbjct: 121 LDLKSDGLNKEFGDSGQE-QGGKRKKKKAKKKRANLQDGDENEKWGTDVGTGSEQELTLL 180
Query: 181 YPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCLINARSKLQHKADVIRRKF 240
YPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQV QFANCLINARSKLQHKADVI RKF
Sbjct: 181 YPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVRQFANCLINARSKLQHKADVIHRKF 240
Query: 241 TITKALLYKADRSSLDRLQQQ---------------------------------MLEIGS 273
TITKALLYKADRSS DRLQQQ MLEIG+
Sbjct: 241 TITKALLYKADRSSFDRLQQQIYKLELEQKRLEEDTFVYNWLQQQLKLSPAYKKMLEIGT 300
BLAST of Cp4.1LG12g08730 vs. ExPASy TrEMBL
Match:
A0A1S3CHA9 (uncharacterized protein LOC103500732 OS=Cucumis melo OX=3656 GN=LOC103500732 PE=4 SV=1)
HSP 1 Score: 367 bits (941), Expect = 3.37e-124
Identity = 226/347 (65.13%), Postives = 236/347 (68.01%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEAD----LLMFFVFFTFSALGFSLYLRKRLRT----- 60
MAALHSYLSRFF FPFRITN+SE D LL F+FFTF L S L KR++
Sbjct: 1 MAALHSYLSRFFPSFPFRITNFSEGDIPMLLLCSFLFFTFFILVLSFSLYKRVKKVEFEE 60
Query: 61 ----ISKANEPERIDI-----------DQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDL 120
IS EPE+IDI D+TCL HSLLFE LPPDSPKW SFFVE DDL
Sbjct: 61 HQQLISNPIEPEKIDIGNSVTDCGNGTDRTCLTHSLLFEILPPDSPKWASFFVEGRCDDL 120
Query: 121 DL-----NKEFGESGQEEQGGKRKKKRAKKKRANLLHG------------GDEPELTLLY 180
DL NKEFG+SGQE QGGKRKKK+AKKKRANL G G E ELTLLY
Sbjct: 121 DLKGARLNKEFGDSGQE-QGGKRKKKKAKKKRANLQDGDENVKWGTDVGTGSEQELTLLY 180
Query: 181 PFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCLINARSKLQHKADVIRRKFT 240
PFTSSTSVIQRKIKRQYDELMKCQESKELTLAQV QFANCLINARSKLQHKADVI RKFT
Sbjct: 181 PFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVRQFANCLINARSKLQHKADVIHRKFT 240
Query: 241 ITKALLYKADRSSLDRLQQQ---------------------------------MLEIGSC 273
ITKALLYKADRSS DRLQQQ MLEIG+C
Sbjct: 241 ITKALLYKADRSSFDRLQQQIYKLELEQKRLEEDTFVYNWLQQQLKLSPAYKKMLEIGTC 300
BLAST of Cp4.1LG12g08730 vs. ExPASy TrEMBL
Match:
A0A6J1D702 (uncharacterized protein LOC111017622 OS=Momordica charantia OX=3673 GN=LOC111017622 PE=4 SV=1)
HSP 1 Score: 354 bits (909), Expect = 2.78e-119
Identity = 220/351 (62.68%), Postives = 234/351 (66.67%), Query Frame = 0
Query: 1 MAALHSYLSRFFSDFPFRITNYSEADLLM-----FFVFFTFSALGFSLYLRKRLRTI--- 60
MA L SYLSRFF +FP RI+NYS+ DL M FFVFFTFS L S L KRLR I
Sbjct: 1 MAPLRSYLSRFFPNFPLRISNYSDGDLPMLLLCSFFVFFTFSVLVLSFSLYKRLRKIEFE 60
Query: 61 ------SKANEPERIDIDQT--------------CLVHSLLFERLPPDSPKWTSFFVEQG 120
SK +EPERIDI + CL HSLLFE LPPDSPKW S F E+G
Sbjct: 61 HHQQLISKPSEPERIDIGHSLASCGEGTDRRSPACLTHSLLFEILPPDSPKWGSLFDEEG 120
Query: 121 QDDLD-----LNKEFGESGQEEQGGKRKKKRAKKKRANLLHG------------GDEPEL 180
+DDLD LN+EFG+SGQE QGGKRKKKRAKKKRAN G E EL
Sbjct: 121 RDDLDSKGSGLNREFGDSGQE-QGGKRKKKRAKKKRANSQAEDETDNWGVDSGTGSEQEL 180
Query: 181 TLLYPFTSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCLINARSKLQHKADVIR 240
TLLYPFTSSTSVIQRKIK+QYDELMKCQESKELTLAQV QFANCLINARSKLQHKADVI
Sbjct: 181 TLLYPFTSSTSVIQRKIKQQYDELMKCQESKELTLAQVRQFANCLINARSKLQHKADVIH 240
Query: 241 RKFTITKALLYKADRSSLDRLQQQ---------------------------------MLE 273
RKFTITKALLYKADRSS DRLQQQ MLE
Sbjct: 241 RKFTITKALLYKADRSSFDRLQQQIYKLELEQKRLEEDTFVYNWLQQQLKLSPAYKKMLE 300
BLAST of Cp4.1LG12g08730 vs. TAIR 10
Match:
AT1G17665.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 15 plant structures; EXPRESSED DURING: 11 growth stages; Has 149 Blast hits to 146 proteins in 39 species: Archae - 0; Bacteria - 4; Metazoa - 21; Fungi - 5; Plants - 30; Viruses - 0; Other Eukaryotes - 89 (source: NCBI BLink). )
HSP 1 Score: 164.1 bits (414), Expect = 1.6e-40
Identity = 113/283 (39.93%), Postives = 154/283 (54.42%), Query Frame = 0
Query: 49 LRTISKANEPERIDIDQTCLVHSLLFERLPPDSPKWTSFFVEQGQDDLDLNKEFGESGQE 108
L IS + + + + T L +S L+E L D + +DD D E
Sbjct: 69 LSEISDEAQYQTHENEPTHLTNSRLYELLLSD----------KKEDDSD-----WEGDHV 128
Query: 109 EQGGKRKKKRAKKKRANLL---HGGD------------------------EPELTLLYPF 168
++ K+KK R KKK++++ GG+ +PE LYPF
Sbjct: 129 KKKKKKKKNRGKKKKSDIRGDESGGEKQLGEGEDGLVLNPRTDSISISENKPEFVCLYPF 188
Query: 169 TSSTSVIQRKIKRQYDELMKCQESKELTLAQVTQFANCLINARSKLQHKADVIRRKFTIT 228
TS++S QRKIK+QYD+L+KC +K LTLAQV +FANCLI A+++LQHK++VI+RKF+IT
Sbjct: 189 TSTSSATQRKIKQQYDQLVKCNNAKGLTLAQVGEFANCLIEAKNELQHKSEVIKRKFSIT 248
Query: 229 KALLYKADRSSLDRLQQQ---------------------------------MLEIGSCTE 272
KALL+KADRSS DRL+QQ +LEI + E
Sbjct: 249 KALLFKADRSSFDRLRQQIYKLEMEQKRVEEDALVYNWLQQQLKLSPAYKKVLEISASME 308
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023548104.1 | 1.71e-180 | 89.22 | uncharacterized protein LOC111806840 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022953217.1 | 9.18e-169 | 85.62 | uncharacterized protein LOC111455829 isoform X1 [Cucurbita moschata] | [more] |
KAG7014022.1 | 1.25e-161 | 79.63 | hypothetical protein SDJN02_24193 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_023548105.1 | 1.56e-147 | 98.21 | uncharacterized protein LOC111806840 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022953218.1 | 7.09e-137 | 93.72 | uncharacterized protein LOC111455829 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1GMM6 | 4.44e-169 | 85.62 | uncharacterized protein LOC111455829 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1GMT9 | 3.43e-137 | 93.72 | uncharacterized protein LOC111455829 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A0A0KCQ0 | 3.01e-125 | 64.94 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G451370 PE=4 SV=1 | [more] |
A0A1S3CHA9 | 3.37e-124 | 65.13 | uncharacterized protein LOC103500732 OS=Cucumis melo OX=3656 GN=LOC103500732 PE=... | [more] |
A0A6J1D702 | 2.78e-119 | 62.68 | uncharacterized protein LOC111017622 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
Match Name | E-value | Identity | Description | |
AT1G17665.1 | 1.6e-40 | 39.93 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |