Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGAAGTGAAGAAATCACAAATCCCAATTCCTAATCGAAATCCCATCTATACATAACGACAATTATAATGATGGTTACGTGTTTGGATCCCTCTCTTTCCTCTAACCACCATCGCCTTCTCTGATTCAGATGCCCTTTGGGTCTAATTCAATTCCCCTTTTTCTTTTTCCCTTCTTTTCTGTACAGCCGACATTTTCCCAAACACAGCAACTCAGACTCATCCATGGCGGACGATGGCAAGGTCATTATTCTTTTCTCACTTCACACATAATTCTCATCTAACTCTTTTCATTTTCTGTTCTGATCGATATACACTTCTATGTCCTCTGTTTTTTTTTTTTAAACTATATATATATATTTTATAATTTTTGCAGAAAGTTGTGGTCATGATGCTTAAAGTGGACTTGCAGTGTGATCGTTGCTACAAGAAAGTCAAGAAAGTTCTCTGCAAATTCCCACGTGAGTTCTTTTCTATTTCTCTTTCAATTTTCTTTCTCAAAATGGATTGTTTCTTACATTTATTTTATTTTTTTCAACAACAGAAATTCGAGACCAGATTTATGATGAAAAACAAAACCTAGTGATTATCAAAGTGGTCTGTTGCAATCCTGAGAAGCTTAGAGATAAAATTTGTTGTAAGGGATGTGGGGTTATTAAAAGCATTGAAATCAAAGAGCCTGAACCTCCCAAGCCTCCTCCTCCCAAGCACGCCGATCCTCCTCCACCACCCAAAAAAGTCGATCCTCCCCCTCCTAAAAAGCCCGATCCACCCCCACCCCAAAAGGTCGATCCTCCGCCTCCGAAAAAAGCCGATCCTCCGCCTCCGAAAAAAGCCGATCCCCCACCCCCCAAAAAGCCCGACCCACCACCACCCAGTAAAGCGGCAGACCCTCCTCCACCACAAAAGGCAGCCGATCCTCCTCCGCCCAAAAAGGCAGACCCGGCGCCGCCCAAAAAGGTTGACCCACCACCCGCGAAAGCGGAGCCTCCGCCTCCGCCTCCGAAGAAGGTGGATCCTCCGCCGGTAGTGGTCCCACAGCCCAACCCGGTTCCGATACCGGTGCCGGTTCAACCGGAGCCGTACCCGGTGAACATGTGCGTGCCGGTTCCGGGTTATCCGCCGGGGTACCCTGTTGGGATTGGGGTGTGCTGTAGGCAGTGCTATGAAGGGAGGGGTGGGGGCCCATGTTATAGTGGGTTTGGTGGGACAGGCCCGTGTTGCGATGGGTGTGCTTCTGGAAGGCCCATTTACGATAGTTACGGTGGAGGGAGGCCCTGTTACGTTAGCCACTGTGAGTATCTTAACGAAGAAAATGCAAGTGGGTGCGTTGTTATGTGAGAGGGACACGTGTGACTGTTGTGGATGGTTTTCATTATATGATAATAATATATCTAAGTTCGGATGTTTAATCCCATTTTATTTTATTTTATTTTATTTTATTTTATTTTATTTTA
mRNA sequence
AAAGAAGTGAAGAAATCACAAATCCCAATTCCTAATCGAAATCCCATCTATACATAACGACAATTATAATGATGGTTACGTGTTTGGATCCCTCTCTTTCCTCTAACCACCATCGCCTTCTCTGATTCAGATGCCCTTTGGGTCTAATTCAATTCCCCTTTTTCTTTTTCCCTTCTTTTCTGTACAGCCGACATTTTCCCAAACACAGCAACTCAGACTCATCCATGGCGGACGATGGCAAGAAAGTTGTGGTCATGATGCTTAAAGTGGACTTGCAGTGTGATCGTTGCTACAAGAAAGTCAAGAAAGTTCTCTGCAAATTCCCACAAATTCGAGACCAGATTTATGATGAAAAACAAAACCTAGTGATTATCAAAGTGGTCTGTTGCAATCCTGAGAAGCTTAGAGATAAAATTTGTTGTAAGGGATGTGGGGTTATTAAAAGCATTGAAATCAAAGAGCCTGAACCTCCCAAGCCTCCTCCTCCCAAGCACGCCGATCCTCCTCCACCACCCAAAAAAGTCGATCCTCCCCCTCCTAAAAAGCCCGATCCACCCCCACCCCAAAAGGTCGATCCTCCGCCTCCGAAAAAAGCCGATCCTCCGCCTCCGAAAAAAGCCGATCCCCCACCCCCCAAAAAGCCCGACCCACCACCACCCAGTAAAGCGGCAGACCCTCCTCCACCACAAAAGGCAGCCGATCCTCCTCCGCCCAAAAAGGCAGACCCGGCGCCGCCCAAAAAGGTTGACCCACCACCCGCGAAAGCGGAGCCTCCGCCTCCGCCTCCGAAGAAGGTGGATCCTCCGCCGGTAGTGGTCCCACAGCCCAACCCGGTTCCGATACCGGTGCCGGTTCAACCGGAGCCGTACCCGGTGAACATGTGCGTGCCGGTTCCGGGTTATCCGCCGGGGTACCCTGTTGGGATTGGGGTGTGCTGTAGGCAGTGCTATGAAGGGAGGGGTGGGGGCCCATGTTATAGTGGGTTTGGTGGGACAGGCCCGTGTTGCGATGGGTGTGCTTCTGGAAGGCCCATTTACGATAGTTACGGTGGAGGGAGGCCCTGTTACGTTAGCCACTGTGAGTATCTTAACGAAGAAAATGCAAGTGGGTGCGTTGTTATGTGAGAGGGACACGTGTGACTGTTGTGGATGGTTTTCATTATATGATAATAATATATCTAAGTTCGGATGTTTAATCCCATTTTATTTTATTTTATTTTATTTTATTTTATTTTATTTTA
Coding sequence (CDS)
ATGGCGGACGATGGCAAGAAAGTTGTGGTCATGATGCTTAAAGTGGACTTGCAGTGTGATCGTTGCTACAAGAAAGTCAAGAAAGTTCTCTGCAAATTCCCACAAATTCGAGACCAGATTTATGATGAAAAACAAAACCTAGTGATTATCAAAGTGGTCTGTTGCAATCCTGAGAAGCTTAGAGATAAAATTTGTTGTAAGGGATGTGGGGTTATTAAAAGCATTGAAATCAAAGAGCCTGAACCTCCCAAGCCTCCTCCTCCCAAGCACGCCGATCCTCCTCCACCACCCAAAAAAGTCGATCCTCCCCCTCCTAAAAAGCCCGATCCACCCCCACCCCAAAAGGTCGATCCTCCGCCTCCGAAAAAAGCCGATCCTCCGCCTCCGAAAAAAGCCGATCCCCCACCCCCCAAAAAGCCCGACCCACCACCACCCAGTAAAGCGGCAGACCCTCCTCCACCACAAAAGGCAGCCGATCCTCCTCCGCCCAAAAAGGCAGACCCGGCGCCGCCCAAAAAGGTTGACCCACCACCCGCGAAAGCGGAGCCTCCGCCTCCGCCTCCGAAGAAGGTGGATCCTCCGCCGGTAGTGGTCCCACAGCCCAACCCGGTTCCGATACCGGTGCCGGTTCAACCGGAGCCGTACCCGGTGAACATGTGCGTGCCGGTTCCGGGTTATCCGCCGGGGTACCCTGTTGGGATTGGGGTGTGCTGTAGGCAGTGCTATGAAGGGAGGGGTGGGGGCCCATGTTATAGTGGGTTTGGTGGGACAGGCCCGTGTTGCGATGGGTGTGCTTCTGGAAGGCCCATTTACGATAGTTACGGTGGAGGGAGGCCCTGTTACGTTAGCCACTGTGAGTATCTTAACGAAGAAAATGCAAGTGGGTGCGTTGTTATGTGA
Protein sequence
MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM
Homology
BLAST of Cmc12g0324891 vs. NCBI nr
Match:
XP_008464282.1 (PREDICTED: leucine-rich repeat extensin-like protein 3 [Cucumis melo])
HSP 1 Score: 553.5 bits (1425), Expect = 1.1e-153
Identity = 299/299 (100.00%), Postives = 299/299 (100.00%), Query Frame = 0
Query: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL
Sbjct: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
Query: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP
Sbjct: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
Query: 121 PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK 180
PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK
Sbjct: 121 PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK 180
Query: 181 AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ 240
AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ
Sbjct: 181 AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ 240
Query: 241 CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM
Sbjct: 241 CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 299
BLAST of Cmc12g0324891 vs. NCBI nr
Match:
XP_004139513.2 (circumsporozoite protein [Cucumis sativus] >KAE8652830.1 hypothetical protein Csa_022772 [Cucumis sativus])
HSP 1 Score: 490.3 bits (1261), Expect = 1.1e-134
Identity = 279/300 (93.00%), Postives = 282/300 (94.00%), Query Frame = 0
Query: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL
Sbjct: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
Query: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
RDKICCKGCGVIKSIEIKEPEPPKPPPPK AD PPPPKKVDPPP KKPDPPPPQKVDPPP
Sbjct: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKPAD-PPPPKKVDPPPSKKPDPPPPQKVDPPP 120
Query: 121 PKKADPPPPKKADPPPPKK-PDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPA 180
PKKADPPPPKKAD PPP K DPPPP KAADPPPP+K ADPPPPKKADP PPKKVDPPP
Sbjct: 121 PKKADPPPPKKADTPPPSKAADPPPPQKAADPPPPKK-ADPPPPKKADPPPPKKVDPPPP 180
Query: 181 KAEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCR 240
KA PPPPKKVDPPPVVVPQP PVPIPVPVQPEPYPVNMCVPVPGYPPGYP IGVCCR
Sbjct: 181 KAN--PPPPKKVDPPPVVVPQPTPVPIPVPVQPEPYPVNMCVPVPGYPPGYP--IGVCCR 240
Query: 241 QCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
QC+EGRGGGPCYSGFGG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGC+VM
Sbjct: 241 QCHEGRGGGPCYSGFGGPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCIVM 294
BLAST of Cmc12g0324891 vs. NCBI nr
Match:
KAA0042354.1 (proline-rich protein 2 [Cucumis melo var. makuwa])
HSP 1 Score: 419.5 bits (1077), Expect = 2.5e-113
Identity = 233/233 (100.00%), Postives = 233/233 (100.00%), Query Frame = 0
Query: 67 KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP 126
KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP
Sbjct: 6 KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP 65
Query: 127 PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP 186
PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP
Sbjct: 66 PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP 125
Query: 187 PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG 246
PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG
Sbjct: 126 PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG 185
Query: 247 GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM
Sbjct: 186 GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 238
BLAST of Cmc12g0324891 vs. NCBI nr
Match:
KAG6583723.1 (hypothetical protein SDJN03_19655, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 399.4 bits (1025), Expect = 2.7e-107
Identity = 242/296 (81.76%), Postives = 248/296 (83.78%), Query Frame = 0
Query: 4 DGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 63
D KK VVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK
Sbjct: 3 DAKKTVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 62
Query: 64 ICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKK 123
ICCKGCGVIKSIEIK P PPPPK D PPPPKK DPPPPKK DPPPP K DPPPPKK
Sbjct: 63 ICCKGCGVIKSIEIK---PADPPPPKKPD-PPPPKKADPPPPKKADPPPPAKPDPPPPKK 122
Query: 124 ADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEP 183
ADPPPP KADPPPPKK DPPPP K ADPPPP+ ADPPPPKKADP PPKK DPPPA +
Sbjct: 123 ADPPPP-KADPPPPKKADPPPPKK-ADPPPPK--ADPPPPKKADPPPPKKADPPPA-PKA 182
Query: 184 PPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYE 243
PPPPKKVDP P P QPEP+PVN+CVPVPGYPP YP IG+CC QCYE
Sbjct: 183 DPPPPKKVDPVP-------------PAQPEPFPVNICVPVPGYPPAYP--IGMCCSQCYE 242
Query: 244 GRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
G+GGGPCYSGFG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGC VM
Sbjct: 243 GQGGGPCYSGFGRPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCSVM 274
BLAST of Cmc12g0324891 vs. NCBI nr
Match:
XP_022927590.1 (leucine-rich repeat extensin-like protein 3 isoform X5 [Cucurbita moschata])
HSP 1 Score: 399.1 bits (1024), Expect = 3.5e-107
Identity = 250/311 (80.39%), Postives = 258/311 (82.96%), Query Frame = 0
Query: 4 DGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 63
D KK VVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK
Sbjct: 3 DAKKTVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 62
Query: 64 ICCKGCGVIKSIEIK--EPEPPK---PPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDP 123
ICCKGCGVIKSIEIK +P PPK PPPPK ADPPPP K DPPPPKK DPPPP K DP
Sbjct: 63 ICCKGCGVIKSIEIKPADPPPPKKPDPPPPKKADPPPPAKP-DPPPPKKADPPPP-KADP 122
Query: 124 PPPKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPP 183
PPPKKADPPPP KADPPPPKK DPPPP K ADPPPP+ ADPPPPKKADP PPK PPP
Sbjct: 123 PPPKKADPPPP-KADPPPPKKADPPPPKK-ADPPPPK--ADPPPPKKADPPPPKADPPPP 182
Query: 184 AKAEPP------PPPPKKVDPPPVVVPQPNPV----PIPVPVQPEPYPVNMCVPVPGYPP 243
KA+PP PPPPKK DPPP P P P+P P QPEP+PVN+CVPVPGYPP
Sbjct: 183 KKADPPPPKKADPPPPKKADPPPAPKADPPPPKKVDPVP-PAQPEPFPVNICVPVPGYPP 242
Query: 244 GYPVGIGVCCRQCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYL 300
YP IG+CC QCYEG+GGGPCYSGFG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYL
Sbjct: 243 AYP--IGMCCSQCYEGQGGGPCYSGFGRPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYL 302
BLAST of Cmc12g0324891 vs. ExPASy Swiss-Prot
Match:
P23093 (Circumsporozoite protein OS=Plasmodium berghei (strain Anka) OX=5823 PE=3 SV=1)
HSP 1 Score: 46.2 bits (108), Expect = 7.5e-04
Identity = 79/161 (49.07%), Postives = 87/161 (54.04%), Query Frame = 0
Query: 76 EIKEPEP---PKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADPPPPKKA 135
++K+P P P PPP + + PPPP DPPPP DPPPP DPPPP DPPPP
Sbjct: 88 KLKQPPPPPNPNDPPPPNPNDPPPPNPNDPPPPNPNDPPPPNPNDPPPPNANDPPPPNAN 147
Query: 136 DPPPPKKPDPPPPS-------KAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPP 195
DP PP DP PP+ A DPPPP A DPPPP DPAPP DPPP P
Sbjct: 148 DPAPPNANDPAPPNANDPAPPNANDPPPP-NANDPPPPNPNDPAPPNANDPPPPNPNDPA 207
Query: 196 PPPKKVDPPPVVVPQPNPVPIPVP-VQPEPYPVNMCVPVPG 226
PP +P P PQP P P P P QP+P P P PG
Sbjct: 208 PPQGNNNPQPQPRPQPQPQPQPQPQPQPQPQPRPQPQPQPG 247
BLAST of Cmc12g0324891 vs. ExPASy TrEMBL
Match:
A0A1S3CL41 (leucine-rich repeat extensin-like protein 3 OS=Cucumis melo OX=3656 GN=LOC103502210 PE=4 SV=1)
HSP 1 Score: 553.5 bits (1425), Expect = 5.3e-154
Identity = 299/299 (100.00%), Postives = 299/299 (100.00%), Query Frame = 0
Query: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL
Sbjct: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
Query: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP
Sbjct: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
Query: 121 PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK 180
PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK
Sbjct: 121 PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK 180
Query: 181 AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ 240
AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ
Sbjct: 181 AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ 240
Query: 241 CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM
Sbjct: 241 CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 299
BLAST of Cmc12g0324891 vs. ExPASy TrEMBL
Match:
A0A0A0LTA1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G172590 PE=4 SV=1)
HSP 1 Score: 491.5 bits (1264), Expect = 2.5e-135
Identity = 276/299 (92.31%), Postives = 278/299 (92.98%), Query Frame = 0
Query: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL
Sbjct: 1 MADDGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKL 60
Query: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPP 120
RDKICCKGCGVIKSIEIKEPEPPKPPPPK AD PPPPKKVDPPP KKPDPPPPQKVDPPP
Sbjct: 61 RDKICCKGCGVIKSIEIKEPEPPKPPPPKPAD-PPPPKKVDPPPSKKPDPPPPQKVDPPP 120
Query: 121 PKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAK 180
PKKADPPPPKKA D PPPSKAADPPPPQKAADPPPPKKADP PPKKVDPPP K
Sbjct: 121 PKKADPPPPKKA--------DTPPPSKAADPPPPQKAADPPPPKKADPPPPKKVDPPPPK 180
Query: 181 AEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQ 240
A PPPPKKVDPPPVVVPQP PVPIPVPVQPEPYPVNMCVPVPGYPPGYP IGVCCRQ
Sbjct: 181 AN--PPPPKKVDPPPVVVPQPTPVPIPVPVQPEPYPVNMCVPVPGYPPGYP--IGVCCRQ 240
Query: 241 CYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
C+EGRGGGPCYSGFGG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGC+VM
Sbjct: 241 CHEGRGGGPCYSGFGGPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCIVM 286
BLAST of Cmc12g0324891 vs. ExPASy TrEMBL
Match:
A0A5A7TG02 (Proline-rich protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold795G00560 PE=4 SV=1)
HSP 1 Score: 419.5 bits (1077), Expect = 1.2e-113
Identity = 233/233 (100.00%), Postives = 233/233 (100.00%), Query Frame = 0
Query: 67 KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP 126
KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP
Sbjct: 6 KGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKKADP 65
Query: 127 PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP 186
PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP
Sbjct: 66 PPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEPPPP 125
Query: 187 PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG 246
PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG
Sbjct: 126 PPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRG 185
Query: 247 GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 300
GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM
Sbjct: 186 GGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENASGCVVM 238
BLAST of Cmc12g0324891 vs. ExPASy TrEMBL
Match:
A0A6J1EPE4 (leucine-rich repeat extensin-like protein 3 isoform X5 OS=Cucurbita moschata OX=3662 GN=LOC111434373 PE=4 SV=1)
HSP 1 Score: 399.1 bits (1024), Expect = 1.7e-107
Identity = 250/311 (80.39%), Postives = 258/311 (82.96%), Query Frame = 0
Query: 4 DGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 63
D KK VVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK
Sbjct: 3 DAKKTVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 62
Query: 64 ICCKGCGVIKSIEIK--EPEPPK---PPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDP 123
ICCKGCGVIKSIEIK +P PPK PPPPK ADPPPP K DPPPPKK DPPPP K DP
Sbjct: 63 ICCKGCGVIKSIEIKPADPPPPKKPDPPPPKKADPPPPAKP-DPPPPKKADPPPP-KADP 122
Query: 124 PPPKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPP 183
PPPKKADPPPP KADPPPPKK DPPPP K ADPPPP+ ADPPPPKKADP PPK PPP
Sbjct: 123 PPPKKADPPPP-KADPPPPKKADPPPPKK-ADPPPPK--ADPPPPKKADPPPPKADPPPP 182
Query: 184 AKAEPP------PPPPKKVDPPPVVVPQPNPV----PIPVPVQPEPYPVNMCVPVPGYPP 243
KA+PP PPPPKK DPPP P P P+P P QPEP+PVN+CVPVPGYPP
Sbjct: 183 KKADPPPPKKADPPPPKKADPPPAPKADPPPPKKVDPVP-PAQPEPFPVNICVPVPGYPP 242
Query: 244 GYPVGIGVCCRQCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYL 300
YP IG+CC QCYEG+GGGPCYSGFG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYL
Sbjct: 243 AYP--IGMCCSQCYEGQGGGPCYSGFGRPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYL 302
BLAST of Cmc12g0324891 vs. ExPASy TrEMBL
Match:
A0A6J1ELF4 (leucine-rich repeat extensin-like protein 3 isoform X8 OS=Cucurbita moschata OX=3662 GN=LOC111434373 PE=4 SV=1)
HSP 1 Score: 399.1 bits (1024), Expect = 1.7e-107
Identity = 246/305 (80.66%), Postives = 254/305 (83.28%), Query Frame = 0
Query: 4 DGKKVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 63
D KK VVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK
Sbjct: 3 DAKKTVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEKLRDK 62
Query: 64 ICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPPPPKK 123
ICCKGCGVIKSIEIK P PPPPK D PPPPKK DPPPP KPDPPPP+K DPPPP K
Sbjct: 63 ICCKGCGVIKSIEIK---PADPPPPKKPD-PPPPKKADPPPPAKPDPPPPKKADPPPP-K 122
Query: 124 ADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPAKAEP 183
ADPPPPKKADPPPP K DPPPP K ADPPPP+ ADPPPPKKADP PPK PPP KA+P
Sbjct: 123 ADPPPPKKADPPPP-KADPPPPKK-ADPPPPK--ADPPPPKKADPPPPKADPPPPKKADP 182
Query: 184 P-----PPPPKKVDPPPVVVPQPNPV----PIPVPVQPEPYPVNMCVPVPGYPPGYPVGI 243
P PPPPKK DPPP P P P+P P QPEP+PVN+CVPVPGYPP YP I
Sbjct: 183 PPPKADPPPPKKADPPPAPKADPPPPKKVDPVP-PAQPEPFPVNICVPVPGYPPAYP--I 242
Query: 244 GVCCRQCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENAS 300
G+CC QCYEG+GGGPCYSGFG GPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENAS
Sbjct: 243 GMCCSQCYEGQGGGPCYSGFGRPGPCCDGCASGRPIYDSYGGGRPCYVSHCEYLNEENAS 295
BLAST of Cmc12g0324891 vs. TAIR 10
Match:
AT4G16380.1 (Heavy metal transport/detoxification superfamily protein )
HSP 1 Score: 133.3 bits (334), Expect = 3.3e-31
Identity = 133/308 (43.18%), Postives = 167/308 (54.22%), Query Frame = 0
Query: 1 MADDGK-KVVVMMLKVDLQCDRCYKKVKKVLCKFPQIRDQIYDEKQNLVIIKVVCCNPEK 60
MA+ GK KV +M LKVDL C +CYKKVKKVLCKFPQIRDQ++DEK N+VIIKVVCC+PE+
Sbjct: 1 MAEKGKEKVTMMKLKVDLDCAKCYKKVKKVLCKFPQIRDQLFDEKSNIVIIKVVCCSPER 60
Query: 61 LRDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPPPPPKKVDPPPPKKPDPPPPQKVDPP 120
+ DK+C KG G IK+IEI EPPKPP P+ PP PK P P+KP +P
Sbjct: 61 IMDKLCSKGGGSIKTIEI--VEPPKPPQPQPQQPPQKPKDAQPKAPEKPK-------EPE 120
Query: 121 PPKKADPPPPKKADPPPPKKPDPPPPSKAADPPPPQKAADPPPPKKADPAPPKKVDPPPA 180
PK+ P K +P PK+P+ P P+K P P PAP K P PA
Sbjct: 121 KPKQ----PEKLKEPEKPKQPE--------KPKEPEKTKQPAPAPAPAPAPAAK--PAPA 180
Query: 181 KAEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEPYPVNMCVPVPGYPPGYPVGIGVCCR 240
A P P PK+ PPP +P P G P +CC
Sbjct: 181 PAPAPAPAPKQPGPPPQAIPM-------------------------MPQGQP---AMCCG 240
Query: 241 QCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSYGGG--------RPCYVSHCEYLNEE 300
Y+G GGP ++G+ G P C GRP+Y+S+GGG R C+V+ C+Y +EE
Sbjct: 241 PYYDGY-GGPAFNGY-GMPPQPYEC-YGRPVYESWGGGCPPPPPAYRQCHVTRCDYFSEE 254
BLAST of Cmc12g0324891 vs. TAIR 10
Match:
AT4G16380.2 (Heavy metal transport/detoxification superfamily protein )
HSP 1 Score: 84.3 bits (207), Expect = 1.8e-16
Identity = 106/273 (38.83%), Postives = 138/273 (50.55%), Query Frame = 0
Query: 35 QIRDQIYDEKQNLVIIKVVCCNPEKLRDKICCKGCGVIKSIEIKEPEPPKPPPPKHADPP 94
+IRDQ++DEK N+VIIKVVCC+PE++ DK+C KG G IK+IEI EPPKPP P+ PP
Sbjct: 15 EIRDQLFDEKSNIVIIKVVCCSPERIMDKLCSKGGGSIKTIEI--VEPPKPPQPQPQQPP 74
Query: 95 PPPKKVDPPPPKKPDPPPPQKVDPPPPKKADPPPPKKADPPPPKKPDPPPPSKAADPPPP 154
PK P P+KP +P PK+ P K +P PK+P+ P P
Sbjct: 75 QKPKDAQPKAPEKPK-------EPEKPKQ----PEKLKEPEKPKQPE--------KPKEP 134
Query: 155 QKAADPPPPKKADPAPPKKVDPPPAKAEPPPPPPKKVDPPPVVVPQPNPVPIPVPVQPEP 214
+K P P PAP K P PA A P P PK+ PPP +P
Sbjct: 135 EKTKQPAPAPAPAPAPAAK--PAPAPAPAPAPAPKQPGPPPQAIPM-------------- 194
Query: 215 YPVNMCVPVPGYPPGYPVGIGVCCRQCYEGRGGGPCYSGFGGTGPCCDGCASGRPIYDSY 274
P G P +CC Y+G GGP ++G+ G P C GRP+Y+S+
Sbjct: 195 -----------MPQGQP---AMCCGPYYDGY-GGPAFNGY-GMPPQPYEC-YGRPVYESW 233
Query: 275 GGG--------RPCYVSHCEYLNEENASGCVVM 300
GGG R C+V+ C+Y +EEN C +M
Sbjct: 255 GGGCPPPPPAYRQCHVTRCDYFSEENPQSCSIM 233
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_008464282.1 | 1.1e-153 | 100.00 | PREDICTED: leucine-rich repeat extensin-like protein 3 [Cucumis melo] | [more] |
XP_004139513.2 | 1.1e-134 | 93.00 | circumsporozoite protein [Cucumis sativus] >KAE8652830.1 hypothetical protein Cs... | [more] |
KAA0042354.1 | 2.5e-113 | 100.00 | proline-rich protein 2 [Cucumis melo var. makuwa] | [more] |
KAG6583723.1 | 2.7e-107 | 81.76 | hypothetical protein SDJN03_19655, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022927590.1 | 3.5e-107 | 80.39 | leucine-rich repeat extensin-like protein 3 isoform X5 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
P23093 | 7.5e-04 | 49.07 | Circumsporozoite protein OS=Plasmodium berghei (strain Anka) OX=5823 PE=3 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A1S3CL41 | 5.3e-154 | 100.00 | leucine-rich repeat extensin-like protein 3 OS=Cucumis melo OX=3656 GN=LOC103502... | [more] |
A0A0A0LTA1 | 2.5e-135 | 92.31 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G172590 PE=4 SV=1 | [more] |
A0A5A7TG02 | 1.2e-113 | 100.00 | Proline-rich protein 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold7... | [more] |
A0A6J1EPE4 | 1.7e-107 | 80.39 | leucine-rich repeat extensin-like protein 3 isoform X5 OS=Cucurbita moschata OX=... | [more] |
A0A6J1ELF4 | 1.7e-107 | 80.66 | leucine-rich repeat extensin-like protein 3 isoform X8 OS=Cucurbita moschata OX=... | [more] |
Match Name | E-value | Identity | Description | |
AT4G16380.1 | 3.3e-31 | 43.18 | Heavy metal transport/detoxification superfamily protein | [more] |
AT4G16380.2 | 1.8e-16 | 38.83 | Heavy metal transport/detoxification superfamily protein | [more] |