Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CCAAAGAGCAAACCGACCCGACAACGAAAAACAGAACCGTCATTTTTTCCGGCAACCGGAATGGCAACGGCAAGGCCAAGAACCGCCGGACTCGGCGCCGTCCGCCGACAATCCATGCGGGTTTACAACGAGCCCTTCAATCCCAATGTTGAAGAAAGAGAAGAAAATGAAGCCCACATTCTACTACTTGAACTTCCTGGTCCCAATTTTTTCCCTCTTTCTTCATTGTTTTGAATTTTGATTAGTTTAGTACGTAAAAGAAATATAATAGTCTATGAACTTTTATATTTACTAGAAAACATTAACGTGTGTGGTACGTGCAAAGATTATATGTAGTTTTATTTTTCAAAACTACTCTTAGAAATATATTTTTTAATAGAAAAACTTAAATATAGCTCAACTGGTTAGAGTATTATAATTTTAATTGAAAGGATTTATTTTATTAGTCTTTAAACTTTTAATTTTGTGTTCAATAAATTTTAAAATTTTCAATTTTGTGTTTAAGAGATCTTAGACCTATTTGACTTTAAAAAAAATCATATACGTAGTAAACACAAAATTAAAAATTTAAGGTCTAAAATTACACCTATATCATGATTTATTAGAAAAAAAAAATTTGAAAGTTCAGAATTTAAATTAGCTATTTTTTTTCAATACATGTCAATTACCGTTGAGTTATGCTCATTTTGACTTAAATTAGCTATTTTTAAGCCTCATTTGATTACTATTTAATTTTTATTATTTATTTTCTAAATTAAACTTGTTTTTTTTTTCTCAATTTTCTTACGTTTTCGTCATTATTAATTAATCACACGTTTAAATTCGTAATAAAATTCTAAAAATAAAAACAACTTTTTGAAGTGTTTTTAGTTTTCAAAATGGGACTTGATTTATTAAAACATGCAAAAACGTAGAAAACAAAACAAAAATTTCAGAGGTAAAAAAAAGTAAAAAACAAATGATCATCCAATGTGAGCTTTAAGATTCTCGACACAAATTAAAAGTTCATAAATCCGATAGACATAACTTAAAAGTTCGGACATTACAAAATTAAGGGTTAAACAAACACAAAAGTGAAGTTTATGAACAAACTTGTAATTTAACATTTATTATTATTGTTATTAAGAGGAACTTATTTTTTTTATTTTCGAGTCTGATTTTTGTTTTGTGATTGAATTGTTCTTCAGATTTCACCAAGCAGCACGTTAAGGTTAAGACTGAAAAAGAGGAACGGACAGTGGTGGTCACCGGAGATCGGAATGTCGGAAATAATAGATTGCTCATATTGGATAAAACTTTCCCAATTCCTCAAAATTGCGTAATCGAAAACATTGATCATAAATTACAAGATGGACTTCTCACCATTACAATGCCCAAACAAACTACGGCGGCGGCGGCGGCGACGGTGGCTCCGCCGTTCAAAGAACCAGAACAAACAGCCCCAGAAAAAGGGAGGGAAGAAATCTCGCCGGAAAACACAACTCCGCCGCCGAAAGAACCAGAACAAACAACCCCACAAAAGGGGGGCGAGGAAATCTCGCCGGAAAACGCAGCTCCGCCGCCGAAAGAACCAGAACAAACGACCCCACAAAAGGGGGGCCAGGAAACCTCGTCGGAAAACGCAGCTCCGCCGGAGACGAAGGAGGAAATTATGCAAACGGTGGAGGAAAACAAGGGAAAATCGGCAGAACTCCAAAAGCAGGCCTCGGCGAAGGCTGAGGAAGAAGCTCCGACGCCGGCGCCGGCGGTGGTGCCACCGCCTGTGAAGAGTCCGGCTGAAGGAGGTTCCGGCGAGGCTAAAACGACATCGGATGAGAAAATCAGCAGCCCAGATCAGAAACTAACGGAGAAGAAAGAAATTGAAAATCAAAACGGAGAAAAGGGGAAGGAATCTAAAACAGAGGAGGTGGGTAAGAATCGAGAAGAGCCGAAGATCGGCACCGGAAGTCGATCTCCGGGAGCGACCGGAGTTGGAAAACTTGCCGGCGGCTACACGGTCAGAAGGATGCCGTTACTGGTGACGGCGAGTCTCGCGGCGGCGGTTGTGACATCGGTGGCGGCGTATTTCGCATATGCTTATTACGGACTGTCGTTCGCGATGGAATGA
mRNA sequence
ATGGCAACGGCAAGGCCAAGAACCGCCGGACTCGGCGCCGTCCGCCGACAATCCATGCGGGTTTACAACGAGCCCTTCAATCCCAATGTTGAAGAAAGAGAAGAAAATGAAGCCCACATTCTACTACTTGAACTTCCTGATTTCACCAAGCAGCACGTTAAGGTTAAGACTGAAAAAGAGGAACGGACAGTGGTGGTCACCGGAGATCGGAATGTCGGAAATAATAGATTGCTCATATTGGATAAAACTTTCCCAATTCCTCAAAATTGCGTAATCGAAAACATTGATCATAAATTACAAGATGGACTTCTCACCATTACAATGCCCAAACAAACTACGGCGGCGGCGGCGGCGACGGTGGCTCCGCCGTTCAAAGAACCAGAACAAACAGCCCCAGAAAAAGGGAGGGAAGAAATCTCGCCGGAAAACACAACTCCGCCGCCGAAAGAACCAGAACAAACAACCCCACAAAAGGGGGGCGAGGAAATCTCGCCGGAAAACGCAGCTCCGCCGCCGAAAGAACCAGAACAAACGACCCCACAAAAGGGGGGCCAGGAAACCTCGTCGGAAAACGCAGCTCCGCCGGAGACGAAGGAGGAAATTATGCAAACGGTGGAGGAAAACAAGGGAAAATCGGCAGAACTCCAAAAGCAGGCCTCGGCGAAGGCTGAGGAAGAAGCTCCGACGCCGGCGCCGGCGGTGGTGCCACCGCCTGTGAAGAGTCCGGCTGAAGGAGGTTCCGGCGAGGCTAAAACGACATCGGATGAGAAAATCAGCAGCCCAGATCAGAAACTAACGGAGAAGAAAGAAATTGAAAATCAAAACGGAGAAAAGGGGAAGGAATCTAAAACAGAGGAGGTGGGTAAGAATCGAGAAGAGCCGAAGATCGGCACCGGAAGTCGATCTCCGGGAGCGACCGGAGTTGGAAAACTTGCCGGCGGCTACACGGTCAGAAGGATGCCGTTACTGGTGACGGCGAGTCTCGCGGCGGCGGTTGTGACATCGGTGGCGGCGTATTTCGCATATGCTTATTACGGACTGTCGTTCGCGATGGAATGA
Coding sequence (CDS)
ATGGCAACGGCAAGGCCAAGAACCGCCGGACTCGGCGCCGTCCGCCGACAATCCATGCGGGTTTACAACGAGCCCTTCAATCCCAATGTTGAAGAAAGAGAAGAAAATGAAGCCCACATTCTACTACTTGAACTTCCTGATTTCACCAAGCAGCACGTTAAGGTTAAGACTGAAAAAGAGGAACGGACAGTGGTGGTCACCGGAGATCGGAATGTCGGAAATAATAGATTGCTCATATTGGATAAAACTTTCCCAATTCCTCAAAATTGCGTAATCGAAAACATTGATCATAAATTACAAGATGGACTTCTCACCATTACAATGCCCAAACAAACTACGGCGGCGGCGGCGGCGACGGTGGCTCCGCCGTTCAAAGAACCAGAACAAACAGCCCCAGAAAAAGGGAGGGAAGAAATCTCGCCGGAAAACACAACTCCGCCGCCGAAAGAACCAGAACAAACAACCCCACAAAAGGGGGGCGAGGAAATCTCGCCGGAAAACGCAGCTCCGCCGCCGAAAGAACCAGAACAAACGACCCCACAAAAGGGGGGCCAGGAAACCTCGTCGGAAAACGCAGCTCCGCCGGAGACGAAGGAGGAAATTATGCAAACGGTGGAGGAAAACAAGGGAAAATCGGCAGAACTCCAAAAGCAGGCCTCGGCGAAGGCTGAGGAAGAAGCTCCGACGCCGGCGCCGGCGGTGGTGCCACCGCCTGTGAAGAGTCCGGCTGAAGGAGGTTCCGGCGAGGCTAAAACGACATCGGATGAGAAAATCAGCAGCCCAGATCAGAAACTAACGGAGAAGAAAGAAATTGAAAATCAAAACGGAGAAAAGGGGAAGGAATCTAAAACAGAGGAGGTGGGTAAGAATCGAGAAGAGCCGAAGATCGGCACCGGAAGTCGATCTCCGGGAGCGACCGGAGTTGGAAAACTTGCCGGCGGCTACACGGTCAGAAGGATGCCGTTACTGGTGACGGCGAGTCTCGCGGCGGCGGTTGTGACATCGGTGGCGGCGTATTTCGCATATGCTTATTACGGACTGTCGTTCGCGATGGAATGA
Protein sequence
MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKEERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATVAPPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTPQKGGQETSSENAAPPETKEEIMQTVEENKGKSAELQKQASAKAEEEAPTPAPAVVPPPVKSPAEGGSGEAKTTSDEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGGYTVRRMPLLVTASLAAAVVTSVAAYFAYAYYGLSFAME
Homology
BLAST of Spg009952 vs. NCBI nr
Match:
XP_022968176.1 (proteoglycan 4 [Cucurbita maxima])
HSP 1 Score: 376.3 bits (965), Expect = 2.8e-100
Identity = 235/388 (60.57%), Postives = 269/388 (69.33%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPR+AGLGA+RRQS+R YNEPF PNV ER+ENEAHIL L+LPDF +QHVKVK E+
Sbjct: 1 MATGRPRSAGLGALRRQSLRAYNEPFTPNVVERDENEAHILQLQLPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NNRLLILDKT+PIPQ+C I+ + HKL+ G LTITMPKQT A
Sbjct: 61 ARTVVVTGDRLLANNRLLILDKTYPIPQDCPIDKVHHKLEAGYLTITMPKQTAPPEAVAT 120
Query: 121 APPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTP 180
A P K+PEQT PEKG EE +PEN TPP KEPEQ TP+KG EE SPENA+P KEPEQT+
Sbjct: 121 AAP-KDPEQTTPEKGSEETTPENATPPQKEPEQNTPEKGSEETSPENASPSQKEPEQTSL 180
Query: 181 QKGGQETSSENAAP---------------------------------PETKEEIMQTVEE 240
+KG +ETS NA P P+ + E + EE
Sbjct: 181 EKGSEETSPGNATPPPKGPKQTSVEKGSEETSPGNATPLPKEPEQTTPKKESEEISPEEE 240
Query: 241 NKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---KSPAEGGSGEAKTTSDEKISSPDQK 300
+KGKSAELQK+ S KAEEEAPT AP VPPP P +G SG+ KTT DEKI +P+QK
Sbjct: 241 DKGKSAELQKKGSVKAEEEAPTQAPTEVPPPAAAKNGPVQGESGKEKTTPDEKIKNPNQK 300
Query: 301 LTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGGYTVRRMPLLV 353
TEK ENQN EKGKESKTE+VGKN + KIGTG+ S AT K A G+T R L V
Sbjct: 301 PTEK---ENQNPEKGKESKTEKVGKNEKTGKIGTGTPSRKATTSKKHAAGFTNTRRVLSV 360
BLAST of Spg009952 vs. NCBI nr
Match:
XP_022945514.1 (proteoglycan 4 isoform X2 [Cucurbita moschata])
HSP 1 Score: 374.8 bits (961), Expect = 8.3e-100
Identity = 230/363 (63.36%), Postives = 263/363 (72.45%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRTAGLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTAGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G L ITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLIITMPKQTAPPAAP-- 120
Query: 121 APPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTP 180
K+PEQ PEKG EE +PEN TPP KEPEQ TP+KG EE SP NA+PPPK P+QT+
Sbjct: 121 ----KDPEQKTPEKGSEETTPENATPPQKEPEQKTPEKGSEETSPGNASPPPKGPKQTSL 180
Query: 181 QKGGQETSSENAAPP--------ETKEEIMQTVEENKGKSAELQKQASAKAEEEAPTPAP 240
+KGG+ETS NA PP KE + EE+KGKSAELQK+ S KA EEAPTPAP
Sbjct: 181 EKGGEETSPGNATPPPKEPEQTTRKKESEEISPEEDKGKSAELQKKGSVKAGEEAPTPAP 240
Query: 241 AVVPPPV---KSPAEGGSGEAKTTSDEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGK 300
VPPP P G SG+ KTT DEKI +P+QK TEK ENQN EKGKESKTE+VGK
Sbjct: 241 TEVPPPAAAKNGPVRGESGKEKTTPDEKIKNPNQKPTEK---ENQNPEKGKESKTEKVGK 300
Query: 301 NREEPKIGTGSRSPGATGVGKLAGGYTVRRMPLLVTASLAAAVVTSVAAYFAYAYYGLSF 353
N + KIGTG+ S AT K A G+T R L VTAS+AAAVVT AAY A+AYYG SF
Sbjct: 301 NEKTGKIGTGTPSQKATTGKKYAAGFTNTRRVLPVTASVAAAVVTVAAAYLAFAYYGFSF 354
BLAST of Spg009952 vs. NCBI nr
Match:
KAG6573906.1 (Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 370.9 bits (951), Expect = 1.2e-98
Identity = 235/396 (59.34%), Postives = 267/396 (67.42%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRTAGLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTAGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G LTITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLTITMPKQTAPPAAVAT 120
Query: 121 A-----------------------PPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQ 180
A PP KEPEQ PEKG EE SP N +PP KEPEQ TP+
Sbjct: 121 AAPKDPEQTTPEKGSGETTPGNASPPQKEPEQKTPEKGSEETSPGNASPPQKEPEQKTPE 180
Query: 181 KGGEEISPENAAPPPKEPEQTTPQKGGQETSSENAAP------------------PETKE 240
KG EE SP NA+PPPK P+QT+ +KG +ETS NA P PE K
Sbjct: 181 KGSEETSPGNASPPPKGPKQTSLEKGNEETSPGNATPPPKEPEQTTPKKESEEISPEMKA 240
Query: 241 EIMQTVEENKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---KSPAEGGSGEAKTTSDE 300
+I + EE+KGKSAELQK+ S KAEEEAPTPAP VPPP P G SG+ KTT DE
Sbjct: 241 KIKRPEEEDKGKSAELQKKGSVKAEEEAPTPAPTEVPPPAAAKNGPVRGESGKEKTTPDE 300
Query: 301 KISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGGYT 353
KI++P+QK TEK ENQN EKGKESKTEEVGKN + KIGTG+ S A K A G T
Sbjct: 301 KINNPNQKPTEK---ENQNPEKGKESKTEEVGKNEKTGKIGTGTPSQKAIIGKKPAAGVT 360
BLAST of Spg009952 vs. NCBI nr
Match:
KAG7012971.1 (Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 370.9 bits (951), Expect = 1.2e-98
Identity = 235/396 (59.34%), Postives = 267/396 (67.42%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRTAGLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTAGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G LTITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLTITMPKQTAPPAAVAT 120
Query: 121 A-----------------------PPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQ 180
A PP KE EQ PEKG E SPEN +PP KEPEQ TP+
Sbjct: 121 AAPKDPGETTPEKGSGETTPENATPPQKEAEQKTPEKGSGETSPENASPPQKEPEQKTPE 180
Query: 181 KGGEEISPENAAPPPKEPEQTTPQKGGQETSSENAAP------------------PETKE 240
KG EE SP NA+PPPK P+QT+ +KG +ETS NA P PE K
Sbjct: 181 KGSEETSPGNASPPPKGPKQTSLEKGNEETSPGNATPPPKEPEQTTPKKESEEISPEMKA 240
Query: 241 EIMQTVEENKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---KSPAEGGSGEAKTTSDE 300
+I + EE+KGKSAELQK+ S KAEEEAPTPAP VPPP P G SG+ KTT DE
Sbjct: 241 KIKRPEEEDKGKSAELQKKGSVKAEEEAPTPAPTEVPPPAAAKNGPVRGESGKEKTTPDE 300
Query: 301 KISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGGYT 353
KI++P+QK TEK ENQN EKGKESKTEEVGKN + KIGTG+ S AT K A G T
Sbjct: 301 KINNPNQKPTEK---ENQNPEKGKESKTEEVGKNEKTGKIGTGTPSQKATTGKKPAAGVT 360
BLAST of Spg009952 vs. NCBI nr
Match:
XP_023541033.1 (proteoglycan 4 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 366.7 bits (940), Expect = 2.2e-97
Identity = 233/412 (56.55%), Postives = 268/412 (65.05%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRT GLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTVGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G LTITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLTITMPKQTAPPAAVAT 120
Query: 121 APPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTP 180
A P K+PEQT PEKG E +PEN TPP KEPEQ TP+KG EE SPENA+PP KEPEQ TP
Sbjct: 121 AAP-KDPEQTTPEKGSGETTPENATPPQKEPEQKTPEKGSEETSPENASPPQKEPEQITP 180
Query: 181 QKGGQETSSENAAP---------------------------------------------- 240
+KG +ETS NA+P
Sbjct: 181 EKGSEETSPGNASPPLKEPEQASLKKGSEETSPGNATPPPKGPKQTSLEKGSEETSPGNA 240
Query: 241 -----------PETKEEIMQTVEENKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---K 300
P+ + E + EE+KGKSAELQK+ S KAEEEAPTPAP VPPP
Sbjct: 241 TPPPKEPEQTTPKKESEEISPEEEDKGKSAELQKKGSVKAEEEAPTPAPTEVPPPAAAKN 300
Query: 301 SPAEGGSGEAKTTSDEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGS 353
P +G SG+ KT DEKI++P+QK TEK NQN EKGKESKTEEVGKN + KIGTG+
Sbjct: 301 GPVQGESGKEKTPPDEKINNPNQKPTEK---GNQNPEKGKESKTEEVGKNEKTGKIGTGT 360
BLAST of Spg009952 vs. ExPASy TrEMBL
Match:
A0A6J1HU50 (proteoglycan 4 OS=Cucurbita maxima OX=3661 GN=LOC111467490 PE=3 SV=1)
HSP 1 Score: 376.3 bits (965), Expect = 1.4e-100
Identity = 235/388 (60.57%), Postives = 269/388 (69.33%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPR+AGLGA+RRQS+R YNEPF PNV ER+ENEAHIL L+LPDF +QHVKVK E+
Sbjct: 1 MATGRPRSAGLGALRRQSLRAYNEPFTPNVVERDENEAHILQLQLPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NNRLLILDKT+PIPQ+C I+ + HKL+ G LTITMPKQT A
Sbjct: 61 ARTVVVTGDRLLANNRLLILDKTYPIPQDCPIDKVHHKLEAGYLTITMPKQTAPPEAVAT 120
Query: 121 APPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTP 180
A P K+PEQT PEKG EE +PEN TPP KEPEQ TP+KG EE SPENA+P KEPEQT+
Sbjct: 121 AAP-KDPEQTTPEKGSEETTPENATPPQKEPEQNTPEKGSEETSPENASPSQKEPEQTSL 180
Query: 181 QKGGQETSSENAAP---------------------------------PETKEEIMQTVEE 240
+KG +ETS NA P P+ + E + EE
Sbjct: 181 EKGSEETSPGNATPPPKGPKQTSVEKGSEETSPGNATPLPKEPEQTTPKKESEEISPEEE 240
Query: 241 NKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---KSPAEGGSGEAKTTSDEKISSPDQK 300
+KGKSAELQK+ S KAEEEAPT AP VPPP P +G SG+ KTT DEKI +P+QK
Sbjct: 241 DKGKSAELQKKGSVKAEEEAPTQAPTEVPPPAAAKNGPVQGESGKEKTTPDEKIKNPNQK 300
Query: 301 LTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGGYTVRRMPLLV 353
TEK ENQN EKGKESKTE+VGKN + KIGTG+ S AT K A G+T R L V
Sbjct: 301 PTEK---ENQNPEKGKESKTEKVGKNEKTGKIGTGTPSRKATTSKKHAAGFTNTRRVLSV 360
BLAST of Spg009952 vs. ExPASy TrEMBL
Match:
A0A6J1G164 (proteoglycan 4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111449725 PE=3 SV=1)
HSP 1 Score: 374.8 bits (961), Expect = 4.0e-100
Identity = 230/363 (63.36%), Postives = 263/363 (72.45%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRTAGLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTAGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G L ITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLIITMPKQTAPPAAP-- 120
Query: 121 APPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEEISPENAAPPPKEPEQTTP 180
K+PEQ PEKG EE +PEN TPP KEPEQ TP+KG EE SP NA+PPPK P+QT+
Sbjct: 121 ----KDPEQKTPEKGSEETTPENATPPQKEPEQKTPEKGSEETSPGNASPPPKGPKQTSL 180
Query: 181 QKGGQETSSENAAPP--------ETKEEIMQTVEENKGKSAELQKQASAKAEEEAPTPAP 240
+KGG+ETS NA PP KE + EE+KGKSAELQK+ S KA EEAPTPAP
Sbjct: 181 EKGGEETSPGNATPPPKEPEQTTRKKESEEISPEEDKGKSAELQKKGSVKAGEEAPTPAP 240
Query: 241 AVVPPPV---KSPAEGGSGEAKTTSDEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGK 300
VPPP P G SG+ KTT DEKI +P+QK TEK ENQN EKGKESKTE+VGK
Sbjct: 241 TEVPPPAAAKNGPVRGESGKEKTTPDEKIKNPNQKPTEK---ENQNPEKGKESKTEKVGK 300
Query: 301 NREEPKIGTGSRSPGATGVGKLAGGYTVRRMPLLVTASLAAAVVTSVAAYFAYAYYGLSF 353
N + KIGTG+ S AT K A G+T R L VTAS+AAAVVT AAY A+AYYG SF
Sbjct: 301 NEKTGKIGTGTPSQKATTGKKYAAGFTNTRRVLPVTASVAAAVVTVAAAYLAFAYYGFSF 354
BLAST of Spg009952 vs. ExPASy TrEMBL
Match:
A0A6J1G142 (proteoglycan 4 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449725 PE=3 SV=1)
HSP 1 Score: 356.7 bits (914), Expect = 1.1e-94
Identity = 230/405 (56.79%), Postives = 263/405 (64.94%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPRTAGLGA+RRQS+R YNEPF PNV E++ENEAHIL LELPDF +QHVKVK E+
Sbjct: 1 MATGRPRTAGLGALRRQSLRAYNEPFTPNVVEKDENEAHILRLELPDFNEQHVKVKVEEG 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAA--- 120
RTVVVTGDR + NRLLIL+KT+PIPQ+C I+ + HKL+ G L ITMPKQT AA
Sbjct: 61 ARTVVVTGDRLLATNRLLILNKTYPIPQDCSIDKVHHKLEAGFLIITMPKQTAPPAAPKD 120
Query: 121 ---------------ATVAPPFKEPEQTAPEKGREEISPENTTPPPKEPEQTTPQKGGEE 180
PP KEPEQ PEKG E SPEN +P KEPEQ TP+KG EE
Sbjct: 121 PEQKTPEKGSEETTPENATPPQKEPEQKTPEKGSGETSPENASPLQKEPEQKTPEKGSEE 180
Query: 181 ISPENAAPPPKEPEQTTPQKGGQETSSENAAPP--------------------------- 240
SP NA+PPPK P+QT+ +KG +ETS NA+PP
Sbjct: 181 TSPGNASPPPKGPKQTSLKKGSEETSPGNASPPPKGPKQTSLEKGGEETSPGNATPPPKE 240
Query: 241 -----ETKEEIMQTVEENKGKSAELQKQASAKAEEEAPTPAPAVVPPPV---KSPAEGGS 300
KE + EE+KGKSAELQK+ S KA EEAPTPAP VPPP P G S
Sbjct: 241 PEQTTRKKESEEISPEEDKGKSAELQKKGSVKAGEEAPTPAPTEVPPPAAAKNGPVRGES 300
Query: 301 GEAKTTSDEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATG 353
G+ KTT DEKI +P+QK TEK ENQN EKGKESKTE+VGKN + KIGTG+ S AT
Sbjct: 301 GKEKTTPDEKIKNPNQKPTEK---ENQNPEKGKESKTEKVGKNEKTGKIGTGTPSQKATT 360
BLAST of Spg009952 vs. ExPASy TrEMBL
Match:
A0A1S3BFR5 (neurofilament heavy polypeptide-like OS=Cucumis melo OX=3656 GN=LOC103489354 PE=3 SV=1)
HSP 1 Score: 310.5 bits (794), Expect = 9.3e-81
Identity = 216/398 (54.27%), Postives = 265/398 (66.58%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPR LG +RRQSMR YNEPF P+VEE +ENEAHIL L+LPDF +HV V E+E
Sbjct: 1 MATPRPRIGNLG-IRRQSMRAYNEPFTPDVEEIDENEAHILRLQLPDF--EHVNVNVERE 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTT------- 120
RTVVVTGDR+V N RLLIL+KTFPIPQNC + ++HKLQDG+LTIT+ KQ T
Sbjct: 61 ARTVVVTGDRHVSNTRLLILNKTFPIPQNCKSDGVEHKLQDGVLTITILKQITEPVTAPP 120
Query: 121 AAAAATVAPP-----FKEPE-------QTAPEKGREEISPENTTPPP-----KEPE---- 180
AA + APP KEP+ ++ P+K +EEIS N +PP KEPE
Sbjct: 121 LQAAESTAPPETKAENKEPDTAALTKSESVPDKAKEEISSANVSPPETKAKIKEPEAALT 180
Query: 181 --QTTPQKGGEEISPENAAPPP-----KEP------EQTTPQKGGQETSSENAAPPETKE 240
++TP K EEIS NA+PP KEP + +TP+KG ++ S N APPE+KE
Sbjct: 181 KSESTPDKAKEEISSANASPPETKAEIKEPAAALPKDDSTPEKGREDISPGNVAPPESKE 240
Query: 241 EIMQ----TVEENKGKSAELQKQASAKA-EEEAPTPAPAVVPPPVKSPAEGGSGEAKTTS 300
I + + +++GKSA LQKQ SAKA +EEAPTPAP V P PA+ G+ +TT
Sbjct: 241 AIKEPKAAALPKDEGKSAALQKQGSAKATKEEAPTPAPLVASQP---PADRNYGKEETTL 300
Query: 301 DEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGG 353
D I+S + + KEIENQN EKGKESKTEEV KN E +IGTG+ SP T VGKLAGG
Sbjct: 301 DHNINSQE----KSKEIENQNPEKGKESKTEEVRKNEETAEIGTGTPSPRGTKVGKLAGG 360
BLAST of Spg009952 vs. ExPASy TrEMBL
Match:
A0A5D3CB04 (Neurofilament heavy polypeptide-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G001690 PE=4 SV=1)
HSP 1 Score: 292.4 bits (747), Expect = 2.6e-75
Identity = 210/398 (52.76%), Postives = 257/398 (64.57%), Query Frame = 0
Query: 1 MATARPRTAGLGAVRRQSMRVYNEPFNPNVEEREENEAHILLLELPDFTKQHVKVKTEKE 60
MAT RPR LG +RRQSMR YNEPF P+VEE +ENEAHIL L+LP E
Sbjct: 1 MATPRPRIGNLG-IRRQSMRAYNEPFTPDVEEIDENEAHILRLQLP-------------E 60
Query: 61 ERTVVVTGDRNVGNNRLLILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTT------- 120
RTVVVTGDR+V N RLLIL+KTFPIPQNC + ++HKLQDG+LTIT+ KQ T
Sbjct: 61 ARTVVVTGDRHVSNTRLLILNKTFPIPQNCKSDGVEHKLQDGVLTITILKQITEPVTAPP 120
Query: 121 AAAAATVAPP-----FKEPE-------QTAPEKGREEISPENTTPPP-----KEPE---- 180
AA + APP KEP+ ++ P+K +EEIS N +PP KEPE
Sbjct: 121 LQAAESTAPPETKAENKEPDTAALTKSESVPDKAKEEISSANVSPPETKAKIKEPEAALT 180
Query: 181 --QTTPQKGGEEISPENAAPPP-----KEP------EQTTPQKGGQETSSENAAPPETKE 240
++TP K EEIS NA+PP KEP + +TP+KG ++ S N APPE+KE
Sbjct: 181 KSESTPDKAKEEISSANASPPETKAEIKEPAAALPKDDSTPEKGREDISPGNVAPPESKE 240
Query: 241 EIMQ----TVEENKGKSAELQKQASAKA-EEEAPTPAPAVVPPPVKSPAEGGSGEAKTTS 300
I + + +++GKSA LQKQ SAKA +EEAPTPAP V P PA+ G+ +TT
Sbjct: 241 AIKEPKAAALPKDEGKSAALQKQGSAKATKEEAPTPAPLVASQP---PADRNYGKEETTL 300
Query: 301 DEKISSPDQKLTEKKEIENQNGEKGKESKTEEVGKNREEPKIGTGSRSPGATGVGKLAGG 353
D I+S + + KEIENQN EKGKESKTEEV KN E +IGTG+ SP T VGKLAGG
Sbjct: 301 DHNINSQE----KSKEIENQNPEKGKESKTEEVRKNEETAEIGTGTPSPRGTKVGKLAGG 360
BLAST of Spg009952 vs. TAIR 10
Match:
AT2G29500.1 (HSP20-like chaperones superfamily protein )
HSP 1 Score: 48.9 bits (115), Expect = 9.7e-06
Identity = 30/101 (29.70%), Postives = 57/101 (56.44%), Query Frame = 0
Query: 27 NPNVEEREENEAHILLLELPDFTKQHVKVKTEKEERTVVVTGDRNV----GNNRLLILDK 86
N V+ RE EAH+ +LP K+ VKV+ E E+ + ++G+R+V N+ +++
Sbjct: 45 NARVDWRETPEAHVFKADLPGLKKEEVKVEIE-EDSVLKISGERHVEKEDKNDTWHRVER 104
Query: 87 T-------FPIPQNCVIENIDHKLQDGLLTITMPKQTTAAA 117
+ F +P+N ++ + +++G+LT+T+PK T A
Sbjct: 105 SSGQFTRRFRLPENVKMDQVKAAMENGVLTVTVPKAETKKA 144
BLAST of Spg009952 vs. TAIR 10
Match:
AT1G07400.1 (HSP20-like chaperones superfamily protein )
HSP 1 Score: 47.0 bits (110), Expect = 3.7e-05
Identity = 28/105 (26.67%), Postives = 53/105 (50.48%), Query Frame = 0
Query: 27 NPNVEEREENEAHILLLELPDFTKQHVKVKTEKEERTVVVTGDRNVGNNRLL-------- 86
N V+ +E EAH+ +LP K+ VKV+ E ++ + ++G+R+V
Sbjct: 47 NARVDWKETAEAHVFKADLPGMKKEEVKVEIE-DDSVLKISGERHVEKEEKQDTWHRVER 106
Query: 87 ---ILDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 121
+ F +P+N ++ + +++G+LT+T+PK A A V
Sbjct: 107 SSGQFSRKFKLPENVKMDQVKASMENGVLTVTVPKVEEAKKKAQV 150
BLAST of Spg009952 vs. TAIR 10
Match:
AT1G59860.1 (HSP20-like chaperones superfamily protein )
HSP 1 Score: 46.6 bits (109), Expect = 4.8e-05
Identity = 28/105 (26.67%), Postives = 53/105 (50.48%), Query Frame = 0
Query: 27 NPNVEEREENEAHILLLELPDFTKQHVKVKTEKEERTVVVTGDRNVGNNRLLI------- 86
N V+ +E EAH+ +LP K+ VKV+ E ++ + ++G+R+V
Sbjct: 45 NARVDWKETAEAHVFKADLPGMKKEEVKVEIE-DDSVLKISGERHVEKEEKQDTWHRVER 104
Query: 87 ----LDKTFPIPQNCVIENIDHKLQDGLLTITMPKQTTAAAAATV 121
+ F +P+N ++ + +++G+LT+T+PK T A V
Sbjct: 105 SSGGFSRKFRLPENVKMDQVKASMENGVLTVTVPKVETNKKKAQV 148
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022968176.1 | 2.8e-100 | 60.57 | proteoglycan 4 [Cucurbita maxima] | [more] |
XP_022945514.1 | 8.3e-100 | 63.36 | proteoglycan 4 isoform X2 [Cucurbita moschata] | [more] |
KAG6573906.1 | 1.2e-98 | 59.34 | Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subs... | [more] |
KAG7012971.1 | 1.2e-98 | 59.34 | Inactive protein RESTRICTED TEV MOVEMENT 2, partial [Cucurbita argyrosperma subs... | [more] |
XP_023541033.1 | 2.2e-97 | 56.55 | proteoglycan 4 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1HU50 | 1.4e-100 | 60.57 | proteoglycan 4 OS=Cucurbita maxima OX=3661 GN=LOC111467490 PE=3 SV=1 | [more] |
A0A6J1G164 | 4.0e-100 | 63.36 | proteoglycan 4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111449725 PE=3 SV=... | [more] |
A0A6J1G142 | 1.1e-94 | 56.79 | proteoglycan 4 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111449725 PE=3 SV=... | [more] |
A0A1S3BFR5 | 9.3e-81 | 54.27 | neurofilament heavy polypeptide-like OS=Cucumis melo OX=3656 GN=LOC103489354 PE=... | [more] |
A0A5D3CB04 | 2.6e-75 | 52.76 | Neurofilament heavy polypeptide-like OS=Cucumis melo var. makuwa OX=1194695 GN=E... | [more] |