Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCTCACCTCAAACAACATCAACACTGCCATCTAAAAATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGTACTTTACCAACTAGCTCGTAGAGTTAATATGATGTTGGTTTGTTCATATATTGATGTGTACTATTTAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAAAAAATTATAACGACCCTAGTTTTCGAAGGACCATCGTCACAAAAATAAAACTTTAAATTAGTAATTTTTACTTGAACTGATGTACTTATTTTCTAGTACGAATTAATAATCGTGAAGGAAATTATCAATGAATTTGAGAGTAAATTGTGGGATTTGACGTTTTGGATAAGATTGATCATATAATTATTCAAATGTCAAATAGATTAAATTGGACGATTGATCCAATTGCTTTTGTTTGTTATAGATTTTGTTCTTTTGTGATCAATAATTTGAATACGTTATATTATTTTTCTAAATTTTGTTTAAAAAAAAAAAAAAATCGTTGTAAAAGGATATAAATTCGGACCTTAGAAAAAAAGAAGTGAAATCTTTTCCAAGCAAGAACAAAATTATTTTATCCTTTTTGGCACGAGACAAATTATACATTTGATTGGAGAGGGGATAATAAATGTTGTTATCGGTTGATGAAGATTCAAATCCCGAAAATAAATTTAGGTTGGAATTGGTACCAAATTCTAAATTGGTAGCAGTTCTTGGGGGAGTAGAACATAGTGGAGATTAAATTAGAGCGTTCTATCGAACAAAACCATCGAAGCAAAGGAGTCACCGGGAGCGCGATATAGAGATAAACAGAGGGAAAGAGGGGAAGAGGGCAGTAGCTTTATTTTCGCGGAATTCGTAGCTTTAATTTCATCGCCTTCAATTCCTTTTTCCAATCTAAACGCCCCTTCAAACGCGCAATCTTGATTTAGCTTGAAATCCCAATTATCTGCGTGCGTGATTTTTCCCAAAAAGTGCACGTATATATTTTCCGTATATTCTCCCAATTTCCGTCTTCCAGGTTTCTCTGTCCTCCACCCACCCACCCAC
mRNA sequence
TTCCTCACCTCAAACAACATCAACACTGCCATCTAAAAATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAAAAAATTATAACGACCCTAGTTTTCGAAGGACCATCGTCACAAAAATAAAACTTTAAATTAGTAATTTTTACTTGAACTGATGTACTTATTTTCTAGTACGAATTAATAATCGTGAAGGAAATTATCAATGAATTTGAGAGTAAATTGTGGGATTTGACGTTTTGGATAAGATTGATCATATAATTATTCAAATGTCAAATAGATTAAATTGGACGATTGATCCAATTGCTTTTGTTTGTTATAGATTTTGTTCTTTTGTGATCAATAATTTGAATACGTTATATTATTTTTCTAAATTTTGTTTAAAAAAAAAAAAAAATCGTTGTAAAAGGATATAAATTCGGACCTTAGAAAAAAAGAAGTGAAATCTTTTCCAAGCAAGAACAAAATTATTTTATCCTTTTTGGCACGAGACAAATTATACATTTGATTGGAGAGGGGATAATAAATGTTGTTATCGGTTGATGAAGATTCAAATCCCGAAAATAAATTTAGGTTGGAATTGGTACCAAATTCTAAATTGGTAGCAGTTCTTGGGGGAGTAGAACATAGTGGAGATTAAATTAGAGCGTTCTATCGAACAAAACCATCGAAGCAAAGGAGTCACCGGGAGCGCGATATAGAGATAAACAGAGGGAAAGAGGGGAAGAGGGCAGTAGCTTTATTTTCGCGGAATTCGTAGCTTTAATTTCATCGCCTTCAATTCCTTTTTCCAATCTAAACGCCCCTTCAAACGCGCAATCTTGATTTAGCTTGAAATCCCAATTATCTGCGTGCGTGATTTTTCCCAAAAAGTGCACGTATATATTTTCCGTATATTCTCCCAATTTCCGTCTTCCAGGTTTCTCTGTCCTCCACCCACCCACCCAC
Coding sequence (CDS)
ATGATTTGCCTCCTCATTTTAATCGCTCTCAACTTCAGTCTCCTCGACCTCTCACAGGCCAGGCACCACAACAACCTCCCTTCCGCCGCAGTCGTCGGTACCGTCTTCTGCGACACGTGTTTTCAAGACACGTTTTCTAAGACAAGTCACTTCATTTCAGGCGCGACGGTGGCTGTCGAATGTGGCAATGGGGGATCGAACCCGAGTTTTAGAGACGAAGTAAAGACAGACAAAACAGGGGAATTCAAGATTCAGCTGCCAGTTTCAGTGAGGAAGATTGAGGAATGTTATGTGCGGTTAATAAGAAGCAGTGAACCATATTGTGCGGTGGCTGCAAGAGCCAAATCATCATCGCTTAAGCTCAAGTCAAGAAAACAAGGCATGCATGTGTTCTCGGCTGGATTCTTCACTTTCAAGCCTCTTAAACAGCCAAAACTTTGCAGCCATAACTCACATTTTAATGAATTTGATGACACGAAACAAGTCGTTGACTTCCCGGGGTTACCGGCTCCGATCCAGAACCCGACCGTGCCGAACGTTCCTCGGATTTACGATAACCTTCCGCCTCTACCTCTTCTTCCTGGACTTCCTCCATTGCCTCAACTTCCTCCTCTCCCTCCTCTTCCACCTCTCCCTGTTTTTCCATTGTTTCCACCAAAAAAGGATGATGAAAATGTACAAACTCCAAAGATAAGTCAAAACCCGGACCTGTTTCATCCACAAACTCTCCTTCCAATTCCATCTTTAAAGCCTTTCAGGCCACATTTTGTTATGCCTCCACATAAGCTGCGCCACCATCCTCTCACGCATGGTCCTACACCGCCCTCAGCCGCTGCCGCGGCGGGCGAGTTGGCTCCGTCCCCGCCGCTCCCTTTTTCACTCCCATCCATCCCTAACATGCCCGAGATCTCCTCACCTCCAAAGCAAACTTCTCCTTAA
Protein sequence
MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPLPFSLPSIPNMPEISSPPKQTSP
Homology
BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match:
XP_023528636.1 (proline-rich protein 4-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 615 bits (1587), Expect = 4.39e-222
Identity = 312/312 (100.00%), Postives = 312/312 (100.00%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE
Sbjct: 1 MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
Query: 61 CGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSLK 120
CGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSLK
Sbjct: 61 CGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSLK 120
Query: 121 LKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNV 180
LKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNV
Sbjct: 121 LKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNV 180
Query: 181 PRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHP 240
PRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHP
Sbjct: 181 PRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHP 240
Query: 241 QTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPLPFSLPSIPNM 300
QTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPLPFSLPSIPNM
Sbjct: 241 QTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPLPFSLPSIPNM 300
Query: 301 PEISSPPKQTSP 312
PEISSPPKQTSP
Sbjct: 301 PEISSPPKQTSP 312
BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match:
XP_022962607.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita moschata])
HSP 1 Score: 568 bits (1463), Expect = 4.52e-203
Identity = 294/319 (92.16%), Postives = 300/319 (94.04%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPS-AAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
MICLLILIALNFS LDLSQARHHNNLPS AAVVGTVFCDTCFQDTFSK+SHFISGATVAV
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
Query: 61 ECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
ECG+GGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL
Sbjct: 61 ECGDGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
Query: 121 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPN 180
KLKSRKQG HVFSAGFFTFKPLK PKLCSHNSH +EFDDTKQVVDFPGLPAPIQNPTVPN
Sbjct: 121 KLKSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQVVDFPGLPAPIQNPTVPN 180
Query: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFH 240
VPRIYDNLPPL LLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPD+FH
Sbjct: 181 VPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFH 240
Query: 241 PQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGP------TPPSAAAAAGELAPSPPLPFS 300
PQTLLPIPSLKP RPHFVMPPHKLRHHPLTHGP TP +AAA ELAPSPPLPFS
Sbjct: 241 PQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPFSPSFSTPTPPSAAADELAPSPPLPFS 300
Query: 301 LPSIPNMPEISSPPKQTSP 312
LP IP MPEISSPPK+TSP
Sbjct: 301 LPPIPRMPEISSPPKETSP 319
BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match:
KAG6602543.1 (hypothetical protein SDJN03_07776, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 567 bits (1462), Expect = 6.66e-203
Identity = 296/321 (92.21%), Postives = 302/321 (94.08%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPS-AAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
MICLLILIALNFS LDLSQARHHNNLPS AAVVGTVFCDTCFQDTFSKTSHFISGATVAV
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
Query: 61 ECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
ECG+GGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL
Sbjct: 61 ECGDGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
Query: 121 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPN 180
KLKSRKQG HVFSAGFFTFKPLK PKLCSHNSH NEFDDTKQVV+FPGLPAPIQNPTVPN
Sbjct: 121 KLKSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHSNEFDDTKQVVEFPGLPAPIQNPTVPN 180
Query: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFH 240
VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPD+FH
Sbjct: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFH 240
Query: 241 PQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGP--------TPPSAAAAAGELAPSPPLP 300
PQTLLPIPSLKP RPHFVMPPHKLRHH LTHGP TPPSAAA ELAPSPPLP
Sbjct: 241 PQTLLPIPSLKPLRPHFVMPPHKLRHHLLTHGPFSPSFSTPTPPSAAAG-DELAPSPPLP 300
Query: 301 FSLPSIPNMPEISSPPKQTSP 312
FSLP IP++PEISSPPK+TSP
Sbjct: 301 FSLPPIPHIPEISSPPKETSP 320
BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match:
XP_022990149.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita maxima])
HSP 1 Score: 555 bits (1429), Expect = 3.94e-198
Identity = 283/303 (93.40%), Postives = 290/303 (95.71%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPSAA--VVGTVFCDTCFQDTFSKTSHFISGATVA 60
MICLLILIALNFS LDLSQARHHNNLPSAA VVGTVFCDTCFQDTFSKTSHFISGATVA
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60
Query: 61 VECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSS 120
VECGN GSNP+FR+EVKTDKTGEFKIQLPVSVRK+EECYVRLIRSSEPYCAVAARAKSSS
Sbjct: 61 VECGNEGSNPNFREEVKTDKTGEFKIQLPVSVRKVEECYVRLIRSSEPYCAVAARAKSSS 120
Query: 121 LKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVP 180
L+LKS+KQG HVFSAGFFTFKPLKQPKLCSHNSH NEFDDTKQVVDFPGLPAPIQNPTVP
Sbjct: 121 LRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHNSHSNEFDDTKQVVDFPGLPAPIQNPTVP 180
Query: 181 NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLF 240
NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLP+FPLFPPKKDDENVQTPKISQNPDLF
Sbjct: 181 NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPIFPLFPPKKDDENVQTPKISQNPDLF 240
Query: 241 HPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAA----AAGELAPSPPLPFSL 297
HPQTLLPIPSLKP RPHFVMPPHKLRHHPLTHGPTPP+AAA A GELAPSPPLPFSL
Sbjct: 241 HPQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPTPPTAAALAAAAGGELAPSPPLPFSL 300
BLAST of Cp4.1LG01g22370 vs. NCBI nr
Match:
KAG7033221.1 (hypothetical protein SDJN02_07275 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 420 bits (1080), Expect = 2.99e-146
Identity = 215/222 (96.85%), Postives = 216/222 (97.30%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPS-AAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
MICLLILIALNFS LDLSQARHHNNLPS AAVVGTVFCDTCFQDTFSKTSHFISGATVAV
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
Query: 61 ECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
ECG+GG NPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL
Sbjct: 61 ECGDGGWNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
Query: 121 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPN 180
KLKSRKQG HVFSAGFFTFKPLK PKLCSHNSH NEFDDTKQVVDFPGLPAPIQNPTVPN
Sbjct: 121 KLKSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHSNEFDDTKQVVDFPGLPAPIQNPTVPN 180
Query: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKK 221
VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKK
Sbjct: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKK 222
BLAST of Cp4.1LG01g22370 vs. ExPASy TrEMBL
Match:
A0A6J1HD47 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita moschata OX=3662 GN=LOC111463006 PE=4 SV=1)
HSP 1 Score: 568 bits (1463), Expect = 2.19e-203
Identity = 294/319 (92.16%), Postives = 300/319 (94.04%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPS-AAVVGTVFCDTCFQDTFSKTSHFISGATVAV 60
MICLLILIALNFS LDLSQARHHNNLPS AAVVGTVFCDTCFQDTFSK+SHFISGATVAV
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAV 60
Query: 61 ECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
ECG+GGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL
Sbjct: 61 ECGDGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSSL 120
Query: 121 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPN 180
KLKSRKQG HVFSAGFFTFKPLK PKLCSHNSH +EFDDTKQVVDFPGLPAPIQNPTVPN
Sbjct: 121 KLKSRKQGTHVFSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQVVDFPGLPAPIQNPTVPN 180
Query: 181 VPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFH 240
VPRIYDNLPPL LLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPD+FH
Sbjct: 181 VPRIYDNLPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFH 240
Query: 241 PQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGP------TPPSAAAAAGELAPSPPLPFS 300
PQTLLPIPSLKP RPHFVMPPHKLRHHPLTHGP TP +AAA ELAPSPPLPFS
Sbjct: 241 PQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPFSPSFSTPTPPSAAADELAPSPPLPFS 300
Query: 301 LPSIPNMPEISSPPKQTSP 312
LP IP MPEISSPPK+TSP
Sbjct: 301 LPPIPRMPEISSPPKETSP 319
BLAST of Cp4.1LG01g22370 vs. ExPASy TrEMBL
Match:
A0A6J1JRA2 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita maxima OX=3661 GN=LOC111487127 PE=4 SV=1)
HSP 1 Score: 555 bits (1429), Expect = 1.91e-198
Identity = 283/303 (93.40%), Postives = 290/303 (95.71%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPSAA--VVGTVFCDTCFQDTFSKTSHFISGATVA 60
MICLLILIALNFS LDLSQARHHNNLPSAA VVGTVFCDTCFQDTFSKTSHFISGATVA
Sbjct: 1 MICLLILIALNFSFLDLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVA 60
Query: 61 VECGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRKIEECYVRLIRSSEPYCAVAARAKSSS 120
VECGN GSNP+FR+EVKTDKTGEFKIQLPVSVRK+EECYVRLIRSSEPYCAVAARAKSSS
Sbjct: 61 VECGNEGSNPNFREEVKTDKTGEFKIQLPVSVRKVEECYVRLIRSSEPYCAVAARAKSSS 120
Query: 121 LKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVP 180
L+LKS+KQG HVFSAGFFTFKPLKQPKLCSHNSH NEFDDTKQVVDFPGLPAPIQNPTVP
Sbjct: 121 LRLKSKKQGTHVFSAGFFTFKPLKQPKLCSHNSHSNEFDDTKQVVDFPGLPAPIQNPTVP 180
Query: 181 NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLF 240
NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLP+FPLFPPKKDDENVQTPKISQNPDLF
Sbjct: 181 NVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPIFPLFPPKKDDENVQTPKISQNPDLF 240
Query: 241 HPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHGPTPPSAAA----AAGELAPSPPLPFSL 297
HPQTLLPIPSLKP RPHFVMPPHKLRHHPLTHGPTPP+AAA A GELAPSPPLPFSL
Sbjct: 241 HPQTLLPIPSLKPLRPHFVMPPHKLRHHPLTHGPTPPTAAALAAAAGGELAPSPPLPFSL 300
BLAST of Cp4.1LG01g22370 vs. ExPASy TrEMBL
Match:
A0A6J1BYU3 (proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 SV=1)
HSP 1 Score: 358 bits (919), Expect = 1.54e-120
Identity = 210/337 (62.31%), Postives = 233/337 (69.14%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
M+ LLIL+ LN S DLSQARHH NLPSA +VGTVFCDTC Q+ SKTS FISGATVAVE
Sbjct: 1 MVWLLILLILNLSFCDLSQARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVE 60
Query: 61 CGNGGSNPSFRDEVKTDKTGEFKIQLPVSVRK----IEECYVRLIRSSEPYCAVAARAKS 120
CGNGG PSFR+EVKTDK GEFK+ LPVSV + IE C V LIRSSE YCAVAA A S
Sbjct: 61 CGNGGPKPSFREEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATS 120
Query: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQ---VVDFPGLPAPIQ 180
SS +LKSR QG H FSAGFFTFKPLK P +C+ + N F D KQ ++D+P LP PI+
Sbjct: 121 SSFELKSRNQGTHFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIE 180
Query: 181 NPT-VPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKIS 240
NPT VPNVPRIYDNLPPLP LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K
Sbjct: 181 NPTIVPNVPRIYDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-- 240
Query: 241 QNPDLFHPQTLLPIPSLKPFRPHFVMPPHKLRHHPLTHG---------PTPPSAAAA--- 300
TLLP K RPHFV+PPH+L+H PL P PP AAA
Sbjct: 241 ---------TLLP--HKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAV 300
Query: 301 AGELAPSPP-----LPFSLPSIPNMPEISSPPKQTSP 312
AG PSPP PF +P +P +P I SPP+QTSP
Sbjct: 301 AGGSVPSPPSPTSPTPFPIPPVPGLPGIPSPPRQTSP 324
BLAST of Cp4.1LG01g22370 vs. ExPASy TrEMBL
Match:
A0A5A7T850 (Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G002260 PE=4 SV=1)
HSP 1 Score: 357 bits (917), Expect = 2.20e-120
Identity = 208/336 (61.90%), Postives = 234/336 (69.64%), Query Frame = 0
Query: 1 MICLLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVE 60
MI LLIL+ LNFS DLS+ARHH LPSA ++GTVFCDTCFQ+ FSKTSHFISGATVAVE
Sbjct: 1 MIWLLILLLLNFSFFDLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVE 60
Query: 61 CGNGGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKS 120
CGN G PSFR+EVKTDK GEFK+ LPV V KIEECYV I+SSEPYC VAA AKS
Sbjct: 61 CGNRGRKPSFREEVKTDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKS 120
Query: 121 SSLKLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQVV--------DFPG 180
SSL+LKS+KQ H FSAGFFTFKPLKQP LC+ + N FDD K+++ D P
Sbjct: 121 SSLQLKSKKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPN 180
Query: 181 LPAPIQNPTVPNVPRIYDNLPPLPLLPGL------PPLPQLPPLPPLPPLPVFPLFPPKK 240
LP+PIQNPTVPN PRIYDNLPPLPLLPGL PPLP LPPLPPLPPLP FP+FPPK
Sbjct: 181 LPSPIQNPTVPNAPRIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKA 240
Query: 241 DDEN---VQTPKISQNPDLFHPQTLLPIPSLKPFR-PH-FVMPPHKLRHHPLTHGPTPPS 300
+DE ++TP S+ D F PIP +KP R PH FV+PP +L HHP PP
Sbjct: 241 NDEKNAPIETPNTSEKLDKF------PIPPIKPLRRPHYFVLPPQRLHHHPQP----PPH 300
Query: 301 AAAAAGELAPSPPLPFSLPSIPNMPEISSPPKQTSP 312
A GE IPN+ ISSP K+TSP
Sbjct: 301 VAVIGGE------------PIPNLSNISSPQKKTSP 314
BLAST of Cp4.1LG01g22370 vs. ExPASy TrEMBL
Match:
A0A0A0KXM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1)
HSP 1 Score: 341 bits (875), Expect = 3.24e-114
Identity = 190/287 (66.20%), Postives = 214/287 (74.56%), Query Frame = 0
Query: 4 LLILIALNFSLLDLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGN 63
LLIL+ LNFS DLS+ARHH LPSA VVGTVFCDTC+Q+ FSKTSHFISGATVAVECGN
Sbjct: 4 LLILLLLNFSFFDLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGN 63
Query: 64 GGSNPSFRDEVKTDKTGEFKIQLPV----SVRKIEECYVRLIRSSEPYCAVAARAKSSSL 123
G PSFR+EVKTDK GEFK+ LPV V+KIEECYV L++SSEPYC VAA AKSSSL
Sbjct: 64 KGPEPSFREEVKTDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKSSSL 123
Query: 124 KLKSRKQGMHVFSAGFFTFKPLKQPKLCSHN-SHFNEFDDTKQV-------VDFPGLPAP 183
+LKSRKQ H FSAGFFTFKPLKQP LC+ + N FDD K++ D P LP+P
Sbjct: 124 QLKSRKQNTHTFSAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSP 183
Query: 184 IQNPTVPNVPRIYDNLPPLPLLPGLPPLPQLPPLPPLPPLPV------FPLFPPK-KDDE 243
IQ PTVP+ PRIYDNLPPLPLLPGL PLPQLPPLPPLPPLP FP+FPPK KD++
Sbjct: 184 IQIPTVPSAPRIYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEKDEK 243
Query: 244 NV--QTPKISQNPDLFHPQTLLPIPSLKPFRP--HFVMPPHKLRHHP 267
N +TP S+ D F PIP +KP R HFV+PP +L HHP
Sbjct: 244 NAPNETPNTSEKLDKF------PIPPIKPLRKPHHFVLPPQRLHHHP 284
BLAST of Cp4.1LG01g22370 vs. TAIR 10
Match:
AT5G15780.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 178.3 bits (451), Expect = 9.4e-45
Identity = 140/322 (43.48%), Postives = 177/322 (54.97%), Query Frame = 0
Query: 17 LSQARHH---NNLPSAAVVGTVFCDTCFQDTFSKT-SHFISGATVAVECGNGGSNPSFRD 76
LSQ + H SA VVGTV+CDTCF FSK+ +H ISGA VAVEC + S PSFR
Sbjct: 25 LSQGQQHVMKKTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALVAVECIDENSKPSFRQ 84
Query: 77 EVKTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLK-LKSRKQG 136
EVKTDK GEFK++LP S V+KI+ C V+L+ SS+PYC++A+ A SSSLK LKS G
Sbjct: 85 EVKTDKRGEFKVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASSATSSSLKRLKSNHHG 144
Query: 137 --MHVFSAGFFTFKPLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYD 196
VFSAGFFTF+P QP++CS +K ++ P P P+Q+P P+
Sbjct: 145 ENTRVFSAGFFTFRPENQPEICSQKP--INLRGSKPLLPDPSFPPPLQDPPNPS------ 204
Query: 197 NLPPLPLLPGLPPLPQLP----PLPPLPPLPVFPLFPP----------KKDDENVQTPKI 256
PLP LP +PPLP LP P+P LP V PL PP KK D
Sbjct: 205 ---PLPNLPIVPPLPNLPVPKLPVPDLPLPLVPPLLPPGPQKSASLHNKKSDSLKDKKTE 264
Query: 257 SQNPDLFHPQTLLPIPSLKPFRPHF-VMPPHKLRHHPLTHGPTPPSAAAAAGELAPSPPL 313
+ P+ F P L PS+ P P +P L +PL P+PPS L P+PP
Sbjct: 265 ALKPNFFFPPNPLNPPSIIPPNPLIPSIPTPTLPPNPLI--PSPPSLPPI--PLIPTPPT 324
BLAST of Cp4.1LG01g22370 vs. TAIR 10
Match:
AT5G13140.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 59.7 bits (143), Expect = 4.9e-09
Identity = 61/222 (27.48%), Postives = 101/222 (45.50%), Query Frame = 0
Query: 31 VVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVK------TDKTGEFKI 90
VVG V+CDTC +TFS+ S+F+ G V V C S+P +EV T+++G +K+
Sbjct: 41 VVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSGVYKL 100
Query: 91 QLP--------VSVRKIEECYVRLIRSSEP---YCAVAA-RAKSSSLKLKSRKQGMHVFS 150
++P + +C +++++S C++ + ++ + +KS++ + ++S
Sbjct: 101 EIPHVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDRVCIYS 160
Query: 151 AGFFTFK-PLKQPKLCSHNSHFNEFDDTKQVVDFPGLPAPIQNPTVPNVPRIYDNLPPLP 210
++K P K LC + + D K F P Y +LPPLP
Sbjct: 161 LSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKFRDSKFFWPYLAPYWFPWPYPDLPPLP 220
Query: 211 LLPGLP--PLPQLP------PLP------PLPPLPVFPLFPP 220
LP P P P LP LP P+ +P P FPP
Sbjct: 221 TLPPFPSFPFPSLPFGNPNLALPAFDWKNPVTWIPYLPRFPP 262
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023528636.1 | 4.39e-222 | 100.00 | proline-rich protein 4-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022962607.1 | 4.52e-203 | 92.16 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
KAG6602543.1 | 6.66e-203 | 92.21 | hypothetical protein SDJN03_07776, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022990149.1 | 3.94e-198 | 93.40 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
KAG7033221.1 | 2.99e-146 | 96.85 | hypothetical protein SDJN02_07275 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1HD47 | 2.19e-203 | 92.16 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A6J1JRA2 | 1.91e-198 | 93.40 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A6J1BYU3 | 1.54e-120 | 62.31 | proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 ... | [more] |
A0A5A7T850 | 2.20e-120 | 61.90 | Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
A0A0A0KXM2 | 3.24e-114 | 66.20 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G15780.1 | 9.4e-45 | 43.48 | Pollen Ole e 1 allergen and extensin family protein | [more] |
AT5G13140.1 | 4.9e-09 | 27.48 | Pollen Ole e 1 allergen and extensin family protein | [more] |