Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAATTCAATCGCAAATCGGAGTCCACTGGATTGAGGAAGTTTAAGAGATTGAGAAATTGTTCATTTACCTACGAACGCCGGCGGCTGCGGCGCGAGGGGCGGAGTTCGAACAGTGGCGTCGTGCGGTTACAGGAATCGAACTTCGAATTCGAGTCACGAGATGCCGAGGAAAGATCGAAGAGAGAGGCGACTGAGTATCAATTGAGTGGTGCAGAATGGTGTGCCTTGCGAGGATGATTCAAGAGGGGGCGATTGATTGAGACTTGAGAGTGAGATGAATTTCGTTCGCGGCGGGTTTCTTCTCCAAAACGAAATACAAAAAAATTTACGGCGGGAGAGAGGTGGTTTTTTATTATTATTATTTTTTATATAATATTAACAAAAATATAGTAAGGGGAATTTCAACGGTTAACAATTTATATTGCGGGTACAGTAACAAAATTGTGTAATTTCTTTTTTACCTTTGTATATCATTCTACCAGTGATATTTTATATATTGCTAGTAGTGATAGAACGGATAGATATTTTAAGATAATTATAGTCTTATTTTATGGTGGATTTGGGTTTTTTTTTAACTATATTTTCAATTAATTGAGTTTGATCATATATATATGTAGGTGTCTTATATATTAATTTTAGGTAATGCTTATAATTATATGTATATATATTTGTCAGGTGTTTTTTTAGAACGTAATTTTTTAATACAATATATGAGGTGGAAATTAAATTCACAACCTTTTGACTGAAAATATATGTGAATTATTGTTGAACTATACTCGAGTTCTATTTGTCAATTGTTATTATTTGCAACATTCAACTGTAAAAATCACTATCAAAAAATAAATAAATTGTTGAAGTCATATATTCTGACCAAAAAAGCGTGGATTTGAATCCTCACTTCATGTTGTAAAAAAAATATAAATATTTATGTTTTAAAATCACATATTAAAAGTGGGTTTACTCAACCTACTTTATTTTTTTTAGTACAACTTTATTTTTTTTTAGTACAACACGTAGGGTGAGTAGATTTGAACTTATGACCTTCTAGTCGGATATAAGTACACTATGTTCAATCTAACTCAATCTATCTTATTTTTAAAAAATAAAAACAAAATATTTTTTTTCCCTTTCAAATGCCTACCAAAATATGAAACATAGTGGGCCTTGAGAATAAGTGATTTGCTTGCATTTGACAGTGATATGGTTCTCTAGACAAATCTCTGACTTCCTTTTGTTATTTTTTGGTTTGTCATACATATTATTTTTTGCATGACACTCTTTTTATGATAAAGTGGTGTTAGTTTGACATCAAAATTGAATTAAATTATTTACGAACACAAATTTGGATTCATTTGTTCACAAATTTTGAGGCAAGTATTTTTCGGGAGACTGTCTCGAACTCTCAAATACTATCTTACTTAGATACTCAACATGTTTTATTCTCAAATTAAGCAAACGATGTTTCATTATTCACTACTTATGAACGCGAACTTTGTACTTATCTATCTACGACCACCATAAAATTCACGAGTTAAACTAAGCATGGTCCAACCAAAATGACTTTGAAGAATGAATAGCCGGAGACAAATTCGAAATATCAAATACTTTAAAAATATAAATCTACTTGAAAGTTGAAAGCAATTTTTACTTTAAAAGGCCAATGGATGATTTGATTATGCACCAAGAAGTTAACTTTAAAGATATATATAGTACAAACAAACAAAATGCCACCCTTAATTTTTGTAACCAATTAAAATATGCAAATATGAGAAGTAAAATTAAAATAATTTGGGGGAAGGGAGAGAAAATCCATTGGAGAATGAAGTGAATAAAGAGGGAATTCTGTGTTTGTAGTTGAAGTGCTGTTTATTGTCACAGATAAGTCACCAAAATAGAGCCTTTGGCCCTTTTCTCTCTGTTTTGCTGCTCCATCTCGTGAACTAACACTGACCCAACTGCCCCAAATTTGGCCATATCCCACACCAATAAAGAACCAAATTGAAATTAAATGCCAATTAAAAAACAATGGTACAAATAAAGACCCGAAGAAAAATACTCAATGTTTATCGATCAACATCTATCTTATCCCAACCAATATTAAATCTTCGATAAGTAGGACCAAAAAACGCAGAAAAGAGATAGTTTTTCTATTTTAAAACAAGTTAAAAAACCTATCAATTTTTTACTCGTAGTCGGCACGAAGATGAGTGCTGAACTATTAGAAATTAGTCAACCATCTATTCATTTTTATGGCCTCTTTAGTTGGTGGGAAAATGAACCATTCACTGCTGTTTTGGTTTGCAGTTCACTGGTTTTTGCATGATTCACATTATGAACCTTGGTGGAGTTTTGTGATATAAATTCCCATTGGTTTCTCATTTTTTCTTTACCTCAAGATGGGAACAGAACCAGTGGGATCTTCAATGGTTTGGCTTCTCATTTTACTCATTCTCAATTTGAGTTTCTGTGACCTTTCACAGGCCAGGCATCACAAGAACCTACCCTCCGCTGTCATCGTCGGTACGGTCTTTTGTGATACCTGCTCTCAAGAGAAGCTCTCAAAGACTAGTCGCTTCATCTCAGGTAGTTGGAAAGTCTAAATTTGTGTAATGAGCATAATATAAAGTGAGAAAGCTGTATTCTTTCTAAAATCACGGATCTCTGAATAGTGATTCGGAACGCTGACAACCTTAAACTTTAAAGTGAAGGGTCGTGACTAAACATGGTTTTATACGTTTGTAGGGGCAACAGTTGCTGTCGAATGTGGAAACGGAGGACCAAAACCGAGTTTTAGGGAAGAAGTGAAGACAGACAAGAGAGGGGAGTTCAAGGTTGATCTGCCAGTTTCAGTGAGCAGACATGTGAAGACGATTGAGGGATGTTCTGTGAATTTGATTAGAAGCAGCGAGGCATATTGTGCAGTGGCTGCAGCAGCAACATCATCTTCCTTTGAACTCAAATCAAGAAATCAAGGCACACATTTTTTCTCAGCTGGATTCTTCACTTTCAAGCCCCTCAAACACCCAAACATTTGCACCCAAAAGCCATATTCCAACACATTCCATGACATGAAACAAGCTCTCCCGATGCTCGACTACCCGGCTCTGCCGACCCCGATCGAGAACCCGACGATCGTGCCGAATGTTCCTCGGATTTACGACAATCTTCCTCCCCTCCCATTTCTTCCTAGACTCCCACCATTGCCTCAACTCCCTCCTCTCCCACCTCTCCCACCACTCCCAGGCTTTCCAATCTTTCCACCTAAAAAGACTGTAGAAAATGCACCAAATGGAAAGACTCTGCTCCCACACAAAAAGCATTTGAGGCCACATTTTGTTCTGCCTCCACACAGGCTGCAACACCCTCCTCTCTTCCCCAATATACCTTCTCTGCTTCCATTTGAAATTCCTCCTCCGCCCGTGGCAGCGGCAGTGGGGGCGGTGGCGGGTGGCTCGGTCCCGTCCCCGCCATCCCCGACCTCGCCGACCCCTTTTCCTATCCCTCCTGTTCCTGGCCTGCCTGGGATTCCCTCGCCTCCTAGGCAAACTTCTCCTTGA
mRNA sequence
ATGAAGAAATTCAATCGCAAATCGGAGTCCACTGGATTGAGGAAGTTTAAGAGATTGAGAAATTGTTCATTTACCTACGAACGCCGGCGGCTGCGGCGCGAGGGGCGGAGTTCGAACAGTGGCGTCGTGCGGTTACAGGAATCGAACTTCGAATTCGAGTCACGAGATGCCGAGGAAAGATCGAAGAGAGAGGCGACTGAGTATCAATTGAGTGGTGCAGAATGTTCACTGGCCAGGCATCACAAGAACCTACCCTCCGCTGTCATCGTCGGTACGGTCTTTTGTGATACCTGCTCTCAAGAGAAGCTCTCAAAGACTAGTCGCTTCATCTCAGGGGCAACAGTTGCTGTCGAATGTGGAAACGGAGGACCAAAACCGAGTTTTAGGGAAGAAGTGAAGACAGACAAGAGAGGGGAGTTCAAGGTTGATCTGCCAGTTTCAGTGAGCAGACATGTGAAGACGATTGAGGGATGTTCTGTGAATTTGATTAGAAGCAGCGAGGCATATTGTGCAGTGGCTGCAGCAGCAACATCATCTTCCTTTGAACTCAAATCAAGAAATCAAGGCACACATTTTTTCTCAGCTGGATTCTTCACTTTCAAGCCCCTCAAACACCCAAACATTTGCACCCAAAAGCCATATTCCAACACATTCCATGACATGAAACAAGCTCTCCCGATGCTCGACTACCCGGCTCTGCCGACCCCGATCGAGAACCCGACGATCGTGCCGAATGTTCCTCGGATTTACGACAATCTTCCTCCCCTCCCATTTCTTCCTAGACTCCCACCATTGCCTCAACTCCCTCCTCTCCCACCTCTCCCACCACTCCCAGGCTTTCCAATCTTTCCACCTAAAAAGACTGTAGAAAATGCACCAAATGGAAAGACTCTGCTCCCACACAAAAAGCATTTGAGGCCACATTTTGTTCTGCCTCCACACAGGCTGCAACACCCTCCTCTCTTCCCCAATATACCTTCTCTGCTTCCATTTGAAATTCCTCCTCCGCCCGTGGCAGCGGCAGTGGGGGCGGTGGCGGGTGGCTCGGTCCCGTCCCCGCCATCCCCGACCTCGCCGACCCCTTTTCCTATCCCTCCTGTTCCTGGCCTGCCTGGGATTCCCTCGCCTCCTAGGCAAACTTCTCCTTGA
Coding sequence (CDS)
ATGAAGAAATTCAATCGCAAATCGGAGTCCACTGGATTGAGGAAGTTTAAGAGATTGAGAAATTGTTCATTTACCTACGAACGCCGGCGGCTGCGGCGCGAGGGGCGGAGTTCGAACAGTGGCGTCGTGCGGTTACAGGAATCGAACTTCGAATTCGAGTCACGAGATGCCGAGGAAAGATCGAAGAGAGAGGCGACTGAGTATCAATTGAGTGGTGCAGAATGTTCACTGGCCAGGCATCACAAGAACCTACCCTCCGCTGTCATCGTCGGTACGGTCTTTTGTGATACCTGCTCTCAAGAGAAGCTCTCAAAGACTAGTCGCTTCATCTCAGGGGCAACAGTTGCTGTCGAATGTGGAAACGGAGGACCAAAACCGAGTTTTAGGGAAGAAGTGAAGACAGACAAGAGAGGGGAGTTCAAGGTTGATCTGCCAGTTTCAGTGAGCAGACATGTGAAGACGATTGAGGGATGTTCTGTGAATTTGATTAGAAGCAGCGAGGCATATTGTGCAGTGGCTGCAGCAGCAACATCATCTTCCTTTGAACTCAAATCAAGAAATCAAGGCACACATTTTTTCTCAGCTGGATTCTTCACTTTCAAGCCCCTCAAACACCCAAACATTTGCACCCAAAAGCCATATTCCAACACATTCCATGACATGAAACAAGCTCTCCCGATGCTCGACTACCCGGCTCTGCCGACCCCGATCGAGAACCCGACGATCGTGCCGAATGTTCCTCGGATTTACGACAATCTTCCTCCCCTCCCATTTCTTCCTAGACTCCCACCATTGCCTCAACTCCCTCCTCTCCCACCTCTCCCACCACTCCCAGGCTTTCCAATCTTTCCACCTAAAAAGACTGTAGAAAATGCACCAAATGGAAAGACTCTGCTCCCACACAAAAAGCATTTGAGGCCACATTTTGTTCTGCCTCCACACAGGCTGCAACACCCTCCTCTCTTCCCCAATATACCTTCTCTGCTTCCATTTGAAATTCCTCCTCCGCCCGTGGCAGCGGCAGTGGGGGCGGTGGCGGGTGGCTCGGTCCCGTCCCCGCCATCCCCGACCTCGCCGACCCCTTTTCCTATCCCTCCTGTTCCTGGCCTGCCTGGGATTCCCTCGCCTCCTAGGCAAACTTCTCCTTGA
Protein sequence
MKKFNRKSESTGLRKFKRLRNCSFTYERRRLRREGRSSNSGVVRLQESNFEFESRDAEERSKREATEYQLSGAECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPGLPGIPSPPRQTSP
Homology
BLAST of Moc04g37100 vs. NCBI nr
Match:
XP_022134152.1 (proline-rich protein 4-like [Momordica charantia])
HSP 1 Score: 605.1 bits (1559), Expect = 4.1e-169
Identity = 308/313 (98.40%), Postives = 309/313 (98.72%), Query Frame = 0
Query: 70 LSGAECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR 129
LS + S ARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR
Sbjct: 12 LSFCDLSQARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR 71
Query: 130 EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG 189
EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG
Sbjct: 72 EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG 131
Query: 190 THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI 249
THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI
Sbjct: 132 THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI 191
Query: 250 YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF 309
YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF
Sbjct: 192 YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF 251
Query: 310 VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG 369
VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG
Sbjct: 252 VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG 311
Query: 370 LPGIPSPPRQTSP 383
LPGIPSPPRQTSP
Sbjct: 312 LPGIPSPPRQTSP 324
BLAST of Moc04g37100 vs. NCBI nr
Match:
KAG6602543.1 (hypothetical protein SDJN03_07776, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 337.8 bits (865), Expect = 1.2e-88
Identity = 199/323 (61.61%), Postives = 227/323 (70.28%), Query Frame = 0
Query: 74 ECSLARHHKNLPS-AVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEV 133
+ S ARHH NLPS A +VGTVFCDTC Q+ SKTS FISGATVAVECG+GG PSFR+EV
Sbjct: 16 DLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGDGGSNPSFRDEV 75
Query: 134 KTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHF 193
KTDK GEFK+ LPVS V+ IE C V LIRSSE YCAVAA A SSS +LKSR QGTH
Sbjct: 76 KTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHV 135
Query: 194 FSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDN 253
FSAGFFTFKPLKHP +C+ +SN F D KQ ++++P LP PI+NPT VPNVPRIYDN
Sbjct: 136 FSAGFFTFKPLKHPKLCSHNSHSNEFDDTKQ---VVEFPGLPAPIQNPT-VPNVPRIYDN 195
Query: 254 LPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL-- 313
LPPLP LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL
Sbjct: 196 LPPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFHPQTLLPI 255
Query: 314 PHKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSP 373
P K LRPHFV+PPH+L+H L + P F P PP AAA +A P+ P
Sbjct: 256 PSLKPLRPHFVMPPHKLRH-HLLTHGPFSPSFSTPTPPSAAAGDELA---------PSPP 315
Query: 374 TPFPIPPVPGLPGIPSPPRQTSP 383
PF +PP+P +P I SPP++TSP
Sbjct: 316 LPFSLPPIPHIPEISSPPKETSP 320
BLAST of Moc04g37100 vs. NCBI nr
Match:
XP_023528636.1 (proline-rich protein 4-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 335.1 bits (858), Expect = 7.9e-88
Identity = 199/322 (61.80%), Postives = 221/322 (68.63%), Query Frame = 0
Query: 74 ECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEVK 133
+ S ARHH NLPSA +VGTVFCDTC Q+ SKTS FISGATVAVECGNGG PSFR+EVK
Sbjct: 16 DLSQARHHNNLPSAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNGGSNPSFRDEVK 75
Query: 134 TDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHFF 193
TDK GEFK+ LPVS V+ IE C V LIRSSE YCAVAA A SSS +LKSR QG H F
Sbjct: 76 TDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGMHVF 135
Query: 194 SAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDNL 253
SAGFFTFKPLK P +C+ + N F D KQ ++D+P LP PI+NPT VPNVPRIYDNL
Sbjct: 136 SAGFFTFKPLKQPKLCSHNSHFNEFDDTKQ---VVDFPGLPAPIQNPT-VPNVPRIYDNL 195
Query: 254 PPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL--P 313
PPLP LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL P
Sbjct: 196 PPLPLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDLFHPQTLLPIP 255
Query: 314 HKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPT 373
K RPHFV+PPH+L+H PL P PP AA A AG PSP P
Sbjct: 256 SLKPFRPHFVMPPHKLRHHPLTHG---------PTPPSAA---AAAGELAPSP-----PL 312
Query: 374 PFPIPPVPGLPGIPSPPRQTSP 383
PF +P +P +P I SPP+QTSP
Sbjct: 316 PFSLPSIPNMPEISSPPKQTSP 312
BLAST of Moc04g37100 vs. NCBI nr
Match:
XP_022962607.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita moschata])
HSP 1 Score: 332.4 bits (851), Expect = 5.1e-87
Identity = 199/325 (61.23%), Postives = 227/325 (69.85%), Query Frame = 0
Query: 74 ECSLARHHKNLPS-AVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEV 133
+ S ARHH NLPS A +VGTVFCDTC Q+ SK+S FISGATVAVECG+GG PSFR+EV
Sbjct: 16 DLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEV 75
Query: 134 KTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHF 193
KTDK GEFK+ LPVS V+ IE C V LIRSSE YCAVAA A SSS +LKSR QGTH
Sbjct: 76 KTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHV 135
Query: 194 FSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDN 253
FSAGFFTFKPLKHP +C+ +S+ F D KQ ++D+P LP PI+NPT VPNVPRIYDN
Sbjct: 136 FSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQ---VVDFPGLPAPIQNPT-VPNVPRIYDN 195
Query: 254 LPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL-- 313
LPPL LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL
Sbjct: 196 LPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFHPQTLLPI 255
Query: 314 PHKKHLRPHFVLPPHRLQHPPLF--PNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPT 373
P K LRPHFV+PPH+L+H PL P PS F P PP AAA +P+
Sbjct: 256 PSLKPLRPHFVMPPHKLRHHPLTHGPFSPS---FSTPTPPSAAA----------DELAPS 315
Query: 374 SPTPFPIPPVPGLPGIPSPPRQTSP 383
P PF +PP+P +P I SPP++TSP
Sbjct: 316 PPLPFSLPPIPRMPEISSPPKETSP 319
BLAST of Moc04g37100 vs. NCBI nr
Match:
XP_022990149.1 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like [Cucurbita maxima])
HSP 1 Score: 327.0 bits (837), Expect = 2.1e-85
Identity = 190/309 (61.49%), Postives = 215/309 (69.58%), Query Frame = 0
Query: 74 ECSLARHHKNLPS--AVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREE 133
+ S ARHH NLPS A +VGTVFCDTC Q+ SKTS FISGATVAVECGN G P+FREE
Sbjct: 16 DLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNEGSNPNFREE 75
Query: 134 VKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTH 193
VKTDK GEFK+ LPVS V+ +E C V LIRSSE YCAVAA A SSS LKS+ QGTH
Sbjct: 76 VKTDKTGEFKIQLPVS----VRKVEECYVRLIRSSEPYCAVAARAKSSSLRLKSKKQGTH 135
Query: 194 FFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYD 253
FSAGFFTFKPLK P +C+ +SN F D KQ ++D+P LP PI+NPT VPNVPRIYD
Sbjct: 136 VFSAGFFTFKPLKQPKLCSHNSHSNEFDDTKQ---VVDFPGLPAPIQNPT-VPNVPRIYD 195
Query: 254 NLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL- 313
NLPPLP LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL
Sbjct: 196 NLPPLPLLPGLPPLPQLPPLPPLPPLPIFPLFPPKKDDENVQTPKISQNPDLFHPQTLLP 255
Query: 314 -PHKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTS 368
P K LRPHFV+PPH+L+H PL P PP AAA+ A AGG + +P+
Sbjct: 256 IPSLKPLRPHFVMPPHKLRHHPLTHG---------PTPPTAAALAAAAGGEL----APSP 303
BLAST of Moc04g37100 vs. ExPASy TrEMBL
Match:
A0A6J1BYU3 (proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 SV=1)
HSP 1 Score: 605.1 bits (1559), Expect = 2.0e-169
Identity = 308/313 (98.40%), Postives = 309/313 (98.72%), Query Frame = 0
Query: 70 LSGAECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR 129
LS + S ARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR
Sbjct: 12 LSFCDLSQARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFR 71
Query: 130 EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG 189
EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG
Sbjct: 72 EEVKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQG 131
Query: 190 THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI 249
THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI
Sbjct: 132 THFFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRI 191
Query: 250 YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF 309
YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF
Sbjct: 192 YDNLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKHLRPHF 251
Query: 310 VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG 369
VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG
Sbjct: 252 VLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTSPTPFPIPPVPG 311
Query: 370 LPGIPSPPRQTSP 383
LPGIPSPPRQTSP
Sbjct: 312 LPGIPSPPRQTSP 324
BLAST of Moc04g37100 vs. ExPASy TrEMBL
Match:
A0A6J1HD47 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita moschata OX=3662 GN=LOC111463006 PE=4 SV=1)
HSP 1 Score: 332.4 bits (851), Expect = 2.5e-87
Identity = 199/325 (61.23%), Postives = 227/325 (69.85%), Query Frame = 0
Query: 74 ECSLARHHKNLPS-AVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEV 133
+ S ARHH NLPS A +VGTVFCDTC Q+ SK+S FISGATVAVECG+GG PSFR+EV
Sbjct: 16 DLSQARHHNNLPSTAAVVGTVFCDTCFQDTFSKSSHFISGATVAVECGDGGSNPSFRDEV 75
Query: 134 KTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHF 193
KTDK GEFK+ LPVS V+ IE C V LIRSSE YCAVAA A SSS +LKSR QGTH
Sbjct: 76 KTDKTGEFKIQLPVS----VRKIEECYVRLIRSSEPYCAVAARAKSSSLKLKSRKQGTHV 135
Query: 194 FSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDN 253
FSAGFFTFKPLKHP +C+ +S+ F D KQ ++D+P LP PI+NPT VPNVPRIYDN
Sbjct: 136 FSAGFFTFKPLKHPKLCSHNSHSSEFDDTKQ---VVDFPGLPAPIQNPT-VPNVPRIYDN 195
Query: 254 LPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL-- 313
LPPL LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL
Sbjct: 196 LPPLTLLPGLPPLPQLPPLPPLPPLPVFPLFPPKKDDENVQTPKISQNPDMFHPQTLLPI 255
Query: 314 PHKKHLRPHFVLPPHRLQHPPLF--PNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPT 373
P K LRPHFV+PPH+L+H PL P PS F P PP AAA +P+
Sbjct: 256 PSLKPLRPHFVMPPHKLRHHPLTHGPFSPS---FSTPTPPSAAA----------DELAPS 315
Query: 374 SPTPFPIPPVPGLPGIPSPPRQTSP 383
P PF +PP+P +P I SPP++TSP
Sbjct: 316 PPLPFSLPPIPRMPEISSPPKETSP 319
BLAST of Moc04g37100 vs. ExPASy TrEMBL
Match:
A0A6J1JRA2 (amyloid beta A4 precursor protein-binding family B member 1-interacting protein-like OS=Cucurbita maxima OX=3661 GN=LOC111487127 PE=4 SV=1)
HSP 1 Score: 327.0 bits (837), Expect = 1.0e-85
Identity = 190/309 (61.49%), Postives = 215/309 (69.58%), Query Frame = 0
Query: 74 ECSLARHHKNLPS--AVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREE 133
+ S ARHH NLPS A +VGTVFCDTC Q+ SKTS FISGATVAVECGN G P+FREE
Sbjct: 16 DLSQARHHNNLPSAAAAVVGTVFCDTCFQDTFSKTSHFISGATVAVECGNEGSNPNFREE 75
Query: 134 VKTDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTH 193
VKTDK GEFK+ LPVS V+ +E C V LIRSSE YCAVAA A SSS LKS+ QGTH
Sbjct: 76 VKTDKTGEFKIQLPVS----VRKVEECYVRLIRSSEPYCAVAARAKSSSLRLKSKKQGTH 135
Query: 194 FFSAGFFTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYD 253
FSAGFFTFKPLK P +C+ +SN F D KQ ++D+P LP PI+NPT VPNVPRIYD
Sbjct: 136 VFSAGFFTFKPLKQPKLCSHNSHSNEFDDTKQ---VVDFPGLPAPIQNPT-VPNVPRIYD 195
Query: 254 NLPPLPFLPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVENAPNGK-----------TLL- 313
NLPPLP LP LPPLPQLPPLPPLPPLP FP+FPPKK EN K TLL
Sbjct: 196 NLPPLPLLPGLPPLPQLPPLPPLPPLPIFPLFPPKKDDENVQTPKISQNPDLFHPQTLLP 255
Query: 314 -PHKKHLRPHFVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGSVPSPPSPTS 368
P K LRPHFV+PPH+L+H PL P PP AAA+ A AGG + +P+
Sbjct: 256 IPSLKPLRPHFVMPPHKLRHHPLTHG---------PTPPTAAALAAAAGGEL----APSP 303
BLAST of Moc04g37100 vs. ExPASy TrEMBL
Match:
A0A0A0KXM2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1)
HSP 1 Score: 315.1 bits (806), Expect = 4.1e-82
Identity = 182/273 (66.67%), Postives = 199/273 (72.89%), Query Frame = 0
Query: 74 ECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEVK 133
+ S ARHH+ LPSAV+VGTVFCDTC QEK SKTS FISGATVAVECGN GP+PSFREEVK
Sbjct: 16 DLSEARHHRKLPSAVVVGTVFCDTCYQEKFSKTSHFISGATVAVECGNKGPEPSFREEVK 75
Query: 134 TDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHFF 193
TDKRGEFKV+LPV VS+HVK IE C V L++SSE YC VAA A SSS +LKSR Q TH F
Sbjct: 76 TDKRGEFKVNLPVLVSKHVKKIEECYVELVKSSEPYCDVAATAKSSSLQLKSRKQNTHTF 135
Query: 194 SAGFFTFKPLKHPNICTQKPYS-NTFHDMKQAL----PMLDYPALPTPIENPTIVPNVPR 253
SAGFFTFKPLK PN+C QKP + NTF DMK+ P D P LP+PI+ PT VP+ PR
Sbjct: 136 SAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIQLPPPPSYDIPNLPSPIQIPT-VPSAPR 195
Query: 254 IYDNLPPLPFLPRLPPLPQLPPLPP------LPPLPGFPIFPPK-KTVENAPN------- 313
IYDNLPPLP LP L PLPQLPPLPP LPPLP FPIFPPK K +NAPN
Sbjct: 196 IYDNLPPLPLLPGLLPLPQLPPLPPLPPLPTLPPLPKFPIFPPKEKDEKNAPNETPNTSE 255
Query: 314 --GKTLLPHKKHLRP--HFVLPPHRLQHPPLFP 324
K +P K LR HFVLPP RL H P P
Sbjct: 256 KLDKFPIPPIKPLRKPHHFVLPPQRLHHHPRLP 287
BLAST of Moc04g37100 vs. ExPASy TrEMBL
Match:
A0A5A7T850 (Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold169G002260 PE=4 SV=1)
HSP 1 Score: 308.5 bits (789), Expect = 3.8e-80
Identity = 193/333 (57.96%), Postives = 212/333 (63.66%), Query Frame = 0
Query: 74 ECSLARHHKNLPSAVIVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEVK 133
+ S ARHH+ LPSAVI+GTVFCDTC QEK SKTS FISGATVAVECGN G KPSFREEVK
Sbjct: 16 DLSEARHHRKLPSAVIIGTVFCDTCFQEKFSKTSHFISGATVAVECGNRGRKPSFREEVK 75
Query: 134 TDKRGEFKVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFELKSRNQGTHFF 193
TDKRGEFKV+LPV VS+HV+ IE C V I+SSE YC VAA A SSS +LKS+ Q TH F
Sbjct: 76 TDKRGEFKVNLPVLVSKHVEKIEECYVESIKSSEPYCDVAATAKSSSLQLKSKKQNTHTF 135
Query: 194 SAGFFTFKPLKHPNICTQKPYS-NTFHDMKQAL-----PMLDYPALPTPIENPTIVPNVP 253
SAGFFTFKPLK PN+C QKP + NTF DMK+ + P D P LP+PI+NPT VPN P
Sbjct: 136 SAGFFTFKPLKQPNLCNQKPQNPNTFDDMKEIIQLPPPPPFDSPNLPSPIQNPT-VPNAP 195
Query: 254 RIYDNLPPLPF------LPRLPPLPQLPPLPPLPPLPGFPIFPPKKTVE-----NAPNGK 313
RIYDNLPPLP LP LPPLP LPPLPPLPPLP FPIFPPK E PN
Sbjct: 196 RIYDNLPPLPLLPGLLPLPPLPPLPPLPPLPPLPPLPKFPIFPPKANDEKNAPIETPNTS 255
Query: 314 TLL------PHKKHLRPH-FVLPPHRLQHPPLFPNIPSLLPFEIPPPPVAAAVGAVAGGS 373
L P K RPH FVLPP RL H P PPP A +G
Sbjct: 256 EKLDKFPIPPIKPLRRPHYFVLPPQRLHHHP-------------QPPPHVAVIGG----- 314
Query: 374 VPSPPSPTSPTPFPIPPVPGLPGIPSPPRQTSP 383
P+P L I SP ++TSP
Sbjct: 316 ---------------EPIPNLSNISSPQKKTSP 314
BLAST of Moc04g37100 vs. TAIR 10
Match:
AT5G15780.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 190.7 bits (483), Expect = 2.2e-48
Identity = 151/358 (42.18%), Postives = 191/358 (53.35%), Query Frame = 0
Query: 82 KNLPSAVIVGTVFCDTCSQEKLSKT-SRFISGATVAVECGNGGPKPSFREEVKTDKRGEF 141
K SAV+VGTV+CDTC SK+ + ISGA VAVEC + KPSFR+EVKTDKRGEF
Sbjct: 35 KTRSSAVVVGTVYCDTCFNGAFSKSPNHLISGALVAVECIDENSKPSFRQEVKTDKRGEF 94
Query: 142 KVDLPVSVSRHVKTIEGCSVNLIRSSEAYCAVAAAATSSSFE-LKSRNQG--THFFSAGF 201
KV LP SVS+HVK I+ CSV L+ SS+ YC++A++ATSSS + LKS + G T FSAGF
Sbjct: 95 KVKLPFSVSKHVKKIKRCSVKLLSSSQPYCSIASSATSSSLKRLKSNHHGENTRVFSAGF 154
Query: 202 FTFKPLKHPNICTQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVPNVPRIYDNLPPLP 261
FTF+P P IC+QKP +++ + P+L P+ P P+++P N PLP
Sbjct: 155 FTFRPENQPEICSQKPI-----NLRGSKPLLPDPSFPPPLQDP----------PNPSPLP 214
Query: 262 FLPRLPPLPQLP-PLPPLPPLPGFPIFPPKKTVENAPNGKTLLPHKKH----------LR 321
LP +PPLP LP P P+P LP P+ PP + P L +KK L+
Sbjct: 215 NLPIVPPLPNLPVPKLPVPDLP-LPLVPP--LLPPGPQKSASLHNKKSDSLKDKKTEALK 274
Query: 322 PHFVLPPHRLQHP--------------------PLFPNIPSLLPFE-IPPPPVAAAVGAV 378
P+F PP+ L P PL P+ PSL P IP PP + +
Sbjct: 275 PNFFFPPNPLNPPSIIPPNPLIPSIPTPTLPPNPLIPSPPSLPPIPLIPTPPTLPTIPLL 334
BLAST of Moc04g37100 vs. TAIR 10
Match:
AT5G13140.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 52.4 bits (124), Expect = 9.5e-07
Identity = 63/235 (26.81%), Postives = 96/235 (40.85%), Query Frame = 0
Query: 89 IVGTVFCDTCSQEKLSKTSRFISGATVAVECGNGGPKPSFREEVK------TDKRGEFKV 148
+VG V+CDTCS S+ S F+ G V V C P EEV T++ G +K+
Sbjct: 41 VVGVVYCDTCSINTFSRQSYFLQGVEVHVTCRFKASSPKTAEEVNISVNRTTNRSGVYKL 100
Query: 149 DLP----VSVSRHVKTIEGCSVNLIRSS---EAYCAVAAAATSSS-FELKSRNQGTHFFS 208
++P + + CS ++++S C++ T+++ +KS+ +S
Sbjct: 101 EIPHVDGIDCVDGIAISSQCSAKILKTSSDDNGGCSIPVFQTATNEVSIKSKQDRVCIYS 160
Query: 209 AGFFTFK-PLKHPNIC---------TQKPYSNTFHDMKQALPMLDYPALPTPIENPTIVP 268
++K P K+ ++C + F D K P L P P
Sbjct: 161 LSALSYKPPHKNTSLCGNGGKKHHRKDEKVEKKFRDSKFFWPYLAPYWFPWP-------- 220
Query: 269 NVPRIYDNLPPLPFLPRLP--PLPQLP------PLP------PLPPLPGFPIFPP 286
Y +LPPLP LP P P P LP LP P+ +P P FPP
Sbjct: 221 -----YPDLPPLPTLPPFPSFPFPSLPFGNPNLALPAFDWKNPVTWIPYLPRFPP 262
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022134152.1 | 4.1e-169 | 98.40 | proline-rich protein 4-like [Momordica charantia] | [more] |
KAG6602543.1 | 1.2e-88 | 61.61 | hypothetical protein SDJN03_07776, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023528636.1 | 7.9e-88 | 61.80 | proline-rich protein 4-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022962607.1 | 5.1e-87 | 61.23 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
XP_022990149.1 | 2.1e-85 | 61.49 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1BYU3 | 2.0e-169 | 98.40 | proline-rich protein 4-like OS=Momordica charantia OX=3673 GN=LOC111006489 PE=4 ... | [more] |
A0A6J1HD47 | 2.5e-87 | 61.23 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A6J1JRA2 | 1.0e-85 | 61.49 | amyloid beta A4 precursor protein-binding family B member 1-interacting protein-... | [more] |
A0A0A0KXM2 | 4.1e-82 | 66.67 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G642130 PE=4 SV=1 | [more] |
A0A5A7T850 | 3.8e-80 | 57.96 | Major pollen allergen Lol p 11 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
Match Name | E-value | Identity | Description | |
AT5G15780.1 | 2.2e-48 | 42.18 | Pollen Ole e 1 allergen and extensin family protein | [more] |
AT5G13140.1 | 9.5e-07 | 26.81 | Pollen Ole e 1 allergen and extensin family protein | [more] |