Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTTCAAATTGTTCTTTGTTCCTCTCTTTAACTCCTTTTTCCTTCGCCATTAAGTCTTCACCCTCACCTTCATCCCTCCATTTTTTCCCACTTCAATTCATTCCTAAACCCTTCATCAATGGCGGAATCAGATGTTCTCACACCACCGCAAAATCACTCTACTCCCTCTCCAAGTAAGTTCAATACTCATTTATTCTACAAACTTATAACCGCCATTTTCTTTCTTCTCATTCTCCCTTTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAGCTTTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTATGGCCTTTTTAGCCGTAGAAGCGATGAAAAAGAAGATGAAATTAGTGTTTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGTTTGCTTCATGTTTCCTCTGTTTTTGATGATGAACCTGAAACTCCTTCTGCTAATGATGATGAAAATAAGGTCCAAACATGGAATAATCGGTATTTTAGAAATGAATCTGTTGTTGTTGCTGAAGAACGTCCTGTGGATAATGAGCAGAGAGTTAGGAGTGAGAAACCTTTGCTTCTCCCTGTTCGTAGTTTGAAATCTCGTGTTGTTGTAGATGATGAGTTTAGATCTAAGAAGAGAGTGAGTTCTAGAAGGTTATTGAGTAATTTGAAGAGGAGTTCGAATGTGGAGTTTGGAGGAGTGAATAATCTCGATGAAATTGACGATAAGTTGAATGAGAATTTCGTTCTTCCATCTCCGGTTCCATGGCGGTCGAGATCAGGGAGGATGGAGAAGCAAGAAGAAGCTGATAATCCTTCCATGGAGGACTCCGAATCGAATCGGATCGGTTCTAGGTCTCCTAAGCCTCAAACTTCAAAATCTTCCCGAGCCAGTGCCATTCCTCAGAAACTATCTCCTTCTCCCTCTCCATCTCCAAGGAAACCATCTCCTTCCCATAACGTGTCACCAGAATTACAGGCCAAGAGTGCAGAGGATTTGGTGAGGAAGAAGAGCTTCTACCGCTCTCCGCCTCCACCGCCGCCCCCACCCCCGCCACGTGTTCGAAGAACTTCCTCGATGAAACCAAGCTCATGGGTGAACGAGGATGATGTACCTCATCAAAAGGAATTGAGAAGAAGCTACACTAGCAAGCCCAGAACCATAACTCGTGATACGGGAGATGATACTGATATGATGATTGGTGCTAATTCTAGCGGCGAAACACAACCTAGACATTATGTTGATGGTCTATCAATGGGAAAATCTGTTAGAACAATTAGGGCTGGCGAAGCTGTGAATGAACCACCAAGAAGAGGGAGAGAATTTAGTGTGAATGATCAATTGAAGGGGAAGACGATGGTGAATGAGAATACCCATGTCCAAGATTTTGAAGAAAACCCCCTTGAGTCTCCAGATGAAGATAAAGAAGAATTAGTTGAAAAGCTAACAATGGACACCGACGTTGACGAAGACGACGACGATGACATGGAAAGCGAGGTAGAAGGGAATAGCATGGTGGGGAAGTTTATCAGGGAAGACAATGGAGAACCTTTTGATGTGAAACGGAGAAATAGAGAGGACGAAAGAGGTTCGAGTAATGAAGAAGAAGAAGAAGAAGAAGAAGCAGGAAGCTGTAGTAATATTGGCAATGATGGAGGACCGGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAGATAAGGCTTCAAAGGATTGAATCATTCAAAAGATCAAGTGGACAAATACGTAAAAACACTACAAAGCAAAGTTGAAGATGAAAGATTATGTATCATATAATACCCTTCTCTTCAGTAAGTTCATCAAAATTTGAACTCCAAATCATTCAACTGACATTATAGAGTCATTTTCTAATTCTAACACTTTTTTTTTACATGATTTTCTTCTTTGATCACCAAGTTTCAATTCTCTGGAAAGTGTTTCATCTCCTCC
mRNA sequence
GCTTCAAATTGTTCTTTGTTCCTCTCTTTAACTCCTTTTTCCTTCGCCATTAAGTCTTCACCCTCACCTTCATCCCTCCATTTTTTCCCACTTCAATTCATTCCTAAACCCTTCATCAATGGCGGAATCAGATGTTCTCACACCACCGCAAAATCACTCTACTCCCTCTCCAAGTAAGTTCAATACTCATTTATTCTACAAACTTATAACCGCCATTTTCTTTCTTCTCATTCTCCCTTTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAGCTTTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTATGGCCTTTTTAGCCGTAGAAGCGATGAAAAAGAAGATGAAATTAGTGTTTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGTTTGCTTCATGTTTCCTCTGTTTTTGATGATGAACCTGAAACTCCTTCTGCTAATGATGATGAAAATAAGGTCCAAACATGGAATAATCGGTATTTTAGAAATGAATCTGTTGTTGTTGCTGAAGAACGTCCTGTGGATAATGAGCAGAGAGTTAGGAGTGAGAAACCTTTGCTTCTCCCTGTTCGTAGTTTGAAATCTCGTGTTGTTGTAGATGATGAGTTTAGATCTAAGAAGAGAGTGAGTTCTAGAAGGTTATTGAGTAATTTGAAGAGGAGTTCGAATGTGGAGTTTGGAGGAGTGAATAATCTCGATGAAATTGACGATAAGTTGAATGAGAATTTCGTTCTTCCATCTCCGGTTCCATGGCGGTCGAGATCAGGGAGGATGGAGAAGCAAGAAGAAGCTGATAATCCTTCCATGGAGGACTCCGAATCGAATCGGATCGGTTCTAGGTCTCCTAAGCCTCAAACTTCAAAATCTTCCCGAGCCAGTGCCATTCCTCAGAAACTATCTCCTTCTCCCTCTCCATCTCCAAGGAAACCATCTCCTTCCCATAACGTGTCACCAGAATTACAGGCCAAGAGTGCAGAGGATTTGGTGAGGAAGAAGAGCTTCTACCGCTCTCCGCCTCCACCGCCGCCCCCACCCCCGCCACGTGTTCGAAGAACTTCCTCGATGAAACCAAGCTCATGGGTGAACGAGGATGATGTACCTCATCAAAAGGAATTGAGAAGAAGCTACACTAGCAAGCCCAGAACCATAACTCGTGATACGGGAGATGATACTGATATGATGATTGGTGCTAATTCTAGCGGCGAAACACAACCTAGACATTATGTTGATGGTCTATCAATGGGAAAATCTGTTAGAACAATTAGGGCTGGCGAAGCTGTGAATGAACCACCAAGAAGAGGGAGAGAATTTAGTGTGAATGATCAATTGAAGGGGAAGACGATGGTGAATGAGAATACCCATGTCCAAGATTTTGAAGAAAACCCCCTTGAGTCTCCAGATGAAGATAAAGAAGAATTAGTTGAAAAGCTAACAATGGACACCGACGTTGACGAAGACGACGACGATGACATGGAAAGCGAGGTAGAAGGGAATAGCATGGTGGGGAAGTTTATCAGGGAAGACAATGGAGAACCTTTTGATGTGAAACGGAGAAATAGAGAGGACGAAAGAGGTTCGAGTAATGAAGAAGAAGAAGAAGAAGAAGAAGCAGGAAGCTGTAGTAATATTGGCAATGATGGAGGACCGGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAGATAAGGCTTCAAAGGATTGAATCATTCAAAAGATCAAGTGGACAAATACGTAAAAACACTACAAAGCAAAGTTGAAGATGAAAGATTATGTATCATATAATACCCTTCTCTTCAGTAAGTTCATCAAAATTTGAACTCCAAATCATTCAACTGACATTATAGAGTCATTTTCTAATTCTAACACTTTTTTTTTACATGATTTTCTTCTTTGATCACCAAGTTTCAATTCTCTGGAAAGTGTTTCATCTCCTCC
Coding sequence (CDS)
ATGGCGGAATCAGATGTTCTCACACCACCGCAAAATCACTCTACTCCCTCTCCAAGTAAGTTCAATACTCATTTATTCTACAAACTTATAACCGCCATTTTCTTTCTTCTCATTCTCCCTTTAGTCCCTTCCCAAGCCCCTGAGTTTGTTAATCAAGCTTTACTCACCAGAAGCTGGGAGCTTCTCCACCTTCTTTTCGTCGGAATCGCTGTTTCTTATGGCCTTTTTAGCCGTAGAAGCGATGAAAAAGAAGATGAAATTAGTGTTTCTAAGTTTGATAATGTTCAATCTTATGTTTCTGGTTTGCTTCATGTTTCCTCTGTTTTTGATGATGAACCTGAAACTCCTTCTGCTAATGATGATGAAAATAAGGTCCAAACATGGAATAATCGGTATTTTAGAAATGAATCTGTTGTTGTTGCTGAAGAACGTCCTGTGGATAATGAGCAGAGAGTTAGGAGTGAGAAACCTTTGCTTCTCCCTGTTCGTAGTTTGAAATCTCGTGTTGTTGTAGATGATGAGTTTAGATCTAAGAAGAGAGTGAGTTCTAGAAGGTTATTGAGTAATTTGAAGAGGAGTTCGAATGTGGAGTTTGGAGGAGTGAATAATCTCGATGAAATTGACGATAAGTTGAATGAGAATTTCGTTCTTCCATCTCCGGTTCCATGGCGGTCGAGATCAGGGAGGATGGAGAAGCAAGAAGAAGCTGATAATCCTTCCATGGAGGACTCCGAATCGAATCGGATCGGTTCTAGGTCTCCTAAGCCTCAAACTTCAAAATCTTCCCGAGCCAGTGCCATTCCTCAGAAACTATCTCCTTCTCCCTCTCCATCTCCAAGGAAACCATCTCCTTCCCATAACGTGTCACCAGAATTACAGGCCAAGAGTGCAGAGGATTTGGTGAGGAAGAAGAGCTTCTACCGCTCTCCGCCTCCACCGCCGCCCCCACCCCCGCCACGTGTTCGAAGAACTTCCTCGATGAAACCAAGCTCATGGGTGAACGAGGATGATGTACCTCATCAAAAGGAATTGAGAAGAAGCTACACTAGCAAGCCCAGAACCATAACTCGTGATACGGGAGATGATACTGATATGATGATTGGTGCTAATTCTAGCGGCGAAACACAACCTAGACATTATGTTGATGGTCTATCAATGGGAAAATCTGTTAGAACAATTAGGGCTGGCGAAGCTGTGAATGAACCACCAAGAAGAGGGAGAGAATTTAGTGTGAATGATCAATTGAAGGGGAAGACGATGGTGAATGAGAATACCCATGTCCAAGATTTTGAAGAAAACCCCCTTGAGTCTCCAGATGAAGATAAAGAAGAATTAGTTGAAAAGCTAACAATGGACACCGACGTTGACGAAGACGACGACGATGACATGGAAAGCGAGGTAGAAGGGAATAGCATGGTGGGGAAGTTTATCAGGGAAGACAATGGAGAACCTTTTGATGTGAAACGGAGAAATAGAGAGGACGAAAGAGGTTCGAGTAATGAAGAAGAAGAAGAAGAAGAAGAAGCAGGAAGCTGTAGTAATATTGGCAATGATGGAGGACCGGATGTAGATAAGAAGGCTGATGAGTTCATTGCCAAATTCAGAGAGCAGATAAGGCTTCAAAGGATTGAATCATTCAAAAGATCAAGTGGACAAATACGTAAAAACACTACAAAGCAAAGTTGA
Protein sequence
MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSANDDENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPSMEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQAKSAEDLVRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEFIAKFREQIRLQRIESFKRSSGQIRKNTTKQS*
Homology
BLAST of CSPI03G02500 vs. ExPASy TrEMBL
Match:
A0A0A0L1H7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G019360 PE=4 SV=1)
HSP 1 Score: 1050.8 bits (2716), Expect = 2.0e-303
Identity = 555/561 (98.93%), Postives = 557/561 (99.29%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHL YKLITAIFFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLLYKLITAIFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQAKSAEDL 300
MEDSESNRIGSRSPKPQTSKSSRASAIPQ+LSPSPSPSPRKPSPSHNVSPELQAKSAEDL
Sbjct: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQRLSPSPSPSPRKPSPSHNVSPELQAKSAEDL 300
Query: 301 VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG 360
VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG
Sbjct: 301 VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG 360
Query: 361 DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM 420
DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM
Sbjct: 361 DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM 420
Query: 421 VNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED 480
+NENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED
Sbjct: 421 MNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED 480
Query: 481 NGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEFIAKFREQIRL 540
NGEPFDVKRRNREDERGSSN EEEEEEEAGS SNIGNDGGPDVDKKADEFIAKFREQIRL
Sbjct: 481 NGEPFDVKRRNREDERGSSN-EEEEEEEAGSSSNIGNDGGPDVDKKADEFIAKFREQIRL 540
Query: 541 QRIESFKRSSGQIRKNTTKQS 562
QRIESFKRSSGQIRKNTTKQS
Sbjct: 541 QRIESFKRSSGQIRKNTTKQS 560
BLAST of CSPI03G02500 vs. ExPASy TrEMBL
Match:
A0A5A7TIC6 (Putative Hydroxyproline-rich glycoprotein family protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1017G00520 PE=4 SV=1)
HSP 1 Score: 985.3 bits (2546), Expect = 1.0e-283
Identity = 526/568 (92.61%), Postives = 544/568 (95.77%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHLFYKL+TA+FFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLMTAVFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRV+VDDE RSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVIVDDESRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKR+SNVEFGGVNNLDEIDDKLNEN VLPSPVPWRSRSGR+EKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRTSNVEFGGVNNLDEIDDKLNENVVLPSPVPWRSRSGRLEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKL--SPSPSPSPRKPSPSHNVSPELQAKSAE 300
MEDSESNRIGSRSPKPQTSK+SRASAIPQKL SPSPSPSPRKPSPSHNVSPELQAKSAE
Sbjct: 241 MEDSESNRIGSRSPKPQTSKASRASAIPQKLSPSPSPSPSPRKPSPSHNVSPELQAKSAE 300
Query: 301 DLVRKKSFYRS--PPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTIT 360
DLVRKKSFYRS PPPPPPPPPPRVRRTSSMKPSSWVNEDD+PHQKELRRS+TSKPR I
Sbjct: 301 DLVRKKSFYRSPPPPPPPPPPPPRVRRTSSMKPSSWVNEDDIPHQKELRRSFTSKPRAII 360
Query: 361 RDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLK 420
RDTGDDTDMM+G+NSSGETQPR+YVD LSMGKSVRTIR GE VNEPPRRGREFSVNDQLK
Sbjct: 361 RDTGDDTDMMLGSNSSGETQPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFSVNDQLK 420
Query: 421 GKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEV-EGNSMVGK 480
GK M+NENTH+QDFEENP+E PDEDKEELVEKLT+DTDVD DDDDDMESE+ E NSMVGK
Sbjct: 421 GKMMMNENTHIQDFEENPIEFPDEDKEELVEKLTLDTDVD-DDDDDMESEIEENNSMVGK 480
Query: 481 FIREDNGEPFDVKRRNREDERGSSNEE--EEEEEEAGSCSNIGNDGGPDVDKKADEFIAK 540
FIREDNGEPFDVKRRNR+DERGS NEE EEEEEEAGS SNIGNDGGPDVDKKADEFIAK
Sbjct: 481 FIREDNGEPFDVKRRNRDDERGSRNEEEKEEEEEEAGSASNIGNDGGPDVDKKADEFIAK 540
Query: 541 FREQIRLQRIESFKRSSGQIRKNTTKQS 562
FREQIRLQRIES KRSSGQIRKN TKQ+
Sbjct: 541 FREQIRLQRIESIKRSSGQIRKNITKQT 567
BLAST of CSPI03G02500 vs. ExPASy TrEMBL
Match:
A0A1S4DX64 (LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 OS=Cucumis melo OX=3656 GN=LOC103490869 PE=4 SV=1)
HSP 1 Score: 970.7 bits (2508), Expect = 2.6e-279
Identity = 520/567 (91.71%), Postives = 538/567 (94.89%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHLFYKL+TA+FFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLMTAVFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRV+VDDE RSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVIVDDESRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKR+SNVEFGGVNNLDEIDDKLNEN VLPSPVPWRSRSGR+EKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRTSNVEFGGVNNLDEIDDKLNENVVLPSPVPWRSRSGRLEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKL--SPSPSPSPRKPSPSHNVSPELQAKSAE 300
MEDSESNRIGSRSPKPQTSK+SRASAIPQKL SPSPSPSPRKPSPSHNVSPELQAKSAE
Sbjct: 241 MEDSESNRIGSRSPKPQTSKASRASAIPQKLSPSPSPSPSPRKPSPSHNVSPELQAKSAE 300
Query: 301 DLVRKKSFYRSPPPPPPPP-PPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITR 360
DLVRKKSFYRSPPPPPPPP P V SMKPSSWVNEDD+PHQKELRRS+TSKPR I R
Sbjct: 301 DLVRKKSFYRSPPPPPPPPHPHHVFEELSMKPSSWVNEDDIPHQKELRRSFTSKPRAIIR 360
Query: 361 DTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKG 420
DTGDDTDMM+G+NSSGETQPR+YVD LSMGKSVRTIR GE VNEPPRRGREFSVNDQLKG
Sbjct: 361 DTGDDTDMMLGSNSSGETQPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFSVNDQLKG 420
Query: 421 KTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEV-EGNSMVGKF 480
K M+NENTH+QDFEENP+E PDEDKEELVEKLT+DTDVD DDDDDMESE+ E NSMVGKF
Sbjct: 421 KMMMNENTHIQDFEENPIEFPDEDKEELVEKLTLDTDVD-DDDDDMESEIEENNSMVGKF 480
Query: 481 IREDNGEPFDVKRRNREDERGSSNEE--EEEEEEAGSCSNIGNDGGPDVDKKADEFIAKF 540
IREDNGEPFDVKRRNR+DERGS NEE EEEEEEAGS SNIGNDGGPDVDKKADEFIAKF
Sbjct: 481 IREDNGEPFDVKRRNRDDERGSRNEEEKEEEEEEAGSASNIGNDGGPDVDKKADEFIAKF 540
Query: 541 REQIRLQRIESFKRSSGQIRKNTTKQS 562
REQIRLQRIES KRSSGQIRKN TKQ+
Sbjct: 541 REQIRLQRIESIKRSSGQIRKNITKQT 566
BLAST of CSPI03G02500 vs. ExPASy TrEMBL
Match:
A0A6J1L1K4 (uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC111500278 PE=4 SV=1)
HSP 1 Score: 731.1 bits (1886), Expect = 3.5e-207
Identity = 427/585 (72.99%), Postives = 474/585 (81.03%), Query Frame = 0
Query: 1 MAESDVLTPPQN----HSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLT 60
MAESDV P N +PSKFN+H+ YK++ AIFFL+ILPLVPSQAPEFVNQ LLT
Sbjct: 1 MAESDVPAKPPNLPPGKDQATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
R+WELLHLLFVGIAVSYGLFSRR+DEKED ISVS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61 RTWELLHLLFVGIAVSYGLFSRRNDEKEDGISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
Query: 121 SAND------DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVV 180
SAND D NKVQTW+NRYFRNES+VVAEE PV NEQRVRSEKPLLLPVRSL S+VV
Sbjct: 121 SANDESMSSSDGNKVQTWSNRYFRNESLVVAEESPVVNEQRVRSEKPLLLPVRSLNSQVV 180
Query: 181 VDDEFR----SKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSR 240
VDDE R S RVSS RLLSN KRSSN EFGG+ +L+ I+D LNEN VLPSPVPWRSR
Sbjct: 181 VDDESRTVSGSTSRVSSGRLLSNSKRSSNGEFGGL-SLEGIEDNLNENVVLPSPVPWRSR 240
Query: 241 SGRMEKQEEADNP-------SMEDSESNRIGSRSPKPQTSKSSRASAIPQKLS-PSPSPS 300
SGR E QEEADNP ME+SESN I SRS +PQTS+S +ASAI KLS PSPSP
Sbjct: 241 SGRTEVQEEADNPPVYSPAVPMEESESNWIDSRSSRPQTSRSFQASAI--KLSPPSPSPF 300
Query: 301 PRKPSPSHNVSPELQAKSAEDLVRKKSFYRS-PPPPPPPPPPRVRRTSSMKPSSWVNEDD 360
PRKPSPS NVSPEL+AKS+ED VRKKSF+ S PPPPPPPPPP VRR +SMKPSS +N++D
Sbjct: 301 PRKPSPSPNVSPELKAKSSEDSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSLLNDND 360
Query: 361 VPHQKELRRSY-TSKPRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAG 420
VPHQK+L+RS TSKPR RDTGDD DM++G NSS E PR+Y D LSMGKS+R IR G
Sbjct: 361 VPHQKDLKRSVTTSKPRRSIRDTGDDIDMVMGTNSSAEALPRNYDDILSMGKSIRKIRPG 420
Query: 421 EAVNEPPRRGREFSVNDQLKGKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVD 480
E NEP RRGREF NDQLKGK M+++NTHVQ FEENP+E PD+DK+E VEKL M+T
Sbjct: 421 EVANEPTRRGREFGGNDQLKGK-MIDQNTHVQAFEENPIEFPDDDKKEPVEKLGMET--- 480
Query: 481 EDDDDDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIG 540
DDDDDMESE E N+MVGKFIREDNGEPF+V R R++ER SSN EEAG SN+
Sbjct: 481 -DDDDDMESEEEDNNMVGKFIREDNGEPFNVNR--RDNERSSSN------EEAGGSSNLS 540
Query: 541 NDGGPDVDKKADEFIAKFREQIRLQRIESFKRSSGQIRKNTTKQS 562
NDGGPDVDKKADEFIAKFREQIRLQRIES KRS+GQIR+NT+KQS
Sbjct: 541 NDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQS 569
BLAST of CSPI03G02500 vs. ExPASy TrEMBL
Match:
A0A6J1E6G0 (uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC111431041 PE=4 SV=1)
HSP 1 Score: 730.3 bits (1884), Expect = 6.0e-207
Identity = 422/587 (71.89%), Postives = 472/587 (80.41%), Query Frame = 0
Query: 1 MAESDVLTPPQN----HSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLT 60
MAESDV P N +PSKFN+H+ YK++ AIFFL+ILPLVPSQAPEFVNQ LLT
Sbjct: 1 MAESDVPAKPPNLPPRKDRATPSKFNSHILYKILMAIFFLVILPLVPSQAPEFVNQTLLT 60
Query: 61 RSWELLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETP 120
R+WELLHLLFVGIAVSYGLFSRR+DEKEDEISVS FDNVQSYVSGLLHVSSVFDDE ETP
Sbjct: 61 RTWELLHLLFVGIAVSYGLFSRRNDEKEDEISVSNFDNVQSYVSGLLHVSSVFDDEAETP 120
Query: 121 SAND------DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVV 180
SAND D NKVQTW+NRYFRNESV V+EE PV NEQRVRSEKPLLLPVRSLKSRVV
Sbjct: 121 SANDESMSLSDGNKVQTWSNRYFRNESVAVSEESPVVNEQRVRSEKPLLLPVRSLKSRVV 180
Query: 181 VDDEFR----SKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSR 240
VDDE R S RVSSRRLLS+ KRSSN E GGV NL ++D NEN LPSPVPWRSR
Sbjct: 181 VDDESRTVSGSTSRVSSRRLLSDSKRSSNGEVGGV-NLGGVEDNFNENVALPSPVPWRSR 240
Query: 241 SGRMEKQEEADNP-------SMEDSESNRIGSRSPKPQTSKSSRASAI---PQKLSPSPS 300
SGR E QEEADNP ME+SESN I SRS +PQTS+SS+ASAI P SPSPS
Sbjct: 241 SGRTEVQEEADNPPMYSPAVPMEESESNWIDSRSSRPQTSRSSQASAIKLSPPSPSPSPS 300
Query: 301 PSPRKPSPSHNVSPELQAKSAEDLVRKKSFYRS-PPPPPPPPPPRVRRTSSMKPSSWVNE 360
PSPRKPSPS NVSPEL+AKS+E VRKKSF+ S PPPPPPPPPP VRR +SMKPSSW+N+
Sbjct: 301 PSPRKPSPSPNVSPELKAKSSEGSVRKKSFFPSPPPPPPPPPPPHVRRIASMKPSSWLND 360
Query: 361 DDVPHQKELRRSY-TSKPRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIR 420
+DVPHQK+L+RS TSKPR+ R TGDD DM++G NSS E PR+Y D LSMGKS R IR
Sbjct: 361 NDVPHQKDLKRSVTTSKPRSSIRATGDDIDMVMGTNSSAEALPRNYDDSLSMGKSTRKIR 420
Query: 421 AGEAVNEPPRRGREFSVNDQLKGKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTD 480
GE NEPPRRGREF DQLKGK M+++N HVQ FEENP+E P+++K+ELVEKL+M+T
Sbjct: 421 PGEVANEPPRRGREFGGYDQLKGK-MIDQNAHVQAFEENPIEFPNDNKKELVEKLSMET- 480
Query: 481 VDEDDDDDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSN 540
DDDMES+ E N+MVGKFIREDNGEPF+V R R++ER SSN E EAGS SN
Sbjct: 481 -----DDDMESKEEDNNMVGKFIREDNGEPFNVNR--RDNERSSSN-----ELEAGSSSN 540
Query: 541 IGNDGGPDVDKKADEFIAKFREQIRLQRIESFKRSSGQIRKNTTKQS 562
+ NDGGPDVDKKADEFIAKFREQIRLQRIES KRS+GQIR+NT+KQ+
Sbjct: 541 LSNDGGPDVDKKADEFIAKFREQIRLQRIESIKRSTGQIRRNTSKQT 572
BLAST of CSPI03G02500 vs. NCBI nr
Match:
XP_011650387.1 (uncharacterized protein DDB_G0284459 [Cucumis sativus])
HSP 1 Score: 1050.8 bits (2716), Expect = 4.1e-303
Identity = 555/561 (98.93%), Postives = 557/561 (99.29%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHL YKLITAIFFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLLYKLITAIFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQAKSAEDL 300
MEDSESNRIGSRSPKPQTSKSSRASAIPQ+LSPSPSPSPRKPSPSHNVSPELQAKSAEDL
Sbjct: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQRLSPSPSPSPRKPSPSHNVSPELQAKSAEDL 300
Query: 301 VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG 360
VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG
Sbjct: 301 VRKKSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTG 360
Query: 361 DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM 420
DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM
Sbjct: 361 DDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM 420
Query: 421 VNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED 480
+NENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED
Sbjct: 421 MNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIRED 480
Query: 481 NGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEFIAKFREQIRL 540
NGEPFDVKRRNREDERGSSN EEEEEEEAGS SNIGNDGGPDVDKKADEFIAKFREQIRL
Sbjct: 481 NGEPFDVKRRNREDERGSSN-EEEEEEEAGSSSNIGNDGGPDVDKKADEFIAKFREQIRL 540
Query: 541 QRIESFKRSSGQIRKNTTKQS 562
QRIESFKRSSGQIRKNTTKQS
Sbjct: 541 QRIESFKRSSGQIRKNTTKQS 560
BLAST of CSPI03G02500 vs. NCBI nr
Match:
KAA0041069.1 (putative Hydroxyproline-rich glycoprotein family protein [Cucumis melo var. makuwa] >TYK12039.1 putative Hydroxyproline-rich glycoprotein family protein [Cucumis melo var. makuwa])
HSP 1 Score: 985.3 bits (2546), Expect = 2.1e-283
Identity = 526/568 (92.61%), Postives = 544/568 (95.77%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHLFYKL+TA+FFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLMTAVFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRV+VDDE RSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVIVDDESRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKR+SNVEFGGVNNLDEIDDKLNEN VLPSPVPWRSRSGR+EKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRTSNVEFGGVNNLDEIDDKLNENVVLPSPVPWRSRSGRLEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKL--SPSPSPSPRKPSPSHNVSPELQAKSAE 300
MEDSESNRIGSRSPKPQTSK+SRASAIPQKL SPSPSPSPRKPSPSHNVSPELQAKSAE
Sbjct: 241 MEDSESNRIGSRSPKPQTSKASRASAIPQKLSPSPSPSPSPRKPSPSHNVSPELQAKSAE 300
Query: 301 DLVRKKSFYRS--PPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTIT 360
DLVRKKSFYRS PPPPPPPPPPRVRRTSSMKPSSWVNEDD+PHQKELRRS+TSKPR I
Sbjct: 301 DLVRKKSFYRSPPPPPPPPPPPPRVRRTSSMKPSSWVNEDDIPHQKELRRSFTSKPRAII 360
Query: 361 RDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLK 420
RDTGDDTDMM+G+NSSGETQPR+YVD LSMGKSVRTIR GE VNEPPRRGREFSVNDQLK
Sbjct: 361 RDTGDDTDMMLGSNSSGETQPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFSVNDQLK 420
Query: 421 GKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEV-EGNSMVGK 480
GK M+NENTH+QDFEENP+E PDEDKEELVEKLT+DTDVD DDDDDMESE+ E NSMVGK
Sbjct: 421 GKMMMNENTHIQDFEENPIEFPDEDKEELVEKLTLDTDVD-DDDDDMESEIEENNSMVGK 480
Query: 481 FIREDNGEPFDVKRRNREDERGSSNEE--EEEEEEAGSCSNIGNDGGPDVDKKADEFIAK 540
FIREDNGEPFDVKRRNR+DERGS NEE EEEEEEAGS SNIGNDGGPDVDKKADEFIAK
Sbjct: 481 FIREDNGEPFDVKRRNRDDERGSRNEEEKEEEEEEAGSASNIGNDGGPDVDKKADEFIAK 540
Query: 541 FREQIRLQRIESFKRSSGQIRKNTTKQS 562
FREQIRLQRIES KRSSGQIRKN TKQ+
Sbjct: 541 FREQIRLQRIESIKRSSGQIRKNITKQT 567
BLAST of CSPI03G02500 vs. NCBI nr
Match:
XP_016900571.1 (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis melo])
HSP 1 Score: 970.7 bits (2508), Expect = 5.4e-279
Identity = 520/567 (91.71%), Postives = 538/567 (94.89%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAESDVLTPPQNHSTPSPSKFNTHLFYKL+TA+FFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLMTAVFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKR 180
DENKVQTWNNRYFRNESVVVAEERPV NEQRVRSEKPLLLPVRSLKSRV+VDDE RSKKR
Sbjct: 121 DENKVQTWNNRYFRNESVVVAEERPVVNEQRVRSEKPLLLPVRSLKSRVIVDDESRSKKR 180
Query: 181 VSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPS 240
VSSRRLLSNLKR+SNVEFGGVNNLDEIDDKLNEN VLPSPVPWRSRSGR+EKQEEADNPS
Sbjct: 181 VSSRRLLSNLKRTSNVEFGGVNNLDEIDDKLNENVVLPSPVPWRSRSGRLEKQEEADNPS 240
Query: 241 MEDSESNRIGSRSPKPQTSKSSRASAIPQKL--SPSPSPSPRKPSPSHNVSPELQAKSAE 300
MEDSESNRIGSRSPKPQTSK+SRASAIPQKL SPSPSPSPRKPSPSHNVSPELQAKSAE
Sbjct: 241 MEDSESNRIGSRSPKPQTSKASRASAIPQKLSPSPSPSPSPRKPSPSHNVSPELQAKSAE 300
Query: 301 DLVRKKSFYRSPPPPPPPP-PPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITR 360
DLVRKKSFYRSPPPPPPPP P V SMKPSSWVNEDD+PHQKELRRS+TSKPR I R
Sbjct: 301 DLVRKKSFYRSPPPPPPPPHPHHVFEELSMKPSSWVNEDDIPHQKELRRSFTSKPRAIIR 360
Query: 361 DTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKG 420
DTGDDTDMM+G+NSSGETQPR+YVD LSMGKSVRTIR GE VNEPPRRGREFSVNDQLKG
Sbjct: 361 DTGDDTDMMLGSNSSGETQPRNYVDSLSMGKSVRTIRPGEVVNEPPRRGREFSVNDQLKG 420
Query: 421 KTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEV-EGNSMVGKF 480
K M+NENTH+QDFEENP+E PDEDKEELVEKLT+DTDVD DDDDDMESE+ E NSMVGKF
Sbjct: 421 KMMMNENTHIQDFEENPIEFPDEDKEELVEKLTLDTDVD-DDDDDMESEIEENNSMVGKF 480
Query: 481 IREDNGEPFDVKRRNREDERGSSNEE--EEEEEEAGSCSNIGNDGGPDVDKKADEFIAKF 540
IREDNGEPFDVKRRNR+DERGS NEE EEEEEEAGS SNIGNDGGPDVDKKADEFIAKF
Sbjct: 481 IREDNGEPFDVKRRNRDDERGSRNEEEKEEEEEEAGSASNIGNDGGPDVDKKADEFIAKF 540
Query: 541 REQIRLQRIESFKRSSGQIRKNTTKQS 562
REQIRLQRIES KRSSGQIRKN TKQ+
Sbjct: 541 REQIRLQRIESIKRSSGQIRKNITKQT 566
BLAST of CSPI03G02500 vs. NCBI nr
Match:
XP_038905604.1 (uncharacterized protein DDB_G0284459 [Benincasa hispida])
HSP 1 Score: 866.3 bits (2237), Expect = 1.4e-247
Identity = 477/568 (83.98%), Postives = 508/568 (89.44%), Query Frame = 0
Query: 1 MAESDVLTPPQNHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWE 60
MAES+VL PP N P+PSKF+TH+ YK++TAIFFLLILPLVPSQAPEFVNQ LLTRSWE
Sbjct: 1 MAESEVLAPPLNQ--PTPSKFHTHILYKVVTAIFFLLILPLVPSQAPEFVNQTLLTRSWE 60
Query: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND
Sbjct: 61 LLHLLFVGIAVSYGLFSRRSDEKEDEISVSKFDNVQSYVSGLLHVSSVFDDEPETPSAND 120
Query: 121 ------DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDE 180
DENKV TWN+RYFRNESVVVAEERP NEQRVRSEKPLLLPVRSLKSRVVVDDE
Sbjct: 121 ESISSSDENKVHTWNSRYFRNESVVVAEERPAVNEQRVRSEKPLLLPVRSLKSRVVVDDE 180
Query: 181 FRSKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQE 240
R K RVSSRRLLSNLKRSSN EFGGV +L+EI+DKLNEN VLPSPVPWRSRSGRME QE
Sbjct: 181 SRYKPRVSSRRLLSNLKRSSNGEFGGV-SLEEIEDKLNENVVLPSPVPWRSRSGRMEVQE 240
Query: 241 EADNPSMEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQA 300
EAD PSMEDSESNRI SRSP+PQ S+SSRASAI QK SPSPSPSPRKPSPSHNVSPE QA
Sbjct: 241 EADIPSMEDSESNRIDSRSPRPQASRSSRASAISQKPSPSPSPSPRKPSPSHNVSPESQA 300
Query: 301 KSAEDLVRKKSFYRS-PPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPR 360
KSAEDLVRKKSFYRS PPPPPPPPPP VRR SSMKPSSWVNE++VPHQKEL+RS TSKPR
Sbjct: 301 KSAEDLVRKKSFYRSPPPPPPPPPPPHVRRISSMKPSSWVNENNVPHQKELKRSLTSKPR 360
Query: 361 TITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVND 420
++ RDTGDDTD+MIGANSSGE R+YVD LSMGKSVRTIR GE VNEPPRRGREF ND
Sbjct: 361 SLIRDTGDDTDVMIGANSSGEALSRNYVDNLSMGKSVRTIRPGEVVNEPPRRGREFGGND 420
Query: 421 QLKGKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMV 480
QLKGK M+++NTHVQDFEENP+E PDEDKEELVEKLTMDT D+DDDDDMESE E N+MV
Sbjct: 421 QLKGK-MMDQNTHVQDFEENPIEFPDEDKEELVEKLTMDT--DDDDDDDMESEEENNNMV 480
Query: 481 GKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEFIAK 540
G+FIREDNGEPFDVK R+R+D R SSN EEEAGS SN+ NDGGPDVDKKADEFIAK
Sbjct: 481 GRFIREDNGEPFDVKLRDRDDGRVSSN-----EEEAGSSSNMANDGGPDVDKKADEFIAK 540
Query: 541 FREQIRLQRIESFKRSSGQIRKNTTKQS 562
FREQIRLQRIES KRSSGQIR+NT KQ+
Sbjct: 541 FREQIRLQRIESIKRSSGQIRRNTAKQT 557
BLAST of CSPI03G02500 vs. NCBI nr
Match:
KAE8650053.1 (hypothetical protein Csa_010234 [Cucumis sativus])
HSP 1 Score: 813.1 bits (2099), Expect = 1.4e-231
Identity = 433/438 (98.86%), Postives = 436/438 (99.54%), Query Frame = 0
Query: 124 KVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKRVSS 183
+VQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKRVSS
Sbjct: 14 QVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSRVVVDDEFRSKKRVSS 73
Query: 184 RRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPSMED 243
RRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPSMED
Sbjct: 74 RRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQEEADNPSMED 133
Query: 244 SESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQAKSAEDLVRK 303
SESNRIGSRSPKPQTSKSSRASAIPQ+LSPSPSPSPRKPSPSHNVSPELQAKSAEDLVRK
Sbjct: 134 SESNRIGSRSPKPQTSKSSRASAIPQRLSPSPSPSPRKPSPSHNVSPELQAKSAEDLVRK 193
Query: 304 KSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTGDDT 363
KSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTGDDT
Sbjct: 194 KSFYRSPPPPPPPPPPRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSKPRTITRDTGDDT 253
Query: 364 DMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTMVNE 423
DMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTM+NE
Sbjct: 254 DMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSVNDQLKGKTMMNE 313
Query: 424 NTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIREDNGE 483
NTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIREDNGE
Sbjct: 314 NTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNSMVGKFIREDNGE 373
Query: 484 PFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEFIAKFREQIRLQRI 543
PFDVKRRNREDERGSSN EEEEEEEAGS SNIGNDGGPDVDKKADEFIAKFREQIRLQRI
Sbjct: 374 PFDVKRRNREDERGSSN-EEEEEEEAGSSSNIGNDGGPDVDKKADEFIAKFREQIRLQRI 433
Query: 544 ESFKRSSGQIRKNTTKQS 562
ESFKRSSGQIRKNTTKQS
Sbjct: 434 ESFKRSSGQIRKNTTKQS 450
BLAST of CSPI03G02500 vs. TAIR 10
Match:
AT4G16790.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 145.2 bits (365), Expect = 1.6e-34
Identity = 187/569 (32.86%), Postives = 245/569 (43.06%), Query Frame = 0
Query: 12 NHSTPSPSKFNTHLFYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWELLHLLFVGIAV 71
N +P KF + +K + ++P+ SQ PE NQ TR ELLHL+FVGIAV
Sbjct: 15 NKEDQNPRKFYSRFIFKALILTVLCAVVPVFLSQTPELANQ---TRLLELLHLVFVGIAV 74
Query: 72 SYGLFSRRSDEKEDEISVSKFD---------NVQSYVSGLLHVSSVFD--DEPETPSAND 131
SYGLFSRR+ + S D N SYV +L VSSVF+ E E+ ++D
Sbjct: 75 SYGLFSRRNYDGGGGGGTSNSDHNKADHSNNNSHSYVPKILEVSSVFNVGHESESEPSDD 134
Query: 132 ---DENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLK-SRV---VVDD 191
D+ K QTW N+Y + + E R VD EKPLLLPVRSL SRV D+
Sbjct: 135 SSGDQRKFQTWKNKY--HMKIPEVETRFVDRVSSENREKPLLLPVRSLNYSRVSDSSGDN 194
Query: 192 EFRSKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLPSPVPWRSRSGRMEKQ 251
R +K S R LL L G +N D VLPSP+PWRSRS
Sbjct: 195 SGRWEKVRSKRELLKTL---------GDDNSD----------VLPSPIPWRSRSS----- 254
Query: 252 EEADNPSMEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSPSHNVSPELQ 311
S S S + S + I PS SPRK +P N++ E
Sbjct: 255 ------SSSSSSSKEVESLPSVKNLTTVESQPLIKNLTPPSSFSSPRKSNPIPNLASE-- 314
Query: 312 AKSAEDLVRKKSFYRSPPPPPPPPP--PRVRRTSSMKPSSWVNEDDVPHQKELRRSYTSK 371
F+ SPPPPPPPPP P +SS K + ++ E R S K
Sbjct: 315 ------------FHPSPPPPPPPPPPLPAFYNSSSRKDHPGI------YRVERRESSVHK 374
Query: 372 PRTITRDTGDDTDMMIGANSSGETQPRHYVDGLSMGKSVRTIRAGEAVNEPPRRGREFSV 431
+ + GE P PP E+
Sbjct: 375 TKF----------------AGGEFHP--------------------PPPPPPPPPVEYYK 434
Query: 432 NDQLKGKTMVNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDDDDMESEVEGNS 491
+ K + E S + K +K+ + E + D E + ++
Sbjct: 435 SPPTKFRLS----------NERRKSSEQKMKRNAPKKVWWSDPIVESKEQDTEKNDQRSN 473
Query: 492 MVGKFIRE-DNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGGPDVDKKADEF 551
+ K + E +NGE +R E+E E++ EEE S I N G DVDKKADEF
Sbjct: 495 LGSKAVEESENGE-----QRRGENEIHDEVEKKIVEEE--GVSEINN--GSDVDKKADEF 473
Query: 552 IAKFREQIRLQRIESFKRSSGQIRKNTTK 560
IAKFREQIRLQRIES KRS+ +I N+++
Sbjct: 555 IAKFREQIRLQRIESIKRSTNKISANSSR 473
BLAST of CSPI03G02500 vs. TAIR 10
Match:
AT3G60380.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT4G16790.1); Has 6102 Blast hits to 3981 proteins in 424 species: Archae - 6; Bacteria - 372; Metazoa - 2603; Fungi - 655; Plants - 291; Viruses - 28; Other Eukaryotes - 2147 (source: NCBI BLink). )
HSP 1 Score: 114.8 bits (286), Expect = 2.3e-25
Identity = 116/367 (31.61%), Postives = 165/367 (44.96%), Query Frame = 0
Query: 26 FYKLITAIFFLLILPLVPSQAPEFVNQALLTRSWELLHLLFVGIAVSYGLFSRRSDEKED 85
F K + FLL LPL PSQAP+FV + +LT+ WEL+HLLFVGIAV+YGLFSRR+ E
Sbjct: 33 FCKSVLFALFLLALPLFPSQAPDFVGETVLTKFWELIHLLFVGIAVAYGLFSRRNVESAV 92
Query: 86 EISVSKFDNVQ-SYVSGLLHVSSVFDD--------------------------------- 145
++ +++ D SYVS + VSSVFD+
Sbjct: 93 DLRMTRVDESSLSYVSRIFQVSSVFDEEFDDNSCEFVDVRSDESVSARASVVGKSESFVV 152
Query: 146 ---EPETPSANDDENKVQTWNNRYFRNESVVVAEERPVDNEQRVRSEKPLLLPVRSLKSR 205
E E S + N+V+ WN++YF+ +S VV RP +PL LP+R L+S
Sbjct: 153 ESGELEESSEFGETNEVRAWNSQYFQGKSKVVV-ARPAYGLDGHVVHQPLGLPIRRLRS- 212
Query: 206 VVVDDEFRSKKRVSSRRLLSNLKRSSNVEFGGVNNLDEIDDKLNENFVLP-SPVPWRSRS 265
R + + + + N E + D+ +E P SPVPW++R
Sbjct: 213 -----SLRDNAALQDKSFADSCDGAVNAEAESL----LADNFFDEVLAAPASPVPWQARP 272
Query: 266 GRM---EKQEEADNPSMEDSESNRIGSRSPKPQTSKSSRASAIPQKLSPSPSPSPRKPSP 325
M + P D I SRS +S++S AS + SPS S S S
Sbjct: 273 EMMGIGDNYPSNFQPISVDETLKSISSRSTGSSSSQTSYASQNQNRFSPSRSVSAE--SL 332
Query: 326 SHNVSPELQAKSAEDLVRKKSFYRSPPP--PPPPPPPRVRRTSSMKPSSWVNEDDVPHQK 350
+ NV ++ KS + R S P P P PP P + + + S + DD P +
Sbjct: 333 NSNVEELVKEKSRQSSSRSSSPSLPPSPSLSPSPPSPELVPNDTRRRSPELVTDDTPRRA 386
HSP 2 Score: 25.4 bits (54), Expect = 1.8e+02
Identity = 43/151 (28.48%), Postives = 71/151 (47.02%), Query Frame = 0
Query: 403 PRRGREFSVNDQLKGKTM--VNENTHVQDFEENPLESPDEDKEELVEKLTMDTDVDEDDD 462
PR R S N L+GK++ + + H +D + + S D + ++ + + ++
Sbjct: 587 PRSWRA-SSNVSLRGKSVRTIRSDRHGKDVKTDGDSSEDRAEAKVESRGRTKSRRPRQEE 646
Query: 463 DDMESEVEGNSMVGKFIREDNGEPFDVKRRNREDERGSSNEEEEEEEEAGSCSNIGNDGG 522
+ E +S EP +V + E+ EEEEE A + +
Sbjct: 647 LSIVLHQEKSSET-----RAKSEPEEVAMEEPQAEQQPEVTFEEEEEAAWESQSNASHDH 706
Query: 523 PDVDKKADEFIAKFREQIRLQRIESFKRSSG 552
+VD+KA EFIAKFREQIRLQ++ S ++ G
Sbjct: 707 NEVDRKAGEFIAKFREQIRLQKLISGEQPRG 731
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L1H7 | 2.0e-303 | 98.93 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G019360 PE=4 SV=1 | [more] |
A0A5A7TIC6 | 1.0e-283 | 92.61 | Putative Hydroxyproline-rich glycoprotein family protein OS=Cucumis melo var. ma... | [more] |
A0A1S4DX64 | 2.6e-279 | 91.71 | LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 OS=Cucumis melo OX=365... | [more] |
A0A6J1L1K4 | 3.5e-207 | 72.99 | uncharacterized protein DDB_G0284459-like OS=Cucurbita maxima OX=3661 GN=LOC1115... | [more] |
A0A6J1E6G0 | 6.0e-207 | 71.89 | uncharacterized protein DDB_G0284459 OS=Cucurbita moschata OX=3662 GN=LOC1114310... | [more] |
Match Name | E-value | Identity | Description | |
XP_011650387.1 | 4.1e-303 | 98.93 | uncharacterized protein DDB_G0284459 [Cucumis sativus] | [more] |
KAA0041069.1 | 2.1e-283 | 92.61 | putative Hydroxyproline-rich glycoprotein family protein [Cucumis melo var. maku... | [more] |
XP_016900571.1 | 5.4e-279 | 91.71 | PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein DDB_G0284459 [Cucumis me... | [more] |
XP_038905604.1 | 1.4e-247 | 83.98 | uncharacterized protein DDB_G0284459 [Benincasa hispida] | [more] |
KAE8650053.1 | 1.4e-231 | 98.86 | hypothetical protein Csa_010234 [Cucumis sativus] | [more] |
Match Name | E-value | Identity | Description | |
AT4G16790.1 | 1.6e-34 | 32.86 | hydroxyproline-rich glycoprotein family protein | [more] |
AT3G60380.1 | 2.3e-25 | 31.61 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |