Homology
BLAST of CSPI05G20150 vs. ExPASy Swiss-Prot
Match:
Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)
HSP 1 Score: 658.7 bits (1698), Expect = 9.0e-188
Identity = 456/999 (45.65%), Postives = 599/999 (59.96%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKII 60
M RI VVA SIA +K+L ++ + S+NGE D +++V ++ K +
Sbjct: 1 MFVRIGFVVAASIAAVTVKRLNVKP-----SKPSKPSDNGEGGDKEQSVDPDYNLNDKNL 60
Query: 61 RGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLL-----LPQRNSENWLLDD 120
+ EEEEEEE I+ +Q G SD D D EF+ LL P + +N L +
Sbjct: 61 QEEEEEEEEEVKLINSVINQTRGSFSDYLD-DDILPEFEDLLSGEIEYPLPDDDNNL--E 120
Query: 121 NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQL 180
++E+ E + + ELERL +L+ ELEER+VKLEGEL+ G+K E+D++EL++QL
Sbjct: 121 KAEKERKYEVEMAYNDGELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQL 180
Query: 181 DAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTK 240
K +I MLN TI+SLQAERK L+EE+ + +++KELE R KIKELQR+IQLDANQTK
Sbjct: 181 KIKTVEIDMLNITINSLQAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTK 240
Query: 241 ERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS 300
+LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+ ELK KNRELQHE +EL+
Sbjct: 241 GQLLLLKQHVSSLQMKEEEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSI 300
Query: 301 KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLR 360
KL+ +ARI TL+ MTE++ + K REE LK NEDL+KQ+EGLQMNRFSEVEELVYLR
Sbjct: 301 KLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLR 360
Query: 361 WINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNF 420
W+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLEYAG E G+ +TD ESN+
Sbjct: 361 WVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNY 420
Query: 421 SHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS------- 480
S P S D+ +N S+DSS SR SSF +KP LKK +++ SS S PS
Sbjct: 421 SQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGS 480
Query: 481 ---TIDSSHRWKDPLEAVMALSA--------------------ETLTLSEVRLQ---VSS 540
S ++ + PLE++M +A ET L +R Q S
Sbjct: 481 PGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSP 540
Query: 541 RKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK------- 600
+ +NSVA SF +MSKSV+ L +KY YK+ HKLA+ EK IK K + RA+
Sbjct: 541 GEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVA 600
Query: 601 --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-- 660
++GD S+ + E + +NA V K+ + + +
Sbjct: 601 LPPKLAQLKEKRVVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVP 660
Query: 661 ---PDSQYDNNSTNFISS---------------------PTSSGG--------------- 720
P S STN S+ P GG
Sbjct: 661 RPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGA 720
Query: 721 ----EVHRGSELVQFNRKMMKPE------------------------------------- 780
+VHR ELV+F + +MK E
Sbjct: 721 GGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLA 780
Query: 781 VKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKR 840
VK +ETQ D V +LA EVR +SF+++ED+++FV WLDE+LS LVD +L+HFDWP+
Sbjct: 781 VKADVETQGD-FVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFDWPEG 840
Query: 841 KTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT 847
K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM LL+KVEQSVYALL+TRD
Sbjct: 841 KADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDM 900
BLAST of CSPI05G20150 vs. ExPASy TrEMBL
Match:
A0A0A0KT25 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G577430 PE=4 SV=1)
HSP 1 Score: 1545.0 bits (3999), Expect = 0.0e+00
Identity = 834/845 (98.70%), Postives = 836/845 (98.93%), Query Frame = 0
Query: 2 MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 61
MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQ GLEEE
Sbjct: 1 MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQ-------GLEEE 60
Query: 62 EEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 121
EEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL
Sbjct: 61 EEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 120
Query: 122 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 181
IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN
Sbjct: 121 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 180
Query: 182 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVS 241
NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVS
Sbjct: 181 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVS 240
Query: 242 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 301
TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Sbjct: 241 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 300
Query: 302 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 361
LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR
Sbjct: 301 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 360
Query: 362 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 421
NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE
Sbjct: 361 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 420
Query: 422 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 481
NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS
Sbjct: 421 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 480
Query: 482 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 541
AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE
Sbjct: 481 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 540
Query: 542 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS 601
KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Sbjct: 541 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS 600
Query: 602 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 661
SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF
Sbjct: 601 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 660
Query: 662 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 721
VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE
Sbjct: 661 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 720
Query: 722 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 781
VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK
Sbjct: 721 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 780
Query: 782 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 841
YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE
Sbjct: 781 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 838
Query: 842 TGQQN 847
TGQQN
Sbjct: 841 TGQQN 838
BLAST of CSPI05G20150 vs. ExPASy TrEMBL
Match:
A0A1S3CSZ9 (protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103504610 PE=4 SV=1)
HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 771/849 (90.81%), Postives = 797/849 (93.88%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL-- 60
MMNRISVVVAVSIA YAIKQLTIRSWTSFFLP TNCSENGED KKN GL
Sbjct: 3 MMNRISVVVAVSIAAYAIKQLTIRSWTSFFLP-TNCSENGEDAKKN----------GLDE 62
Query: 61 EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVP 120
EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL +KEEKVP
Sbjct: 63 EEEEEEEASSINDATSQVNGRTSDLEDGDHSSDELQV-LLPQRNSENWLLVHYKKEEKVP 122
Query: 121 EFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDIS 180
EFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DIS
Sbjct: 123 EFLTESNKIESERLLKLVMELEERKVKLEGELLMCDGIKYSETDVMELRKQLDAKNNDIS 182
Query: 181 MLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQ 240
MLNNTISSLQAERKILKEEILKGALMKKELEE R KIKELQR+IQLDANQTKERLLLLKQ
Sbjct: 183 MLNNTISSLQAERKILKEEILKGALMKKELEEARDKIKELQRQIQLDANQTKERLLLLKQ 242
Query: 241 RVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR 300
RVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKAR
Sbjct: 243 RVSTLQAKEEEAVKKEAELFKKQKAAKDFEVELGELKWKNRELQHEKQELTSKLEVMKAR 302
Query: 301 IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY 360
IKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY
Sbjct: 303 IKTLTKMTESEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY 362
Query: 361 ELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEID 420
ELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS ID
Sbjct: 363 ELRNNQIPAGESARYLNKSSSPKSREKAKQLMLEYAGMEFGQEETDHESNFSHPFSFGID 422
Query: 421 NLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVM 480
NLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP SSHRWKDPLEAVM
Sbjct: 423 NLENTSIDSSRSRTSSFSEKPNSNLSLKKLIRNQGGLSAVSPPGISGSSHRWKDPLEAVM 482
Query: 481 ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQ 540
ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQ
Sbjct: 483 ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEESLQQKYSTYKEHYKLAIGSEKQ 542
Query: 541 IKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST 600
IKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+ST
Sbjct: 543 IKEKAESEKAKSSGDSSSLNLEYHDISMRKKSATLPLKLAQMK-NKISCEPDSQNDNDST 602
Query: 601 NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMED 660
N IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMED
Sbjct: 603 NLISNPTSSGGEVHRGSELVQFNQKMMKPEVKAHMETQGDHLVVALAMEVREACFSNMED 662
Query: 661 IVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPK 720
IVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPK
Sbjct: 663 IVSFVIRLDEKLSSLVDGMEILEHFDWPMRKTDALREAAFGYQKLMKLREEVSSFVDNPK 722
Query: 721 LTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVE 780
LTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVE
Sbjct: 723 LTCEVALNKMNSLLDKVEQSVNALLQTRDTMISRYEELGIPIDWLLDCGVVGKIKVLCVE 782
Query: 781 LARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSR 840
LARKYMKRIVKEHNALSGP+KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+R
Sbjct: 783 LARKYMKRIVKEHNALSGPDKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRNR 838
Query: 841 VHTETGQQN 847
VHTETGQ+N
Sbjct: 843 VHTETGQKN 838
BLAST of CSPI05G20150 vs. ExPASy TrEMBL
Match:
A0A5D3BMR7 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00210 PE=4 SV=1)
HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 704/767 (91.79%), Postives = 728/767 (94.92%), Query Frame = 0
Query: 55 IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNR 114
++GL EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL +
Sbjct: 20 VQGLDEEEEEEEEASSINDATSQVNGRTSDLEDGDHSSDELQV-LLPQRNSENWLLVHYK 79
Query: 115 KEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDA 174
KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDA
Sbjct: 80 KEEKVPEFLTENNKIESERLLKLVMELEERKVKLEGELLMCDGIKYSETDVMELRKQLDA 139
Query: 175 KNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKER 234
KN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQR+IQLDANQTKER
Sbjct: 140 KNNDISMLNNTISSLQAERKILKEEILKGALMKKELEEARGKIKELQRQIQLDANQTKER 199
Query: 235 LLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKL 294
LLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKL
Sbjct: 200 LLLLKQRVSTLQAKEEEAVKKEAELFKKQKAAKDFEVELGELKWKNRELQHEKQELTSKL 259
Query: 295 EVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI 354
EVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Sbjct: 260 EVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI 319
Query: 355 NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHP 414
NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHP
Sbjct: 320 NACLRYELRNNQIPAGESARYLNKSSSPKSREKAKQLMLEYAGMEFGQEETDHESNFSHP 379
Query: 415 FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKD 474
FS IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP SSHRWKD
Sbjct: 380 FSFGIDNLENTSIDSSRSRTSSFSEKPNSNLSLKKLIRNQGGLSAVSPPGISGSSHRWKD 439
Query: 475 PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLA 534
PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLA
Sbjct: 440 PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEESLQQKYSTYKEHHKLA 499
Query: 535 IGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQ 594
IGSEKQIKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ
Sbjct: 500 IGSEKQIKEKAESEKAKSSGDSSSLNLEYHDISMRKKSATLPLKLAQMK-NKISCEPDSQ 559
Query: 595 YDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS 654
DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA
Sbjct: 560 NDNDSTNLISNPTSSGGEVHRGSELVQFNQKMMKPEVKAHMETQGDHLVVALAMEVREAC 619
Query: 655 FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSS 714
FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSS
Sbjct: 620 FSNMEDIVSFVIRLDEKLSSLVDGMEILEHFDWPMRKTDALREAAFGYQKLMKLREEVSS 679
Query: 715 FVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKI 774
FVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKI
Sbjct: 680 FVDNPKLTCEVALNKMNSLLDKVEQSVNALLQTRDTMISRYEELGIPIDWLLDCGVVGKI 739
Query: 775 KVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK 819
KVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Sbjct: 740 KVLCVELARKYMKRIVKEHNGLSGPDKEPNREFLLFQGVRFASRVHK 784
BLAST of CSPI05G20150 vs. ExPASy TrEMBL
Match:
A0A6J1DWY5 (protein CHUP1, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111024258 PE=4 SV=1)
HSP 1 Score: 857.1 bits (2213), Expect = 6.4e-245
Identity = 531/851 (62.40%), Postives = 621/851 (72.97%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEE 60
+M ++ V+VAVSIA YAIKQLTIRSW+S LP TNCSENGE +KN GL +
Sbjct: 2 IMTKLGVLVAVSIAAYAIKQLTIRSWSSSALP-TNCSENGEGTEKN----------GL-D 61
Query: 61 EEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEE-KVPE 120
EE++ NSI+ A SQV+G +SD E + LLP R+SE+ LLD N+KEE KVPE
Sbjct: 62 VEEQKGNSINGAASQVSGSSSDPELRE---------LLP-RDSESRLLDYNKKEEGKVPE 121
Query: 121 FLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISM 180
+EN+KIEL+RLLKL+MELEERKVKLE EL+M D +K ++D EL K+L+AK++D+SM
Sbjct: 122 SHMENNKIELQRLLKLVMELEERKVKLEDELLMYDRLKDGKSDGTELXKELEAKDEDMSM 181
Query: 181 LNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQR 240
LN TISSLQAERK L+EEI+KGA MKKELEE +GKIKELQR++QLDANQTKE L LK+R
Sbjct: 182 LNITISSLQAERKKLQEEIVKGAFMKKELEEAKGKIKELQRQLQLDANQTKEHLSSLKRR 241
Query: 241 VSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARI 300
VSTLQAKEEEAVKKEA+LY+K KAAK FE+E GELK KNR+LQ E +ELTSKLEVM+ARI
Sbjct: 242 VSTLQAKEEEAVKKEAQLYRKLKAAKGFELELGELKQKNRQLQREKEELTSKLEVMEARI 301
Query: 301 KTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYE 360
TLT +TE+EIIT+EREE +KL+ NE+L KQLEGLQMNRFSEVEELVYLRW+NACLRYE
Sbjct: 302 TTLTTLTESEIITEEREEXRKLRRANEELTKQLEGLQMNRFSEVEELVYLRWVNACLRYE 361
Query: 361 LRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHP-FSSEID 420
LR+N+ GESA L+KS SPKSKEKAKQLMLEYAG G+ ETDHESNFSHP FSS I+
Sbjct: 362 LRDNETLGGESALDLSKSLSPKSKEKAKQLMLEYAGLGFGQLETDHESNFSHPTFSSGIE 421
Query: 421 NLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVM 480
+ +NTS SSRSRTSSF RWKDPLEA +
Sbjct: 422 DFDNTSSGSSRSRTSSF---------------------------------RWKDPLEAAV 481
Query: 481 ALSAETLTL-SEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAI--GS 540
A S ETLT SEV+ QVSSR SVNSVATSFQ MS+S E+S++QKYS YKEHHKL I G
Sbjct: 482 AHSTETLTTPSEVKFQVSSRNSVNSVATSFQPMSQSAEESVKQKYSAYKEHHKLNIGRGR 541
Query: 541 EKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNN 600
EKQIKEK E ER K N EP
Sbjct: 542 EKQIKEKAEKERVK--------------------------------NSCYWEP------- 601
Query: 601 STNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNM 660
E V+F++K+MK EVK MET+ D LVM L M+V+ SF+NM
Sbjct: 602 -------------------EFVRFDQKLMKAEVKADMETEGD-LVMPLTMDVKAVSFTNM 661
Query: 661 EDIVSFVIWLDEKLSSLVD-GMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVD 720
ED+VSFVIWLD+K SSLVD + ILEHFDWP+ K+DALREAA YQ LMKL EEVSSFVD
Sbjct: 662 EDVVSFVIWLDQKTSSLVDERVMILEHFDWPEGKSDALREAALEYQNLMKLGEEVSSFVD 721
Query: 721 NPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVL 780
+PKLT EVAL M+SLL K+EQSV+A+L+ R+ IS+YEELGIP+DWLLD GVVGK+KVL
Sbjct: 722 SPKLTREVALKTMHSLLHKMEQSVHAVLRNREMAISQYEELGIPVDWLLDSGVVGKMKVL 738
Query: 781 CVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEEL 840
VELARKYMKRI+ E NALSGP KEPNREFLL QGVRFASRVH+FAGGFD +SMKAFEEL
Sbjct: 782 SVELARKYMKRILNEVNALSGPHKEPNREFLLLQGVRFASRVHQFAGGFDVESMKAFEEL 738
Query: 841 RSRVHTETGQQ 846
R+R+HTE GQ+
Sbjct: 842 RNRIHTEAGQK 738
BLAST of CSPI05G20150 vs. ExPASy TrEMBL
Match:
A0A061ECQ9 (Hydroxyproline-rich glycoprotein family protein isoform 4 OS=Theobroma cacao OX=3641 GN=TCM_011880 PE=4 SV=1)
HSP 1 Score: 709.9 bits (1831), Expect = 1.3e-200
Identity = 472/938 (50.32%), Postives = 595/938 (63.43%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE----------DVKKNVKQV 60
M+ R+ VVA SIA +A+KQL +++ S SENGE D KK
Sbjct: 1 MIVRVGFVVAASIAAFAVKQLNVKNSKS-STSLAKSSENGEASFEEHPNEGDNKKQFAYS 60
Query: 61 HQKIIR--GLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLL 120
+ + + G +EEEEE+ IS ++VNG D+ D D EF+ LL E L
Sbjct: 61 NDSLKKKDGEKEEEEEDVKLISSIFNRVNGSQPDIGDED-ILPEFEDLL--SGEIEYPLS 120
Query: 121 DD---NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVME 180
D + EK+ E + N+ ELERL L+ ELEER+VKLEGEL+ G+K E+D+ E
Sbjct: 121 ADKFARAEREKIYETEMANNASELERLRNLVKELEEREVKLEGELLEYYGLKEQESDIFE 180
Query: 181 LRKQLDAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLD 240
L++QL K +I MLN TISSLQ+ERK L+E+I GA +KKELE R KIKELQR+IQLD
Sbjct: 181 LKRQLKIKTVEIDMLNITISSLQSERKKLQEDIAHGASVKKELEVARNKIKELQRQIQLD 240
Query: 241 ANQTKERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHEN 300
ANQTK +LL LKQ+VS LQAKE+EA+K +AE+ KK KA K+ E+E EL+ KN+ELQHE
Sbjct: 241 ANQTKAQLLFLKQQVSGLQAKEQEAIKNDAEVEKKLKAVKELEMEVMELRRKNKELQHEK 300
Query: 301 QELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEE 360
+ELT KL+ +A+I L+ MTETEI + REE L+ NEDL+KQ+EGLQMNRFSEVEE
Sbjct: 301 RELTVKLDAAEAKIAALSNMTETEIDVRAREEVSNLRHANEDLLKQVEGLQMNRFSEVEE 360
Query: 361 LVYLRWINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETD 420
LVYLRW+NACLRYELRN Q P G+ SAR LNKS SPKS+E AKQL+LEYAG E G+ +TD
Sbjct: 361 LVYLRWVNACLRYELRNYQTPEGKISARDLNKSLSPKSQETAKQLLLEYAGSERGQGDTD 420
Query: 421 HESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVS----- 480
ESNFSHP S+ ++L+N SI SS SR SS +KP+ LKK R++ SSAVS
Sbjct: 421 IESNFSHPSSTGSEDLDNASIYSSNSRYSSLSKKPSLIQKLKKWGRSKDDSSAVSSPARS 480
Query: 481 ----PPSTIDSSHRWKDPLEAVMALSA--------------------ETLTLSEVRLQVS 540
PS I S + PLEA+M +A ET T+ +R QVS
Sbjct: 481 LSGGSPSRISMSQHSRGPLEALMLRNAGDGVAITTFGKNEQEFTDSPETPTIPNIRTQVS 540
Query: 541 SRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAKSSGDSS 600
S S NSVATSF LMS+SV+ SL++KY YK+ HKLA+ EKQIK+K + RA+ GD S
Sbjct: 541 SGDSPNSVATSFHLMSRSVDGSLEEKYPAYKDRHKLALEREKQIKQKAQQARAERFGDKS 600
Query: 601 SSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFISSPTSS--------- 660
+ + + E K L KLAQ+K R DS +N + S T S
Sbjct: 601 NFSSKAE---REKPVILPPKLAQIK-ERTVFPGDSSGQSNDDKAVDSQTISKMKLAHIEK 660
Query: 661 ------------GGEVHRGSELVQFNRKMMKP--------------------------EV 720
G G + P V
Sbjct: 661 RPPRVPRPPPKPAGGTSAGVNTTTTGQPPAPPPLPCALPPLPPPPPPGGPPPPPPPPGSV 720
Query: 721 KDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRK 780
K +ETQ D V +LA E+R ASF+++ED+V+FV WLDE+LS LVD +L+HFDWP+ K
Sbjct: 721 KADVETQGD-FVQSLATEIRAASFTSIEDLVAFVNWLDEELSFLVDERAVLKHFDWPEGK 780
Query: 781 TDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTT 840
DALREAAF YQ L+KL +++SSFVD+P L CE AL KM LL+KVEQSVYALL+TRD
Sbjct: 781 ADALREAAFEYQDLVKLEKQISSFVDDPSLPCEAALKKMYKLLEKVEQSVYALLRTRDMA 840
Query: 841 ISRYEELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQ 847
ISRY+E GIP++WLLD GVVGKIK+ V+LARKYMKR+ E + L+GPEKEPNREF+L Q
Sbjct: 841 ISRYKEFGIPVNWLLDSGVVGKIKLSSVQLARKYMKRVASELDLLTGPEKEPNREFILLQ 900
BLAST of CSPI05G20150 vs. NCBI nr
Match:
XP_011655490.1 (protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031741049.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus])
HSP 1 Score: 1565.4 bits (4052), Expect = 0.0e+00
Identity = 841/845 (99.53%), Postives = 843/845 (99.76%), Query Frame = 0
Query: 2 MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 61
MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE
Sbjct: 1 MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 60
Query: 62 EEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 121
EEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL
Sbjct: 61 EEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 120
Query: 122 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 181
IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN
Sbjct: 121 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 180
Query: 182 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVS 241
NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVS
Sbjct: 181 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVS 240
Query: 242 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 301
TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Sbjct: 241 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 300
Query: 302 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 361
LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR
Sbjct: 301 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 360
Query: 362 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 421
NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE
Sbjct: 361 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 420
Query: 422 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 481
NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS
Sbjct: 421 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 480
Query: 482 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 541
AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE
Sbjct: 481 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 540
Query: 542 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS 601
KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Sbjct: 541 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS 600
Query: 602 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 661
SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF
Sbjct: 601 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 660
Query: 662 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 721
VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE
Sbjct: 661 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 720
Query: 722 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 781
VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK
Sbjct: 721 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 780
Query: 782 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 841
YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE
Sbjct: 781 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 840
Query: 842 TGQQN 847
TGQQN
Sbjct: 841 TGQQN 845
BLAST of CSPI05G20150 vs. NCBI nr
Match:
XP_031741050.1 (protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KAE8648621.1 hypothetical protein Csa_008546 [Cucumis sativus])
HSP 1 Score: 1510.4 bits (3909), Expect = 0.0e+00
Identity = 814/819 (99.39%), Postives = 816/819 (99.63%), Query Frame = 0
Query: 2 MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 61
MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE
Sbjct: 1 MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 60
Query: 62 EEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 121
EEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL
Sbjct: 61 EEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 120
Query: 122 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 181
IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN
Sbjct: 121 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 180
Query: 182 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVS 241
NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVS
Sbjct: 181 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVS 240
Query: 242 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 301
TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Sbjct: 241 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 300
Query: 302 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 361
LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR
Sbjct: 301 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 360
Query: 362 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 421
NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE
Sbjct: 361 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 420
Query: 422 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 481
NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS
Sbjct: 421 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 480
Query: 482 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 541
AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE
Sbjct: 481 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 540
Query: 542 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS 601
KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Sbjct: 541 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS 600
Query: 602 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 661
SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF
Sbjct: 601 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 660
Query: 662 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 721
VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE
Sbjct: 661 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 720
Query: 722 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 781
VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK
Sbjct: 721 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 780
Query: 782 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFA 821
YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK A
Sbjct: 781 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKEA 819
BLAST of CSPI05G20150 vs. NCBI nr
Match:
XP_031741051.1 (protein CHUP1, chloroplastic isoform X3 [Cucumis sativus])
HSP 1 Score: 1502.3 bits (3888), Expect = 0.0e+00
Identity = 814/845 (96.33%), Postives = 816/845 (96.57%), Query Frame = 0
Query: 2 MNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 61
MNRISVVVAVSIA YAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE
Sbjct: 1 MNRISVVVAVSIAAYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGLEEE 60
Query: 62 EEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 121
EEEEANSISD TSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL
Sbjct: 61 EEEEANSISDTTSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVPEFL 120
Query: 122 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 181
IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN
Sbjct: 121 IENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDISMLN 180
Query: 182 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVS 241
NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQR+IQLDANQTKERLLLLKQRVS
Sbjct: 181 NTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRQIQLDANQTKERLLLLKQRVS 240
Query: 242 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 301
TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT
Sbjct: 241 TLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKT 300
Query: 302 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 361
LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR
Sbjct: 301 LTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELR 360
Query: 362 NNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLE 421
NNQIP AGKEIGEAETDHESNFSHPFSSEIDNLE
Sbjct: 361 NNQIP---------------------------AGKEIGEAETDHESNFSHPFSSEIDNLE 420
Query: 422 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 481
NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS
Sbjct: 421 NTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVMALS 480
Query: 482 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 541
AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE
Sbjct: 481 AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKE 540
Query: 542 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCEPDSQYDNNSTNFIS 601
KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMN+ISCEPDSQYDNNSTNFIS
Sbjct: 541 KVENERAKSSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNKISCEPDSQYDNNSTNFIS 600
Query: 602 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 661
SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF
Sbjct: 601 SPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMEDIVSF 660
Query: 662 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 721
VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE
Sbjct: 661 VIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPKLTCE 720
Query: 722 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 781
VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK
Sbjct: 721 VALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVELARK 780
Query: 782 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 841
YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE
Sbjct: 781 YMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSRVHTE 818
Query: 842 TGQQN 847
TGQQN
Sbjct: 841 TGQQN 818
BLAST of CSPI05G20150 vs. NCBI nr
Match:
XP_008467205.1 (PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo] >XP_008467206.1 PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo])
HSP 1 Score: 1391.3 bits (3600), Expect = 0.0e+00
Identity = 771/849 (90.81%), Postives = 797/849 (93.88%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGEDVKKNVKQVHQKIIRGL-- 60
MMNRISVVVAVSIA YAIKQLTIRSWTSFFLP TNCSENGED KKN GL
Sbjct: 3 MMNRISVVVAVSIAAYAIKQLTIRSWTSFFLP-TNCSENGEDAKKN----------GLDE 62
Query: 61 EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNRKEEKVP 120
EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL +KEEKVP
Sbjct: 63 EEEEEEEASSINDATSQVNGRTSDLEDGDHSSDELQV-LLPQRNSENWLLVHYKKEEKVP 122
Query: 121 EFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDAKNDDIS 180
EFL E++KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDAKN+DIS
Sbjct: 123 EFLTESNKIESERLLKLVMELEERKVKLEGELLMCDGIKYSETDVMELRKQLDAKNNDIS 182
Query: 181 MLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQ 240
MLNNTISSLQAERKILKEEILKGALMKKELEE R KIKELQR+IQLDANQTKERLLLLKQ
Sbjct: 183 MLNNTISSLQAERKILKEEILKGALMKKELEEARDKIKELQRQIQLDANQTKERLLLLKQ 242
Query: 241 RVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKAR 300
RVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKLEVMKAR
Sbjct: 243 RVSTLQAKEEEAVKKEAELFKKQKAAKDFEVELGELKWKNRELQHEKQELTSKLEVMKAR 302
Query: 301 IKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY 360
IKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY
Sbjct: 303 IKTLTKMTESEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRY 362
Query: 361 ELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEID 420
ELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHPFS ID
Sbjct: 363 ELRNNQIPAGESARYLNKSSSPKSREKAKQLMLEYAGMEFGQEETDHESNFSHPFSFGID 422
Query: 421 NLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKDPLEAVM 480
NLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP SSHRWKDPLEAVM
Sbjct: 423 NLENTSIDSSRSRTSSFSEKPNSNLSLKKLIRNQGGLSAVSPPGISGSSHRWKDPLEAVM 482
Query: 481 ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQ 540
ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEH+KLAIGSEKQ
Sbjct: 483 ALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEESLQQKYSTYKEHYKLAIGSEKQ 542
Query: 541 IKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQYDNNST 600
IKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ DN+ST
Sbjct: 543 IKEKAESEKAKSSGDSSSLNLEYHDISMRKKSATLPLKLAQMK-NKISCEPDSQNDNDST 602
Query: 601 NFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREASFSNMED 660
N IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA FSNMED
Sbjct: 603 NLISNPTSSGGEVHRGSELVQFNQKMMKPEVKAHMETQGDHLVVALAMEVREACFSNMED 662
Query: 661 IVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSSFVDNPK 720
IVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSSFVDNPK
Sbjct: 663 IVSFVIRLDEKLSSLVDGMEILEHFDWPMRKTDALREAAFGYQKLMKLREEVSSFVDNPK 722
Query: 721 LTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKIKVLCVE 780
LTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKIKVLCVE
Sbjct: 723 LTCEVALNKMNSLLDKVEQSVNALLQTRDTMISRYEELGIPIDWLLDCGVVGKIKVLCVE 782
Query: 781 LARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRSR 840
LARKYMKRIVKEHNALSGP+KEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELR+R
Sbjct: 783 LARKYMKRIVKEHNALSGPDKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKAFEELRNR 838
Query: 841 VHTETGQQN 847
VHTETGQ+N
Sbjct: 843 VHTETGQKN 838
BLAST of CSPI05G20150 vs. NCBI nr
Match:
TYK00280.1 (protein CHUP1 [Cucumis melo var. makuwa])
HSP 1 Score: 1277.3 bits (3304), Expect = 0.0e+00
Identity = 704/767 (91.79%), Postives = 728/767 (94.92%), Query Frame = 0
Query: 55 IRGL--EEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLLLPQRNSENWLLDDNR 114
++GL EEEEEEEA+SI+DATSQVNGRTSDLEDGDHSSDE QV LLPQRNSENWLL +
Sbjct: 20 VQGLDEEEEEEEEASSINDATSQVNGRTSDLEDGDHSSDELQV-LLPQRNSENWLLVHYK 79
Query: 115 KEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQLDA 174
KEEKVPEFL EN+KIE ERLLKL+MELEERKVKLEGEL+MCDGIKYSETDVMELRKQLDA
Sbjct: 80 KEEKVPEFLTENNKIESERLLKLVMELEERKVKLEGELLMCDGIKYSETDVMELRKQLDA 139
Query: 175 KNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKER 234
KN+DISMLNNTISSLQAERKILKEEILKGALMKKELEE RGKIKELQR+IQLDANQTKER
Sbjct: 140 KNNDISMLNNTISSLQAERKILKEEILKGALMKKELEEARGKIKELQRQIQLDANQTKER 199
Query: 235 LLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKL 294
LLLLKQRVSTLQAKEEEAVKKEAEL+KKQKAAKDFEVE GELKWKNRELQHE QELTSKL
Sbjct: 200 LLLLKQRVSTLQAKEEEAVKKEAELFKKQKAAKDFEVELGELKWKNRELQHEKQELTSKL 259
Query: 295 EVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI 354
EVMKARIKTLTKMTE+EIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI
Sbjct: 260 EVMKARIKTLTKMTESEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWI 319
Query: 355 NACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHP 414
NACLRYELRNNQIPAGESARYLNKSSSPKS+EKAKQLMLEYAG E G+ ETDHESNFSHP
Sbjct: 320 NACLRYELRNNQIPAGESARYLNKSSSPKSREKAKQLMLEYAGMEFGQEETDHESNFSHP 379
Query: 415 FSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPSTIDSSHRWKD 474
FS IDNLENTSIDSSRSRTSSF EKPNSNLSLKKLIRNQGG SAVSPP SSHRWKD
Sbjct: 380 FSFGIDNLENTSIDSSRSRTSSFSEKPNSNLSLKKLIRNQGGLSAVSPPGISGSSHRWKD 439
Query: 475 PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLA 534
PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVE+SLQQKYSTYKEHHKLA
Sbjct: 440 PLEAVMALSAETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEESLQQKYSTYKEHHKLA 499
Query: 535 IGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMR-KNATLVLKLAQMKMNRISCEPDSQ 594
IGSEKQIKEK E+E+AKSSGDSSS NLEY DISMR K+ATL LKLAQMK N+ISCEPDSQ
Sbjct: 500 IGSEKQIKEKAESEKAKSSGDSSSLNLEYHDISMRKKSATLPLKLAQMK-NKISCEPDSQ 559
Query: 595 YDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHMETQRDHLVMALAMEVREAS 654
DN+STN IS+PTSSGGEVHRGSELVQFN+KMMKPEVK HMETQ DHLV+ALAMEVREA
Sbjct: 560 NDNDSTNLISNPTSSGGEVHRGSELVQFNQKMMKPEVKAHMETQGDHLVVALAMEVREAC 619
Query: 655 FSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVSS 714
FSNMEDIVSFVI LDEKLSSLVDGMEILEHFDWP RKTDALREAAFGYQKLMKLREEVSS
Sbjct: 620 FSNMEDIVSFVIRLDEKLSSLVDGMEILEHFDWPMRKTDALREAAFGYQKLMKLREEVSS 679
Query: 715 FVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGKI 774
FVDNPKLTCEVALNKMNSLLDKVEQSV ALLQTRDT ISRYEELGIPIDWLLDCGVVGKI
Sbjct: 680 FVDNPKLTCEVALNKMNSLLDKVEQSVNALLQTRDTMISRYEELGIPIDWLLDCGVVGKI 739
Query: 775 KVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHK 819
KVLCVELARKYMKRIVKEHN LSGP+KEPNREFLLFQGVRFASRVHK
Sbjct: 740 KVLCVELARKYMKRIVKEHNGLSGPDKEPNREFLLFQGVRFASRVHK 784
BLAST of CSPI05G20150 vs. TAIR 10
Match:
AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 658.7 bits (1698), Expect = 6.4e-189
Identity = 456/999 (45.65%), Postives = 599/999 (59.96%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKII 60
M RI VVA SIA +K+L ++ + S+NGE D +++V ++ K +
Sbjct: 1 MFVRIGFVVAASIAAVTVKRLNVKP-----SKPSKPSDNGEGGDKEQSVDPDYNLNDKNL 60
Query: 61 RGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLL-----LPQRNSENWLLDD 120
+ EEEEEEE I+ +Q G SD D D EF+ LL P + +N L +
Sbjct: 61 QEEEEEEEEEVKLINSVINQTRGSFSDYLD-DDILPEFEDLLSGEIEYPLPDDDNNL--E 120
Query: 121 NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQL 180
++E+ E + + ELERL +L+ ELEER+VKLEGEL+ G+K E+D++EL++QL
Sbjct: 121 KAEKERKYEVEMAYNDGELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQL 180
Query: 181 DAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTK 240
K +I MLN TI+SLQAERK L+EE+ + +++KELE R KIKELQR+IQLDANQTK
Sbjct: 181 KIKTVEIDMLNITINSLQAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTK 240
Query: 241 ERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS 300
+LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+ ELK KNRELQHE +EL+
Sbjct: 241 GQLLLLKQHVSSLQMKEEEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSI 300
Query: 301 KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLR 360
KL+ +ARI TL+ MTE++ + K REE LK NEDL+KQ+EGLQMNRFSEVEELVYLR
Sbjct: 301 KLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLR 360
Query: 361 WINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNF 420
W+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLEYAG E G+ +TD ESN+
Sbjct: 361 WVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNY 420
Query: 421 SHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS------- 480
S P S D+ +N S+DSS SR SSF +KP LKK +++ SS S PS
Sbjct: 421 SQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGS 480
Query: 481 ---TIDSSHRWKDPLEAVMALSA--------------------ETLTLSEVRLQ---VSS 540
S ++ + PLE++M +A ET L +R Q S
Sbjct: 481 PGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSP 540
Query: 541 RKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK------- 600
+ +NSVA SF +MSKSV+ L +KY YK+ HKLA+ EK IK K + RA+
Sbjct: 541 GEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVA 600
Query: 601 --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-- 660
++GD S+ + E + +NA V K+ + + +
Sbjct: 601 LPPKLAQLKEKRVVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVP 660
Query: 661 ---PDSQYDNNSTNFISS---------------------PTSSGG--------------- 720
P S STN S+ P GG
Sbjct: 661 RPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGA 720
Query: 721 ----EVHRGSELVQFNRKMMKPE------------------------------------- 780
+VHR ELV+F + +MK E
Sbjct: 721 GGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLA 780
Query: 781 VKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKR 840
VK +ETQ D V +LA EVR +SF+++ED+++FV WLDE+LS LVD +L+HFDWP+
Sbjct: 781 VKADVETQGD-FVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFDWPEG 840
Query: 841 KTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT 847
K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM LL+KVEQSVYALL+TRD
Sbjct: 841 KADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDM 900
BLAST of CSPI05G20150 vs. TAIR 10
Match:
AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 658.7 bits (1698), Expect = 6.4e-189
Identity = 456/999 (45.65%), Postives = 599/999 (59.96%), Query Frame = 0
Query: 1 MMNRISVVVAVSIAVYAIKQLTIRSWTSFFLPTTNCSENGE--DVKKNVK---QVHQKII 60
M RI VVA SIA +K+L ++ + S+NGE D +++V ++ K +
Sbjct: 1 MFVRIGFVVAASIAAVTVKRLNVKP-----SKPSKPSDNGEGGDKEQSVDPDYNLNDKNL 60
Query: 61 RGLEEEEEEEANSISDATSQVNGRTSDLEDGDHSSDEFQVLL-----LPQRNSENWLLDD 120
+ EEEEEEE I+ +Q G SD D D EF+ LL P + +N L +
Sbjct: 61 QEEEEEEEEEVKLINSVINQTRGSFSDYLD-DDILPEFEDLLSGEIEYPLPDDDNNL--E 120
Query: 121 NRKEEKVPEFLIENSKIELERLLKLLMELEERKVKLEGELIMCDGIKYSETDVMELRKQL 180
++E+ E + + ELERL +L+ ELEER+VKLEGEL+ G+K E+D++EL++QL
Sbjct: 121 KAEKERKYEVEMAYNDGELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQL 180
Query: 181 DAKNDDISMLNNTISSLQAERKILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTK 240
K +I MLN TI+SLQAERK L+EE+ + +++KELE R KIKELQR+IQLDANQTK
Sbjct: 181 KIKTVEIDMLNITINSLQAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTK 240
Query: 241 ERLLLLKQRVSTLQAKEEEAVKKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTS 300
+LLLLKQ VS+LQ KEEEA+ K+ E+ +K KA +D EV+ ELK KNRELQHE +EL+
Sbjct: 241 GQLLLLKQHVSSLQMKEEEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSI 300
Query: 301 KLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLR 360
KL+ +ARI TL+ MTE++ + K REE LK NEDL+KQ+EGLQMNRFSEVEELVYLR
Sbjct: 301 KLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLR 360
Query: 361 WINACLRYELRNNQIPAGE-SARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNF 420
W+NACLRYELRN Q PAG+ SAR L+K+ SPKS+ KAK+LMLEYAG E G+ +TD ESN+
Sbjct: 361 WVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNY 420
Query: 421 SHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS------- 480
S P S D+ +N S+DSS SR SSF +KP LKK +++ SS S PS
Sbjct: 421 SQPSSPGSDDFDNASMDSSTSRFSSFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGS 480
Query: 481 ---TIDSSHRWKDPLEAVMALSA--------------------ETLTLSEVRLQ---VSS 540
S ++ + PLE++M +A ET L +R Q S
Sbjct: 481 PGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSP 540
Query: 541 RKSVNSVATSFQLMSKSVEQSLQQKYSTYKEHHKLAIGSEKQIKEKVENERAK------- 600
+ +NSVA SF +MSKSV+ L +KY YK+ HKLA+ EK IK K + RA+
Sbjct: 541 GEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVA 600
Query: 601 --------------------SSGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-- 660
++GD S+ + E + +NA V K+ + + +
Sbjct: 601 LPPKLAQLKEKRVVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVP 660
Query: 661 ---PDSQYDNNSTNFISS---------------------PTSSGG--------------- 720
P S STN S+ P GG
Sbjct: 661 RPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGA 720
Query: 721 ----EVHRGSELVQFNRKMMKPE------------------------------------- 780
+VHR ELV+F + +MK E
Sbjct: 721 GGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLA 780
Query: 781 VKDHMETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKR 840
VK +ETQ D V +LA EVR +SF+++ED+++FV WLDE+LS LVD +L+HFDWP+
Sbjct: 781 VKADVETQGD-FVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFDWPEG 840
Query: 841 KTDALREAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDT 847
K DALREAAF YQ LMKL ++V+SFVD+P L+CE AL KM LL+KVEQSVYALL+TRD
Sbjct: 841 KADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQSVYALLRTRDM 900
BLAST of CSPI05G20150 vs. TAIR 10
Match:
AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 572.0 bits (1473), Expect = 7.9e-163
Identity = 376/798 (47.12%), Postives = 483/798 (60.53%), Query Frame = 0
Query: 192 KILKEEILKGALMKKELEEGRGKIKELQRKIQLDANQTKERLLLLKQRVSTLQAKEEEAV 251
K L+EE+ + +++KELE R KIKELQR+IQLDANQTK +LLLLKQ VS+LQ KEEEA+
Sbjct: 53 KNLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAM 112
Query: 252 KKEAELYKKQKAAKDFEVEFGELKWKNRELQHENQELTSKLEVMKARIKTLTKMTETEII 311
K+ E+ +K KA +D EV+ ELK KNRELQHE +EL+ KL+ +ARI TL+ MTE++ +
Sbjct: 113 NKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKV 172
Query: 312 TKEREEAQKLKSENEDLIKQLEGLQMNRFSEVEELVYLRWINACLRYELRNNQIPAGE-S 371
K REE LK NEDL+KQ+EGLQMNRFSEVEELVYLRW+NACLRYELRN Q PAG+ S
Sbjct: 173 AKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKIS 232
Query: 372 ARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAETDHESNFSHPFSSEIDNLENTSIDSSRS 431
AR L+K+ SPKS+ KAK+LMLEYAG E G+ +TD ESN+S P S D+ +N S+DSS S
Sbjct: 233 ARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMDSSTS 292
Query: 432 RTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS----------TIDSSHRWKDPLEAVMAL 491
R SSF +KP LKK +++ SS S PS S ++ + PLE++M
Sbjct: 293 RFSSFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQRGPLESLMIR 352
Query: 492 SA--------------------ETLTLSEVRLQ---VSSRKSVNSVATSFQLMSKSVEQS 551
+A ET L +R Q S + +NSVA SF +MSKSV+
Sbjct: 353 NAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFHVMSKSVDNV 412
Query: 552 LQQKYSTYKEHHKLAIGSEKQIKEKVENERAK---------------------------S 611
L +KY YK+ HKLA+ EK IK K + RA+ +
Sbjct: 413 LDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRVVVPSVITA 472
Query: 612 SGDSSSSNLEYEDISMRKNATLVLKLAQMKMNRISCE-----PDSQYDNNSTNFISS--- 671
+GD S+ + E + +NA V K+ + + + P S STN S+
Sbjct: 473 TGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKSTNLPSARPP 532
Query: 672 ------------------PTSSGG-------------------EVHRGSELVQFNRKMMK 731
P GG +VHR ELV+F + +MK
Sbjct: 533 LPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMK 592
Query: 732 PE-------------------------------------VKDHMETQRDHLVMALAMEVR 791
E VK +ETQ D V +LA EVR
Sbjct: 593 RESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGD-FVQSLATEVR 652
Query: 792 EASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREE 847
+SF+++ED+++FV WLDE+LS LVD +L+HFDWP+ K DALREAAF YQ LMKL ++
Sbjct: 653 ASSFTDIEDLLAFVSWLDEELSFLVDERAVLKHFDWPEGKADALREAAFEYQDLMKLEKQ 712
BLAST of CSPI05G20150 vs. TAIR 10
Match:
AT1G48280.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 181.8 bits (460), Expect = 2.3e-45
Identity = 96/248 (38.71%), Postives = 153/248 (61.69%), Query Frame = 0
Query: 594 NNSTNFISSPTSSGGEVHRGSELVQ---FNRKMMKPEVKDHMETQRDHLVMALAMEVREA 653
+NS N S + +V+ + NR +K +ET + + L +V
Sbjct: 296 DNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIET-KGEFINDLIQKVLTT 355
Query: 654 SFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDALREAAFGYQKLMKLREEVS 713
FS+MED++ FV WLD++L++L D +L+HF WP++K D L+EAA Y++L KL +E+S
Sbjct: 356 CFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEAAVEYRELKKLEKELS 415
Query: 714 SFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRYEELGIPIDWLLDCGVVGK 773
S+ D+P + VAL KM +LLDK EQ + L++ R +++ Y++ IP++W+LD G++ K
Sbjct: 416 SYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMICK 475
Query: 774 IKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRFASRVHKFAGGFDSKSMKA 833
IK ++LA+ YM R+ E + ++E +E LL QGVRFA R H+FAGG D +++ A
Sbjct: 476 IKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYRTHQFAGGLDPETLCA 535
Query: 834 FEELRSRV 839
EE++ RV
Sbjct: 536 LEEIKQRV 542
BLAST of CSPI05G20150 vs. TAIR 10
Match:
AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )
HSP 1 Score: 169.9 bits (429), Expect = 9.0e-42
Identity = 166/568 (29.23%), Postives = 264/568 (46.48%), Query Frame = 0
Query: 283 HENQELTSKLEVMKARIKTLTKMTETEIITKEREEAQKLKSENEDLIKQLEGLQMNRFSE 342
H S V+ + + ++ E E + K KL E+ +I LE ++ E
Sbjct: 75 HATAAAASHNGVVSELRRQVEELREREALLKTENLEVKLLRESVSVIPLLESQIADKNGE 134
Query: 343 VEELVYLRWINACLRYELRNNQIPAGESARYLNKSSSPKSKEKAKQLMLEYAGKEIGEAE 402
++E LR A L +N+ E R +++EK + + K +
Sbjct: 135 IDE---LRKETARL---AEDNERLRREFDRSEEMRRECETREKEMEAEIVELRKLVSSES 194
Query: 403 TDHESNFSHPFSSEIDNLENTSIDSSRSRTSSFREKPNSNLSLKKLIRNQGGSSAVSPPS 462
DH + S F +D +++ S R S R P + + + + ++S
Sbjct: 195 DDHALSVSQRFQGLMDVSAKSNLIRSLKRVGSLRNLP------EPITNQENTNKSISSSG 254
Query: 463 TIDSSHRWKDPLEAVMALS-AETLTLSEVRLQVSSRKSVNSVATSFQLMSKSVEQSLQQK 522
D KD +E+ S +E LT S V SR V V S S+ S + +
Sbjct: 255 DADGDIYRKDEIESYSRSSNSEELTESSSLSTVRSR--VPRVPKPPPKRSISLGDSTENR 314
Query: 523 YSTYKEHH--------KLAIGSEKQIKEKVENERAKSSGDSSSSNLEYEDISMRKNATLV 582
+ + + V +L +R+ +V
Sbjct: 315 ADPPPQKSIPPPPPPPPPPLLQQPPPPPSVSKAPPPPPPPPPPKSLSIASAKVRRVPEVV 374
Query: 583 -LKLAQMKMNRISCEPDSQYDNNSTNFISSPTSSGGEVHRGSELVQFNRKMMKPEVKDHM 642
+ M+ + + DS N+ S+ ++ E NR + +K +
Sbjct: 375 EFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNARDMIGEIE----NRSVYLLAIKTDV 434
Query: 643 ETQRDHLVMALAMEVREASFSNMEDIVSFVIWLDEKLSSLVDGMEILEHFDWPKRKTDAL 702
ETQ D + L EV A+FS++ED+V FV WLD++LS LVD +L+HF+WP++K DAL
Sbjct: 435 ETQGD-FIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHFEWPEQKADAL 494
Query: 703 REAAFGYQKLMKLREEVSSFVDNPKLTCEVALNKMNSLLDKVEQSVYALLQTRDTTISRY 762
REAAF Y L KL E S F ++P+ + AL KM +L +K+E VY+L + R++ +++
Sbjct: 495 REAAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEKLEHGVYSLSRMRESAATKF 554
Query: 763 EELGIPIDWLLDCGVVGKIKVLCVELARKYMKRIVKEHNALSGPEKEPNREFLLFQGVRF 822
+ IP+DW+L+ G+ +IK+ V+LA KYMKR+ E A+ G P E L+ QGVRF
Sbjct: 555 KSFQIPVDWMLETGITSQIKLASVKLAMKYMKRVSAELEAIEG--GGPEEEELIVQGVRF 614
Query: 823 ASRVHKFAGGFDSKSMKAFEELRSRVHT 841
A RVH+FAGGFD+++MKAFEELR + +
Sbjct: 615 AFRVHQFAGGFDAETMKAFEELRDKARS 621
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Q9LI74 | 9.0e-188 | 45.65 | Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KT25 | 0.0e+00 | 98.70 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G577430 PE=4 SV=1 | [more] |
A0A1S3CSZ9 | 0.0e+00 | 90.81 | protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103504610 PE=4 S... | [more] |
A0A5D3BMR7 | 0.0e+00 | 91.79 | Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1728G00210... | [more] |
A0A6J1DWY5 | 6.4e-245 | 62.40 | protein CHUP1, chloroplastic-like OS=Momordica charantia OX=3673 GN=LOC111024258... | [more] |
A0A061ECQ9 | 1.3e-200 | 50.32 | Hydroxyproline-rich glycoprotein family protein isoform 4 OS=Theobroma cacao OX=... | [more] |
Match Name | E-value | Identity | Description | |
XP_011655490.1 | 0.0e+00 | 99.53 | protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031741049.1 protei... | [more] |
XP_031741050.1 | 0.0e+00 | 99.39 | protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KAE8648621.1 hypothet... | [more] |
XP_031741051.1 | 0.0e+00 | 96.33 | protein CHUP1, chloroplastic isoform X3 [Cucumis sativus] | [more] |
XP_008467205.1 | 0.0e+00 | 90.81 | PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo] >XP_008467206.1 PRED... | [more] |
TYK00280.1 | 0.0e+00 | 91.79 | protein CHUP1 [Cucumis melo var. makuwa] | [more] |
Match Name | E-value | Identity | Description | |
AT3G25690.1 | 6.4e-189 | 45.65 | Hydroxyproline-rich glycoprotein family protein | [more] |
AT3G25690.2 | 6.4e-189 | 45.65 | Hydroxyproline-rich glycoprotein family protein | [more] |
AT3G25690.3 | 7.9e-163 | 47.12 | Hydroxyproline-rich glycoprotein family protein | [more] |
AT1G48280.1 | 2.3e-45 | 38.71 | hydroxyproline-rich glycoprotein family protein | [more] |
AT4G18570.1 | 9.0e-42 | 29.23 | Tetratricopeptide repeat (TPR)-like superfamily protein | [more] |