HG10023513 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10023513
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein CHUP1, chloroplastic
LocationChr05: 34873317 .. 34875860 (-)
RNA-Seq ExpressionHG10023513
SyntenyHG10023513
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAACAAGAGGGATTTGATGAAGCCTGTATTATTCAAATTTGGGGTTGCTCTGGCTATCTCCTTTGCTGGTTTGCTCTATTCCCGATTCAGACTCGGAAATAAGAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGGTTCTTATACTCAATCTCTTATGCTTCCTCAATTTCTTAATGTTAATTGTTAAGTCATGTATTTGGTGTTTGTGCGAGTCCGAACCTCTAACCTTTTGGTCTTAACTGATTGAGATATATTGAGCTATGTTTAGTTGGCAAGTTTTAAATCATGTAATTCATTTGCTTTTAGCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGGGGAAGGGGAAGAAGACTTAGACTTGACAATCAAGGAATGAAGGCAGCAACAACAGCATCCTCTAATAATGTTGTTCTTTTTGCAGTTGATGCCTATGTAAGTTTTCTAAAGATGGTTTTTACATTTTGGTAATATGATTAACTTAGAATGTTCAAAAAGGTTGGATACGAGGTAGGTTTTTCTGAAGTTGGTTGTGCAAAATTTGTGTAATGAACCCATTTGTTGAACTGTTGGGAGTGATGGAAAATGCTAATACCTTATGATGAAAGTATCTTATTAATGAAGAAACCATCGGGACAGTTGATTTTTCTTTCTCAGTTTCTGTCCTCGGCAATTGTTTGTTTTCAGCATGTTGGTCTTTGATTAAGACATAAGCTTCACGTTATGTGTTATGGTCTTTTCTAAGCTTGTGACTTCGTTGTTTGATTTCTTTCTCAGCAAGAAATGTGTATTCCAAAAGTCAATGTTGATGATTCAAATGTTGGTCTCTGTCCTAGCAATAAGCATGGTGTAGAAAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTGCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGACGCACCAAGGTTGGCGCTCAAAACTCCAAAAGCTTATAAGACAGTTGAGGATGATGAATATGAACAAGAGATCAGACACCTCAAAAGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCAGTAATGGAGCTCCAAAATAGGTTGAAGATTAGTAACATGGAAGCCAAGCTTTTCAAACTCAAGATTGAGTCCCTTCAGGCAGATAACCGACGATTAGTGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCCGCAAAAGCAAAAATTAAGTTCCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGACAAATGAAAGCAATAAAGATGCCCAAATCAAGTTGCAAAAGATTGAAGAATTGGAGAAAGAGATAGAGGACTTGAGAAAGTCGAATTTGAGATTACAGATAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAGTATGTTTTCTTCCTAGCATAGCTATTTTGCCTTTTTTTTTATAATTTTAAGGTGATGAAAAATTGAATGCCATCCCATCTAATTTGTGCAGAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCAGGAGAAAATGAGGCGTTGGCTAAGGAAATTGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCAGGGAAAACAGCAGCAAGAGACCTGAGCAAAACATTAAGTCCCAAATCCAAGGAGAAAGCAAAGAAGCTCATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAACGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCGGATGATTCAGCTGTTGATTTTCCATCAACAACCAAAACAAGTTCAAACAAAATCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGTGATTCACCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCAGAATTCATCAAGACATTCAATGGATTTTCACAGATTGCATACCCAAAAGGAAGATGATGGAAAAACTGAGGACTCCATTAGAAGGAATAGTGATGTTGGCTACGTGAACAAGAGATTTGTTTTAGGGAGCGACCGATCGAGCAACTCATCATATAGATCTCAAAGTCAGGATGCAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGCCACATAGGAAGGCTGCATCCATTGGTTCGTTTTGA

mRNA sequence

ATGGAAAACAAGAGGGATTTGATGAAGCCTGTATTATTCAAATTTGGGGTTGCTCTGGCTATCTCCTTTGCTGGTTTGCTCTATTCCCGATTCAGACTCGGAAATAAGAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGGGGAAGGGGAAGAAGACTTAGACTTGACAATCAAGGAATGAAGGCAGCAACAACAGCATCCTCTAATAATGTTGTTCTTTTTGCAGTTGATGCCTATCAAGAAATGTGTATTCCAAAAGTCAATGTTGATGATTCAAATGTTGGTCTCTGTCCTAGCAATAAGCATGGTGTAGAAAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTGCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGACGCACCAAGGTTGGCGCTCAAAACTCCAAAAGCTTATAAGACAGTTGAGGATGATGAATATGAACAAGAGATCAGACACCTCAAAAGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCAGTAATGGAGCTCCAAAATAGGTTGAAGATTAGTAACATGGAAGCCAAGCTTTTCAAACTCAAGATTGAGTCCCTTCAGGCAGATAACCGACGATTAGTGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCCGCAAAAGCAAAAATTAAGTTCCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGACAAATGAAAGCAATAAAGATGCCCAAATCAAGTTGCAAAAGATTGAAGAATTGGAGAAAGAGATAGAGGACTTGAGAAAGTCGAATTTGAGATTACAGATAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCAGGAGAAAATGAGGCGTTGGCTAAGGAAATTGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCAGGGAAAACAGCAGCAAGAGACCTGAGCAAAACATTAAGTCCCAAATCCAAGGAGAAAGCAAAGAAGCTCATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAACGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCGGATGATTCAGCTGTTGATTTTCCATCAACAACCAAAACAAGTTCAAACAAAATCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGTGATTCACCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCAGAATTCATCAAGACATTCAATGGATTTTCACAGATTGCATACCCAAAAGGAAGATGATGGAAAAACTGAGGACTCCATTAGAAGGAATAGTGATGTTGGCTACGTGAACAAGAGATTTGTTTTAGGGAGCGACCGATCGAGCAACTCATCATATAGATCTCAAAGTCAGGATGCAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGCCACATAGGAAGGCTGCATCCATTGGTTCGTTTTGA

Coding sequence (CDS)

ATGGAAAACAAGAGGGATTTGATGAAGCCTGTATTATTCAAATTTGGGGTTGCTCTGGCTATCTCCTTTGCTGGTTTGCTCTATTCCCGATTCAGACTCGGAAATAAGAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGGGGAAGGGGAAGAAGACTTAGACTTGACAATCAAGGAATGAAGGCAGCAACAACAGCATCCTCTAATAATGTTGTTCTTTTTGCAGTTGATGCCTATCAAGAAATGTGTATTCCAAAAGTCAATGTTGATGATTCAAATGTTGGTCTCTGTCCTAGCAATAAGCATGGTGTAGAAAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTGCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGACGCACCAAGGTTGGCGCTCAAAACTCCAAAAGCTTATAAGACAGTTGAGGATGATGAATATGAACAAGAGATCAGACACCTCAAAAGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCAGTAATGGAGCTCCAAAATAGGTTGAAGATTAGTAACATGGAAGCCAAGCTTTTCAAACTCAAGATTGAGTCCCTTCAGGCAGATAACCGACGATTAGTGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCCGCAAAAGCAAAAATTAAGTTCCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGACAAATGAAAGCAATAAAGATGCCCAAATCAAGTTGCAAAAGATTGAAGAATTGGAGAAAGAGATAGAGGACTTGAGAAAGTCGAATTTGAGATTACAGATAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCAGGAGAAAATGAGGCGTTGGCTAAGGAAATTGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTTCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCAGGGAAAACAGCAGCAAGAGACCTGAGCAAAACATTAAGTCCCAAATCCAAGGAGAAAGCAAAGAAGCTCATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAACGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCGGATGATTCAGCTGTTGATTTTCCATCAACAACCAAAACAAGTTCAAACAAAATCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGTGATTCACCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCAGAATTCATCAAGACATTCAATGGATTTTCACAGATTGCATACCCAAAAGGAAGATGATGGAAAAACTGAGGACTCCATTAGAAGGAATAGTGATGTTGGCTACGTGAACAAGAGATTTGTTTTAGGGAGCGACCGATCGAGCAACTCATCATATAGATCTCAAAGTCAGGATGCAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGCCACATAGGAAGGCTGCATCCATTGGTTCGTTTTGA

Protein sequence

MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSNSSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF
Homology
BLAST of HG10023513 vs. NCBI nr
Match: XP_038898688.1 (protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898689.1 protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898690.1 protein CHUP1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1065.1 bits (2753), Expect = 2.4e-307
Identity = 580/646 (89.78%), Postives = 609/646 (94.27%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           M++KRDLMKP+LFKFG ALAISFAG L S+FRL NKRPPL PPSSSSS DDQ +KVDLGR
Sbjct: 1   MDDKRDLMKPILFKFGFALAISFAGFLCSQFRLRNKRPPLLPPSSSSS-DDQSSKVDLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLDNQG+KAAT ASS NVV FAVDAY++ CIPKVN DDSN+GL PSNKHGV+KD
Sbjct: 61  GRGP--RLDNQGLKAATAASS-NVVHFAVDAYEKKCIPKVNFDDSNIGLRPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           G LLPEFQELVKEFDF+AANAGL PKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSK
Sbjct: 121 G-LLPEFQELVKEFDFSAANAGLPPKKNVEAPRSGLETPKAYKTVEDDEYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VK LRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL 
Sbjct: 181 VKTLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVCDHAKSVSDLEAAKAKIKFLKKK+RYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK
Sbjct: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKIRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQI+LQKIEELEKEIEDLRKSNL+LQIENSDL RRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIRLQKIEELEKEIEDLRKSNLKLQIENSDLSRRLDATQFLANSLLEDQEKESLKEEM 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL+ ENEAL KEIEQLQAHRCAD+EELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLSRENEALTKEIEQLQAHRCADIEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEGIEGK IN+ DFDSDQWSSSQASSHTDPGDPDDSAVDFPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGIEGKSINITDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KTSSNK+KFISKLRKLLRGKGSQQNLTLLAEKSAASVEDS SPRYSSSNS GTNATRA
Sbjct: 481 TAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSGSPRYSSSNSPGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSI-RRNSDVGYVNKRFVLGSDRSS 600
           EGQGIGYTTPS+NSSRHSMDFHRL++QKEDDGKTEDSI RRNSDVGYVNK+FVLGSD SS
Sbjct: 541 EGQGIGYTTPSRNSSRHSMDFHRLNSQKEDDGKTEDSIRRRNSDVGYVNKKFVLGSDESS 600

Query: 601 NSSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           NSSYRSQSQD ESTEKSELMKYAEVLKDTRGAKN+  RKAASIGSF
Sbjct: 601 NSSYRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSQRKAASIGSF 641

BLAST of HG10023513 vs. NCBI nr
Match: KAA0059471.1 (protein CHUP1 [Cucumis melo var. makuwa] >TYK03852.1 protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 1038.9 bits (2685), Expect = 1.9e-299
Identity = 565/645 (87.60%), Postives = 599/645 (92.87%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCIPKVNVDDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDNQGMKAATAASS-NVVLFAVDAYEEMCIPKVNVDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSK
Sbjct: 121 GLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTVEDDEYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNK
Sbjct: 241 SQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKVLKLQDQEHKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSHTDPGDPDDSAAEFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KTSSNKIKFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRA
Sbjct: 481 TAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSPCYSSSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSN
Sbjct: 541 EGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVGYVNKRFVLGSDQSSN 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SS RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Sbjct: 601 SSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGSF 641

BLAST of HG10023513 vs. NCBI nr
Match: XP_008462405.1 (PREDICTED: protein CHUP1, chloroplastic [Cucumis melo] >XP_008462406.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo])

HSP 1 Score: 1035.4 bits (2676), Expect = 2.1e-298
Identity = 564/645 (87.44%), Postives = 598/645 (92.71%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCI KVNVDDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDNQGMKAATAASS-NVVLFAVDAYEEMCIRKVNVDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSK
Sbjct: 121 GLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTVEDDEYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNK
Sbjct: 241 SQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKVLKLQDQEHKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSHTDPGDPDDSAAEFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KTSSNKIKFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRA
Sbjct: 481 TAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSPCYSSSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSN
Sbjct: 541 EGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVGYVNKRFVLGSDQSSN 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SS RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Sbjct: 601 SSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGSF 641

BLAST of HG10023513 vs. NCBI nr
Match: XP_031744947.1 (protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031744948.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 1021.9 bits (2641), Expect = 2.4e-294
Identity = 552/645 (85.58%), Postives = 589/645 (91.32%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSSADDQGNKV+LGR
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSSADDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLD QG       + +NVVLFAVDAY+E CIPKVN DDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDKQG-------TPSNVVLFAVDAYEETCIPKVNFDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSK
Sbjct: 121 GLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKTVENDEYEQEIRYLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNK
Sbjct: 241 SQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRVLKLQDQEHKTNQSNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSHTDPGDPDDSTTDFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KT SNKIKFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRA
Sbjct: 481 TAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDSPCYSTSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+
Sbjct: 541 EGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDVGCVNKRFVVGSDQLSD 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SSYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Sbjct: 601 SSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGSF 636

BLAST of HG10023513 vs. NCBI nr
Match: XP_004141788.1 (protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KGN45575.1 hypothetical protein Csa_015974 [Cucumis sativus])

HSP 1 Score: 1015.4 bits (2624), Expect = 2.2e-292
Identity = 551/645 (85.43%), Postives = 588/645 (91.16%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLD QG       + +NVVLFAVDAY+E CIPKVN DDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDKQG-------TPSNVVLFAVDAYEETCIPKVNFDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSK
Sbjct: 121 GLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKTVENDEYEQEIRYLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNK
Sbjct: 241 SQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRVLKLQDQEHKTNQSNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSHTDPGDPDDSTTDFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KT SNKIKFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRA
Sbjct: 481 TAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDSPCYSTSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+
Sbjct: 541 EGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDVGCVNKRFVVGSDQLSD 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SSYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Sbjct: 601 SSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGSF 635

BLAST of HG10023513 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 209.1 bits (531), Expect = 1.4e-52
Identity = 150/388 (38.66%), Postives = 235/388 (60.57%), Query Frame = 0

Query: 120 DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLK 179
           D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 180 SKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRR 239
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  +  + I SLQA+ ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 240 LVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNES 299
           L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNK 264

Query: 300 NKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKES 359
           + + + KL+ +++LE ++ +L++ N  LQ E  +L  +LD+ +      +++ E  +   
Sbjct: 265 DTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAK 324

Query: 360 LKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR 419
           ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Sbjct: 325 VREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISAR 384

Query: 420 DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSA 479
           DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D+++
Sbjct: 385 DLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNAS 444

Query: 480 VDFPSTTKTS-SNKIKFISKLRKLLRGK 503
           +D  ++  +S S K   I KL+K  + K
Sbjct: 445 MDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of HG10023513 vs. ExPASy TrEMBL
Match: A0A5A7V182 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G00460 PE=4 SV=1)

HSP 1 Score: 1038.9 bits (2685), Expect = 9.0e-300
Identity = 565/645 (87.60%), Postives = 599/645 (92.87%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCIPKVNVDDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDNQGMKAATAASS-NVVLFAVDAYEEMCIPKVNVDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSK
Sbjct: 121 GLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTVEDDEYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNK
Sbjct: 241 SQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKVLKLQDQEHKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSHTDPGDPDDSAAEFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KTSSNKIKFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRA
Sbjct: 481 TAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSPCYSSSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSN
Sbjct: 541 EGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVGYVNKRFVLGSDQSSN 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SS RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Sbjct: 601 SSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGSF 641

BLAST of HG10023513 vs. ExPASy TrEMBL
Match: A0A1S3CGW9 (protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500772 PE=4 SV=1)

HSP 1 Score: 1035.4 bits (2676), Expect = 1.0e-298
Identity = 564/645 (87.44%), Postives = 598/645 (92.71%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +LMKP+L KFGV LAISFA  LYSRFRL NKRPPLPPP SSSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLDNQGMKAAT ASS NVVLFAVDAY+EMCI KVNVDDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDNQGMKAATAASS-NVVLFAVDAYEEMCIRKVNVDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLLLPEFQE VKEFD +AANA  SPKKNV+APR  L+TPKAYKTVEDDEYEQEIRHLKSK
Sbjct: 121 GLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTVEDDEYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V+KLQDQEHKTNESNK
Sbjct: 241 SQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKVLKLQDQEHKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRK N RLQIENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSHTDPGDPDDSA +FPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSHTDPGDPDDSAAEFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KTSSNKIKFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDSP YSSSNSTGTNATRA
Sbjct: 481 TAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSPCYSSSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY T S+NSSR+S+DF RLH+QKED+ KTEDS RRNSDVGYVNKRFVLGSD+SSN
Sbjct: 541 EGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVGYVNKRFVLGSDQSSN 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SS RSQSQD ESTEKSELMKYAEVLKDTRGAKN+ HRKAASIGSF
Sbjct: 601 SSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGSF 641

BLAST of HG10023513 vs. ExPASy TrEMBL
Match: A0A0A0K799 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G452300 PE=4 SV=1)

HSP 1 Score: 1015.4 bits (2624), Expect = 1.1e-292
Identity = 551/645 (85.43%), Postives = 588/645 (91.16%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME+K +L +P+LFKFGV LAISFAG LYSRFRL NKRPPLPPPS SSS DDQGNKV+LGR
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSS-DDQGNKVNLGR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   RLD QG       + +NVVLFAVDAY+E CIPKVN DDSN+GLCPSNKHGV+KD
Sbjct: 61  GRGP--RLDKQG-------TPSNVVLFAVDAYEETCIPKVNFDDSNLGLCPSNKHGVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLL PEFQEL+KEFD +AANA  S KKNV+APR  L+TPKAYKTVE+DEYEQEIR+LKSK
Sbjct: 121 GLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKTVENDEYEQEIRYLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF  KIESL+ADNRRL 
Sbjct: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTFKIESLEADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV+KLQDQEHKTN+SNK
Sbjct: 241 SQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRVLKLQDQEHKTNQSNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DAQIKLQKIE+LEKEIE+LRKSNLRL+IENSDLGRRLDATQFLANS+LEDQEKESLKEE 
Sbjct: 301 DAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLANSLLEDQEKESLKEET 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENEAL KEIEQLQAHR ADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT
Sbjct: 361 ERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSHTDPGDPDDS  DFPS
Sbjct: 421 LSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSHTDPGDPDDSTTDFPS 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T KT SNKIKFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDSP YS+SNSTGTNATRA
Sbjct: 481 TAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDSPCYSTSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQ IGY TP  NSS HSMDFHRL +QKEDD K EDSIRRNSDVG VNKRFV+GSD+ S+
Sbjct: 541 EGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDVGCVNKRFVVGSDQLSD 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAASIGSF 646
           SSYRSQ+QD ESTEKSELMKYAEVLKDTRGAKNR HRK ASIGSF
Sbjct: 601 SSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGSF 635

BLAST of HG10023513 vs. ExPASy TrEMBL
Match: A0A6J1HMC2 (protein CHUP1, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464273 PE=4 SV=1)

HSP 1 Score: 902.9 bits (2332), Expect = 7.7e-259
Identity = 503/647 (77.74%), Postives = 551/647 (85.16%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME K DL+KPVLFKFGV LAISFA  +YSRFR+ NKRP L PPSSSSS + + NKV+LGR
Sbjct: 2   MEEKTDLVKPVLFKFGVVLAISFASFMYSRFRIRNKRPSLAPPSSSSSDEWRNNKVELGR 61

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   +LD+Q MK AT ASSN ++L A DAY+EMCI K N DDS+ G    N H V+++
Sbjct: 62  GRGH--KLDDQTMKVATAASSNAIIL-AADAYEEMCIQKANGDDSSAGFSTGNDHIVDEE 121

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GLLLPEFQELVK+FD +AANAG SPKKN  A RL ++TPKAYK VE D YE EI+HLKSK
Sbjct: 122 GLLLPEFQELVKQFDLSAANAGFSPKKNAGALRLGIETPKAYKRVESDGYEHEIKHLKSK 181

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL 
Sbjct: 182 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLE 241

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQV D AKS SDLEAA+  IKFLKKKLR+EAEQNR QI+NLQQRV KL DQE K NES K
Sbjct: 242 SQVSDQAKSASDLEAARTTIKFLKKKLRHEAEQNREQIVNLQQRVTKLLDQECKINESTK 301

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           + QIKLQ IE+LEKEIE+L+K+N RLQ ENSDLGRRLDATQFLANSILEDQEKESLKEER
Sbjct: 302 NDQIKLQNIEDLEKEIEELKKANSRLQKENSDLGRRLDATQFLANSILEDQEKESLKEER 361

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           +R A ENE L KEIEQLQAHRCADVEELVYLRWINACLRYELRNFQP AGKTAARDLSKT
Sbjct: 362 DRFAQENETLTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPAAGKTAARDLSKT 421

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+ KAKKLILEYANTEGIEGK IN+ DFDSDQWSSSQASSHTDPGD D SAVD   
Sbjct: 422 LSPKSEHKAKKLILEYANTEGIEGKSINLTDFDSDQWSSSQASSHTDPGDLDYSAVDSRL 481

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           T K SSNKIKF+SKLR LLRGK +QQ+  LL EKSAA+V D DSPRYSSS+STGTNATRA
Sbjct: 482 TAKPSSNKIKFMSKLRSLLRGKSNQQSSALLPEKSAAAVGDVDSPRYSSSHSTGTNATRA 541

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           +G G GYTTPSQNSSR SMDFHRL++QKEDD KTEDS+RRNSDVGY+NKRFV GSDRSSN
Sbjct: 542 DGHGTGYTTPSQNSSRRSMDFHRLNSQKEDDVKTEDSLRRNSDVGYINKRFVSGSDRSSN 601

Query: 601 SSYRSQSQDAEST---EKSELMKYAEVLKDTRGAKNRPHRKAASIGS 645
           S YRS SQ+ EST   EKSEL+KYAEVLK++RG KN+  RK A + S
Sbjct: 602 SLYRSSSQETESTDKSEKSELLKYAEVLKNSRGDKNQSRRKVAPMCS 645

BLAST of HG10023513 vs. ExPASy TrEMBL
Match: A0A6J1D049 (protein CHUP1, chloroplastic isoform X3 OS=Momordica charantia OX=3673 GN=LOC111016343 PE=4 SV=1)

HSP 1 Score: 897.9 bits (2319), Expect = 2.5e-257
Identity = 512/646 (79.26%), Postives = 550/646 (85.14%), Query Frame = 0

Query: 1   MENKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKRPPLPPPSSSSSADDQGNKVDLGR 60
           ME KR+L KP+L KFGV LAISFAG LYSRFR+  KRP LPPPSSSSSA DQGNKVDL R
Sbjct: 1   MEEKRNL-KPILLKFGVVLAISFAGFLYSRFRIRKKRPRLPPPSSSSSA-DQGNKVDLSR 60

Query: 61  GRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGVEKD 120
           GRG   +LDNQ +K                  +EM IPKVNVDDSNVGLCPS+K  V+KD
Sbjct: 61  GRGP--KLDNQAIK------------------EEMYIPKVNVDDSNVGLCPSSKRSVDKD 120

Query: 121 GLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLKSK 180
           GL LPE QELVKE DF AANAGLS +KNV+A R  L+TPKAY   E D+YEQEIRHLKSK
Sbjct: 121 GLFLPELQELVKESDFPAANAGLSHEKNVEALRSGLQTPKAYNNFETDDYEQEIRHLKSK 180

Query: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRRLV 240
           VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKI+NMEAKLF LKIESLQADNRRL 
Sbjct: 181 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFTLKIESLQADNRRLE 240

Query: 241 SQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNESNK 300
           SQV DHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQQRV KL DQE+KTNESNK
Sbjct: 241 SQVSDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQQRVAKLHDQEYKTNESNK 300

Query: 301 DAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEKESLKEER 360
           DA+IKL++IE+LEKE+EDLR SNLRLQIENSDL RRLDATQ LANSILED EKESLKEER
Sbjct: 301 DARIKLKRIEDLEKEVEDLRNSNLRLQIENSDLARRLDATQVLANSILEDPEKESLKEER 360

Query: 361 ERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKT 420
           ERL  ENE L KEIEQLQAHRCADVEELVYLRWINACLRYELRN+QP  GKTAARDLSKT
Sbjct: 361 ERLGQENENLMKEIEQLQAHRCADVEELVYLRWINACLRYELRNYQPRPGKTAARDLSKT 420

Query: 421 LSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPS 480
           LSPKS+EKAKKLILEYANTEGIEGKGIN++DFDSDQWSSSQASS T   D DDS VDF +
Sbjct: 421 LSPKSEEKAKKLILEYANTEGIEGKGINIMDFDSDQWSSSQASSLT---DQDDSYVDFQA 480

Query: 481 TTKTSSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNSTGTNATRA 540
           TTK SSNKIKFISKLRKLL+GK SQQN  L AEKSAAS+EDSDSPRYSSSNSTGTNATRA
Sbjct: 481 TTKPSSNKIKFISKLRKLLKGKDSQQNQALSAEKSAASMEDSDSPRYSSSNSTGTNATRA 540

Query: 541 EGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVLGSDRSSN 600
           EGQGIG    SQ+SSRHSMDF RL +Q  + GK EDS+RRNSD GY NKR VLGS+R SN
Sbjct: 541 EGQGIGSANSSQSSSRHSMDFRRLSSQ--EYGKPEDSVRRNSDGGYTNKRLVLGSNRMSN 600

Query: 601 SSYRSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRK-AASIGSF 646
           S +++ S D ES+EKSELMKYAEVLKD+ GAKNR HRK AASI S+
Sbjct: 601 SPFKTPSPDTESSEKSELMKYAEVLKDSPGAKNRSHRKGAASIDSY 619

BLAST of HG10023513 vs. TAIR 10
Match: AT1G52080.1 (actin binding protein family )

HSP 1 Score: 344.7 bits (883), Expect = 1.6e-94
Identity = 260/649 (40.06%), Postives = 368/649 (56.70%), Query Frame = 0

Query: 3   NKRDLMKPVLFKFGVALAISFAGLLYSRFRLGNKR-----PPLPPPSSSSSADDQGNKVD 62
           +KRD+   VL + G ALA+SFAG L++RFR   KR     PPLPP SS +   D  NK  
Sbjct: 6   HKRDINLLVL-QLGAALAVSFAGFLFARFRKNTKRIGPTLPPLPPHSSDNGYRDYSNKSI 65

Query: 63  LGRGRGRRLRLDNQGMKAATTASSNNVVLFAVDAYQEMCIPKVNVDDSNVGLCPSNKHGV 122
             R  G              T  ++   L  V   +E  +                    
Sbjct: 66  DRRDEG--------------TEKTDEETLIGVSPRRECDLD------------------- 125

Query: 123 EKDGLLLPEFQELVKEFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHL 182
           EKD  LLPEF+E  K+ D    +       + + PR  +  P A+ + E+ ++E EI  L
Sbjct: 126 EKDVFLLPEFEEEAKKLDLLVCD-------DCETPRSDITAPLAFPSEEEADHENEINRL 185

Query: 183 KSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNR 242
           ++ V+ LRERER LE +LLEYY LKEQ+   MEL++RLK++ ME K+F  KI+ LQA+N 
Sbjct: 186 RNTVRALRERERCLEDKLLEYYSLKEQQKIAMELRSRLKLNQMETKVFNFKIKKLQAENE 245

Query: 243 RLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNE 302
           +L ++  +H+K + +L+ AK++++ LKKKL    +Q+  QIL+L+QRV +LQ++E K   
Sbjct: 246 KLKAECFEHSKVLLELDMAKSQVQVLKKKLNINTQQHVAQILSLKQRVARLQEEEIKAVL 305

Query: 303 SNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLANSILEDQEK-ESL 362
            + +A   +Q++ +LE EI +L  +N RLQ EN +L  +L++ Q +ANS LE+ E+ E+L
Sbjct: 306 PDLEADKMMQRLRDLESEINELTDTNTRLQFENFELSEKLESVQIIANSKLEEPEEIETL 365

Query: 363 KEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARD 422
           +E+  RL  ENE L K++EQLQ  RC D+E+LVYLRWINACLRYELR +QPPAGKT ARD
Sbjct: 366 REDCNRLRSENEELKKDVEQLQGDRCTDLEQLVYLRWINACLRYELRTYQPPAGKTVARD 425

Query: 423 LSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH--TDPGDPDDS 482
           LS TLSP S+EKAK+LILEYA++E          + D D+WSSSQ  S   TD    DDS
Sbjct: 426 LSTTLSPTSEEKAKQLILEYAHSED---------NTDYDRWSSSQEESSMITDSMFLDDS 485

Query: 483 AVDFPSTTKT-SSNKIKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSPRYSSSNST 542
           +VD    TKT  + K K + KL K+L GK ++      ++K A S E        SS++T
Sbjct: 486 SVDTLFATKTKKTGKKKLMHKLMKILHGKDTKD-----SKKRAGSSE-------PSSSNT 545

Query: 543 GTNATRAEGQGIGYTTPSQNSSRHSMDFHRLHTQKEDDGKTEDSIRRNSDVGYVNKRFVL 602
           G            ++TP Q  S HSMDF  L       GK E+   +N  V    K    
Sbjct: 546 GV-----------HSTPRQLRSTHSMDFQMLMR-----GKDEEEDFKNHIVMLRRK---- 570

Query: 603 GSDRSSNSSY-RSQSQDAESTEKSELMKYAEVLKDTRGAKNRPHRKAAS 642
            S+ + +S+Y      + +   K EL+K A+ L  +R  K + H+K+ S
Sbjct: 606 -SEAAGSSTYGEEHCLETDQNGKKELIKLADALTKSRSTK-KLHKKSVS 570

BLAST of HG10023513 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 209.1 bits (531), Expect = 1.0e-53
Identity = 150/388 (38.66%), Postives = 235/388 (60.57%), Query Frame = 0

Query: 120 DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLK 179
           D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 180 SKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRR 239
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  +  + I SLQA+ ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 240 LVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNES 299
           L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNK 264

Query: 300 NKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKES 359
           + + + KL+ +++LE ++ +L++ N  LQ E  +L  +LD+ +      +++ E  +   
Sbjct: 265 DTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAK 324

Query: 360 LKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR 419
           ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Sbjct: 325 VREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISAR 384

Query: 420 DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSA 479
           DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D+++
Sbjct: 385 DLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNAS 444

Query: 480 VDFPSTTKTS-SNKIKFISKLRKLLRGK 503
           +D  ++  +S S K   I KL+K  + K
Sbjct: 445 MDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of HG10023513 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 209.1 bits (531), Expect = 1.0e-53
Identity = 150/388 (38.66%), Postives = 235/388 (60.57%), Query Frame = 0

Query: 120 DGLLLPEFQELVK-EFDFAAANAGLSPKKNVDAPRLALKTPKAYKTVEDDEYEQEIRHLK 179
           D  +LPEF++L+  E ++        P  + D      +  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 180 SKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKLKIESLQADNRR 239
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  +  + I SLQA+ ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 240 LVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKTNES 299
           L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +    
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNK 264

Query: 300 NKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQEKES 359
           + + + KL+ +++LE ++ +L++ N  LQ E  +L  +LD+ +      +++ E  +   
Sbjct: 265 DTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAK 324

Query: 360 LKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAAR 419
           ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +AR
Sbjct: 325 VREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISAR 384

Query: 420 DLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSA 479
           DLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D+++
Sbjct: 385 DLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNAS 444

Query: 480 VDFPSTTKTS-SNKIKFISKLRKLLRGK 503
           +D  ++  +S S K   I KL+K  + K
Sbjct: 445 MDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of HG10023513 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 151.8 bits (382), Expect = 1.9e-36
Identity = 104/272 (38.24%), Postives = 169/272 (62.13%), Query Frame = 0

Query: 235 DNRRLVSQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHK 294
           +++ L  ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E +
Sbjct: 51  NDKNLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEE 110

Query: 295 TNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQ---FLANSILEDQ 354
               + + + KL+ +++LE ++ +L++ N  LQ E  +L  +LD+ +      +++ E  
Sbjct: 111 AMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESD 170

Query: 355 EKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGK 414
           +   ++EE   L   NE L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK
Sbjct: 171 KVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGK 230

Query: 415 TAARDLSKTLSPKSKEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDP 474
            +ARDLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D 
Sbjct: 231 ISARDLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDF 290

Query: 475 DDSAVDFPSTTKTS-SNKIKFISKLRKLLRGK 503
           D++++D  ++  +S S K   I KL+K  + K
Sbjct: 291 DNASMDSSTSRFSSFSKKPGLIQKLKKWGKSK 314

BLAST of HG10023513 vs. TAIR 10
Match: AT2G36650.1 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 51.2 bits (121), Expect = 3.6e-06
Identity = 55/238 (23.11%), Postives = 125/238 (52.52%), Query Frame = 0

Query: 168 DEYEQEIRHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKISNMEAKLFKL 227
           ++ +QEI  LKS+ + L+ +E  +E+    +  LK+QE  ++E ++ L +   +   F+ 
Sbjct: 72  NQQKQEILSLKSRFEELQRKEYEMELHFERFCNLKDQEVMLIEHKSILSLEKAQLDFFRK 131

Query: 228 KIESLQADNRRLVSQVCDHAKSVSDLEAAKAKIKFLK---KKLRYEAEQNRGQILNLQQR 287
           ++ +++ +++R  + V  + K V +++  +++   L+   KKLR +++Q   +++N  ++
Sbjct: 132 EVLAMEEEHKRGQALVIVYLKLVGEIQELRSENGLLEGKAKKLRRKSKQ-LYRVVNESRK 191

Query: 288 VVKLQDQEHKTNESNKDAQIKLQKIEELEKEIEDLRKSNLRLQIENSDLGRRLDATQFLA 347
           ++ ++ +  K  +   + + K   ++ELE +++D+      LQ E  +L           
Sbjct: 192 IIGVEKEFLKCVD---ELETKNNIVKELEGKVKDMEAYVDVLQEEKEEL---------FM 251

Query: 348 NSILEDQEKESLKEERERLAGENEALAKEIEQLQAHRCADVEELVYLRWINACLRYEL 403
            S     E  S+++ R         + +E E+L+      V+E++ LRW NACLR+E+
Sbjct: 252 KSSNSTSEMVSVEDYRR--------IVEEYEELKKDYANGVKEVINLRWSNACLRHEV 288

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898688.12.4e-30789.78protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898689.1 protein CHUP1, ... [more]
KAA0059471.11.9e-29987.60protein CHUP1 [Cucumis melo var. makuwa] >TYK03852.1 protein CHUP1 [Cucumis melo... [more]
XP_008462405.12.1e-29887.44PREDICTED: protein CHUP1, chloroplastic [Cucumis melo] >XP_008462406.1 PREDICTED... [more]
XP_031744947.12.4e-29485.58protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031744948.1 protei... [more]
XP_004141788.12.2e-29285.43protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KGN45575.1 hypothetic... [more]
Match NameE-valueIdentityDescription
Q9LI741.4e-5238.66Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7V1829.0e-30087.60Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G00460 ... [more]
A0A1S3CGW91.0e-29887.44protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500772 PE=4 SV=1[more]
A0A0A0K7991.1e-29285.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G452300 PE=4 SV=1[more]
A0A6J1HMC27.7e-25977.74protein CHUP1, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1D0492.5e-25779.26protein CHUP1, chloroplastic isoform X3 OS=Momordica charantia OX=3673 GN=LOC111... [more]
Match NameE-valueIdentityDescription
AT1G52080.11.6e-9440.06actin binding protein family [more]
AT3G25690.11.0e-5338.66Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.21.0e-5338.66Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.31.9e-3638.24Hydroxyproline-rich glycoprotein family protein [more]
AT2G36650.13.6e-0623.11unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 201..221
NoneNo IPR availableCOILSCoilCoilcoord: 295..329
NoneNo IPR availableCOILSCoilCoilcoord: 348..380
NoneNo IPR availableCOILSCoilCoilcoord: 253..291
NoneNo IPR availableCOILSCoilCoilcoord: 167..194
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 521..557
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 593..609
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 455..483
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 610..630
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 55..69
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 590..645
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 513..558
NoneNo IPR availablePANTHERPTHR31342:SF4ACTIN BINDING PROTEIN FAMILYcoord: 2..641
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 2..641

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10023513.1HG10023513.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane
cellular_component GO:0016021 integral component of membrane